🌐 AI-Powered Web Scraper + Q&A with Ollama + FAISS

Scrape website content, store it in a vector DB, and ask questions about it using a local LLM (Mistral via Ollama). Built with LangChain, FAISS, and Streamlit.

🛠 Features

🌍 Web scraping using requests + BeautifulSoup
🔍 Embedding text chunks via sentence-transformers
💾 Semantic search using FAISS vector database
🤖 Local LLM (Mistral via Ollama) for Q&A
🖥️ Easy-to-use Streamlit UI

📦 Installation

1. Clone the repository

git clone https://github.com/AzkaSahar/AI-Web-Scraper.git
cd AI-Web-Scraper

2. Install dependencies

pip install -r requirements.txt

🧠 Prerequisites

Install and run Ollama on your machine
Pull the Mistral model:

ollama pull mistral

🚀 Usage

streamlit run ai_webscraper.py

💡 How it works:

Input a website URL
It scrapes and stores text chunks in a FAISS index
Ask a question — the app retrieves relevant content and passes it to the LLM
The LLM answers based on that content

🗂️ Optional Folder Structure (if you want to organize it)


├── ai_webscraper.py             # Main Streamlit script
├── requirements.txt
└── README.md

📝 License

MIT — free to use, modify, and distribute.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🌐 AI-Powered Web Scraper + Q&A with Ollama + FAISS

🛠 Features

📦 Installation

1. Clone the repository

2. Install dependencies

🧠 Prerequisites

🚀 Usage

💡 How it works:

🗂️ Optional Folder Structure (if you want to organize it)

📝 License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
ai_webscraper.py		ai_webscraper.py
requirements.txt		requirements.txt

AzkaSahar/AI-Web-Scraper

Folders and files

Latest commit

History

Repository files navigation

🌐 AI-Powered Web Scraper + Q&A with Ollama + FAISS

🛠 Features

📦 Installation

1. Clone the repository

2. Install dependencies

🧠 Prerequisites

🚀 Usage

💡 How it works:

🗂️ Optional Folder Structure (if you want to organize it)

📝 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages