Welcome to our AI-based web scraper project, designed to efficiently extract data using advanced machine learning models to gather and process real estate listings, providing users with structured and valuable information.
Ensure you have Python 3.12.5 installed for compatibility with all dependencies and features. Note that Python 3.13 does not work in this project as of October 2024.
Download Python 3.12.5 from the official Python website.
git clone https://github.com/tomycabre/AI-Web-Scrapper.git
cd AI-Web-Scrapper
py -m venv ai
python3 -m venv ai
.\ai\Scripts\activate.bat
source ai/bin/activate
py -m pip install -r requirements.txt
pip install -r requirements.txt
py -m streamlit run main.py
streamlit run main.py
- Download Chromedriver from here.
- Select "Stable" and download the (
win64
orwin32
) zip file. - Extract the zip file and copy
chromedriver.exe
to your AI-WEB-SCRAPPER folder.
- Delete the existing
chromedriver.exe
from the project. - Download Chromedriver from here.
- Select "Stable" and download the appropriate zip file (
mac-arm64
ormac-x64
). - Extract the zip file and copy
chromedriver
to your AI-WEB-SCRAPPER folder.
-
Download Ollama from here.
-
Select your platform (MacOS, Linux, Windows) and complete the installation.
-
Open Command Prompt and type:
ollama
-
Download your desired Ollama version (e.g., llama3):
ollama run llama3
-
Update the model version in
parse.py
(line 17):model = OllamaLLM(model="your model version here")
Congratulations! You have successfully set up the AI-Based Real Estate Web Scraper.
- Automated Data Extraction: Scrapes real estate listings from multiple websites.
- AI Integration: Uses machine learning models to enhance data extraction accuracy.
- Customizable: Easily update the model version and scraping parameters.
Instructions on how to use the web scraper will be provided here.
We welcome contributions! Please open issues or submit pull requests.
This project is licensed under the MIT License.