AI-rachnid (Web Scraper)

FastAPI for efficient, AI-driven web scraping using Scrapegraph-ai

Note

This project is a fork of scrapegraph-ai-fastapi, fixed and adapted to support multi page scraping.

Table of Contents

Getting Started
Contributing

Getting Started

Environment Setup

Copy .env.example to .env and configure your API keys. Due to the special nature of the Gemini model, it is configured separately. Other models are configurable via API_KEY and API_BASE_URL.

GOOGLE_API_KEY=
GOOGLE_API_ENDPOINT=
API_KEY=
API_BASE_URL=

Running with Docker

Ensure you have a Docker instance running. For MacOS, I recommend using OrbStack.

Available commands:

npm run docker:build - Build the Docker image
npm run docker:dev - Run the container in development mode
npm run dev - Build and run in one command
npm run docker:stop - Stop running containers
npm run docker:clean - Clean up Docker resources

Available Models

The API supports multiple model providers and models, using langchain's init_chat_model.

Google Gemini
- Provider: google_genai
- Model: google_genai/gemini-1.5-flash-latest // or other model
- Requires: GOOGLE_API_KEY or GOOGLE_API_ENDPOINT in .env
OpenAI
- Provider: openai
- Model: gpt-4o-mini // or other model
- Requires: API_KEY or API_BASE_URL in .env
Ollama
- Provider: ollama
- Model: ollama/llama3.1 // or other model

You can find more supported models on the langchain website init_chat_model.

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

If you're interested in contributing to this project, please read the contribution guide.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
app		app
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
render.yaml		render.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI-rachnid (Web Scraper)

Getting Started

Environment Setup

Running with Docker

Available Models

Contributing

About

Languages

License

elliotBraem/ai-rachnid

Folders and files

Latest commit

History

Repository files navigation

AI-rachnid (Web Scraper)

Getting Started

Environment Setup

Running with Docker

Available Models

Contributing

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages