Skip to content

Latest commit

 

History

History
51 lines (33 loc) · 1.29 KB

README.md

File metadata and controls

51 lines (33 loc) · 1.29 KB

Vacancies Scraper for dou.ua

This tool scrapes and analysis vacancies and technologies trends from the dou.ua, allowing users to specify domains and technologies to collect the desired information.

Features

  • Scrape vacancies using the Scrapy and Selenium frameworks.
  • Specify domains for search (e.g., Python, Java, QA, etc.).
  • Specify technologies to measure their popularity.
  • Use Jupyter Notebook to analyze data and generate charts in a PDF file.
  • Support automated execution via script, as well as manual launch with Scrapy and Jupyter Notebook.

Examples

image image image

Configuration

You can modify the scraping configuration in the app.config section to specify the domain and technologies to scrape.

Launch

After filling out the app.config file, you can run the automated script using:

python app/main.py

Alternatively, you can manually scrape data using:

scrapy crawl vacancies

To launch the Jupyter Notebook manually, run it yourself as needed.

Requirements

  • Python 3.10+
  • Scrapy
  • Selenium
  • Jupyter Notebook

You can install all dependencies by running:

pip install -r requirements.txt