Skip to content

Tool designed to scrape and analyze job vacancies and technology trends from dou.ua, allowing to gather relevant data

Notifications You must be signed in to change notification settings

lilarin/dou-scrape-and-analyse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vacancies Scraper for dou.ua

This tool scrapes and analysis vacancies and technologies trends from the dou.ua, allowing users to specify domains and technologies to collect the desired information.

Features

  • Scrape vacancies using the Scrapy and Selenium frameworks.
  • Specify domains for search (e.g., Python, Java, QA, etc.).
  • Specify technologies to measure their popularity.
  • Use Jupyter Notebook to analyze data and generate charts in a PDF file.
  • Support automated execution via script, as well as manual launch with Scrapy and Jupyter Notebook.

Examples

image image image

Configuration

You can modify the scraping configuration in the app.config section to specify the domain and technologies to scrape.

Launch

After filling out the app.config file, you can run the automated script using:

python app/main.py

Alternatively, you can manually scrape data using:

scrapy crawl vacancies

To launch the Jupyter Notebook manually, run it yourself as needed.

Requirements

  • Python 3.10+
  • Scrapy
  • Selenium
  • Jupyter Notebook

You can install all dependencies by running:

pip install -r requirements.txt

About

Tool designed to scrape and analyze job vacancies and technology trends from dou.ua, allowing to gather relevant data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published