Skip to content

Latest commit

 

History

History
97 lines (72 loc) · 3.51 KB

README.md

File metadata and controls

97 lines (72 loc) · 3.51 KB

Housing scrapper

CI

This is a tool that notifies through Telegram when a new property becomes available on any of the listing websites.

This repository is based on https://github.com/rodrigouroz/housing_scrapper.

I rewrote much of the code as a personal challenge. New features include:

  • Asynchronous requests
  • Type annotations
  • Automated tests
  • Continuous integration setup

Installation

The recommended way to install dependencies is using a virtual environment:

cd <PATH_TO_REPOSITORY>           # Change directory to this repository
python3 -m venv env               # Create a virtual environment folder
source env/bin/activate           # Activate virtual environment
pip3 install -r requirements.txt  # Install dependencies

Configuration

There is a configuration.example.yml that you can use as a template. Copy that file and rename it to configuration.yml.

Notifier

First you need to create a Telegram Bot and configure the notifier with the following two parameters:

  • The token is a string along the lines of 110201543:AAHdqTcvCH1vGWJxfSeofSAs0K5PALDsaw that is required to authorize the bot and send requests to the Bot API. Keep your token secure and store it safely, it can be used by anyone to control your bot.

  • The chat_id is a number that identifies the chat your bot will use to notify you. After talking to your bot first or adding it to a telegram group, you can visit https://api.telegram.org/botXXX:YYYYY/getUpdates to see the chat_ids (replace XXX:YYYYY with your authentication token).

Providers

You can setup any of the providers as shown below:

providers:
  zonaprop:
    base_url: 'https://www.zonaprop.com.ar'
    sources:
      - '/departamentos-alquiler-2-habitaciones.html'
      - '/ph-alquiler-2-habitaciones.html'
  argenprop:
    base_url: 'https://www.argenprop.com'
    sources:
      - '/departamento-alquiler-pais-argentina-2-dormitorios'
      - '/ph-alquiler-pais-argentina-2-dormitorios'
  mercadolibre:
    base_url: 'https://inmuebles.mercadolibre.com.ar'
    sources:
      - '/departamentos/alquiler/2-dormitorios/'
      - '/casas/alquiler/2-dormitorios/'
  properati:
    base_url: 'https://www.properati.com.ar'
    sources:
      - '/departamento/alquiler/ambientes:2'
  inmobusqueda:
    base_url: 'https://www.inmobusqueda.com.ar'
    sources:
      - '/departamento-alquiler-la-plata-casco-urbano.html?cambientes=2.'

Usage

There is a main script to run the program with the following options:

usage: main.py [-h] [-c CONFIG] [-d DATABASE] [--notify] [--no-notify]

Housing scrapper tool

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        path to configuration file. It defaults to `configuration.yml`
  -d DATABASE, --database DATABASE
                        path to database file. It defaults to `properties.db`
  --notify              notify results through a telegram bot [default]
  --no-notify           avoid notifying results

You can add the following line to your crontab to run once every hour (using the virtual environment mentioned in the installation instructions):

0 * * * * cd <PATH_TO_REPOSITORY> && env/bin/python3 -m main >> run.log 2>&1

License

MIT