Python-based Webclient

This project is developed to have a tiny web-client handy to perform domain crawling in an automated fashion. The primary intention of this project is to generate baseline traffic by crawling Alexa's top 500 domains.

Modules requirement

requests
eliot
eliot-tree
schedule

Modules installation (automated)

$mkdir -p ~/tools && cd ~/tools
$git clone https://github.com/sujitawake/webclient.git
$cd webclient
$pipenv install request eliot eliot-tree schedule

Usage

$cd ~/tools/webclient
$pipenv shell # You should be inside a virtualenv now
$chmod +x run.py
$python run.py

Execution/Debug logs

This program generates ASCII logs during the program execution. This would aid to further debugging if any of the program execution goes wrong. Eliot module has been integrated to generate the log files on-the-fly. You need to follow the below steps to walkthrough the logs. This log file would give detailed information for the followings:

When program execution had started and ended
Details about each domain, consisting of:
- Source domain address
- Redirection domain address (whenever applicable)
- Start/End time, took to crawl each URL
Exceptions are handled and logged internally into the log file (logs)

View execution logs

$cat logs | eliot-tree --local-timezone
# OR
$eliot-tree logs --local-timezone

Logs screenshot

This is a glimpse of the output which you would see when you want to debug if any of the crawling event went wrong in case of errors.

NOTE:
Generated logs are overwritten each time when the program starts. So if you want to debug any previously generated errors, don't forget to take backup of the previously generated log file.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Pipfile		Pipfile
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py
screen.png		screen.png
urls.txt		urls.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python-based Webclient

Modules requirement

Modules installation (automated)

Usage

Execution/Debug logs

View execution logs

Logs screenshot

About

Releases

Packages

Languages

sujitawake/webclient

Folders and files

Latest commit

History

Repository files navigation

Python-based Webclient

Modules requirement

Modules installation (automated)

Usage

Execution/Debug logs

View execution logs

Logs screenshot

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages