hyperlink_crawler

This will traverse the Web as a linked graph from the starting --url finding all outgoing links (<a> tag): it will store each outgoing link for the URL, and then repeat the process for each or them, until --limit URLs will have been traversed. The output will be a JSON file with all incoming and outgoing link information

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
hyperlinks.py		hyperlinks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hyperlink_crawler

About

Releases

Packages

Languages

04msambit/hyperlink_crawler

Folders and files

Latest commit

History

Repository files navigation

hyperlink_crawler

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages