Simple web crawer Base python3 and requests and BeautifulSoup4 Cancel Regular Expression Html parsing requirements vary: get text get image get pdf get ... You can do your parse with py code.