From 2e8d5c5ef41f59bc085954816835f524ef53c754 Mon Sep 17 00:00:00 2001 From: noorbasha19 <60005351+noorbasha19@users.noreply.github.com> Date: Thu, 1 Oct 2020 08:03:13 +0530 Subject: [PATCH] Updated README.md Added one more module which is useful for most of the web scrapping and parsing scrapped data --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 8ff2b36..4a5c4c6 100644 --- a/README.md +++ b/README.md @@ -27,6 +27,7 @@ We don't need [a lot of libraries](https://github.com/vinta/awesome-python). We #### Web Crawling * [Scrapy](http://scrapy.org/) - A fast high-level screen scraping and web crawling framework. +* [beautifulsoup4] (https://www.crummy.com/software/BeautifulSoup/bs4/doc/) - Easy to scrap and parsing information from websites and mainly it sits on html and xml. #### Web Content Extracting * [newspaper](https://github.com/codelucas/newspaper) - News extraction, article extraction and content curation in Python.