Skip to content

megasiska86/web-scrapers

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

web-scrapers

This is a repository for my web-scraping projects.

Requirements

Text Summarization

News articles and their bullet-point summaries scraped from Times of India News Archive.

Medical NER

Diseases and treatments/tests scraped from medical websites. Following gazetteers have been been created:

  1. malacards-diseases scraped from malacards.org (18455 entries).
  2. medicinenet-diseases scraped from medicinenet.com (4969 entries).
  3. medicinenet-treatments scraped from medicinenet.com (931 entries).

References

About

A repository of my web-scraping projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%