Skip to content

Parses text from Wikipedia articles and provides the most common words used.

Notifications You must be signed in to change notification settings

jreinlein/wiki-count

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WikiCount

Parses text from Wikipedia articles and provides most common words used.

Input: URL to an English Wikipedia article.

Output: Most common words used in that article, descending order.

preview

Download

⬇️ Version 1.1

Future Plans

  1. Provide functionality for non-English articles
  2. Implement word-cloud creation functionality for most common words
  3. Allow exportation of results into .csv or .xls file
  4. Show title of article for articles that were generated randomly
    1. Implement quiz-type game (match word list to article name)
  5. Provide support for non-Wikipedia URLs

Libraries

  • JSoup (parses HTML)
  • Apache Commons Lang (3.x)
  • JavaFX 2

About

Parses text from Wikipedia articles and provides the most common words used.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages