Skip to content

revature-scalawags/jeroen-proj1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

jeroen-proj1

Requirements

- Scala and Hadoop application for answering interesting questions about large data sets
- Use following technologies
    - Hadoop MapReduce
    - YARN
    - HDFS
    - Scala
    - Hive
    - Git + GitHub
- Dataset
    - [All Analytics](https://dumps.wikimedia.org/other/analytics/)
- Question
    - Which English wikipedia article got the most traffic on October 20?
    - Analyze how many users will see the average vandalized wikipedia page before the offending edit is reversed.

Features

- CLI (low)(easy)
- Menu provides users with queries to make on the data (high level not actual Hive syntax) (low)(easy)

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages