Skip to content

revature-scalawags/Zeshawn_Project1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Zeshawn_Project1

Project Description

A simple Hive MapReduce application that utilizes Hive to analyze very large data sets

Technologies Used

-Scala 2.13.3 -Hadoop 3.2.1 -Hive -YARN -sbt 1.4.4 -Docker container

Features

  • InputStream - Retrieves twitter stream with Spark session
  • dataMapper - Maps every key in the dataframe to a value
  • dataReduce - Reduces the datasets so that all the keys are distinct values

Getting Started

  • MapReduce
  • Install & Configure git
  • Install xCode for easy access

Usage

  • sbt assembly to package files
  • sbt compile to build
  • sbt run to output

Contributors

Zeshawn Manzoor

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published