Skip to content

A pie chart plotter for Spark twitter steaming data collected from all over India during the valentines week built using plotly and geopy.

License

Notifications You must be signed in to change notification settings

sahilsareen/LoveIsInTheAir-Stats-Plotter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LoveIsInTheAir-Stats-Plotter

A pie chart plotter for Spark twitter steaming data collected from all over India during the valentines week built using plotly and geopy. Plots the distribution of tweets for all the states of India.

Tweets of love from India (Feb 7-14, 2016)

  • Click to see the pie chart on plot.ly

Setup and HowTo

  1. Install python-2.7

  2. Install the following modules using pip install: geopy, plotly, json, logging

  3. Clone LoveIsInTheAir-Stats-Plotter: git clone https://github.com/sahilsareen/LoveIsInTheAir-Stats-Plotter.git

  4. Get a plot.ly account

  5. Update the configuration

  • Set the spark config:
    • spark_file_name: The location where the spark streamed twitter cached files are stored by LoveIsInTheAir
    • spark_num_partitions: The number of cached files stored as partitions
  • geopy_timeout: Timeout per city to state lookup by geopy
  • Set the plotly config:
    • plotly_username: Your username on plot.ly
    • plotly_api_key: Your api-key
    • plotly_plot_name: Name of the pie chart
  1. Run cd LoveIsInTheAir-Stats-Plotter/src && python StatesTweetCount.py

Contributing

  1. Generate a pull request, OR
  2. Email patches to sahil [DOT] sareen [AT] hotmail [DOT] com

License

See License

Author

  • Sahil Sareen (sahil [DOT] sareen [AT] hotmail [DOT] com)

About

A pie chart plotter for Spark twitter steaming data collected from all over India during the valentines week built using plotly and geopy.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages