A pie chart plotter for Spark twitter steaming data collected from all over India during the valentines week built using plotly and geopy. Plots the distribution of tweets for all the states of India.
- Click to see the pie chart on plot.ly
-
Install
python-2.7
-
Install the following modules using
pip install
:geopy
,plotly
,json
,logging
-
Clone LoveIsInTheAir-Stats-Plotter:
git clone https://github.com/sahilsareen/LoveIsInTheAir-Stats-Plotter.git
-
Get a
plot.ly
account -
Update the configuration
- Set the spark config:
spark_file_name
: The location where the spark streamed twitter cached files are stored by LoveIsInTheAirspark_num_partitions
: The number of cached files stored as partitions
geopy_timeout
: Timeout per city to state lookup by geopy- Set the plotly config:
- Run
cd LoveIsInTheAir-Stats-Plotter/src && python StatesTweetCount.py
- Generate a pull request, OR
- Email patches to
sahil [DOT] sareen [AT] hotmail [DOT] com
- Stick to the python style guide
See License
- Sahil Sareen (sahil [DOT] sareen [AT] hotmail [DOT] com)