Skip to content

philipppahl/pydata_berlin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyData Berlin Meetup

PyData Meetup 11/11/2014

Slides and Python notebook of the talk on PySpark.

To get started download Spark, set the environment variables and start the notebook.

export SPARK_HOME="<path_to_spark>";                                                                                        
export PYTHONPATH="$SPARK_HOME/python/:$PYTHONPATH";  

The data is a small sub-sample of the original dataset: "Amazon movie reviews": http://snap.stanford.edu/data/web-Movies.html

About

PyData Meetup 11/11/2014

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published