Analyzing Data on Python and Apache Spark
The sample data is in table: City, Date, Temperature
Task includes the following:
- Calculate maximum, minimum and average temperature for all cities per month.
- Calculate maximum, minimum and average temperature for all cities per month, if a given month has at least given number of records.
- Calculate the difference between Temperature per day and Min Temperature per month, difference between Temperature per day and Max Temperature per month, difference between Temperature per day and Avg Temperature per month.
- Calculate maximum, minimum and average temperature per month for a given set of cities.