Skip to content

EugeunieOrz/Analyzing-Data-with-PySpark

Repository files navigation

Analyzing Data with PySpark

Analyzing Data on Python and Apache Spark

The sample data is in table: City, Date, Temperature

Task includes the following:

  1. Calculate maximum, minimum and average temperature for all cities per month.
  2. Calculate maximum, minimum and average temperature for all cities per month, if a given month has at least given number of records.
  3. Calculate the difference between Temperature per day and Min Temperature per month, difference between Temperature per day and Max Temperature per month, difference between Temperature per day and Avg Temperature per month.
  4. Calculate maximum, minimum and average temperature per month for a given set of cities.

Releases

No releases published

Packages

No packages published

Languages