Skip to content

pmandiola/nycsecondavesubway

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Impact Assessment after opening of 2nd Avenue Subway in NYC on taxis

This is an updated version of the work we did for the December 2018 Hackathon at CUSP. This project has been done by Max Brueckner-Humphreys, Devashish Khulbe, Pablo Mandiola and Soham Mody.

We used timeseries analysis and point of change detection to assess the impact of the opening of the NYC Second Ave Subway on the taxi ridership in the area.

Data Processing

The Jupyter Notebook data_processing.ipynb downloads 3 years of NYC yellow taxi data from the TLC and preprocesses it to get a timeseries with daily pickups in the Taxi Zones arround the Second Ave Subway. The TLC uploads a dataset of 1-2gb for each month so the data processing takes a few hours. The final file is yellowtaxi_all.csv.

Data Exploration and Analysis

The Jupyter Notebook data_analysis.ipynb has the analysis of the timeseries: periodicity, seasonality and trend, and finally point of change detection using bayesian inference.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.7%
  • Python 0.3%