Analyzing COVID-19's impact on commercial airlines and their passengers.
Our dashboard presentation is available in the video below.
- Milestone 1
- Add data
- Add project vision
- Milestone 2
- Add analyses
- Fill out README
- Milestone 3
- Complete EDAs
- Revise research questions
- Milestone 4
- Partial analysis pipelines
- Step-by-step pipeine
- Rewrite with method chaining
- Create
project_functions.py
files - Conduct analysis
- Partial analysis pipelines
- Milestone 5
- Update the processed dataset for Tableau Dashboard if needed
- Create Tableau Dashboards
- Record a 10 minute video presenting the dashboard
- Milestone 6
- Address project feedback
- Write final report
- Implement project feedback
- Feedback 1
- Feedback 2
- Feedback 3
-
Feedback 4
- Data loading
- Complex data visualization
- Appropriate titles and labels for all plots
- At least one key insight per plot
- Complex data cleaning, wrangling, processing
- Handle missing values
- Update column names
- Remove unnecessary columns
- Use groupby objects
- Create aggregate dataframe
- Complex research questions
- Narrative with insightful commentary
- Sophisticated Tableau Dashboard
- Use of functions in wrangling, processing, cleaning
- 4+ basic operations
- 3+ advanced operations
- 2+ method chains
- Use of git branches
- Use of GitHub features
- Pull requests
- Code reviews
- Issues
- Extra flair (we hope)
We are interested in how airlines initially responded to COVID-19, how those responses disrupted consumer flights, and how we can use flight and health data to predict flight delays and cancellations during future pandemics.
Despite their sound engineering and safety protocols, airlines are still at the mercy of nature. They contend with various logistical challenges that create turbulence in their operation such as the winter storm in December 2022 that left many UBCO students unable to see their families.
In the wake of this, we are investigating flight delays and cancellations during COVID-19 due to the wealth of data available during that period. By analyzing this data, relating them to specific airlines, and comparing the effects of COVID-19 in different regions, we hope to predict these disruptions.
We are also interested in relating the flight data to notable events in the early timeline of COVID-19, e.g., its first reported fatality, the health emergency declared in the United States, and the second case in Thailand. Spikes in flight delays might relate to such health events, so gaining insight into them might improve our predictions.
Unlike weather issues, which have always been a factor in aviation, an international pandemic would have been an extraordinary circumstance for airlines. Therefore, we aim to understand how airlines handled the pandemic by analyzing their flights near the beginning of the pandemic. The first known case was in December 2019, so any data during or after that month should be relevant.
Our dataset contains a list of flights from January 2020 sourced from the US Government. For each flight, it lists the IATA codes for origin and destination airports, the distance of each flight, the tail number of the aircraft, the date of the flights, and other values. Most notably, it also contains both the planned and actual times of departure and arrival times, and it tells us whether the flight was diverted or canceled. This allows us to find information about flight delays without having to futz around with multiple data sources.
If possible, we will incorporate health data and airport location data in the near future.
- Aakash Tirathdas: Need to fix my sleep schedule.
- Bradan Fleming: I need a hug.
- Kaiden Merchant: I need a car. :(