Skip to content

Commit

Permalink
Updated data processing to be more clear with comments
Browse files Browse the repository at this point in the history
  • Loading branch information
danyoungday committed Jun 6, 2024
1 parent e5d757e commit 8afe5e9
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion use_cases/eluc/app/process_data.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,9 @@ def main():
"""
Main function that loads the data and saves it.
"""
dataset = ELUCData(APP_START_YEAR-1, APP_START_YEAR, 2022)
# Subsets the dataset so train_df is from start_year-1 to test year which we discard.
# Then we take the app data as the test def which is from the app start year to the end of the dataset.
dataset = ELUCData(start_year=APP_START_YEAR-1, test_year=APP_START_YEAR)
test_df = dataset.test_df
save_dir = Path("data/processed")
save_dir.mkdir(exist_ok=True)
Expand Down

0 comments on commit 8afe5e9

Please sign in to comment.