This repo accompanies the paper "Missing Data Multiple Imputation for Tabular Q-Learning in Online RL" written for Susan Murphy's Stat 234 class on reinforcement learning at Harvard in Spring 2024 by Kyla Chasalow and Skyler Wu.
Within the src
directory:
LakeWorldEnvironment
contains functions for setting up an instance of our LakeWorld RL environment, instances of which include methods for generating missingness in their statesLakeWorldMain
contains the main function for running the Lake World EnvironmentImputerTools
contains functions for implementing imputation ensemblesRLTools
contains the functions for running our RL pipelineMissingMechanisms
contains general functions for generating missingness that are used in multiple other placesSimulationHelpers
contains functions that aid in running our simulationsLakeWorldTests
contains some tests of various functions in the above scripts
To run our simulation: run pmdi_main_v3_runscript_driver.sh
after making appropriate updates to filepaths. TODO update this
To generate analysis files: run analyzer_main_runscript_driver.sh
and use analyzer.ipynb
to combine the resulting files.
To generate figures: run figures.ipynb