TBQN

The code base for my Master thesis "Transformer based action sequence generation in reinforcement learning settings". It includes fully modularized code to train, Hp optimize and evaluate a DQN agent with a Transformer based architecture as its network (TBQN). The basic network structure is depicted below.

Getting Started

This repository includes the following useful things:

Fully modularized code to Run a DQN agent with a Transformer based architecture as its network (TBQN).
Simple scripts to run TBQN with a mountain of different Parameters and model variations.
Scripts to perform parameter optimization for TBQN using the Optuna Library
The code and the results of the experiments I conducted during my thesis work.
Notebooks that can be used to evaluate and display either single model performance or whole studies.

Build on

TF-agents - The RL library used
Optuna - The HP optimization library

Installing

To install dependencies simply run

pip install -r requirements.txt

You should be good to go.

Example usage

To run an experiment or a study simply run one of the scripts like this:

python experiment_script_3.py --output_dir Acrobot-v1 --env Acrobot-v1

Parameters can be added and changed accordingly to the script.

To evaluate the results of an experiment simply load your results into Result_display.ipynb. Inside the Notebook specify the saved experiment path:

directory = "experiment_scripts/example_result"

To evaluate the results of a study use Study_display.ipynb. or Study_display_2.ipynb.

Authors

Gideon Stein - Initial work - Github

Comments

I am currently working on an PPO version which is not finished yet. Therefore, PPO named files are not finalized neither complete. They should be ignored.
Due to storage I cannot include concrete studies and model runs in this repository. If you are interested in anything additional data, feel free to write me.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
experiment_scripts		experiment_scripts
hp_optm_scripts		hp_optm_scripts
modules		modules
saves		saves
README.md		README.md
approx_Acrobot-v1.npy		approx_Acrobot-v1.npy
approx_CartPole-v0.npy		approx_CartPole-v0.npy
approx_CartPole-v1.npy		approx_CartPole-v1.npy
approx_LunarLander-v2.npy		approx_LunarLander-v2.npy
approx_MountainCar-v0.npy		approx_MountainCar-v0.npy
architecture.png		architecture.png
requirements.txt		requirements.txt
result_display.ipynb		result_display.ipynb
result_display_ppo.ipynb		result_display_ppo.ipynb
study_display.ipynb		study_display.ipynb
study_display_2.ipynb		study_display_2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TBQN

Getting Started

Build on

Installing

Example usage

Authors

Comments

About

Releases

Packages

Contributors 2

Languages

Gideon-Stein/TBQN

Folders and files

Latest commit

History

Repository files navigation

TBQN

Getting Started

Build on

Installing

Example usage

Authors

Comments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages