GitHub - ibrahim-elshar/Deep_PG: A Deep Policy Gradient implementation - REINFORCE and N-step A2C- for LunarLander-v2

Assignment 3 part 1 for 10-703 Deep Reinforcement Learning and Control - CMU

This repository contains 2 Python scripts for solving the LunarLander-v2 openai gym environment using deep reinforcement learning algorithms: REINFORCE algorithm (without baseline) is found in reinforce.py; N-step Advantage Actor-Critic algorithm found in a2c.py

Note: current implementation of a2c.py contains critic network model parameters specific to n=1 a2c algorithm (namely, 30x30x30 MLP instead of 20x20x20 MLP used for n=20, 50, and 100).

usage: reinforce.py [-h] [--model-config-path MODEL_CONFIG_PATH]
                    [--num-episodes NUM_EPISODES] [--lr LR]
                    [--render | --no-render]

optional arguments:
  -h, --help            show this help message and exit
  --model-config-path MODEL_CONFIG_PATH
                        Path to the model config file.
  --num-episodes NUM_EPISODES
                        Number of episodes to train on.
  --lr LR               The learning rate.
  --render              Whether to render the environment.
  --no-render           Whether to render the environment.

=========================================================

usage: a2c.py [-h] [--model-config-path MODEL_CONFIG_PATH]
              [--num-episodes NUM_EPISODES] [--lr LR] [--critic-lr CRITIC_LR]
              [--n N] [--render | --no-render]

optional arguments:
  -h, --help            show this help message and exit
  --model-config-path MODEL_CONFIG_PATH
                        Path to the actor model config file.
  --num-episodes NUM_EPISODES
                        Number of episodes to train on.
  --lr LR               The actor's learning rate.
  --critic-lr CRITIC_LR
                        The critic's learning rate.
  --n N                 The value of N in N-step A2C.
  --render              Whether to render the environment.
  --no-render           Whether to render the environment.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
LunarLander-v2-config.json		LunarLander-v2-config.json
README.md		README.md
a2c.py		a2c.py
reinforce.py		reinforce.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

ibrahim-elshar/Deep_PG

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages