YADQN

YADQN is yet another implementation of Deep Q-Networks (DQN) described in the Deepmind paper Playing Atari with Deep Reinforcement Learning.

In this repository, DQN is used to control the CartPole-v1 environment on the openai gym. To do that, a Q-function is implemented in Keras to map a state into the relative value of applying a LEFT or a RIGHT force to the cart. The Q-function is then trained by minimising the TD error from a set of S,A,R,S' tuples randomly sampled from an experience replay buffer. Lastly, an e-greedy policy selects an action for a given state to control the cart.

Results

On most runs, the algorithm was able to learn to balance the cart after a few hundred episodes:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
__init__.py		__init__.py
agent.py		agent.py
cart_pole.py		cart_pole.py
experience_replay.py		experience_replay.py
policy.py		policy.py
value_function.py		value_function.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YADQN

Results

Still learning:

Getting there...

Nailed it!

About

Releases

Packages

Languages

tomstewart89/YADQN

Folders and files

Latest commit

History

Repository files navigation

YADQN

Results

Still learning:

Getting there...

Nailed it!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages