Introduction

This package provides a Chainer implementation of Double DQN described in Deep Reinforcement Learning with Double Q-learning.

この記事で実装したコードです。

Requirements

環境構築に関しては DQN-chainerリポジトリを動かすだけが参考になります。

Running

e.g. Atari Breakout

Open 4 terminal windows and run the following commands on each terminal:

Terminal #1

rl_glue

Terminal #2

cd path_to_deep-q-network
python experiment.py --csv_dir breakout/csv --plot_dir breakout/plot

Terminal #3

cd path_to_deep-q-network/breakout
python train.py

Terminal #4

cd /home/your_name/ALE
./ale -game_controller rlglue -use_starting_actions true -random_seed time -display_screen true -frame_skip 4 -send_rgb true /path_to_rom/breakout.bin

Experiments

実験に用いたコンピュータのスペックは以下の通りです。


OS	Ubuntu 14.04 LTS
CPU	Core i7
RAM	16GB
GPU	GTX 970M 6GB

Atari Breakout

Preprocessing

We extract the luminance from the RGB frame and rescale it to 84x84.

Then we stack the 4 most recent frames to produce the input to the DQN.

e.g.

Training

We trained Double DQN for a total of 46 hours (7600 episodes, 95 epochs, 4791K frames).

Score:

Highscore:

Evaluation

Average score:

Atari Pong

Preprocessing

Training

We trained Double DQN for a total of 55 hours (1500 episodes, 99 epochs, 4964K frames).

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
breakout		breakout
pong		pong
tests		tests
LICENSE		LICENSE
activations.py		activations.py
agent.py		agent.py
config.py		config.py
ddqn.py		ddqn.py
experiment.py		experiment.py
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Requirements

Running

Experiments

Atari Breakout

Preprocessing

Training

Score:

Highscore:

Evaluation

Average score:

Atari Pong

Preprocessing

Training

Score:

Highscore:

Evaluation

Average score:

About

Releases

Packages

Languages

License

DavidALGO/double-dqn

Folders and files

Latest commit

History

Repository files navigation

Introduction

Requirements

Running

Experiments

Atari Breakout

Preprocessing

Training

Score:

Highscore:

Evaluation

Average score:

Atari Pong

Preprocessing

Training

Score:

Highscore:

Evaluation

Average score:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages