Overview

This is a A3C (Actor Critic) method for solving the Find-the-Goal minitask in the Minecraft (MARLO) environment.

Technologies Used

Chainerrl
Pytorch
Autoencoding

How it works

The core implementation involves A3C, which combines a policy and value estimation into one architecture. We used Chainer's A3C implentation, with slight changes to incorporate our autoencoder encoder model, discussed below.

Autoencoder

We used a (nonvariational) autoencoder to compress the frame inputs by 250 times. The structure for this autoencoder is written in the file AutoEncoderModels.py in the path /chainerrl_autoencoder/experimetes_ae/. It involves 10 residual blocks and 2 deconvolution layers. chainerrl_autoencoder is essentially the original implementation of chainerrl, with small dimenionsional changes to account for the autoencoder.

The following are compressed images of the Minecraft environment by the autoencoder:

Results

Results for training A3C model with Autoencoder on Find-the-Goal minitask.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
chainerrl_autoencoder		chainerrl_autoencoder
img		img
.DS_Store		.DS_Store
README.md		README.md
envs_setup.py		envs_setup.py
plots.py		plots.py
testA3C.py		testA3C.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Technologies Used

How it works

Autoencoder

Results

About

Releases

Packages

Languages

hsezhiyan/MARLO_A3C

Folders and files

Latest commit

History

Repository files navigation

Overview

Technologies Used

How it works

Autoencoder

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages