A generic MDP gym environment.

I am building this environment primarily for my Reinforcement Learning research. Purpose of this python module is to enable creation and simulation of Markov Decision Processes.

The environment is accessible through the OpenAI gym wrapper. An example to use it as follows.

import gym
import mdp_environment

env = gym.make("mdp-v0")
env.reset()
for _ in range(1000):
    _, _, done, _ = env.step(env.action_space.sample())
    if done:
        env.reset()

There are two custom MDP environments with the following details.

mdp-v0:
mdp-v1:

The MDP chain looks like this. For both the MPDs, the parameter N and p are adjustible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

A generic MDP gym environment.

Files

README.md

Latest commit

History

README.md

File metadata and controls

A generic MDP gym environment.