Skip to content

Rishav1/mdp_environment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A generic MDP gym environment.

I am building this environment primarily for my Reinforcement Learning research. Purpose of this python module is to enable creation and simulation of Markov Decision Processes.

The environment is accessible through the OpenAI gym wrapper. An example to use it as follows.

import gym
import mdp_environment

env = gym.make("mdp-v0")
env.reset()
for _ in range(1000):
    _, _, done, _ = env.step(env.action_space.sample())
    if done:
        env.reset()

There are two custom MDP environments with the following details.

  • mdp-v0:

    • S:

    • A:

    • T:

    • R:

    • P:

    • γ: 1

  • mdp-v1:

    • S:

    • A:

    • T:

    • R:

    • P:

    • γ: 1

The MDP chain looks like this. For both the MPDs, the parameter N and p are adjustible. MDP transtion

About

A general Markov Decision Process environment.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages