NES

Simple Feed Forward Example of a Natural Evolution Strategy as described by the Evolution Strategies as a Scalable Alternative to Reinforcement Learning paper authored by OpenAI

This is a good example of a modular separation between environments, training, model building, etc. in order to allow for a plug-and-play style of machine learning.

Only Config.yaml should be changed between different training examples. Major code changes should come with a version number.

Known Issues

- Converges to sub-optimal policy of moving right until stuck.
- Resolve gaussian standardization of rewards when std dev of rewards is 0 (Converged to policy).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

NES

Known Issues

Files

README.md

Latest commit

History

README.md

File metadata and controls

NES

Known Issues