Algorithms (DDPG, TD3), Tests, Docs, and Environment overhaul
Phew, that's been a journey.
Features:
- Base algorithms are added and tested
- Environments are now completely redone and can be used for your data
- Online tutorial
- Code Climate grades my code B
- CircleCI tests written
- Somewhat reminiscent of documentation. It will be more complete soon.
Coming soon
- PyPi page
- BCQ implementation will be stress tested and tweaked
- TopK Off Policy Correction