-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HER Wrappers #340
base: master
Are you sure you want to change the base?
HER Wrappers #340
Conversation
This pull request introduces 7 alerts when merging acd87f6 into 9b7400e - view on LGTM.com new alerts:
|
Codecov Report
@@ Coverage Diff @@
## master #340 +/- ##
==========================================
- Coverage 91.22% 90.45% -0.77%
==========================================
Files 89 93 +4
Lines 3772 4014 +242
==========================================
+ Hits 3441 3631 +190
- Misses 331 383 +52
|
This pull request introduces 2 alerts when merging 06cb5a7 into 9b7400e - view on LGTM.com new alerts:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome work! @hades-rp2010
Few questions:
- Have you tried training?
- Does the wrapper work with all off policy algorithms?
- Have you checked out https://github.com/eleurent/highway-env? Highway is a standard goal based env.
from genrl.trainers import OffPolicyTrainer | ||
|
||
|
||
class HERTrainer(OffPolicyTrainer): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might need to refactor this -- arent there functions here which are exact duplicates of the ones in OffPolicyTrainer
?
from genrl.trainers.her_trainer import HERTrainer | ||
|
||
|
||
class BitFlippingEnv(GoalEnv): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should this be in the tests?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe we could move this somewhere in the environments
module.
|
This pull request introduces 5 alerts when merging b654d8c into 147d373 - view on LGTM.com new alerts:
|
Add |
This pull request introduces 5 alerts when merging da535e2 into 147d373 - view on LGTM.com new alerts:
|
This pull request introduces 5 alerts when merging 9edae64 into 147d373 - view on LGTM.com new alerts:
|
Are you done here? @hades-rp2010 If you can resolve the merge conflicts and maybe the codeclimate issues then we can merge this. |
I think the algorithms weren’t training here. Please ensure you get a reasonable reward :)
… On 12-Oct-2020, at 12:17 PM, Sampreet ***@***.***> wrote:
Are you done here? @hades-rp2010 <https://github.com/hades-rp2010> If you can resolve the merge conflicts and maybe the codeclimate issues then we can merge this.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub <#340 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AH72FJYHZVDUIXFRDDDOUELSKKQ7TANCNFSM4RJ6TMOA>.
|
Wrt #171
Have added a
HERTrainer
,HERGoalEnvWrapper
, and aHERWrapper
for the replay buffer.Some changes in the locations of the tests might be needed.. Wasnt too sure of where to put them