You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I have a question about the entropy coefficient c2 of PPO and its standard value in SIMPLE.
In the original paper, the "standard" value is c2=0.01 but in SIMPLE its set to c2=0.1
"parser.add_argument("--entcoeff", "-ent", type = float, default = 0.1, help="The entropy coefficient in PPO")"
Is there a reason to put the standard value so high in SIMPLE? I am currently trying to tune that value and I am just curious.
Hi,
I have a question about the entropy coefficient c2 of PPO and its standard value in SIMPLE.
In the original paper, the "standard" value is c2=0.01 but in SIMPLE its set to c2=0.1
"parser.add_argument("--entcoeff", "-ent", type = float, default = 0.1, help="The entropy coefficient in PPO")"
Is there a reason to put the standard value so high in SIMPLE? I am currently trying to tune that value and I am just curious.
Kind regards,
Markus
(Paper: https://arxiv.org/pdf/1707.06347.pdf)
The text was updated successfully, but these errors were encountered: