When training fullplace #8

PKUTAN · 2022-04-20T04:00:59Z

when I train, it do not seem to coverge

Rogerity · 2022-05-06T13:57:32Z

Due to the immense action space, placement is actually a difficult RL learning task, so it is common to face such situation. There are several possible reasons for your curve, such as lack of pre-training or overfitting. To make training more stable, tuning hyperparameters such as reward funtion and learning rate may be helpful as well.

PKUTAN · 2022-05-18T10:50:17Z

Can you show the hyperparameters to train fullplace environment?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When training fullplace #8

When training fullplace #8

PKUTAN commented Apr 20, 2022

Rogerity commented May 6, 2022

PKUTAN commented May 18, 2022

When training fullplace #8

When training fullplace #8

Comments

PKUTAN commented Apr 20, 2022

Rogerity commented May 6, 2022

PKUTAN commented May 18, 2022