v0.4.6
API Change
- middleware:
CkptSaver(cfg, policy, train_freq=100)
->CkptSaver(policy, cfg.exp_name, train_freq=100)
Env
- add metadrive env and related ppo config (#574)
- add acrobot env and related dqn config (#577)
- add carracing in box2d (#575)
- add new gym hybrid viz (#563)
- update cartpole IL config (#578)
Algorithm
Enhancement
Fix
- fix to_device and prev_state bug when using ttorch (#571)
- fix py38 and numpy unittest bugs (#565)
- fix typo in contrastive_loss.py (#572)
- fix dizoo envs pkg installation bugs
- fix multi_trainer middleware unittest bug
Style
- add evogym docker (#580)
- fix metaworld docker bug
- fix setuptools high version incompatibility bug
- extend treetensor lowest version
New Paper
- GoBigger: [ICLR 2023] A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation
Contributors: @PaParaZz1 @puyuan1996 @timothijoe @Cloud-Pku @ruoyuGao @Super1ce @karroyan @kxzxvbk @eltociear