v0.4.6

PaParaZz1 released this 18 Feb 13:49

· 207 commits to main since this release

API Change

middleware: CkptSaver(cfg, policy, train_freq=100) -> CkptSaver(policy, cfg.exp_name, train_freq=100)

Env

add metadrive env and related ppo config (#574)
add acrobot env and related dqn config (#577)
add carracing in box2d (#575)
add new gym hybrid viz (#563)
update cartpole IL config (#578）

Algorithm

add BDQ algorithm (#558)
add procedure cloning model (#573)

Enhancement

add simplified PPOF (PPO × Family) interface (#567) (#568) (#581) (#582)

Fix

fix to_device and prev_state bug when using ttorch (#571)
fix py38 and numpy unittest bugs (#565)
fix typo in contrastive_loss.py (#572)
fix dizoo envs pkg installation bugs
fix multi_trainer middleware unittest bug

Style

add evogym docker (#580)
fix metaworld docker bug
fix setuptools high version incompatibility bug
extend treetensor lowest version

New Paper

GoBigger: [ICLR 2023] A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation

Contributors: @PaParaZz1 @puyuan1996 @timothijoe @Cloud-Pku @ruoyuGao @Super1ce @karroyan @kxzxvbk @eltociear

Contributors

eltociear, karroyan, and 7 other contributors

Assets 2