Skip to content

v0.4.6

Compare
Choose a tag to compare
@PaParaZz1 PaParaZz1 released this 18 Feb 13:49
· 207 commits to main since this release

API Change

  1. middleware: CkptSaver(cfg, policy, train_freq=100) -> CkptSaver(policy, cfg.exp_name, train_freq=100)

Env

  1. add metadrive env and related ppo config (#574)
  2. add acrobot env and related dqn config (#577)
  3. add carracing in box2d (#575)
  4. add new gym hybrid viz (#563)
  5. update cartpole IL config (#578

Algorithm

  1. add BDQ algorithm (#558)
  2. add procedure cloning model (#573)

Enhancement

  1. add simplified PPOF (PPO × Family) interface (#567) (#568) (#581) (#582)

Fix

  1. fix to_device and prev_state bug when using ttorch (#571)
  2. fix py38 and numpy unittest bugs (#565)
  3. fix typo in contrastive_loss.py (#572)
  4. fix dizoo envs pkg installation bugs
  5. fix multi_trainer middleware unittest bug

Style

  1. add evogym docker (#580)
  2. fix metaworld docker bug
  3. fix setuptools high version incompatibility bug
  4. extend treetensor lowest version

New Paper

  1. GoBigger: [ICLR 2023] A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation

Contributors: @PaParaZz1 @puyuan1996 @timothijoe @Cloud-Pku @ruoyuGao @Super1ce @karroyan @kxzxvbk @eltociear