diff --git a/README.md b/README.md index 69dd40b..9a3a407 100644 --- a/README.md +++ b/README.md @@ -138,6 +138,11 @@ format:
(Click to Collapse) +- [Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent](https://proceedings.mlr.press/v235/li24by.html) + - Yingru Li, Jiawei Xu, Lei Han, Zhi-Quan Luo + - Key: ensembles, Thompson sampling, scalable exploration, regret analysis, complexity theory + - ExpEnv: Atari, DeepSea + - [ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization](https://openreview.net/pdf?id=OwtMhMSybu) - Tianying Ji, Yongyuan Liang, Yan Zeng, Yu Luo, Guowei Xu, Jiawei Guo, Ruijie Zheng, Furong Huang, Fuchun Sun, Huazhe Xu - Key: Off-policy Actor-critic with Causality-aware Entropy regularization, exploration, causality-aware entropy regularization