How to implement MAAC/MFAC for Gaussian Squeezing? #11

rezunli96 · 2019-03-10T06:10:17Z

Hi I recently get some confusion when trying to reproduce your work, particular about experiment (1) on gaussian squeezing. According to my understanding in order to implement MAA2C algorithm as described in the DeepMind's NeurIPS 17 paper, the critic network should represent the Q-value function which takes joint action of the players into input. However, it seems that gaussian squeeze task is a stateless environment. According to your implementation details, there is a discount factor \gamma for AC methods but not for Q-learning method. So how do you define the state for gaussian squeezing? And if it is stateless, how can one use A2C methods?

Amanda2024 · 2021-08-06T08:22:22Z

The same question? How can we get the code for gaussian squeezing?

rezunli96 closed this as completed Mar 14, 2019

rezunli96 reopened this Mar 14, 2019

rezunli96 changed the title ~~How to implement MAA2C with so many agents?~~ How to implement MAAC/MFAC for Gaussian Squeezing? Apr 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to implement MAAC/MFAC for Gaussian Squeezing? #11

How to implement MAAC/MFAC for Gaussian Squeezing? #11

rezunli96 commented Mar 10, 2019 •

edited

Loading

Amanda2024 commented Aug 6, 2021

How to implement MAAC/MFAC for Gaussian Squeezing? #11

How to implement MAAC/MFAC for Gaussian Squeezing? #11

Comments

rezunli96 commented Mar 10, 2019 • edited Loading

Amanda2024 commented Aug 6, 2021

rezunli96 commented Mar 10, 2019 •

edited

Loading