Adding Truncated Quantile Critics #58
-
Hi everyone! I am modifying tmrl (my repo) by adding custom interface, new model, new inputs from my custom plugin - link to plugin here and more wandb features like visualizing model architecture and gradients. I am trying to add TQC (Truncated Quantile Critics) (paper here). However, as in TQC Q-Functions output is bigger than I get dimensionality problem that I am not able to fix:
Code of PyTorch TQC is available here. Could someone help me fix this issue or give advice what should I do? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hi, this sounds like a cool project! I am not very familiar with the TQC algorithm but it looks promising, it would be cool if you could push the implementation to tmrl when this works :) I see that SB3-contrib also has an implementation of TQC For the error you get, it looks like one of your tensors ( |
Beta Was this translation helpful? Give feedback.
Hi, this sounds like a cool project!
I am not very familiar with the TQC algorithm but it looks promising, it would be cool if you could push the implementation to tmrl when this works :)
I see that SB3-contrib also has an implementation of TQC
For the error you get, it looks like one of your tensors (
r
,d
,sorted_z_part
,next_log_pi
) is missing the batch dimension for some reason?