Sending human created episodes to trainer to speed up model training #66
Replies: 2 comments
-
Hi, this is a cool project but I can see the main technical challenge here: you have to act as an This is doable, but you will have to decouple the game from your gamepad/keyboard input. In other words, you need to make sure whatever input you send have no effect in the game, and are instead intercepted by your program, which then translates them into actions for the You also need to be thoughtful about what action corresponds to what observation: don't forget that, in the real-time setting, your neural network will need one time-step to compute an action (this will be natural if you use something like the |
Beta Was this translation helpful? Give feedback.
-
I've already captured input from Xbox controller and change them to actions order. I've also change step function to get action from the controller instead of from the model. However, as this functionality is quite difficult with uncertain ending results, I am switching to writing IQN (TQC is quite terrible in Trackmania due to its' different perpuse or my code is poorly written). |
Beta Was this translation helpful? Give feedback.
-
Hi!, I'm currently working on incorporating my episodes to replay buffer to improve critic training, but I'm facing some major challenges in the process. :( So at least I can give an idea of recording human-made episodes at the beginning of critics' training to make training more time-efficient.
Beta Was this translation helpful? Give feedback.
All reactions