Skip to content

boyuandeng/rl_locomotion_go1

Repository files navigation

RL_LOCOMOTION For Go1-NX

This repo builds on the code from rl_locomotion

Usage

Completely follow the steps of the original warehouse for installation and use

Depending on each individual's environment, 
there may be some minor issues when configuring the environment,
but you can solve your problems through Google

Change

First

Reset rewards to correct unstable standing issues. see video: IMAGE ALT TEXT HERE

second

Crop the Go1 description file provided by Unitree to adapt to the old version of raisim

Third

Updated jt_mean_pos in this file and this

Results

I conducted experimental comparisons, including modifying different hyperparameters and adjusting rewards.

The blue curve represents the results of training the initial code under a1, while the other curves represent the results of training under go1. The purple and yellow curves represent the training results when the RL_coeff parameters are 0.2 and 0.5, respectively, with a greater error compared to the original curve. The green curve is the result of setting the RL parameter to 0.8 and modifying the reward. It can be seen that only the green curve is closest to the original training result

Figure 1

Figure 2

About

This repo builds on the code from rl_locomotion for Go1

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published