Skip to content
View jannerm's full-sized avatar

Highlights

  • Pro

Block or report jannerm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. diffuser diffuser Public

    Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

    Python 911 145

  2. trajectory-transformer trajectory-transformer Public

    Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

    Python 466 65

  3. gamma-models gamma-models Public

    Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"

    Python 43 8

  4. mbpo mbpo Public

    Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"

    Python 477 83

  5. ddpo ddpo Public

    Code for the paper "Training Diffusion Models with Reinforcement Learning"

    Python 367 26

  6. berkeleydeeprlcourse/homework_fall2020 berkeleydeeprlcourse/homework_fall2020 Public

    Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)

    Jupyter Notebook 250 246