Skip to content
View sanowl's full-sized avatar
👽
👽

Block or report sanowl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. LSLM-Listening-while-Speaking-Language-Model LSLM-Listening-while-Speaking-Language-Model Public

    LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

    Python 56 6

  2. Self-Correcting-LLM--Reinforcement-Learning- Self-Correcting-LLM--Reinforcement-Learning- Public

    This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

    Python 22 5

  3. THINKING-LLMS THINKING-LLMS Public

    this is based on the paper THINKING LLMS: GENERAL INSTRUCTION FOLLOWING WITH THOUGHT GENERATION I might add new stuff that is not related to the paper

    Python 4 1

  4. ConsistAI ConsistAI Public

    evaluation methods for knowledge editing

    Python

  5. Self-Taught-Evaluator Self-Taught-Evaluator Public

    this is based on the paper Self-Taught Evaluators

    Python 6 1

  6. OmegaPRM OmegaPRM Public

    this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google deepmind

    Python 18 1