Skip to content
@Zyphra

Zyphra

Popular repositories Loading

  1. BlackMamba BlackMamba Public

    Code repository for Black Mamba

    Python 234 18

  2. Zamba2 Zamba2 Public

    PyTorch implementation of models from the Zamba2 series.

    Python 166 17

  3. tree_attention tree_attention Public

    Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

    Python 109 5

  4. transformers_zamba2 transformers_zamba2 Public

    Python 42 1

  5. Zyda_processing Zyda_processing Public

    Python 31 1

  6. zcookbook zcookbook Public

    Training hybrid models for dummies.

    Python 16 1

Repositories

Showing 10 of 22 repositories
  • transformers_zamba Public Forked from huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Zyphra/transformers_zamba’s past year of commit activity
    Python 3 Apache-2.0 27,892 0 0 Updated Jan 7, 2025
  • Zyphra/transformers_zamba2’s past year of commit activity
    Python 42 Apache-2.0 1 6 0 Updated Dec 19, 2024
  • zcookbook Public

    Training hybrid models for dummies.

    Zyphra/zcookbook’s past year of commit activity
    Python 16 Apache-2.0 1 0 1 Updated Dec 16, 2024
  • tree_attention Public

    Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

    Zyphra/tree_attention’s past year of commit activity
    Python 109 5 1 0 Updated Dec 3, 2024
  • Zamba2 Public

    PyTorch implementation of models from the Zamba2 series.

    Zyphra/Zamba2’s past year of commit activity
    Python 166 Apache-2.0 17 1 1 Updated Nov 26, 2024
  • FastChat Public Forked from lm-sys/FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Zyphra/FastChat’s past year of commit activity
    Python 0 Apache-2.0 4,725 0 0 Updated Nov 6, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Zyphra/Megatron-LM’s past year of commit activity
    Python 0 2,536 10 4 Updated Aug 20, 2024
  • Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Zyphra/Megatron-DeepSpeed’s past year of commit activity
    Python 0 2,536 0 2 Updated Aug 19, 2024
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Zyphra/flash-attention’s past year of commit activity
    Python 0 BSD-3-Clause 1,419 0 0 Updated Jul 8, 2024
  • Zamba-torch Public
    Zyphra/Zamba-torch’s past year of commit activity
    Python 7 Apache-2.0 1 0 0 Updated Jul 1, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…