Change the repository type filter
All
Repositories list
40 repositories
lightllm
PublicLightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
quant_horizon
Publicgeneral-sam-py
Publicmtc-token-healing
PublicEasyLLM
PublicDeepSpeed
Publicopencompass
Publicxtuner
PublicInternVL
PublicOmniBal
Public- [CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Models".
L2_Compression
Publicmsbench
PublicFCPTS
Public templatestatecs
Publicgreedy-tokenizer
PublicQLLM
Public[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models"Dipoorlet
Publicawesome-lm-system
PublicOutlier_Suppression_Plus
PublicUP_LPCV2023_Plugin
PublicChatGLM-6B
Publicpyvlova
Publicsystemnoise_web
PublicNART
PublicUnited-Perception
Public