数据更新: 2024-10-10 / 温馨提示:中文项目泛指「文档母语为中文」OR「含有中文翻译」的项目,通常在项目的「readme/wiki/官网」可以找到
# | Repository | Description | Stars | Average daily growth | Updated |
---|---|---|---|---|---|
1 | 2noise/ChatTTS | A generative speech model for daily dialogue. | 31333 | 230 | 2024-10-09 |
2 | All-Hands-AI/OpenHands | 🙌 OpenHands: Code Less, Make More | 32822 | 156 | 2024-10-09 |
3 | Ucas-HaoranWei/GOT-OCR2.0 | Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model | 5121 | 135 | 2024-10-02 |
4 | RVC-Boss/GPT-SoVITS | 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) | 33797 | 125 | 2024-10-02 |
5 | KwaiVGI/LivePortrait | Bring portraits to life! | 12216 | 123 | 2024-10-07 |
6 | binary-husky/gpt_academic | 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss ... | 64638 | 113 | 2024-10-07 |
7 | hpcaitech/Open-Sora | Open-Sora: Democratizing Efficient Video Production for All | 21814 | 94 | 2024-08-09 |
8 | myshell-ai/OpenVoice | Instant voice cloning by MIT and MyShell. | 29071 | 92 | 2024-08-21 |
9 | harry0703/MoneyPrinterTurbo | 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM. | 16408 | 77 | 2024-07-26 |
10 | fudan-generative-vision/hallo | Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation | 9281 | 77 | 2024-09-14 |
11 | THUDM/ChatGLM-6B | ChatGLM-6B: An Open Bilingual Dialogue Language Model 开源双语对话语言模型 | 40484 | 70 | 2024-06-27 |
12 | InternLM/MindSearch | 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) | 4813 | 65 | 2024-09-25 |
13 | lm-sys/FastChat | An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. | 36614 | 64 | 2024-10-06 |
14 | hiyouga/LLaMA-Factory | Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024) | 32079 | 64 | 2024-10-08 |
15 | gpt-omni/mini-omni | open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. | 2775 | 63 | 2024-09-25 |
16 | infiniflow/ragflow | RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. | 18857 | 62 | 2024-10-09 |
17 | huggingface/transformers | 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. | 133123 | 61 | 2024-10-09 |
18 | QwenLM/Qwen2-VL | Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. | 2508 | 60 | 2024-10-04 |
19 | ScrapeGraphAI/Scrapegraph-ai | Python scraper based on AI | 14788 | 58 | 2024-10-09 |
20 | FunAudioLLM/CosyVoice | Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. | 5386 | 54 | 2024-09-29 |
21 | LC044/WeChatMsg | 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手 | 33731 | 53 | 2024-09-23 |
22 | huggingface/speech-to-speech | Speech To Speech: an effort for an open-sourced and modular GPT4-o | 3200 | 50 | 2024-09-27 |
23 | VikParuchuri/marker | Convert PDF to markdown quickly with high accuracy | 16882 | 49 | 2024-09-07 |
24 | PKU-YuanGroup/Open-Sora-Plan | This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project. | 11308 | 49 | 2024-10-08 |
25 | OpenBMB/MiniCPM-V | MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone | 12189 | 48 | 2024-09-13 |
26 | opendatalab/PDF-Extract-Kit | A Comprehensive Toolkit for High-Quality PDF Content Extraction | 5017 | 48 | 2024-10-09 |
27 | jianchang512/ChatTTS-ui | 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces. | 6020 | 45 | 2024-08-29 |
28 | RVC-Project/Retrieval-based-Voice-Conversion-WebUI | Easily train a good VC model with voice data <= 10 mins! | 23638 | 42 | 2024-09-05 |
29 | chatanywhere/GPT_API_free | Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。 | 22088 | 41 | 2024-09-26 |
30 | netease-youdao/QAnything | Question and Answer based on Anything. | 11580 | 41 | 2024-09-27 |
31 | adithya-s-k/omniparse | Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks | 5138 | 40 | 2024-09-23 |
32 | ultralytics/ultralytics | Ultralytics YOLO11 🚀 | 29925 | 39 | 2024-10-09 |
33 | zhayujie/chatgpt-on-wechat | 基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。 | 30365 | 38 | 2024-09-26 |
34 | Kwai-Kolors/Kolors | Kolors Team | 3683 | 38 | 2024-09-04 |
35 | Upsonic/gpt-computer-assistant | Intelligence development framework in python for your product like Apple Intelligence | 5212 | 38 | 2024-09-10 |
36 | THUDM/ChatGLM3 | ChatGLM3 series: Open Bilingual Chat LLMs 开源双语对话语言模型 | 13379 | 38 | 2024-07-10 |
37 | VikParuchuri/surya | OCR, layout analysis, reading order, table recognition in 90+ languages | 10036 | 37 | 2024-10-08 |
38 | hpcaitech/ColossalAI | Making large AI models cheaper, faster and more accessible | 38711 | 36 | 2024-10-09 |
39 | fishaudio/fish-speech | Brand new TTS solution | 13098 | 36 | 2024-10-08 |
40 | NexaAI/nexa-sdk | Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ... | 1891 | 34 | 2024-10-09 |
41 | THUDM/ChatGLM2-6B | ChatGLM2-6B: An Open Bilingual Chat LLM 开源双语对话语言模型 | 15702 | 33 | 2024-06-27 |
42 | THUDM/GLM-4 | GLM-4 series: Open Multilingual Multimodal Chat LMs 开源多语言多模态对话模型 | 4800 | 32 | 2024-10-06 |
43 | ymcui/Chinese-LLaMA-Alpaca | 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) | 18246 | 32 | 2024-04-30 |
44 | QwenLM/Qwen | The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. | 13637 | 31 | 2024-09-24 |
45 | ultralytics/yolov5 | YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite | 50086 | 31 | 2024-10-05 |
46 | FunAudioLLM/SenseVoice | Multilingual Voice Understanding Model | 2869 | 29 | 2024-09-25 |
47 | jingyaogong/minimind | 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! | 2203 | 29 | 2024-10-09 |
48 | microsoft/UFO | A UI-Focused Agent for Windows OS Interaction. | 7633 | 28 | 2024-09-25 |
49 | qhjqhj00/MemoRAG | Empowering RAG with a memory-based data interface for all-purpose applications! | 1002 | 28 | 2024-09-29 |
50 | hiroi-sora/Umi-OCR | OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。 | 26210 | 28 | 2024-10-09 |
51 | assafelovic/gpt-researcher | LLM based autonomous agent that does online comprehensive research on any given topic | 14340 | 28 | 2024-10-07 |
52 | reflex-dev/reflex | 🕸️ Web apps in pure Python 🐍 | 19655 | 27 | 2024-10-09 |
53 | PaddlePaddle/PaddleOCR | Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and de ... | 43079 | 27 | 2024-10-09 |
54 | Textualize/rich | Rich is a Python library for rich text and beautiful formatting in the terminal. | 49116 | 27 | 2024-10-04 |
55 | 1Panel-dev/MaxKB | 🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。 | 10600 | 27 | 2024-10-09 |
56 | BadToBest/EchoMimic | Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning | 2595 | 26 | 2024-08-15 |
57 | GaiZhenbiao/ChuanhuChatGPT | GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. | 15183 | 26 | 2024-09-25 |
58 | linyqh/NarratoAI | 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click. | 1484 | 25 | 2024-10-08 |
59 | eosphoros-ai/DB-GPT | AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents | 13435 | 25 | 2024-10-08 |
60 | Sinaptik-AI/pandas-ai | Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG. | 12829 | 24 | 2024-09-25 |
61 | xinntao/Real-ESRGAN | Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. | 27979 | 24 | 2024-08-06 |
62 | TeamWiseFlow/wiseflow | Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and upl ... | 4071 | 24 | 2024-09-04 |
63 | jianchang512/clone-voice | A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频 | 7355 | 23 | 2024-08-22 |
64 | Zeyi-Lin/HivisionIDPhotos | ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。 | 10875 | 23 | 2024-09-28 |
65 | THUDM/LongWriter | LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs | 1386 | 23 | 2024-09-27 |
66 | guoyww/AnimateDiff | Official implementation of AnimateDiff. | 10379 | 22 | 2024-07-31 |
67 | Tencent/HunyuanDiT | Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding | 3337 | 22 | 2024-08-15 |
68 | OpenMOSS/MOSS | An open-source tool-augmented conversational language model from Fudan University | 11928 | 22 | 2024-07-13 |
69 | OpenBMB/XAgent | An Autonomous LLM Agent for Complex Task Solving | 8078 | 22 | 2024-08-12 |
70 | AiuniAI/Unique3D | Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image | 2954 | 22 | 2024-09-18 |
71 | netease-youdao/EmotiVoice | EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine | 7314 | 22 | 2024-08-13 |
72 | dataelement/bisheng | BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF ... | 8655 | 21 | 2024-10-09 |
73 | Kanaries/pygwalker | PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis | 12867 | 21 | 2024-10-02 |
74 | modelscope/DiffSynth-Studio | Enjoy the magic of Diffusion models! | 6409 | 21 | 2024-10-09 |
75 | vvbbnn00/WARP-Clash-API | 该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic. | 8453 | 20 | 2024-09-04 |
76 | myshell-ai/MeloTTS | High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. | 4581 | 20 | 2024-08-09 |
77 | microsoft/DeepSpeed | DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. | 35020 | 20 | 2024-10-09 |
78 | deepseek-ai/DeepSeek-Coder | DeepSeek Coder: Let the Code Write Itself | 6640 | 19 | 2024-05-21 |
79 | jzhang38/TinyLlama | The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. | 7730 | 19 | 2024-05-03 |
80 | ageitgey/face_recognition | The world's simplest facial recognition api for Python and the command line | 53078 | 19 | 2024-08-21 |
81 | RUCAIBox/LLMSurvey | The official GitHub page for the survey paper "A Survey of Large Language Models". | 10151 | 18 | 2024-08-20 |
82 | facebookresearch/nougat | Implementation of Nougat Neural Optical Understanding for Academic Documents | 8850 | 18 | 2024-04-16 |
83 | modelscope/agentscope | Start building LLM-empowered multi-agent applications in an easier way. | 4983 | 18 | 2024-10-08 |
84 | OpenGVLab/InternVL | [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 | 5692 | 18 | 2024-09-19 |
85 | OpenInterpreter/01 | The #1 open-source voice interface for desktop, mobile, and ESP32 chips. | 4924 | 18 | 2024-10-02 |
86 | 3b1b/manim | Animation engine for explanatory math videos | 63144 | 18 | 2024-10-04 |
87 | fishaudio/Bert-VITS2 | vits2 backbone with multilingual-bert | 7881 | 18 | 2024-10-07 |
88 | xinsir6/ControlNetPlus | ControlNet++: All-in-one ControlNet for image generations and editing! | 1685 | 17 | 2024-09-30 |
89 | honmashironeko/ProxyCat | 一款部署于云端或本地的代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用 | 866 | 17 | 2024-10-06 |
90 | dyang886/Game-Cheats-Manager | Easily download and manage game cheats for your convenience | 4693 | 17 | 2024-10-05 |
91 | THUDM/CodeGeeX2 | CodeGeeX2: A More Powerful Multilingual Code Generation Model | 7620 | 17 | 2024-07-10 |
92 | buaacyw/MeshAnything | From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" | 1999 | 17 | 2024-08-05 |
93 | FlagOpen/FlagEmbedding | Retrieval and Retrieval-augmented LLMs | 7068 | 16 | 2024-10-07 |
94 | ymcui/Chinese-LLaMA-Alpaca-2 | 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) | 7068 | 16 | 2024-09-23 |
95 | marimo-team/marimo | A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git. | 6661 | 16 | 2024-10-09 |
96 | yixiu001/serv00-login | 同时支持serv00与ct8自动化批量保号,每3天自动登录一次面板,并且发送消息到Telegram | 1455 | 15 | 2024-07-19 |
97 | voicepaw/so-vits-svc-fork | so-vits-svc fork with realtime support, improved interface and more features. | 8728 | 15 | 2024-10-08 |
98 | gradio-app/gradio | Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! | 32610 | 15 | 2024-10-09 |
99 | THUDM/CogVLM | a state-of-the-art-level open visual language model 多模态预训练模型 | 5928 | 15 | 2024-05-29 |
100 | OptimalScale/LMFlow | An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All. | 8234 | 15 | 2024-10-05 |
101 | 6drf21e/ChatTTS_colab | 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 | 1979 | 15 | 2024-07-02 |
102 | BlinkDL/ChatRWKV | ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. | 9395 | 15 | 2024-07-11 |
103 | InternLM/InternLM | Official release of InternLM2.5 base and chat models. 1M context support | 6299 | 14 | 2024-09-06 |
104 | VisionRush/DeepFakeDefenders | Image forgery recognition algorithm | 529 | 13 | 2024-09-09 |
105 | THUDM/CogVLM2 | GPT4V-level open-source multi-modal model based on Llama3-8B | 2040 | 13 | 2024-09-03 |
106 | xxlong0/Wonder3D | Single Image to 3D using Cross-Domain Diffusion for 3D Generation | 4723 | 13 | 2024-08-29 |
107 | open-mmlab/mmdetection | OpenMMLab Detection Toolbox and Benchmark | 29262 | 13 | 2024-08-21 |
108 | jxxghp/MoviePilot | NAS媒体库自动化管理工具 | 6335 | 13 | 2024-10-09 |
109 | THUDM/CodeGeeX4 | CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more. | 1306 | 13 | 2024-08-25 |
110 | WZMIAOMIAO/deep-learning-for-image-processing | deep learning for image processing including classification and object-detection etc. | 22624 | 13 | 2024-07-25 |
111 | xaoyaoo/PyWxDump | 获取微信信息;读取数据库,本地查看聊天记录并导出为csv、html等格式用于AI训练,自动回复等。支持多账户信息获取,支持所有微信版本。 | 5470 | 13 | 2024-10-07 |
112 | PeterH0323/Streamer-Sales | Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端 ... | 2427 | 13 | 2024-09-29 |
113 | HZJQF/help_tool | 推理算法助手(降维打击) | 539 | 13 | 2024-10-05 |
114 | 521xueweihan/GitHub520 | 😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装) | 21205 | 13 | 2024-10-09 |
115 | TMElyralab/MuseTalk | MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting | 2546 | 13 | 2024-09-23 |
116 | sMythicalBird/ZenlessZoneZero-Auto | 绝区零 ZenlessZoneZero 零号空洞 自动战斗 自动化 图片分类 OCR识别 | 1164 | 13 | 2024-10-06 |
117 | QwenLM/Qwen-VL | The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. | 4900 | 12 | 2024-08-07 |
118 | llmware-ai/llmware | Unified framework for building enterprise RAG pipelines with small, specialized models | 4654 | 12 | 2024-10-07 |
119 | YaoFANGUK/video-subtitle-remover | 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures. | 4100 | 12 | 2024-10-09 |
120 | fufankeji/MateGen | Next-Generation Interactive Intelligent Programming Assistant | 1033 | 12 | 2024-09-20 |
121 | aigc-apps/sd-webui-EasyPhoto | 📷 EasyPhoto Your Smart AI Photo Generator. | 4929 | 12 | 2024-07-10 |
122 | lipku/livetalking | Real time interactive streaming digital human | 3587 | 12 | 2024-10-05 |
123 | barry-far/V2ray-Configs | 🛰️✨ Free V2ray Configs , Updating Every 10 minutes. | 4510 | 12 | 2024-10-09 |
124 | TMElyralab/MuseV | MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising | 2377 | 12 | 2024-06-28 |
125 | RayVentura/ShortGPT | 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation | 5653 | 12 | 2024-09-19 |
126 | MustardChef/WSABuilds | Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in. | 7868 | 12 | 2024-08-16 |
127 | taosdata/TDengine | High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios | 23298 | 12 | 2024-10-09 |
128 | moesnow/March7thAssistant | 崩坏:星穹铁道全自动 三月七小助手 | 5024 | 12 | 2024-09-28 |
129 | gusye1234/nano-graphrag | A simple, easy-to-hack GraphRAG implementation | 906 | 12 | 2024-10-01 |
130 | baichuan-inc/Baichuan-7B | A large-scale 7B pretraining language model developed by BaiChuan-Inc. | 5670 | 12 | 2024-07-18 |
131 | luosiallen/latent-consistency-model | Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference | 4311 | 12 | 2024-06-14 |
132 | linyiLYi/street-fighter-ai | This is an AI agent for Street Fighter II Champion Edition. | 6319 | 11 | 2024-05-14 |
133 | tyxsspa/AnyText | Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing> | 4250 | 11 | 2024-06-21 |
134 | THUDM/CodeGeeX | CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023) | 8173 | 11 | 2024-08-13 |
135 | BlinkDL/RWKV-LM | RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa ... | 12493 | 11 | 2024-09-23 |
136 | X-PLUG/MobileAgent | Mobile-Agent: The Powerful Mobile Device Operation Assistant Family | 2792 | 11 | 2024-09-26 |
137 | thuml/Time-Series-Library | A Library for Advanced Deep Time Series Models. | 6545 | 11 | 2024-09-29 |
138 | aixcoder-plugin/aiXcoder-7B | official repository of aiXcoder-7B Code Large Language Model | 2194 | 11 | 2024-08-29 |
139 | Langboat/Mengzi3 | - | 2032 | 10 | 2024-10-09 |
140 | FujiwaraChoki/MoneyPrinterV2 | Automate the process of making money online. | 2369 | 10 | 2024-04-17 |
141 | yihong0618/xiaogpt | Play ChatGPT and other LLM with Xiaomi AI Speaker | 6163 | 10 | 2024-09-22 |
142 | yangjianxin1/Firefly | Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 | 5700 | 10 | 2024-09-19 |
143 | ihmily/DouyinLiveRecorder | 可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、Twitch、Acfun、CHZZK等平台直播录制 | 4535 | 10 | 2024-10-08 |
144 | lanqian528/chat2api | A service that can convert ChatGPT on the web to OpenAI API format. | 1894 | 10 | 2024-10-07 |
145 | Alpha-VLLM/Lumina-T2X | Lumina-T2X is a unified framework for Text to Any Modality Generation | 2040 | 10 | 2024-08-06 |
146 | QwenLM/Qwen2-Audio | The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud. | 1132 | 10 | 2024-08-13 |
147 | yl4579/StyleTTS2 | StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models | 4820 | 10 | 2024-08-10 |
148 | reorx/awesome-chatgpt-api | Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota. | 5919 | 10 | 2024-09-26 |
149 | cubiq/ComfyUI_IPAdapter_plus | - | 3949 | 10 | 2024-09-13 |
150 | ViggoZ/producthunt-daily-hot | 自动生成每日Product Hunt热门产品中文榜单,基于GitHub Actions自动提交Markdown文件 | 604 | 10 | 2024-10-09 |
151 | infrost/DeeplxFile | 基于Deeplx和Playwright提供的简单易用,快速,免费,不限制文件大小,支持超长文本翻译,跨平台的文件翻译工具 / Easy-to-use, fast, free, unlimited file size and cross platform file translation tool based on Deeplx & Playwright that supports long tex ... | 535 | 10 | 2024-09-09 |
152 | xorbitsai/inference | Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any ... | 5027 | 10 | 2024-10-09 |
153 | modelscope/modelscope | ModelScope: bring the notion of Model-as-a-Service to life. | 6886 | 9 | 2024-10-09 |
154 | google-deepmind/penzai | A JAX research toolkit for building, editing, and visualizing neural networks. | 1654 | 9 | 2024-09-11 |
155 | modelscope/ms-swift | Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vis ... | 3734 | 9 | 2024-10-09 |
156 | THUDM/CogVideo | text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) | 7928 | 9 | 2024-10-09 |
157 | InternLM/lmdeploy | LMDeploy is a toolkit for compressing, deploying, and serving LLMs. | 4370 | 9 | 2024-10-09 |
158 | hankcs/HanLP | Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification | 33649 | 9 | 2024-10-08 |
159 | recommenders-team/recommenders | Best Practices on Recommendation Systems | 18925 | 9 | 2024-10-09 |
160 | mli/autocut | 用文本编辑器剪视频 | 6609 | 9 | 2024-10-05 |
161 | PaddlePaddle/PaddleNLP | 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ... | 12012 | 9 | 2024-10-09 |
162 | modelscope/FunASR | A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. | 6220 | 9 | 2024-10-09 |
163 | xlang-ai/OpenAgents | [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild | 3942 | 9 | 2024-07-08 |
164 | LinYuanovo/pikpak_auto_invite | PikPak自动邀请程序,附带图像识别过验证码,支持本地及GitHub Actions云端运行 | 1058 | 9 | 2024-07-04 |
165 | CVHub520/X-AnyLabeling | Effortless data labeling with AI support from Segment Anything and other awesome models. | 3858 | 8 | 2024-10-02 |
166 | kohya-ss/sd-scripts | - | 5062 | 8 | 2024-10-07 |
167 | om-ai-lab/OmAgent | A multimodal agent framework for solving complex tasks [EMNLP'2024] | 779 | 8 | 2024-10-09 |
168 | jianchang512/stt | Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式 | 2243 | 8 | 2024-10-07 |
169 | tgbot-collection/YYeTsBot | 🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享 | 14183 | 8 | 2024-07-21 |
170 | AutoGPTQ/AutoGPTQ | An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. | 4384 | 8 | 2024-09-28 |
171 | THUDM/VisualGLM-6B | Chinese and English multimodal conversational language model 多模态中英双语对话语言模型 | 4077 | 8 | 2024-08-23 |
172 | pkuliyi2015/multidiffusion-upscaler-for-automatic1111 | Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0 | 4731 | 8 | 2024-08-07 |
173 | open-compass/opencompass | OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. | 3853 | 8 | 2024-10-09 |
174 | Plachtaa/VITS-fast-fine-tuning | This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion | 4709 | 8 | 2024-07-03 |
175 | MzeroMiko/VMamba | VMamba: Visual State Space Models,code is based on mamba | 2078 | 8 | 2024-09-25 |
176 | EstrellaXD/Auto_Bangumi | AutoBangumi - 全自动追番工具 | 6772 | 8 | 2024-09-26 |
177 | XPixelGroup/DiffBIR | Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior | 3299 | 8 | 2024-07-03 |
178 | hitsz-ids/synthetic-data-generator | SDG is a specialized framework designed to generate high-quality structured tabular data. | 3265 | 8 | 2024-10-07 |
179 | sml2h3/ddddocr | 带带弟弟 通用验证码识别OCR pypi版 | 9772 | 8 | 2024-07-25 |
180 | madawei2699/myGPTReader | A community-driven way to read and chat with AI bots - powered by chatGPT. | 4426 | 8 | 2024-04-25 |
181 | QiuChenly/InjectLib | 你知道我要说什么 | 945 | 8 | 2024-10-08 |
182 | ok-oldking/ok-wuthering-waves | 鸣潮 后台自动战斗 自动刷声骸上锁合成 Automation for Wuthering Waves | 1032 | 8 | 2024-10-08 |
183 | z1069614715/objectdetection_script | 一些关于目标检测的脚本的改进思路代码,详细请看readme.md | 5172 | 8 | 2024-10-07 |
184 | InternLM/xtuner | An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...) | 3821 | 8 | 2024-09-29 |
185 | Evil0ctal/Douyin_TikTok_Download_API | 🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。 | 8876 | 8 | 2024-09-26 |
186 | QwenLM/Qwen-Agent | Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension. | 3258 | 8 | 2024-10-08 |
187 | fxsjy/jieba | 结巴中文分词 | 33170 | 8 | 2024-08-21 |
188 | DennisThink/awesome_twitter_CN | 值得关注的中文twitter用户 | 616 | 7 | 2024-09-26 |
189 | continue-revolution/sd-webui-animatediff | AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI | 3067 | 7 | 2024-09-22 |
190 | aigc-apps/EasyAnimate | 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion | 1221 | 7 | 2024-08-22 |
191 | xingpingcn/enhanced-FaaS-in-China | 提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 cf优选ip ... | 1543 | 7 | 2024-10-09 |
192 | malinkang/weread2notion-pro | - | 2048 | 7 | 2024-10-09 |
193 | modelscope/FunClip | Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated. | 3397 | 7 | 2024-08-22 |
194 | bilibili/Index-1.9B | A SOTA lightweight multilingual LLM | 881 | 7 | 2024-09-20 |
195 | InternLM/InternLM-XComposer | InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output | 2473 | 7 | 2024-08-30 |
196 | DachunKai/EvTexture | [ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution | 994 | 7 | 2024-09-17 |
197 | sqlmapproject/sqlmap | Automatic SQL injection and database takeover tool | 32201 | 7 | 2024-09-25 |
198 | DeepInsight-AI/DeepBI | LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI. | 2329 | 7 | 2024-10-08 |
199 | JaveleyQAQ/WeChatOpenDevTools-Python | WeChatOpenDevTool 微信小程序强制开启开发者工具 | 1943 | 7 | 2024-09-15 |
200 | ParthJadhav/Tkinter-Designer | An easy and fast way to create a Python GUI 🐍 | 9065 | 7 | 2024-08-21 |
↓ -- 感谢读者 -- ↓
榜单持续更新,如有帮助请加星收藏,方便后续浏览,感谢你的支持!