Skip to content

Latest commit

 

History

History
216 lines (211 loc) · 38.3 KB

File metadata and controls

216 lines (211 loc) · 38.3 KB

返回目录问题反馈

中文增速榜 > 软件类 > Python

数据更新: 2024-10-10   /   温馨提示:中文项目泛指「文档母语为中文」OR「含有中文翻译」的项目,通常在项目的「readme/wiki/官网」可以找到

# Repository Description Stars Average daily growth Updated
1 2noise/ChatTTS A generative speech model for daily dialogue. 31333 230 2024-10-09
2 All-Hands-AI/OpenHands 🙌 OpenHands: Code Less, Make More 32822 156 2024-10-09
3 Ucas-HaoranWei/GOT-OCR2.0 Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model 5121 135 2024-10-02
4 RVC-Boss/GPT-SoVITS 1 min voice data can also be used to train a good TTS model! (few shot voice cloning) 33797 125 2024-10-02
5 KwaiVGI/LivePortrait Bring portraits to life! 12216 123 2024-10-07
6 binary-husky/gpt_academic 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss ... 64638 113 2024-10-07
7 hpcaitech/Open-Sora Open-Sora: Democratizing Efficient Video Production for All 21814 94 2024-08-09
8 myshell-ai/OpenVoice Instant voice cloning by MIT and MyShell. 29071 92 2024-08-21
9 harry0703/MoneyPrinterTurbo 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM. 16408 77 2024-07-26
10 fudan-generative-vision/hallo Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation 9281 77 2024-09-14
11 THUDM/ChatGLM-6B ChatGLM-6B: An Open Bilingual Dialogue Language Model 开源双语对话语言模型 40484 70 2024-06-27
12 InternLM/MindSearch 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT) 4813 65 2024-09-25
13 lm-sys/FastChat An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. 36614 64 2024-10-06
14 hiyouga/LLaMA-Factory Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024) 32079 64 2024-10-08
15 gpt-omni/mini-omni open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. 2775 63 2024-09-25
16 infiniflow/ragflow RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. 18857 62 2024-10-09
17 huggingface/transformers 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. 133123 61 2024-10-09
18 QwenLM/Qwen2-VL Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud. 2508 60 2024-10-04
19 ScrapeGraphAI/Scrapegraph-ai Python scraper based on AI 14788 58 2024-10-09
20 FunAudioLLM/CosyVoice Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability. 5386 54 2024-09-29
21 LC044/WeChatMsg 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手 33731 53 2024-09-23
22 huggingface/speech-to-speech Speech To Speech: an effort for an open-sourced and modular GPT4-o 3200 50 2024-09-27
23 VikParuchuri/marker Convert PDF to markdown quickly with high accuracy 16882 49 2024-09-07
24 PKU-YuanGroup/Open-Sora-Plan This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project. 11308 49 2024-10-08
25 OpenBMB/MiniCPM-V MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone 12189 48 2024-09-13
26 opendatalab/PDF-Extract-Kit A Comprehensive Toolkit for High-Quality PDF Content Extraction 5017 48 2024-10-09
27 jianchang512/ChatTTS-ui 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces. 6020 45 2024-08-29
28 RVC-Project/Retrieval-based-Voice-Conversion-WebUI Easily train a good VC model with voice data <= 10 mins! 23638 42 2024-09-05
29 chatanywhere/GPT_API_free Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。 22088 41 2024-09-26
30 netease-youdao/QAnything Question and Answer based on Anything. 11580 41 2024-09-27
31 adithya-s-k/omniparse Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks 5138 40 2024-09-23
32 ultralytics/ultralytics Ultralytics YOLO11 🚀 29925 39 2024-10-09
33 zhayujie/chatgpt-on-wechat 基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。 30365 38 2024-09-26
34 Kwai-Kolors/Kolors Kolors Team 3683 38 2024-09-04
35 Upsonic/gpt-computer-assistant Intelligence development framework in python for your product like Apple Intelligence 5212 38 2024-09-10
36 THUDM/ChatGLM3 ChatGLM3 series: Open Bilingual Chat LLMs 开源双语对话语言模型 13379 38 2024-07-10
37 VikParuchuri/surya OCR, layout analysis, reading order, table recognition in 90+ languages 10036 37 2024-10-08
38 hpcaitech/ColossalAI Making large AI models cheaper, faster and more accessible 38711 36 2024-10-09
39 fishaudio/fish-speech Brand new TTS solution 13098 36 2024-10-08
40 NexaAI/nexa-sdk Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-speech ... 1891 34 2024-10-09
41 THUDM/ChatGLM2-6B ChatGLM2-6B: An Open Bilingual Chat LLM 开源双语对话语言模型 15702 33 2024-06-27
42 THUDM/GLM-4 GLM-4 series: Open Multilingual Multimodal Chat LMs 开源多语言多模态对话模型 4800 32 2024-10-06
43 ymcui/Chinese-LLaMA-Alpaca 中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs) 18246 32 2024-04-30
44 QwenLM/Qwen The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud. 13637 31 2024-09-24
45 ultralytics/yolov5 YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite 50086 31 2024-10-05
46 FunAudioLLM/SenseVoice Multilingual Voice Understanding Model 2869 29 2024-09-25
47 jingyaogong/minimind 「大模型」3小时完全从0训练26M的小参数GPT,个人显卡即可推理训练! 2203 29 2024-10-09
48 microsoft/UFO A UI-Focused Agent for Windows OS Interaction. 7633 28 2024-09-25
49 qhjqhj00/MemoRAG Empowering RAG with a memory-based data interface for all-purpose applications! 1002 28 2024-09-29
50 hiroi-sora/Umi-OCR OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。 26210 28 2024-10-09
51 assafelovic/gpt-researcher LLM based autonomous agent that does online comprehensive research on any given topic 14340 28 2024-10-07
52 reflex-dev/reflex 🕸️ Web apps in pure Python 🐍 19655 27 2024-10-09
53 PaddlePaddle/PaddleOCR Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and de ... 43079 27 2024-10-09
54 Textualize/rich Rich is a Python library for rich text and beautiful formatting in the terminal. 49116 27 2024-10-04
55 1Panel-dev/MaxKB 🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。 10600 27 2024-10-09
56 BadToBest/EchoMimic Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning 2595 26 2024-08-15
57 GaiZhenbiao/ChuanhuChatGPT GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. 15183 26 2024-09-25
58 linyqh/NarratoAI 利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click. 1484 25 2024-10-08
59 eosphoros-ai/DB-GPT AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents 13435 25 2024-10-08
60 Sinaptik-AI/pandas-ai Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG. 12829 24 2024-09-25
61 xinntao/Real-ESRGAN Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. 27979 24 2024-08-06
62 TeamWiseFlow/wiseflow Wiseflow is an agile information mining tool that extracts concise messages from various sources such as websites, WeChat official accounts, social platforms, etc. It automatically categorizes and upl ... 4071 24 2024-09-04
63 jianchang512/clone-voice A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频 7355 23 2024-08-22
64 Zeyi-Lin/HivisionIDPhotos ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。 10875 23 2024-09-28
65 THUDM/LongWriter LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs 1386 23 2024-09-27
66 guoyww/AnimateDiff Official implementation of AnimateDiff. 10379 22 2024-07-31
67 Tencent/HunyuanDiT Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding 3337 22 2024-08-15
68 OpenMOSS/MOSS An open-source tool-augmented conversational language model from Fudan University 11928 22 2024-07-13
69 OpenBMB/XAgent An Autonomous LLM Agent for Complex Task Solving 8078 22 2024-08-12
70 AiuniAI/Unique3D Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image 2954 22 2024-09-18
71 netease-youdao/EmotiVoice EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine 7314 22 2024-08-13
72 dataelement/bisheng BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SF ... 8655 21 2024-10-09
73 Kanaries/pygwalker PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis 12867 21 2024-10-02
74 modelscope/DiffSynth-Studio Enjoy the magic of Diffusion models! 6409 21 2024-10-09
75 vvbbnn00/WARP-Clash-API 该项目可以让你通过订阅的方式使用Cloudflare WARP+,自动获取流量。This project enables you to use Cloudflare WARP+ through subscription, automatically acquiring traffic. 8453 20 2024-09-04
76 myshell-ai/MeloTTS High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. 4581 20 2024-08-09
77 microsoft/DeepSpeed DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective. 35020 20 2024-10-09
78 deepseek-ai/DeepSeek-Coder DeepSeek Coder: Let the Code Write Itself 6640 19 2024-05-21
79 jzhang38/TinyLlama The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. 7730 19 2024-05-03
80 ageitgey/face_recognition The world's simplest facial recognition api for Python and the command line 53078 19 2024-08-21
81 RUCAIBox/LLMSurvey The official GitHub page for the survey paper "A Survey of Large Language Models". 10151 18 2024-08-20
82 facebookresearch/nougat Implementation of Nougat Neural Optical Understanding for Academic Documents 8850 18 2024-04-16
83 modelscope/agentscope Start building LLM-empowered multi-agent applications in an easier way. 4983 18 2024-10-08
84 OpenGVLab/InternVL [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型 5692 18 2024-09-19
85 OpenInterpreter/01 The #1 open-source voice interface for desktop, mobile, and ESP32 chips. 4924 18 2024-10-02
86 3b1b/manim Animation engine for explanatory math videos 63144 18 2024-10-04
87 fishaudio/Bert-VITS2 vits2 backbone with multilingual-bert 7881 18 2024-10-07
88 xinsir6/ControlNetPlus ControlNet++: All-in-one ControlNet for image generations and editing! 1685 17 2024-09-30
89 honmashironeko/ProxyCat 一款部署于云端或本地的代理池中间件,可将静态代理IP灵活运用成隧道IP,提供固定请求地址,一次部署终身使用 866 17 2024-10-06
90 dyang886/Game-Cheats-Manager Easily download and manage game cheats for your convenience 4693 17 2024-10-05
91 THUDM/CodeGeeX2 CodeGeeX2: A More Powerful Multilingual Code Generation Model 7620 17 2024-07-10
92 buaacyw/MeshAnything From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers" 1999 17 2024-08-05
93 FlagOpen/FlagEmbedding Retrieval and Retrieval-augmented LLMs 7068 16 2024-10-07
94 ymcui/Chinese-LLaMA-Alpaca-2 中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models) 7068 16 2024-09-23
95 marimo-team/marimo A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git. 6661 16 2024-10-09
96 yixiu001/serv00-login 同时支持serv00与ct8自动化批量保号,每3天自动登录一次面板,并且发送消息到Telegram 1455 15 2024-07-19
97 voicepaw/so-vits-svc-fork so-vits-svc fork with realtime support, improved interface and more features. 8728 15 2024-10-08
98 gradio-app/gradio Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! 32610 15 2024-10-09
99 THUDM/CogVLM a state-of-the-art-level open visual language model 多模态预训练模型 5928 15 2024-05-29
100 OptimalScale/LMFlow An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All. 8234 15 2024-10-05
101 6drf21e/ChatTTS_colab 🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。 1979 15 2024-07-02
102 BlinkDL/ChatRWKV ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. 9395 15 2024-07-11
103 InternLM/InternLM Official release of InternLM2.5 base and chat models. 1M context support 6299 14 2024-09-06
104 VisionRush/DeepFakeDefenders Image forgery recognition algorithm 529 13 2024-09-09
105 THUDM/CogVLM2 GPT4V-level open-source multi-modal model based on Llama3-8B 2040 13 2024-09-03
106 xxlong0/Wonder3D Single Image to 3D using Cross-Domain Diffusion for 3D Generation 4723 13 2024-08-29
107 open-mmlab/mmdetection OpenMMLab Detection Toolbox and Benchmark 29262 13 2024-08-21
108 jxxghp/MoviePilot NAS媒体库自动化管理工具 6335 13 2024-10-09
109 THUDM/CodeGeeX4 CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, function calling, repository-level Q&A and much more. 1306 13 2024-08-25
110 WZMIAOMIAO/deep-learning-for-image-processing deep learning for image processing including classification and object-detection etc. 22624 13 2024-07-25
111 xaoyaoo/PyWxDump 获取微信信息;读取数据库,本地查看聊天记录并导出为csv、html等格式用于AI训练,自动回复等。支持多账户信息获取,支持所有微信版本。 5470 13 2024-10-07
112 PeterH0323/Streamer-Sales Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端 ... 2427 13 2024-09-29
113 HZJQF/help_tool 推理算法助手(降维打击) 539 13 2024-10-05
114 521xueweihan/GitHub520 😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装) 21205 13 2024-10-09
115 TMElyralab/MuseTalk MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting 2546 13 2024-09-23
116 sMythicalBird/ZenlessZoneZero-Auto 绝区零 ZenlessZoneZero 零号空洞 自动战斗 自动化 图片分类 OCR识别 1164 13 2024-10-06
117 QwenLM/Qwen-VL The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud. 4900 12 2024-08-07
118 llmware-ai/llmware Unified framework for building enterprise RAG pipelines with small, specialized models 4654 12 2024-10-07
119 YaoFANGUK/video-subtitle-remover 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures. 4100 12 2024-10-09
120 fufankeji/MateGen Next-Generation Interactive Intelligent Programming Assistant 1033 12 2024-09-20
121 aigc-apps/sd-webui-EasyPhoto 📷 EasyPhoto Your Smart AI Photo Generator. 4929 12 2024-07-10
122 lipku/livetalking Real time interactive streaming digital human 3587 12 2024-10-05
123 barry-far/V2ray-Configs 🛰️✨ Free V2ray Configs , Updating Every 10 minutes. 4510 12 2024-10-09
124 TMElyralab/MuseV MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising 2377 12 2024-06-28
125 RayVentura/ShortGPT 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation 5653 12 2024-09-19
126 MustardChef/WSABuilds Run Windows Subsystem For Android on your Windows 10 and Windows 11 PC using prebuilt binaries with Google Play Store (MindTheGapps) and/or Magisk or KernelSU (root solutions) built in. 7868 12 2024-08-16
127 taosdata/TDengine High-performance, scalable time-series database designed for Industrial IoT (IIoT) scenarios 23298 12 2024-10-09
128 moesnow/March7thAssistant 崩坏:星穹铁道全自动 三月七小助手 5024 12 2024-09-28
129 gusye1234/nano-graphrag A simple, easy-to-hack GraphRAG implementation 906 12 2024-10-01
130 baichuan-inc/Baichuan-7B A large-scale 7B pretraining language model developed by BaiChuan-Inc. 5670 12 2024-07-18
131 luosiallen/latent-consistency-model Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference 4311 12 2024-06-14
132 linyiLYi/street-fighter-ai This is an AI agent for Street Fighter II Champion Edition. 6319 11 2024-05-14
133 tyxsspa/AnyText Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing> 4250 11 2024-06-21
134 THUDM/CodeGeeX CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023) 8173 11 2024-08-13
135 BlinkDL/RWKV-LM RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, sa ... 12493 11 2024-09-23
136 X-PLUG/MobileAgent Mobile-Agent: The Powerful Mobile Device Operation Assistant Family 2792 11 2024-09-26
137 thuml/Time-Series-Library A Library for Advanced Deep Time Series Models. 6545 11 2024-09-29
138 aixcoder-plugin/aiXcoder-7B official repository of aiXcoder-7B Code Large Language Model 2194 11 2024-08-29
139 Langboat/Mengzi3 - 2032 10 2024-10-09
140 FujiwaraChoki/MoneyPrinterV2 Automate the process of making money online. 2369 10 2024-04-17
141 yihong0618/xiaogpt Play ChatGPT and other LLM with Xiaomi AI Speaker 6163 10 2024-09-22
142 yangjianxin1/Firefly Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型 5700 10 2024-09-19
143 ihmily/DouyinLiveRecorder 可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、pandatv、afreecatv、flextv、popkontv、twitcasting、winktv、百度、微博、酷狗、花椒、Twitch、Acfun、CHZZK等平台直播录制 4535 10 2024-10-08
144 lanqian528/chat2api A service that can convert ChatGPT on the web to OpenAI API format. 1894 10 2024-10-07
145 Alpha-VLLM/Lumina-T2X Lumina-T2X is a unified framework for Text to Any Modality Generation 2040 10 2024-08-06
146 QwenLM/Qwen2-Audio The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud. 1132 10 2024-08-13
147 yl4579/StyleTTS2 StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models 4820 10 2024-08-10
148 reorx/awesome-chatgpt-api Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota. 5919 10 2024-09-26
149 cubiq/ComfyUI_IPAdapter_plus - 3949 10 2024-09-13
150 ViggoZ/producthunt-daily-hot 自动生成每日Product Hunt热门产品中文榜单,基于GitHub Actions自动提交Markdown文件 604 10 2024-10-09
151 infrost/DeeplxFile 基于Deeplx和Playwright提供的简单易用,快速,免费,不限制文件大小,支持超长文本翻译,跨平台的文件翻译工具 / Easy-to-use, fast, free, unlimited file size and cross platform file translation tool based on Deeplx & Playwright that supports long tex ... 535 10 2024-09-09
152 xorbitsai/inference Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any ... 5027 10 2024-10-09
153 modelscope/modelscope ModelScope: bring the notion of Model-as-a-Service to life. 6886 9 2024-10-09
154 google-deepmind/penzai A JAX research toolkit for building, editing, and visualizing neural networks. 1654 9 2024-09-11
155 modelscope/ms-swift Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vis ... 3734 9 2024-10-09
156 THUDM/CogVideo text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023) 7928 9 2024-10-09
157 InternLM/lmdeploy LMDeploy is a toolkit for compressing, deploying, and serving LLMs. 4370 9 2024-10-09
158 hankcs/HanLP Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named Entity Recognition, Syntactic & Semantic Dependency Parsing, Document Classification 33649 9 2024-10-08
159 recommenders-team/recommenders Best Practices on Recommendation Systems 18925 9 2024-10-09
160 mli/autocut 用文本编辑器剪视频 6609 9 2024-10-05
161 PaddlePaddle/PaddleNLP 👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ... 12012 9 2024-10-09
162 modelscope/FunASR A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc. 6220 9 2024-10-09
163 xlang-ai/OpenAgents [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild 3942 9 2024-07-08
164 LinYuanovo/pikpak_auto_invite PikPak自动邀请程序,附带图像识别过验证码,支持本地及GitHub Actions云端运行 1058 9 2024-07-04
165 CVHub520/X-AnyLabeling Effortless data labeling with AI support from Segment Anything and other awesome models. 3858 8 2024-10-02
166 kohya-ss/sd-scripts - 5062 8 2024-10-07
167 om-ai-lab/OmAgent A multimodal agent framework for solving complex tasks [EMNLP'2024] 779 8 2024-10-09
168 jianchang512/stt Voice Recognition to Text Tool / 一个离线运行的本地语音识别转文字服务,输出json、srt字幕带时间戳、纯文字格式 2243 8 2024-10-07
169 tgbot-collection/YYeTsBot 🎬 人人影视 机器人和网站,包含人人影视全部资源以及众多网友的网盘分享 14183 8 2024-07-21
170 AutoGPTQ/AutoGPTQ An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm. 4384 8 2024-09-28
171 THUDM/VisualGLM-6B Chinese and English multimodal conversational language model 多模态中英双语对话语言模型 4077 8 2024-08-23
172 pkuliyi2015/multidiffusion-upscaler-for-automatic1111 Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0 4731 8 2024-08-07
173 open-compass/opencompass OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. 3853 8 2024-10-09
174 Plachtaa/VITS-fast-fine-tuning This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion 4709 8 2024-07-03
175 MzeroMiko/VMamba VMamba: Visual State Space Models,code is based on mamba 2078 8 2024-09-25
176 EstrellaXD/Auto_Bangumi AutoBangumi - 全自动追番工具 6772 8 2024-09-26
177 XPixelGroup/DiffBIR Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior 3299 8 2024-07-03
178 hitsz-ids/synthetic-data-generator SDG is a specialized framework designed to generate high-quality structured tabular data. 3265 8 2024-10-07
179 sml2h3/ddddocr 带带弟弟 通用验证码识别OCR pypi版 9772 8 2024-07-25
180 madawei2699/myGPTReader A community-driven way to read and chat with AI bots - powered by chatGPT. 4426 8 2024-04-25
181 QiuChenly/InjectLib 你知道我要说什么 945 8 2024-10-08
182 ok-oldking/ok-wuthering-waves 鸣潮 后台自动战斗 自动刷声骸上锁合成 Automation for Wuthering Waves 1032 8 2024-10-08
183 z1069614715/objectdetection_script 一些关于目标检测的脚本的改进思路代码,详细请看readme.md 5172 8 2024-10-07
184 InternLM/xtuner An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...) 3821 8 2024-09-29
185 Evil0ctal/Douyin_TikTok_Download_API 🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用,在线批量解析及下载。 8876 8 2024-09-26
186 QwenLM/Qwen-Agent Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension. 3258 8 2024-10-08
187 fxsjy/jieba 结巴中文分词 33170 8 2024-08-21
188 DennisThink/awesome_twitter_CN 值得关注的中文twitter用户 616 7 2024-09-26
189 continue-revolution/sd-webui-animatediff AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI 3067 7 2024-09-22
190 aigc-apps/EasyAnimate 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion 1221 7 2024-08-22
191 xingpingcn/enhanced-FaaS-in-China 提升部署在cloudflare、vercel或netlify的网页在中国的访问速度和稳定性 Improve the access speed and stability in China of web pages hosted on cloudflare, vercel or netlify by merely changing your CNAME record. cf优选域名 cf优选ip ... 1543 7 2024-10-09
192 malinkang/weread2notion-pro - 2048 7 2024-10-09
193 modelscope/FunClip Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated. 3397 7 2024-08-22
194 bilibili/Index-1.9B A SOTA lightweight multilingual LLM 881 7 2024-09-20
195 InternLM/InternLM-XComposer InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output 2473 7 2024-08-30
196 DachunKai/EvTexture [ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution 994 7 2024-09-17
197 sqlmapproject/sqlmap Automatic SQL injection and database takeover tool 32201 7 2024-09-25
198 DeepInsight-AI/DeepBI LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI. 2329 7 2024-10-08
199 JaveleyQAQ/WeChatOpenDevTools-Python WeChatOpenDevTool 微信小程序强制开启开发者工具 1943 7 2024-09-15
200 ParthJadhav/Tkinter-Designer An easy and fast way to create a Python GUI 🐍 9065 7 2024-08-21

↓ -- 感谢读者 -- ↓

榜单持续更新,如有帮助请加星收藏,方便后续浏览,感谢你的支持!