English | 中文
Collect practical AI repos, tools, websites, papers and tutorials on AI.
Translated from ChatGPT, picture from Midjourney.
updated repos and stars every 2 hours and re-ranking automatically.
No. |
Repos |
Description |
---|---|---|
1 | public-apis/public-apis |
A collective list of free APIs |
2 | kamranahmedse/developer-roadmap |
Interactive roadmaps, guides and other educational content to help developers grow in their careers. |
3 | vinta/awesome-python |
A curated list of awesome Python frameworks, libraries, software and resources |
4 | tensorflow/tensorflow |
An Open Source Machine Learning Framework for Everyone |
5 | practical-tutorials/project-based-learning |
Curated list of project-based tutorials |
6 | Significant-Gravitas/AutoGPT |
An experimental open-source attempt to make GPT-4 fully autonomous. |
7 | AUTOMATIC1111/stable-diffusion-webui |
Stable Diffusion web UI |
8 | huggingface/transformers |
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. |
9 | justjavac/free-programming-books-zh_CN |
📚 免费的计算机编程类中文书籍,欢迎投稿 |
10 | f/awesome-chatgpt-prompts |
This repo includes ChatGPT prompt curation to use ChatGPT better. |
11 | langchain-ai/langchain |
⚡ Building applications with LLMs through composability ⚡ |
12 | pytorch/pytorch |
Tensors and Dynamic neural networks in Python with strong GPU acceleration |
13 | ChatGPTNextWeb/ChatGPT-Next-Web |
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。 |
14 | home-assistant/core |
🏡 Open source home automation that puts local control and privacy first. |
15 | supabase/supabase |
The open source Firebase alternative. |
16 | ollama/ollama |
Get up and running with Llama 2, Mistral, Gemma, and other large language models. |
17 | nomic-ai/gpt4all |
gpt4all: an ecosystem of open-source chatbots trained on a massive collection of clean assistant data including code, stories and dialogue |
18 | fighting41love/funNLP |
The Most Powerful NLP-Weapon Arsenal |
19 | bregman-arie/devops-exercises |
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions |
20 | josephmisiti/awesome-machine-learning |
A curated list of awesome Machine Learning frameworks, libraries and software. |
21 | twitter/the-algorithm |
Source code for Twitter's Recommendation Algorithm |
22 | openai/whisper |
Robust Speech Recognition via Large-Scale Weak Supervision |
23 | keras-team/keras |
Deep Learning for humans |
24 | apache/superset |
Apache Superset is a Data Visualization and Data Exploration Platform |
25 | 3b1b/manim |
Animation engine for explanatory math videos |
26 | scikit-learn/scikit-learn |
scikit-learn: machine learning in Python |
27 | ggerganov/llama.cpp |
Port of Facebook's LLaMA model in C/C++ |
28 | xtekky/gpt4free |
decentralizing the Ai Industry, free gpt-4/3.5 scripts through several reverse engineered API's ( poe.com, phind.com, chat.openai.com etc...) |
29 | binary-husky/gpt_academic |
Academic Optimization of GPT |
30 | d2l-ai/d2l-zh |
Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries |
31 | openai/openai-cookbook |
Examples and guides for using the OpenAI API |
32 | binhnguyennus/awesome-scalability |
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems |
33 | meta-llama/llama |
Inference code for Llama models |
34 | imartinez/privateGPT |
Interact with your documents using the power of GPT, 100% privately, no data leaks |
35 | ageitgey/face_recognition |
The world's simplest facial recognition api for Python and the command line |
36 | CorentinJ/Real-Time-Voice-Cloning |
Clone a voice in 5 seconds to generate arbitrary speech in real-time |
37 | gpt-engineer-org/gpt-engineer |
Specify what you want it to build, the AI asks for clarification, and then builds it. |
38 | PlexPt/awesome-chatgpt-prompts-zh |
ChatGPT Chinese Training Guide. Guidelines for various scenarios. Learn how to make it listen to you |
39 | abi/screenshot-to-code |
Drop in a screenshot and convert it to clean HTML/Tailwind/JS code |
40 | OpenInterpreter/open-interpreter |
A natural language interface for computers |
41 | labmlai/annotated_deep_learning_paper_implementations |
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠 |
42 | xai-org/grok-1 |
Grok open release |
⭐ 43 | commaai/openpilot |
openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for over 200 supported car makes and models. |
44 | lencx/ChatGPT |
🔮 ChatGPT Desktop Application (Mac, Windows and Linux) |
45 | v2ray/v2ray-core |
A platform for building proxies to bypass network restrictions. |
46 | facebookresearch/segment-anything |
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model. |
47 | dair-ai/Prompt-Engineering-Guide |
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering |
48 | microsoft/generative-ai-for-beginners |
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/ |
49 | Avik-Jain/100-Days-Of-ML-Code |
100 Days of ML Coding |
50 | n8n-io/n8n |
Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services. |
51 | geekan/MetaGPT |
The Multi-Agent Meta Programming Framework: Given one line Requirement, return PRD, Design, Tasks, Repo |
52 | THUDM/ChatGLM-6B |
ChatGLM-6B: An Open Bilingual Dialogue Language Model |
53 | PaddlePaddle/PaddleOCR |
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices) |
54 | fastlane/fastlane |
🚀 The easiest way to automate building and releasing your iOS and Android apps |
55 | hpcaitech/ColossalAI |
Making large AI models cheaper, faster and more accessible |
56 | psf/black |
The uncompromising Python code formatter |
57 | oobabooga/text-generation-webui |
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA. |
58 | LAION-AI/Open-Assistant |
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so. |
59 | Stability-AI/stablediffusion |
High-Resolution Image Synthesis with Latent Diffusion Models |
60 | lllyasviel/Fooocus |
Focus on prompting and generating |
61 | XingangPan/DragGAN |
Code for DragGAN (SIGGRAPH 2023) |
62 | mingrammer/diagrams |
🎨 Diagram as Code for prototyping cloud system architectures |
63 | TencentARC/GFPGAN |
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration. |
64 | apache/airflow |
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows |
65 | microsoft/TaskMatrix |
Talking, Drawing and Editing with Visual Foundation Models |
66 | lm-sys/FastChat |
An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5. |
67 | comfyanonymous/ComfyUI |
A powerful and modular stable diffusion GUI with a graph/nodes interface. |
68 | babysor/MockingBird |
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time |
69 | lapce/lapce |
Lightning-fast and Powerful Code Editor written in Rust |
70 | QuivrHQ/quivr |
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation. |
71 | google-research/google-research |
Google Research |
72 | microsoft/DeepSpeed |
A deep learning optimization library that makes distributed training and inference easy, efficient, and effective |
73 | suno-ai/bark |
🔊 Text-Prompted Generative Audio Model |
74 | Asabeneh/30-Days-Of-Python |
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw |
75 | karpathy/nanoGPT |
The simplest, fastest repository for training/finetuning medium-sized GPTs |
76 | streamlit/streamlit |
Streamlit — A faster way to build and share data apps. |
77 | ggerganov/whisper.cpp |
Port of OpenAI's Whisper model in C/C++ |
78 | ray-project/ray |
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads. |
79 | Chanzhaoyu/chatgpt-web |
A demonstration website built with Express and Vue3 called ChatGPT |
80 | lobehub/lobe-chat |
🤖 Lobe Chat - an open-source, extensible (Function Calling), high-performance chatbot framework. It supports one-click free deployment of your private ChatGPT/LLM web application. |
81 | coqui-ai/TTS |
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production |
82 | mlabonne/llm-course |
Course with a roadmap and notebooks to get into Large Language Models (LLMs). |
83 | facebookresearch/fairseq |
Facebook AI Research Sequence-to-Sequence Toolkit written in Python. |
84 | karanpratapsingh/system-design |
Learn how to design systems at scale and prepare for system design interviews |
85 | gradio-app/gradio |
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work! |
86 | TheAlgorithms/C-Plus-Plus |
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes. |
87 | yunjey/pytorch-tutorial |
PyTorch Tutorial for Deep Learning Researchers |
88 | tatsu-lab/stanford_alpaca |
Code and documentation to train Stanford's Alpaca models, and generate the data. |
89 | facebookresearch/detectron2 |
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks. |
90 | Pythagora-io/gpt-pilot |
PoC for a scalable dev tool that writes entire apps from scratch while the developer oversees the implementation |
91 | langgenius/dify |
One API for plugins and datasets, one interface for prompt engineering and visual operation, all for creating powerful AI applications. |
92 | google/jax |
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more |
93 | lllyasviel/ControlNet |
Let us control diffusion models! |
94 | acheong08/ChatGPT |
Reverse engineered ChatGPT API |
95 | chatchat-space/Langchain-Chatchat |
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain |
96 | v2fly/v2ray-core |
A platform for building proxies to bypass network restrictions. |
97 | milvus-io/milvus |
A cloud-native vector database, storage for next generation AI applications |
98 | Lightning-AI/pytorch-lightning |
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes. |
99 | upscayl/upscayl |
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy. |
100 | JushBJJ/Mr.-Ranedeer-AI-Tutor |
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences. |
101 | linexjlin/GPTs |
leaked prompts of GPTs |
102 | pola-rs/polars |
Fast multi-threaded, hybrid-out-of-core query engine focussing on DataFrame front-ends |
103 | mckaywrigley/chatbot-ui |
The open-source AI chat interface for everyone. |
104 | xinntao/Real-ESRGAN |
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration. |
105 | OpenBB-finance/OpenBBTerminal |
Investment Research for Everyone, Anywhere. |
106 | eugeneyan/applied-ml |
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production. |
107 | freqtrade/freqtrade |
Free, open source crypto trading bot |
108 | microsoft/autogen |
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ |
109 | google/mediapipe |
Cross-platform, customizable ML solutions for live and streaming media. |
110 | zhayujie/chatgpt-on-wechat |
Wechat robot based on ChatGPT, which uses OpenAI api and itchat library |
111 | google-research/tuning_playbook |
A playbook for systematically maximizing the performance of deep learning models. |
112 | s0md3v/roop |
one-click deepfake (face swap) |
113 | Vision-CAIR/MiniGPT-4 |
Enhancing Vision-language Understanding with Advanced Large Language Models |
114 | FlowiseAI/Flowise |
Drag & drop UI to build your customized LLM flow using LangchainJS |
115 | myshell-ai/OpenVoice |
Instant voice cloning by MyShell |
116 | mli/paper-reading |
Classic Deep Learning and In-Depth Reading of New Papers Paragraph by Paragraph |
117 | tinygrad/tinygrad |
You like pytorch? You like micrograd? You love tinygrad! ❤️ |
118 | RVC-Boss/GPT-SoVITS |
1 min voice data can also be used to train a good TTS model! (few shot voice cloning) |
119 | svc-develop-team/so-vits-svc |
SoftVC VITS Singing Voice Conversion |
120 | academic/awesome-datascience |
📝 An awesome Data Science repository to learn and apply for real world problems. |
121 | iperov/DeepFaceLive |
Real-time face swap for PC streaming or video calls |
122 | ultralytics/ultralytics |
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite |
123 | OpenBMB/ChatDev |
Create Customized Software using Natural Language Idea (through Multi-Agent Collaboration) |
124 | apache/flink |
Apache Flink |
125 | microsoft/JARVIS |
a system to connect LLMs with ML community |
126 | yetone/openai-translator |
Browser extension and cross-platform desktop application for translation based on ChatGPT API |
127 | DataTalksClub/data-engineering-zoomcamp |
Free Data Engineering course! |
128 | mouredev/Hello-Python |
Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA... |
129 | Stability-AI/generative-models |
Generative Models by Stability AI |
130 | bazelbuild/bazel |
a fast, scalable, multi-language and extensible build system |
131 | nrwl/nx |
Smart Monorepos · Fast CI |
132 | hiyouga/LLaMA-Factory |
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM) |
133 | modularml/mojo |
The Mojo Programming Language |
134 | hiroi-sora/Umi-OCR |
OCR图片转文字识别软件,完全离线。截屏/批量导入图片,支持多国语言、合并段落、竖排文字。可排除水印区域,提取干净的文本。基于 PaddleOCR 。 |
135 | invoke-ai/InvokeAI |
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products. |
136 | mindsdb/mindsdb |
The platform for customizing AI from enterprise data |
137 | deepinsight/insightface |
State-of-the-art 2D and 3D Face Analysis Project |
138 | openai/chatgpt-retrieval-plugin |
Plugins are chat extensions designed specifically for language models like ChatGPT, enabling them to access up-to-date information, run computations, or interact with third-party services in response to a user's request. |
139 | opentofu/opentofu |
OpenTofu lets you declaratively manage your cloud infrastructure. |
140 | getcursor/cursor |
An editor made for programming with AI |
141 | mudler/LocalAI |
🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. It allows to generate Text, Audio, Video, Images. Also with voice cloning capabilities. |
142 | openai/openai-python |
The OpenAI Python library provides convenient access to the OpenAI API from applications written in the Python language. |
143 | grpc/grpc-go |
The Go language implementation of gRPC. HTTP/2 based RPC |
144 | meta-llama/llama3 |
The official Meta Llama 3 GitHub site |
145 | facebookresearch/audiocraft |
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning. |
146 | open-webui/open-webui |
User-friendly WebUI for LLMs (Formerly Ollama WebUI) |
147 | RVC-Project/Retrieval-based-Voice-Conversion-WebUI |
Voice data <= 10 mins can also be used to train a good VC model! |
148 | yoheinakajima/babyagi |
uses OpenAI and Pinecone APIs to create, prioritize, and execute tasks, This is a pared-down version of the original Task-Driven Autonomous Agent |
149 | vllm-project/vllm |
A high-throughput and memory-efficient inference and serving engine for LLMs |
150 | PromtEngineer/localGPT |
Chat with your documents on your local device using GPT models. No data leaves your device and 100% private. |
151 | clash-verge-rev/clash-verge-rev |
Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux) |
152 | karpathy/minGPT |
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training |
153 | Bin-Huang/chatbox |
A desktop app for GPT-4 / GPT-3.5 (OpenAI API) that supports Windows, Mac & Linux |
154 | karpathy/llm.c |
LLM training in simple, raw C/CUDA |
155 | microsoft/unilm |
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities |
156 | microsoft/semantic-kernel |
Integrate cutting-edge LLM technology quickly and easily into your apps |
157 | tloen/alpaca-lora |
Instruct-tune LLaMA on consumer hardware |
158 | qdrant/qdrant |
Qdrant - Vector Database for the next generation of AI applications. Also available in the cloud https://cloud.qdrant.io/ |
159 | janhq/jan |
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer |
160 | BuilderIO/gpt-crawler |
Crawl a site to generate knowledge files to create your own custom GPT from a URL |
161 | logspace-ai/langflow |
⛓️ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity. |
162 | ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code |
500 AI Machine learning Deep learning Computer vision NLP Projects with code |
163 | guidance-ai/guidance |
A guidance language for controlling large language models. |
164 | ymcui/Chinese-LLaMA-Alpaca |
Chinese LLaMA & Alpaca LLMs |
165 | TabbyML/tabby |
Self-hosted AI coding assistant |
166 | mlflow/mlflow |
Open source platform for the machine learning lifecycle |
167 | Sanster/lama-cleaner |
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures. |
168 | mlc-ai/mlc-llm |
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices. |
169 | stitionai/devika |
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI. |
170 | tree-sitter/tree-sitter |
An incremental parsing system for programming tools |
171 | haotian-liu/LLaVA |
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities. |
172 | chatanywhere/GPT_API_free |
Free ChatGPT API Key, Free ChatGPT API, supports GPT-4 API (free), ChatGPT offers a free domestic forwarding API that allows direct connections without the need for a proxy. It can be used in conjunction with software/plugins like ChatBox, significantly reducing interface usage costs. Enjoy unlimited and unrestricted chatting within China |
173 | xx025/carrot |
Free ChatGPT Site List |
174 | yuliskov/SmartTube |
SmartTube - an advanced player for set-top boxes and tvs running Android OS |
175 | LiLittleCat/awesome-free-chatgpt |
🆓免费的 ChatGPT 镜像网站列表,持续更新。List of free ChatGPT mirror sites, continuously updated. |
176 | emilwallner/Screenshot-to-code |
A neural network that transforms a design mock-up into a static website. |
177 | apple/ml-stable-diffusion |
Stable Diffusion with Core ML on Apple Silicon |
178 | microsoft/LightGBM |
A fast, distributed, high-performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. |
179 | Mikubill/sd-webui-controlnet |
WebUI extension for ControlNet |
180 | renovatebot/renovate |
Universal dependency update tool that fits into your workflows. |
181 | rasbt/LLMs-from-scratch |
Implementing a ChatGPT-like LLM from scratch, step by step |
182 | rlabbe/Kalman-and-Bayesian-Filters-in-Python |
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions. |
183 | Stability-AI/StableLM |
Stability AI Language Models |
184 | transitive-bullshit/chatgpt-api |
Node.js client for the official ChatGPT API. |
185 | THUDM/ChatGLM2-6B |
ChatGLM2-6B: An Open Bilingual Chat LLM |
186 | joonspk-research/generative_agents |
Generative Agents: Interactive Simulacra of Human Behavior |
187 | Mozilla-Ocho/llamafile |
Distribute and run LLMs with a single file. |
188 | meta-llama/codellama |
Inference code for CodeLlama models |
189 | blakeblackshear/frigate |
NVR with realtime local object detection for IP cameras |
190 | pybind/pybind11 |
Seamless operability between C++11 and Python |
191 | w-okada/voice-changer |
リアルタイムボイスチェンジャー Realtime Voice Changer |
192 | GaiZhenbiao/ChuanhuChatGPT |
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI. |
193 | facefusion/facefusion |
Next generation face swapper and enhancer |
194 | kenjihiranabe/The-Art-of-Linear-Algebra |
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone" |
195 | mayooear/gpt4-pdf-chatbot-langchain |
GPT4 & LangChain Chatbot for large PDF docs |
196 | ddbourgin/numpy-ml |
Machine learning, in numpy |
197 | TransformerOptimus/SuperAGI |
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably. |
198 | ml-explore/mlx |
MLX: An array framework for Apple silicon |
199 | microsoft/Bringing-Old-Photos-Back-to-Life |
Bringing Old Photo Back to Life (CVPR 2020 oral) |
200 | dair-ai/ML-YouTube-Courses |
📺 Discover the latest machine learning / AI courses on YouTube. |
201 | fauxpilot/fauxpilot |
An open-source GitHub Copilot server |
202 | sunner/ChatALL |
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers |
203 | microsoft/qlib |
Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL. |
roboflow/supervision |
We write your reusable computer vision tools. 💜 | |
arc53/DocsGPT |
GPT-powered chat for documentation, chat with your documents | |
206 | songquanpeng/one-api |
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI. |
207 | unifyai/ivy |
Unified AI |
208 | alibaba/lowcode-engine |
An enterprise-class low-code technology stack with scale-out design / 一套面向扩展设计的企业级低代码技术体系 |
209 | HumanAIGC/AnimateAnyone |
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation |
210 | openai/evals |
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks. |
211 | joaomdmoura/crewAI |
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks. |
212 | deepset-ai/haystack |
🔍 Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs (GPT-4, ChatGPT and alike). Haystack offers production-ready tools to quickly build complex question answering, semantic search, text generation applications, and more. |
213 | chat2db/Chat2DB |
An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities |
214 | xcanwin/KeepChatGPT |
Using ChatGPT is more efficient and smoother, perfectly solving ChatGPT network errors. No longer do you need to frequently refresh the webpage, saving over 10 unnecessary steps |
215 | IDEA-Research/Grounded-Segment-Anything |
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect, Segment and Generate Anything with Image and Text Inputs |
216 | wong2/chatgpt-google-extension |
A browser extension that enhances search engines with ChatGPT, this repos will not be updated from 2023-02-20 |
217 | labring/FastGPT |
A platform that uses the OpenAI API to quickly build an AI knowledge base, supporting many-to-many relationships. |
218 | fuergaosi233/wechat-chatgpt |
Use ChatGPT On Wechat via wechaty |
219 | Mintplex-Labs/anything-llm |
A full-stack application that turns any documents into an intelligent chatbot with a sleek UI and easier way to manage your workspaces. |
220 | microsoft/onnxruntime |
ONNX Runtime: cross-platform, high-performance ML inferencing and training accelerator |
221 | chroma-core/chroma |
the AI-native open-source embedding database |
222 | THUDM/ChatGLM3 |
ChatGLM3 series: Open Bilingual Chat LLMs |
223 | neonbjb/tortoise-tts |
A multi-voice TTS system trained with an emphasis on quality |
224 | stefan-jansen/machine-learning-for-trading |
Code for Machine Learning for Algorithmic Trading, 2nd edition. |
225 | OpenLMLab/MOSS |
An open-source tool-augmented conversational language model from Fudan University |
226 | adobe/react-spectrum |
A collection of libraries and tools that help you build adaptive, accessible, and robust user experiences. |
227 | LlamaFamily/Llama-Chinese |
Llama Chinese Community, the best Chinese Llama large model, fully open source and commercially available |
228 | BlinkDL/RWKV-LM |
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it combines the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding. |
229 | willwulfken/MidJourney-Styles-and-Keywords-Reference |
A reference containing Styles and Keywords that you can use with MidJourney AI |
230 | smol-ai/developer |
With 100k context windows on the way, it's now feasible for every dev to have their own smol developer |
231 | graphdeco-inria/gaussian-splatting |
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering" |
232 | AI4Finance-Foundation/FinGPT |
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We'll soon release the trained model. |
233 | ashishps1/awesome-system-design-resources |
This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems |
234 | continuedev/continue |
⏩ the open-source copilot chat for software development—bring the power of ChatGPT to VS Code |
danny-avila/LibreChat |
Enhanced ChatGPT Clone: Features OpenAI, GPT-4 Vision, Bing, Anthropic, OpenRouter, Google Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development | |
openai/shap-e |
Generate 3D objects conditioned on text or images | |
237 | QwenLM/Qwen-7B |
The official repo of Qwen-7B (通义千问-7B) chat & pretrained large language model proposed by Alibaba Cloud. |
⭐ 238 | harry0703/MoneyPrinterTurbo |
Generate short videos with one click using a large model |
239 | stanfordnlp/dspy |
Stanford DSPy: The framework for programming—not prompting—foundation models |
240 | plasma-umass/scalene |
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals |
241 | Koenkk/zigbee2mqtt |
Zigbee 🐝 to MQTT bridge 🌉, get rid of your proprietary Zigbee bridges 🔨 |
242 | openai/triton |
Development repository for the Triton language and compiler |
243 | eosphoros-ai/DB-GPT |
Revolutionizing Database Interactions with Private LLM Technology |
244 | gventuri/pandas-ai |
Pandas AI is a Python library that integrates generative artificial intelligence capabilities into Pandas, making dataframes conversational |
245 | steven-tey/novel |
Notion-style WYSIWYG editor with AI-powered autocompletions |
246 | Dao-AILab/flash-attention |
Fast and memory-efficient exact attention |
247 | lukas-blecher/LaTeX-OCR |
pix2tex: Using a ViT to convert images of equations into LaTeX code. |
248 | databrickslabs/dolly |
A large language model trained on the Databricks Machine Learning Platform |
HqWu-HITCS/Awesome-Chinese-LLM |
Organizing smaller, cost-effective, privately deployable open-source Chinese language models, including related datasets and tutorials | |
illacloud/illa-builder |
Create AI-Driven Apps like Assembling Blocks | |
251 | kubeshark/kubeshark |
The API traffic analyzer for Kubernetes providing real-time K8s protocol-level visibility, capturing and monitoring all traffic and payloads going in, out and across containers, pods, nodes and clusters. Inspired by Wireshark, purposely built for Kubernetes |
252 | OpenTalker/SadTalker |
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation |
253 | official-stockfish/Stockfish |
UCI chess engine |
254 | h2oai/h2ogpt |
Come join the movement to make the world's best open source GPT led by H2O.ai - 100% private chat and document search, no data leaks, Apache 2.0 |
255 | kgrzybek/modular-monolith-with-ddd |
Full Modular Monolith application with Domain-Driven Design approach. |
256 | getumbrel/llama-gpt |
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. |
257 | princeton-nlp/SWE-agent |
SWE-agent: Agent Computer Interfaces Enable Software Engineering Language Models |
258 | facebookresearch/seamless_communication |
Foundational Models for State-of-the-Art Speech and Text Translation |
259 | PKU-YuanGroup/Open-Sora-Plan |
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project. |
260 | eugeneyan/open-llms |
A list of open LLMs available for commercial use. |
261 | facebookresearch/AnimatedDrawings |
Code to accompany "A Method for Animating Children's Drawings of the Human Figure" |
262 | ShishirPatil/gorilla |
Gorilla: An API store for LLMs |
263 | bytebase/bytebase |
World's most advanced database DevOps and CI/CD for Developer, DBA and Platform Engineering teams. The GitLab/GitHub for database DevOps. |
264 | chidiwilliams/buzz |
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper. |
265 | InstantID/InstantID |
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥 |
266 | paul-gauthier/aider |
aider is GPT powered coding in your terminal |
267 | danielmiessler/fabric |
fabric is an open-source framework for augmenting humans using AI. |
268 | stas00/ml-engineering |
Machine Learning Engineering Guides and Tools |
269 | magic-research/magic-animate |
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model |
⭐ 270 | assafelovic/gpt-researcher |
GPT based autonomous agent that does online comprehensive research on any given topic |
271 | ggerganov/ggml |
Tensor library for machine learning |
272 | AIGC-Audio/AudioGPT |
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head |
273 | Z3Prover/z3 |
The Z3 Theorem Prover |
274 | cpacker/MemGPT |
Teaching LLMs memory management for unbounded context 📚🦙 |
275 | state-spaces/mamba |
Mamba: Linear-Time Sequence Modeling with Selective State Spaces |
KindXiaoming/pykan |
Kolmogorov Arnold Networks | |
owainlewis/awesome-artificial-intelligence |
A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers. | |
278 | mlc-ai/web-llm |
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support. |
279 | meta-llama/llama-recipes |
Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger |
280 | chathub-dev/chathub |
All-in-one chatbot client |
281 | artidoro/qlora |
QLoRA: Efficient Finetuning of Quantized LLMs |
282 | plandex-ai/plandex |
An AI coding engine for complex tasks |
283 | kedro-org/kedro |
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular. |
284 | dice2o/BingGPT |
Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux) |
285 | Rudrabha/Wav2Lip |
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. |
286 | danswer-ai/danswer |
Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc. |
287 | BlinkDL/ChatRWKV |
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. |
288 | BradyFU/Awesome-Multimodal-Large-Language-Models |
Latest Papers and Datasets on Multimodal Large Language Models |
289 | netease-youdao/QAnything |
Question and Answer based on Anything. |
290 | DataTalksClub/mlops-zoomcamp |
Free MLOps course from DataTalks.Club |
291 | m-bain/whisperX |
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization) |
292 | guillaumekln/faster-whisper |
Faster Whisper transcription with CTranslate2 |
unslothai/unsloth |
5X faster 50% less memory LLM finetuning | |
bleedline/aimoneyhunter |
AI Side Hustle Money Mega Collection: Teaching You How to Utilize AI for Various Side Projects to Earn Extra Income. | |
295 | togethercomputer/OpenChatKit |
OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots for various applications |
296 | cumulo-autumn/StreamDiffusion |
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation |
297 | RUCAIBox/LLMSurvey |
A collection of papers and resources related to Large Language Models. |
298 | dair-ai/ML-Papers-of-the-Week |
🔥Highlighting the top ML papers every week. |
299 | bentoml/OpenLLM |
An open platform for operating large language models (LLMs) in production. Fine-tune, serve, deploy, and monitor any LLMs with ease. |
300 | salesforce/LAVIS |
LAVIS - A One-stop Library for Language-Vision Intelligence |
301 | adams549659584/go-proxy-bingai |
A Microsoft New Bing demo site built with Vue3 and Go, providing a consistent UI experience, supporting ChatGPT prompts, and accessible within China |
302 | Avaiga/taipy |
Turns Data and AI algorithms into production-ready web applications in no time. |
303 | BerriAI/litellm |
Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs) |
304 | NVIDIA/Megatron-LM |
Ongoing research training transformer models at scale |
305 | mistralai/mistral-src |
Reference implementation of Mistral AI 7B v0.1 model. |
306 | bigscience-workshop/petals |
🌸 Run large language models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading |
307 | DataEngineer-io/data-engineer-handbook |
This is a repo with links to everything you'd ever want to learn about data engineering |
308 | BloopAI/bloop |
A fast code search engine written in Rust |
309 | nerfstudio-project/nerfstudio |
A collaboration friendly studio for NeRFs |
310 | embedchain/embedchain |
Framework to easily create LLM powered bots over any dataset. |
311 | Stability-AI/StableStudio |
Community interface for generative AI |
312 | modelscope/facechain |
FaceChain is a deep-learning toolchain for generating your Digital-Twin. |
313 | manticoresoftware/manticoresearch |
Easy to use open source fast database for search |
314 | voicepaw/so-vits-svc-fork |
so-vits-svc fork with realtime support, improved interface and more features. |
315 | DataTalksClub/machine-learning-zoomcamp |
The code from the Machine Learning Bookcamp book and a free course based on the book |
316 | TheR1D/shell_gpt |
A command-line productivity tool powered by ChatGPT, will help you accomplish your tasks faster and more efficiently |
317 | wandb/wandb |
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API. |
318 | sashabaranov/go-openai |
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go |
319 | microsoft/promptflow |
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring. |
320 | huggingface/trl |
Train transformer language models with reinforcement learning. |
321 | gorse-io/gorse |
Gorse open source recommender system engine |
322 | facebookresearch/nougat |
Implementation of Nougat Neural Optical Understanding for Academic Documents |
323 | acheong08/EdgeGPT |
Reverse engineered API of Microsoft's Bing Chat |
324 | TheRamU/Fay |
Fay is a complete open source project that includes Fay controller and numeral models, which can be used in different applications such as virtual hosts, live promotion, numeral human interaction and so on |
325 | mshumer/gpt-prompt-engineer |
|
326 | WongKinYiu/yolov9 |
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information |
327 | OptimalScale/LMFlow |
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All. |
328 | brexhq/prompt-engineering |
Tips and tricks for working with Large Language Models like OpenAI's GPT-4. |
329 | karpathy/minbpe |
Minimal, clean, code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization. |
330 | huggingface/text-generation-inference |
Large Language Model Text Generation Inference |
331 | anse-app/chatgpt-demo |
A demo repo based on OpenAI API (gpt-3.5-turbo) |
332 | facebookresearch/dinov2 |
PyTorch code and models for the DINOv2 self-supervised learning method. |
333 | espnet/espnet |
End-to-End Speech Processing Toolkit |
334 | facebookresearch/ImageBind |
ImageBind One Embedding Space to Bind Them All |
335 | microsoft/TypeChat |
TypeChat is a library that makes it easy to build natural language interfaces using types. |
⭐ 336 | NielsRogge/Transformers-Tutorials |
This repository contains demos I made with the Transformers library by HuggingFace. |
337 | vercel-labs/ai |
Build AI-powered applications with React, Svelte, and Vue |
338 | ashawkey/stable-dreamfusion |
A pytorch implementation of text-to-3D dreamfusion, powered by stable diffusion. |
339 | kedacore/keda |
KEDA is a Kubernetes-based Event Driven Autoscaling component. It provides event driven scale for any container running in Kubernetes |
340 | kroma-network/tachyon |
Modular ZK(Zero Knowledge) backend accelerated by GPU |
341 | vosen/ZLUDA |
CUDA on AMD GPUs |
342 | Visualize-ML/Book4_Power-of-Matrix |
Book_4 'Power of Matrix' |
343 | nashsu/FreeAskInternet |
FreeAskInternet is a completely free, private and locally running search aggregator & answer generate using LLM, without GPU needed. The user can ask a question and the system will make a multi engine search and combine the search result to the ChatGPT3.5 LLM and generate the answer based on search results. |
344 | OpenBMB/XAgent |
An Autonomous LLM Agent for Complex Task Solving |
345 | deep-floyd/IF |
A novel state-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding |
346 | xiangsx/gpt4free-ts |
Providing a free OpenAI GPT-4 API ! This is a replication project for the typescript version of xtekky/gpt4free |
347 | LouisShark/chatgpt_system_prompt |
store all agent's system prompt |
348 | o3de/o3de |
Open 3D Engine (O3DE) is an Apache 2.0-licensed multi-platform 3D engine that enables developers and content creators to build AAA games, cinema-quality 3D worlds, and high-fidelity simulations without any fees or commercial obligations. |
349 | google/magika |
Detect file content types with deep learning |
350 | ahmedbahaaeldin/From-0-to-Research-Scientist-resources-guide |
Detailed and tailored guide for undergraduate students or anybody want to dig deep into the field of AI with solid foundation. |
351 | zyronon/douyin |
Vue3 + Pinia + Vite5 仿抖音,Vue 在移动端的最佳实践 . Imitate TikTok ,Vue Best practices on Mobile |
352 | enso-org/enso |
Hybrid visual and textual functional programming. |
353 | Const-me/Whisper |
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model |
354 | THUDM/CodeGeeX2 |
CodeGeeX2: A More Powerful Multilingual Code Generation Model |
355 | Plachtaa/VALL-E-X |
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. The demo is available at https://plachtaa.github.io |
356 | openlm-research/open_llama |
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset |
357 | burn-rs/burn |
Burn - A Flexible and Comprehensive Deep Learning Framework in Rust |
358 | 01-ai/Yi |
A series of large language models trained from scratch by developers @01-ai |
⭐ 359 | infiniflow/ragflow |
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding. |
360 | bigcode-project/starcoder |
Home of StarCoder: fine-tuning & inference! |
361 | sweepai/sweep |
Sweep is an AI junior developer |
362 | lucidrains/denoising-diffusion-pytorch |
Implementation of Denoising Diffusion Probabilistic Model in Pytorch |
363 | vanna-ai/vanna |
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄. |
364 | leptonai/search_with_lepton |
Building a quick conversation-based search demo with Lepton AI. |
365 | OthersideAI/self-operating-computer |
A framework to enable multimodal models to operate a computer. |
366 | fishaudio/Bert-VITS2 |
vits2 backbone with multilingual-bert |
367 | SJTU-IPADS/PowerInfer |
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs |
368 | jzhang38/TinyLlama |
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. |
369 | e2b-dev/awesome-ai-agents |
A list of AI autonomous agents |
370 | ymcui/Chinese-LLaMA-Alpaca-2 |
Chinese LLaMA-2 & Alpaca-2 LLMs |
371 | CASIA-IVA-Lab/FastSAM |
Fast Segment Anything |
372 | jasonppy/VoiceCraft |
Zero-Shot Speech Editing and Text-to-Speech in the Wild |
373 | Lightning-AI/litgpt |
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more. |
374 | bhaskatripathi/pdfGPT |
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The only open source solution to turn your pdf files in a chatbot! |
375 | dair-ai/ML-Papers-Explained |
Explanation to key concepts in ML |
376 | NVIDIA/TensorRT-LLM |
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines. |
377 | VikParuchuri/surya |
Accurate line-level text detection and recognition (OCR) in any language |
378 | ai-collection/ai-collection |
The Generative AI Landscape - A Collection of Awesome Generative AI Applications |
379 | Unstructured-IO/unstructured |
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines. |
380 | abhishekkrthakur/approachingalmost |
Approaching (Almost) Any Machine Learning Problem |
381 | open-mmlab/mmagic |
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox |
382 | PWhiddy/PokemonRedExperiments |
Playing Pokemon Red with Reinforcement Learning |
383 | spdustin/ChatGPT-AutoExpert |
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding). |
384 | zilliztech/GPTCache |
GPTCache is a library for creating semantic cache to store responses from LLM queries. |
385 | PKU-YuanGroup/ChatLaw |
Chinese Legal Large Model |
386 | GreyDGL/PentestGPT |
A GPT-empowered penetration testing tool |
387 | apple/corenet |
CoreNet: A library for training deep neural networks |
388 | qunash/chatgpt-advanced |
A browser extension that augments your ChatGPT prompts with web results. |
389 | netease-youdao/EmotiVoice |
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine |
390 | AbdullahAlfaraj/Auto-Photoshop-StableDiffusion-Plugin |
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using Automatic1111-sd-webui as a backend. |
391 | kuafuai/DevOpsGPT |
Multi agent system for AI-driven software development. Convert natural language requirements into working software. Supports any development language and extends the existing base code. |
392 | huggingface/chat-ui |
Open source codebase powering the HuggingChat app |
393 | jaywalnut310/vits |
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
394 | nadermx/backgroundremover |
Background Remover lets you Remove Background from images and video using AI with a simple command line interface that is free and open source. |
395 | mit-han-lab/streaming-llm |
Efficient Streaming Language Models with Attention Sinks |
396 | react-native-webview/react-native-webview |
React Native Cross-Platform WebView |
🔥🔥🔥VinciGit00/Scrapegraph-ai |
Python scraper based on AI | |
linyiLYi/street-fighter-ai |
This is an AI agent for Street Fighter II Champion Edition. | |
e2b-dev/e2b |
Vercel for AI agents. We help developers to build, deploy, and monitor AI agents. Focusing on specialized AI agents that build software for you - your personal software developers. | |
langchain-ai/opengpts |
This is an open source effort to create a similar experience to OpenAI's GPTs and Assistants API | |
wenda-LLM/wenda |
Wenda: An LLM invocation platform. Its objective is to achieve efficient content generation tailored to specific environments while considering the limited computing resources of individuals and small businesses, as well as knowledge security and privacy concerns | |
mylxsw/aidea |
AIdea is a versatile app that supports GPT and domestic large language models,also supports "Stable Diffusion" text-to-image generation, image-to-image generation, SDXL 1.0, super-resolution, and image colorization | |
gaomingqi/Track-Anything |
A flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI. | |
dataelement/bisheng |
Bisheng is an open LLM devops platform for next generation AI applications. | |
405 | intel-analytics/ipex-llm |
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with llama.cpp, HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc. |
406 | Licoy/ChatGPT-Midjourney |
🎨 Own your own ChatGPT+Midjourney web service with one click |
407 | UFund-Me/Qbot |
Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment |
408 | openai/consistency_models |
Official repo for consistency models. |
409 | rustformers/llm |
Run inference for Large Language Models on CPU, with Rust |
410 | run-llama/rags |
Build ChatGPT over your data, all with natural language |
411 | normal-computing/outlines |
Generative Model Programming |
412 | CopilotKit/CopilotKit |
Build in-app AI chatbots 🤖, and AI-powered Textareas ✨, into react web apps. |
413 | Shaunwei/RealChar |
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖 |
414 | LiheYoung/Depth-Anything |
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data |
415 | Moonvy/OpenPromptStudio |
AIGC Hint Word Visualization Editor |
416 | reorproject/reor |
AI note-taking app that runs models locally. |
417 | ramonvc/freegpt-webui |
GPT 3.5/4 with a Chat Web UI. No API key is required. |
418 | OpenTalker/video-retalking |
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild |
419 | microsoft/DeepSpeedExamples |
Example models using DeepSpeed |
420 | threestudio-project/threestudio |
A unified framework for 3D content generation. |
421 | phidatahq/phidata |
Build AI Assistants using function calling |
422 | civitai/civitai |
Build a platform where people can share their stable diffusion models |
423 | a16z-infra/companion-app |
AI companions with memory: a lightweight stack to create and host your own AI companions |
424 | baichuan-inc/baichuan-7B |
A large-scale 7B pretraining language model developed by BaiChuan-Inc. |
425 | pengxiao-song/LaWGPT |
Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge |
426 | SevaSk/ecoute |
Ecoute is a live transcription tool that provides real-time transcripts for both the user's microphone input (You) and the user's speakers output (Speaker) in a textbox. It also generates a suggested response using OpenAI's GPT-3.5 for the user to say based on the live transcription of the conversation. |
427 | geekyutao/Inpaint-Anything |
Inpaint anything using Segment Anything and inpainting models. |
428 | GoogleCloudPlatform/generative-ai |
Sample code and notebooks for Generative AI on Google Cloud |
429 | nsarrazin/serge |
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API |
430 | firmai/financial-machine-learning |
A curated list of practical financial machine learning tools and applications. |
431 | google/gemma.cpp |
lightweight, standalone C++ inference engine for Google's Gemma models. |
432 | OpenGVLab/LLaMA-Adapter |
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters |
433 | deepseek-ai/DeepSeek-Coder |
DeepSeek Coder: Let the Code Write Itself |
434 | adam-maj/tiny-gpu |
A minimal GPU design in Verilog to learn how GPUs work from the ground up |
435 | yetone/bob-plugin-openai-translator |
A Bob Plugin base ChatGPT API |
436 | dsdanielpark/Bard-API |
The python package that returns a response of Google Bard through API. |
437 | Project-MONAI/MONAI |
AI Toolkit for Healthcare Imaging |
438 | vespa-engine/vespa |
The open big data serving engine. https://vespa.ai |
439 | Azure-Samples/azure-search-openai-demo |
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences. |
440 | clovaai/donut |
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 |
441 | WooooDyy/LLM-Agent-Paper-List |
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al. |
442 | jxnl/instructor |
structured outputs for llms |
443 | yihong0618/xiaogpt |
Play ChatGPT with xiaomi ai speaker |
444 | firebase/firebase-ios-sdk |
Firebase SDK for Apple App Development |
445 | InternLM/InternLM |
InternLM has open-sourced a 7 billion parameter base model, a chat model tailored for practical scenarios and the training system. |
446 | Tohrusky/Final2x |
2^x Image Super-Resolution |
447 | biobootloader/wolverine |
Automatically repair python scripts through GPT-4 to give them regenerative abilities. |
448 | facebookresearch/DiT |
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers" |
449 | EleutherAI/lm-evaluation-harness |
A framework for few-shot evaluation of autoregressive language models. |
450 | MineDojo/Voyager |
An Open-Ended Embodied Agent with Large Language Models |
451 | pytorch-labs/gpt-fast |
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python. |
452 | apache/hudi |
Upserts, Deletes And Incremental Processing on Big Data. |
453 | FlagOpen/FlagEmbedding |
Dense Retrieval and Retrieval-augmented LLMs |
454 | IDEA-Research/GroundingDINO |
Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection" |
455 | HumanAIGC/OutfitAnyone |
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person |
456 | microsoft/promptbase |
All things prompt engineering |
457 | nilsherzig/LLocalSearch |
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progress of the agents and the final answer. No OpenAI or Google API keys are needed. |
458 | openchatai/OpenChat |
Run and create custom ChatGPT-like bots with OpenChat, embed and share these bots anywhere, the open-source chatbot console. |
459 | RayVentura/ShortGPT |
🚀🎬 ShortGPT - An experimental AI framework for automated short/video content creation. Enables creators to rapidly produce, manage, and deliver content using AI and automation. |
460 | google/gemma_pytorch |
The official PyTorch implementation of Google's Gemma models |
461 | ml-explore/mlx-examples |
Examples in the MLX framework |
462 | vercel-labs/ai-chatbot |
A full-featured, hackable Next.js AI chatbot built by Vercel Labs |
463 | mosaicml/composer |
Train neural networks up to 7x faster |
464 | OpenGVLab/DragGAN |
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux) |
465 | imoneoi/openchat |
OpenChat: Advancing Open-source Language Models with Imperfect Data |
466 | microsoft/SynapseML |
Simple and Distributed Machine Learning |
467 | mpociot/chatgpt-vscode |
A VSCode extension that allows you to use ChatGPT |
468 | k8sgpt-ai/k8sgpt |
Giving Kubernetes Superpowers to everyone |
469 | openchatai/OpenCopilot |
🤖 🔥 Let your users chat with your product features and execute things by text - open source Shopify sidekick |
470 | 1Panel-dev/MaxKB |
💬 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。 |
471 | steven2358/awesome-generative-ai |
A curated list of modern Generative Artificial Intelligence projects and services |
472 | meshery/meshery |
Meshery, the cloud native manager |
473 | udlbook/udlbook |
Understanding Deep Learning - Simon J.D. Prince |
474 | aishwaryanr/awesome-generative-ai-guide |
A one stop repository for generative AI research updates, interview resources, notebooks and much more! |
475 | PrefectHQ/marvin |
A batteries-included library for building AI-powered software |
PawanOsman/ChatGPT |
OpenAI API Free Reverse Proxy | |
fr0gger/Awesome-GPT-Agents |
A curated list of GPT agents for cybersecurity | |
478 | lvwzhen/law-cn-ai |
⚖️ AI Legal Assistant |
479 | microsoft/TaskWeaver |
A code-first agent framework for seamlessly planning and executing data analytics tasks. |
480 | qiuyu96/CoDeF |
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing |
481 | yangjianxin1/Firefly |
Firefly: Chinese conversational large language model (full-scale fine-tuning + QLoRA), supporting fine-tuning of Llma2, Llama, Baichuan, InternLM, Ziya, Bloom, and other large models |
482 | Acly/krita-ai-diffusion |
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required. |
483 | Portkey-AI/gateway |
A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API. |
484 | sczhou/ProPainter |
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting |
485 | TaskingAI/TaskingAI |
The open source platform for AI-native application development. |
486 | levihsu/OOTDiffusion |
Official implementation of OOTDiffusion |
⭐ 487 | Skyvern-AI/skyvern |
Automate browser-based workflows with LLMs and Computer Vision |
488 | aigc-apps/sd-webui-EasyPhoto |
📷 EasyPhoto |
lllyasviel/stable-diffusion-webui-forge |
a platform on top of Stable Diffusion WebUI (based on Gradio) to make development easier, optimize resource management, and speed up inference | |
josStorer/RWKV-Runner |
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for commercial use. | |
491 | SkalskiP/courses |
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI) |
492 | aiwaves-cn/agents |
An Open-source Framework for Autonomous Language Agents |
493 | srbhr/Resume-Matcher |
Open Source Free ATS Tool to compare Resumes with Job Descriptions and create a score to rank them. |
494 | Plachtaa/VITS-fast-fine-tuning |
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion |
495 | OpenInterpreter/01 |
The open-source language model computer |
496 | lightaime/camel |
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society |
497 | lxfater/inpaint-web |
A free and open-source inpainting & image-upscaling tool powered by webgpu and wasm on the browser。 |
498 | thuml/Time-Series-Library |
A Library for Advanced Deep Time Series Models. |
499 | facebookincubator/AITemplate |
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference. |
500 | homanp/superagent |
🥷 Superagent - Build, deploy, and manage LLM-powered agents |
501 | OpenBMB/ToolBench |
An open platform for training, serving, and evaluating large language model for tool learning. |
502 | madawei2699/myGPTReader |
A slack bot that can read any webpage, ebook or document and summarize it with chatGPT |
503 | SuperDuperDB/superduperdb |
🔮 SuperDuperDB: Bring AI to your database: Integrate, train and manage any AI models and APIs directly with your database and your data. |
504 | togethercomputer/RedPajama-Data |
code for preparing large datasets for training large language models |
505 | enricoros/big-AGI |
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud. |
ai-boost/awesome-prompts |
Curated list of chatgpt prompts from the top-rated GPTs in the GPTs Store. Prompt Engineering, prompt attack & prompt protect. Advanced Prompt Engineering papers. | |
build-trust/ockam |
Orchestrate end-to-end encryption, cryptographic identities, mutual authentication, and authorization policies between distributed applications – at massive scale. | |
508 | mnotgod96/AppAgent |
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps. |
509 | Deci-AI/super-gradients |
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS. |
510 | ChaoningZhang/MobileSAM |
This is the official code for Faster Segment Anything (MobileSAM) project that makes SAM lightweight |
511 | xxlong0/Wonder3D |
A cross-domain diffusion model for 3D reconstruction from a single image |
512 | SCIR-HI/Huatuo-Llama-Med-Chinese |
Repo for HuaTuo (华驼), Llama-7B tuned with Chinese medical knowledge |
513 | digitalinnovationone/dio-lab-open-source |
Repositório do lab "Contribuindo em um Projeto Open Source no GitHub" da Digital Innovation One. |
514 | microsoft/Mastering-GitHub-Copilot-for-Paired-Programming |
A 6 Lesson course teaching everything you need to know about harnessing GitHub Copilot and an AI Paired Programing resource. |
515 | google-deepmind/graphcast |
|
516 | openai/plugins-quickstart |
Get a ChatGPT plugin up and running in under 5 minutes! |
517 | NVlabs/neuralangelo |
Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023) |
518 | roboflow/notebooks |
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM. |
519 | terraform-aws-modules/terraform-aws-eks |
Terraform module to create AWS Elastic Kubernetes (EKS) resources 🇺🇦 |
520 | miurla/morphic |
An AI-powered answer engine with a generative UI |
stanford-oval/storm |
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations. | |
Facico/Chinese-Vicuna |
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model | |
sjvasquez/handwriting-synthesis |
Handwriting Synthesis with RNNs ✏️ | |
524 | sanchit-gandhi/whisper-jax |
Optimised JAX code for OpenAI's Whisper Model, largely built on the Hugging Face Transformers Whisper implementation |
525 | luosiallen/latent-consistency-model |
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference |
526 | AILab-CVC/VideoCrafter |
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation |
527 | UX-Decoder/Segment-Everything-Everywhere-All-At-Once |
Official implementation of the paper "Segment Everything Everywhere All at Once" |
528 | kyegomez/tree-of-thoughts |
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70% |
529 | marimo-team/marimo |
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git. |
530 | smol-ai/GodMode |
AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day. |
531 | openai/grok |
|
532 | allenai/OLMo |
Modeling, training, eval, and inference code for OLMo |
533 | jina-ai/reader |
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/ |
534 | lavague-ai/LaVague |
Automate automation with Large Action Model framework |
535 | open-mmlab/Amphion |
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development. |
friuns2/BlackFriday-GPTs-Prompts |
List of free GPTs that doesn't require plus subscription | |
a16z-infra/ai-getting-started |
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs | |
keijiro/AICommand |
ChatGPT integration with Unity Editor | |
539 | microsoft/LLMLingua |
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss. |
540 | ray-project/llm-numbers |
Numbers every LLM developer should know |
541 | Significant-Gravitas/Auto-GPT-Plugins |
Plugins for Auto-GPT |
542 | Akegarasu/lora-scripts |
LoRA training scripts use kohya-ss's trainer, for diffusion model. |
543 | huggingface/alignment-handbook |
Robust recipes for to align language models with human and AI preferences |
544 | OpenBMB/MiniCPM |
MiniCPM-2B: An end-side LLM outperforms Llama2-13B. |
545 | developersdigest/llm-answer-engine |
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Mixtral, Langchain, OpenAI, Brave & Serper |
546 | 1rgs/jsonformer |
A Bulletproof Way to Generate Structured JSON from Language Models |
547 | Zejun-Yang/AniPortrait |
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation |
548 | ravenscroftj/turbopilot |
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU |
549 | mshumer/gpt-llm-trainer |
|
550 | vikhyat/moondream |
tiny vision language model |
551 | llSourcell/DoctorGPT |
DoctorGPT is an LLM that can pass the US Medical Licensing Exam. It works offline, it's cross-platform, & your health data stays private. |
552 | leetcode-mafia/cheetah |
Whisper & GPT-based app for passing remote SWE interviews |
apple/ml-mgie |
||
FlagAI-Open/FlagAI |
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model. | |
555 | lencx/Noi |
🦄 AI + Tools + Plugins + Community |
556 | damo-vilab/AnyDoor |
Official implementations for paper: Anydoor: zero-shot object-level image customization |
557 | google-deepmind/alphageometry |
Solving Olympiad Geometry without Human Demonstrations |
558 | Nukem9/dlssg-to-fsr3 |
Adds AMD FSR3 Frame Generation to games by replacing Nvidia DLSS-G Frame Generation (nvngx_dlssg). |
559 | espeak-ng/espeak-ng |
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents. |
560 | OpenBMB/AgentVerse |
🤖 AgentVerse 🪐 provides a flexible framework that simplifies the process of building custom multi-agent environments for large language models (LLMs). |
561 | shroominic/codeinterpreter-api |
Open source implementation of the ChatGPT Code Interpreter 👾 |
562 | myshell-ai/MeloTTS |
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean. |
563 | BuilderIO/ai-shell |
A CLI that converts natural language to shell commands. |
llmware-ai/llmware |
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models. | |
langfuse/langfuse |
🪢 Open source LLM engineering platform. Observability, metrics, evals, prompt management, testing, prompt playground, datasets, LLM evaluations -- 🍊YC W23 🤖 integrate via Typescript, Python / Decorators, OpenAI, Langchain, LlamaIndex, Litellm, Instructor, Mistral, Perplexity, Claude, Gemini, Vertex | |
hiyouga/ChatGLM-Efficient-Tuning |
Fine-tuning ChatGLM-6B with PEFT | |
alibaba-damo-academy/FunASR |
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models. | |
xlang-ai/OpenAgents |
OpenAgents: An Open Platform for Language Agents in the Wild | |
569 | Speykious/cve-rs |
Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀 |
570 | 0hq/WebGPT |
Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~2000 lines of vanilla Javascript. |
571 | Yue-Yang/ChatGPT-Siri |
Shortcuts for Siri using ChatGPT API gpt-3.5-turbo model |
572 | whoiskatrin/chart-gpt |
AI tool to build charts based on text input |
573 | ricklamers/gpt-code-ui |
An open source implementation of OpenAI's ChatGPT Code interpreter |
574 | Fanghua-Yu/SUPIR |
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild |
575 | hemansnation/God-Level-Data-Science-ML-Full-Stack |
A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI |
576 | AILab-CVC/YOLO-World |
Real-Time Open-Vocabulary Object Detection |
577 | luban-agi/Awesome-AIGC-Tutorials |
Curated tutorials and resources for Large Language Models, AI Painting, and more. |
578 | Luodian/Otter |
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability. |
579 | FoundationVision/VAR |
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction" |
580 | jackMort/ChatGPT.nvim |
ChatGPT Neovim Plugin: Effortless Natural Language Generation with OpenAI's ChatGPT API |
581 | NVIDIA/NeMo-Guardrails |
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems. |
582 | run-llama/llama-hub |
A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain |
583 | Dooy/chatgpt-web-midjourney-proxy |
chatgpt web, midjourney, gpts,tts, whisper 一套ui全搞定 |
584 | collabora/WhisperSpeech |
An Open Source text-to-speech system built by inverting Whisper. |
585 | MarkFzp/mobile-aloha |
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation |
586 | SysCV/sam-hq |
Segment Anything in High Quality |
587 | minimaxir/simpleaichat |
Python package for easily interfacing with chat apps, with robust features and minimal code complexity. |
588 | xtekky/chatgpt-clone |
ChatGPT interface with better UI |
589 | Kent0n-Li/ChatDoctor |
A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge |
590 | ItzCrazyKns/Perplexica |
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI |
591 | pashpashpash/vault-ai |
OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, etc) using a simple React frontend. |
592 | huggingface/distil-whisper |
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate. |
593 | fudan-generative-vision/champ |
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance |
594 | project-baize/baize-chatbot |
Let ChatGPT teach your own chatbot in hours with a single GPU! |
595 | OpenGVLab/InternGPT |
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统) |
596 | AprilNEA/ChatGPT-Admin-Web |
ChatGPT WebUI with user management and admin dashboard system |
597 | ethen8181/machine-learning |
🌎 machine learning tutorials (mainly in Python3) |
598 | pytorch/torchtune |
A Native-PyTorch Library for LLM Fine-tuning |
599 | Codium-ai/AlphaCodium |
code generation tool that surpasses most human competitors in CodeContests |
600 | metavoiceio/metavoice-src |
AI for human-level speech intelligence |
601 | apple/swift-syntax |
A set of Swift libraries for parsing, inspecting, generating, and transforming Swift source code. |
602 | cvg/LightGlue |
LightGlue: Local Feature Matching at Light Speed (ICCV 2023) |
603 | morph-labs/rift |
Rift: an AI-native language server for your personal AI software engineer |
604 | CVI-SZU/Linly |
Chinese-LLaMA basic model; ChatFlow Chinese conversation model; NLP pre-training/command fine-tuning dataset |
605 | anthropics/anthropic-cookbook |
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude. |
606 | langchain-ai/langgraph |
|
607 | dvlab-research/MiniGemini |
Official implementation for Mini-Gemini |
608 | baichuan-inc/Baichuan-13B |
A 13B large language model developed by Baichuan Intelligent Technology |
609 | microsoft/torchscale |
Foundation Architecture for (M)LLMs |
610 | iryna-kondr/scikit-llm |
Seamlessly integrate powerful language models like ChatGPT into scikit-learn for enhanced text analysis tasks. |
611 | daveshap/OpenAI_Agent_Swarm |
HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!" |
612 | williamyang1991/Rerender_A_Video |
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation |
613 | NExT-GPT/NExT-GPT |
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model |
OpenDriveLab/UniAD |
[CVPR 2023 Best Paper] Planning-oriented Autonomous Driving | |
neuralmagic/deepsparse |
Sparsity-aware deep learning inference runtime for CPUs | |
616 | gmpetrov/databerry |
The no-code platform for building custom LLM Agents |
617 | jupyterlab/jupyter-ai |
A generative AI extension for JupyterLab |
618 | deanxv/coze-discord-proxy |
代理Discord-Bot对话Coze-Bot,实现API形式请求GPT4对话模型/微调模型 |
619 | muellerberndt/mini-agi |
A minimal generic autonomous agent based on GPT3.5/4. Can analyze stock prices, perform network security tests, create art, and order pizza. |
620 | SamurAIGPT/privateGPT |
An app to interact privately with your documents using the power of GPT, 100% privately, no data leaks |
621 | ai-boost/Awesome-GPTs |
Curated list of awesome GPTs 👍. |
622 | adamcohenhillel/ADeus |
An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own server. You can then chat with Adeus using the app, and it will have all the right context about what you want to talk about - a truly personalized, personal AI. |
623 | LLM-Red-Team/kimi-free-api |
🚀 KIMI AI 长文本大模型白嫖服务,支持高速流式输出、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。 |
624 | mendableai/firecrawl |
🔥 Turn entire websites into LLM-ready markdown |
625 | MarkFzp/act-plus-plus |
Imitation Learning algorithms with Co-traing for Mobile ALOHA: ACT, Diffusion Policy, VINN |
626 | facebookresearch/ijepa |
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture." |
627 | opengeos/segment-geospatial |
A Python package for segmenting geospatial data with the Segment Anything Model (SAM) |
628 | philz1337x/clarity-upscaler |
Clarity-Upscaler: Reimagined image upscaling for everyone |
629 | OpenBMB/CPM-Bee |
A bilingual large-scale model with trillions of parameters |
630 | open-compass/opencompass |
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. |
631 | CrazyBoyM/llama3-Chinese-chat |
Llama3 Chinese Repository with modified versions, and training and deployment resources |
632 | eureka-research/Eureka |
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" |
633 | li-plus/chatglm.cpp |
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs |
634 | damo-vilab/i2vgen-xl |
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models |
635 | salesforce/CodeT5 |
Home of CodeT5: Open Code LLMs for Code Understanding and Generation |
636 | gptlink/gptlink |
Build your own free commercial ChatGPT environment in 10 minutes. The setup is simple and includes features such as user management, orders, tasks, and payments |
637 | agiresearch/AIOS |
AIOS: LLM Agent Operating System |
638 | georgia-tech-db/eva |
AI-Relational Database System |
639 | InternLM/xtuner |
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM, Llama, Baichuan, Qwen, ChatGLM) |
640 | SCUTlihaoyu/open-chat-video-editor |
Open source short video automatic generation tool |
641 | Alpha-VLLM/LLaMA2-Accessory |
An Open-source Toolkit for LLM Development |
huggingface/lerobot |
🤗 LeRobot: State-of-the-art Machine Learning for Real-World Robotics in Pytorch | |
facebookresearch/audio2photoreal |
Code and dataset for photorealistic Codec Avatars driven from audio | |
gptscript-ai/gptscript |
Natural Language Programming | |
645 | albertan017/LLM4Decompile |
Reverse Engineering: Decompiling Binary Code with Large Language Models |
646 | cvlab-columbia/zero123 |
Zero-1-to-3: Zero-shot One Image to 3D Object: https://zero123.cs.columbia.edu/ |
647 | liou666/polyglot |
Desktop AI Language Practice Application |
648 | srush/Tensor-Puzzles |
Solve puzzles. Improve your pytorch. |
649 | Azure/azure-rest-api-specs |
The source for REST API specifications for Microsoft Azure. |
650 | Josh-XT/AGiXT |
AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions. |
651 | ishan0102/vimGPT |
Browse the web with GPT-4V and Vimium |
652 | huggingface/parler-tts |
Inference and training library for high-quality TTS models. |
653 | huggingface/safetensors |
Simple, safe way to store and distribute tensors |
654 | leptonai/leptonai |
A Pythonic framework to simplify AI service building |
655 | krishnaik06/Roadmap-To-Learn-Generative-AI-In-2024 |
Roadmap To Learn Generative AI In 2024 |
656 | Deeptrain-Community/chatnio |
A one-stop chat relay API site supporting multiple models including OpenAI, Midjourney, Google Gemini, etc. It supports custom presets, cloud synchronization, elastic billing, and image parsing |
657 | PKU-YuanGroup/Video-LLaVA |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection |
658 | hegelai/prompttools |
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate). |
659 | SuperTux/supertux |
SuperTux source code |
660 | mazzzystar/Queryable |
Run CLIP on iPhone to Search Photos. |
661 | Ironclad/rivet |
The open-source visual AI programming environment and TypeScript library |
662 | baaivision/Painter |
Painter & SegGPT Series: Vision Foundation Models from BAAI |
663 | mshumer/gpt-author |
|
sgl-project/sglang |
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with LLMs faster and more controllable. | |
lamini-ai/lamini |
Official repo for Lamini's data generator for generating instructions to train instruction-following LLMs | |
666 | databricks/dbrx |
Code examples and resources for DBRX, a large language model developed by Databricks |
667 | facebookresearch/jepa |
PyTorch code and models for V-JEPA self-supervised learning from video. |
668 | iusztinpaul/hands-on-llms |
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴 |
669 | facebookresearch/habitat-sim |
A flexible, high-performance 3D simulator for Embodied AI research. |
670 | facebookresearch/Pearl |
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta. |
671 | OpenPipe/OpenPipe |
Turn expensive prompts into cheap fine-tuned models |
672 | FranxYao/chain-of-thought-hub |
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting |
673 | unit-mesh/auto-dev |
🧙AutoDev: The AI-powered coding wizard with multilingual support 🌐, auto code generation 🏗️, and a helpful bug-slaying assistant 🐞! Customizable prompts 🎨 and a magic Auto Dev/Testing/Document/Agent feature 🧪 included! 🚀 |
674 | kevmo314/magic-copy |
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. |
675 | NVIDIA/trt-llm-rag-windows |
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM |
676 | argmaxinc/WhisperKit |
Swift native speech recognition on-device for iOS and macOS applications. |
677 | johnma2006/mamba-minimal |
Simple, minimal implementation of the Mamba SSM in one file of PyTorch. |
dvmazur/mixtral-offloading |
Run Mixtral-8x7B models in Colab or consumer desktops | |
jqnatividad/qsv |
CSVs sliced, diced & analyzed. | |
680 | hustvl/Vim |
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model |
681 | mshumer/gpt-investor |
|
682 | paulpierre/RasaGPT |
💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram |
683 | emcf/engshell |
An English-language shell for any OS, powered by LLMs |
684 | JiauZhang/DragGAN |
Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold |
685 | dnhkng/GlaDOS |
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve. |
686 | nus-apr/auto-code-rover |
A project structure aware autonomous software engineer aiming for autonomous program improvement |
687 | SoraWebui/SoraWebui |
SoraWebui is an open-source Sora web client, enabling users to easily create videos from text with OpenAI's Sora model. |
yisol/IDM-VTON |
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild | |
aixcoder-plugin/aiXcoder-7B |
official repository of aiXcoder-7B Code Large Language Model | |
690 | spring-projects/spring-ai |
An Application Framework for AI Engineering |
691 | microsoft/promptbench |
A unified evaluation framework for large language models |
692 | openai/consistencydecoder |
Consistency Distilled Diff VAE |
693 | cgpotts/cs224u |
Code for Stanford CS224u |
694 | google-deepmind/gemma |
Open weights LLM from Google DeepMind. |
695 | girafe-ai/ml-course |
Open Machine Learning course |
696 | hncboy/chatgpt-web-java |
ChatGPT project developed in Java, based on Spring Boot 3 and JDK 17, supports both AccessToken and ApiKey modes |
697 | cohere-ai/cohere-toolkit |
Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications. |
698 | TracecatHQ/tracecat |
😼 The AI-native, open source alternative to Tines / Splunk SOAR. |
699 | elastic/otel-profiling-agent |
The production-scale datacenter profiler |
700 | meta-llama/PurpleLlama |
Set of tools to assess and improve LLM security. |
701 | AI-Citizen/SolidGPT |
Chat everything with your code repository, ask repository level code questions, and discuss your requirements. AI Scan and learning your code repository, provide you code repository level answer🧱 🧱 |
Blealtan/efficient-kan |
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN). | |
sugarforever/chat-ollama |
ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management. | |
liltom-eth/llama2-webui |
Run Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Supporting Llama-2-7B/13B/70B with 8-bit, 4-bit. Supporting GPU inference (6 GB VRAM) and CPU inference. | |
705 | abi/secret-llama |
Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3. |
706 | IDEA-Research/T-Rex |
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy |
707 | PRIS-CV/DemoFusion |
Let us democratise high-resolution generation! (arXiv 2023) |
708 | X-PLUG/MobileAgent |
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception |
709 | semanser/codel |
✨ Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor. |
710 | billmei/every-chatgpt-gui |
Every front-end GUI client for ChatGPT |
711 | flowtyone/flowty-realtime-lcm-canvas |
A realtime sketch to image demo using LCM and the gradio library. |
712 | microsoft/aici |
AICI: Prompts as (Wasm) Programs |
713 | luijait/DarkGPT |
DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes. |
714 | amazon-science/chronos-forecasting |
Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting |
715 | Niek/chatgpt-web |
ChatGPT web interface using the OpenAI API |
716 | PKU-YuanGroup/MoE-LLaVA |
Mixture-of-Experts for Large Vision-Language Models |
717 | Doubiiu/DynamiCrafter |
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors |
718 | OpenGVLab/InternVL |
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源模型 |
719 | Eladlev/AutoPrompt |
A framework for prompt tuning using Intent-based Prompt Calibration |
720 | suyu-emu/suyu |
suyu is the continuation of the world's most popular, open-source, Nintendo Switch emulator, yuzu. It is written in C++ with portability in mind, and we actively maintain builds for Windows, Linux and Android. |
721 | ashishpatel26/LLM-Finetuning |
LLM Finetuning with peft |
722 | janhq/nitro |
A fast, lightweight, embeddable inference engine to supercharge your apps with local AI. OpenAI-compatible API |
723 | TMElyralab/MuseV |
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising |
724 | ytongbai/LVM |
Sequential Modeling Enables Scalable Learning for Large Vision Models |
725 | deepseek-ai/DeepSeek-VL |
DeepSeek-VL: Towards Real-World Vision-Language Understanding |
726 | flowdriveai/flowpilot |
flow-pilot is an openpilot based driver assistance system that runs on linux, windows and android powered machines. |
727 | idaholab/moose |
Multiphysics Object Oriented Simulation Environment |
728 | NVIDIA/GenerativeAIExamples |
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture. |
729 | EricLBuehler/mistral.rs |
Blazingly fast LLM inference. |
730 | baaivision/Emu |
Emu Series: Generative Multimodal Models from BAAI |
731 | Nutlope/notesGPT |
Record voice notes & transcribe, summarize, and get tasks |
732 | Doriandarko/maestro |
A framework for Claude Opus to intelligently orchestrate subagents. |
733 | YangLing0818/RPG-DiffusionMaster |
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG) |
734 | linyiLYi/snake-ai |
An AI agent that beats the classic game "Snake". |
735 | mishushakov/llm-scraper |
Turn any webpage into structured data using LLMs |
736 | Kludex/fastapi-tips |
FastAPI Tips by The FastAPI Expert! |
737 | GoogleCloudPlatform/localllm |
Run LLMs locally on Cloud Workstations |
738 | MrForExample/ComfyUI-3D-Pack |
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.) |
⭐ 739 | 🔥lllyasviel/IC-Light |
More relighting! |
collabora/WhisperFusion |
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI. | |
OpenBMB/MiniCPM-V |
MiniCPM-V 2.0: An Efficient End-side MLLM with Strong OCR and Understanding Capabilities | |
huggingface/cookbook |
Open-source AI cookbook | |
The-OpenROAD-Project/OpenROAD |
OpenROAD's unified application implementing an RTL-to-GDS Flow. Documentation at https://openroad.readthedocs.io/en/latest/ | |
kyegomez/BitNet |
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch | |
truefoundry/cognita |
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry | |
Azure/PyRIT |
The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems. | |
google/maxtext |
A simple, performant and scalable Jax LLM! | |
microsoft/sample-app-aoai-chatGPT |
[PREVIEW] Sample code for a simple web chat experience targeting chatGPT through AOAI. | |
andrewnguonly/Lumos |
A RAG LLM co-pilot for browsing the web, powered by local LLMs | |
lucidrains/self-rewarding-lm-pytorch |
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI | |
lichao-sun/Mora |
Mora: More like Sora for Generalist Video Generation | |
InstantStyle/InstantStyle |
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥 | |
AnswerDotAI/fsdp_qlora |
Training LLMs with QLoRA + FSDP | |
SciPhi-AI/R2R |
A framework for rapid development and deployment of production-ready RAG systems | |
princeton-nlp/SWE-bench |
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues? | |
pytorch/executorch |
On-device AI across mobile, embedded and edge for PyTorch | |
GaParmar/img2img-turbo |
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more | |
pytorch/torchtitan |
A native PyTorch Library for large model training | |
PKU-YuanGroup/MagicTime |
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators | |
McGill-NLP/webllama |
Llama-3 agents that can browse the web by following instructions and talking to you | |
elfvingralf/macOSpilot-ai-assistant |
Voice + Vision powered AI assistant that answers questions about any application, in context and in audio. | |
mhamilton723/FeatUp |
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024 | |
wpilibsuite/allwpilib |
Official Repository of WPILibJ and WPILibC | |
Lightning-AI/lightning-thunder |
Make PyTorch models Lightning fast! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once. | |
SakanaAI/evolutionary-model-merge |
Official repository of Evolutionary Optimization of Model Merging Recipes | |
TencentARC/BrushNet |
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion" | |
DataTalksClub/llm-zoomcamp |
LLM Zoomcamp - a free online course about building an AI bot that can answer questions about your knowledge base | |
Profluent-AI/OpenCRISPR |
AI-generated gene editing systems | |
openai/simple-evals |
||
myshell-ai/JetMoE |
Reaching LLaMA2 Performance with 0.1M Dollars | |
FoundationVision/GLEE |
【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale | |
xlang-ai/OSWorld |
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments | |
langchain-ai/langchain-extract |
🦜⛏️ Did you say you like data? | |
a-real-ai/pywinassistant |
The first open source Large Action Model generalist Artificial Narrow Intelligence that controls completely human user interfaces by only using natural language. PyWinAssistant utilizes Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models. |
Tip:
symbol | rule |
---|---|
🔥 | 256 < stars today <= 512 |
🔥🔥 | 512 < stars today <= 1k |
🔥🔥🔥 | stars today > 1k |
ranking up / down | |
⭐ | on trending page today |
No. |
Tool | Description |
---|---|---|
1 | ChatGPT | A sibling model to InstructGPT, which is trained to follow instructions in a prompt and provide a detailed response |
2 | DALL·E 2 | Create original, realistic images and art from a text description |
3 | Murf AI | AI enabled, real people's voices |
4 | Midjourney | An independent research lab that produces an artificial intelligence program under the same name that creates images from textual descriptions, used in Discord |
5 | Make-A-Video | Make-A-Video is a state-of-the-art AI system that generates videos from text |
6 | Creative Reality™ Studio by D-ID | Use generative AI to create future-facing videos |
7 | chat.D-ID | The First App Enabling Face-to-Face Conversations with ChatGPT |
8 | Notion AI | Access the limitless power of AI, right inside Notion. Work faster. Write better. Think bigger. |
9 | Runway | Text to Video with Gen-2 |
10 | Resemble AI | Resemble’s AI voice generator lets you create human–like voice overs in seconds |
11 | Cursor | Write, edit, and chat about your code with a powerful AI |
12 | Hugging Face | Build, train and deploy state of the art models powered by the reference open source in machine learning |
13 | Claude |
A next-generation AI assistant for your tasks, no matter the scale |
14 | Poe | Poe lets you ask questions, get instant answers, and have back-and-forth conversations with AI. Gives access to GPT-4, gpt-3.5-turbo, Claude from Anthropic, and a variety of other bots |
No. |
WebSite |
Description |
---|---|---|
1 | OpenAI | An artificial intelligence research lab |
2 | Bard | Base Google's LaMDA chatbots and pull from internet |
3 | ERNIE Bot | Baidu’s new generation knowledge-enhanced large language model is a new member of the Wenxin large model family |
4 | DALL·E 2 | An AI system that can create realistic images and art from a description in natural language |
5 | Whisper | A general-purpose speech recognition model |
6 | CivitAI | A platform that makes it easy for people to share and discover resources for creating AI art |
7 | D-ID | D-ID’s Generative AI enables users to transform any picture or video into extraordinary experiences |
8 | Nvidia eDiff-I | Text-to-Image Diffusion Models with Ensemble of Expert Denoisers |
9 | Stability AI | The world's leading open source generative AI company which opened source Stable Diffusion |
10 | Meta AI | Whether it be research, product or infrastructure development, we’re driven to innovate responsibly with AI to benefit the world |
11 | ANTHROPIC | AI research and products that put safety at the frontier |
No. |
Report&Paper |
Description |
---|---|---|
1 | GPT-4 Technical Report | GPT-4 Technical Report |
2 | mli/paper-reading | Deep learning classics and new papers are read carefully paragraph by paragraph. |
3 | labmlai/annotated_deep_learning_paper_implementations | A collection of simple PyTorch implementations of neural networks and related algorithms, which are documented with explanations |
4 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models | Talking, Drawing and Editing with Visual Foundation Models |
5 | OpenAI Research | The latest research report and papers from OpenAI |
6 | Make-A-Video: Text-to-Video Generation without Text-Video Data | Meta's Text-to-Video Generation |
7 | eDiff-I: Text-to-Image Diffusion Models with Ensemble of Expert Denoisers | Nvidia eDiff-I - New generation of generative AI content creation tool |
8 | Training an Assistant-style Chatbot with Large Scale Data Distillation from GPT-3.5-Turbo | 2023 GPT4All Technical Report |
9 | Segment Anything | Meta Segment Anything |
10 | LLaMA: Open and Efficient Foundation Language Models | LLaMA: a collection of foundation language models ranging from 7B to 65B parameters |
11 | papers-we-love/papers-we-love | Papers from the computer science community to read and discuss |
12 | CVPR 2023 papers | The most exciting and influential CVPR 2023 papers |
No. |
Tutorial | Description |
---|---|---|
1 | Coursera - Machine Learning | The Machine Learning Specialization Course taught by Dr. Andrew Ng |
2 | microsoft/ML-For-Beginners | 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all |
3 | ChatGPT Prompt Engineering for Developers | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI) will teach how to use a large language model (LLM) to quickly build new and powerful applications |
4 | Dive into Deep Learning | Targeting Chinese readers, functional and open for discussion. The Chinese and English versions are used for teaching in over 400 universities across more than 60 countries |
5 | AI Expert Roadmap | Roadmap to becoming an Artificial Intelligence Expert in 2022 |
6 | Computer Science courses | List of Computer Science courses with video lectures |
7 | Machine Learning with Python | Machine Learning with Python Certification on freeCodeCamp |
8 | Building Systems with the ChatGPT API | This short course taught by Isa Fulford (OpenAI) and Andrew Ng (DeepLearning.AI), you will learn how to automate complex workflows using chain calls to a large language model |
9 | LangChain for LLM Application Development | This short course taught by Harrison Chase (Co-Founder and CEO at LangChain) and Andrew Ng. you will gain essential skills in expanding the use cases and capabilities of language models in application development using the LangChain framework |
10 | How Diffusion Models Work | This short course taught by Sharon Zhou (CEO, Co-founder, Lamini). you will gain a deep familiarity with the diffusion process and the models which carry it out. More than simply pulling in a pre-built model or using an API, this course will teach you to build a diffusion model from scratch |
11 | Free Programming Books For AI | 📚 Freely available programming books for AI |
12 | microsoft/AI-For-Beginners | 12 Weeks, 24 Lessons, AI for All! |
13 | hemansnation/God-Level-Data-Science-ML-Full-Stack | A collection of scientific methods, processes, algorithms, and systems to build stories & models. This roadmap contains 16 Chapters, whether you are a fresher in the field or an experienced professional who wants to transition into Data Science & AI |
14 | datawhalechina/prompt-engineering-for-developers | Chinese version of Andrew Ng's Big Model Series Courses, including "Prompt Engineering", "Building System", and "LangChain" |
15 | ossu/computer-science | 🎓 Path to a free self-taught education in Computer Science! |
16 | microsoft/Data-Science-For-Beginners | 10 Weeks, 20 Lessons, Data Science for All! |
17 | jwasham/coding-interview-university |
A complete computer science study plan to become a software engineer. |
If this project has been helpful to you in any way, please give it a ⭐️ by clicking on the star.