constantwangheng Goto Github PK
Type: User
Type: User
多维度中文对齐评测基准 | Benchmarking Chinese Alignment of LLMs
Instruct-tune LLaMA on consumer hardware
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
A large-scale 7B pretraining language model developed by Baichuan
A series of large language models developed by Baichuan Intelligent Technology
Netease Youdao's open-source embedding and reranker models for RAG products.
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Hierarchical-Attention-Network
An implementation of pointer generator network by using bert as encoder and transformer decoder.
中文bigbird预训练模型
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
中文 LLaMA-2 & Alpaca-2 大模型二期项目 (Chinese LLaMA-2 & Alpaca-2 LLMs)
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
CMATH: Can your language model pass Chinese elementary school math test?
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Example models using DeepSpeed
Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"
Face recognition using Tensorflow
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
Code for the paper "Efficient Training of Language Models to Fill in the Middle"
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Text classification with Foundation Language Model LLaMA
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Scripts for fine-tuning Llama2 via SFT and DPO.
Inference Llama 2 in one file of pure C
The official Meta Llama 3 GitHub site
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.