cdj0311 Goto Github PK
Type: User
Company: Antgroup
Bio: LLM
Location: Beijing
Type: User
Company: Antgroup
Bio: LLM
Location: Beijing
Kernl lets you run Pytorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Minimal keyword extraction with BERT
some demos of Knowledge Distillation in NLP
Knowledge distillation in text classification with pytorch. 知识蒸馏,中文文本分类,教师模型BERT、XLNET,学生模型biLSTM。
史上最大规模1.4亿中文知识图谱开源下载
百度NLP:分词,词性标注,命名实体识别
LambdaRank Neural Network model using Keras.
Landmark Attention: Random-Access Infinite Context Length for Transformers
我的 LeetCode 做题记录,使用 Python 语言作答。
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
LightSeq: A High Performance Library for Sequence Processing and Generation
Lion: Adversarial Distillation of Closed-Source Large Language Model
Inference code for LLaMA models
Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA)
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
Machine learning resources,including algorithm, paper, dataset, example and so on.
distributed trainer for LLMs
Ongoing research training transformer models at scale
Tools for merging pretrained large language models.
Codebase for Merging Language Models
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
An open-source tool-augmented conversational language model from Fudan University
《机器翻译:统计建模与深度学习方法》肖桐 朱靖波 著 - Machine Translation: Statistical Modeling and Deep Learning Methods
transform multi-label classification as sentence pair task, with more training data and information
A Multi-View DSSM for Recommendation System with tensorflow estimator.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.