Code Monkey home page Code Monkey logo

valkryhx's Projects

chatglm-lora-rlhf-pytorch icon chatglm-lora-rlhf-pytorch

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

chatglm-rlhf-lora-rm-ppo icon chatglm-rlhf-lora-rm-ppo

ChatGLM-6B添加了RLHF的实现,以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成

chatglm2-6b icon chatglm2-6b

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

chatglm3 icon chatglm3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

fastllm icon fastllm

纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行

firefly icon firefly

Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA)

hierarchical-clustering-java icon hierarchical-clustering-java

Implementation of an agglomerative hierarchical clustering algorithm in Java. Different linkage approaches are supported.

lightzero icon lightzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

llm-tuning icon llm-tuning

Tuning LLMs with no tears💦, sharing LLM-tools with love❤️.

localgpt icon localgpt

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

lora_bnb_ft_int8_chatyuan_large_v2 icon lora_bnb_ft_int8_chatyuan_large_v2

Fine-tuning_ChatYuan-largeV2_测试alpaca格式数据集_LoRA+bitsandbytes_int8微调_去掉全量finetune_保留int8模型加载测试_0415.ipynb

medicalgpt icon medicalgpt

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现包括二次预训练、有监督微调、奖励建模、强化学习训练。

stochastic-muzero icon stochastic-muzero

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

winutils icon winutils

winutils.exe hadoop.dll and hdfs.dll binaries for hadoop windows

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.