Comments (9)
Hi, we currently have no free people to help debug it~ if you are interested, you can try it, but we will support QLora in the future~ thx. I guess it may be because of the compatibility of Zero3 and PEFT。
from openrlhf.
This week we'll try to figure it out
from openrlhf.
test with the latest code, please.
This issue is related to the model being incorrectly set to eval mode and not being reverted back to train mode during the backward operation. It should be fixed in the latest code.
from openrlhf.
feel free to reopen it~
from openrlhf.
when i try this code stage2 it work! but i don't know why stage3 is not work,,,
from openrlhf.
thank you for reply
from openrlhf.
thank you and sorry for late response. i will check this tomorrow ! thank you!!!!
from openrlhf.
i run this code and it work!! thank you!!!
from openrlhf.
nice!
from openrlhf.
Related Issues (20)
- Support hybird-model in Ray PPO
- Version0.0.1: Release the first development version HOT 1
- fixing Typo HOT 1
- Error occurred when loading datasets from disk HOT 5
- Local dataset: Please perform appropriate preprocessing on your local data set.
- Scale rlhf to 100B models HOT 1
- DeepSpeed Training and Inference HOT 2
- Implement Re-max
- Inquiry regarding the feasibility of fine-tuning LLaMA2-7B with a single A100 HOT 4
- [Severity] High similarity with Colossal-AI HOT 4
- Discussion on our 1st release. HOT 1
- feature: add api support for hosting a reward model HOT 5
- Add pipeline module to support more scientific comparative experiments and research
- 问一下,huggingface提供的checkpoint为pt文件,如何转成大模型常见的.bin HOT 4
- Loading RM ckpt bug: AttributeError: 'NoneType' object has no attribute 'load' HOT 3
- 为什么速度回比deepspeed chat快4倍这么多 HOT 3
- baichuan2-13b-base作为actor RuntimeError: CUDA error: device-side assert triggered HOT 5
- Support KTO HOT 1
- High Memory Usage HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openrlhf.