tothemoon96 / doremi Goto Github PK
View Code? Open in Web Editor NEWThis project forked from sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Home Page: https://arxiv.org/abs/2305.10429
License: MIT License