haileyschoelkopf / megatron-deepspeed-mt0 Goto Github PK
View Code? Open in Web Editor NEWThis project forked from lintangsutawika/megatron-deepspeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
License: Other