manas-embold / gpt-neox Goto Github PK
View Code? Open in Web Editor NEWThis project forked from eleutherai/gpt-neox
An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.
License: Apache License 2.0