A multimodal language model from scratch with capabilities to add new modalities, beyond image, text or video.
anoushkrit / nanomlm Goto Github PK
View Code? Open in Web Editor NEWA multimodal language model from scratch with capabilities to add new modalities, beyond image, text or video.