- Reading List for Topics in Multimodal Machine Learning --- from CMU LTI&MLD
- Multimodal Machine Learning Reading List (Updated by Feiyang Chen)
- Core Areas
- Applications
- Language and Visual QA
- Language Grounding in Vision
- Language Grouding in Navigation
- Multimodal Machine Translation
- Multi-agent Communication
- Commonsense Reasoning
- Multimodal Reinforcement Learning
- Multimodal Dialog
- Language and Audio
- Audio and Visual
- Media Description
- Video Generation from Text
- Affect Recognition and Multimodal Language
- Healthcare
- Robotics
- Workshops
- Tutorials
- Courses
- CMU --- MultiComp Lab
- MIT --- SYNTHETIC INTELLIGENCE LABORATORY
- NTU --- SenticNet Team
- SenticNet GitHub
- CMU MultimodalSDK --- Affect Recognition and Multimodal Language
- AMHUSE --- Affect Recognition and Multimodal Language
- Multi30k Dataset --- Multimodal Machine Translation
- VATEX --- Multimodal Machine Translation
- MELD --- Multimodal Dialog
- CLEVR-Dialog --- Multimodal Dialog
- Charades-Ego --- Media Description
- MPII --- Media Description
- RecipeQA --- Language and Visual QA
- GQA --- Language and Visual QA
- CLEVR --- Language and Visual QA