Code Monkey home page Code Monkey logo

awesome-multimodal-ml's People

Contributors

astrosaeed avatar biophysninja avatar bryanbocao avatar catalina17 avatar chahuja avatar cocoxili avatar echo0409 avatar eurus-holmes avatar evinpinar avatar gaurav22verma avatar gchhablani avatar hanmenghan avatar henryjunw avatar hi-zhenyu avatar imantdaunhawer avatar jia-honghenrylee avatar jivatneet avatar kaichen-z avatar kmario23 avatar mariyahendriksen avatar markhershey avatar pengboxiangshang avatar peter-yh-wu avatar pliang279 avatar richarizardd avatar sverma88 avatar ttengwang avatar xingbow avatar zhimin-z avatar zubair-irshad avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-multimodal-ml's Issues

Proposal to tweak the title for consistency

Thanks for your contribution!

I wonder if it is possible to keep the repo name and the title consistent so the title is written as "Awesome Multimodal ML" rather than "Reading List for Topics in Multimodal Machine Learning", I this might be greater!

About the missing research areas

Sorry I didn't see the topic like ''Social Impact โ€“ Fairness and Misinformation''. But I saw this topic in your course. Thank you.

wrong url

The url for "Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations" is wrong

Consistency and complementary information in multiview or multimodal

Hi everyone, I open this issue for the discussion of consistency and complementary information in multiview or multimodal. After reading some papers, I find that many authors would like to talk about consistency between modalities (e.g., the similarity between modalities) or the complementary information across the modalities. Yes, the consistency can enhance some signals that are not so remarkable in one modality and the complementary information can supplement the information that one view or modal does not exist. But I do not clearly understand why we need them? What's more, I do not find any mathematical explanation about it. Can anybody provide some comprehensive understanding about them?

guidance to handle missing modality at test time

Hi Paul,

I have read about Co-learning where we can train model on 3 modalities however at test time we can use only one modality. I am struggling to understand how this will be implemented in a code. Once we train a model with 3 modalities, it will expect 3 modalities at test time. Do we need to handle this scenario by passing zero or some random values for modalities to be dropped. Please help, also any sample implementation of the same.
Thanks a lot for all your awesome repo.

Papers in later 2022

after chatgpt, especially gpt-4 was released, there are more multi-modal pre-train model released like mini-gpt4, instructBLIP, is there any plan to add these paper into the list?

Add a paper about efficient multimodal models

Hi, @pliang279, very thanks for your great list from which I learned a lot!

Recently, we have a new work about compressing multimodal models, i.e., making them more lightweight and friendly for custom-level devices to use. However, it seems that there isn't a proper subsection to cover this work. And it would be nice if there was a subsection about efficient multimodal models or something similar! Looking forward to your opinions on this or which of the existing subsections could include the work.

Paper: https://proceedings.mlr.press/v202/shi23e.html
Code: https://github.com/sdc17/UPop
Project: https://dachuanshi.com/UPop-Project/

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.