Code Monkey home page Code Monkey logo

awesome-model-merging's Introduction

Awesome-Model-Merging

Awesome Awesome Model Merging

A curated list of Model Merging methods: Combining different pre-trained models.

Contributions are welcome!

Acknowledgments: This wonderful template is from https://github.com/VainF/Awesome-Anything by Gongfan Fang.

Title & Authors Intro Useful Links
Model Fusion via Optimal Transport
Sidak Pal Singh, Martin Jaggi
> ETH Zurich, EPFL
> NeurIPS'20

intro [Github]
[PDF]
Git Re-Basin: Merging Models modulo Permutation Symmetries
Samuel K. Ainsworth, Jonathan Hayase, Siddhartha Srinivasa
> University of Washington
> ICLR'23

intro [Github]
[PDF]
ZipIt! Merging Models from Different Tasks without Training
George Stoica, Daniel Bolya, Jakob Bjorner, Taylor Hearn, Judy Hoffman
> Georgia Tech
> arXiv'23

intro [Github]
[PDF]
Model Soups: Averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Mitchell Wortsman, Gabriel Ilharco, Samir Yitzhak Gadre, Rebecca Roelofs, Raphael Gontijo-Lopes, Ari S. Morcos, Hongseok Namkoong, Ali Farhadi, Yair Carmon, Simon Kornblith, Ludwig Schmidt
> University of Washington, Columbia University, Google Research, Meta AI Research, Tel Aviv University
> ICML'22

intro [Github]
[PDF]
Deep Model Reassembly
Xingyi Yang, Daquan Zhou, Songhua Liu, Jingwen Ye, Xinchao Wang
> National University of Singapore, Bytedance
> NeurIPS'22

intro [Github]
[PDF]
Factorizing Knowledge in Neural Networks
Xingyi Yang, Jingwen Ye, Xinchao Wang
> National University of Singapore
> ECCV'22

intro [Github]
[PDF]
REPAIR: REnormalizing Permuted Activations for Interpolation Repair
Keller Jordan, Hanie Sedghi, Olga Saukh, Rahim Entezari, Behnam Neyshabur
> Hive AI, Google Research, TU Graz / CSH Vienna
> ICLR'23

intro [Github]
[PDF]
An Empirical Study of Multimodal Model Merging
Yi-Lin Sung, Linjie Li, Kevin Lin, Zhe Gan, Mohit Bansal, Lijuan Wang
> UNC Chapel Hill, Microsoft
> arXiv'23

intro [Github]
[PDF]
Deep Neural Network Fusion via Graph Matching with Applications to Model Ensemble and Federated Learning
Chang Liu, Chenfei Lou, Runzhong Wang, Alan Yuhan Xi, Li Shen, Junchi Yan
> Shanghai Jiao Tong University, University of Wisconsin Madison, JD Explore Academy, Shanghai AI Laboratory
> ICML'22

intro [Github]
[PDF]
Merging Models with Fisher-Weighted Averaging
Michael Matena, Colin Raffel
> UNC Chapel Hill
> arXiv'21

intro [Github]
[PDF]
Re-basin via implicit Sinkhorn differentiation
Fidel A. Guerrero Peña, Heitor Rapela Medeiros, Thomas Dubail, Masih Aminbeidokhti, Eric Granger, Marco Pedersoli
> ÉTS
> CVPR'23

intro [Github]
[PDF]
Dataless Knowledge Fusion by Merging Weights of Language Models
Xisen Jin, Xiang Ren, Daniel Preotiuc-Pietro, Pengxiang Cheng
> University of Southern California, Bloomberg
> ICLR'23

intro [Github]
[PDF]
Amalgamating Knowledge From Heterogeneous Graph Neural Networks
Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao
> The University of Sydney, National University of Singapore, Stevens Institute of Technology, Zhejiang University
> CVPR'21

intro [Github]
[PDF]
Amalgamating Knowledge towards Comprehensive Classification
Chengchao Shen, Xinchao Wang, Jie Song, Li Sun, Mingli Song
> Zhejiang University, Stevens Institute of Technology
> AAAI'19

intro [Github]
[PDF]
Knowledge Amalgamation from Heterogeneous Networks by Common Feature Learning
Sihui Luo, Xinchao Wang, Gongfan Fang, Yao Hu, Dapeng Tao, Mingli Song
> Zhejiang University, Stevens Institute of Technology, Alibaba Group, Yunnan University
> IJCAI'19

intro [Github]
[PDF]
Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More
Jingwen Ye, Yixin Ji, Xinchao Wang, Kairi Ou, Dapeng Tao, Mingli Song
> Zhejiang University, Stevens Institute of Technology, Alibaba Group, Yunnan University
> CVPR'19

intro [Github]
[PDF]
Customizing Student Networks From Heterogeneous Teachers via Adaptive Knowledge Amalgamation
Chengchao Shen, Mengqi Xue, Xinchao Wang, Jie Song, Li Sun, Mingli Song
> Zhejiang University, Stevens Institute of Technology
> ICCV'19

intro [Github]
[PDF]
Data-Free Knowledge Amalgamation via Group-Stack Dual-GAN
Jingwen Ye, Yixin Ji, Xinchao Wang, Kairi Ou, Dapeng Tao, Mingli Song
> Zhejiang University, Stevens Institute of Technology, Alibaba Group
> CVPR'20

intro [Github]
[PDF]
CNN LEGO: Disassembling and Assembling Convolutional Neural Network
Jiacong Hu, Jing Gao, Zunlei Feng, Lechao Cheng, Jie Lei, Hujun Bao, Mingli Song
> Zhejiang University
> arXiv'22

intro [Github]
[PDF]
Collaboration by Competition: Self-coordinated Knowledge Amalgamation for Multi-talent Student Learning
Sihui Luo, Wenwen Pan, Xinchao Wang, Dazhou Wang, Haihong Tang, Mingli Song
> Zhejiang University, Stevens Institute of Technology, Alibaba Group
> ECCV'20

intro [Github]
[PDF]
Amalgamating Filtered Knowledge: Learning Task-customized Student from Multi-task Teachers
Jingwen Ye, Xinchao Wang, Yixin Ji, Kairi Ou, Mingli Song
> Zhejiang University, Stevens Institute of Technology, Alibaba Group
> IJCAI'19

intro [Github]
[PDF]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.