Code Monkey home page Code Monkey logo

maoal's Introduction

Module-Aware Optimization for Auxiliary Learning

by Hong Chen, Xin Wang, Yue Liu, Yuwei Zhou, Chaoyu Guan and Wenwu Zhu.

Introduction

This work can be used when you want to optimize a model for the target task with the help of several auxiliary losses. It can automatically find how much each auxiliary loss should contribute to each part of the whole model, preventing the potential module-wise conflicts.You can download the paper here. Some of the auxilearn codes are modified from the AuxLearn Repo, we thank them for offering the code. _

Dataset

The provided code is for reproducing the MovieLens experiment in the paper. Create a folder named data in the repo using:

mkdir data

Then download the preprocessed dataset MovieLens-1M from the link and unzip it in the data folder.

Run the experiment

You can run the script to reproduce the MovieLens experiments. The argument "exp_name" is the name for the experiment.

sh train.sh

Additionally, if you want to run some of the baselines or tune hyperparameters, change the params.json in the config folder.

Explanation for key configs

+ ['main']['lr']: learning rate for the lower optimization
+ ['hyper']['lr]: learning rate for the upper optimization
+ use_aux: 1 for methods using bi-level optimization, 0 for methods that do not use bi-level optimization
+ interval: iteration between two upper optimization
+ mode: 'modular(MAOAL)', 'GCS', 'common'(SLL/Equal, change ['main']['aux_weight] to 0.0 for SLL, 1.0 for Equal), 'aux'(AuxL)

Lower and Upper Optimization

+ lower: The importance parameterized gradient is implemented in the hypermodel class in the train_regularizer.py.
+ upper: In line 274-line 315, the upper optimization is conducted. gauxlearn package includes the algorithm for upper gradient calculation.

You may also find the papers in the citation useful.

Citation

@inproceedings{chen2022auxiliary,
title = {Auxiliary Learning with Joint Task and Data Scheduling},
author = {Chen, Hong and Wang Xin, and Guan, Chaoyu and Liu, Yue and Zhu Wenwu},
booktitle = {International Conference on Machine Learning},
pages = {3634--3647},
year = {2022},
organization = {PMLR}
}
@inproceedings{
chen2022moduleaware,
title={Module-Aware Optimization for Auxiliary Learning},
author={Hong Chen and Xin Wang and Yue Liu and Yuwei Zhou and Chaoyu Guan and Wenwu Zhu},
booktitle={Thirty-Sixth Conference on Neural Information Processing Systems},
year={2022},
url={https://openreview.net/forum?id=x-i37an3uym}
}

maoal's People

Contributors

forchchch avatar

Stargazers

ceeyeo Y avatar  avatar  avatar  avatar Generall avatar  avatar  avatar

Watchers

 avatar

Forkers

daonanzhang

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.