crossmodalgroup / maskedvectorquantization Goto Github PK

Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"

License: MIT License

Python 100.00%

maskedvectorquantization's People

Stargazers

Watchers

Forkers

yjyoo3312 dongso peterzs

maskedvectorquantization's Issues

Query regarding typing mistake in the paper

hi, thanks for the great paper. in Fig. 4 you mentioned "red denotes high scores while blue denotes low scores." but shouldn't it will be opposite blue denotes high scores while red denotes low scores? because why backgorund is getting high score

Speed of the MaskedVectorQunantization

Thanks for releasing the nice work! I test the maskedVectorQuantization module on my task. However, the masked version is 4x slower than the version without the masker and demasker modules. Is there any suggestions to accelerate the training? Thank you in advance!

Can't find "modules" floder of this project

This is a valuable job, I want to train on my own dataset. Would you like to share the core part such as Encoder， Decoder， Masker, and Demasker？

Regarding obtaining the configuration of the first stage DQVAE used in the second stage training.

Hello author, I am very interested in your project! Due to my low computer configuration, may I obtain your pre-trained DQVAE's ckpt to debug the code for the second stage?

How to train the lightweight scoring network to pickup import patches to be VQ-ed.

Hi, I have a question as described in the issue, the way/rules to separate all feature vectors in the grid feature map from the encoder into important and unimportant samples confused me. As depicted in Sec 3.2: The larger the score s_l is, the more important the region feature z_l is. However, I don't understand the label assignment strategy to distinguish the important samples from the unimportant ones to train the lightweight scoring function. Would you please kindly specify it in this issue? I appreciate it in advance.

crossmodalgroup / maskedvectorquantization Goto Github PK

maskedvectorquantization's People

Stargazers

Watchers

Forkers

maskedvectorquantization's Issues

Query regarding typing mistake in the paper

Speed of the MaskedVectorQunantization

Can't find "modules" floder of this project

Regarding obtaining the configuration of the first stage DQVAE used in the second stage training.

How to train the lightweight scoring network to pickup import patches to be VQ-ed.

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent