Hi, Thank you for the great work and especially for publishing the p

hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Pretrained Stage 2 Transformer for ViT-VQGAN about enhancing-transformers HOT 8 CLOSED

thuanz123 commented on July 21, 2024

Pretrained Stage 2 Transformer for ViT-VQGAN

from enhancing-transformers.

Comments (8)

thuanz123 commented on July 21, 2024

hi @manuelknott, the code for stage2 transformer is currently buggy so after I fixed everything, I will try to train and released a pretrained model. But this will be a long time later since I'm still learning about autoregressive modeling with transformers.

from enhancing-transformers.

manuelknott commented on July 21, 2024

Awesome! Already looking forward to it.

from enhancing-transformers.

thuanz123 commented on July 21, 2024

Awesome! Already looking forward to it.

Then if you dont have any questions further, I will close this issue. Feel free to reopen

from enhancing-transformers.

manuelknott commented on July 21, 2024

Is there any update on this? I have seen there has been some modifications on the stage 2 code. If the bugs should be fixed, I can also try to train it on my own. Thank you!

from enhancing-transformers.

thuanz123 commented on July 21, 2024

hi @manuelknott , sorry for the late reply, the issue should be fixed now and you can train a 2nd stage transformer now.

from enhancing-transformers.

manuelknott commented on July 21, 2024

Awesome, thank you. Just to be sure: the main.py file only covers stage 1 training for now, right? Is there any chance you could share the code for training stage 2 if it is not part of the repo yet?

from enhancing-transformers.

thuanz123 commented on July 21, 2024

hi @manuelknott, main.py supports training stage2 model, you just need to use correct config. For example, you can refer to imagenet_gpt_vitvq_base.yaml to have a glimpse of stage2 config. Note that this example stage2 config is quite big and can only fit in at least 8 A100s so you might want to reduce the parameters

from enhancing-transformers.

manuelknott commented on July 21, 2024

Thanks for the explanation! I will try to train a model with my limited resources (2x A5000). Let's see how it goes. Do you plan to publish a pretrained stage 2 model anytime soon?

from enhancing-transformers.

Related Issues (20)

Recommend Projects

Pretrained Stage 2 Transformer for ViT-VQGAN about enhancing-transformers HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent