Dear zihan,
Thank you for your impressive work!
I have been following your work and I have successfully run the code with the setting 'model_type="LViT"', which does not require a pretrained model. However, I am facing some challenges in understanding how to obtain the pretained model for the setting 'model_type="LViT"'.
As discussed in Section 2.1 of your instructions:
It appears that a U-Net model might be a prerequistite for the LViT model. Could you please clarifiy if this is the case? If so, does this imply I need to train a U-Net model first before proceeding with 'LViT_pretrain'? Also, in such a scenario, should I change the model type and write the corresponding code here?
Furthermore, I am also curious about how to load the pretrained U-Net model once it is obtained. Is this U-Net model directly applicable to LViT_pretrain, or are there additional steps or modifications required?
Your guidance on these matters would be greatly appreciated, as it would greatly assist me in understanding and utilizing your work more effectively.
Thank you for your time and consideration. I am looking forward to your valuable insights.
Best regards,
Pengyu Zhao