The tryondiffusion from tryonlabs

Different noise augmentation levels for different items in batch?

The paper does not mention clearly how noise augmentation levels are applied. Different noise augmentation levels for different items in the batch can also be experimented with. As of now, I have kept the noise augmentation level the same across the batch.

What is the dataset format?

I am looking into training the model. But I cannot understand the format of the dataset.

tryondiffusion/tryondiffusion/trainer.py

Lines 10 to 20 in d471b91

    
           self.train_ip_folder = "data/test_flow/train/ip" 
        
           self.train_jp_folder = "data/test_flow/train/jp" 
        
           self.train_ia_folder = "data/test_flow/train/ia" 
        
           self.train_ic_folder = "data/test_flow/train/ic" 
        
           self.train_jg_folder = "data/test_flow/train/jg" 
        
           self.validation_ip_folder = "data/test_flow/validation/ip" 
        
           self.validation_jp_folder = "data/test_flow/validation/jp" 
        
           self.validation_ia_folder = "data/test_flow/validation/ia" 
        
           self.validation_ic_folder = "data/test_flow/validation/ic" 
        
           self.validation_jg_folder = "data/test_flow/validation/jg"

Can you tell me what the different abbreveated folders stand for? The folders: ip, jp, ia, ic and jg. Can probably get image-parse from ip but the rest are a mistery.

Add positional encoding and noise agumentation level

Pooled pose embeddings must be summed with positional encodings of diffusion time step t, and noise augmentation levels tna. As explained at the end of section 3.2 in the paper.

two mistakes

1.dataloader_train.py line 72-75，missed T.ToTensor(), which is include /255. T.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)), only ensure -0.5 and /0.5. recommend use PIL to preprocess image instead of use opencv.
2. diffusion.py line 79-83, def add_noise_to_img(self, img, t): return (sqrt_alpha_timestep * epsilon) + (sqrt_one_minus_alpha_timestep * epsilon), epsilon, which is should be return (sqrt_alpha_timestep * img) + (sqrt_one_minus_alpha_timestep * epsilon), epsilon.

Pass `device` to `GaussianSmoothening` Fucntion

Pass device to both the UNets while initializing them, and use self.device to pass it to the GaussianSmoothening function in AttentionPool1d.

draw_bodypose is not included in body_pose.py

The function draw_bodypose in utils.py is not included in body_pose.py

It is required to draw the estimated pose.

Recommended Change: Include draw_bodypose in body_pose.py and call it after calling body_estimation.

Maybe I am not familiar with the concept but how would it work without RGB-agnostic images, or how 6 channels would be passed? Do we make values 0 for RGB agnostic images? Any comments are welcome.

Error in human and garment pose training scripts

There is a small bug in train.py script of human and garment pose models.

The bug is present in below lines:

for test_keypoints in test_dataloader:
            model.eval()
            test_predictions, _ = model(test_keypoints)

The correction should be:

for test_keypoints, _ in test_dataloader:
            model.eval()
            test_predictions, _ = model(test_keypoints)

	self.train_ip_folder = "data/test_flow/train/ip"
	self.train_jp_folder = "data/test_flow/train/jp"
	self.train_ia_folder = "data/test_flow/train/ia"
	self.train_ic_folder = "data/test_flow/train/ic"
	self.train_jg_folder = "data/test_flow/train/jg"

	self.validation_ip_folder = "data/test_flow/validation/ip"
	self.validation_jp_folder = "data/test_flow/validation/jp"
	self.validation_ia_folder = "data/test_flow/validation/ia"
	self.validation_ic_folder = "data/test_flow/validation/ic"
	self.validation_jg_folder = "data/test_flow/validation/jg"

tryonlabs / tryondiffusion Goto Github PK

tryondiffusion's People

Contributors

Stargazers

Watchers

Forkers

tryondiffusion's Issues

Recommend Projects

Recommend Topics

Recommend Org