Code Monkey home page Code Monkey logo

tryondiffusion's People

Contributors

kailashahirwar avatar rsamf avatar tanayag avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tryondiffusion's Issues

What is the dataset format?

I am looking into training the model. But I cannot understand the format of the dataset.

self.train_ip_folder = "data/test_flow/train/ip"
self.train_jp_folder = "data/test_flow/train/jp"
self.train_ia_folder = "data/test_flow/train/ia"
self.train_ic_folder = "data/test_flow/train/ic"
self.train_jg_folder = "data/test_flow/train/jg"
self.validation_ip_folder = "data/test_flow/validation/ip"
self.validation_jp_folder = "data/test_flow/validation/jp"
self.validation_ia_folder = "data/test_flow/validation/ia"
self.validation_ic_folder = "data/test_flow/validation/ic"
self.validation_jg_folder = "data/test_flow/validation/jg"

Can you tell me what the different abbreveated folders stand for? The folders: ip, jp, ia, ic and jg. Can probably get image-parse from ip but the rest are a mistery.

two mistakes

1.dataloader_train.py line 72-75,missed T.ToTensor(), which is include /255. T.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)), only ensure -0.5 and /0.5. recommend use PIL to preprocess image instead of use opencv.
2. diffusion.py line 79-83, def add_noise_to_img(self, img, t): return (sqrt_alpha_timestep * epsilon) + (sqrt_one_minus_alpha_timestep * epsilon), epsilon, which is should be return (sqrt_alpha_timestep * img) + (sqrt_one_minus_alpha_timestep * epsilon), epsilon.

draw_bodypose is not included in body_pose.py

The function draw_bodypose in utils.py is not included in body_pose.py

It is required to draw the estimated pose.

Recommended Change: Include draw_bodypose in body_pose.py and call it after calling body_estimation.

discord

please add a link to the discord server

Concatenate instead of add

I made a little mistake while implementing the UNet architecture. Instead of concatenating, I added(elementwise summation) the channels in UNet's skip connections.

Ideally, we must concatenate them channel-wise and reduce the channels during convolution. It's a straightforward fix. I am creating the issue to keep track of and fix the problem before training.

Run the tryondiffusion Code

Is this project available for use or is it under construction? Plus if it is available kindly guide how to run this since there is only setup file to download dependencies only.

Better Documentation

I want to know how to use this in production.
Is this just a model implementation? If it is the implementation of the architecture then how can we train it over our dataset?

how to get the garment landmarks

can you help me out in finding the garment landmarks keypoints? and what are the techniques that u have used to get the garment landmark keypoints

Add support for Classifier Free Guidance in Network

The paper says, "We follow classifier-free guidance and train our models with conditioning dropout: conditional inputs are set to 0 for 10% of training time."

This means during 10% of the training time, only Person UNet is to be trained, without any cross-attention, or self attention or anything in conditional inputs.

Maybe I am not familiar with the concept but how would it work without RGB-agnostic images, or how 6 channels would be passed? Do we make values 0 for RGB agnostic images? Any comments are welcome.

Error in human and garment pose training scripts

There is a small bug in train.py script of human and garment pose models.

The bug is present in below lines:

for test_keypoints in test_dataloader:
            model.eval()
            test_predictions, _ = model(test_keypoints)

The correction should be:

for test_keypoints, _ in test_dataloader:
            model.eval()
            test_predictions, _ = model(test_keypoints)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.