kaichen1998 / geodiffusion Goto Github PK

View Code? Open in Web Editor NEW

55.0 55.0 0.0 4.2 MB

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)

Home Page: https://kaichen1998.github.io/projects/geodiffusion/

License: MIT License

Python 98.73% Shell 1.27%

geodiffusion's People

Contributors

Stargazers

Watchers

geodiffusion's Issues

Generation quality of the model

Thanks for your inspiring work!

However, I encountered a problem. When I use the model trained on COCO-Stuff and image size is 512*512, the generation quality seems poor.

The prompt from coco-stuff is:

  layout = {
    "bbox":
      [
        ['metal', 0.04218750074505806, 0.25647059082984924, 0.10000000149011612, 0.5247058868408203],
        ['chair', 0.17940625548362732, 0.4312705993652344, 0.35014063119888306, 0.5062353014945984],
        ['sky-other', 0.606249988079071, 0.0, 0.734375, 0.09882353246212006],
        ['person', 0.0, 0.5493882298469543, 0.07332812249660492, 0.7298117876052856],
        ['pavement', 0.0, 0.5976470708847046, 0.9781249761581421, 1.0],
        ['building-other', 0.0, 0.0, 1.0, 0.7152941226959229],
        ['person', 0.8331093788146973, 0.5236706137657166, 0.913937509059906, 0.8113176226615906],
        ['chair', 0.422062486410141, 0.4221176505088806, 0.6030937433242798, 0.499505877494812],
        ['bus', 0.1626562476158142, 0.29044705629348755, 0.8476094007492065, 0.9376470446586609],
        ['person', 0.32343751192092896, 0.3623529374599457, 0.792187511920929, 0.5176470875740051],
        ['person', 0.9270156025886536, 0.49814116954803467, 0.9953437447547913, 0.8023764491081238],
        ['clothes', 0.15000000596046448, 0.567058801651001, 1.0, 1.0]
      ]
  }

The generation config is:

{
 "dataset": "coco_stuff",
 "num_bucket_per_side": [256, 256],
 "width": 512,
 "height": 512,
 "prompt_template": "An image with {bbox}",
 "cfg_scale": 4.5,
 "num_inference_steps": 50,
 "max_num_bbox": 18
}

However, the generation result seems strange using run_layout_to_image.py:

I've tried different prompts and the results are very confusing.

What's wrong with my operation? Thanks!

bbox_visualizer function is missing.

Why are the results or md5 all the same after multiple inferences when using the same layout？

Normally，the generate result should be different every time when use the same layout and prompt. But, use your model, the result and md5 of image are all the same

When will the training code be released?

Excuse me , When will the complete code (including training code)be released ? Thank you!

请问代码什么时候开源呢

期待

Congratulations on Your Paper Being Accepted to ICLR 2024!

Dear Chen, Kai and Xie, Enze and Chen, Zhe and Hong, Lanqing and Li, Zhenguo and Yeung, Dit-Yan,

I hope this message finds you well. I am writing to extend my heartfelt congratulations to you and your co-authors on the acceptance of your paper, "GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation," to the International Conference on Learning Representations (ICLR) 2024!

Your achievement reflects the dedication, hard work, and innovative thinking that went into your research. ICLR is one of the premier conferences in machine learning and artificial intelligence, and being accepted to present your work there is a significant recognition of its quality and impact on the field.

On behalf of the open source community, I want to express our sincere appreciation for your contributions to advancing the frontiers of knowledge in machine learning. Your research not only pushes the boundaries of what is possible but also inspires others in our community to explore new ideas and approaches.

Additionally, I would like to extend our gratitude for your commitment to open science and sharing your knowledge with the community. It is through the generosity of researchers like you that we can collectively accelerate progress and foster collaboration in the field.

In light of the importance of reproducibility and open access to research, we kindly request your assistance in making the training code associated with your paper publicly available. Having access to the training code would greatly benefit researchers and practitioners interested in replicating and building upon your findings.

As a token of our appreciation, we would like to offer our assistance in any way possible. Whether it be providing support with open-sourcing the training code associated with your paper or promoting your work through our channels, please do not hesitate to reach out to us.

Once again, congratulations on this well-deserved accomplishment! We look forward to seeing your presentation at ICLR 2024 and witnessing the continued impact of your research in the years to come.

Warm regards,
CatLoves

kaichen1998 / geodiffusion Goto Github PK

geodiffusion's People

Contributors

Stargazers

Watchers

geodiffusion's Issues

Generation quality of the model

bbox_visualizer function is missing.

Why are the results or md5 all the same after multiple inferences when using the same layout？

When will the training code be released?

请问代码什么时候开源呢

Congratulations on Your Paper Being Accepted to ICLR 2024!

When will open the full training code? Thanks

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent