sled-group / cyclenet Goto Github PK

Official Code for NeurIPS 2023 Paper: CycleNet: Rethinking Cycle Consistent in Text‑Guided Diffusion for Image Manipulation

Home Page: https://cyclenetweb.github.io/

License: Apache License 2.0

Python 100.00%

diffusion-models image-edit cyclegan cycle-consistency

cyclenet's People

Contributors

Stargazers

Watchers

Forkers

mars-tin paperwave mobled37 standardgalactic

cyclenet's Issues

Make images as condition not text

great job! I have a question that is ok that make images as condition instead of text? for example, as for original dataset the dataset structure is like this:

fill50k/train.jsonl
fill50k/images/X.png
fill50k/conditioning_images/X.png

can i use the conditioning_images as condition, and change the relevent part of the code in here:

class MyDataset(Dataset):
    def __init__(self):
        self.data = []
        with open('./training/cfill50k/prompt.json', 'rt') as f:
            for line in f:
                self.data.append(json.loads(line))

    def __len__(self):
        return len(self.data)

    def __getitem__(self, idx):
        item = self.data[idx]

        image_filename = item['image']
        source = item['source']
        target = item['target']

        image = cv2.imread('./training/cfill50k/' + image_filename)

        # Do not forget that OpenCV read images in BGR order.
        image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

        image = (image.astype(np.float32) / 127.5) - 1.0

        return dict(jpg=image, source=source, txt=target)

Thank you!

How to provide appropriate source prompts and target prompts

Your work looks impressive. I would like to ask, in the summer2winter task, what kind of source prompts and target prompts are provided?

Too much VRAM cost and ckpt file volume compared to the paper

Thanks for great work!

I followed the example provided with cfill dataset.
-> https://github.com/sled-group/CycleNet/blob/main/docs/train.md

I found 2 differences with the original works.

Training with batch size = 1 cost over 30GB VRAM, which is different in paper (batch size = 4 cost 27.9GB VRAM).
My ckpt file (saved in training) is 13.7 GB, which is different in provided ckpt (summer2winter.ckpt = 6.7 GB)

Would you explain how to reproduce the original works?

How to train the model based on Scene/Object-Level Manipulation dataset or ManiCups dataset?

hello, great work!
How to train the model based on Scene/Object-Level Manipulation dataset or ManiCups dataset?
@Mars-tin @Roihn @h6kplus @SihanXU
Or how to sample based on pre-trained models?

CycleFill dataset link down

In the 'docs/train.md' section, a link to the CycleFill50k dataset is provided, but it leads to a 404 error page. Could you please provide the dataset or the 'prompt.json' file? I have located the checkpoint, but I am looking to train with my own data.

input image:

output image

sled-group / cyclenet Goto Github PK

cyclenet's People

Contributors

Stargazers

Watchers

Forkers

cyclenet's Issues

Make images as condition not text

How to provide appropriate source prompts and target prompts

Too much VRAM cost and ckpt file volume compared to the paper

How to train the model based on Scene/Object-Level Manipulation dataset or ManiCups dataset?

CycleFill dataset link down

Training cyclefill50

will you release the full training code of winter to summer?

Pre-trained weights for CycleNet style transfer

when will the paper relsease

Has anyone reproduced this method？

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent