Code Monkey home page Code Monkey logo

gigagan's Issues

clip loss?

Hello, I only saw the definition of clip loss in clip.py. I didn't find where you used it? But your paper points out that this loss is used.

training new SG model with our own CLIP "captions"

Hello @JiauZhang - I was on StyleClips repo and we were trying to confirm something.

I need to train a new SG model, with our own CLIP "captions" and not use LAION or anything else.

So, my alpha test case was to build a new SG model with images of Men and Women.

My question is - Did you train a new model with your own "captions" or how are you adding NEW keywords/captions?

In our case we have our own internal nomenclature which we must train into some medical images for a use-case, and we have to figure out how to do this.

Inference is failing on running generate.py script

The error is IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1) at images = g(z, text_embeds)[0] statement.

rahulbhalley@192 GigaGAN % python3 generate.py
Traceback (most recent call last):
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/generate.py", line 17, in <module>
    images = g(z, text_embeds)[0]
             ^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/model.py", line 140, in forward
    styles = [self.style(s) for s in styles]
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/model.py", line 140, in <listcomp>
    styles = [self.style(s) for s in styles]
              ^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/container.py", line 217, in forward
    input = module(input)
            ^^^^^^^^^^^^^
  File "/opt/homebrew/lib/python3.11/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
    return forward_call(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/rahulbhalley/Desktop/Hide-and-Seek/image-synth/GigaGAN/layers.py", line 11, in forward
    return input * torch.rsqrt(torch.mean(input ** 2, dim=1, keepdim=True) + 1e-8)
                               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)

How can I contribute to your work

Hello, great effort on this re-implementation.
Would you need GPU resources / other assistance to complete your work? How far are you from achieving the paper results for the upscaler?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.