Comments (3)
I have realized that the model can be improved by making the generation of the background and foreground conditional dependent. It is unclear if the background should b e generated first and then the foreground or the other way around.
In the inference I have not the trick to rescale the bounding box probabilities based on (img_raw-bg) which clearly suggests that, in the inference, the background is inferred first and the foreground is conditional dependent.
I am pretty sure my trick can be removed by extending the model and making foreground and background conditional dependent
from spacetx-research.
One idea to improve this is to take the mask, send it to a CNN, and use z what as AdaIN mu and std.
The idea is to take the mask and massage it with content by changing the activation of conv filters using z_what
One doesn't need to condition z_what on z_mask
since the image is generated by modulating the mask, the necessary mask/content correlations will be learnt automatically
Just a thought ;-)
We justify conditioning the inference of z_what on z_mask because, well, you need both the image AND the mask to infer z_what, even if z_mask is indep of z_what
AdaIN is this paper: https://arxiv.org/pdf/1703.06868.pdf
from spacetx-research.
Still issue is solved since in the new graphical model there is only one Z which is decoded to both image and mask
from spacetx-research.
Related Issues (20)
- problem at TEST time HOT 1
- things to test
- replace all RELU with LeakyReLU
- MEMO OF THINGS TO DO HOT 3
- FIX MOVIES in MAIN
- USE toarch.save instead of pickle.save
- dataloader HOT 1
- put tiling function on CPU if memory is a problem
- memory footprint might be too large HOT 1
- to do tomorrow
- to do when coming back from vacation
- make encoder/decoder of different size and check them on single cells HOT 1
- to do october HOT 1
- Prior for the probability is wrong HOT 1
- robustness to different inputs size
- IDEA TO CHECKS
- build a k-nn graph not a graph based on spatial vicinity....
- strategy to deal with high-resolution images
- WHAT I AM LEARNING
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spacetx-research.