Code Monkey home page Code Monkey logo

deepnude-an-image-to-image-technology's Introduction

DeepNude-an-Image-to-Image-technology

If you are not interested in theory and want to build your own Image-to-Image application directly, click here. 如果你对理论不感兴趣,想直接打造自己的Image-to-Image应用,点击这儿

What is DeepNude? 什么是 DeepNude?

Conditional Adversarial Networks learns the mapping of pixel levels at the corresponding image by training on the original collection and the target collection. For example, the neural network model in DeepNude trains on 10,000 ordinary female photos (source collections) and their corresponding nude photos (target collections). Entering a normal female photo into a trained model produces a corresponding nude photo.

Conditional Adversarial Networks 通过在原图片集和目标图片集上训练,学习对应图片像素级别的映射关系。例如,DeepNude中的神经网络模型就是在10000张普通女性照片(源图片集)和其对应的裸体照片上(目标图片集)进行训练。把普通女性照片输入训练好的模型就能生成其对应的裸体照片。

Obviously, DeepNude is an inappropriate application of artificial intelligence technology, which makes some people have aversion to artificial intelligence technology. It is necessary to clarify the use of the technology itself and the technology. History has long proven that the benefits of using technology are far greater than its drawbacks. We can't lose confidence in artificial intelligence because of squandering food. Here is the place to see the essence of the problem!

显然 DeepNude 是一种人工智能技术不恰当的应用,这让一部分人对人工智能技术产生了厌恶。这儿有必要厘清技术本身和技术的使用。历史早已证明合理的使用技术带来的好处远远大于其弊端。我们不能因噎废食,从而对人工智能失去信心。这儿就是来看清问题本质的地方!

Content of this resource 本资源内容

English 中文
Image-to-Image Demo 试玩Demo
Image-to-Image Theoretical Research 理论研究
Image-to-Image Practice Research 实践研究
DeepNude software itself 软件自身
Future 展望未来

Image-to-Image Demo

This section provides a demo of Image-to-Image Demo: Black and white stick figures to colorful cats, shoes, handbags.

这一部分提供一个试玩的 Image-to-Image Demo:黑白简笔画到色彩丰富的猫、鞋、手袋。

DeepNude software mainly uses Image-to-Image technology, which theoretically converts the images you enter into any image you want. You can experience Image-to-Image technology in your browser by clicking Image-to-Image Demo below.

DeepNude 软件主要使用了Image-to-Image技术,该技术理论上可以把你输入的图片转换成任何你想要的图片。你可以点击下方的Image-to-Image Demo在浏览器中体验Image-to-Image技术。

Image-to-Image Demo

An example of using this demo is as follows:

In the left side box, draw a cat as you imagine, and then click the process button, you can output a model generated cat.

在左侧框中按照自己想象画一个简笔画的猫,再点击process按钮,就能输出一个模型生成的猫。


Image-to-Image Theoretical Research

This section describes DeepNude-related AI/Deep Learning theory (especially computer vision) research. If you like to read the paper and use the latest papers, enjoy it.

这一部分阐述DeepNude相关的人工智能/深度学习理论(特别是计算机视觉)研究,如果你喜欢阅读论文使用最新论文成果,尽情享用吧。

1. Image Inpainting 图像修复

效果

In the image interface of Image_Inpainting(NVIDIA_2018).mp4 video, you only need to use tools to simply smear the unwanted content in the image. Even if the shape is very irregular, NVIDIA's model can “restore” the image with very realistic The picture fills the smeared blank. It can be described as a one-click P picture, and "no ps traces." The study was based on a team from Nvidia's Guilin Liu et al. who published a deep learning method that could edit images or reconstruct corrupted images, even if the images were worn or lost pixels. This is the current 2018 state-of-the-art approach.

Image_Inpainting(NVIDIA_2018).mp4 视频中左侧的操作界面,只需用工具将图像中不需要的内容简单涂抹掉,哪怕形状很不规则,NVIDIA的模型能够将图像“复原”,用非常逼真的画面填补被涂抹的空白。可谓是一键P图,而且“毫无ps痕迹”。 该研究来自Nvidia的Guilin Liu等人的团队,他们发布了一种可以编辑图像或重建已损坏图像的深度学习方法,即使图像穿了个洞或丢失了像素。这是目前2018 state-of-the-art的方法。

2. Image-to-Image or Pix2Pix (need for paired train data)

DeepNude mainly uses this Image-to-Image(Pix2Pix) technology.

效果

Image-to-Image Translation with Conditional Adversarial Networks is a general solution for the use of conditional confrontation networks as an image-to-image conversion problem proposed by the University of Berkeley.

Image-to-Image Translation with Conditional Adversarial Networks 是伯克利大学研究提出的使用条件对抗网络作为图像到图像转换问题的通用解决方案。

3. CycleGAN (without the need for paired train data)

效果

CycleGAN uses a cycle consistency loss to enable training without the need for paired data. In other words, it can translate from one domain to another without a one-to-one mapping between the source and target domain. This opens up the possibility to do a lot of interesting tasks like photo-enhancement, image colorization, style transfer, etc. All you need is the source and the target dataset.

CycleGAN使用循环一致性损失函数来实现训练,而无需配对数据。换句话说,它可以从一个域转换到另一个域,而无需在源域和目标域之间进行一对一映射。 这开启了执行许多有趣任务的可能性,例如照片增强,图像着色,样式传输等。您只需要源和目标数据集。

horse2zebra 马变斑马


Image-to-Image Practice Research

More models and functions will be added in the future

This section explains DeepNude-related AI/Deep Learning (especially computer vision) code practices, and if you like to experiment, enjoy them.

这一部分阐述DeepNude相关的人工智能/深度学习(特别是计算机视觉)代码实践,如果你喜欢动手做实验,尽情享用它们。

The CycleGAN neural network model is used to realize the four functions of photo style conversion, photo effect enhancement, landscape season change, and object conversion.

使用CycleGAN神经网络模型实现照片风格转换、照片效果增强、照片中风景季节变换、物体转换四大功能。

Click Start Experience A


DeepNude software itself

After researching DeepNude technology, I have removed data related to DeepNude. Please don't ask me to get DeepNude program.

DeepNude's technology stack

Windows version of DeepNude use process

DeepNude can really achieve the purpose of Image-to-Image, and the generated image is more realistic.

Delete the color.cp36-win_amd64.pyd file in the deepnude root directory, and then add the color.py file to get the advanced version of deepnude.

What can be improved?

DeepNude software shortcomings

  1. Size. Including 156M DeepNude_Windows_v2.0.0.zip and 1.90G pyqtlib.rar;
  2. Speed. It takes 30 seconds to convert a picture;
  3. Content. Use the Image-to-Image neural network to automatically remove the clothes from women to reveal their nudity. This application applies the wrong application of deep learning.

Where DeepNude can be improved

  • DeepNude can be implemented using Tensorflow and uses model compression techniques.
  • DeepNude should change the current practice of not respecting women.

Future

In fact, we don't need Image-to-Image. We can use GANs to generate images directly from random values or generate images from text.

The new AI technology Obj-GAN developed by Microsoft Research AI understands natural language descriptions, sketches, composite images, and then refines the details based on individual words provided by sketch frames and text. In other words, the network can generate images of the same scene based on textual descriptions that describe everyday scenes.

微软人工智能研究院(Microsoft Research AI)开发的新 AI 技术Obj-GAN可以理解自然语言描述、绘制草图、合成图像,然后根据草图框架和文字提供的个别单词细化细节。换句话说,这个网络可以根据描述日常场景的文字描述生成同样场景的图像。

效果

模型

Advanced version of the pen: just one sentence, one story, you can generate a picture.

Microsoft's new research proposes a new GAN, ObjGAN, which can generate complex scenes based on textual descriptions. They also proposed another GAN, StoryGAN, which can draw a story. Enter the text of a story and output the "picture book".

进阶版神笔:只需一句话、一个故事,即可生成画面

微软新研究提出新型 GAN——ObjGAN,可根据文字描述生成复杂场景。他们还提出另一个可以画故事的 GAN——StoryGAN,输入一个故事的文本,即可输出「连环画」。

当前最优的文本到图像生成模型可以基于单句描述生成逼真的鸟类图像。然而,文本到图像生成器远远不止仅对一个句子生成单个图像。给定一个多句段落,生成一系列图像,每个图像对应一个句子,完整地可视化整个故事。

效果


The most commonly used image-to-image technology should be Beauty App, so why don't we develop a smarter Beauty Camera?

现在用得最多的Image-to-Image技术应该就是美颜APP了,所以我们为什么不开发一个更加智能的美颜相机呢?

If you have interesting ideas or data, please contact me quickly at [email protected].

deepnude-an-image-to-image-technology's People

Contributors

yuanxiaosc avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.