High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Abstract

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048 × 1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

Keywords

Computer scienceDiscriminatorGenerator (circuit theory)Artificial intelligenceObject (grammar)Generative grammarImage (mathematics)Variety (cybernetics)SegmentationSemantics (computer science)Resolution (logic)Computer visionPattern recognition (psychology)

Affiliated Institutions

Related Publications

Generative adversarial networks

Ian Goodfellow , Jean Pouget-Abadie , Mehdi Mirza +5 more

Generative adversarial networks are a kind of artificial intelligence algorithm designed to solve the generative modeling problem. The goal of a generative model is to study a c...

2020 Communications of the ACM 11829 citations

Generative Adversarial Networks: An Overview

Antonia Creswell , Tom White , Vincent Dumoulin +3 more

Generative adversarial networks (GANs) provide a way to learn deep representations without extensively annotated training data. They achieve this by deriving backpropagation sig...

2018 IEEE Signal Processing Magazine 4073 citations

Image Style Transfer Using Convolutional Neural Networks

Leon A. Gatys , Alexander S. Ecker , Matthias Bethge

Rendering the semantic content of an image in different styles is a difficult image processing task. Arguably, a major limiting factor for previous approaches has been the lack ...

2016 5772 citations

High-Resolution Image Synthesis with Latent Diffusion Models

Robin Rombach , Andreas Blattmann , Dominik Lorenz +2 more

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image da...

2022 2022 IEEE/CVF Conference on Computer ... 10716 citations

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair , Geoffrey E. Hinton

Restricted Boltzmann machines were developed using binary stochastic hidden units. These can be generalized by replacing each binary unit by an infinite number of copies that al...

2010 International Conference on Machine L... 13197 citations

Publication Info

Year: 2018
Type: article
Pages: 8798-8807
Citations: 4266
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

4266

OpenAlex

Cite This

APA Style

                            
                                    Ting-Chun Wang, 
                                
                                    Ming-Yu Liu, 
                                
                                    Jun-Yan Zhu
                                
                                et al.
                            
                            (2018). 
                            High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs. 
                            
                            , 8798-8807.
                            https://doi.org/10.1109/cvpr.2018.00917

Identifiers

DOI: 10.1109/cvpr.2018.00917