Paper
10 November 2022 Conditional generative adversarial networks in computer vision: an introduction and outlook
Mengkun Gao
Author Affiliations +
Proceedings Volume 12348, 2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022); 1234804 (2022) https://doi.org/10.1117/12.2641644
Event: 2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022), 2022, Zhuhai, China
Abstract
Conditional Generative Adversarial Networks (CGAN) is an architecture-variant Generative Adversarial Network (GAN) that inputs conditional data into generator and discriminator simultaneously. CGAN has been applied in a wide range of fields in recent years. One of the most famous usages is its application in computer vision, which enables to perform images and videos transformation with specified conditions. In this paper, we first introduce the structure of GAN, which supports every CGAN system, and the basic CGAN that has been demonstrated in computer vision. Then, we review different implementations of CGAN in two major computer vision domains, i.e., the image processing domain and the videos processing domain. Besides, we review three representative research orientations in each domain, which all implement CGAN or variants of CGAN. Moreover, there is a comparison or declaration of several outstanding approaches in each direction.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Mengkun Gao "Conditional generative adversarial networks in computer vision: an introduction and outlook", Proc. SPIE 12348, 2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022), 1234804 (10 November 2022); https://doi.org/10.1117/12.2641644
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

Computer vision technology

Machine vision

Video processing

Image processing

Neural networks

3D modeling

RELATED CONTENT

Swimmer position estimation by lane rectification
Proceedings of SPIE (March 22 2019)
Novel receipt recognition with deep learning algorithms
Proceedings of SPIE (April 22 2020)
Neural model for feature matching in stereo vision
Proceedings of SPIE (February 01 1991)
Real Time Pyramid Transform Architecture
Proceedings of SPIE (December 11 1985)

Back to Top