StyleGANs and Transfer Learning for Generating Synthetic Images in Industrial Applications

Deep learning applications on computer vision involve the use of large-volume and representative data to obtain state-of-the-art results due to the massive number of parameters to optimise in deep models. However, data are limited with asymmetric distributions in industrial applications due to rare...

Full description

Saved in:

Bibliographic Details
Published in	Symmetry (Basel) Vol. 13; no. 8; p. 1497
Main Authors	Achicanoy, Harold, Chaves, Deisy, Trujillo, Maria
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.08.2021
Subjects	Bedrooms Charcoal Computer vision Data augmentation Deep learning Domains Generative adversarial networks Image acquisition Image processors Image quality Industrial applications Probability distribution Skewed distributions Synthetic data Training
Online Access	Get full text
ISSN	2073-8994 2073-8994
DOI	10.3390/sym13081497

Cover

Loading…

More Information
Summary:	Deep learning applications on computer vision involve the use of large-volume and representative data to obtain state-of-the-art results due to the massive number of parameters to optimise in deep models. However, data are limited with asymmetric distributions in industrial applications due to rare cases, legal restrictions, and high image-acquisition costs. Data augmentation based on deep learning generative adversarial networks, such as StyleGAN, has arisen as a way to create training data with symmetric distributions that may improve the generalisation capability of built models. StyleGAN generates highly realistic images in a variety of domains as a data augmentation strategy but requires a large amount of data to build image generators. Thus, transfer learning in conjunction with generative models are used to build models with small datasets. However, there are no reports on the impact of pre-trained generative models, using transfer learning. In this paper, we evaluate a StyleGAN generative model with transfer learning on different application domains—training with paintings, portraits, Pokémon, bedrooms, and cats—to generate target images with different levels of content variability: bean seeds (low variability), faces of subjects between 5 and 19 years old (medium variability), and charcoal (high variability). We used the first version of StyleGAN due to the large number of publicly available pre-trained models. The Fréchet Inception Distance was used for evaluating the quality of synthetic images. We found that StyleGAN with transfer learning produced good quality images, being an alternative for generating realistic synthetic images in the evaluated domains.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2073-8994 2073-8994
DOI:	10.3390/sym13081497