Jinhyun Jang, Taeyong Song, Kwanghoon Sohn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)


Aerial-to-ground image synthesis is an emerging and challenging problem that aims to synthesize a ground image from an aerial image. Due to the highly different layout and object representation between the aerial and ground images, existing approaches usually fail to transfer the components of the aerial scene into the ground scene. In this paper, we propose a novel framework to explore the challenges by imposing enhanced structural alignment and semantic awareness. We introduce a novel semantic-attentive feature transformation module that allows to reconstruct the complex geographic structures by aligning the aerial feature to the ground layout. Furthermore, we propose semantic-aware loss functions by leveraging a pre-trained segmentation network. The network is enforced to synthesize realistic objects across various classes by separately calculating losses for different classes and balancing them. Extensive experiments including comparisons with previous methods and ablation studies show the effectiveness of the proposed framework both qualitatively and quantitatively.

Original languageEnglish
Title of host publication2021 IEEE International Conference on Image Processing, ICIP 2021 - Proceedings
PublisherIEEE Computer Society
Number of pages5
ISBN (Electronic)9781665441155
Publication statusPublished - 2021
Event2021 IEEE International Conference on Image Processing, ICIP 2021 - Anchorage, United States
Duration: 2021 Sept 192021 Sept 22

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880


Conference2021 IEEE International Conference on Image Processing, ICIP 2021
Country/TerritoryUnited States

Bibliographical note

Publisher Copyright:
© 2021 IEEE

All Science Journal Classification (ASJC) codes

  • Software
  • Computer Vision and Pattern Recognition
  • Signal Processing


Dive into the research topics of 'SEMANTIC-AWARE NETWORK FOR AERIAL-TO-GROUND IMAGE SYNTHESIS'. Together they form a unique fingerprint.

Cite this