Synthetic data generation using building information models

Yeji Hong, Somin Park, Hongjo Kim, Hyoungkwan Kim

Research output: Contribution to journalArticlepeer-review

24 Citations (Scopus)


Infrastructure scene understanding from image data aids diverse applications in construction and maintenance. Recently, deep learning models have been employed to extract information regarding infrastructure from visual data. The performance of these models depends significantly on the volume of training data. However, preparing the training data is time-consuming and laborious, as it entails labeling numerous images. To address this issue, this paper proposes a method for generating high-quality synthetic data that includes the automatic annotation of infrastructure scenes. The method consists of three steps: 1) translating building information model (BIM) images into real-world images, 2) automatically labeling them using the spatial information contained in the BIM to generate various synthetic datasets, and 3) splicing the selected synthetic datasets together to form the final synthetic dataset. The Mask R-CNN models trained with building and bridge synthetic data achieved average precisions of 71.6% and 84.9%, respectively.

Original languageEnglish
Article number103871
JournalAutomation in Construction
Publication statusPublished - 2021 Oct

Bibliographical note

Publisher Copyright:
© 2021

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Civil and Structural Engineering
  • Building and Construction


Dive into the research topics of 'Synthetic data generation using building information models'. Together they form a unique fingerprint.

Cite this