On the Role of ViT and CNN in Semantic Communications: Analysis and Prototype Validation

Hanju Yoo, Linglong Dai, Songkuk Kim, Chan Byoung Chae

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)

Abstract

Semantic communications have shown promising advancements by optimizing source and channel coding jointly. However, the dynamics of these systems remain understudied, limiting research and performance gains. Inspired by the robustness of Vision Transformers (ViTs) in handling image nuisances, we propose a ViT-based model for semantic communications. Our approach achieves a peak signal-To-noise ratio (PSNR) gain of +0.5 dB over convolutional neural network variants. We introduce novel measures, average cosine similarity and Fourier analysis, to analyze the inner workings of semantic communications and optimize the system's performance. We also validate our approach through a real wireless channel prototype using software-defined radio (SDR). To the best of our knowledge, this is the first investigation of the fundamental workings of a semantic communications system, accompanied by the pioneering hardware implementation. To facilitate reproducibility and encourage further research, we provide open-source code, including neural network implementations and LabVIEW codes for SDR-based wireless transmission systems (Source codes available at https://bit.ly/SemViT).

Original languageEnglish
Pages (from-to)71528-71541
Number of pages14
JournalIEEE Access
Volume11
DOIs
Publication statusPublished - 2023

Bibliographical note

Publisher Copyright:
© 2013 IEEE.

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'On the Role of ViT and CNN in Semantic Communications: Analysis and Prototype Validation'. Together they form a unique fingerprint.

Cite this