TY - JOUR
T1 - On the Role of ViT and CNN in Semantic Communications
T2 - Analysis and Prototype Validation
AU - Yoo, Hanju
AU - Dai, Linglong
AU - Kim, Songkuk
AU - Chae, Chan Byoung
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - Semantic communications have shown promising advancements by optimizing source and channel coding jointly. However, the dynamics of these systems remain understudied, limiting research and performance gains. Inspired by the robustness of Vision Transformers (ViTs) in handling image nuisances, we propose a ViT-based model for semantic communications. Our approach achieves a peak signal-To-noise ratio (PSNR) gain of +0.5 dB over convolutional neural network variants. We introduce novel measures, average cosine similarity and Fourier analysis, to analyze the inner workings of semantic communications and optimize the system's performance. We also validate our approach through a real wireless channel prototype using software-defined radio (SDR). To the best of our knowledge, this is the first investigation of the fundamental workings of a semantic communications system, accompanied by the pioneering hardware implementation. To facilitate reproducibility and encourage further research, we provide open-source code, including neural network implementations and LabVIEW codes for SDR-based wireless transmission systems (Source codes available at https://bit.ly/SemViT).
AB - Semantic communications have shown promising advancements by optimizing source and channel coding jointly. However, the dynamics of these systems remain understudied, limiting research and performance gains. Inspired by the robustness of Vision Transformers (ViTs) in handling image nuisances, we propose a ViT-based model for semantic communications. Our approach achieves a peak signal-To-noise ratio (PSNR) gain of +0.5 dB over convolutional neural network variants. We introduce novel measures, average cosine similarity and Fourier analysis, to analyze the inner workings of semantic communications and optimize the system's performance. We also validate our approach through a real wireless channel prototype using software-defined radio (SDR). To the best of our knowledge, this is the first investigation of the fundamental workings of a semantic communications system, accompanied by the pioneering hardware implementation. To facilitate reproducibility and encourage further research, we provide open-source code, including neural network implementations and LabVIEW codes for SDR-based wireless transmission systems (Source codes available at https://bit.ly/SemViT).
KW - 6G
KW - deep neural network
KW - real-Time wireless communications
KW - semantic communications
KW - wireless image transmission
UR - http://www.scopus.com/inward/record.url?scp=85164414598&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85164414598&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2023.3291405
DO - 10.1109/ACCESS.2023.3291405
M3 - Article
AN - SCOPUS:85164414598
SN - 2169-3536
VL - 11
SP - 71528
EP - 71541
JO - IEEE Access
JF - IEEE Access
ER -