智能超表面賦能語義通信系統(tǒng)研究綜述
doi: 10.11999/JEIT240984 cstr: 32379.14.JEIT240984
-
1.
鄭州大學電氣與信息工程學院 鄭州 450001
-
2.
北京郵電大學人工智能學院 北京 100876
-
3.
寧波諾丁漢大學電氣與電子工程系 寧波 315100
-
4.
浙江大學電子與信息工程學院 杭州 310058
-
5.
科學技術部新技術中心 北京 100036
-
6.
蘭卡斯特大學計算與通信學院 英格蘭 LA14YW
Research Overview of Reconfigurable Intelligent Surface Enabled Semantic Communication Systems
-
1.
School of Information Engineering, Zhengzhou University, Zhengzhou 450001, China
-
2.
School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China
-
3.
Department of Electrical and Electronics Engineering, University of Nottingham Ningbo China, Ningbo 315100, China
-
4.
School of Electronics and Information Engineering, Zhejiang University, Hangzhou 310058, China
-
5.
Innovative Technology Center, Most, Beijing 100036, China
-
6.
School of Computing and Communications, Lancaster University, Lancashire LA1 4YW, England
-
摘要: 智能超表面(RIS)以其卓越的成本效益、低能耗及獨特的可編程性,在調控無線環(huán)境方面展現(xiàn)出顯著優(yōu)勢,已成為6G通信技術的關鍵組成部分。語義通信(SemCom)突破香農極限,同時確保關鍵信息的精確傳遞,同樣被視為6G的核心技術之一。該文首先回顧了語義通信的發(fā)展歷程,闡述其從理論走向實踐的過程,并分析了RIS在提高通信性能方面的突出優(yōu)勢。接著,提出RIS賦能語義通信系統(tǒng)模型,展示了RIS在提升通信質量方面的顯著效果。最后,對RIS賦能語義通信系統(tǒng)的未來發(fā)展趨勢進行展望,其將邁向智能化、個性化信息傳遞新階段,為6G通信技術發(fā)展奠定堅實基礎,并有望成為6G核心關鍵技術,引導通信革命。Abstract:
Objective The proliferation of sixth-Generation (6G) wireless network technologies has led to an exponential demand for intelligent devices, such as autonomous transportation, environmental monitoring, and consumer robotics. These applications will generate vast amounts of data, reaching zetta-bytes in scale. Furthermore, they require support for massive connectivity over limited spectrum resources, and low latency, presenting critical challenges to traditional source-channel coding methods. Therefore, the 6G architecture is shifting from a traditional framework focused on high transmission rates to a novel paradigm centered on the intelligent interconnection of all things. Semantic Communication (SemCom) is considered an extension of the Shannon communication paradigm, aiming to extract the meaning from data and filter out unnecessary, irrelevant, or unessential information. As a core paradigm in 6G, SemCom enhances transmission accuracy and spectral efficiency, optimizing service quality. Despite its significant potential, challenges remain in implementing SemCom systems. Reconfigurable Intelligent Surfaces (RIS) are seen as key enablers for 6G networks. RIS can be dynamically deployed in wireless environments to manipulate electromagnetic wave characteristics (such as frequency, phase, and polarization) via programmable reflection and refraction, reshaping wireless channels to amplify signal strength, extend coverage, and optimize performance. Integrating RIS into SemCom systems helps address limitations like coverage voids while enhancing the precision and efficiency of semantic information delivery. This paper proposes an RIS-enabled SemCom framework, with numerical simulations validating its effectiveness in improving system accuracy and robustness. Methods This paper integrates RIS into the SemCom system. The transmitted signal reaches the receiver through both the direct link and the RIS-reflected link, mitigating communication interruptions caused by obstructions. Additionally, the Bilingual Evaluation Understudy (BLEU) metric is used to evaluate performance. Simulations compare RIS-enhanced channels with conventional channels (e.g., AWGN and Rayleigh), demonstrating the performance gain of RIS in SemCom systems. Results and Discussions A positive correlation is observed between Signal-To-Noise Ratio (SNR) increases and improvements in the BLEU score, where higher BLEU scores indicate better text reconstruction fidelity to the source content, reflecting enhanced semantic accuracy and communication quality ( Fig. 4 ). Under RIS-enhanced channel conditions, SemCom systems not only show higher BLEU scores but also exhibit greater stability, with reduced sensitivity to SNR fluctuations. This validates the advantages of RIS channels in semantic information recovery. The performance gap between RIS and conventional channels widens significantly under low SNR conditions, suggesting that RIS-enabled systems maintain robust communication quality and semantic fidelity even with signal degradation, highlighting their stronger practical competitiveness. Additionally, the comparative analysis shows performance differences across N-gram models (Figs. 4(a) and(b) ). Practical implementations, therefore, require model selection based on computational constraints and task requirements, with potential for exploring higher-order N-gram architectures.Conclusions This paper systematically examines the evolution of SemCom and the theoretical foundations of RIS. SemCom, aimed at overcoming the bandwidth limitations of traditional systems and enabling natural human-machine interactions, has shown transformative potential across various domains. At the same time, the paper highlights RIS’s advantages in improving wireless system performance and its potential integration with SemCom paradigms. A novel RIS-enabled SemCom architecture is proposed, with experimental validation confirming its effectiveness in enhancing information recovery accuracy. Additionally, the paper outlines future research directions for RIS-enhanced SemCom, urging the research community to address emerging challenges. Prospects Current research on RIS-enabled SemCom is still in its early stages, primarily focusing on resource allocation, performance enhancement, and architectural design. However, it faces fundamental challenges, such as the lack of Shannon-like theoretical foundations and vulnerabilities in knowledge base synchronization and updating. Three critical challenges emerge: (1) Cross-modal semantic fusion architecture, which requires adaptive frameworks to support diverse 6G services beyond single-modality paradigms; (2) Dynamic knowledge base optimization, requiring efficient update mechanisms to balance semantic consistency with computational and communication overhead; (3) Semantic-aware security protocols, which must incorporate hybrid defenses against AI-specific attacks (e.g., adversarial perturbations) and RIS-enabled channel manipulation threats. -
[1] 汪丙炎, 李榮鵬, 趙志峰, 等. 融合知識圖譜的語義通信系統(tǒng)[J]. 移動通信, 2024, 48(2): 11–15,62. doi: 10.3969/j.issn.1006-1010.20231223-0001.WANG Bingyan, LI Rongpeng, ZHAO Zhifeng, et al. Semantic communication system incorporating knowledge graphs[J]. Mobile Communications, 2024, 48(2): 11–15,62. doi: 10.3969/j.issn.1006-1010.20231223-0001. [2] YANG Wanting, DU Hongyang, LIEW Z Q, et al. Semantic communications for future internet: Fundamentals, applications, and challenges[J]. IEEE Communications Surveys & Tutorials, 2023, 25(1): 213–250. doi: 10.1109/COMST.2022.3223224. [3] 朱政宇, 王梓晅, 徐金雷, 等. 智能反射面輔助的未來無線通信: 現(xiàn)狀與展望[J]. 航空學報, 2022, 43(2): 025014. doi: 10.7527/S1000-6893.2021.25014.ZHU Zhengyu, WANG Zixuan, XU Jinlei, et al. Future wireless communication assisted by intelligent reflecting surface: State of art and prospects[J]. Acta Aeronautica et Astronautia Sinica, 2022, 43(2): 025014. doi: 10.7527/S1000-6893.2021.25014. [4] SHANNON C E. A mathematical theory of communication[J]. The Bell System Technical Journal, 1948, 27(3): 379–423. doi: 10.1002/j.1538-7305.1948.tb01338.x. [5] WENG Zhenzi and QIN Zhijin. Semantic communication systems for speech transmission[J]. IEEE Journal on Selected Areas in Communications, 2021, 39(8): 2434–2444. doi: 10.1109/JSAC.2021.3087240. [6] 牛凱, 張平. 語義通信的數(shù)學理論[J]. 通信學報, 2024, 45(6): 7–59. doi: 10.11959/j.issn.1000-436x.2024111.NIU Kai and ZHANG Ping. A mathematical theory of semantic communication[J]. Journal on Communications, 2024, 45(6): 7–59. doi: 10.11959/j.issn.1000-436x.2024111. [7] 秦志金, 趙菼菼, 李凡, 等. 多模態(tài)語義通信研究綜述[J]. 通信學報, 2023, 44(5): 28–41. doi: 10.11959/j.issn.1000?436x.2023105.QIN Zhijin, ZHAO Tantan, LI Fan, et al. Survey of research on multimodal semantic communication[J]. Journal on Communications, 2023, 44(5): 28–41. doi: 10.11959/j.issn.1000?436x.2023105. [8] NIU Kai, DAI Jincheng, YAO Shengshi, et al. A paradigm shift toward semantic communications[J]. IEEE Communications Magazine, 2022, 60(11): 113–119. doi: 10.1109/MCOM.001.2200099. [9] SHI Guangming, XIAO Yong, LI Yingyu, et al. From semantic communication to semantic-aware networking: Model, architecture, and open problems[J]. IEEE Communications Magazine, 2021, 59(8): 44–50. doi: 10.1109/MCOM.001.2001239. [10] FARSAD N, RAO M, and GOLDSMITH A. Deep learning for joint source-channel coding of text[C]. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, 2018: 2326–2330. doi: 10.1109/ICASSP.2018.8461983. [11] XIE Huiqiang, QIN Zhijin, TAO Xiaoming, et al. Task-oriented multi-user semantic communications[J]. IEEE Journal on Selected Areas in Communications, 2022, 40(9): 2584–2597. doi: 10.1109/JSAC.2022.3191326. [12] XU Jialong, AI Bo, CHEN Wei, et al. Wireless image transmission using deep source channel coding with attention modules[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32(4): 2315–2328. doi: 10.1109/TCSVT.2021.3082521. [13] HU Qiyu, ZHANG Guangyi, QIN Zhijin, et al. Robust semantic communications against semantic noise[C]. 2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall), London, United Kingdom, 2022: 1–6. doi: 10.1109/VTC2022-Fall57202.2022.10012843. [14] TONG Haonan, YANG Zhaohui, WANG Sihua, et al. Federated learning based audio semantic communication over wireless networks[C]. 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, Spain, 2021: 1–6. doi: 10.1109/GLOBECOM46510.2021.9685654. [15] TUNG T and GüNDüZ D. DeepWiVe: Deep-learning-aided wireless video transmission[J]. IEEE Journal on Selected Areas in Communications, 2022, 40(9): 2570–2583. doi: 10.1109/JSAC.2022.3191354. [16] XIE Huiqiang, QIN Zhijin, and LI G Y. Task-oriented multi-user semantic communications for VQA[J]. IEEE Wireless Communications Letters, 2022, 11(3): 553–557. doi: 10.1109/LWC.2021.3136045. [17] HUDSON D A and MANNING C D. Compositional attention networks for machine reasoning[C]. The 6th International Conference on Learning Representations (ICLR), Vancouver, Canada, 2018. [18] ZHANG Guangyi, HU Qiyu, QIN Zhijin, et al. A unified multi-task semantic communication system with domain adaptation[C]. GLOBECOM 2022 - 2022 IEEE Global Communications Conference, Rio de Janeiro, Brazil, 2022: 3971–3976. doi: 10.1109/GLOBECOM48099.2022.10000850. [19] HU Xuejie, TIAN Yue, LI Qinying, et al. A novel RIS-aided optimization strategy for semantic communication system[J]. IEEE Wireless Communications Letters, 2024, 13(6): 1655–1659. doi: 10.1109/LWC.2024.3385492. [20] JIANG Peiwen, WEN Chaokai, JIN Shi, et al. RIS-enhanced semantic communications adaptive to user requirements[J]. IEEE Transactions on Communications, 2024, 72(7): 4134–4148. doi: 10.1109/TCOMM.2024.3369697. [21] ZHAO Zhuoxiang, YANG Zhaohui, HUANG Chongwen, et al. A joint communication and computation design for distributed RIS-assisted probabilistic semantic communication in IIoT[J]. IEEE Internet of Things Journal, 2024, 11(16): 26568–26579. doi: 10.1109/JIOT.2024.3409271. [22] DU Hongyang, WANG Jiacheng, NIYATO D, et al. Lightweight wireless sensing through RIS and inverse semantic communications[C]. 2023 IEEE Wireless Communications and Networking Conference (WCNC), Glasgow, United Kingdom, 2023: 1–6. doi: 10.1109/WCNC55385.2023.10119005. [23] DEY S, VINAYAKARAO V, GUPTA M, et al. Evaluating commit message generation: To BLEU or not to BLEU?[C]. 2022 IEEE/ACM 44th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER), Pittsburgh, USA, 2022: 31–35. doi: 10.1145/3510455.3512790. [24] KOEHN P. Europarl: A parallel corpus for statistical machine translation[C]. Machine Translation Summit X: Papers, Phuket, Thailand, 2005: 79–86. -