Open access peer-reviewed chapter

Deep Learning for MIMO Communications

Written By

Yunlong Cai, Qiyu Hu, Guangyi Zhang and Kai Kang

Reviewed: 01 June 2023 Published: 24 June 2023

DOI: 10.5772/intechopen.112038

From the Edited Volume

MIMO Communications - Fundamental Theory, Propagation Channels, and Antenna Systems

Edited by Ahmed A. Kishk and Xiaoming Chen

Chapter metrics overview

121 Chapter Downloads

View Full Metrics

Abstract

Recently, deep learning (DL) is becoming a key feature of next-generation multiple-input multiple-output (MIMO) transceiver design with learning and inference capabilities embedded in the network, which achieves greatly enhanced system performance. Popular topics include end-to-end (E2E) learning for transceiver design, deep reinforcement learning (DRL) for communications, and model-driven deep unfolding techniques. In particular, E2E learning treats the communication system design as an E2E data reconstruction task that seeks to jointly optimize transceiver components, so that encoding and decoding are fostered by the learned weights of deep neural networks (DNN). E2E learning can be employed to solve various problems in MIMO communications, such as channel state information (CSI) feedback, beamforming, signal detection, and channel estimation. Moreover, DRL has been widely applied to solve high-dimensional non-convex optimization problems in designing the transceiver. However, these DNNs generally suffer from an inability to be interpreted or generalized, and they often lack performance guarantees. To overcome such drawbacks, substantial researches have proposed to unfold the iterations of an iterative optimization algorithm into a layer-wise structure analogous to a DNN. Inspired by the great potential of these DL methods, it is important to investigate AI-empowered transceivers for future MIMO systems.

Keywords

  • deep learning
  • deep unfolding
  • beamforming
  • transceiver design
  • channel estimation

1. Introduction

In physical layer communications, the transceiver design is a core technology in multiple-input multiple-output (MIMO) systems, as shown in Figure 1. Iterative optimization algorithms for the transceiver design have achieved satisfactory system performance, but they generally require a large number of iterations and have the high-complexity computation, which makes it difficult to be deployed in practical systems. Recently, the deep learning (DL) method, as a primary technique in artificial intelligence, has received great attention in wireless communications, especially in physical layer communications. DL methods employ the deep neural networks (DNNs) and treat the algorithm as a “black-box”. Compared to conventional optimization algorithms, DL methods can approximate high-complexity operations with lower computational complexity. These DNNs are usually data-driven models, which rely on a large number of data for training. However, it is difficult to obtain training samples in practical communication systems and these data-driven DNNs suffer from poor interpretability and generalization ability. In contrast, model-driven methods exploit known physical mechanisms and domain knowledge. Thus, they require less training samples and it makes the DNNs explainable. Some studies unfolded the iterative optimization algorithms into layer-wise networks with introduced trainable parameters, which reduce the iteration numbers and improve the system performance. This chapter discusses the application of DL-based approaches in physical layer communications, which includes parts of channel estimation and feedback, beamforming, detection, channel decoding and end-to-end learning. Each part will be introduced with data-driven and model-driven approaches.

Figure 1.

The architecture of transceiver in physical layer communications.

1.1 Channel estimation and feedback

In massive MIMO systems, the base station (BS) relies on accurate channel state information (CSI) to achieve potential gains from multiple antennas. However, the large number of antennas brings challenges and huge overhead for channel estimation and feedback, where many DL-based methods have been proposed to exploit the features of CSI and reduce the overhead [1, 2, 3, 4, 5, 6, 7]. In [2], the authors exploited the channel sparsity in the angle domain and proposed a DNN for channel estimation and direction-of-arrival (DoA) estimation. The proposed DL method can learn the spatial structures of channels and achieve better performance than conventional methods. As for channel feedback, the CsiNet has been developed in ref. [3] for channel compression, feedback, and reconstruction. It employs the structure of an autoencoder, where an encoder and a decoder are designed for channel compression and construction, respectively. Compared to the traditional compressive sensing (CS) algorithm, the CsiNet improves the CSI recovery quality and compression ratio.

Model-driven DL approaches have also been applied for channel estimation and feedback [5, 6, 7]. In ref. [5], a learned denoising-based approximate message passing (LDAMP) network has been proposed for beamspace millimeter-wave (mmWave) MIMO channel estimation, where the convolutional denoising NN is merged into the AMP channel estimation algorithm. In addition, the authors of ref. [6] proposed a dynamic deep-unfolding neural networks (NNs) with adaptive depth for channel estimation, where the layers of NN vary from different inputs. As shown in Figure 2(a), denotes each layer of the NN and a function ϕ is defined to control the depth of the NN. When the output of the function τ>ε, the NN stops to output results. To estimate CSI, the sparse Bayesian learning (SBL) algorithm is unfolded into a layered network with introduced trainable parameters in the framework. In particular, some priori parameters which are difficult to determine in the SBL algorithm are set as trainable parameters. The other trainable parameters are introduced to approximate the operations with high computational complexity. Besides, to avoid gradient explosion, the trainable parameters are updated by deep deterministic policy gradient (DDPG), rather than updated by the stochastic gradient descent (SGD) algorithm directly. As shown in Figure 2(b), the state, action, and state transition of DDPG correspond to the optimization variables, trainable parameters, and architecture of NN, respectively. As for channel feedback, the authors of ref. [7] proposed a model-driven multiple-measurement-vectors learned approximate message passing (MMV-LAMP) network for channel estimation and feedback in frequency division duplex (FDD) systems, which reduces the pilot feedback overhead.

Figure 2.

DDPG-driven deep-unfolding framework with adaptive depth.

1.2 Beamforming

In massive MIMO systems, beamforming has been a key technique to improve the spectrum efficiency and achieve spatial multiplexing gains. Traditional beamforming algorithms require a large number of iterations and high-complexity computations, which impedes their application in practical systems, especially when the number of antennas is large. Thus, many DL-based approaches for beamforming design have been proposed [8, 9, 10, 11, 12, 13, 14]. In ref. [8], the authors proposed a DNN-enabled massive MIMO framework for effective hybrid beamforming. Compared to conventional schemes, the proposed framework achieves better performance with lower computational complexity. Besides, a DL-based joint channel feedback and beamforming approach has been designed in ref. [9]. In addition, deep reinforcement learning (DRL) based beamforming algorithms have also been developed in ref. [10, 11]. The authors of ref. [10] proposed a DRL hybrid beamforming scheme to improve the coverage range of THz communications in the reconfigurable intelligent surfaces (RIS) assisted system.

Apart from the aforementioned data-driven NNs, researchers developed the model-driven methods where iterative beamforming algorithms are unfolded into networks [12, 13, 14]. In ref. [12], the authors proposed an iterative algorithm-induced deep-unfolding neural network (IAIDNN) for digital beamforming shown in Figure 3, where the weighted minimum mean-square error (WMMSE) iterative algorithm is unfolded into a layer-wise network. ,G, and J denote the layers of the network for updating different variables. In particular, inspired by the first-order Taylor expansion, the matrix inversion A1 is approximated by AX+AY+Z, where X, Y, and Z are introduced trainable parameters. A represents the proposed non-linear operation where the diagonal elements of A are taken the reciprocal and non-diagonal elements are set as 0. The computational complexity of matrix inversion is On3 while that of the proposed approximation is On2.37. In the backpropagation, the authors derived the generalized chain rule (GCR) in matrix form and the trainable parameters are updated based on it. Simulations have shown that the proposed IAIDNN achieves the performance of the WMMSE algorithm with much less iterations. Besides, the authors of ref. [13] developed a deep-unfolding framework for the passive and active beamforming joint design in a RIS-assisted MIMO system, which outperforms the conventional iterative algorithms.

Figure 3.

The architecture of IAIDNN which unfolds the WMMSE algorithm into a layer-wise network.

1.3 MIMO detection

In MIMO systems, the detector plays an important role in the receiver. Traditional iterative detection algorithms are designed based on the assumption that the channel model is subject to a specific distribution, thus the performance is unsatisfactory in variable environments. To tackle the issue, DL-based detectors have been proposed [1, 15, 16, 17, 18, 19, 20]. In refs. [1, 15, 16], the authors proposed a DNN-based joint channel estimation and signal detection algorithm for the receiver design where the detectors are designed to adapt to different wireless channels.

Several model-driven DL-based methods based on conventional iterative detectors have also been investigated in recent years [17, 18, 19, 20]. The authors of ref. [17] unfolded the orthogonal approximate message passing (OAMP) detector into a layer-wise structure named OAMP-Net2. It only introduces a few learnable parameters to improve the stability and speed of convergence, and the parameters are optimized to adapt to different channel environments. Besides, a deep detector named LoRD-Net has been proposed for signal detection [18]. The LoRD-Net incorporates domain knowledge in its architecture design, thus requiring much fewer parameters than data-driven NNs. Furthermore, a joint channel estimation and signal detection model-driven NN has been proposed in ref. [20] to reduce the effect of channel estimation errors on detection.

1.4 Channel decoding

With the development of fifth generation (5G), user data and system capacity have rapidly increased and a higher transmission rate means lower decoding latency demand. However, traditional decoders require high-complexity computation and a large number of iterations. To address the issue, DL-based decoding algorithms have been developed [21, 22, 23, 24, 25, 26]. In ref. [21], researchers explored that it is easier for DL decoders to learn the structured codes than random codes and verified that NNs can learn a form of decoding algorithm, rather than only a classifier. The article [22] focuses on the issue that the successive interference cancelation (SIC) decoding is imperfect in the nonorthogonal multiple access (NOMA) system and proposes a novel DL-based scheme for decoding in MIMO-NOMA systems. A non-linear precoder and SIC decoder have been constructed by deep feedforward neural networks (FNNs) which help received signals decode accurately in the SIC manner.

The prior parameters play an important role in conventional iterative decoders but are usually set by experience. Thus, DL is a proper method to find the optimal value for the prior parameters and thus model-driven based decoding methods are promising techniques [24, 25, 26]. The authors of ref. [24] utilized the DL method to find the proper weights to the passing messages in the Tanner graph and achieved comparable performance with belief propagation (BP) decoders with less iterations. Furthermore, considering many expensive multiplication operations in ref. [24] which make it difficult to implement, Lugosch and Gross [25] proposed a neural offset min-sum decoding algorithm with no multiplications and less parameter computation. The proposed approach speeds up the training process and is friendly for hardware implementation. In addition, a model-driven low-density parity-check (LDPC) decoding network has been developed in ref. [26]. The iterative decoding progress between checking nodes and variable nodes is unfolded into a propagation network, which combines the advantages of deep learning and conventional normalized min-sum LDPC decoding methods.

1.5 End-to-end learning

The aforementioned DL-based approaches are optimized locally for individual modules, where global optimality cannot be guaranteed. The modules in the transceiver are usually highly correlated with each other, and thus a joint design can achieve better performance than a separate design. To fulfill the global optimization, several DL-based end-to-end communication systems have been proposed [27, 28, 29, 30, 31, 32], where all the trainable parameters are updated based on an end-to-end loss. A DNN-based based end-to-end wireless communication system has been proposed in ref. [27], which includes channel encoding, decoding, modulation, and equalization. A conditional generative adversarial net (GAN) has been designed to model the channel distribution and the proposed end-to-end approach is effective on Rayleigh fading channels. In ref. [28], the authors proposed a DNN-based end-to-end joint transceiver design algorithm for FDD mmWave MIMO systems, which consists of the modules of pilot training, channel feedback and reconstruction, and hybrid beamforming. To avoid CSI mismatch caused by the transmission delay and feedback overhead, a two-timescale scheme has been considered. Specifically, a superframe is introduced as the long-timescale, where the CSI statistics remain constant. Each superframe consists of several frames, each of which contains a number of time slots that defined the short timescale. During each time slot, the instantaneous CSI remains unchanged. Correspondingly, a two-timescale DNN is developed as shown in Figure 4, which consists of a long-term DNN and a short-term DNN. The long-term DNN consists of modules of pilot training, high-dimensional CSI estimation and feedback, and hybrid beamforming, while the short-term DNN composes of modules of pilot training, low-dimensional equivalent CSI estimation and feedback, and digital beamforming. At the end of each frame, the long-term DNN is employed to obtain the high-dimensional full CSI to update the long-term analog beamformers. The short-term digital beamformers are updated based on the low-dimensional equivalent CSI acquired by the short-term DNN. The trainable parameters of all the modules are optimized to minimize the bit-error-rate (BER), which is the system’s global optimization objective.

Figure 4.

The architecture of the proposed DNN-based end-to-end framework.

Inspired by ref. [28], the authors of ref. [31] designed a model-driven based end-to-end framework for joint transceiver design in time division duplexing (TDD) systems, which consists of a channel estimation deep-unfolding NN (CEDUN) and a hybrid beamforming deep-unfolding NN (HBDUN). As shown in Figure 5, the CEDUN is comprised of a pilot training NN and a recursive least squares (RLS) algorithm-induced deep-unfolding NN, where a set of trainable parameters are introduced to increase the degrees of freedom. For hybrid beamforming, the stochastic successive convex approximation (SSCA) algorithm is unfolded into a layer-wise structure in the HBDUN, which consists of an analog NN and a digital NN. Specifically, the phase of analog beamformers is set as trainable parameters in the analog NN. In the digital NN, two non-linear operations are introduced to approximate the matrix inversion. In addition, the authors consider the mixed-timescale scheme, where long-term analog beamformers are optimized based on the CSI statistics during a superframe, and short-term digital beamformers are updated in each time slot based on equivalent CSI. According to the mixed-timescale scheme, a novel two-stage training method is investigated to jointly train the framework. Figure 5(a) shows the first stage of training, where the trainable parameters of HBDUN are optimized with the loss function of the negative system sum rate. The second training stage is shown in Figure 5(b), where the parameters of analog NN are fixed, and low-dimensional equivalent CSI is obtained. The parameters of CEDUN and digital NN are optimized in this stage with the same loss function in the first stage. Simulation results have shown that the deep-unfolding NNs perform comparable with the traditional algorithms with reduced complexity and the joint design method achieves better performance than the separate design.

Figure 5.

The architecture of the end-to-end deep-unfolding framework.

In addition, the authors of ref. [32] proposed an end-to-end DRL and deep-unfolding framework for joint beam selection and digital beamforming design, the architecture of which is shown in Figure 6. Specifically, the framework consists of a DRL-based NN for beam selection and a model-driven based NN for digital beamforming. A novel training method has been developed to jointly train the DRL-based NN and unfolding NN in an end-to-end way. This work indicates that the model-driven NNs can be trained with other DL methods such as DRL-based NN.

Figure 6.

The architecture of the end-to-end DRL and deep-unfolding framework.

Advertisement

2. Deep learning for MIMO-based semantic communications

Future 6G wireless networks are expected to bridge the physical and cyber worlds, enabling human interactions with multiple intelligent devices through various data modalities like images and text [33]. This introduces a number of applications from autonomous driving to the Internet of Everything, which involves intelligent human-to-machine and machine-to-machine connections. These new fascinating applications have imposed challenging requirements on communication networks, including ultra-high reliability, ultra-low latency, and extremely high data rates [34]. However, supporting and enabling such applications will require coping with explosive growth in bandwidth and complexity, due to the transmission of these massive datasets and large models. Driven by the aforementioned requirements, there is a springing up of semantic communication research in both academia and industry. The growing trend of semantic communications aims at accurately recovering the statistical structure of the underlying information of the source signal and designing the communication transceiver in an end-to-end fashion, similar to joint source and channel coding (JSCC) by taking the source semantics into account. A data-aware communication transceiver with intelligence that is able to understand the relevance and meaning of data traffic is of paramount importance as it would significantly improve the transmission efficiency.

2.1 Formulation of semantic communications

In general, a semantic communication MIMO transceiver can be modeled as the framework shown in Figure 7, where an end-to-end communication system is developed to incorporate coding and modulation [35]. In particular, the encoding, decoding, and transmission procedures are parameterized by the DNNs, and the system is optimized in a back-propagation manner with the data-driven method. In particular, as shown in Figure 7, the transmitter maps the source, s, into a symbol stream, x, and then passes it through the physical channel with transmission impairments. The received symbol stream, y, is decoded at the receiver to have an estimation of the source, ŝ. Both the transmitter and the receiver are represented by DNNs. In particular, the DNNs at the transmitter consist of the semantic encoder and channel encoder, while the DNNs at the receiver consist of the semantic decoder and channel decoder. The semantic encoder learns to transform the transmitted data into an encoded feature vector while the semantic decoder learns to recover the transmitted data from the received signals. Moreover, the channel encoder and channel decoder aim at eliminating the signal distortion caused by the wireless channel.

Figure 7.

The framework of a typical semantic communication system.

We consider a MIMO system with Nt transmit antennas and Nr receive antennas. The encoded symbol stream can be represented by

x=f2f1sθ1θ2,E1

where xCNt×1,θ1 and θ2 denote the trainable parameters of the semantic encoder, f1, and the channel encoder, f2, respectively. Subsequently, the signal received at the receiver, yCNr×1, is given by

y=Hx+n,E2

where HCNr×Nt denotes the channel matrix and nCN0σ2I is the additive white Gaussian noise (AWGN). Correspondingly, the decoded signal is given as

ŝ=g1g2yΘ2Θ1,E3

where Θ1 and Θ2 denote the trainable parameters of the semantic decoder, g1, and the channel decoder, g2, respectively. For clarity, we denote θ as the set of trainable parameters and fθ as the DNNs in the considered semantic communication systems. Thus, we have ŝ=fθs.

2.2 Semantic importance-gudied design for MIMO transceivers

Regarding the physical layer transceiver, the modules of the transceiver are often optimized independently. In particular, the modulation, beamforming, and signal detection modules are designed to minimize the bit-error-rate (BER), and the channel feedback and estimation aim to optimize the mean-square-error (MSE). In the case of semantic communications, the MIMO transceiver can be designed by revising the modules in the traditional transceiver. Next, we will discuss some advancements in the MIMO transceiver design in semantic communications.

  1. Measure of Semantic Importance: In semantic communications, the source data is mapped into different semantic features by the DNN models. Different semantic features are of different importance for completing target tasks, where semantic importance is defined as the correlation between the semantic features and the target task [36]. The method to measure the importance of semantic features can be variable in different semantic communication systems. Specifically, there are gradient-based approaches that draw on ideas related to the interpretability of DNNs [37], and the entropy-modeling approach that focuses on the regional complexity in terms of the source data content [38], etc. In semantic resource allocation, the model distinguishes the importance levels of different features and adaptively allocates the resource according to channel conditions and users’ requirements. In particular, the semantic importance has shown great potential for the designs in DL-based multi-user MIMO systems, e.g., the design of precoding, the allocation of subcarriers, and various adaptive schemes.

  2. MIMO Precoding: One of the main issues in multi-user MIMO systems is the mutual interference between the signals of different users. Limited by the number of receiving antennas, it is difficult for each user to eliminate the interference from other users alone. It is worth noting that the precoding algorithms such as singular value decomposition (SVD)-based precoding, convert the MIMO channel into a set of parallel subchannels with different SNRs. Intuitively, the performance of the MIMO systems can be significantly enhanced by allocating subchannels with high SNRs to features with high importance levels, as these features would play a more important role in the target task. For instance, the semantic MIMO system designed in ref. [39] significantly outperforms traditional MIMO systems by jointly considering the CSI and entropy distribution of the semantic features, where the entropy can be regarded as a measure of semantic importance.

  3. Allocation of Subcarriers in OFDM Systems: Orthogonal frequency-division multiplexing (OFDM) technique has been widely employed in MIMO systems to realize high-speed data transmission. In OFDM, each subcarrier is equivalent to a subchannel with a certain CSI. The CSI of each subcarrier is instrumental in power allocation to boost the communication rate. However, in the case of semantic communications, the CSI can also be exploited to determine the allocation of subcarriers to different semantic features according to their importance. A typical example is the dual-attention mechanism proposed in ref. [40], which employs both channel-wise attention and spatial attention, and jointly learns to transmit important features with better subcarriers, which achieves state-of-the-art performance among existing JSCC schemes.

  4. Adaptive Design Based on CSI: Semantic importance can also be exploited to design adaptive schemes for MIMO systems based on CSI. These adaptive designs consider both CSI and semantic importance in an adaptable manner, which significantly improves the system performance and reduces the training overhead. For image transmission over MIMO channels, the authors in ref. [41] have employed the channel attention module proposed in ref. [42], to distinguish the importance of different features and adjust their weights according to different CSI scenarios. Based on feature importance and the complexity of image, the authors in ref. [43] have proposed an adaptive CSI feedback scheme for precoding, which improves the effectiveness by adjusting the feedback overhead.

References

  1. 1. Ye H, Li GY, Juang B-H. Power of deep learning for channel estimation and signal detection in OFDM systems. IEEE Wireless Communications Letters. 2018;7(1):114-117
  2. 2. Huang H, Yang J, Huang H, Song Y, Gui G. Deep learning for super-resolution channel estimation and DOA estimation based massive MIMO system. IEEE Transactions on Vehicular Technology. 2018;67(9):8549-8560
  3. 3. Wen C-K, Shih W-T, Jin S. Deep learning for massive MIMO CSI feedback. IEEE Wireless Communications Letters. 2018;7(5):748-751
  4. 4. Wang T, Wen C-K, Jin S, Li GY. Deep learning-based CSI feedback approach for time-varying massive MIMO channels. IEEE Wireless Communications Letters. 2019;8(2):416-419
  5. 5. He H, Wen C-K, Jin S, Li GY. Deep learning-based channel estimation for beamspace mmwave massive MIMO systems. IEEE Wireless Communications Letters. 2018;7(5):852-855
  6. 6. Hu Q, Shi S, Cai Y, Yu G. DDPG-driven deep-unfolding with adaptive depth for channel estimation with sparse bayesian learning. IEEE Transactions on Signal Processing. 2022;70:4665-4680
  7. 7. Ma X, Gao Z, Gao F, Di Renzo M. Model-driven deep learning based channel estimation and feedback for millimeter-wave massive hybrid MIMO systems. IEEE Journal on Selected Areas in Communications. 2021;39(8):2388-2406
  8. 8. Huang H, Song Y, Yang J, Gui G, Adachi F. Deep-learning-based millimeter-wave massive MIMO for hybrid precoding. IEEE Transactions on Vehicular Technology. 2019;68(3):3027-3032
  9. 9. Sohrabi F, Attiah KM, Yu W. Deep learning for distributed channel feedback and multiuser precoding in FDD massive MIMO. IEEE Transactions on Wireless Communications. 2021;20(7):4044-4057
  10. 10. Huang C, Yang Z, Alexandropoulos GC, Xiong K, Wei L, Yuen C, et al. Multi-hop RIS-empowered terahertz communications: A DRL-based hybrid beamforming design. IEEE Journal on Selected Areas in Communications. 2021;39(6):1663-1677
  11. 11. Wang Q, Feng K, Li X, Jin S. PrecoderNet: Hybrid beamforming for millimeter wave systems with deep reinforcement learning. IEEE Wireless Communications Letters. 2020;9(10):1677-1681
  12. 12. Hu Q, Cai Y, Shi Q, Xu K, Yu G, Ding Z. Iterative algorithm induced deep-unfolding neural networks: Precoding design for multiuser MIMO systems. IEEE Transactions on Wireless Communications. 2021;20(2):1394-1410
  13. 13. Liu Y, Hu Q, Cai Y, Yu G, Li GY. Deep-unfolding beamforming for intelligent reflecting surface assisted full-duplex systems. IEEE Transactions on Wireless Communications. 2022;21(7):4784-4800
  14. 14. Shi S, Cai Y, Hu Q, Champagne B, Hanzo L. Deep-unfolding neural-network aided hybrid beamforming based on symbol-error probability minimization. IEEE Transactions on Vehicular Technology. 2023;72(1):529-545
  15. 15. Hua H, Wang X, Xu Y. Signal detection in uplink pilot-assisted multi-user MIMO systems with deep learning. In: Computing, Communications and IoT Applications. 2019. pp. 369–373
  16. 16. Yi X, Zhong C. Deep learning for joint channel estimation and signal detection in OFDM systems. IEEE Communications Letters. 2020;24(12):2780-2784
  17. 17. He H, Wen C-K, Jin S, Li GY. Model-driven deep learning for MIMO detection. IEEE Transactions on Signal Processing. 2020;68:1702-1715
  18. 18. Khobahi S, Shlezinger N, Soltanalian M, Eldar YC. LoRD-net: Unfolded deep detection network with low-resolution receivers. IEEE Transactions on Signal Processing. 2021;69:5651-5664
  19. 19. Samuel N, Diskin T, Wiesel A. Learning to detect. IEEE Transactions on Signal Processing. 2019;67(10):2554-2564
  20. 20. Zhang Y, Sun J, Xue J, Li GY, Xu Z. Deep expectation-maximization for joint MIMO channel estimation and signal detection. IEEE Transactions on Signal Processing. 2022;70:4483-4497
  21. 21. Gruber T, Cammerer S, Hoydis J, Brink ST. On deep learning-based channel decoding. In: Annual Conference on Information Sciences and System. 2017. pp. 1–6
  22. 22. Kang J-M, Kim I-M, Chun C-J. Deep learning-based MIMO-NOMA with imperfect SIC decoding. IEEE Systems Journal. 2020;14(3):3414-3417
  23. 23. Cao C, Li D, Fair I. Deep learning-based decoding of constrained sequence codes. IEEE Journal on Selected Areas in Communications. 2019;37(11):2532-2543
  24. 24. Nachmani E, Be’ery Y, Burshtein D. Learning to decode linear codes using deep learning. In: Annual Allerton Conference on Communication, Control, and Computing. 2016. pp. 341–346
  25. 25. Lugosch L. Gross WJ. Neural offset min-sum decoding. In: IEEE International Symposium on Information Theory. 2017. pp. 1361–1365
  26. 26. Wang Q, Wang S, Fang H, Chen L, Chen L, Guo Y. A model driven deep learning method for normalized min-sum LDPC decoding. In: IEEE International Conference on Communications Workshops. 2020. pp. 1–6
  27. 27. Ye H, Li GY, Juang B-HF, Sivanesan K. Channel Agnostic End-to-End Learning Based Communication Systems with Conditional GAN. AbuDhabi, UAE: Proc. IEEE Global Commun. Conf. Workshops; 2018. pp. 1-5
  28. 28. Hu Q, Cai Y, Kang K, Yu G, Hoydis J, Eldar YC. Two-timescale end-to-end learning for channel acquisition and hybrid precoding. IEEE Journal on Selected Areas in Communications. 2022;40(1):163-181
  29. 29. Dorner S, Cammerer S, Hoydis J, Brink ST. Deep learning based communication over the air. IEEE Journal of Selected Topics Signal Process. 2018;12(1):132-143
  30. 30. Aoudia FA, Hoydis J. End-to-end learning of communications systems without a channel model. Asilomar Conference on Signals, Systems, and Computers. 2018:298-303
  31. 31. Kang K, Hu Q, Cai Y, Yu G, Hoydis J, Eldar YC. Mixed-timescale deep-unfolding for joint channel estimation and hybrid beamforming. IEEE Journal on Selected Areas in Communications. 2022;40(9):2510-2528
  32. 32. Hu Q, Liu Y, Cai Y, Yu G, Ding Z. Joint deep reinforcement learning and unfolding: Beam selection and precoding for mmwave multiuser MIMO with lens arrays. IEEE Journal on Selected Areas in Communications. 2021;39(8):2289-2304
  33. 33. Zhang G, Hu Q, Qin Z, Cai Y, Yu G. A unifified multi-task semantic communication system with domain adaptation. In: IEEE Global Communications Conference. 2022. pp. 3971–3976
  34. 34. Niu K, Dai J, Yao S, Wang S, Si Z, Qin X, et al. A paradigm shift toward semantic communications. IEEE Communications Magazine. 2022;60(11):113-119
  35. 35. Hu Q, Zhang G, Qin Z, Cai Y, Yu G, Li GY. Robust semantic communications with masked VQ-VAE enabled codebook. IEEE Transactions Wireless Communications, early access. 2023
  36. 36. Liu C, Guo C, Yang Y, Jiang N. Adaptable semantic compression and resource allocation for task-oriented communications. arXiv preprint arXiv:2204.08910. 2022
  37. 37. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-CAM: Visual explanations from deep networks via gradient-based localization. In: Proc. IEEE Int. Conf. Comput. Vis. (ICCV). Venice, Italy: IEEE; 2017
  38. 38. Balle J, Chou PA, Minnen D, Singh S, Johnston N, Agustsson E, et al. Nonlinear transform coding. IEEE Journal of Selected Topics Signal Process. 2021;15(2):339-353
  39. 39. Yao S, Wang S, Dai J, Niu K, Zhang P. Versatile semantic coded transmission over MIMO fading channels. arXiv preprint arXiv:2210.16741. 2022
  40. 40. Wu H, Shao Y, Mikolajczyk K, Gündüz D. Channel-adaptive wireless image transmission with OFDM. IEEE Wireless Communications Letters. 2022;11(11):2400-2404
  41. 41. Bian C, Shao Y, Wu H, Gunduz D. Space-time design for deep joint source channel coding of images over MIMO channels. arXiv preprint arXiv:2210.16985. 2022
  42. 42. Xu J, Ai B, Chen W, Yang A, Sun P, Rodrigues M. Wireless image transmission using deep source channel coding with attention modules. IEEE Transactions on Circuits and Systems for Video Technology. 2022;32(4):2315-2328
  43. 43. Zhang G, Hu Q, Cai Y, Yu G. Adaptive CSI feedback for deep learning-enabled image transmission. arXiv preprint arXiv:2302.13477. 2023

Written By

Yunlong Cai, Qiyu Hu, Guangyi Zhang and Kai Kang

Reviewed: 01 June 2023 Published: 24 June 2023