Accelerating Iteratively Linear Detectors in Multi-User (ELAA-)MIMO Systems with UW-SVD

Jiuyu Liu, Yi Ma, Jinfei Wang, and Rahim Tafazolli Jiuyu Liu, Yi Ma (corresponding author), Jinfei Wang, and Rahim Tafazolli are with the 5GIC and 6GIC, Institute for Communication Systems (ICS), University of Surrey, Guildford, United Kingdom, GU2 7XH, e-mails: (jiuyu.liu, y.ma, jinfei.wang, r.tafazolli)@surrey.ac.uk.This work was partially supported by the UK Department for Science, Innovation and Technology under the Future Open Networks Research Challenge project TUDOR (Towards Ubiquitous 3D Open Resilient Network).This work has been partially presented in SPAWC’2023, Shanghai [1].

Abstract

Current iterative multiple-input multiple-output (MIMO) detectors suffer from slow convergence when the wireless channel is ill-conditioned. The ill-conditioning is mainly caused by spatial correlation between channel columns corresponding to the same user equipment, known as intra-user interference. In addition, in the emerging MIMO systems using an extremely large aperture array (ELAA), spatial non-stationarity can make the channel even more ill-conditioned. In this paper, user-wise singular value decomposition (UW-SVD) is proposed to accelerate the convergence of iterative MIMO detectors. Its basic principle is to perform SVD on each user’s sub-channel matrix to eliminate intra-user interference. Then, the MIMO signal model is effectively transformed into an equivalent signal (e-signal) model, comprising an e-channel matrix and an e-signal vector. Existing iterative algorithms can be used to recover the e-signal vector, which undergoes post-processing to obtain the signal vector. It is proven that the e-channel matrix is better conditioned than the original MIMO channel for spatially correlated (ELAA-)MIMO channels. This implies that UW-SVD can accelerate current iterative algorithms, which is confirmed by our simulation results. Specifically, it can speed up convergence by up to $10$ times in both uncoded and coded systems.

Index Terms:

Linear MIMO detectors, extremely large aperture array (ELAA), user-wise singular value decomposition (UW-SVD), channel ill-conditioning, fast convergence.

I Introduction

The primary focus of this paper is low-complexity signal detection for multi-user multiple-input multiple-output (MIMO) systems, particularly those deployed with extremely-large aperture arrays (ELAA). ELAA-MIMO systems can increase spectral efficiency by more than tenfold over current massive-MIMO systems [2]. This is because users are typically located in the near-field of the ELAA; and the near-field channels can provide higher spatial resolution compared to the far-field massive-MIMO channels [3, 4, 5]. For instance, under strong line-of-sight (LoS) conditions, ELAA-MIMO can support multiple data streams from a user equipment (UE) equipped with multiple antennas, while massive-MIMO channels can only support a single data stream per UE [6, 7]. In massive-MIMO systems, the wireless channel can become ill-conditioned due to high spatial correlation among channel columns [8]. However, ELAA channels can be even more ill-conditioned due to both channel spatial correlation and non-stationarity [9]. This makes the design of low-complexity MIMO detectors challenging, particularly those iterative algorithms with square-order complexity [10].

Maximum-likelihood (ML) detector, while achieving the optimal detection performance, is computationally impractical due to its exponentially growing complexity [11]. Linear detectors, such as zero-forcing (ZF) and minimum mean square error (LMMSE), offer a more computationally efficient alternative, providing near-optimal detection performance [12]. However, they both require a Gram matrix inverse with cubic-order complexity, which limits their applications in large-scale MIMO systems [13]. Instead, iterative algorithms achieve ZF/LMMSE detection performance with square-order complexity, bypassing the matrix inverse. Conventional algorithms with simple structures, such as Richardson iteration (RI) [14] and Neumann series [15], can offer fast convergence in well-conditioned channel matrices. Conversely, they may exhibit divergence when applied to ill-conditioned channel matrices [13]. This challenge motivates more advanced algorithms that aim to achieve fast convergence in such channel matrices.

I-A Relevant Prior Arts

Current iterative algorithms can be classified into three main categories [16, 17, 18]: 1) gradient methods, 2) belief propagation, and 3) matrix-splitting (MS) based methods.

Gradient methods can achieve global convergence in solving the problem of linear MIMO detection [19]. For instances, steepest descent (SD) method updates in the same direction as RI, but converges faster than RI because it optimizes the step size in each iteration [20]; conjugate gradient (CG) method leverages the Hermitian nature of the Gram channel matrix to determine a more efficient update direction, which further accelerates convergence compared to SD [21, 22]. In addition, quasi-Newton (QN) methods represent an important branch of gradient methods, such as symmetric rank 1 (SR1) [23] and Broyden-Fletcher-Goldfarb-Shanno (BFGS) [19]. Due to the iterative approximation of the Gram matrix inversion, QN methods typically exhibit cubic order complexity. Recently, the application of limited-memory BFGS (L-BFGS) to linear MIMO detection has demonstrated its ability to achieve convergence equivalent to that of BFGS while requiring only square-order complexity [24].

Belief propagation refers to iterative message passing (MP) algorithms. Among these algorithms, approximate MP (AMP) was initially proposed for compressive sensing and has been applied for MIMO detection in recent years [25, 26]. The complexity of AMP is comparable to that of L-BFGS, and both algorithms exhibit similar convergence rate in massive-MIMO systems. However, AMP diverges in ELAA-MIMO systems due to the channel non-stationarity [27]. AMP variants such as orthogonal AMP (OAMP) and vector AMP (VAMP) [28, 29] have been proposed to address this problem, and their detection performance is even slightly better than that of LMMSE [30]. However, they both introduce computational overhead due to the requirement for matrix inverse or singular value decomposition (SVD), resulting in cubic-order complexity.

In MS-based methods, the Gram channel matrix is divided into the sum of several individually invertible sub-matrices, the inversions of which will be used to accelerate the convergence [17]. Typically, they divide the Gram matrix into its diagonal part and its upper and lower triangular parts. MS-based methods include Jacobi iteration (JI) [31], Gauss-Seidel (GS) method [32], and successive over-relaxation (SSOR) [33]. Specifically, the inverses of the diagonal and lower triangular matrices are used for the JI and GS methods, respectively. Furthermore, SSOR converges faster than the JI and GS methods because it uses the inverses of both the upper and lower triangular matrices to further accelerate the convergence. The triangular matrix inverse has square-order complexity, making it scalable for large-scale MIMO systems [17].

I-B Motivation of This Paper

Based on recent theoretical advancements [34, 35, 36, 37, 38] and empirical field measurements [39, 40, 41, 42], it has been observed that intra-user interference is much stronger than inter-user interference in ELAA-MIMO systems. This phenomenon also holds true for conventional massive-MIMO systems when spatial correlations are taken into account (see [8] and our discussion in Section IV-B for more details). However, current iterative algorithms typically use a generalized approach to tackle the problem of channel ill-conditioning, disregarding the distinctive features of multi-user MIMO channels. As a result, when spatial correlation is considered, current algorithms still require tens of iterations to converge in (ELAA-)MIMO systems [1]. This motivates the rest of this paper.

I-C Contributions of This Paper

In this paper, we propose to utilize user-wise SVD (UW-SVD) to accelerate the convergence of current iterative algorithms in multi-user (ELAA-)MIMO systems. The concept of UW-SVD is to perform SVD on the sub-channel matrix corresponding to each UE ¹¹1 The authors are aware that this option is used in some prior works, e.g., [43, 44, 45], but they all focus on optimizing power allocation strategies using the singular value matrices. In contrast, our study focuses on the left unitary matrices to accelerate the convergence of current iterative algorithms., thereby eliminating the intra-user interference. The MIMO signal model can then be transformed to an equivalent signal (e-signal) model containing an e-channel matrix and a corresponding e-signal vector. The major differences between current iterative algorithms and UW-SVD-assisted iterative algorithms are illustrated in Fig. 1. It can be observed that current algorithms applied to the MIMO signal model converge directly to the estimation of the transmitted signal. In contrast, UW-SVD-based algorithms first converge to an estimation of the e-signal vector and then convert it back to the transmitted signal through a post-processing step. An e-signal model-based ZF detector, termed e-ZF, was developed in our previous work [1]. It is proven that, after post-processing, e-ZF detector can provide equivalent estimation to ZF detector.

Refer to caption — Figure 1: Illustration of the major differences between the current iterative algorithms and UW-SVD-assisted iterative algorithms.

In addition to [1], an LMMSE detector for the e-signal model, termed e-LMMSE, is developed in this paper. Also, it is proven to provide equivalent detection performance to the LMMSE detector. Furthermore, it is demonstrated that the e-channel matrix exhibits a lower condition number compared to the original channel matrix, particularly in ELAA-MIMO systems. Considering an ELAA-MIMO system under LoS conditions as an example, when the spatial correlation of small-scale fading is not accounted for, the condition number of the MIMO channel matrix is approximately $60$ , while the condition number of the e-channel matrix is significantly lower at approximately $5$ . Moreover, when the spatial correlation is considered, the condition number of the MIMO channel matrix increases substantially to approximately $700$ . However, even in this scenario, the condition number of the e-channel matrix remains significantly lower at approximately $7$ . A lower condition number indicates that a matrix is less sensitive to perturbations, which means that iterative algorithms can converge to the correct solution more quickly. Therefore, the proposed UW-SVD can significantly accelerate the convergence of current iterative algorithms to achieve ZF/LMMSE performance. This is evident in our computer simulations. For example, the UW-SVD-assisted SSOR converges ten times faster than SSOR in both uncoded and coded ELAA-MIMO systems. Finally, it is worth noting that UW-SVD can also speed up the convergence in conventional massive-MIMO channels when the spatial correlation is taken into account.

I-D Organization and Notations

The rest of this paper is organized as follows. Section II presents the system model, preliminaries, and problem statement. Section III describes the principle of UW-SVD and its application in accelerating the convergence of current iterative algorithms. Section IV presents the convergence analysis. Section V presents the numerical and simulation results. Finally, the conclusion is presented in Section VI.

Notations

Regular letter, lower-case bold letter, and capital bold letter represent scalar, vector and matrix, respectively. The notations $[\cdot]^{H}$ , $[\cdot]^{-1}$ , $\|\cdot\|$ , $\mathbb{E}\{\cdot\}$ and $\mathrm{cond}(\cdot)$ , represent the Hermitian, inverse, Euclidean norm, expectation, and condition number of a matrix (a vector or a scalar if appropriate), respectively. $\mathbb{D}(\cdot)$ and $\mathbb{L}(\cdot)$ denote a matrix formed by the diagonal and lower-triangular part of a matrix, respectively. $\lambda_{\text{max}}(\cdot)$ or $\lambda_{\text{min}}(\cdot)$ denote the maximum or minimum eigenvalue of a matrix. $\mathrm{diag}(\cdot)$ constructs the input matrices in a block diagonal form. $\mathbf{I}$ and $\mathbf{0}$ denote identity and zero matrices with compatible dimensions.

II System Model, Preliminaries, and Problem Statement

This section begins by introducing the system model. Next, it presents linear MIMO detectors and low-complexity iterative algorithms. Finally, it discusses the challenges of these algorithms in achieving ZF/LMMSE detection performance in ill-conditioned channel matrix.

II-A System Model

Let $M$ and $N$ denote the number of service antennas and user antennas, respectively. For ELAA-MIMO and massive-MIMO systems, their signal models share the same mathematical form and can be expressed as follows

\mathbf{y}=\mathbf{H}\mathbf{x}+\mathbf{z},

(1)

where $\mathbf{y}\in\mathbb{C}^{M\times 1}$ denotes the received signal vector, $\mathbf{x}\in\mathbb{C}^{N\times 1}$ the transmitted signal vector, $\mathbf{z}\sim\mathcal{CN}(0,\sigma_{z}^{2}\mathbf{I})$ the additive white Gaussian noise (AWGN), $\sigma_{z}^{2}$ the noise variance, and $\mathbf{I}$ represents an identity matrix with compatible dimensions. Each element of $\mathbf{x}$ is drawn from a finite alphabet-set with equal probability and fulfills: $\mathbb{E}\{\mathbf{x}\}=\mathbf{0}$ and $\mathbb{E}\{\mathbf{x}\mathbf{x}^{H}\}=\sigma_{x}^{2}\mathbf{I}$ . Note that the random channel matrix $\mathbf{H}\in\mathbb{C}^{M\times N}$ has different distributions in massive-MIMO and ELAA-MIMO systems. In Section V-A, we consider four random distributions of $\mathbf{H}$ for computer simulations.

In ELAA-MIMO and conventional massive-MIMO systems, the performance can be significantly degraded by spatial correlation between user antennas. This spatial correlation leads to two types of interference: 1) intra-user interference, and 2) inter-user interference. Intra-user interference occurs when signals transmitted from different antenna elements to the same user interfere with each other due to spatial correlation. Conversely, inter-user interference is caused by signals intended for other users. Typically, intra-user interference is much stronger than inter-user interference. The reason for this is that the distance between antennas serving the same user is usually less than the distance between antennas serving different users. Section IV-B provides a mathematical justification for this phenomenon. Consequently, the primary objective of this work is to mitigate the intra-user interference for both ELAA-MIMO and massive-MIMO systems.

II-B Preliminaries

The two most classical linear MIMO detectors are ZF and LMMSE, which can be expressed as follows [17]

\widehat{\mathbf{x}}=\mathbf{A}^{-1}\mathbf{b},

(2)

where $\mathbf{b}=\mathbf{H}^{H}\mathbf{y}$ represents the matched filter vector, $\widehat{\mathbf{x}}$ the estimation of $\mathbf{x}$ , and $\mathbf{A}$ is a Gram filter matrix. For ZF and LMMSE detectors, $\mathbf{A}$ can be expressed as follows

\mathbf{A}=\left\{\begin{array}[]{l}\mathbf{A}_{\textsc{zf}}\triangleq\mathbf{% H}^{H}\mathbf{H};\\ \mathbf{A}_{\textsc{lmmse}}\triangleq\mathbf{H}^{H}\mathbf{H}+\rho^{-1}\mathbf% {I},\end{array}\right.

(3)

where $\rho=\sigma_{x}^{2}/\sigma_{z}^{2}$ denotes the signal-to-noise ratio (SNR). However, both ZF and LMMSE detectors require the inverse calculation of $\mathbf{A}$ , which is computationally prohibitive for large MIMO sizes.

A number of iterative algorithms have been proposed to efficiently solve the problem in (2) bypassing $\mathbf{A}$ inverse [16, 17, 18]. Their general form can be expressed as follows

\mathbf{x}_{t+1}=f(\mathbf{x}_{t};\mathbf{A},\mathbf{b}),

(4)

where $\mathbf{x}_{t}$ represents the $t^{th}$ estimation of $\mathbf{x}$ and $f(\cdot)$ is a linear function that varies depending on the specific algorithm employed. Taking RI as an example, $f_{\textsc{ri}}(\cdot)$ is given by [14]

f_{\textsc{ri}}(\mathbf{x}_{t};\mathbf{A},\mathbf{b})=\mathbf{x}_{t}+(\mathbf{% b}-\mathbf{A}\mathbf{x}_{t}),

(5)

which has a simple structure, but it diverges when $\mathbf{A}$ is ill-conditioned [12].

To address this issue, more advanced algorithms have been proposed to achieve faster convergence. For instance, the iterative process of MS-based methods is given by [1]

f_{\textsc{ms}}(\mathbf{x}_{t};\mathbf{A},\mathbf{b})=\mathbf{x}_{t}+\mathbf{M% }^{-1}(\mathbf{b}-\mathbf{A}\mathbf{x}_{t}),

(6)

where $\mathbf{M}$ represents the preconditioning matrix, and it is constructed based on the following matrix splitting

\mathbf{A}=\mathbb{L}(\mathbf{A})+\mathbb{L}(\mathbf{A})^{H}-\mathbb{D}(% \mathbf{A}),

(7)

where $\mathbb{L}(\cdot)$ and $\mathbb{D}(\cdot)$ represent matrices formed by the lower triangular and diagonal parts of the input matrix, respectively. Since $\mathbf{A}_{\textsc{zf}}$ and $\mathbf{A}_{\textsc{lmmse}}$ are both Hermitian matrices, $\mathbb{L}(\mathbf{A})^{H}$ in (7) actually represents the upper triangular part of $\mathbf{A}$ . For JI, GS and SSOR methods, the preconditioning matrices are defined as follows: $\mathbf{M}_{\textsc{ji}}=\mathbb{D}(\mathbf{A})$ , $\mathbf{M}_{\textsc{gs}}=\mathbb{L}(\mathbf{A})$ [32], and $\mathbf{M}_{\textsc{ssor}}=\mathbb{L}(\mathbf{A})\mathbb{D}(\mathbf{A})^{-1}% \mathbb{L}(\mathbf{A})^{H}$ [33], respectively. MS-based methods generally have faster convergence than RI due to the use of $\mathbf{M}$ , which enables a more efficient update direction. In addition, the inversion of triangular matrices exhibits square-order complexity, making it computationally efficient.

Furthermore, gradient methods can also provide accelerated convergence over RI by using adaptive optimization of the step size, update direction, or both. To avoid redundancy within this paper, a detailed discussion of gradient methods will be presented in Section III-D.

II-C Problem Statement

The convergence rate of the iterative algorithms described in (4) is significantly influenced by the condition number of $\mathbf{A}$ [17]. Specifically, for a given iterative function, it converges faster when the condition number is smaller. In this paper, the condition number is defined as follows

\mathrm{cond}(\mathbf{A})\triangleq\dfrac{\lambda_{\text{max}}(\mathbf{A})}{% \lambda_{\text{min}}(\mathbf{A})},

(8)

where $\lambda_{\text{max}}(\cdot)$ and $\lambda_{\text{min}}(\cdot)$ represent the maximum and minimum eigenvalues of the input matrix, respectively. In massive-MIMO systems without spatial correlation, current algorithms can offer fast convergence, since $\mathbf{A}$ is well-conditioned. However, they all demonstrate slow convergence in ELAA-MIMO systems, particularly in scenarios dominated by LoS links [1]. The reason is that ELAA channel matrices could be very ill-conditioned, meaning $\mathrm{cond}(\mathbf{A})\gg 1$ [46]. As discussed in Section I-B, the main reason contributing to (ELAA-)MIMO channel ill-conditioning is the strong intra-user interference. Therefore, the objective of this paper is to efficiently eliminate the impact of intra-user interference on the iterative process, and the following sections are motivated.

III UW-SVD-Assisted Iterative Algorithms

This section introduces the concept of UW-SVD and its role in transforming the MIMO signal model into an e-signal model. Subsequently, the derivations of e-ZF and e-LMMSE detectors for the e-signal model are presented. Furthermore, existing iterative algorithms are employed to estimate the e-signal vector, which is then transformed back to the estimation of $\mathbf{x}$ through the post-processing step.

III-A The Concept of UW-SVD

Suppose there are $K$ UEs deployed in the MIMO system, and the $k^{th}$ UE is equipped with $N_{k}$ antennas. The system configuration satisfies: $\sum_{k=1}^{K}N_{k}=N$ . Then, the complete channel matrix can be represented in a concatenated format as follows

\mathbf{H}=[\mathbf{H}_{1},...,\mathbf{H}_{K}],

(9)

where $\mathbf{H}_{k}\in\mathbb{C}^{M\times N_{k}}$ represents the sub-channel matrix corresponding to the $k^{th}$ UE. To eliminate intra-user interference, we apply the economy-size SVD ²²2A variant of SVD that computes only the necessary components for tall matrices, enhancing computational efficiency [47]. to each user’s sub-channel matrix as follows

\mathbf{H}_{k}=\mathbf{U}_{k}\mathbf{\Sigma}_{k}\mathbf{V}^{H}_{k},

(10)

where $\mathbf{U}_{k}\in\mathbb{C}^{M\times N_{k}}$ represents the left unitary matrix, $\mathbf{\Sigma}_{k}\in\mathbb{R}^{N_{k}\times N_{k}}$ the diagonal matrix containing the singular values, and $\mathbf{V}_{k}\in\mathbb{C}^{N_{k}\times N_{k}}$ represents the right unitary matrix. This step is the so-called UW-SVD ¹.

Substituting (10) into (9) with some tidy-up work, $\mathbf{H}$ can be decomposed into the following three matrix multiplications

\mathbf{H}=\mathbf{\Psi}\mathbf{\Sigma}\mathbf{V}^{H},

(11)

where $\mathbf{\Psi}\triangleq[\mathbf{U}_{1},...,\mathbf{U}_{K}]$ represents the concatenation of $\mathbf{U}_{k}$ . $\mathbf{\Sigma}\triangleq\mathrm{diag}(\mathbf{\Sigma}_{1},\dots,\mathbf{% \Sigma}_{K})$ and $\mathbf{V}\triangleq\mathrm{diag}(\mathbf{V}_{1},\dots,\mathbf{V}_{K})$ are block diagonal matrices containing the singular value and right-unitary matrices, respectively. The notation $\mathrm{diag}(\cdot)$ represents the construction of the input matrices in a block diagonal manner. It is obvious that $\mathbf{\Sigma}$ is a positive-real diagonal matrix and $\mathbf{V}$ is a unitary matrix, i.e.,

\mathbf{V}^{H}\mathbf{V}=\mathbf{V}\mathbf{V}^{H}=\mathbf{I}.

(12)

However, it is worth noting that $\mathbf{\Psi}$ is not a unitary matrix in practical MIMO systems. This is because the left-unitary matrices are tall matrices, and $\mathbf{U}_{k}$ for different UEs may not necessarily be orthogonal to each other. Next, we will explore the transformation from the MIMO signal model to e-signal model using UW-SVD.

III-B The e-Signal Model

According to the UW-SVD in (11), the MIMO signal model in (1) can be transformed into an e-signal model as follows

\mathbf{y}=\mathbf{\Psi}\mathbf{s}+\mathbf{z},

(13)

where $\mathbf{s}\triangleq\mathbf{\Sigma}\mathbf{V}^{H}\mathbf{x}$ represents the e-signal vector, and $\mathbf{\Psi}$ represents the e-channel matrix. Therefore, $\mathbf{\Psi}$ and $\mathbf{s}$ are the linear representations of $\mathbf{H}$ and $\mathbf{x}$ , respectively.

Remark 1

With this e-signal model, it is important to understand the properties of $\mathbf{s}$ and $\mathbf{\Psi}$ . Taking $\mathbf{s}$ as an example, its expectation and covariance can be expressed as follows

	$\displaystyle\mathbb{E}\{\mathbf{s}\}\$	$\displaystyle=\mathbf{0};$			(14)
	$\displaystyle\mathbb{E}\{\mathbf{s}\mathbf{s}^{H}\}\$	$\displaystyle=\sigma_{x}^{2}\mathbf{\Sigma}^{2},$			(15)

which can be easily obtained from $\mathbb{E}\{\mathbf{x}\}=\mathbf{0}$ and $\mathbb{E}\{\mathbf{x}\mathbf{x}^{H}\}=\sigma_{x}^{2}\mathbf{I}$ . This indicates that distinct e-signal data streams are orthogonal to each other and they exhibit different transmission powers. Moreover, we have the following

\mathbb{D}\big{(}\mathbf{\Psi}^{H}\mathbf{\Psi}\big{)}=\mathbf{I},

(16)

which indicates that the complexity of computing $\mathbf{D}\big{(}\mathbf{\Psi}^{H}\mathbf{\Psi}\big{)}$ can be ignored in certain iterative algorithms, such as JI and L-BFGS methods.

Note that the condition number of $\mathbf{\Psi}$ is crucial for this paper because it significantly affects the convergence of the UW-SVD-assisted algorithms. This property will be examined in Section IV, where we present a comprehensive convergence analysis. Before that, we focus on the development of linear detectors for the e-signal model in the next subsection.

III-C The e-ZF and e-LMMSE Detectors

In this subsection, we develop the e-ZF and e-LMMSE detectors for the e-signal model. Additionally, it is demonstrated that they can achieve the same detection performance as the corresponding ZF or LMMSE detector after a low-complexity post-processing step.

Given that the e-signal model is a linear representation of the MIMO signal model, its two linear detectors (i.e., e-ZF and e-LMMSE) can be expressed in the following general form

\widehat{\mathbf{s}}=\mathbf{\Phi}^{-1}\mbox{\boldmath$\delta$},

(17)

where $\mbox{\boldmath$\delta$}=\mathbf{\Psi}^{H}\mathbf{y}$ denotes the matched filter vector for the e-signal model. Similar to that in (3), $\mathbf{\Phi}$ for e-ZF and e-LMMSE can be expressed as follows

\mathbf{\Phi}=\left\{\begin{array}[]{l}\mathbf{\Phi}_{\textsc{zf}}\triangleq% \mathbf{\Psi}^{H}\mathbf{\Psi};\\ \mathbf{\Phi}_{\textsc{lmmse}}\triangleq\mathbf{\Psi}^{H}\mathbf{\Psi}+\rho^{-% 1}\mathbf{\Sigma}^{-2}.\end{array}\right.

(18)

It is obvious that (17) and (2) share the same mathematical structure. Hence, any iterative algorithm designed to determine $\widehat{\mathbf{x}}_{\textsc{zf}}$ or $\widehat{\mathbf{x}}_{\textsc{lmmse}}$ can be directly applied to determine $\widehat{\mathbf{s}}_{\textsc{zf}}$ or $\widehat{\mathbf{s}}_{\textsc{lmmse}}$ , respectively. Since the objective of MIMO signal detection is to reconstruct the transmitted signal vector $\mathbf{x}$ , a post-processing step is required to convert $\widehat{\mathbf{s}}$ back to $\widehat{\mathbf{x}}$ .

Post-Processing Step: Consistent with the definition of $\mathbf{s}$ , we propose to reconvert $\widehat{\mathbf{x}}$ from $\widehat{\mathbf{s}}$ as follows

\widehat{\mathbf{x}}=\mathbf{V}\mathbf{\Sigma}^{-1}\widehat{\mathbf{s}}.

(19)

where $\mathbf{\Sigma}$ is a diagonal matrix, so that computing $\mathbf{\Sigma}^{-1}$ requires only linear computational complexity. Plugging (18) into (19) with some tidy-up works, we can have the following

	$\displaystyle\widehat{\mathbf{x}}_{\textsc{zf}}=\mathbf{V}\mathbf{\Sigma}^{-1}% \widehat{\mathbf{s}}_{\textsc{zf}};$		(20)
	$\displaystyle\widehat{\mathbf{x}}_{\textsc{lmmse}}=\mathbf{V}\mathbf{\Sigma}^{% -1}\widehat{\mathbf{s}}_{\textsc{lmmse}}.$		(21)

With the post-processing step, the e-ZF and e-LMMSE detectors can provide the same detection performance as the ZF and LMMSE detectors, respectively. This implies that any iterative algorithm that converges to $\widehat{\mathbf{s}}$ can provide ZF or LMMSE detection performance. The specific steps of the UW-SVD-assisted algorithms are discussed in the next section.

III-D UW-SVD Assisted Iterative Algorithms

As discussed in Section III-C, all the MS-based methods and gradient methods can be directly applied to estimate $\mathbf{s}$ ³³3AMP and its variants require further modifications to determine $\mathbf{s}$ due to their structures; this exploration is beyond the scope of this paper. Additionally, using MS-based methods and L-BFGS is sufficient to demonstrate the advantage of the proposed UW-SVD method.. These methods share the same iterative structure as (4), except that the specific parameters are adjusted as follows

\mathbf{s}_{t+1}=f(\mathbf{s}_{t};\mathbf{\Phi},\mbox{\boldmath$\delta$}),

(22)

which is the so-called UW-SVD-assisted iterative algorithms. In the case where $\mathbf{\Phi}=\mathbf{\Phi}_{\textsc{zf}}$ , $\mathbf{s}_{t}$ will converge to the e-ZF solution. Conversely, when $\mathbf{\Phi}=\mathbf{\Phi}_{\textsc{lmmse}}$ , $\mathbf{s}_{t}$ will converge to the e-LMMSE solution. It is worth noting that the convergence rate of UW-SVD-assisted algorithms is dominated by $\mathrm{cond}(\mathbf{\Phi})$ , rather than $\mathrm{cond}(\mathbf{A})$ . The comparison between $\mathrm{cond}(\mathbf{\Phi})$ and $\mathrm{cond}(\mathbf{A})$ will be comprehensively explored in Section V.

Similar to (6), MS-based methods assisted by UW-SVD can be expressed as follows

f_{\textsc{ms}}(\mathbf{s}_{t};\mathbf{\Phi},\mbox{\boldmath$\delta$})=\mathbf% {s}_{t}+\mathbf{M}^{-1}(\mbox{\boldmath$\delta$}-\mathbf{\Phi}\mathbf{s}_{t}),

(23)

where the preconditioning matrix $\mathbf{M}$ should be constructed based on the matrix splitting of $\mathbf{\Phi}$ . For the case $\mathbf{\Phi}=\mathbf{\Phi}_{\textsc{zf}}$ , JI and SSOR methods can be further simplified. Specifically, JI and RI are equivalent to each other because $\mathbf{M}_{\textsc{ji}}=\mathbb{D}(\mathbf{\Phi}_{\textsc{zf}})$ is an identity matrix; and $\mathbf{M}$ for SSOR can be simplified to $\mathbf{M}_{\textsc{ssor}}=\mathbb{L}(\mathbf{\Phi}_{\textsc{zf}})\mathbb{L}(% \mathbf{\Phi}_{\textsc{zf}})^{H}$ .

Gradient methods can also be employed to address the problem in (17), such as SD, CG and L-BFGS. It is demonstrated that L-BFGS converges faster than SD while maintaining similar square-order complexity [24]. Moreover, it is proven that L-BFGS and CG are equivalent when solving the convex MIMO detection problem [48]. Therefore, we consider L-BFGS as an example of gradient methods in this paper. Its iterative process is given by [19]

f_{\textsc{lbfgs}}(\mathbf{s}_{t};\mathbf{\Phi},\mbox{\boldmath$\delta$})=% \mathbf{s}_{t}+\xi_{t}\mathbf{d}_{t},

(24)

where $\xi_{t}$ denotes the step size as follows

\xi_{t}=-\dfrac{\mathbf{g}_{t}^{H}\mathbf{d}_{t}}{\mathbf{d}_{t}^{H}\mathbf{% \Phi}\mathbf{d}_{t}},

(25)

and $\mathbf{d}_{t}$ denotes the update direction as follows

\mathbf{d}_{t}=\mathbf{\Theta}_{t}\mathbf{g}_{t},

(26)

where $\mathbf{g}_{t}\triangleq\mathbf{\Phi}\mathbf{s}_{t}-\mbox{\boldmath$\delta$}$ denotes the gradient direction. $\mathbf{\Theta}_{t}$ represents the approximation of Hessian matrix, and it can be expressed as follows

\mathbf{\Theta}_{t}=\bigg{(}\dfrac{(\mathbf{s}_{t}-\mathbf{s}_{t-1})(\mathbf{g% }_{t}-\mathbf{g}_{t-1})^{H}}{(\mathbf{s}_{t}-\mathbf{s}_{t-1})^{H}(\mathbf{g}_% {t}-\mathbf{g}_{t-1})}-\mathbf{I}\bigg{)}\mathbf{\Theta}_{0},

(27)

where $\mathbf{\Theta}_{0}$ represents the initial approximation. Typically, $\mathbf{\Theta}_{0}$ is set as $\mathbb{D}(\mathbf{\Phi})^{-1}$ . For e-ZF detector, the term $\mathbf{\Theta}_{0}$ in (27) can be omitted, since $\mathbb{D}(\mathbf{\Phi}_{\textsc{zf}})^{-1}=\mathbf{I}$ .

Algorithm UW-SVD assisted L-BFGS algorithm

\mathbf{y}

: received signal vector;

\mathbf{H}

: MIMO channel matrix;

\rho

: SNR;

T

: number of iterations;

\mathbf{s}_{0}=\mathbf{0}

: the initialization vector.

\widehat{\mathbf{x}}

: the estimation of

\mathbf{x}

;

1: let

t=0

; call (11) to compute

\mathbf{\Psi}

\mathbf{\Sigma}

, and

\mathbf{V}

;

2: let

\mbox{\boldmath$\delta$}=\mathbf{\Psi}^{H}\mathbf{y}

; let

\mathbf{\Phi}=\mathbf{\Psi}^{H}\mathbf{\Psi}+\rho^{-1}\mathbf{\Sigma}^{-2}

;

3: call (24) to compute

\mathbf{s}_{t+1}

; then

t\leftarrow t+1

;

4: repeat step 3 until

t=T

;

5: call (19) to compute

\widehat{\mathbf{x}}

;

Pseudocode: The UW-SVD assisted L-BFGS algorithm is presented in the Algorithm. It can provide the LMMSE detection performance, since the filter matrix $\mathbf{\Phi}$ is set to be $\mathbf{\Phi}_{\textsc{lmmse}}$ . By setting $\mathbf{\Phi}=\mathbf{\Psi}^{H}\mathbf{\Psi}$ in step $2$ , the algorithm will provide ZF detection performance. In step $3$ , (24) is the iterative function of L-BFGS. Therefore, if it is replaced by (23), the Algorithm would become UW-SVD assisted MS-based methods. Step $5$ is the post-processing step. It aims to reconvert $\widehat{\mathbf{x}}$ from $\widehat{\mathbf{s}}$ . Additionally, UW-SVD can accelerate the convergence of numerous other iterative algorithms, such as SD and CG. The proposed UW-SVD method leverages sample structure to facilitate their application in accelerating the convergence of various existing algorithms.

III-E Complexity Analysis

TABLE I: Complexity Analysis of UW-SVD and Various Iterative MIMO Detectors

Algorithms	Calculation of $\mathbf{A}$ or $\mathbf{\Phi}$	Matrix Inverse	Per Iteration	UW-SVD
ZF/LMMSE	$MN^{2}$	$N^{3}$	0	$N_{\textsc{ue}}MN+N_{\textsc{ue}}N+2N$
RI	$0$	$0$	$2MN$
JI	$0$	$0$	$2MN+N$
GS	$MN^{2}$	$N^{2}$	$1.5N^{2}$
SSOR	$MN^{2}$	$N^{2}$	$2N^{2}+N$
L-BFGS	$0$	$0$	$4MN+N^{2}+5N$

The objective of this section is to demonstrate that the proposed UW-SVD method has low computational-complexity. To simplify and clarify the complexity analysis, we adopt a common assumption in multi-user MIMO systems with $K$ users, where each user has $N_{\textsc{ue}}$ antennas, i.e., $KN_{\textsc{ue}}=N$ . The computational complexity of UW-SVD assisted iterative algorithms can be divided into three main parts: UW-SVD, post-processing, and iterative process. We start the complexity analysis from the UW-SVD and post-processing steps.

Performing SVD on $\mathbf{H}_{k}$ has a complexity of $MN_{\textsc{ue}}^{2}$ [49], resulting in a total complexity of $KMN_{\textsc{ue}}^{2}$ for all the users. Also, the complexity of UW-SVD can also be expressed as $N_{\textsc{ue}}MN$ , since $KN_{\textsc{ue}}=N$ . In the post-processing step, computing $\mathbf{\Sigma}^{-1}$ has a complexity of $N$ , while the computation of $[\mathbf{\Sigma}^{-1}\mathbf{s}]$ has the same complexity of $N$ . Moreover, given that $\mathbf{V}$ is a block diagonal matrix, the complexity of calculating $\mathbf{V}[\mathbf{\Sigma}^{-1}\mathbf{s}_{t}]$ is $KN_{\textsc{ue}}^{2}=N_{\textsc{ue}}N$ . Therefore, the overall complexity of the post-processing step is $N_{\textsc{ue}}N+2N$ . Furthermore, the total complexity of UW-SVD together with the post-processing step is $N_{\textsc{ue}}MN+N_{\textsc{ue}}N+2N$ . In MIMO systems, the number of antennas per UE is usually small, typically $N_{\textsc{ue}}=2$ or $4$ . Hence, the complexity of UW-SVD method stays at the quadratic order.

It is worth noting that not all the iterative algorithms require the computation of $\mathbf{A}$ or $\mathbf{\Phi}$ , including RI, JI, and L-BFGS methods. Taking RI as an example, if we replace $\mathbf{\Phi}_{\textsc{zf}}=\mathbf{\Psi}^{H}\mathbf{\Psi}$ in (23), its iterative process can be expressed as follows

f_{\textsc{ri}}(\mathbf{s}_{t};\mathbf{\Phi},\mbox{\boldmath$\delta$})=\mathbf% {s}_{t}+(\mbox{\boldmath$\delta$}-\mathbf{\Psi}^{H}\mathbf{\Psi}\mathbf{s}_{t}),

(28)

where we can first compute $[\mathbf{\Psi}\mathbf{s}_{t}]$ with a complexity of $MN$ , and then compute $\mathbf{\Psi}^{H}[\mathbf{\Psi}\mathbf{s}_{t}]$ with another complexity of $MN$ . In this successive manner, the calculation of $\mathbf{\Phi}$ can be avoided. This can also be applied to all the other iterative methods, such as the calculation of $\xi_{t}$ in (25) in L-BFGS method. In addition, a similar complexity can be obtained by replacing $\mathbf{\Phi}_{\textsc{zf}}$ with $\mathbf{\Phi}_{\textsc{lmmse}}$ in (28). The complexity of computing $\mathbf{\Sigma}^{-1}$ is only $N$ , since it is a diagonal matrix. Furthermore, the calculation of $\mathbb{D}(\mathbf{\Psi}^{H}\mathbf{\Psi})^{-1}$ in JI and L-BFGS methods can be ignored because it is an identity matrix according to (16).

The authors are aware that certain iterative algorithms require the calculation of $\mathbf{A}$ or $\mathbf{\Phi}$ , such as the GS and SSOR methods. This is because they need to compute $\mathbb{L}(\mathbf{A})^{-1}$ or $\mathbb{L}(\mathbf{\Phi})^{-1}$ . Furthermore, the complexity of computing $\mathbb{L}(\mathbf{A})^{-1}$ or $\mathbb{L}(\mathbf{\Phi})^{-1}$ is $N^{2}$ due to the triangular structure. The complexity of these iterative algorithms, in short, remains essentially the same whether UW-SVD is applied or not. Moreover, ZF and LMMSE detectors require the computation of $\mathbf{A}$ with a cubic-order complexity of $MN^{2}$ . They also necessitate the calculation of $\mathbf{A}^{-1}$ with cubic-order complexity of $N^{3}$ . TABLE I summarizes the complexity of UW-SVD and various MIMO detectors. The matrix inverse operation is the primary reason why ZF/LMMSE is impractical for real-time signal processing due to its serially computational complexity of cubic order [17]. This motivates traditional iterative methods to (partially) circumvent the need for matrix inversion.

Discussion of complexity reduction: Our simulation results demonstrate that the proposed UW-SVD method can reduce the computational complexity of SSOR and L-BFGS methods by up to $90\%$ and $67\%$ , respectively (see Figs. 5(d) and 7). This substantial reduction in complexity is primarily achieved by decreasing the number of iterations required, despite the additional complexity introduced by UW-SVD. As shown in TABLE I, the extra computational burden imposed by UW-SVD is equivalent to $16$ SSOR iterations or a single L-BFGS iteration. However, the advantages of UW-SVD far outweigh its cost. For instance, the result in Fig. 5(d) shows that UW-SVD can accelerate SSOR by up to approximately $240$ iterations. Moreover, our simulation results in Fig. 7 demonstrate that UW-SVD can accelerate L-BFGS method by up to $13$ iterations. This significant acceleration in convergence of UW-SVD more than offsets its additional processing cost, thus significantly reducing the overall computational complexity.

IV Convergence Analysis

In this section, the objective is to compare $\mathrm{cond}(\mathbf{A})$ and $\mathrm{cond}(\mathbf{\Phi})$ in both massive-MIMO and ELAA-MIMO systems. The next two subsections provide detailed results of the comparison in each system, respectively.

IV-A Massive-MIMO with i.i.d. Rayleigh Fading Channels

To better understand the relationship between $\mathrm{cond}(\mathbf{A})$ and $\mathrm{cond}(\mathbf{\Phi})$ , we first introduce the following concept of favorable propagation in massive-MIMO systems.

Property 1 (Favorable Propagation [50])

Suppose that elements of $\mathbf{H}$ to follow independent and identically distributed (i.i.d.) Rayleigh fading as (40), given $N_{k},\forall k$ , as $M$ tends to infinity, we have the following

\lim\limits_{M\rightarrow\infty}\mathbf{H}_{k}^{H}\mathbf{H}_{k}=\mathbf{I},% \quad\forall k.

(29)

Theorem 1

Suppose that every element of $\mathbf{H}$ obeys an i.i.d. Rayleigh distribution in (40), given $N_{k},\forall k$ , as $M$ tends to infinity, we have the following

	$\displaystyle\lim\limits_{M\rightarrow\infty}\mathrm{cond}(\mathbf{\Phi}_{% \textsc{zf}})=\mathrm{cond}(\mathbf{A}_{\textsc{zf}});$		(30)
	$\displaystyle\lim\limits_{M\rightarrow\infty}\mathrm{cond}(\mathbf{\Phi}_{% \textsc{lmmse}})=\mathrm{cond}(\mathbf{A}_{\textsc{lmmse}}).$		(31)

Proof:

See Appendix A ∎

Theorem 1 implies that the UW-SVD-assisted algorithm has a comparable convergence rate comparable to the existing algorithm. The reason is that the intra-user interference in i.i.d. Rayleigh fading channels is very weak, which could limit the gain of UW-SVD. This theoretical finding is verified in the numerical results of Experiment 1 in Section V-C. On the contrary, intra-user interference is strong in spatially correlated (ELAA-)MIMO systems, especially in the presence of LoS links. In the next subsection, we will show that $\mathrm{cond}(\mathbf{\Phi})<\mathrm{cond}(\mathbf{A})$ in such channels.

IV-B Spatially Correlated (ELAA-)MIMO Channels

Given that UW-SVD aims to address intra-user interference, our focus lies primarily on understanding the user-side spatial correlation ⁴⁴4In this section, we focus on the user-side spatial correlation to facilitate the theoretical analysis. However, in our simulations, we adopt the Kronecker model in (44), which considers both user-side and BS-side spatial correlations., which is defined as follows [8]

\mathbf{R}_{\textsc{ue}}\triangleq\mathbb{E}\{\mathbf{H}^{H}\mathbf{H}\},

(32)

where $\mathbf{R}_{\textsc{ue}}$ is usually described by an exponential correlation matrix in conventional massive-MIMO systems. For example, if two user antennas are situated at a distance of $d$ , the correlation between these two antennas can be expressed as follows [8]

r(d)=\exp(-d/\mu),

(33)

where $\mu\geq 0$ represents the scaling factor. In multi-user MIMO systems, the distance between different users is typically much greater than the distance between antennas belonging to the same UE. Therefore, we have the following assumption:

\textit{A1):}\quad\mathbf{R}_{\textsc{ue}}^{k,j}=\mathbf{0},\quad\forall k\neq j,

(34)

where $\mathbf{R}_{\textsc{ue}}^{k,j}\in\mathbb{R}^{N_{k}\times N_{j}}$ denotes a block of $\mathbf{R}_{\textsc{ue}}$ representing the correlation between user $k$ and user $j$ .

Let us take an example to validate this assumption. Suppose we have a UE equipped with two antennas spaced apart by half the carrier wavelength. Assuming the carrier frequency is $3.5$ $\mathrm{GHz}$ and the parameter $\mu$ equals $0.2$ , the correlation between the two intra-user antennas is $r(0.0429)\approx 0.8$ . This suggests a significant spatial correlation between the two intra-user antennas. In contrast, when considering two distinct users separated by one meter, their correlation $r(1)\approx 4.5\times 10^{-5}$ implies nearly orthogonal behavior. In real-world scenarios, user distances typically exceed one meter. Therefore, the assumption A1) in (34) is validated for practical MIMO systems.

Lemma 1

Given A1, suppose the MIMO channel is $\mathbf{H}=\mathbf{\Omega}\sqrt{\mathbf{R}_{\textsc{ue}}}$ , where each element of $\mathbf{\Omega}$ follows an i.i.d. Rayleigh distribution, we have the following

\lim\limits_{M\rightarrow\infty}\mathbf{H}_{k}^{H}\mathbf{H}_{j}=\mathbf{0},% \quad\forall k\neq j,

(35)

where $\mathbf{R}_{\textsc{ue}}^{k,k}\in\mathbb{R}^{N_{k}\times N_{k}}$ is a block of $\mathbf{R}_{\textsc{ue}}$ representing the correlation between the antenna elements of user $k$ .

Proof:

See Appendix B. ∎

In massive-MIMO systems, the number of service-antennas can be in the hundreds or even thousands. Consequently, the condition in (35) can be approximated as follows

\textit{A2):}\quad\mathbf{H}_{k}^{H}\mathbf{H}_{j}=\mathbf{0},\quad\forall k\neq j

(36)

Moreover, as discussed in section I-B, in ELAA-MIMO systems, intra-user interference is much stronger than inter-user interference. This is because the dominant power of different users can be received by different service antennas. This phenomenon is known as spatial orthogonality [51, 6]. Therefore, it can be assumed that A2 also holds in ELAA-MIMO systems. With assumption A2, our focus shifts to the comparison between $\mathrm{cond}(\mathbf{A})$ and $\mathrm{cond}(\mathbf{\Phi})$ for ZF and LMMSE detectors, respectively.

Theorem 2

Given A2, it can be obtained that the condition number of $\mathbf{A}_{\textsc{zf}}$ is larger than that of $\mathbf{\Phi}_{\textsc{zf}}$ , i.e.,

\mathrm{cond}(\mathbf{\Phi}_{\textsc{zf}})<\mathrm{cond}(\mathbf{A}_{\textsc{% zf}}).

(37)

Proof:

See Appendix C ∎

Theorem 3

Given A2, suppose that the transmitted power is normalized to $1$ , i.e., $\sigma_{x}^{2}=1$ , we have the following

\mathrm{cond}(\mathbf{\Phi}_{\textsc{lmmse}})<\mathrm{cond}(\mathbf{A}_{% \textsc{lmmse}}),

(38)

when

\sqrt{\mathrm{cond}(\mathbf{A}_{\textsc{zf}})}\lambda_{\text{min}}(\mathbf{A}_% {\textsc{zf}})>\sigma_{z}^{2}.

(39)

Proof:

See Appendix D ∎

Remark 2

Theorem 2 implies that UW-SVD-assisted algorithms can provide faster convergence to ZF detection performance compared to current algorithms. Theorem 3 suggests the similar conclusion for LMMSE detectors but with the constraint that the SNR should be greater than a threshold. Note that in any MIMO system for multiplexing transmission, it is typically necessary for the minimum eigenvalue of the channel gain to exceed the noise power, i.e., $\lambda_{\text{min}}(\mathbf{A}_{\textsc{zf}})>\sigma_{z}^{2}$ . Also, the condition number should be greater than $1$ based on its definition. Therefore, the inequality in (39) will always be satisfied in practical MIMO systems if we aim for acceptable detection performance using multiplexing techniques.

Remark 3

Theorem 2 and Theorem 3 imply that $M$ is much greater than $N$ or approaches infinity. However, deploying such a large number of service antennas is economically impractical in real-world scenarios. This necessitates the determination of specific, implementable ranges for $M$ and $N$ . However, the stochastic nature of (ELAA-)MIMO channels poses a significant challenge. It is impossible to mathematically derive an exact formula for the $M/N$ ratio at which UW-SVD outperforms conventional methods. To address this, we turn to experimental results for insights into practical $M/N$ ratios. Our experiments, detailed in Section V, reveal that UW-SVD achieves significant gains when the $M/N$ ratio is $4$ or $8$ . These ratios are not only feasible but also commonly found in MIMO systems. This alignment between our findings and real-world parameters ensures the applicability of our methods to practical implementations.

V Numerical and Simulation Results

In this section, the objectives are 1) to compare $\mathrm{cond}(\mathbf{\Phi})$ and $\mathrm{cond}(\mathbf{A})$ in various types of (ELAA-)MIMO channels; 2) to demonstrate that UW-SVD accelerates the convergence of current algorithms; and 3) to establish that the advantages observed in uncoded MIMO systems also apply to coded MIMO systems. This motivates the following three subsections.

V-A Channel Models

Model 1: In massive-MIMO systems, each element of $\mathbf{H}$ is usually assumed to obey i.i.d. Rayleigh fading as follows

H_{m,n}=\omega_{m,n}\sim\mathcal{CN}(0,1/M),

(40)

where $1/M$ denotes the normalized variance of each channel element. This indicates that the propagation environment is in non-LoS (NLoS) state ⁵⁵5In LoS state, the massive-MIMO channels can also be described by i.i.d. Rician fading. However, the far-field Rician channel cannot support multiple data streams per UE [37, 38], and are therefore not the scope of this paper..

Model 2: Spherical wavefront should be taken into account for ELAA channel modeling. In NLoS state, (40) should be extended to i.n.d. (n. for non-identical) Rayleigh fading as follows [52]

H_{m,n}=H_{m,n}^{(0)}\triangleq\Bigg{(}\dfrac{\beta^{(0)}}{d_{m,n}^{\gamma^{(0% )}}}\Bigg{)}\omega_{m,n},

(41)

where $d_{m,n}$ denotes the distance between the $m^{th}$ service-antenna and the $n^{th}$ user antenna; $\beta^{(0)}$ and $\gamma^{(0)}$ represent the NLoS path-loss coefficient and exponent, respectively.

Model 3: Similarly, the ELAA channel in LoS state is described to obey i.n.d. Rician fading as follows [35]

H_{m,n}=H_{m,n}^{(1)}\triangleq\dfrac{\beta^{(1)}}{d_{m,n}^{\gamma^{(1)}}}% \Bigg{(}\sqrt{\dfrac{\kappa}{\kappa+1}}\varphi_{m,n}+\sqrt{\dfrac{1}{\kappa+1}% }\omega_{m,n}\Bigg{)}.

(42)

where $\beta^{(1)}$ and $\gamma^{(1)}$ represent the LoS path-loss coefficient and exponent, respectively, $\kappa$ denotes the Rician K-factor, $\varphi_{m,n}=\exp(-j\frac{2\pi}{\vartheta}d_{m,n})$ the phase of direct LoS link, and $\vartheta$ denotes the wavelength of the carrier wave.

Model 4: ELAA channel could allow a mixed of LoS and NLoS links due to the large aperture [36]. Each element of $\mathbf{H}$ in this case can be expressed as follows

H_{m,n}=\epsilon_{m,n}^{(\eta_{m,n})}H_{m,n}^{(\eta_{m,n})},

(43)

where $\eta_{m,n}\in\{0,1\}$ is a binary random variable, with $\eta_{m,n}=0$ indicates the NLoS state with $H_{m,n}^{(\eta_{m,n})}$ turning into $H_{m,n}^{(0)}$ in (40), or otherwise $\eta_{m,n}=1$ indicates the LoS state with $H_{m,n}^{(\eta_{m,n})}$ turning into $H_{m,n}^{(1)}$ in (42); $\epsilon_{m,n}$ denotes the shadowing effects. The spatial correlations of LoS/NLoS states and shadowing effects are described by exponentially decaying window [36]. This channel model can yield computer-simulated data that fit well with real-world measurement data, e.g, [39, 40]. Therefore, we employ this ELAA channel model to conduct computer simulations.

Kronecker Model: Let $\mathbf{\Omega}\in\mathbb{C}^{M\times N}$ be an i.i.d. complex Gaussian matrix, where its $(m,n)$ -th element is denoted by $\omega_{m,n}$ , as defined in (40). The four channel models above can be converted to their spatially correlated versions by replacing $\mathbf{\Omega}$ with $\mathbf{\Omega}_{\text{kron}}$ , as follows [8]

\mathbf{\Omega}_{\text{kron}}=\sqrt{\mathbf{R}_{\textsc{bs}}}\mathbf{\Omega}% \sqrt{\mathbf{R}_{\textsc{ue}}},

(44)

where $\mathbf{R}_{\textsc{bs}}\in\mathbb{R}^{M\times M}$ and $\mathbf{R}_{\textsc{ue}}\in\mathbb{R}^{N\times N}$ are both exponential correlation matrices representing the BS and UE side correlations, respectively. In MIMO systems, the minimum distance between two antennas should be $\vartheta/2$ . Therefore, we define the following

\varrho\triangleq r(\vartheta/2),

(45)

where $\varrho$ is the spatial correlation between the two closest antennas. Additionally, when $\varrho=0$ , it means that the small-scale fading of distinct user-to-service antenna links is generated independently.

V-B Baselines

The following iterative algorithms are set as baselines for our simulations: GS, SSOR, and L-BFGS. AMP is not used as a baseline because it converges slower than the L-BFGS method and diverges in the ELAA channel. Additionally, it must be modified further to recover the e-signal vector. Due to the page limitations, we cannot demonstrate all the iterative algorithms that proposed in the last sixty years [16]. However, the baselines in this section are sufficient to demonstrate the advantages of the proposed UW-SVD method.

V-C System Setup and Experiments

The carrier frequency is set to be $3.5$ $\mathrm{GHz}$ . The service array is configured as a uniformly linear array (ULA)⁶⁶6An exception is Fig. 7, where the service antenna array is configured as a uniform planar array (UPA) with $M=16\times 16$ antennas. with spacing at half the wavelength. The users are deployed parallel with the ULA at a perpendicular distance of $15$ meters. Each user is equipped with $N_{\textsc{ue}}$ antennas spaced at half the wavelength. The maximum distance between two users is set to be $30$ meters. To ensure a fair comparison of different types of channel models, we normalize the channel gain for each UE, i.e., $\|\mathbf{H}_{k}\|^{2}=N_{\textsc{ue}},{\forall k}$ . This normalization does not change intra-user interference, which is the primary focus of this paper. The wireless environment is assumed to be urban-micro street canyon, and the propagation parameters are determined according to the 3rd Generation Partnership Project (3GPP) technical report [53], as follows: $\beta^{(0)}=0.020$ , $\gamma^{(0)}=1.765$ , $\beta^{(1)}=0.007$ , $\gamma^{(1)}=1.050$ , $\kappa=9$ $\mathrm{dB}$ for Model 3, $\kappa\sim\mathcal{LN}(9\ \mathrm{dB},10\ \mathrm{dB})$ for Model 4. The objectives of this section set the following three experiments.

Experiment 1: The objective is to demonstrate that the relationship between $\mathrm{cond}(\mathbf{A})$ and $\mathrm{cond}(\mathbf{\Phi})$ is consistent with the theoretical analysis presented in Section IV. The cumulative distribution functions (CDFs) of the condition numbers are shown in Figs. 2 and 3. In these two figures, there are $M=256$ service antennas and $K=8$ UEs, each equipped with $N_{\textsc{ue}}=4$ antennas. In Fig. 2(a), where the channel elements are generated independently, it can be seen that $\mathrm{cond}(\mathbf{\Phi})$ is only slightly smaller than $\mathrm{cond}(\mathbf{A})$ in both Model 1 and Model 2. Note that all the condition numbers in this figure are relatively small, which means that numerous iterative algorithms can achieve fast convergence. However, it is more practical to consider spatial correlations and such results are shown in Figs. 2(b) and 2(c). By comparing these two figures, it can be observed that $\mathbf{\Phi}$ is better conditioned than $\mathbf{A}$ , especially in highly correlated MIMO channels. This implies that the advantage of the proposed UW-SVD method will be more evident when the correlation increases.

Fig. 3 shows the results in the presence of LoS links, which can make the wireless channel more ill-conditioned. In Model 3 ( $\varrho=0$ ), it can be observed that $\mathrm{cond}(\mathbf{A}_{\textsc{zf}})$ is approximately $60$ , meaning that $\mathbf{A}_{\textsc{zf}}$ is ill-conditioned. Moreover, it will become even worse as the spatial correlation becomes higher, e.g., $\mathrm{cond}(\mathbf{A}_{\textsc{zf}})\approx 600$ when $\varrho=0.8$ . In addition, $\mathrm{cond}(\mathbf{A}_{\textsc{lmmse}})$ is smaller than $\mathrm{cond}(\mathbf{A}_{\textsc{zf}})$ due to the regularization term. Similar observations can also be found in Model 4. Moreover, $\mathrm{cond}(\mathbf{A})$ in Model 4 can be well-conditioned with a probability of about $0.2$ . This is because this model allows the mixture of LoS/NLoS links, and the randomly generated channel matrix could be in a fully NLoS state with a certain probability. This also leads to higher CDF fluctuations for $\mathrm{cond}(\mathbf{A})$ in Model 4 than in Model 3. In contrast, the fluctuations of $\mathrm{cond}(\mathbf{\Phi})$ are very small, and the value of $\mathrm{cond}(\mathbf{\Phi})$ is close to that of i.i.d. Rayleigh fading channels. This implies that UW-SVD-assisted iterative methods can maintain consistently fast convergence even in the presence of increased intra-user interference.

Experiment 2: The objective is to demonstrate that the proposed UW-SVD-assisted iterative algorithms converge faster than corresponding existing algorithms in (ELAA-)MIMO systems. In this experiment, four figures (i.e., Fig. 4 - Fig. 7) are presented to highlight the advantages of the proposed UW-SVD method from different perspectives. In Fig. 4, the convergence comparison between different iterative algorithms at high SNRs is shown. It shows the average symbol error rate (SER) over the iterations in Model 1, considering three correlation levels. For the case $\varrho=0$ , it can be seen that the proposed UW-SVD method can slightly accelerate the convergence of existing algorithms. However, it is worth noting that the advantage of the proposed UW-SVD method becomes more apparent as the correlation becomes larger. This is consistent with the numerical results in Experiment 1, and this figure indicates that the proposed UW-SVD method can accelerate the current iterative algorithm in conventional massive MIMO channels.

In Fig. 5, we aim to demonstrate the advantages of UW-SVD at different SNRs, by using SSOR that converges to LMMSE detection performance as an example. Two ELAA channels (i.e., Model 2 and Model 4) are considered in the figure, each with two correlation factors (i.e., $\varrho=0.5$ and $\varrho=0.8$ ). As can be seen in each sub-figure, the advantage of UW-SVD diminishes with decreasing SNR. This is consistent with our theoretical analysis in section IV. However, it is worth noting that UW-SVD-assisted SSOR still converges significantly faster than the original SSOR method even at lower SNRs. For instance, in Fig. 5(b), when the SNR is $16$ $\mathrm{dB}$ , the original SSOR method requires approximately $20$ iterations to converge in Model 2 using $16$ QAM. However, with the assistance of UW-SVD, convergence is achieved in just $4$ iterations under the same system configuration. As shown in Figs. 5(c) and 5(d), the original SSOR method requires tens or even hundreds of iterations to achieve the LMMSE detection performance, even in low SNR scenarios. In contrast, the UW-SVD-assisted SSOR method only requires fewer than $10$ iterations to converge. This figure implies that the proposed UW-SVD method can accelerate the convergence of iterative algorithms at different SNR levels.

In Fig. 6, the objective is to demonstrate the robustness of the proposed UW-SVD method when channel estimation error is considered. Let us consider the conventional LS channel estimation approach, and the estimated channel matrix is given by [54]

\widehat{\mathbf{H}}=\mathbf{H}+\mathbf{Z},

(46)

where $\widehat{\mathbf{H}}$ denotes the estimated channel matrix, and $\mathbf{Z}$ is the AWGN matrix. The ratio (denoted by $\varpi$ ) between the power of channel and noise elements is set to be $10$ , $15$ , and $20$ $\mathrm{dB}$ for the three sub-figures, respectively. The MIMO channel is set to be Model 2 with $\varrho=0.2$ and the modulation scheme is set to be $16$ QAM. It can be observed that the LMMSE detector with channel estimation error can only provide sub-optimal detection performance. Therefore, all the iterative algorithms will only converge to this sub-optimal detection performance. The proposed UW-SVD method consistently accelerate the convergence of the L-BFGS method by a factor of two, irrespective of the level of channel estimation error. More specifically, the UW-SVD-assisted L-BFGS method converges within $10$ , $7$ , and $5$ iterations for $\varpi=20$ $\mathrm{dB}$ , $15$ $\mathrm{dB}$ , and $10$ $\mathrm{dB}$ , respectively. In contrast, the original L-BFGS method requires $20$ , $14$ , and $10$ iterations to converge for the same respective levels of $\varpi$ . These results show that UW-SVD can improve the convergence speed of L-BFGS by approximately two times for different levels of channel estimation error.

In Fig. 7, the objective is to show that UW-SVD can accelerate current iterative algorithms in another type of ELAA, i.e., UPA. The UPA is configured with $M=16\times 16$ antennas. The simulation results depicted in Fig. 7 suggest a performance degradation for the LMMSE detector in UPA compared to its performance in ULA. The reason for this is that UPA antennas are more tightly distributed, resulting in higher spatial correlations. In this figure, we utilize the SSOR method that converges to the LMMSE detection performance to demonstrate the advantages of the proposed UW-SVD method. It is noteworthy that the UW-SVD-assisted SSOR method converges faster than the original SSOR method across different SNR levels. For instance, the original SSOR method necessitates over $50$ iterations to converge, and it requires more than $20$ iterations even at relatively low SNR. Conversely, the SSOR method assisted by UW-SVD achieves convergence in only $4$ iterations at all SNR levels.

Experiment 3:

The objective of this experiment is to demonstrate that, with channel coding, the UW-SVD can still significantly accelerate the convergence of current algorithms. Two coding schemes are considered: $1/2$ convolutional code with a codeword length of $200$ bits, and $1/4$ polar code with a codeword length of $1,024$ bits. The decoding schemes are Viterbi decoder and successive cancellation list for convolutional code and polar code, respectively. The modulation schemes are $16$ QAM and $64$ QAM for convolutional and polar codes, respectively. In addition, the performance metric is set to block error rate (BLER) versus Eb/No. As shown in Fig. 8, the performance gap between uncoded and coded systems is approximately $6$ $\mathrm{dB}$ for both channel models. In Fig. 8(a), the UW-SVD-assisted SSOR method converges to the LMMSE detection performance in only $2$ iterations, while the SSOR method requires over $15$ iterations to achieve the same level of convergence. In coded MIMO systems, the improvements achieved by UW-SVD for SSOR remain comparable to those observed in uncoded MIMO systems. Moreover, as shown in Fig. 8(b), UW-SVD-assisted L-BFGS methods can achieve ZF detection performance within three iterations, while the standard L-BFGS algorithm requires over $15$ iterations to achieve the same level of performance. Together with the results in Experiment 2, it can be claimed that UW-SVD can significantly accelerate the convergence of current algorithms by up to ten times, in both uncoded and coded MIMO systems.

VI Conclusion

In this paper, we propose the UW-SVD method to accelerate the convergence of current iterative algorithms for spatially correlated (ELAA-)MIMO channels. The results demonstrate that the UW-SVD-assisted algorithms achieve convergence up to more than ten times faster compared to the corresponding current algorithms in both coded and uncoded systems. The core principle is to perform SVD on each user’s sub-channel matrix, transforming the original MIMO signal model into an e-signal model. For this e-signal model, we develop e-ZF and e-LMMSE detectors with detection performance proven to be equivalent to ZF and LMMSE detectors for the original model. Crucially, it is shown that the e-channel matrix exhibits a significantly better condition number than the original MIMO channel matrix, when considering the channel spatial correlation or non-stationarity or both. By applying current iterative algorithms to iteratively invert the better-conditioned e-channel matrix, followed by a post-processing step to recover the transmitted signals, remarkable convergence acceleration is achieved.

Appendix A Proof of Theorem 1

According to Property 1, it is straightforward that

\lim\limits_{M\rightarrow\infty}\mathbf{\Sigma}_{k}=\mathbf{I},\quad\forall k.

(47)

Hence, we have

\lim\limits_{M\rightarrow\infty}\mathbf{\Sigma}=\mathrm{diag}(\mathbf{\Sigma}_% {1},\ldots,\mathbf{\Sigma}_{K})=\mathbf{I}.

(48)

Plugging (11) into $\mathbf{A}_{\textsc{zf}}$ yields

\mathbf{A}_{\textsc{zf}}=\mathbf{V}\mathbf{\Sigma}\mathbf{\Psi}^{H}\mathbf{% \Psi}\mathbf{\Sigma}\mathbf{V}^{H}.

(49)

Plugging (48) into (49) yields

\lim\limits_{M\rightarrow\infty}\mathbf{A}_{\textsc{zf}}=\mathbf{V}\mathbf{% \Psi}^{H}\mathbf{\Psi}\mathbf{V}^{H}.

(50)

Plugging $\mathbf{\Phi_{\textsc{zf}}}=\mathbf{\Psi}^{H}\mathbf{\Psi}$ into (50) yields

\lim\limits_{M\rightarrow\infty}\mathbf{A}_{\textsc{zf}}=\mathbf{V}\mathbf{% \Phi}_{\textsc{zf}}\mathbf{V}^{H}.

(51)

Given that $\mathbf{V}$ is a unitary matrix, it does not change the condition number of the matrix being multiplied. Hence, (30) in Theorem 1 is proved. Similarly, plugging (11) and (48) into $\mathbf{A}_{\textsc{lmmse}}$ yields

	$\displaystyle\lim\limits_{M\rightarrow\infty}\mathbf{A}_{\textsc{lmmse}}$	$\displaystyle=\mathbf{V}\mathbf{\Psi}^{H}\mathbf{\Psi}\mathbf{V}^{H}+\rho^{-1}% \mathbf{I},$			(52)
		$\displaystyle=\mathbf{V}(\mathbf{\Psi}^{H}\mathbf{\Psi}+\rho^{-1}\mathbf{I})% \mathbf{V}^{H}.$			(52)

According to (48), $\mathbf{\Phi}_{\textsc{lmmse}}$ in (18) can be expressed as follows

\lim\limits_{M\rightarrow\infty}\mathbf{\Phi}_{\textsc{lmmse}}=\mathbf{\Psi}^{% H}\mathbf{\Psi}+\rho^{-1}\mathbf{I}.

(53)

Plugging (53) into (52) yields

\lim\limits_{M\rightarrow\infty}\mathbf{A}_{\textsc{lmmse}}=\mathbf{V}\mathbf{% \Phi}_{\textsc{lmmse}}\mathbf{V}^{H}.

(54)

Together with (51), Theorem 1 is proved.

Appendix B Proof of Lemma 1

According to the assumption A1, the correlation matrix $\mathbf{R}_{\textsc{ue}}$ is a block diagonal matrix. This indicates that $\sqrt{\mathbf{R}_{\textsc{ue}}}$ is also a block diagonal matrix, and it can be expressed as follows

\sqrt{\mathbf{R}_{\textsc{ue}}}=\mathrm{diag}\bigg{(}\sqrt{\mathbf{R}_{\textsc% {ue}}^{1,1}},\dots,\sqrt{\mathbf{R}_{\textsc{ue}}^{K,K}}\bigg{)}.

(55)

Hence, the sub-channel matrix of the $k^{th}$ user can be expressed by $\mathbf{H}_{k}=\mathbf{\Omega}_{k}\sqrt{\mathbf{R}_{\textsc{ue}}^{k,k}}$ , resulting in

\mathbf{H}_{k}^{H}\mathbf{H}_{j}=\sqrt{\mathbf{R}_{\textsc{ue}}^{k,k}}\mathbf{% \Omega}_{k}^{H}\mathbf{\Omega}_{j}\sqrt{\mathbf{R}_{\textsc{ue}}^{j,j}},

(56)

where $\mathbf{\Omega}_{k}\in\mathbb{C}^{M\times N_{k}}$ represents the i.i.d. Rayleigh distributed matrix. According to Property 1, we have the following

\lim\limits_{M\rightarrow\infty}\mathbf{\Omega}_{k}^{H}\mathbf{\Omega}_{j}=% \mathbf{0},\quad\forall k\neq j.

(57)

Applying (57) into (56), Lemma 1 are therefore obtained.

Appendix C Proof of Theorem 2

Plugging (9) into $\mathbf{A}_{\textsc{zf}}$ yields

\mathbf{A}_{\textsc{zf}}=[\mathbf{H}_{1}^{H},\dots,\mathbf{H}_{K}^{H}]^{T}[% \mathbf{H}_{1},...,\mathbf{H}_{K}].

(58)

According to A2, it can be found that all the non-diagonal parts of $\mathbf{A}_{\textsc{zf}}$ are $\mathbf{0}$ . Hence, we have the following

\mathbf{A}_{\textsc{zf}}=\mathrm{diag}(\mathbf{H}_{1}^{H}\mathbf{H}_{1},\dots,% \mathbf{H}_{K}^{H}\mathbf{H}_{K}),

(59)

which indicate that $\mathbf{A}_{\textsc{zf}}$ is a block diagonal matrix. Therefore, $\mathrm{cond}(\mathbf{A}_{\textsc{zf}})$ should not be smaller than the condition number of any of its blocks, i.e.,

\mathrm{cond}(\mathbf{A}_{\textsc{zf}})\geq\max\{\mathrm{cond}(\mathbf{H}_{k}^% {H}\mathbf{H}_{k})\}.

(60)

Since the intra-user channel columns are correlated, it is clear that $\mathrm{cond}(\mathbf{H}_{k}^{H}\mathbf{H}_{k})>1$ , and we have the following

\mathrm{cond}(\mathbf{A}_{\textsc{zf}})>1.

(61)

Also, given A1, performing SVD on $\mathbf{H}_{k}$ and $\mathbf{H}_{j}$ yields

\mathbf{H}_{k}^{H}\mathbf{H}_{j}=\mathbf{V}_{k}\mathbf{\Sigma}_{k}\mathbf{U}_{% k}^{H}\mathbf{\mathbf{U}}_{j}\mathbf{\Sigma}_{j}\mathbf{V}_{j}^{H}=\mathbf{0},% \quad\forall k\neq j.

(62)

Right multiplying $\mathbf{\Sigma}_{k}^{-1}\mathbf{V}_{k}^{H}$ and left multiplying $\mathbf{V}_{j}\mathbf{\Sigma}_{j}^{-1}$ on (62) yields

\mathbf{U}_{k}^{H}\mathbf{U}_{j}=\mathbf{0},\quad\forall k\neq j.

(63)

Similar to that of $\mathbf{A}_{\textsc{zf}}$ , i.e., (59), $\mathbf{\Phi}_{\textsc{zf}}$ can also be expressed as follows

\mathbf{\Phi}_{\textsc{zf}}=\mathrm{diag}(\mathbf{U}_{1}^{H}\mathbf{U}_{1},% \dots,\mathbf{U}_{K}^{H}\mathbf{U}_{K}),

(64)

which indicates that $\mathbf{\Psi}_{\textsc{zf}}=\mathbf{I}$ with condition number $1$ , since $\mathbf{U}_{k},\forall k$ is a unitary matrix. Together with (61), (37) in Theorem 2 is therefore obtained.

Appendix D Proof of Theorem 3

According to (3), $\mathbf{A}_{\textsc{lmmse}}$ can be expressed as follows

\mathbf{A}_{\textsc{lmmse}}=\mathbf{A}_{\textsc{zf}}+\rho^{-1}\mathbf{I}.

(65)

Therefore, $\mathrm{cond}(\mathbf{A}_{\textsc{lmmse}})$ can be expressed as follows

\mathrm{cond}(\mathbf{A}_{\textsc{lmmse}})=\Bigg{(}\dfrac{\lambda_{\text{max}}% (\mathbf{A}_{\textsc{zf}})+\rho^{-1}}{\lambda_{\text{min}}(\mathbf{A}_{\textsc% {zf}})+\rho^{-1}}\Bigg{)}.

(66)

According to (18), $\mathbf{\Phi}_{\textsc{lmmse}}$ can be expressed as follows

\mathbf{\Phi}_{\textsc{lmmse}}=\mathbf{\Phi}_{\textsc{zf}}+\rho^{-1}\mathbf{% \Sigma}^{-2}.

(67)

According to (64) in Theorem 2, we have the following

\mathrm{cond}(\mathbf{\Phi}_{\textsc{lmmse}})=\Bigg{(}\dfrac{1+\rho^{-1}% \lambda_{\text{max}}(\mathbf{\Sigma}^{-2})}{1+\rho^{-1}\lambda_{\text{min}}(% \mathbf{\Sigma}^{-2})}\Bigg{)}.

(68)

According to (59), $\mathbf{A}_{\textsc{zf}}$ is a block diagonal matrix. Moreover, $\mathbf{\Sigma}$ contains the singular values of $\mathbf{H}_{k},\forall k$ , so that $\mathbf{\Sigma}^{2}$ contains the eigenvalues of every block in $\mathbf{A}_{\textsc{zf}}$ . Hence, we have the following

\lambda_{\text{max}}(\mathbf{\Sigma}^{-2})=\lambda_{\text{min}}(\mathbf{A}_{% \textsc{zf}})^{-1};

(69)

\lambda_{\text{min}}(\mathbf{\Sigma}^{-2})=\lambda_{\text{max}}(\mathbf{A}_{% \textsc{zf}})^{-1}.

(70)

Plugging (69) and (70) into (68) with some tidy up works yields

\mathrm{cond}(\mathbf{\Phi}_{\textsc{lmmse}})=\Bigg{(}\dfrac{\lambda_{\text{% max}}(\mathbf{A}_{\textsc{zf}})}{\lambda_{\text{min}}(\mathbf{A}_{\textsc{zf}}% )}\Bigg{)}\Bigg{(}\dfrac{\lambda_{\text{min}}(\mathbf{A}_{\textsc{zf}})+\rho^{% -1}}{\lambda_{\text{max}}(\mathbf{A}_{\textsc{zf}})+\rho^{-1}}\Bigg{)}.

(71)

It is obvious that the left term in (71) is $\mathrm{cond}(\mathbf{A}_{\textsc{zf}})$ . Moreover, according to (66), $\mathrm{cond}(\mathbf{\Phi}_{\textsc{lmmse}})$ in (71) can be expressed as follows

\mathrm{cond}(\mathbf{\Phi}_{\textsc{lmmse}})=\dfrac{\mathrm{cond}(\mathbf{A}_% {\textsc{zf}})}{\mathrm{cond}(\mathbf{A}_{\textsc{lmmse}})}.

(72)

To obtain the condition under which (38) in Theorem 3 holds, plugging (72) into (38) yields

\mathrm{cond}(\mathbf{A}_{\textsc{zf}})<\mathrm{cond}^{2}(\mathbf{A}_{\textsc{% lmmse}}).

(73)

Plugging (66) into (73)

\Bigg{(}\dfrac{\lambda_{\text{max}}(\mathbf{A}_{\textsc{zf}})}{\lambda_{\text{% min}}(\mathbf{A}_{\textsc{zf}})}\Bigg{)}<\Bigg{(}\dfrac{\lambda_{\text{max}}(% \mathbf{A}_{\textsc{zf}})+\rho^{-1}}{\lambda_{\text{min}}(\mathbf{A}_{\textsc{% zf}})+\rho^{-1}}\Bigg{)}^{2}.

(74)

With some tidy-up works, this inequality holds if

\rho>\dfrac{1}{\sqrt{\lambda_{\text{max}}(\mathbf{A}_{\textsc{zf}})\lambda_{% \text{min}}(\mathbf{A}_{\textsc{zf}})}}.

(75)

Given $\sigma_{x}^{2}=1$ , we have $\rho=1/\sigma_{z}^{2}$ , and plugging it into (75), (39) in Theorem 3 is therefore obtained.

References

[1] J. Liu, Y. Ma, and R. Tafazolli, “Leveraging user-wise SVD for accelerated convergence in iterative ELAA-MIMO detections,” in Proc. IEEE 24th Int. Workshop Signal Process. Advances Wireless Commun. (SPAWC), 2023.
[2] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,” Foundations and Trends® Signal Process., vol. Nov., no. 3-4, pp. 154–655, Nov. 2017.
[3] R. Ji, S. Chen, C. Huang, J. Yang, W. E. I. Sha, Z. Zhang, C. Yuen, and M. Debbah, “Extra DoF of near-field holographic MIMO communications leveraging evanescent waves,” IEEE Wireless Commun. Lett., vol. 12, no. 4, pp. 580–584, Jan. 2023.
[4] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, Jan. 2023.
[5] C. Ouyang, Y. Liu, X. Zhang, and L. Hanzo, “Near-field communications: A degree-of-freedom perspective,” arXiv: 2308.00362, Aug. 2023.
[6] Z. Wu and L. Dai, “Multiple access for near-field communications: SDMA or LDMA?” IEEE J. Sel. Areas Commun., vol. 41, no. 6, pp. 1918–1935, Jun. 2023.
[7] D. Dardari, “Communicating with large intelligent surfaces: Fundamental limits and models,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp. 2526–2537, Jul. 2020.
[8] S. Loyka, “Channel capacity of MIMO architecture using the exponential correlation matrix,” IEEE Commun. Lett., vol. 5, no. 9, pp. 369–371, Sep. 2001.
[9] A. Elzanaty, J. Liu, A. Guerra, F. Guidi, Y. Ma, and R. Tafazolli, “Near and far field model mismatch: Implications on 6G communications, localization, and sensing,” arXiv:2310.06604, Oct. 2023.
[10] E. Björnson, L. Sanguinetti, H. Wymeersch, J. Hoydis, and T. L. Marzetta, “Massive MIMO is a reality–What is next? Five promising research directions for antenna arrays,” Digit. Signal Process., vol. 94, pp. 3–20, Nov. 2019.
[11] M.-X. Chang and W.-Y. Chang, “Maximum-likelihood detection for MIMO systems based on differential metrics,” IEEE Trans. Signal Process., vol. 65, no. 14, pp. 3718–3732, Jul. 2017.
[12] Z. Wang, R. M. Gower, Y. Xia, L. He, and Y. Huang, “Randomized iterative methods for low-complexity large-scale MIMO detection,” IEEE Trans. Signal Process., vol. 70, pp. 2934–2949, Jun. 2022.
[13] Z. Wang, W. Xu, Y. Xia, Q. Shi, and Y. Huang, “A new randomized iterative detection algorithm for uplink large-scale MIMO systems,” IEEE Trans. Commun., vol. 71, no. 9, pp. 5093–5107, Apr. 2023.
[14] X. Gao, L. Dai, Y. Ma, and Z. Wang, “Low-complexity near-optimal signal detection for uplink large-scale MIMO systems,” Electron. Lett., vol. 50, no. 18, pp. 1326–1328, Aug. 2014.
[15] D. Zhu, B. Li, and P. Liang, “On the matrix inversion approximation based on Neumann series in massive MIMO systems,” in Proc. IEEE Int. Conf. Commun. (ICC), 2015, pp. 1763–1769.
[16] S. Yang and L. Hanzo, “Fifty years of MIMO detection: The road to large-scale MIMOs,” IEEE Commun. Surveys Tuts., vol. 1, no. 4, pp. 1941–1988, 4th Quart. 2015.
[17] M. A. Albreem, M. Juntti, and S. Shahabuddin, “Massive MIMO detection techniques: A survey,” IEEE Commun. Surveys Tuts., vol. 21, no. 4, pp. 3109–3132, 4th Quart. 2019.
[18] O. Axelsson, “A survey of preconditioned iterative methods for linear systems of algebraic equations,” BIT Numer. Math., vol. 25, pp. 165–187, Mar. 1985.
[19] L. Li and J. Hu, “An efficient linear detection scheme based on L-BFGS method for massive MIMO systems,” IEEE Commun. Lett., vol. 26, no. 1, pp. 138–142, Oct. 2022.
[20] X. Qin, Z. Yan, and G. He, “A near-optimal detection scheme based on joint steepest descent and Jacobi method for uplink massive MIMO systems,” IEEE Commun. Lett., vol. 20, no. 2, pp. 276–279, Feb. 2016.
[21] B. Yin, M. Wu, J. R. Cavallaro, and C. Studer, “Conjugate gradient-based soft-output detection and precoding in massive MIMO systems,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), 2014, pp. 3696–3701.
[22] L. Liu, G. Peng, P. Wang, S. Zhou, Q. Wei, S. Yin, and S. Wei, “Energy- and area-efficient recursive-conjugate-gradient-based MMSE detector for massive MIMO systems,” IEEE Trans. Signal Process., vol. 68, pp. 573–588, Jan. 2020.
[23] J. Wang, Y. Ma, N. Yi, and R. Tafazolli, “Sherman-Morrison regularization for ELAA iterative linear precoding,” in Proc. IEEE Int. Conf. Commun. (ICC), 2023, pp. 3546–3552.
[24] L. Li and J. Hu, “Fast-converging and low-complexity linear massive MIMO detection with L-BFGS method,” IEEE Trans. Veh. Technol., vol. 71, no. 10, pp. 10 656–10 665, Oct. 2022.
[25] D. L. Donoho, A. Maleki, and A. Montanari, “Message-passing algorithms for compressed sensing,” Proc. Natl. Acad. Sci., vol. 106, no. 45, pp. 18 914–18 919, 2009.
[26] S. Lyu and C. Ling, “Hybrid vector perturbation precoding: The blessing of approximate message passing,” IEEE Trans. Signal Process., vol. 67, no. 1, pp. 178–193, Oct. 2019.
[27] J. Liu, Y. Ma, and R. Tafazolli, “Alternative normalized-preconditioning for scalable iterative large-mimo detection,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), 2023, pp. 2924–2929.
[28] J. Ma and L. Ping, “Orthogonal AMP,” IEEE Access, vol. 5, pp. 2020–2033, Mar. 2017.
[29] S. Rangan, P. Schniter, and A. K. Fletcher, “Vector approximate message passing,” IEEE Trans. Inf. Theory, vol. 65, no. 10, pp. 6664–6684, May 2019.
[30] H. He, C.-K. Wen, S. Jin, and G. Y. Li, “A model-driven deep learning network for MIMO detection,” in Proc. IEEE Global Conf. Signal Inf. Process. (GlobalSIP), 2018, pp. 584–588.
[31] B. Y. Kong and I.-C. Park, “Low-complexity symbol detection for massive MIMO uplink based on Jacobi method,” in Proc. IEEE 27th Annu. Int. Symp. Personal, Indoor, Mobile Radio Commun. (PIMRC), 2016, pp. 1–5.
[32] C. Zhang, Z. Wu, C. Studer, Z. Zhang, and X. You, “Efficient soft-output Gauss–Seidel data detector for massive MIMO systems,” IEEE Trans. Circuits Syst. I, vol. 68, no. 12, pp. 5049–5060, Dec. 2021.
[33] T. Xie, L. Dai, X. Gao, X. Dai, and Y. Zhao, “Low-complexity SSOR-based precoding for massive MIMO systems,” IEEE Commun. Lett., vol. 20, no. 4, pp. 744–747, Apr. 2016.
[34] J. Nam, G. Caire, and J. Ha, “On the role of transmit correlation diversity in multiuser MIMO systems,” IEEE Trans. Inf. Theory, vol. 63, no. 1, pp. 336–354, Oct. 2017.
[35] J. Liu, Y. Ma, J. Wang, N. Yi, R. Tafazolli, S. Xue, and F. Wang, “A non-stationary channel model with correlated NLoS/LoS states for ELAA-mMIMO,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), 2021, pp. 1–6.
[36] J. Liu, Y. Ma, and R. Tafazolli, “A spatially non-stationary fading channel model for simulation and (semi-) analytical study of ELAA-MIMO,” IEEE Trans. Wireless Commun., vol. 23, no. 5, pp. 5203–5218, May 2024.
[37] Y. Zhang, C. You, L. Chen, and B. Zheng, “Mixed near- and far-field communications for extremely large-scale array: An interference perspective,” IEEE Commun. Lett., vol. 27, no. 9, pp. 2496–2500, Jul. 2023.
[38] Y. Lu and L. Dai, “Near-field channel estimation in mixed LoS/NLoS environments for extremely large-scale MIMO systems,” IEEE Trans. Commun., vol. 71, no. 6, pp. 3694–3707, Jun. 2023.
[39] Á. O. Martínez, E. De Carvalho, and J. Ø. Nielsen, “Towards very large aperture massive MIMO: A measurement based study,” in Proc. IEEE GLOBECOM Workshops (GC Wkshps), 2014, pp. 281–286.
[40] P. Harris, S. Zhang, M. A. Beach, E. Mellios, A. R. Nix, S. Armour, A. Doufexi, K. Nieman, and N. Kundargi, “LOS throughput measurements in real-time with a 128-antenna massive MIMO testbed,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), 2016, pp. 1–7.
[41] Y. Yuan, C. Wang, C. Li, Z. Zhong, W. Han, and C.-X. Wang, “Spatial correlations of measured MIMO channels with an extremely large aperture array (ELAA),” in Proc. IEEE 95th Veh. Technol. Conf. (VTC), 2022, pp. 1–5.
[42] A. Karstensen, J. O. Nielsen, P. C. F. Eggers, E. De Carvalho, G. F. Pedersen, M. Alm, and G. Steinböck, “Multiuser spatial consistency analysis of outdoor massive-MIMO measurements,” IEEE Trans. Antennas Propag., vol. 70, no. 1, pp. 680–691, Jan. 2022.
[43] H. Lou, M. Ghosh, P. Xia, and R. Olesen, “A comparison of implicit and explicit channel feedback methods for MU-MIMO WLAN systems,” in Proc. IEEE 24th Annu. Int. Symp. Personal, Indoor, Mobile Radio Commun. (PIMRC), 2013, pp. 419–424.
[44] A. Li and C. Masouros, “Hybrid precoding and combining design for millimeter-wave multi-user MIMO based on SVD,” in Proc. IEEE Int. Conf. Commun. (ICC), 2017, pp. 1–6.
[45] L. Khamidullina, A. L. F. de Almeida, and M. Haardt, “Multilinear generalized singular value decomposition (ML-GSVD) and its application to multiuser MIMO systems,” IEEE Trans. Signal Process., vol. 70, pp. 2783–2797, May 2022.
[46] J. Liu, Y. Ma, A. Elzanaty, and R. Tafazolli, “Near-field fading channel modeling for ELAAs: From communication to ISAC,” arXiv:2401.17014, Jan. 2024.
[47] Z. H. Shaik and E. G. Larsson, “Distributed signal processing for out-of-system interference suppression in cell-free massive MIMO,” in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2023, pp. 1–5.
[48] L. Nazareth, “A relationship between the BFGS and conjugate gradient algorithms and its implications for new algorithms,” SIAM J. Numer. Anal., vol. 16, no. 5, pp. 794–800, 1979.
[49] G. H. Golub and C. F. Van Loan, Matrix Computations, 3rd ed. The Johns Hopkins University Press, 1996.
[50] H. Q. Ngo, E. G. Larsson, and T. L. Marzetta, “Aspects of favorable propagation in massive MIMO,” in Proc. 22nd Eur. Signal Process. Conf. (EUSIPCO), 2014, pp. 76–80.
[51] L. Sanguinetti, E. Björnson, and J. Hoydis, “Toward massive MIMO 2.0: Understanding spatial correlation, interference suppression, and pilot contamination,” IEEE Trans. Commun., vol. 68, no. 1, pp. 232–257, Jan. 2020.
[52] A. Amiri, M. Angjelichinoski, E. De Carvalho, and R. W. Heath, “Extremely large aperture massive MIMO: Low complexity receiver architectures,” in Proc. IEEE GLOBECOM Workshops (GC Wkshps), 2018, pp. 1–6.
[53] 3GPP, “Study on channel model for frequencies from 0.5 to 100 GHz,” 3rd Generation Partnership Project (3GPP), Technical Report (TR) 38.901, Mar. 2022, version 17.0.0.
[54] D. Tse and P. Viswanath, Fundamentals of Wireless Communication. Cambridge University Press, 2015.