Sage Journals: Discover world-class research

Abstract

In this article, principal component analysis method, which is applied to image compression and feature extraction, is introduced into the dimension reduction of input characteristic variable of support vector regression, and a method of joint estimation of near-field angle and range based on principal component analysis dimension reduction is proposed. Signal-to-noise ratio and calculation amount are the decisive factors affecting the performance of the algorithm. Principal component analysis is used to fuse the main characteristics of training data and discard redundant information, the signal-to-noise ratio is improved, and the calculation amount is reduced accordingly. Similarly, support vector regression is used to model the signal, and the upper triangular elements of the signal covariance matrix are usually used as input features. Since the covariance matrix has more upper triangular elements, training it as a feature input will affect the training speed to some extent. Principal component analysis is used to reduce the dimensionality of the upper triangular element of the covariance matrix of the known signal, and it is used as the input feature of the multi-output support vector regression machine to construct the near-field parameter estimation model, and the parameter estimation of unknown signal is herein obtained. Simulation results show that this method has high estimation accuracy and training speed, and has strong adaptability at low signal-to-noise ratio, and the performance is better than that of the back-propagation neural network algorithm and the two-step multiple signal classification algorithm.

Keywords

Dimensionality reduction principal component analysis support vector regression machine near-field source

Introduction

The estimation of direction of arrival (DOA) has been widely used in many research fields, such as passive location, sonar array direction finding, seismic and geological resource detection, and mobile communication. The traditional DOA estimation algorithms include the maximum likelihood method,¹ the propagation operator method,² the multiple signal classification (MUSIC) algorithm,³ the estimating signal parameter via rotational invariance techniques (ESPRIT) algorithm,⁴ and other algorithms.^5,6

The traditional spatial spectrum estimation algorithm usually assumes that the signal source is located in the far field of the array, that is, the range from the source to the array is far enough, so that the spherical wavefront of the signal radiation can be approximated as a plane wavefront at the receiving array. However, when the source is closer to the array, the curvature of the wavefront at the aperture crossing cannot be ignored. At this point, the location of the near-field source needs to be described in conjunction with the DOA and range parameters. Therefore, the high-resolution direction-finding algorithm based on the far-field hypothesis cannot be directly applied to the near-field case. The near-field source parameters estimation has become a hot issue because of its breadth application and engineering applicability, and many DOA estimation algorithms suitable for near-field source scenarios have been proposed.^7–9 In terms of the MUSIC algorithm, Huang and Barkat¹⁰ extended the traditional far-field MUSIC method to the near-field source, and this method continued singular value decomposition of the multidimensional matrix, requiring spectral peak search, so this method is very computationally intensive. Starer and Nehorai¹¹ proposed an improved MUSIC algorithm based on the path tracking method, the 2-dimensional (2-D) search problem of near-field MUSIC algorithm was converted into the 1-dimensional (1-D) search problem, and the 2-D parameters of the source were estimated by the iterative method. Challa and Shamsunder¹² proposed the method based on high-order cumulant for the location of near-field signal source, which showed superior performance; however, this kind of method needs to calculate the cumulant. Zhang et al.¹³ proposed a reduced-dimension MUSIC algorithm based on the directional matrix split method, and this reduced-dimension algorithm was converted into the optimization of the reduced-dimension spectrum function, and the spectral search is only involved in the angle domain. The advantage of the traditional MUSIC algorithm is its easy implementation and high resolution at a high signal-to-noise ratio (SNR), but the disadvantage is it is very inefficient and computationally intensive at a low SNR.

In recent years, the intelligent algorithms have been developed, such as the neural network algorithm,^14–16 the support vector machine (SVM) algorithm,^17,18 and other artificial intelligence algorithms.^19,20 The SVM is a machine learning method developed by Vapnik,²¹ which is established based on the Vapnik–Chervonenkis (VC) dimension theory of statistical learning theory and minimum structure risk principle. In recent years, the SVM has been successfully applied in the design of spread spectrum receivers, speech recognition, image processing, regression problem, and other fields. Machine learning establishes the relationship between the input and output of the model through training data, which is not affected by array error and other factors, and has good robustness. The studies on the application of SVM to near-field sources have been less reported domestically and abroad so far, and compressed sensing and other methods adopt step-by-step estimation method due to the computational complexity.

The near-field source signal received by linear array is a 2-D parameter of distance and angle, so the calculation of direct parameter estimation is too large. Most of the near-field source subspace-based parameter estimation method needs to realize the decoupling of distance and angle under specific array arrangement and certain approximate conditions. Support vector regression (SVR) parameter estimation algorithm is usually one-parameter regression. When multiple near-field source signals are incident to the receiving array at the same time, SVM regression becomes the regression of multiple 2-D parameters, and the algorithm becomes more complex. Therefore, the research of SVM regression algorithm of near-field source is rarely reported. The estimation accuracy of parameters and the generalization performance of the algorithm have a lot to do with the training process. In order to improve the estimation accuracy of parameters and the generalization performance of the algorithm, a very large training data set is needed, which greatly improves the amount of data in the training process. There is an urgent need to find a way to reduce the amount of computation. Principal component analysis (PCA) has attracted our attention. As a basic mathematical analysis method, PCA is often used in face recognition, image compression, feature extraction, and other fields. The advantages of PCA method are data compression and multidimensional data dimensionality reduction and noise reduction, so as to reduce redundancy and transition fitting, and the operation is simple without parameter restrictions. In this article, PCA method is introduced into the SVM regression algorithm of near-field parameter estimation, and PCA is used to extract the number of input features of SVM regression algorithm and reduce noise.

This article uses an SVR method to model the signal. In most of the literature, the upper triangular element of the covariance matrix of the received signal is used as the input to the SVR machine. However, the number of upper triangular elements in the covariance matrix is large, resulting in too large an input feature dimension, too long training and testing time, and too high algorithm complexity. In this article, a method of near-field acoustic source localization based on PCA and SVR is presented. The upper triangular elements of the covariance matrix of the received data are extracted first, feature extraction and dimension reduction are realized through the PCA, and the less number of features than those of original features are obtained. The dimension-reduced features are used as input features of SVR, the incident angle and range are used as output for training, and the function between the input and output signals is herein obtained. Simulation results show that this method has a high estimation accuracy and practical computational speed, and has strong adaptability at low SNR, and the prediction precision is better than that of the traditional method.

Array structure and receiving data model

In this article, the scalar sound pressure sensor array is adopted as the data-receiving model of the near-field sound source. The array element is a spatial omnidirectional sensor uniformly distributed on the x-axis. The number of array elements is M, and the spacing between adjacent elements is d. The schematic diagram of array structure is shown in Figure 1.

Figure 1.

Schematic diagram of array structure.

Suppose that K narrow-band, non-Gaussian, independent stationary signal is injected on the above array antenna from the near field. Let the array element at the origin of coordinates (0,0) be the array reference element, and $r_{k}$ and $θ_{k}$ denote the range and the elevation between the kth source and the reference array element, respectively.

At the sampling time t, the output of the mth element on the uniform linear array can be expressed as

$x_{m} (t) = \sum_{k = 1}^{K} s_{k} (t) e^{\tilde{j} τ_{mk}} + n_{m} (t)$ (1)

where $s_{k} (t)$ is the kth source, $n_{m} (t)$ is the noise on the mth element, $τ_{mk}$ represents the propagation phase difference for the kth signal arriving at the mth element with respect to the reference element of the array, $\tilde{j}$ denotes the imaginary unit, $1 \leq m \leq M$ .

In the near-field case, there exists an approximate relationship

$\cos θ_{mk} \approx \cos θ_{k}, r_{mk} \approx r_{k}$ (2)

After Fresnel approximation is adopted, $τ_{mk}$ can be expressed as⁸

$τ_{mk} = 2 π \frac{r_{mk} - r_{k}}{λ_{k}} \approx μ_{k} m + φ_{k} m^{2}$ (3)

Thus, the received signal on the mth element can be obtained as follows

$x_{m} (t) = \sum_{k = 1}^{K} s_{k} (t) e^{\tilde{j} (μ_{k} m + m^{2} φ_{k})} + n_{m} (t)$ (4)

where $μ_{k} = - 2 π d \sin θ_{k} / λ_{k}$ , $φ_{k} = π d^{2} co s^{2} θ_{k} / π d^{2} co s^{2} θ_{k} λ_{k} r_{k} λ_{k} r_{k}$ , $λ_{k}$ is the kth signal wavelength, $θ_{mk}$ represents the angle between the kth signal source incident on the mth element and the y-axis, $r_{mk}$ is the range from the kth signal to the mth element.

Equation (4) can be written as matrix

$X (t) = AS (t) + N (t)$ (5)

where $X (t) = {[x_{1} (t), x_{2} (t), \dots, x_{m} (t), \dots, x_{M} (t)]}^{T}$ , $A = [a_{1} (θ_{1}, r_{1}), \dots, a_{k} (θ_{k}, r_{k}), \dots, a_{K} (θ_{K}, r_{K})]$ , $a_{k} (θ_{k}, r_{k}) = {[1, e^{\tilde{j} (μ_{k} + φ_{k})}, \dots, e^{\tilde{j} (m μ_{k} + m^{2} φ_{k})}, \dots, e^{\tilde{j} (M μ_{k} + M^{2} φ_{k})}]}^{T}$ , $S (t) = {[s_{1} (t), \dots, s_{k} (t), \dots, s_{K} (t)]}^{T}$ , $N (t) = {[n_{1} (t), \dots, n_{m} (t), \dots, n_{M} (t)]}^{T}$ .

The signal is sampled, and the number of snapshots is N. The covariance matrix of the signal is

$R_{s} = \frac{1}{N} \sum_{n = 1}^{N} X (n) X^{H} (n)$ (6)

where $X^{H}$ is the conjugate transpose of the matrix $X$ .

Principle of PCA-MSVR algorithm

Multi-output support vector regression algorithm

In order to solve the problem of regression estimation of multiple variables, multi-output support vector regression (MSVR) proposed by Pérez-Cruz et al.²² is the promotion of standard SVR. This article is just a brief introduction to MSVR; for more details, please refer to the literature.^23–25

In this article, the upper triangular element of the covariance matrix $Rs$ is used to construct the matrix $R$ , and the sample input feature $R$ and $y = [y_{1}, \dots, y_{i}, \dots, y_{Z}]$ are used as the input of the SVR machine, where $y$ is the set of arrival angle and range of near-field source signals. Assume that the given sample data is $D_{R ″', y} = {(R_{i}, y_{i}) | i = 1, 2, \dots, Z}$ , $R_{i} \in R^{M}$ , $y_{i} \in R^{N}$ , Z is the number of samples, $y_{i} = [θ_{i 1}, {r_{i}}_{1}, θ_{i 2}, r_{i 2}, \dots, θ_{ik}, r_{ik}, \dots, {θ_{i}}_{K}, {r_{i}}_{K}]$ , the input data $R_{i}$ is mapped to a high-dimensional space, and the mapped data $ϕ (R_{i}) \in R^{f}$ can be obtained. The signal model built by MSVR is ${\tilde{y}}_{i} = ϕ (R_{i}) W + b$ . The problem MSVR needs to solve is how to select $W, b$ to minimize the error of regression result ${\tilde{y}}_{i}$ , where ${\tilde{y}}_{i}$ is obtained by ${\tilde{y}}_{i} = ϕ (R_{i}) W + b$ and $W = [w^{1}, w^{2}, \dots, w^{j}, \dots, w^{N}]$ is $f \times N$ dimensional matrix, $w^{j}$ is a column vector, $b = [b^{1}, b^{2}, \dots, b^{j}, \dots, b^{N}]$ is the row vector, $b^{j}$ is a constant, $1 \leq j \leq N$ . In Sanchez-Fernandez et al.,²⁴ the minimization objective function is constructed as follows

$L_{p} (W, b) = \frac{1}{2} \sum_{j = 1}^{N} {‖ w^{j} ‖}^{2} + C \sum_{i = 1}^{Z} L (u_{i})$ (7)

The $ε$ -insensitive loss functions can be extended to multidimensional forms, the L₂ norm of the output variable error is used to replace the original 1-D error, and can be defined as

$L (u_{i}) = {\begin{matrix} 0, u_{i} < ε \\ {u_{i}}^{2} - 2 u_{i} ε + ε^{2}, u_{i} \geq ε \end{matrix}$ (8)

where $u_{i} = ‖ e_{i} ‖ = \sqrt{{e_{i}}^{T} e_{i}}$ , $e_{i} = y_{i} - ϕ (R_{i}) W - b = y_{i} - {\tilde{y}}_{i}$ , $ε$ is the allowable deviation, C is a hyperparameter used to determine the trade-off between regularization and reduction of error terms.

Optimization problems are solved using an iterative process, each depending on the previous solution ( $W^{k}$ and $b^{k}$ ) to get the next solution until the optimal solution is reached. In order to optimize equation (7), the iteratively reweighted least squares (IRWLS) procedure is needed.

The first-order Taylor expansion of the target function (7) is shown as follows

$\begin{matrix} L'_{p} = \frac{1}{2} \sum_{j = 1}^{N} {‖ w^{j} ‖}^{2} \\ + C (\sum_{i = 1}^{Z} L ({u_{i}}^{k}) + (\frac{dL (u_{i})}{d u_{i}} |_{{u_{i}}^{k}}) \frac{{({e_{i}}^{k})}^{T}}{{u_{i}}^{k}} [e_{i} - {e_{i}}^{k}]) \end{matrix}$ (9)

where ${u_{i}}^{k} = ‖ {e_{i}}^{k} ‖ = \sqrt{{({e_{i}}^{k})}^{T} e_{i}}$ , $({e_{i}}^{k})^{T} = {y_{i}}^{T} - ϕ^{T} (R_{i}) W^{k} - (b^{k})^{T}$ .

Furthermore, the second-order Taylor expansion is obtained from equation (7)

${L_{p}}^{''} = \frac{1}{2} \sum_{j = 1}^{N} {‖ w^{j} ‖}^{2} + \frac{1}{2} \sum_{i = 1}^{Z} a_{i} {u_{i}}^{2} + CT$ (10)

where $a_{i} = \frac{C}{{u_{i}}^{k}} \frac{dL (u_{i})}{d u_{i}} |_{{u_{i}}^{k}} = {\begin{matrix} 0, {u_{i}}^{k} < ε \\ \frac{2 C ({u_{i}}^{k} - ε)}{{u_{i}}^{k}}, {u_{i}}^{k} \geq ε \end{matrix}$ , CT is a constant independent of $W$ and $b$ . Take the partial derivatives of $w^{j}$ and $b^{j}$ and set them equal to 0, then

$\frac{\partial {L ″}_{p}}{\partial w^{j}} = w^{j} - \sum_{i = 1}^{Z} ϕ (R_{i}) a_{i} (y_{ij} - ϕ^{T} (R_{i}) w^{j} - b^{j}) = 0$ (11)

$\frac{\partial {L ″}_{p}}{\partial b^{j}} = - \sum_{i = 1}^{Z} a_{i} (y_{ij} - ϕ^{T} (R_{i}) w^{j} - b^{j}) = 0$ (12)

Equations (11) and (12) can be written as matrix

$[\begin{matrix} Φ^{T} D_{a} Φ + I & Φ^{T} α \\ α^{T} Φ & 1^{T} α \end{matrix}] [\begin{matrix} w^{j} \\ b^{j} \end{matrix}] = [\begin{matrix} Φ^{T} D_{a} y^{j} \\ α^{T} y^{j} \end{matrix}]$ (13)

where $Φ = [ϕ (R_{i}), \dots, ϕ (R_{Z})]^{T}$ , $α = [a_{1}, \dots, a_{Z}]^{T}$ , $(D_{a})_{ij} = a_{i} δ (i - j)$ , $y^{j} = [y_{1 j}, \dots, y_{Zj}]^{T}$ .

The inner product kernel function $κ (x_{i}, x_{j}) = ϕ^{T} (x_{i}) ϕ (x_{j})$ is usually used to replace the whole nonlinear mapping. Using the representer theorem, it can be shown, under fairly general conditions, that the best solution of the learning problem can be expressed as a linear combination of training samples in the feature space, $w^{j} = \sum_{i} ϕ (R_{i}) β^{j} = Φ^{T} β^{j}$ . Substitute this expression into equations (11) and (12) and you get the following expression

$[\begin{matrix} K + {D_{a}}^{- 1} & 1 \\ α^{T} K & 1^{T} α \end{matrix}] [\begin{matrix} β^{j} \\ b^{j} \end{matrix}] = [\begin{matrix} y^{j} \\ α^{T} y^{j} \end{matrix}]$ (14)

where $(K)_{ij} = κ (x_{i}, x_{j})$ is known as the kernel matrix.

The IRWLS procedure can be summarized as the following steps

Initialization: Set $k = 0$ , $β^{k} = 0$ , $b^{k} = 0$ , and compute ${u_{i}}^{k}$ and $a_{i}$ .

Compute the solution to equation (14), and label them as $β^{s}$ and $b^{s}$ , determine the direction of gradient degradation method as follows: $P^{k} = [\begin{matrix} W^{s} - W^{k} \\ (b^{s} - b^{k})^{T} \end{matrix}]$ .

The search step size $η^{k}$ is solved by heuristic method, and the solution of the next iteration is $[\begin{matrix} W^{k + 1} \\ (b^{k + 1})^{T} \end{matrix}] = [\begin{matrix} W^{k} \\ (b^{k})^{T} \end{matrix}] + η^{k} P^{k}$ .

Compute ${u_{i}}^{k + 1}$ and $a_{i}$ , set $k = k + 1$ , and go back to step (2) and continue until $L_{p}$ no longer decreases.

The convergence proof of the above algorithm is given in Sanchez-Fernandez et al.²⁴ For each new vector $R$ , we can calculate the jth output as $y^{j} = ϕ^{T} (R) ϕ^{T} β^{j}$ . Now, we define the matrix $β = [β^{1}, β^{2}, \dots, β^{N}]$ , and the N outputs can be expressed as

$y = ϕ^{T} (x) ϕ^{T} β = K_{x} β$ (15)

Since the covariance matrix of the signal $Rs$ has more upper triangular elements, using the upper triangular element of the covariance matrix as the training data increases the complexity of the algorithm. In the case of more sample data, the training time will be increased and the training speed will be slower. Therefore, without affecting the performance of estimation, a PCA algorithm is introduced to reduce the dimension of R and the number of features of input samples; thus, the complexity of the algorithm is herein reduced. For example, for an array of eight elements, the upper triangular element of the covariance matrix has 36 elements, and after the dimensional reduction by PCA, the number of features can be changed to about 8, and the complexity of the algorithm is reduced to a quarter of that of the original one. This algorithm will be described in detail below.

PCA algorithm

PCA is a statistical method. PCA is to replace the original indicators with a new set of independent comprehensive indicators by recombining the original indicators which have a certain correlation.²⁶ The main idea is to establish the feature mapping relationship from high-dimensional space to low-dimensional space, and the original complex features have to reduce several main features, so that the original feature information are retained as much as possible, and not related to each other. This set of linearly independent feature by the orthogonal transformation is called principal component. After the dimensionality reduction of the upper triangular element of the signal covariance matrix by PCA, the redundant information is discarded, which increases the sampling density of the sample. At the same time, when the data are affected by noise, the eigenvector corresponding to the minimum eigenvalue is often related to noise, abandoning redundant information can remove noise to a certain extent.²⁷

In recent years, PCA has been widely used in various fields, and granted results have been achieved. In this article, PCA method is applied to the dimension reduction and noise reduction of support vector input features. The main components of input characteristic variable are extracted to reduce data redundancy, imperfection, and over fitting, so as to reduce the dimension and calculation complexity of regression model matrix of $W^{k}$ . The specific steps are as follows

1. The covariance matrix of Z groups of sample signals is calculated respectively, and the upper triangular elements are extracted to form the row vectors, and the signal feature matrix $R'$ of Z groups of row vector is calculated.

2. After the signal, eigenmatrix $R'$ is normalized, and the sample eigenmatrix $R ″$ is obtained.

3. The covariance matrix $C_{R ″} (A \times A)$ of the normalized sample eigenmatrix $R ″$ is calculated. Eigenvalue decomposition on $C_{R ″}$ is performed to obtain its eigenvalue $λ_{i}$ and the corresponding eigenvector $q_{i}$

$C_{R ″} = {R ″}^{H} R ″ / E$ (16)

where $E$ is the number of training samples.

4. Find the percentage of eigenvalues, and select a larger eigenvalue, that is

$\frac{\sum_{i = 1}^{a} λ_{i}}{\sum_{i = 1}^{A} λ_{i}} \geq G, (a < A)$ (17)

In the above expression, G is the cumulative contribution rate, generally greater than 85% can be considered to contain the vast majority of information.²⁸ In this article, the cumulative contribution rate is 95%.

5. The eigenvectors corresponding to eigenvalues are constructed into a matrix $Q = [q_{1}, q_{2}, \dots, q_{a}]$ of $Z \times a$ , and the sample input feature $R ″'$ of MSVR is obtained by linear transformation $R ″' = Q^{T} R ″$ .

$R ″'$ after PCA dimensionality reduction is used to replace R as the input feature of MSVR, and the training data are trained using the MSVR. For convenience, the algorithm described above in this article is called PCA-MSVR method. The steps of PCA-MSVR method are as follows. The performance will be explained in the simulation results.

Data preparation. The covariance matrix of the training sample signal is obtained, and the upper triangular element is extracted as the characteristic matrix of the signal.

Dimensionality reduction. PCA is used to reduce the dimension of the signal’s feature matrix.

Forecast the model. The training data are trained using MSVR to obtain a predictive model of the signal.

Performance estimation. The predictive data are used to predict and estimate performance.

Under the condition that the estimation accuracy is almost the same, PCA-MSVR algorithm can well retain the characteristic information of the signal with minimal datum as possible as we can. The PCA-MSVR algorithm also does not need eigen-decomposition and peak search, and can be realized quickly.

Simulation results and performance analysis

Two near-field, narrow-band, non-Gaussian stationary, sound source signals are incident into the uniform linear sensor array as shown in Figure 1. The receiving array is composed of eight arrays, The inter-element spacing is $d = λ_{min} / 4$ , the signal frequency is set as $[f_{s} / 8, f_{s} / 10]$ , $f_{s}$ is the sampling frequency, the number of snapshots is 1024, and the noise is Gaussian white noise. Angular spacing of the training sample data is set as $Δ θ = 8 °$ , range interval is $Δ r = 0.05 λ_{min}$ , $λ_{min}$ is the wavelength of the signal corresponding to frequency $f_{s} / 8$ , the training angle range is $[- π / 2, π / 2]$ , the training range between two sources with the reference array is $[2.1 λ_{min}, 3.1 λ_{min}]$ . The number of sample data is 180, and it is divided into two equal parts, one for training and the other for testing. Then, the two untrained signals are taken for prediction, and the simulation results are shown in Figures 2 –7.

Figure 2.

Contribution rate of each principal component.

Figure 3.

Cumulative contribution rate of principal components.

Figure 4.

PCA-MSVR DOA estimation.

Figure 5.

PCA-MSVR range estimation.

Figure 6

RMSE of DOA at different SNR.

Figure 7.

Training time comparison of different algorithms.

Figure 2 shows the contribution rate of each principal component to the signal characteristics. It can be seen from the Figure 2 that the contribution rate of the first few principal components is higher. Figure 3 shows the cumulative contribution rate of principal components. From Figure 3, it can be seen that the cumulative contribution rate of the first eight principal components is 99%. In this article, the cumulative contribution rate of the principal component is 95%. Therefore, the first eight principal components are selected.

Figures 4 and 5 are the fitting diagram of the DOA and range estimated by the PCA-MSVR method, at different test sample size. From Figures 4 and 5, it can be seen that the estimated angle and range parameters of the near-field source can be well fitted with the actual value, and the proposed PCA-MSVR algorithm can accurately estimate the angle and range parameters of the near-field source. Simulation results show the effectiveness of the proposed algorithm.

Figure 6 plots the DOA estimation root mean square error (RMSE), respectively estimated by two-step MUSIC, back propagation (BP), MSVR, general regression neural network (GRNN) and the proposed PCA-MSVR algorithm, at various SNR levels. As can be seen from the Figure 6, in the SNR range (namely, at or above −10 dB), the RMSE of proposed PCA-MSVR algorithm is nearly the same as MSVR algorithm, the DOA estimation precision based on the PCA-MSVR and MSVR algorithm is more notable than that of the BP and two-step MUSIC algorithms. When the SNR is within the range of −5 to 20 dB, the RMSE of proposed PCA-MSVR algorithm is nearly the same as MSVR and GRNN algorithm; the DOA estimation precision of PCA-MSVR, MSVR, and GRNN algorithms is more notable than that of the BP and two-step MUSIC algorithms. Although the performance is the same, the calculation amount of proposed PCA-MSVR algorithm is significantly less than the MSVR and GRNN algorithms.

The complexity of the training process is determined by the convergence performance of the algorithm, so it is difficult to give a quantitative analysis of the complexity of the algorithm by using mathematical formula. The training time of the PCA-MSVR algorithm is compared with MSVR algorithms in Figure 7, and it can be seen that the training time of the PCA-MSVR is 0.06 s; however, the training time of the MSVR is about 0.08 s. Compared with MSVR algorithm, the complexity of PCA-MSVR algorithm is significantly reduced.

For further explanation, the frequency condition in simulation experiment is reset as $[f_{s} / 8, f_{s} / 8]$ , the incoherent signal is herein changed into coherent signal, other conditions remain unchanged, and then the proposed method in this article is used to estimate the parameters of the near-field source coherent signal. The simulation results are as follows.

Figures 8 and 9 are the scatter diagram of DOA and range estimation for two near-field coherent signals, and it can be seen in these two figures that the DOA and range estimated values of the near-field signals can fit well with the actual ones, and the proposed algorithm can estimate the parameters of the coherent near-field sources.

Figure 8.

Scatter diagram of DOA estimation.

Figure 9.

Scatter diagram of range estimation.

Conclusion

In this article, an SVR method of combing elevation and range is implemented by PCA dimensionality reduction. With the performance guaranteed, the computational complexity is almost not increased. First, the upper triangular elements of the covariance matrix of the received signal from the sample data are extracted, and the dimensionality is reduced through PCA. Second, the reduced dimensionality matrix is taken as the input feature of the MSVR machine. Finally, the multi-output SVR algorithm is used for modeling to obtain the parameter model of the near-field estimation. PCA greatly reduces the dimension of input features of SVM, and it also reduces the complexity of data processing, and the training time is also shortened accordingly. At the same time, the noise is restrained without losing the lossless original data information, and the SNR is herein improved; as a result, the estimation accuracy is improved. The proposed method has a more superior performance compared with BP and GRNN algorithms in low SNR. This method has no special requirements for the array structure and is suitable for both uniform and non-uniform linear arrays. Because the model parameters are obtained through data training, the array error does not affect the accuracy of parameter estimation. Simulation results show that the proposed method based on PCA dimension reduction and multiple output SVR has high estimation accuracy.

Footnotes

The authors would like to thank the anonymous reviewers and the associated editor for their valuable comments and suggestions that improved the clarity of this article.

Handling Editor: Antonio Lazaro

Data availability statement

All data included in this study are available upon request by contact with the corresponding author.

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research,authorship,and/or publication of this article.

Funding

The author(s) disclosed receipt of the following financial support for the research,authorship,and/or publication of this article: This work is supported by the National Natural Science Foundation of China under Contract (Grant No. 61772398,61972239);the Key Research and Development Program Projects of Shaanxi Province (Grant No. 2019SF-257);the Special Scientific Research Project of Shaanxi Provincial Education Department (Grant No. 18JK0144);the Opening Foundation of Shaanxi University of Technology Shaanxi Key Laboratory of industrial Automation (Grant No. SLGPT2019KF01-15);and the Science and Technology Program of Hantai District (Grant No. 2019KX-21).

ORCID iD

Lanmei Wang

References

Liu

Xing

Liu

, et al. Maximum likelihood angle estimation of target in the presence of chaff centroid jamming. IEEE Access 2018; 6: 74416–74428.

Chen

Cao

, et al. Fast algorithm for DOA estimation with partial covariance matrix and without eigendecomposition. J Signal Infor Pr 2011; 2: 266–269.

Wong

Zoltowski

MD.

Root-MUSIC-based direction-finding and polarization estimation using diversely polarized possibly collocated antennas. IEEE Antenn Wirel Pr 2004; 3(1): 129–132.

Wang

Zhao

, et al. DOA and polarization estimation algorithm based on the virtual multiple baseline theory. Progr Electromagn Res C 2016; 65: 45–56.

Abeywickrama

Zhang

, et al. Low-complexity portable passive drone surveillance via SDR-based signal processing. IEEE Commun Mag 2018; 56(4): 112–118.

Abeywickrama

Yuen

, et al. A robust phase-ambiguity-immune DOA estimation scheme for antenna array. IEEE T Veh Technol 2019; 68(7): 6686–6696.

Kuang

Wang

Xie

, et al. Real-valued near-field localization of partially polarized noncircular sources with a cross-dipole array. IEEE Access 2019; 7: 36623–36632.

Yang

Jie

Liu

BS.

A new near-field source localization algorithm based on generalized esprit. In: Proceedings of the 2009 4th IEEE conference on industrial electronics and applications, Xi’an, China, 25–27 May 2009, pp.1115–1120. New York: IEEE.

Zheng

Jiang

, et al. Localization of mixed far-field and near-field sources via cumulant matrix reconstruction. IEEE Sens J 2018; 18(18): 7671–7680.

10.

Huang

Barkat

Near-field multiple source localization by passive sensor array. IEEE T Antenn Propag 1991; 39(7): 968–975.

11.

Starer

Nehorai

Passive localization of near-field sources by path following. IEEE T Signal Pr 1994; 42(3): 677–680.

12.

Challa

Shamsunder

High-order subspace-based algorithms for passive localization of near-field sources. In: Proceedings of the IEEE conference on signals, systems & computers, Pacific Grove, CA, 30 October–1 November 1995. New York: IEEE.

13.

Zhang

Chen

Zheng

, et al. Localization of near-field sources: a reduced-dimension MUSIC algorithm. IEEE Commun Lett 2018; 22(7): 1422–1425.

14.

Gotsis

Siakavara

Sahalos

JN.

On the direction of arrival (DoA) estimation for a switched-beam antenna system using neural networks. IEEE T Antenn Propag 2009; 57(5): 1399–1411.

15.

Terabayashi

Natsuaki

Hirose

Ultrawideband direction-of-arrival estimation using complex-valued spatiotemporal neural networks. IEEE T Neur Net Lear 2014; 25(9): 1727–1732.

16.

Sun

Chen

Yuen

, et al. Indoor sound source localization with probabilistic neural network. IEEE T Ind Electron 2018; 65(8): 6403–6413.

17.

Gonnouni

Martinez-Ramon

Rojo-Alvarez

, et al. A support vector machine MUSIC algorithm. IEEE T Antenn Propag 2012; 60(10): 4901–4910.

18.

Tarkowski

Kulas

RSS-based DoA estimation for ESPAR antennas using support vector machine. IEEE Antenn Wirel Pr 2019; 18(4): 561–565.

19.

Liu

Zhang

PS.

Direction-of-arrival estimation based on deep neural networks with robustness to array imperfections. IEEE T Antenn Propag 2018; 66(12): 7315–7327.

20.

Chakrabarty

Habets

EAP

. Multi-speaker DOA estimation using deep convolutional networks trained with noise signals. IEEE J Select Topic Signal Pr 2019; 13(1): 8–21.

21.

Vapnik

Statistical learning theory. New York: Wiley, 1998.

22.

Pérez-Cruz

Camps-Valls

Soria-Olivas

, et al. Multi-dimensional function approximation and regression estimation. In: Proceedings of the international conference on artificial neural networks, Madrid, 28–30 August 2002, pp.757–762. Berlin: Springer.

23.

Shilton

Lai

DTH

Palaniswami

A division algebraic framework for multidimensional support vector regression. IEEE T Syst Man Cyb B 2010; 40(2): 517–528.

24.

Sanchez-Fernandez

De-Prado-Cumplido

Arenas-Garcia

, et al. SVM multiregression for nonlinear channel estimation in multiple-input multiple-output systems. IEEE T Signal Pr 2004; 52(8): 2298–2307.

25.

Xiong

Bao

Multiple-output support vector regression with a firefly algorithm for interval-valued stock price index forecasting. Knowl Based Syst 2014; 55: 87–100.

26.

Suykens

Gestel

Vandewalle

, et al. A support vector machine formulation to PCA analysis and its kernel version. IEEE T Neural Netw 2003; 14(2): 447–450.

27.

Zhang

Tian

Zhou

JJ.

A novel wideband direction-of-arrival estimation algorithm based on improved GRNN and PCA. J Optoelectron Laser 2012; 23(4): 692–696.

28.

Xin

Wang

An adaptive network intrusion detection method based on PCA and support vector machines. In: Proceedings of the international conference on advanced data mining & applications, Wuhan, China, 22–24 July 2005, pp.696–703. New York: IEEE.