Abstract
Keywords
Introduction
Fault feature extraction is the basis of fault diagnosis, and the clear and definite fault characteristic quantity has a decisive influence on improving the accuracy and rapidity of fault diagnosis.1,2 The quality of fault feature is the key factor to the fault diagnosis effect. 3 Fuzzy fault information increases the time of fault diagnosis and the rate of misdiagnosis. 4 The present equipment tends to be complex precision. The interference of information is also diversified, and the difficulty of fault diagnosis is obviously increased. For several decades, clarity and significance have been the focus of extraction of the fault characteristic quantity. 5 Recently, some effective methods have been reported to extract the fault features, for example, multiscale clustered gray infogram 6 and criterion fusion for spectral segmentation. 7
Time domain signal, 8 as the signal processing and fault diagnosis of the original information, contains a wealth of information, and it is easy to be accepted. However, the time domain signal fault information is easy to be disturbed by noise and interference, and that the fault information is not obvious.
Frequency domain analysis 9 method is widely used in modern fault diagnosis and can reflect the fault of the equipment. For example, different fault types of rolling bearing fault characteristic frequency can appear different. But single-frequency domain signal cannot complete the description of the non-stationary and non-linear information of the machine.
At present, there are limitations in the common time-frequency analysis methods, 10 such as the Wigner distribution 11 which produces the cross-term interference in the analysis of multi-component signals and causes the ambiguous phenomenon of time-frequency signal feature. 12 The time-frequency window of the short-time Fourier transform 13 is fixed, and the time-frequency resolution is not high. Wavelet transform depends on lack of adaptability. The empirical mode decomposition (EMD) is a signal based on adaptive decomposition method, where the selection of wavelet bases is avoided, and it has the problem of mode mixing and endpoint effect. 14 As a result, the ensemble empirical mode decomposition (EEMD) method, which is an improved version of EMD, 15 solves the modal aliasing problem.16–24
An adaptive denoising fault feature extraction method based on EEMD and the correlation coefficient is proposed in this article. The correlation coefficient of the intrinsic mode function (IMF) component selection is the main fault information included in this method. It highlights the fault information and realizes the adaptive denoising effect at the same time. The separability of the input is strong with low dimension. Small fault diagnosis of the back-propagation (BP) neural network model and network robustness effectively improve the work efficiency of the method and the accuracy of the rolling bearing fault diagnosis. In order to verify the superiority of this method, the neural network based on EEMD method is compared with the network based on EMD or wavelet packet analysis method. Experiments show that the EEMD neural network–based method has higher ability of fault identification.
The remainder of this article is organized as follows. In section “Adaptive denoising theory based on EEMD,” the method is introduced. The process of method is described in section “Rolling bearing fault diagnosis based on EEMD and neural network.” Section “Experimental demonstration for fault diagnosis of rolling bearing” describes the contrast experiment and main results. Finally, we conclude the article in section “Conclusion.”
Adaptive denoising theory based on EEMD
EMD theory
EMD decomposes the complicated signal into a set of complete IMF25,26 based on the local characteristic time scales of a signal. EMD is a kind of method that can be used to deal with non-stationary signals. Different scale fluctuations or trends of the signal are decomposed by a series of different characteristic scale IMFs gradually. Generally speaking, the IMF component meets the following conditions: 27
In the whole data set, the number of extremum and the number of zero crossings must either be equal or differ at most by one;
At any point, the mean values of the upper envelope and lower envelope are zero.
The decomposition principle of EMD method is described as follows:
Determine all the local extreme points of a signal to be processed; the local maxima and minima are connected by two cubic splines to form an upper and a lower envelope, and signals of all data points are covered between the two envelopes.
The mean values of the upper and the lower two envelope lines are, respectively,
Subtract the basic mode component
The IMF component of
Through the above four steps, a signal can be decomposed into a basic mode component of
Figure 1(a) shows the fault signal of rolling bearings. The signal is decomposed by EMD to obtain 11 IMF and 1 residual component as shown in Figure 1(b).

(a) Fault signal of rolling bearings and (b) signal decomposed by EMD.
EEMD theory
The process of EMD decomposition has the problem of mode mixing, 28 in which single IMF contains different frequency components of the signal or the same frequency components are decomposed into different IMFs. EEMD was developed from EMD by Wu and Huang to solve the problem of mode mixing. 29 The essence of EEMD algorithm is the superposition of Gaussian white noise on the original signal. The mean of the IMF component with multiple EMD decomposition is the final result. The algorithm makes use of the statistical property of Gaussian white noise, so that the signal plus white noise can have continuity at different frequencies, and the mode-mixing problem is solved effectively. Based on the characteristics of EMD, the principle of EEMD is described as follows:
Add a white noise to the original signal;
After decomposing the signal, the IMF component is obtained; 30
Repeat steps 1 and 2, but the white noise is different each time;
The mean value of the IMF component obtained by multiple decomposition is the final result. 31
The amplitude of the collective number and the added Gaussian white noise are the two important parameters of EEMD. Wu and Huang suggest that in most cases, the standard deviation of the noise amplitude is 0.2 times the standard deviation of the signal.
The simulation signal and its decomposition results are shown in Figure 2. The EMD in the decomposition process has the problem of mode mixing. Sine signal is decomposed into two IMF components, and decomposition results produced serious distortion; thus, the fund for the EMD loses physical meaning. The simulation signal is processed by EEMD, and the sine signal is decomposed into two IMF components.

(a) Simulation signal, (b) decomposition results by EMD, and (c) decomposition results by EEMD.
The correlation coefficient
EMD denoising method and kurtosis threshold have been widely applied to fault diagnosis. The method mainly depends on the subjective experience and the large number of attempts. At the same time, the disadvantage of this method is lack of adaptability. An adaptive denoising method based on the correlation number is proposed in this article.
32
First, the original signal is processed by EEMD. Then, the correlation coefficients
It emphasizes the fault feature information and inhibits the normal information of the normal state in the original signal; the standard cross-correlation coefficients
where
Rolling bearing fault diagnosis based on EEMD and neural network
The fault diagnosis method based on EEMD and neural network is shown as the block diagram of rolling bearing in Figure 3.
The three states of rolling bearings (inner race fault, outer-ring fault, and normal) are sampled;
The original signal is processed by EEMD and the IMF component is obtained;
Calculate the failure correlation coefficient of each IMF component and analyze the IMF component of the main fault information from the large-to-small selection of the first eight standard cross-correlation coefficients. The change in the energy of frequency band is characterized by the rolling bearing fault, so the energy of IMF is applied as the feature vector of the network to recognize the bearing state
IMF component energy,
Considering the energy value, the normalized treatment is convenient for the following analysis
The eigenvector
The eigenvector
The number of neurons in the output of the neural network is determined by the number of faults. Therefore, the output matrices of status coding are inner-ring fault bearing [0, 0, 1], outer-ring fault bearing [0, 1, 0], and normal bearing [1, 0, 0]. Based on the BP network training, the test sample is tested and identified by the trained network. The characteristic parameters of the test sample are the network input, and the status of the test sample is determined for the network output.

Fault diagnosis block diagram of rolling bearing based on EEMD and neural network.
Experimental demonstration for fault diagnosis of rolling bearing
A large number of experimental data are provided for the typical failure of the rolling bearing from the data center of the Western Reserve Case University. In this article, the experimental data of the rolling bearing from the data center of the Western Reserve Case University is used to verify the effectiveness of the proposed method. The bearing type of the experiment is SKF 6325 of rolling bearing, experimental rotational frequency is 20 Hz, and the sampling frequency is 2048 Hz. The vibration signal is picked up by the acceleration sensor mounted on the bearing base.
The original vibration signal is decomposed by EEMD, and the correlation coefficient
The EMD is similar to the EEMD process in which the raw signal is decomposed, the correlation coefficient is gained to select eight IMF components, and the fault parameters are determined. The 30 training samples are trained through the BP network, the 15 test samples are classified, and the accuracy of the network identification is 83%.
The vibration signal with three layers of wavelet packets has been decomposed in this article. 33 The third layer’s eight-band wavelet packet decomposition coefficients can be obtained. The third–layer’s eight-band wavelet package decomposition coefficients were reconstructed to form eight new time series. 34 According to formulas (2), (3), and (4), the common points are extracted from eight sequence reconstruction of energy as feature vectors, and the network is trained. By using the trained neural network to classify 15 test samples, the network identification accuracy is 77% for each pattern of five samples.
Table 1 presents the eigenvector and results of the test samples (corresponding to three patterns) based on three methods. The energy distribution is shown in Figure 4; the range of the energy characteristics is given by the wavelet packet analysis, and only the normal state of the first IMF is clearly distinguished from the internal and external fault states. The other IMFs cross each other, which is the cause of the subsequent failure identification rate not being high. The characteristics of EMD and EEMD extraction are different from other states relatively. Figure 5 shows the evaluation of the performance of the three methods. Wavelet analysis and EMD diagnostic results are scattered and confused, which is also shown in Figure 4 that the difference is not obvious. In EEMD, same fault data test results are stable, and the difference in different fault modes is obvious.
Three diagnostic results.
EMD: empirical mode decomposition; EEMD: ensemble empirical mode decomposition.

Three methods to extract the energy distribution: (a) characteristics of wavelet packet analysis extraction in three states, (b) characteristics of EMD extraction in three states, and (c) characteristics of EEMD extraction in three states.

Test results of the three pre-processing network.
The method based on EMD or wavelet packet analysis of vibration signal pre-processing can identify fault bearing as well. The recognition capability based on EEMD extraction energy character parameters of network method is higher than a network method based on EMD and wavelet packet analysis of network. The wavelet packet decomposition is non-adaptive, 35 the data test results are scattered and of poor stability, and the distances are far from the expected output. EMD decomposition has the mode-mixing problem, 36 normal status identification rate is high, the inner-ring fault and the outer-ring fault have intersection, and the resolution is low. The EEMD decomposition is performed according to the information of the signal’s adaptive decomposition; the dependency of the decomposition process changes in the signal itself which contains information and solves the EMD mode-mixing problem, mutual correlation coefficient effective denoising, and stable data test results. The difference in fault modes is obvious and the fault information is more sensitive.
Conclusion
The problem of the fault characteristics has been disturbed easily. An adaptive denoising fault feature extraction method based on EEMD and the correlation coefficient is proposed in this article:
EEMD is used to extract the feature of the fault signal, which can solve the non-linear problem of the traditional processing method and the mixed problem of the EMD signal;
Then, the main fault information is selected by correlation coefficient, which makes the fault information become more prominent and obvious;
Theoretical analysis and experimental results show that the fault diagnosis method based on EEMD can identify the fault types of rolling bearing effectively, and it has a higher recognition capability than the network based on EMD and wavelet packet analysis;
The method is precise and feasible to realize the recognition of the servo system in the fault location and fault type, so as to realize the fault diagnosis quickly. The contrast of three ways to use the same data model (input, output, feature, and layer) using the neural network as a fault identification method proves the superiority of EEMD.
Finally, due to the superiority of EEMD in signal processing and classification performance, the proposed method has strong generalization ability.
