Abstract
Keywords
Introduction
Rolling bearings as a main component have been widespread used in various types of rotating machines. Bearing failure is one of the foremost causes of failure and breakdowns in rotating machinery, resulting in significant economic loss.1–3 In order to keep machinery running with high reliability and reduce economic losses, it is essential to develop a reliable and effective bearing fault diagnosis method. Among different bearing fault diagnosis methods, vibration-based methods have attracted extensive attention during the last few decades. 4
Bearing vibration signals contain abundant information about machine health status; therefore, it is possible to obtain dominant characteristics from vibration signals by signal processing techniques. 5 A great number of signal processing methods have been used for rolling bearing fault detection and diagnosis. However, due to the nonlinear factors, for example, stiffness, friction and clearance, bearing vibration signals in particular under faulty conditions, will behave nonlinear and nonstationary characteristics. 6 Besides, the measured vibration signals contain not only the working condition information related to the rolling bearing itself, but also a wealth of information of other moving parts and structures of machinery and equipment, which belongs to the background noise compared with the former. 7 Since background noise is often relatively large, the slight rolling bearing fault information may easily be buried in the background noise and become difficult to extract. As a result, the conventional time and frequency domain methods aiming mainly at linear vibration signals, and even advanced signal processing methods (e.g., wavelet transform) may not make an accurate assessment of the rolling bearing health status. 8
With the development of nonlinear dynamics theory, a great deal of nonlinear analysis techniques has been proposed aiming at recognizing and predicting the complex nonlinear dynamic behavior of bearings.4,9 One of the most typical ways is to extract and refine the fault features from vibration signals over the combination of some advanced signal processing methods, for example, wavelet package transform (WPT),8,10 Hilbert transform (HT),10,11 empirical mode decomposition (EMD), 11 and higher order spectra (HOS), 12 to recognize the fault frequency and compare to the theoretical value with involvement of expert’s empirical judgment. With the development of artificial intelligence, the procedure of rolling bearing fault diagnosis is more and more treated as a procedure of pattern recognition, 13 and its effectiveness and reliability mainly depend on the selection of dominant characteristic vector of the fault features. 13 Recently, some entropy-based methods, for example, approximate entropy (ApEn),14,15 sample entropy (SampEn), 16 fuzzy entropy (FuzzyEn),16,17 hierarchical entropy (HE),13,18 and hierarchical fuzzy entropy (HFE), 13 were developed to extract dominant characteristic vector of the fault features from the bearing vibration signals and obtained evident effect. Here, we exploit a fractal theory-based method, that is, an improved generalized fractal box-counting dimension algorithm, to extract dominant characteristic vector of the fault features from the bearing vibration signals. Fractal theory is one of the most important branches for the contemporary nonlinear science, and it is in particular suitable for processing all kinds of complex nonlinear and nonstationary phenomenon 19 and thus may also suitable for fault feature extraction from bearing vibration signals.
Normally, after feature extraction, a pattern recognition technique is needed to achieve the rolling element bearing fault diagnosis automatically. 13 Various pattern recognition methods have been developed for mechanical fault diagnosis nowadays, among which the most commonly used ones are artificial neural networks (ANNs)20–22 and support vector machines (SVMs). 23 The training of ANNs needs a large number of samples which is hard or even impossible to achieve in practical applications, in particular the fault ones. SVMs based on statistical learning theory, which is of specialties for a smaller number of samples, have better generalization than ANNs and ensure that the local and global optimal solution are exactly the same. 24 However, the accuracy of a SVM classifier is decided by the selection of optimal parameters for SVMs.24,25 In order to ensure the diagnostic accuracy, an optimization algorithm24,25 or/and complex multi-class concept13,26 have to be used complementally to improve the effectiveness of SVMs. In this article, in order to solve the issue of generality versus accuracy, an adaptive gray relation algorithm (AGRA), in which the concept of weight coefficient and adaptive distinguishing coefficient was introduced into the calculation of the relation degree, was developed to achieve accurate pattern recognition based on a small number of samples.
In summary, a novel approach for rolling element bearing fault diagnosis is proposed based on improved fractal box-counting dimension and AGRA. At first, the fault features from the bearing vibration signals, which offer more useful and distinguishing information imaging different bearing health status, are extracted by improved generalized fractal box-counting dimension algorithm. And then, the fault types of rolling element bearings as well as various levels of severity are recognized by the outputs of the AGRA.
The rest of this article is organized as follows. A review of fractal box-counting dimension and its improved version are introduced in sections “Traditional fractal box-counting dimension” and “Improved generalized fractal box-counting dimension,” respectively. The gray relation theory for fault classification is given briefly in section “Gray relation algorithm,” and the AGRA is illustrated in section “Adaptive gray relation algorithm,” followed by section “Proposed approach” which presents the proposed rolling bearing fault diagnosis approach. Next, experimental validation of the proposed approach is given in section “Experimental validation.” Finally, conclusions are presented in section “Conclusion.”
Fractal box-counting dimension
Fractal theory is one of the most important branches for the contemporary nonlinear science, which is in particular suitable for processing all kinds of complex nonlinear and nonstationary phenomenon, and it may be also suitable for fault feature extraction from bearing vibration signals.
Traditional fractal box-counting dimension
Suppose
For the practical vibration discrete signal, due to the existence of the sampling frequency, the sampling interval
Considering a sequence of vibration discrete signal
Use approximate method to make the minimum side length of the boxes covering the sequence of vibration discrete signal
where
Select a good linearity of the fitting curve
where
As usual, the least square method is used to calculate the slope of the current fitting curve which is the fractal box dimension
Improved generalized fractal box-counting dimension
The traditional fractal box-counting dimension algorithm has been introduced in section “Traditional fractal box-counting dimension.” It has been widely used in strictly self-similar signals such as the signals in the electromagnetic fault diagnosis, image analysis, and biological medicine. However, for the common bearing vibration signals, they do not satisfy the self-similar structure of fractal theory to some degree. Therefore, when using the traditional fractal box-counting dimension algorithm to calculate box-counting dimension of the vibration signals, the curve fitting often does not have good linear structure. Aiming at this issue, here an improved generalized fractal box-counting dimension algorithm was developed to overcome the defect of the traditional fractal box-counting dimension.
First, resample the vibration discrete signals and then properly add the sampling points, to reduce the value of
Consider the vibration discrete signal
The derivative process of the number of boxes covering the sequence of vibration discrete signal
When
When
When
When
After reconstruct the phase space of the sequence of vibration discrete signal
Gray relation theory
Normally, after feature extraction, a pattern recognition technique is needed to complete the rolling element bearing fault diagnosis automatically. The research of gray relation theory is the foundation of gray system theory, mainly based on the basic theory of space mathematics, to calculate relation coefficient and relation degree between reference characteristic vector and each comparative characteristic vector. 27 Gray relation theory has a good potential to be used in fault classification with four reasons:28–31
It has good tolerance to measurement noise;
It has the ability to assist the selection of characteristic parameters for classification;
It can solve the learning problem with a small number of samples;
Its algorithm is simple and can solve the issue of generality versus accuracy.
Gray relation algorithm
The dominant characteristic vectors (i.e. the improved generalized fractal box-counting dimension) of the fault features extracted from bearing vibration signals are classified as follows
where
The knowledge base from a small number of samples between fault signatures (i.e. the characteristic vectors) and fault patterns (i.e. the fault types of rolling element bearings as well as various levels of severity) is as follows
where
For
where
And then
Adaptive gray relation algorithm
In order to enhance its tolerance to measurement noise and the ability to assist the selection of characteristic values for fault classification, the information theory was introduced into the calculation of the relation degree.
First, process the distance of characteristic parameter
where
Define the entropy value as follows
And the maximum entropy value is as follows
Then, the relative entropy value is obtained as follows
Referred to the concept of surplus degree in information theory, the definition of surplus degree for
The essential meaning of surplus degree is to remove the difference between the entropy value of the
Finally, calculate the weight coefficient
where
Then, obtain relation degree by weight coefficient multiplying with the corresponding relation coefficient, as follows
And finally
As the selection of distinguishing coefficient value
Process distinguishing coefficient in equation (10) as follows: Denote
Denote
Then, the distinguishing coefficient
if
else if
And the value of distinguishing coefficient
if
else if
When the adaptive distinguishing coefficient
Proposed approach
Based on the integration of the advantages of improved generalized fractal box-counting dimension algorithm and AGRA, a novel fault diagnosis method for rolling element bearings is put forward as follows:
The bearing vibration signals under different working conditions are acquired as samples which are divided into two subsets, the samples for knowledge base and the samples for testing.
The dominant characteristic vectors of fault features from the bearing vibration signals, which can provide more useful information reflecting bearing health status, were extracted by improved generalized fractal box-counting dimension algorithm.
Establish knowledge base according to the relationship between fault signatures (i.e. the characteristic vectors) and fault patterns (i.e. the fault types of rolling element bearings as well as different levels of severity) from the base samples for AGRA model.
The feature vectors of the testing samples are input into the AGRA model, and various bearing working conditions can be recognized by the output of the model.
Experimental validation
Experimental rig
All the rolling element bearing vibration signals analyzed in the article were downloaded from the Case Western Reserve University Bearing Data Center. 32 The whole experiment system is made up of a torque transducer, a two horsepower, a dynamometer and a three-phase induction motor as shown in Figure 1. The horsepower and speed data are collected over the transducer. The desired torque load levels could be achieved by controlling the dynamometer. The motor shaft at the drive end is supported by the test bearing, and single-point faults were introduced into the test bearing using electro-discharge machining, with fault diameters of 7, 14, 21, and 28 mils. Bearing faults under consideration include inner race fault, ball fault and outer race fault, and an accelerometer with a bandwidth up to 5000 Hz was mounted on the motor housing at the drive end of the motor, and then bearing vibration data under various working conditions were collected using a recorder with a sampling frequency of 12 kHz. The deep groove balling bearing 6205-2RS JEM SKF was used in the experiment.

Experimental setup.
Application and analysis
The experimental vibration data used for the analyses are from the tests which were conducted at the load of 0 horsepower and at the motor speed of 1797 r/min. Normal and three fault types of vibration data as well as those with various severities for each fault type are analyzed, and the detailed description of the data is shown in Table 1. Considering various fault categories and severities, the rolling element bearing fault diagnosis turns out to be an 11-class classification problem. The data set consisted of totally 550 data samples, where each data sample is truncated into a 2048-point time series from the original vibration data and there is no overlap between any two of them. Among these 550 data samples, 110 samples are selected at random as samples for knowledge base, and the rest 440 samples are automatically treated as testing data. Here, the testing data are four times the amount of the training data, since in the practical applications, the bearing vibration data under faulty conditions are hard to achieve and are usually of small sample size.
Description of experimental data set.
The fault features extracted from bearing normal condition and different fault conditions with fault size of 7 mils by traditional box-counting dimension are shown in Table 2 and by improved generalized box-counting dimension are shown in Figure 2. And the fault features extracted from bearing inner race fault condition with different levels of severity by traditional box-counting dimension are shown in Table 3 and by improved generalized box-counting dimension are shown in Figure 3. Here, due to that each data sample is truncated into a 2048-point time series from the original vibration data, which conforms to the calculating rule of box-counting dimension; thus, there is no need to do a resampling again. The side length of box sequentially increases from 20, 21, 22, until the side length reaches 210, and calculate the total number of boxes covering each sample. Then, the fitting curves can be obtained according to the method introduced in section “Improved generalized fractal box-counting dimension,” and the improved generalized box-counting dimension can also be obtained, to be used as dominant characteristic vectors of fault features to achieve the purpose of pattern recognition.
Traditional box-counting dimension of a random chosen sample from bearing normal condition and different fault conditions with fault size of 7 mils.

Improved generalized box-counting dimension of a random chosen sample from bearing normal condition and different fault conditions with fault size of 7 mils.
Traditional box-counting dimension of a random chosen sample from bearing inner race fault condition with different levels of severity.

Improved generalized box-counting dimension of a random chosen sample from bearing inner race fault condition with different levels of severity.
Fault features extracted from rolling bearing vibration signals have great effect on the successful diagnosis. Evident differences of fault features among different rolling bearing health status are beneficial for classification by a pattern recognition technique. From Tables 2 and 3, it can be seen that the fault features extracted by traditional box-counting dimension method are limited, and the distances between different fault types of rolling element bearings as well as different levels of severity are close, which are not easy to be classified by a pattern recognition technique. From Figures 2 and 3, it is interesting to see that the dominant characteristic vectors of fault features extracted from the rolling element bearing vibration signals with different fault types as well as different levels of severity by improved generalized fractal box-counting dimension algorithm show apparent differences.
After establishing knowledge base according to the relationship between fault signatures (i.e. the characteristic vectors) and fault patterns (i.e. the fault types of rolling element bearings as well as different levels of severity) from the base samples, the feature vectors of the testing samples are input into the GRA model, and various bearing working conditions were recognized by the output of the GRA model shown in Table 4.
The identification result with traditional fractal box-counting dimension and improved generalized fractal box-counting dimension by GRA.
GRA: gray relation algorithm.
From Table 4, the identification result shows that the recognition effect with traditional fractal box-counting dimension is poor and misleading due to the limited fault features extracted, while the recognition effect with improved generalized fractal box-counting dimension appears much better, and the total success rate can reach more than 96% and also shows that the proposed fault feature extraction algorithm can be applied to the practical rolling element bearing fault diagnosis.
The feature vectors of the testing samples are also input into the AGRA model, and various bearing health status were recognized by the output of the AGRA model as shown in Table 5. One of the merits of AGRA algorithm introduced in section “Adaptive Gray relation algorithm” is no need to design complex computational framework and set concrete parameters, which can well solve the issue of generality versus accuracy.
The identification result with improved generalized fractal box-counting dimension by GRA and AGRA.
AGRA: adaptive gray relation algorithm; GRA: gray relation algorithm.
From Table 5, the identification result shows that the recognition effect with AGRA shows certain improvement, and the total success rate can reach more than 97%. In order to illustrate the effectiveness of the proposed approach, two other most commonly used intelligent methods (e.g. back-propagation (BP) and SVM 24 ) were used as comparative algorithm in the article, as shown in Table 6.
The identification result with improved generalized fractal box-counting dimension by AGRA, BP, and SVM.
AGRA: adaptive gray relation algorithm; SVM: support vector machine; BP: back-propagation.
From Table 6, we can see all these three methods can accurately recognize the rolling bearing health status between healthy and faulty conditions. However, due to the small number of training samples, the extrapolation and interpolation performance of ANN (BP) are assessed to be poor and its recognition effect is unacceptable. It is also shown that the proposed approach in the article has superior diagnostic performance to the existing feed-forward BP neural network and SVM.
Further discussion
In order to further analyze the effectiveness of the proposed approach, an extreme experimental validation, in which only one set of random base sample for these 11 classifications was used for establishing knowledge base for AGRA model, was carried out. The feature vectors of the testing samples are input into the AGRA model, and various bearing working conditions were recognized by the output of the model shown in Table 7.
The identification result with improved generalized fractal box-counting dimension by GRA and AGRA.
AGRA: adaptive gray relation algorithm; GRA: gray relation algorithm.
From Table 7, it is encouraging to see that the proposed approach shows good robust in rolling bearing fault diagnosis and the total success rate can still reach more than 92% by AGRA. And the related comparison with other two existing intelligent methods (i.e. BP and SVM 24 ) is shown in Table 8.
The identification result with improved generalized fractal box-counting dimension by AGRA, BP, and SVM.
AGRA: adaptive gray relation algorithm; SVM: support vector machine; BP: back-propagation.
From Table 8, we can see that compared with existing fault intelligent diagnostic methods (i.e. BP and SVM) of a rolling bearing, the proposed approach can effectively solve the learning problem with an extremely small number of samples for identifying different bearing fault types as well as severities.
Conclusion
In this article, a novel approach for rolling element bearing fault diagnosis is proposed based on an improved generalized fractal box-counting dimension and AGRA. First, the fault features from the bearing vibration signals, which can provide more useful information reflecting bearing health status, were extracted by improved generalized fractal box-counting dimension algorithm. And then the fault types of rolling element bearings as well as various levels of severity were recognized by the outputs of the AGRA. The experimental results demonstrate that the proposed approach can effectively and accurately identify the different bearing fault types as well as severities. And some other meaningful conclusions can be obtained as follows:
The proposed improved generalized fractal box-counting dimension algorithm is very suitable for rolling bearing fault feature extraction, to offer more useful and distinguishing information imaging different bearing health status.
GRA as a pattern recognition technique is very suitable for achieving the rolling element bearing fault diagnosis automatically, in particular for condition of a small number of samples.
With the introduction of concept of weight coefficient and adaptive distinguishing coefficient to calculate the relation degree, the proposed AGRA appears a certain improvement.
In the future research, in order to continually improve the diagnostic accuracy for the proposed approach, some advanced signal processing methods, for example, WPT, HT, EMD, and HOS, may be explored to be integrated into the improved generalized fractal box-counting dimension to more effectively extract dominant characteristic vectors.
