Abstract
Keywords
Introduction
Considered as an essential bridge connecting with the physical world and human society, wireless sensor networks (WSNs) have been widely applied in medical, space exploration, military applications, smart home and environmental monitoring. However, the energy-efficiency problems of the sensor nodes, namely, the limited processing energy and the highly constrained energy resources, have always been a bottleneck hindering the further development of the network. Therefore, reducing the energy consumption and prolonging the lifetime expectation become challenges for the researchers in designing WSNs.1,2
In WSNs, traditional data gathering employed multi-hop to forward the raw data to the sink node.3,4 Thus, large number of redundant transmissions will be picked, leading to tremendous amount of energy waste. More explicitly, nodes which are closer to the sink take more forward tasks than the others, resulting in a rather faster energy consuming speed as well as the reduction of the whole network lifetime. To solve the problem, a promising technique called compressed sensing (CS) theory has brought a revolutionary breakthrough to the information processing field. This theory pointed out that for a compressible signal, a small collection of the linear projections is sufficient for the near perfect reconstruction.5–8 Rabbat et al. 9 introduced CS into the single-hop wireless network and compressed data successfully. In the study of Luo et al. 10 and Wang et al., 11 CS was applied in large-scale multi-hop WSNs, efficiently decreasing the communication cost and balancing the energy consumption among nodes. Compared with the existing traditional distributed source coding techniques such as Huffman coding, CS brings the benefit of simple compression at sensor nodes without excessive computational and control overheads, which is more feasible for the computation-limiting and energy-limiting nodes. In order to obtain deeper compressed data, researchers combine CS with routing protocol. Combining CS with power-efficient gathering in sensor information systems (PEGASIS) is one of the most popular routing protocols, 12 to reduce the energy consumption and evenly distribute the energy consumption loads, leading to an increase in network lifetime. However, compared with tree-typed routing, such as the minimum spanning tree, PEGASIS-based schemes minimized communication cost of each hop, rather than that of the whole link, resulting in a suboptimal performance. Furthermore, PEGASIS-based schemes suffer from poor robustness and fairly long latency of data gathering. CS combined with tree-type routing was investigated in the study of Luo et al.,10,13 for the sake of minimizing the total forwarding energy consumption. But simply applying CS could not improve the throughput of the network. On the contrary, it increases the communication cost of the leaf nodes, as well as the intermediate nodes which are closer to the leaves. For this reason, hybrid CS scheme was proposed, 14 in which only parent nodes with high communication load carried out compressing operation. However, the above studies focused only on plain network, when the network is large, conceiving hierarchical network structure via clustering would be more suitable for network management. Relying on the thought of Leach, 15 CS-based data gathering routing scheme suitable for clustering structure was studied. The author first formulated an energy consumption model to obtain the optimal number of clusters and then designed an efficient deterministic dynamic clustering scheme, to guarantee all cluster heads uniformly distributed approximately. The proposed algorithm cluster-based compressive sensing data collection (CCS) in the study of Nguyen et al. 16 combined CS and clustering utilizing block diagonal matrices (BDMs) as the measurement matrices. CCS discussed the optimal number of clusters for reaching the minimum power consumption and the effect of different sparsifying bases on the CS performance. Xie and Jia 17 proposed a clustering method that used hybrid CS, the literature first proposed an analytical model that studied the relationship between the size of clusters and number of transmissions in the hybrid CS method. Nguyen 18 combined random walk (RW) routing and CS to save energy and achieve longer network lifetime. The above-mentioned studies are constructive; however, it should be noted that all those algorithms neglect the event sources in WSNs. In fact, the event sources would deeply influence the data correlation. In addition, little attention has been devoted to the spatial correlation between sensor readings and the impact of event source on data correlation. Researchers15–17 used the clustering routing, and they all discussed the optimal number of clusters and kept the fixed cluster number. However, if we take the event source into consideration, the optimal cluster number could be changed because of the impact caused by event sources. They all uniformly reconstructed data at sink finally although they used different gathering routing.
The performance of WSNs is significantly affected by spatial correlation.19–22 Furthermore, there are some interested event sources impacting on the local spatial correlation of sensor readings, for example, temperature monitoring scenario, where sensors around an ignition point will have more correlated readings, while those nodes far away from the same point would have less correlated readings. Duarte et al. 23 presented two simple jointly sparse signals models, namely, Joint Sparse Model-1 (JSM-1) and Joint Sparse Model-2 (JSM-2), then they designed algorithms to recover multiple signals jointly. Based on JSM model, 24 the impact of event sources on data correlation was analysed, and the global factors is capable of increasing the common sparsity was further argued while decreasing the unique sparsity, which would result in a decrease in total sparsity and measurements. The authors clustered nodes via spatial correlation and optimal distance. Of particular note is that in the study of Wang et al., 24 interested event sources were supposed to be uniformly distributed in the network which disagrees with the practice. A more reasonable way is to dynamically cluster the networks according to the location of event sources, as well as the compressing sensor readings within a cluster.
Tackling the above-mentioned challenges, in this article we propose an compressed sensing–based dynamic clustering algorithm centred on event source (CS-DCES) algorithm. The main contributions in this work are summarized as follows:
We focus on WSNs with event sources, which cause the different correlation of raw readings.
We analyse and model the impact of event source on spatial correlation through JSM-1. Distance-based attenuation coefficient matrix is proposed as sparse matrix.
We cluster the sensors centred on the event source. The location of event source could be calculated in each round of reconstruction so as to dynamically re-cluster.
We reconstruct signal within cluster so as to increase the correlations within cluster, while decreasing the measurements needed for accurate reconstruction of original signals.
The main challenges for CS-DCES are (1) how to obtain the location of event sources and (2) how to model the impact of event source on spatial correlation.
The rest of this article is organized as follows. Section ‘System model’ devotes to the system model. A cluster scheme based on spatial correlation model is given in section ‘Cluster scheme based on spatial correlation model’. In section ‘CS-DCES’, we present the CS-DCES algorithm. Simulation results and performance analysis are presented in section ‘Simulation and performance analysis’. Finally, we give our concluding remarks in section ‘Conclusion’.
System model
It is assumed that the WSNs are deployed in a square area with the boundary length of
where
where
In WSN, each sensor reading is the summation of signal intensity of event sources, which can be expressed as
where
If there is an event source at node
where
In most of the practical WSNs application, monitoring areas are inaccessible because of their terrains and complex environments, such as the volcano monitoring and forest fire monitoring. It is difficult to regularly deploy network. Assuming that randomly deployed
Rows and columns of the matrix no longer denote the coordinates of nodes; here, the node can obtain its location via global positioning system (GPS) or other GPS-relative position algorithms.25,26 Vector
When there is an event source which is closest to node
If several equivalent nodes are apart from a same event source with the same shortest distance, randomly select node
where
where
Then, according to equation (3), the whole sensor readings
Cluster scheme based on spatial correlation model
In large-scale WSNs, there is spatial correlation between sensor readings. We model and analyse the sensor readings according to the Joint Sparsity Model-1(JSM-1) of Gupta et al.
21
We assume that there are
where
where
However, because of the complex network environment, there are some independent event sources which affect the spatial correlation of sensors readings in different areas. For example, consider the situation where a group of sensors measure the temperatures of outdoor locations. Global factors (such as the sun and prevailing winds) affect
For a small region, event source is a global factor which affects the surrounding sensors readings. The closer a node to the event source is, the more powerful its impact will be. Therefore, we propose a rule that nodes in the neighbourhood of an event source should be classified into one cluster. Assume that
where
Of particular note in equation (19) that
CS-DCES
For WSNs with event sources we interest, a compressive sensing–based dynamic clustering algorithm is proposed, which centres on the event source according to system model and spatial correlation model. The CS-DCES algorithm is presented in Figure 1.

Flow chart of CS-DCES.
The main part of CS-DCES will be detailed below:

Network clustering diagram.
Algorithm 1.
Algorithm 2.
Simulation and performance analysis
In this section, we evaluate the performance of CS-DCES algorithm, and the simulation environment is MATLAB 2012b, 2.1 GHz CPU and 4G RAM. The simulation parameters are set as follows: 400 sensor nodes are deployed in the monitoring region with the boundary length of 20 m and
Traditional CS-based data gathering algorithm, such as compressive data gathering (CDG) of Luo et al. 10 and efficient centralized dynamic clustering (ECDC) method of Wu et al., 15 view the sensor readings of the whole network as a single signal and completely reconstruct them at sink, while our proposed CS-DCES algorithm groups the nodes with high correlation and reconstructs sensor readings within a cluster individually. For convenience, we name the traditional algorithm as unified restructuring algorithm (CS-URA). The algorithm DCCS in the study of Nguyen et al. 16 combined CS and clustering utilizing BDMs as the measurement matrices. The member nodes send measurements to cluster head directly, and the event sources were neglected in DCCS. The spatial-correlation-based compressive sensing routing (SCSR) algorithm of Wang et al. 24 considers the impact of event sources on the data spatial correlation but does not pay attention to the event source or its accurate location nor does its incidence. All those four algorithms, namely, CS-DCES, CS-URA, DCCS and SCSR, will be compared in detail.
The signal-to-noise (SNR) and network lifetime are adopted to evaluate the performance of these algorithms. The SNR can be defined as
where
The energy consumption model is defined as 15
where
Parameter setting.
Performance comparison
It is assumed that there are two event sources at location (x = 15,

The relationship between reconstruction SNR and the number of measurements.
Figure 4 shows the relationship between the number of data gathering rounds

Network lifetime.
Locate the event source
Getting the location of event source plays a deterministic role on the efficiency of the CS-DCES algorithm. Both CS-URA and CS-DCES reconstruct the location of event sources under system models; however, CS-DCES is capable of making the same accuracy but using fewer measurements. We plot their performances in Figure 5 and the event source distribution map reconstructed by CS-DCES in Figure 6, respectively. It can be concluded that our proposed CS-DCES algorithm can accurately locate the event source and efficiently distinguish the change of locations, which guaranteeing the effectiveness of our algorithm.

Reconstruction of event source position.

Event source distribution map: (a) distribution of original event sources and (b) distribution of restructuring event sources.
Performance analysis
In our proposed CS-DCES scheme, there are three factors affecting the performance of the algorithm, namely, attenuation coefficient of event source
We carried out simulations with different attenuation coefficient

Attenuation coefficient impact on performance.
We further compare the CS-URA algorithm with our CS-DCES when

Comparison of CS-DCES and CS-URA.
Second, we investigate how the distance

Event source distance impact on performance.
Finally, we evaluate the impact of the number of event sources

Number of event sources impact on performance.
Randomly deployed networks
As a matter of fact, it is difficult to regularly deploy nodes in practice. Therefore, we add the CS-DCES simulation under random deployment. As shown in Figure 11, although the performance of CS-DCES becomes worse compared with regularly deploying, the performance curves of CS-DCES are still superior to the CS-URA ones with lower measurements, and they finally maintain consistent when the measurements increase. Furthermore, the number of measurements is 27 and 92 for CS-DCES and CS-URA when SNR = 24, respectively. Figure 12 indicates that the first sensor node is dead after gathering 242 rounds, while the whole nodes are dead after 320 rounds in CS-URA. However, in CS-DCES, the first sensor node is dead after gathering 1702 rounds and the whole nodes are dead after 1991 rounds, with effectively prolonging the lifetime.

The relationship between reconstruction SNR and the number of measurements under random deployment.

Network lifetime under random deployment.
Conclusion
In this article, a CS-DCES algorithm was proposed for data gathering in WSNs, which takes the event sources using compressive sensing and clustering strategy into consideration. To increase the data spatial correlation of a cluster, our proposed CS-DCES algorithm obtains the event sources locations and the dynamically cluster nodes from the location information, leading to a decrease in the unique sparsity, an increase in common sparsity and a reduction in the total sparsity and measurements of each data gathering. More explicitly, we employed the cluster head rotation mechanism to distribute the traffic load more evenly. Simulation results and analysis indicate that the proposed CS-DCES algorithm is capable of minimizing the network communication cost and achieving better balanced consumption throughout the network, while preserving relatively the same reconstruction accuracy under both regularly and randomly deploying nodes. The proposed scheme effectively prolongs the networks lifetime. Finally, we analysed three performance factors, namely,
In current scenario, we consider the wireless channel in WSNs completely reliable, which is assumed by many existing researches. However, the unreliable links in WSNs is common and the performance of CS-based data gathering scheme is sensitive to unreliable links. As to the future research, it is worthy investigating the application of CS to data gathering with unreliable link.
