Abstract
Keywords
Introduction
Wireless sensor network (WSN) has paved a gateway to connect back to the wired world. It is a data-centric network that routes the required data to their destination, irrespective of the location from where it is passed. The sensor nodes are randomly deployed in these networks such that the position of these nodes is non-predetermined as given by Akyildiz et al.1,2 These lightweight sensors have restricted the amount of energy and are responsible for continuously monitoring the environment that they are deployed in. They are capable of sensing physical parameters such as temperature and humidity in the surrounding and send these data to the base station (BS) located either outside or inside the network. As the number of nodes sensing a given region increases, the fault tolerance of the network increases. It also leads to precision in the observations made by the sensors in the network. Thus, the distribution of nodes and their density plays a significant role in the scalability of the network.
As each node contains a restricted amount of resources, data fusion and aggregation becomes an indispensable aspect. This forms a base for many algorithms that are structured. In data gathering, the efficiency of a sensor node is measured by decreasing the overheads involved with the data packet transmission and distributing the same load equally among all nodes. Therefore, an efficient data gathering approach is required to reduce the burden imposed on the master node. Data dissemination involves distributing the data in the network in such a way that it reaches the BS by consuming a minimum amount of energy available for the nodes. This forms a basis for formulating different clustering and routing protocols. Also, due to the high-density node deployment, the problem of data collision is very prominent in these routing algorithms. Hence, WSN, when integrated with cognitive radio (CR) technology, helps to overcome these limitations, by providing an opportunistic manner of accessing a shared spectrum by multiple users.
A unique capability of the CR network is to vary its transmitting and receiving parameters, by estimating the adjacent radio conditions in collaboration with other nodes. 3 Cognitive radio–based sensor node (CRSN) is best suited for the wide range of application-specific requirements offering several potential benefits. It has been proposed as one of the most promising technologies to address spectrum access and utilization challenges in cognitive radio wireless sensor network (CRWSN). 3 Hence, cognition technique when properly incorporated in WSN could bring about a lot of improvement in their performance.
Performance evaluation, by comparison, reveals that the proposed work is improved in terms of network lifetime, stability, and energy efficiency. The rest of the article is organized as follows. Section “Related works” gives an idea about the various algorithms that exist with their merits and the demerits. The next section “Description of the proposed algorithm” describes the newly proposed work, and their simulation results for different scenarios are given in the section “Performance evaluation and result analysis.” The last section “Conclusion” explains the inferences that could be drawn out of the proposed work and the future enhancement which can be made.
Related works
CR technology is an imaginative radio design logic that builds range usage by taking advantage of unused and under-used ranges in progressively changing conditions. 4 It is stated by Joshi et al. 5 that the cognitive method is the way toward knowing through observation, arranging, thinking, acting, and persistently refreshing and updating with a background marked by learning. This distinctive capability of sensing the environmental parameters helps to enhance the lifetime of the network. In a CR communication environment, there are two types of users—the primary users, which are given access to the licensed spectrum, and the secondary users, which are given access to the unlicensed spectrum. The spectrum sensing is carried over by the CR nodes and data transmission takes place efficiently over the spectrum in a more opportunistic manner.
Akan et al. 6 state that the primary unit which differs between a classical sensor node and CRSN node is the CR transceiver unit as shown in Figure 1. Generally, in any sensor network, all the events are sensed by a sensor node and are communicated to nearby nodes that act as a sink for the sensed information. By this way, all nodes communicate data to the BS through multi-hop routing. This intermediate node which helps to transmit data from a cluster head (CH) to BS consumes more energy due to transmission and reception and dies out fast. Hence, the distance between the source and the BS plays a vital role. Also, while transmitting, the data could undergo collision and thus packet retransmission has to be done often. Hence, an enhanced clustering and routing protocol along with a defined allocation of the spectrum band avoids all the above-mentioned issues. This is accomplished when a CR-based WSN network is formed. As explained by Akyildiz et al.1,2 and Cook and Das, 7 the CR-based WSN nodes change their working parameters for channel access so as to expand the lifetime of the network. Earlier, a conceptual design for CR-based WSN was given where, for the same transmission power, the maximum communication range in a CR channel was almost doubled. 8 Hence, the throughput of the network was also increased.

Architecture of cognitive radio–based sensor node.
Grouping of the nodes in the network into a set of small clusters with one node appointed as a CH is called clustering. 9 Although clustering is an old concept, choosing an intelligent strategy that could optimize the size of the cluster and can select the most eligible CH is still being researched profoundly. Clustering is helpful in increasing the scalability of the network by reducing the direct communication to the BS from the nodes that are far away. It also helps to accomplish stability and network scalability, as well as to sustain shared tasks, such as channel detecting and channel access, which are the basics of CR operations. 10 The process of electing a CH depends on various factors such as residual energy, number of rounds to which a given node has not become the CH, and finally the optimum number of CHs that are elected by the network for each round. A conceptual design for CR-based WSN was given by Cavalcanti et al., where the author stated that for the same transmit power, the maximum communication range in CR channel is almost twice. 8 Hence, the throughput of the network could also be increased.
There have been many algorithms designed for node clustering and routing in WSN. These clustering protocols cannot be used in CR-WSNs because the procedure to elect the CHs varies in CR-WSN algorithms as compared to the conventional WSNs. LEACH (Low-Energy Adaptive Clustering Hierarchy) algorithm 9 is one of the basic homogeneous cluster-based energy optimization algorithms used for improving the lifetime of the nodes in WSN. Even though they work well for small-sized networks, they became less stable when the size of the network was increased causing load imbalance. Also, it did not take into account the residual energy of the node, which was taken into consideration in the DEEC (Distributed Energy-Efficient Clustering) algorithm. 11
Later, SEP (Stable Election Protocol) algorithm 12 was proposed, with three-level node heterogeneity (as normal nodes, advanced nodes, and super nodes). This approach helped to improve the lifetime of the network with the same CH election technique used in LEACH. Only one difference should be cited was the modified probability function for becoming the CH and the threshold function. One of the limitations of the LEACH algorithm is its homogeneity, where the nodes that are selected as CHs have to perform more functions than the normal nodes. This caused the energy of those nodes to drain out faster. The performance evaluation of SEP showed an enhanced network lifetime and increased stability as compared to LEACH, but on increasing the network size, the nodes far away from BS started dying out faster. Later, another three-level heterogeneous network was introduced in EDEEC (Enhanced Distributed Energy-Efficient Clustering) algorithm. 13 Here, the transmission of data packets incurs energy dissipation depending upon the distance between the sender and the recipient. This process continues until all the nodes exhaust their energy and die. These algorithms increased the network lifetime but did not provide good network stability.
An advanced version of a cluster-based heterogeneous network called MCR (Multi-hop Clustering and Routing) algorithm 14 was found. Here, the elected CHs do not transmit the data directly to the BS. Instead, each CH transmits its data to a nearby CH until it reaches the BS. This strategy helped in increasing the scalability of the network as only a small number of CHs were interacting with the BS and therefore energy dissipation was reduced. Also, the value of heterogeneity was considered to be two which led to an enhanced performance.
In MCR, the election of CH was based on the weighted election probability (WEP) of the nodes where one can make a high-energy node as a CH more frequently. This reduces the burden on the low-power nodes and therefore helped in increasing the network lifetime. Also, by taking high initial energy for the network, throughput and network lifetime increased as it took more time for 50% of nodes to die. As compared to the chain-based algorithms such as PEGASIS (Power-Efficient Gathering in Sensor Information Systems), 15 the reliability increased as the information was passed to the CH through a limited number of hops. Although the MCR algorithm performs well on the grounds of network lifetime and reliability, it fails on the grounds of network stability. Despite these demerits, MCR was able to outperform SEP and LEACH protocols.
Later, another protocol called EEMHR (Energy-Efficient Multilevel Heterogeneous Routing) algorithm 16 was proposed. The main variation in this algorithm was that the heterogeneity of the network was not restricted to two, as was in the case of MCR algorithm. Moreover, the value of the threshold was dependent on the total number of nodes alive and the total number of nodes present initially. The main problem with this algorithm is the increased number of CH at the later stage of the network, which greatly increases the packet loss due to the collision. Also, the stability of the network was poor compared to other similar algorithm, EEMDC (Energy-Efficient Multilevel and Distance Aware Clustering).
The EEMDC algorithm 17 is the first algorithm to consider the factor of distance in assigning the CH probabilities. In this, the complete network was divided into three layers with heterogeneous nodes deployed. These different layers and nodes in these layers had different hop counts. It also incorporated average energy calculation while electing a node as a CH. This strategy helped in optimizing the size of the cluster and the nodes that are nearer to the BS as a preferred option for being the CH. Even though the given algorithm was highly stable and the network lifetime was prolonged, it increased the channel contention caused due to many complex processes involved in selecting the eligible CH and the optimum path to transfer the data. Hence, packet loss due to the collision was very high compared to EEMHR.
When compared with the MCR algorithm, the number of overheads was high in EEMHR and EEMDC. Hence, in order to improve the performance of the MCR algorithm in terms of scalability, stability, throughput, and network lifetime, Tyagi et al. 18 formulated an algorithm called DBMCR (Distance-Based Multi-hop Clustering and Routing). The only difference between MCR and DBMCR is that in order to avoid collision of data packets, the concept of spectrum allocation is introduced. This is one of the major characteristics of CR network which helps to overcome the problems faced by bursty communication of a normal sensor network. By exploiting the spectrum in an opportunistic fashion, CR enables users to sense which portions of the spectrum are available, select the best available channel, and coordinate spectrum access with other users. 19
In this, the WEP was formulated on the basis of the distance of the nodes from the BS. The whole network is divided into three regions based on the distance threshold as shown in Figure 2. Different spectra are allocated to the first two regions. Flat routing protocol is used in the first and second regions and the third region uses the distance-based clustering technique. The region which is near the BS used direct communication for transmission. First-order radio model was used and the rest of all the design considerations were followed similar to the MCR algorithm. By allocating a spectrum to each region, it was found that the time required for the first node to die was higher as compared to the MCR algorithm and SEP algorithm. A similar trend was observed in the case of the number of rounds before half the number of node dies. In case of the number of alive nodes, it was observed that after certain rounds, DBMCR equals MCR. Later, MCR starts declining faster than DBMCR.

Division of the network regions in DBMCR.
Due to a large number of applications associated with the WSNs, it is necessary to formulate an algorithm that could efficiently deal with the problem of routing and clustering along with reducing data loss due to channel contention. Although existing algorithms have taken care of either network lifetime or effective spectrum allocation among the nodes, the approaches having both the requirements are rare to find. Therefore, a new algorithm that can outperform the existing above algorithms on the basis of stability, scalability, network lifetime and solve the problem of packet loss occurring due to channel contention in the network is proposed. In order to make the proposed algorithm energy efficient and the process of clustering more effective, the concepts discussed in the above-said algorithms were amalgamated with CR spectrum allocation technique.
Description of the proposed algorithm
In the proposed algorithm, the advantages of distance-based clustering and multi-hop routing have been amalgamated with effective spectrum allocation strategy, in a heterogeneous CR sensor network environment. Deploying heterogeneous nodes, partitioning the region for better spectrum allocation, electing CH based on the average distance coverage range, and assigning cluster member (CM) based on average threshold distance are the major features that have been introduced in this algorithm. In order to mathematically analyze the proposed model, the nomenclatures used are given in Table 1.
Abbreviations used.
CH: cluster head; BS: base station.
Consider the network area of size
Energy consumption model
The implementation of proposed algorithm was done using first-order radio model similar to the one proposed in LEACH, 9 SEP, 12 MCR, 14 and EEMHR 16 algorithms and is shown in Figure 3
Each node will use the energies calculated by equations (1)–(3) for transmitting and receiving a “

Radio energy model.
Division into regions
A network of area

Division of the network region with different spectra in the proposed algorithm.
The radius of region 1
Whether the location of the BS is at the center or at the corner, for the above condition to hold true
By considering that the areas of region
Nodes of regions
Spectrum allocation in regions
After dividing
Heterogeneity
In order to define the heterogeneous model of the network, consider
The percentage of advanced nodes is taken as
The total number of normal nodes is
Out of the total number of advanced nodes,
Energy associated with each super nodes is
Therefore, the heterogeneity of the network comes out to be
And, the total increment in the initial energy of the entire network is
Calculation of optimal probability to elect a CH

Scenario when the BS is at the corner.

Scenario when the BS is at the center.
First, the optimal value for a multipath environment which is given by equation (6) is considered
Initially, consider the case when the BS is located at a corner, as shown in Figure 5.
Scenario 1
Location of the BS: corner;
Path loss factor
Expected value for
The value for multipath environment was found to be given in equation (7)
Scenario 2
Location of the BS: corner;
Path loss factor
Expected value for
The value for multipath environment was found to be given in equation (8)
Next, consider the case when the BS is located at the center, as shown in Figure 6.
Scenario 3
Location of the BS: center;
Path loss factor
Expected value for
The value for multipath environment was found to be given in equation (9)
Scenario 4
Location of the BS: center;
Path loss factor
Expected value for
The value for multipath environment was found to be given in equation (10)
On the basis of the above formula, the value of optimal probability could be calculated as given by equation (11)
Calculation of WEP
After the calculation of optimal probability, it is necessary to find the election probability associated with each type of node, that is, election probability associated with normal nodes, advanced nodes, and super nodes. Since the weight of each type of node is multiplied with the election probability, the complete product is known as WEP. Hence, in order to make the algorithm more efficient, a factor of the distance of nodes from the BS called
Initially, calculate the average distance of all the nodes which are participating in cluster-based communication using equation (12)
where
After average distance calculation, nodes are assigned their WEPs according to the following rule:
If
Else, if
where
Threshold calculation
The final step toward a CH election is the calculation of threshold for each type of node. As the threshold value is decided, each node generates a random number which is then compared with the value of threshold of that type of node. If the value of random number is less than the value of the threshold, that node is selected as a CH.
Correspondingly, the value of threshold is calculated below as in equation (19).
If
Otherwise
On the basis of equations (19) and (20), the new threshold is calculated. The clusters are formed and the transmission is initiated.
Choosing a minimum path for the CHs
After the election of the CHs, data are transmitted from each of the CM from their respective CH. These CHs transmit the received data to the BS. If direct transmission fails, then the CH seeks for another CH present at a minimum distance from it to transmit the data. Initially, the location of each CH is broadcasted by the BS to all the sensor nodes. With this information, the processor inside each node helps to calculate this minimum distant CH.
Cluster range
One of the important modifications that is being incorporated in this algorithm is the restriction of a CH’s range on the basis of the average distance of the region. Using equation (21), the range of the CH can be decided
Now, the region with a high value of average distance will have smaller cluster ranges as the CH’s location could easily cover far away nodes present in the network. This avoids the connectivity loss in the network among nodes.
Routing
Routing is done by allocating a time slot to each node under the time-division multiple access (TDMA) scheduling and each node waits for its turn to transmit its data, as shown in Figure 7. The CHs which are far away transmit data through a multi-hop transmission, as shown in Figure 8, and the nodes which cannot fall under any cluster transmit the data through a chain process, as shown in Figure 9.

Clustering and routing of data.

Multihop routing.

Scenario with no CH formed.
Simulation parameters
MATLAB software20,21 is used in order to do the extensive simulation of the proposed algorithm, and Table 2 gives the parametric setting used for the network analysis.
Simulation parameters.
Flow chart of the algorithm
The steps involved in creating and simulating the proposed algorithm are given as a flow chart in Figure 10.

Flow chart for the proposed algorithm.
Performance evaluation and result analysis
Energy efficiency
Residual energy and throughput are the two main parameters which aid in analyzing the energy efficiency of any network.
Residual energy
The residual energy analysis gives a comparative study on the amount of energy consumed by the nodes for transmitting and receiving data in a network. This helps in distributing the load among all nodes equally and thereby helps in increasing the network lifetime. The analysis from Figures 11 to 14 proves that the proposed algorithm has better residual energy as compared to SEP, MCR, and DBMCR algorithms.

Residual energy of the network for an initial energy

Residual energy of the network for an initial energy

Residual energy of the network for an initial energy

Residual energy of the network for an initial energy
Throughput
The amount of successful data transmission from a node to the BS is measured from the throughput analysis. It is calculated in terms of the amount of data bits transmitted per time slot. This helps to detect the unsuccessful transmission and request the node for retransmission, which mainly contributes for the energy decrease among the nodes. Figures 15–18 give a comparative analysis and prove that the proposed algorithm has improved throughput compared to SEP, MCR, and DBMCR algorithms.

Throughput for an initial energy of

Throughput for an initial energy of

Throughput for an initial energy of

Throughput for an initial energy of
Improvement of energy efficiency
From Table 3, the percentage increase in residual energy for the proposed algorithm can be calculated. Similarly, the simulation results of residual energy along with the throughput analysis also prove that the proposed algorithm has increased energy efficiency as compared to SEP, MCR, and DBMCR algorithms. Due to this, the amount of successful data transmission without any collision has increased. Also, the introduction of heterogeneity and assigning spectrum to different regions helps to distribute the load evenly in the network. This consequently improved the energy efficiency of the proposed algorithm.
Analysis of residual energy for proposed algorithm.
SEP: Stable Election Protocol; MCR: Multi-hop Clustering and Routing; DBMCR: Distance-Based Multi-hop Clustering and Routing.
Stability
First node death
It is the measure of the death of the first node of a network. Figures 19–22 give a relative analysis of the rounds for which the first node death (FND) happens for the proposed algorithm in comparison with SEP, MCR, and DBMCR algorithms.

First node death (FND) with initial energy

First node death (FND) with initial energy

First node death (FND) with initial energy

First node death (FND) with initial energy
Improvement of network stability
In order to assess the overall network performance, the percentage increase in the stability is given in Table 4. From this analysis, it is found that the proposed algorithm has delayed the FND and has kept the nodes alive for a longer duration compared to others. This delay in the FND signifies that the proposed algorithm has better stability. Dividing the regions and clustering the nodes based on distance, along with the transmission of data through the allocated spectrum, has greatly avoided the collision of data transmitted and has retained the energy of the nodes. Also, the nodes that are lying outside the average distance circle in the given region will have least probability of becoming the CH. Therefore, by reducing their chances of becoming a CH, the number of rounds before which the first node’s death occurs is increased. Hence, these factors have improved the stability of the network as compared to SEP, MCR, and DBMCR algorithms.
Analysis of stability for proposed algorithm.
SEP: Stable Election Protocol; MCR: Multi-hop Clustering and Routing; DBMCR: Distance-Based Multi-hop Clustering and Routing.
Scalability
Scalability of a network is measured by calculating the number of rounds till 80% of nodes die from FND. From Figures 23 to 26, the rounds at which 80% of nodes die are analyzed. By comparing both the simulation results, the FND and 80% of nodes death, it was found that the proposed algorithm was able to prolong the death of 80% of nodes for a longer duration than FND. Hence, the scalability of the proposed algorithm is found to be improved as compared to SEP, MCR, and DBMCR algorithms.

Rounds for 80% node death with an initial energy of

Rounds for 80% node death with an initial energy of

Rounds for 80% node death with an initial energy of

Rounds for 80% node death with an initial energy of
Improvement of network scalability
As stated earlier, FND and 80% node death measure the scalability of the network. Table 5 gives the percentage improvement of the proposed algorithm in comparison with SEP, MCR, and DBMCR. Also, with the simulation result of 80% node death in comparison with FND, it is quite evident that the proposed method is able to keep the network alive for more rounds. This proves that the network is highly scalable and has the ability to keep the nodes alive for a longer duration.
Analysis of FND for proposed algorithm.
SEP: Stable Election Protocol; MCR: Multi-hop Clustering and Routing; DBMCR: Distance-Based Multi-hop Clustering and Routing; FND: first node death.
Network lifetime
To verify the proposed algorithm on the basis of network lifetime, the simulation results for the number of nodes alive are given in Figures 27–30.

Number of nodes alive for initial energy of

Number of nodes alive for initial energy of

Number of nodes alive for initial energy of

Number of nodes alive for initial energy of
Improvement of network lifetime
The study of Table 6 shows that the round for which the half node death (HND) occurs is high compared to other algorithms. Since the FND also plays a significant role, its analysis is also considered. It is also evident that after the FND, the energy of all the other members takes a gradual decline in the proposed method. The distribution of load equally among all nodes and allocation of different spectra have retained the energy and have delayed their FND and HND.
Analysis of HND for proposed algorithm.
SEP: Stable Election Protocol; MCR: Multi-hop Clustering and Routing; DBMCR: Distance-Based Multi-hop Clustering and Routing; HND: half node death.
From Table 7, it is observed that the proposed algorithm was able to give a better percentage improvement of the network lifetime as compared to SEP, MCR, and DBMCR algorithms by a huge extent. The energy-efficient approach of finding an average distance for CH formations and calculating a threshold distance for electing cluster members has greatly helped in reducing the energy used by the nodes for transmitting data to longer distance. Along with this, the division of regions into different parts and assigning the available licensed spectrum has limited the communication of nodes present in those regions to their respective range without any collision.
Analysis of network lifetime for proposed algorithm.
SEP: Stable Election Protocol; MCR: Multi-hop Clustering and Routing; DBMCR: Distance-Based Multi-hop Clustering and Routing.
Conclusion
The proposed algorithm works more efficiently and improves the network stability by a significant proportion with respect to SEP, MCR, and DBMCR. By apportioning the considered area into different regions and parts and by varying the intra-cluster distance range of each CH, the energy dissipation due to data aggregation faced by each CH was reduced significantly. Evidently, using the proposed algorithm, the lifetime of the network has been extended by an average of 74.13% with SEP, 52.9% with MCR, and 33.9% with DBMCR algorithms. The proposed work could be further extended by considering the clustering process with additional parameters such as channel availability and CH selection time along with simulation of the algorithm on an uneven three-dimensional (3D) terrain, which could fetch more realistic results.
