Abstract
Introduction
Underwater acoustic networks (underwater wireless sensor networks (UWSNs)) consist of abundant low-cost sensor nodes tied to underwater vehicles, and the nodes are deployed to monitor the underwater environment collaboratively over the interest area. 1 In order to explore the underwater world, UWSNs have attracted wide attention, and many specific applications have emerged, such as environmental monitoring, natural disaster prevention, and distributed tactical surveillance, where the node localization is always very significant. 2 If each sensor node cannot provide its accurate coordinate, the data collected by sensor nodes may give wrong interpretations for the physical events. 3 However, the underwater environment is more complex than the terrestrial environment, and the underwater characteristics bring several new challenges as follows. (1) The nodes with limited batteries are more prone to be exhausted, so they should be recharged timely. Unfortunately, it is very hard to access underwater nodes. 4 (2) Radio wave is not feasible underwater, because it requires a large antenna and a high transmission power, and thus the acoustic communication becomes the typical physical layer technology in UWSNs. Nevertheless, the acoustic channel is characterized by its limited bandwidth, high bit error rate, path loss, motion-induced Doppler shift, and so on. 5 (3) Underwater sensor nodes are liable to move or be damaged6,7 due to the water current caused by external forces such as earthquake, tide, wind velocity, underwater creature touch, or strong electromagnetic interference, which will lead to a dynamic network topology. 8 All above unique characteristics are possible to result in the damage or inaccurate localization of beacon nodes.
Generally, a typical architecture for three-dimensional (3D) UWSNs is shown in Figure 1, where there are three types of nodes: surface buoys, beacon nodes, and ordinary sensor nodes.
9
The surface buoys can get the coordinates from their equipped global positioning system (GPS). The beacon nodes are submerged underwater, and thus GPS is not feasible for beacon nodes. They should communicate with the surface buoys to obtain their

A typical architecture of UWSNs.
Several error beacon nodes which cannot provide accurate references for ordinary nodes should be filtered out to avoid the aggravation of localization error. However, it is very difficult to find the error beacon nodes because we usually have no pre-knowledge about the error ones, and thus the error beacon nodes can be filtered out according to the mutual localization results among all beacon nodes. This work is the extension of our early work, 10 the main differences between the two papers are as follows: (1) the algorithm is given more illustrations, (2) the theoretical analysis of algorithm has been improved and extended, and (3) more simulations have been done and supplied.
Related works
Localization schemes have been extensively investigated in wireless sensor networks or UWSNs, and these schemes can be divided into two groups: anchor-based schemes and anchor-free schemes. 11 In the anchor-based schemes, the beacon nodes get their coordinates in advance through carrying GPS receiver or even they are artificially pre-configured. The beacon nodes broadcast periodically their coordinate information. Subsequently, the ordinary nodes estimate their coordinates by calculating the distances or angles to the nearest beacon nodes, especially, some measurement techniques such as received signal strength indicator (RSSI), time of arrival (TOA), and time difference of arrival (TDOA) are usually utilized in the process. Zhang et al. 12 proposed a multi-anchor nodes collaborative localization (MANCL) algorithm. First, the well-localized nodes within one hop are prone to become the reference nodes if the ordinary nodes cannot receive four beacon signals, and the selection criterion is related to the energy, trust value, and distance. Then, an improved Euclidean distance estimation method is adopted to localize the ordinary nodes. Finally, the remaining un-localized ordinary nodes complete their localizations with the help of two-hop anchor nodes.
The anchor-free schemes 13 determine the ordinary nodes’ coordinates through exploiting the connectivity or distance information among nodes, and thus the assistances of beacon nodes are unnecessary. The anchor-free schemes are especially suitable for the networks where nodes are hardly deployed, such as the battlefield environment or special warfare environment. Generally, the network protocol without beacon nodes is more complex than that with beacon nodes.
In addition, the anchor-based schemes can further be classified into the static beacon node localization and mobile beacon node localization. In Cheng et al., 14 an underwater positioning scheme (UPS) is proposed, where ordinary nodes record the receiving time of beacon messages, and then the time difference is transformed into the range distance after receiving four beacon messages. Finally, the ordinary nodes apply the trilateration method to estimate their own coordinates. UPS reduces the communication overhead and does not require the time synchronization, so the cost of UPS is relatively low. In Rahman et al., 15 with the help of a mobile beacon node, Cayley–Menger is used to determine the node coordinates. The distance between nodes is measured through combining the radio and acoustic signals which are free from the phenomenon of multi-path fading. In Zhang and Liang, 16 the distance between nodes is calculated by a new ranging method named round-trip time of flight (RTOF), and then the ordinary nodes complete localizations using an improved particle swarm optimization (PSO) algorithm, which adds a Gaussian decreasing inertia weight and a kind of competition mechanism. This scheme can improve the localization accuracy and localization efficiency with less beacon nodes. Nonetheless, the above literatures do not take into account the mobility of underwater nodes, which are only applicable in static underwater networks.
The mobility issue in node localization has also been reviewed. Ojha and Misra 17 used spatially correlated mobility pattern of UWSNs to estimate the node coordinates. In the initial stage of localization, there are only three beacon nodes. If an ordinary node cannot get enough information, it will assume that the node moves according to some specific rules, and the future coordinates can be easily predicted. When the ordinary nodes can communicate with at least three beacon nodes at the original and predicted positions, and then the coordinates of ordinary nodes can be determined. The outstanding advantage of this algorithm is that it is energy efficient as a result of the “silent localization.” In Zhu et al., 18 a localization scheme based on mobility prediction for UWSNs is introduced. The localization process is divided into two parts: the beacon nodes utilize the modified covariance algorithm to estimate their prediction models to reduce the position error, while the ordinary nodes choose the well-localized reference nodes to get their positions and speed by a node-selection strategy. The algorithm increases the localization coverage and decreases the localization error compared with scalable localization scheme with mobility prediction (SLMP) algorithm. 19
But the prediction gives a poor accuracy especially when the underwater environment is hostile. A multi-hop location (MLA) in UWSNs is also proposed in Zhu et al., 20 where the routing nodes are introduced to solve the problem of isolated nodes. First, the shortest paths from beacon nodes to ordinary nodes are found through a greedy approach. Subsequently, the shortest paths are fitted into a straight distance using the cosine method. Finally, the trilateration is repeatedly performed to localize the ordinary nodes. This algorithm has much higher localization accuracy than determined maximum likelihood (DML) algorithm. 21
Some researchers also take notice of the measurement errors in localization process. Liu et al. 22 combined the time synchronization and the node localization, which corrects the bias in the range estimation and improves the propagation delay in estimation when the stratification effect of underwater medium is considered. In addition, in order to further increase the localization accuracy, an advanced tracking algorithm interacting multiple model (IMM) is employed to handle the mobile case. Wu and Li 23 proposed an improved underwater acoustic network localization algorithm, which considers the measurement error caused by the sound velocity distortion and signal refraction. It uses an improved linear difference method to correct the measurement offset, which improves the localization accuracy. Simultaneously, a strategy similar to the greedy algorithm reduces the redundancy of the calculation results.
However, none of these works take the issue of error beacons into consideration. If beacon nodes move, its coordinate information will become obsolete or even wrong. Therefore, the ordinary nodes will be positioned more inaccurately under the assistance of these error beacons. To deal with the error beacon problem, this article proposes the EBFA based on
EBFA based on K -means clustering
Suppose that plenty of sensors nodes are deployed in a 3D underwater space
Algorithm description
The EBFA based on
where
Moreover, each beacon should reserve a variable
The following example describes the EBFA algorithm briefly. Suppose that there are 10 nodes

Example diagram of node localization.
The diagram of beacons classification.
The diagram of value of
As is shown in Table 2, the value of
Time complexity of EBFA
The time complexity of EBFA is mainly contributed by the Step 1 to Step 4. The time complexity of Step 1 is
Mathematical analysis
In general, the deployment of sensor nodes tossed from the air to the ground obeys the normal distribution. Let
First, the real coordinate of the localized node is calculated from the following equation set
where
where
where
The error from trilateration algorithm executions should be taken into account. Set
Moreover, the localization error is expressed by the scalar
Formula (6) transforms the localization error into a scalar, and then the error can be analyzed from each axis. Let
Then the localization error expressed by the scalar
Furthermore,
Therefore,
Simulations
EBFA is evaluated by observing the performance variation when adopting different model parameters (such as the number of beacon nodes and the number of error beacons) and by comparing EBFA with other algorithms. Table 3 shows the values of the parameters.
Simulation parameters.
The accurate discovery of the error beacons is extremely critical to the localization accuracies of ordinary nodes. This simulation measures the number of found error beacons with different number of beacon nodes. As shown in Figure 3, three plots (the proportion is assigned as 20%, 30%, and 40%, respectively) are observed. The plot with a larger

Number of found error beacons versus rates of error beacons.
In Figure 4, the variance metric denotes the stability of the number of found error beacons. The variance is expressed as

Variance versus number of beacons.
In EBFA, the error beacons are filtered out according to a distance threshold, that is, the beacons with larger localization error are excluded. Therefore, the value of threshold has a significant influence on the performance of EBFA. Both Figures 5 and 6 illustrate the impacts of the threshold when rate is set 20%. In Figure 5, it can be found that when the threshold is set 10, the number of found error beacons is approximately equal to the number of real error beacons because the proper setting of threshold helps EBFA to find the error beacon nodes accurately. Nevertheless, when the threshold is too small, some accurate beacons are also mistakenly labeled as the error beacons. Moreover, when the threshold is too large, most of the error beacons are not found because EBFA cannot differentiate the accurate beacons and error beacons by a large threshold. Consequently, the proper setting of the threshold is important to the EBFA performance.

Found error beacons versus distance threshold.

Variance versus different thresholds.
As depicted in Figure 6, the threshold has an obvious impact on the variance number of found error nodes as well. In general, with the increase in the number of beacon nodes, the plots of variance continue to rise up. When the threshold is 5, the variance number is larger than the others, this is because the number of found error beacons is larger, and thus the deployment of error beacons becomes more random accordingly.
Figure 7 compares the number of found beacons of EBFA, Centroid, and Trilateration. Apparently, EBFA overcomes Centroid and Trilateration absolutely. In Centroid, almost all nodes are labeled invalid. This is because Centroid has a stricter requirement for the node distribution and it assumes that the nodes obey the uniform distribution, which does not tally with the random deployment in our simulations. Therefore, it is hard to find error beacons exactly. In Trilateration, all nodes involved in localization will be marked as error beacons provided that this localization result is wrong. Hence, some of the error beacons cannot be found. In EBFA, most of the error beacons can be found in an iterative way, and probabilities of falsely marking the beacon nodes are very small, but the found error beacons are usually a bit more than the real ones. The reason is that some accurate beacons served for the localizations of the neighboring error beacons are also prone to be regarded as the error beacons.

Comparisons of the number of found beacons and found error beacons: (a) number of found beacons and (b) number of found error beacons.
As shown in Figure 8, the localization error denotes the coordinate derivations of ordinary nodes from the beacon localizations. The results indicate that EBFA can achieve the lowest localization error after effectively filtering most of the error beacons. Moreover, the scalability of EBFA is also better than those of the other two algorithms, this is because the proportion of the number of found error beacons stays the same approximately, as shown in Figure 3.

Localization error comparisons.
In summary, when the proper distance threshold is set, EBFA can accurately filter out most of the error beacons, which effectively improve the localization accuracy of ordinary nodes. Moreover, EBFA performs a favorable scalability with the number of beacon nodes. Moreover, there may be some gaps from theory algorithms to practical application, and EBFA should be improved the practicality according to the test feedbacks in actual UWSN systems.
Conclusion
This article explores the problem of error beacons filtering. The error beacons will be found in an iterative way based on an improved trilateration and the
This work is based on the assumption that the number of error beacon nodes is fewer compared with the number of all beacon nodes. The probabilities of finding the error beacon nodes will become smaller when the proportion of error beacon nodes becomes larger. Our future work will focus on the issue that how to revise the coordinate of error beacons such that they can still be available for coordinate references.
