Abstract
Introduction
With the rapid development of Internet of things (IoT), the demand for positioning and navigation is becoming more and more popular. Some mature solutions for outdoor localization, such as Global Positioning System (GPS) and BeiDou Navigation Satellite System (BDS), are unsuitable for indoor localization because of the poor penetration of the satellite signal. Accordingly, a number of commercial systems and research prototypes have been proposed for indoor localization.
The time-based indoor localization system, using ultra wideband (UWB) radio, is the most accurate method. But high costs and complexity limit its spread. Another solution is based on the geomagnetic field. By utilizing a magnetic sensor, building’s anomalies in the Earth’s magnetic field can be detected to track a person’s location indoors. This technology is mainly used in the smart phone. The drawback is that it must collect the magnetic field of the building before locating. Some other solutions aim at mobile wireless sensor networks (MWSNs), 1 which has focused on outdoor localization with low-cost and low-power consumption partly based on GPS.
RSSI-based algorithm is a promising approach when the IoT application demands low cost and low complexity. In an ideal situation, the log-normal distance path loss (LDPL) model 2 can exactly describe the relationship between distance and RSSI, then trilateration algorithm can calculate coordinates. However, LDPL model has poor performance in real environments. Because RSSI would be easily affected by many interferences like multipath or obstacles attenuation, it makes the RSSI unreliable and leads to low position accuracy. As a result, most RSSI-based research works have focused on the idea of “fingerprinting” to improve the accuracy in recent years, which has been getting many valuable results. However, it demands prior knowledge of environments, which means a limitation for application in changeable environments like construction. In this case, trilateration and LDPL model should have been more suitable. Nevertheless, few studies have paid attention in this direction for the unreliable relationship between RSSI and distance, especially in harsh environments. To address the problem, this article proposes a novel filter for distance from LDPL model, and modifies the trilateration algorithm to calculate coordinates.
The main points are as follows. First, we calculate the distances between beacon nodes (BNs) and target nodes through LDPL model. Second, based on analyzing targets behavior, we use curve fitting (CF) technique to smooth raw data in a series of overlapping time slices (CFOTS). Based on the characteristics of CF, smoothed data will become a series continuous curve. We fused them to one distance curve using the weighted average algorithm on the overlapping area. And then, an algorithm, modified trilateration named distance ratio location (DRL), is proposed to estimate targets approximate location based on these distance curves. Finally, the locations will be smoothed by Kalman filter (KF). This method has been applied in WuHan Metro construction. It is an extremely harsh environment. Large equipment and construction operations can lead to a very complex multipath, electromagnetic interference, data-packet loss sometimes. Moreover, the environment will change with the construction schedule. Compared with other localization algorithms, the method can achieve acceptable results.
The contributions of this work can be summarized as follows:
Traditional filter has always tried to smooth the noise from raw RSSI data, but it is too difficult for the weak relationship between distance and RSSI. Therefore, we start analyzing in targets’ behaviors and summarize them into three movement modes.
Based on the analysis, we propose a filter using polynomial CF to smooth the distance between target node (TN) and BN. With the characteristic of CF, discrete distance points can be smoothed into a continuous curve.
We modify trilateration to a new algorithm and name it as DRL. Instead of calculating the absolute accurate coordinates, DRL tries to estimate the approximate location. We applied the system in metro construction, and the results have shown a significant improvement compared to other existing works.
The rest of this article is organized as follows. Section “Related work” gives a short summary of the related work for indoor localization. We describe their drawbacks and present the advantages of the method proposed in this article. Section “System model” describes the system model used in this article. In section “Localization method design,” we first give a high level description on the problem studied. Then, the distance filter and DRL algorithm are introduced in detail. Section “Practice results and evaluation” presents the evaluation results and section “Conclusion” concludes the article.
Related work
The physical measurements used in most indoor localization approaches can be classified into three categories: 3 time, angle, and power features. Some other hydra indoor location systems, using ancillary features, need the extra hardware, which means increasing cost and implementation difficulty.
Time features employed in ranging are usually in two forms: time of arrival (TOA) and time difference of arrival (TDOA). The accuracy of time-based ranging depends on the bandwidth of communication. The best resolution of wireless communication physical layer for time feature is the UWB. There are so many research prototypes focused on this.3–6 The main directions of research aim to resolve multipath and penetrate obstacles.4–7 On the other hand, some mature commercial localizations have emerged, such as PLUS and Ubisense. They are both based on UWB and have a very high precision (approximately 10–15 cm). The Ubisense system uses both TDOA and AOA to estimate the location of specific tags, and the PLUS system uses the TDOA mechanism. Their main drawback is the very high cost, 8 Ubisense and PLUS’s costs all exceed US$7000 for just four BNs. Furthermore, obstacles, especially metal, will seriously interfere with the PLUS system after running our tests.
Angle features are mainly used in angle of arrival (AOA) or direction of arrival (DOA) algorithms. According to the difference in signal forms, they can be further divided into two subclasses: (1) using the antennas amplitude response and (2) using the receiver antennas phase response. The first category is named beam forming and the other is phase interferometry. Directional antennas can be used to estimate both angle and distance, and then the location of target can be located.9,10 Single access point-based indoor localization (SAIL) 9 combines physical layer (PHY) information and human motion to compute the propagation delay of the direct path by itself, eliminating the adverse effect of multipath and yielding submeter distance estimation accuracy. Each BN, equipped with a switched-beam antenna, can estimate both the azimuth and the elevation of the incoming signals to locate the target. 10 Self-designed three-element antenna arrays are used to localize radio frequency identification tag (RFID) tags with off-the-shelf device in Azzouzi et al. 11 Array track uses multiple-input and multiple-output (MIMO)-based techniques to track wireless clients at a real time.12–15 Actually, obtaining the angle features generally requires RSSI. A drawback of this kind of localization method is that multiple antennas are required, which makes the localization system more complicated and difficult to implement. Therefore, using off-the-shelf platforms like Yang et al., 16 apparently, is not suitable for construction environment.
Power features in indoor localization systems are generally divided into two forms: Channel State Information (CSI) and RSSI. First, CSI can leverage the finer grained power feature and channel response to discriminate multipath characteristics. 17 CSI works in the PHY layer. Accordingly, professional equipment is necessary. 18 Although channel response can be partially extracted from off-the-shelf frequency division multiplexing (OFDM) receivers in the format of CSI since 802.11 a/g/n standard, 19 CSI is also difficult to use on other platforms. Second, although some articles 20 question RSSI’s accuracy, it is still widely used in many indoor localization systems for its low cost and simplicity to implement. RSSI can be used in both geometric mapping and “fingerprinting” algorithm, but the “fingerprinting”-based approach is more popular21,22 for its higher accuracy. RADAR is probably one of the first examples of an indoor localization system using signal strength. 23 RADAR employs an empirical model to resolve the errors caused by multipath, metal reflection, and other noises. It can achieve an accuracy of 2–3 m. But there will be a sharp decline in accuracy when significant changes in the environment occur, 23 and this is also a major drawback for all “fingerprinting”-based localization algorithm. Some approaches try to solve this problem. For instance, Wang et al. 24 use a particle swarm optimization (PSO)-based back propagation (BP) neural network (PSO-BP) method to determine the relationship between RSSI and distance. But they need plenty of reference tags deployed as equilateral triangles, which is impossible in some harsh environments. Blumrosen et al., 25 using the channel path-loss model, performs a set of offline tests of the system that attempt to match system parameters to the experiment’s environment using different RSSI measurements at predetermined locations. But it improves little in the metro construction environment. Researches 26 on geometric mapping generally focus on smoothing RSSI and position results in order to obtain improved location information. However, geometric algorithm-based localization system still cannot obtain an acceptable accuracy in harsh environments. Some other approaches tried to solve the problem by fusing multisensors. For example, Chen et al. 27 present a weight loss path of WiFi with pedestrian dead reckoning (PDR) to achieve high accuracy. But some sensors, like accelerometer, magnetometer, and gyroscope, have a large power consumption.
High accuracy and low cost are the two main goals in designing localization systems. Unfortunately, we cannot take them both in most cases.
System model
A target can be a person, a vehicle, or other mobile object, and we consider it as a TN in this article. Accordingly, we consider the nodes with known coordinates as BNs.
Each TN wears an electronic tag. The tag transmits signals periodically, and each BN records the RSSI when receiving the signal. Localization algorithm will run in a central server. Accordingly, necessary information for algorithm, including the BN network topology, LDPL model, and so on is stored in the server.
The LDPL mode describes the relationship between RSSI and distance as logarithmic, which means that the more the distance between TN and BN, the lower the sensitivity of RSSI is. In other words, the RSSI error, recorded by hardware, will be too large to be acceptable when the distance is beyond a certain extent, even if the environment is ideal. Therefore, we should deploy the BNs closely as possible. CC1101 is the radio frequency chip used in the application. According to its measurement accuracy, the distance between the adjacent BN should be less than

BN deployed in part floor plan of the construction field.
DRL algorithm, calculating the position from the distance curve, demands that the quadrilateral made by four adjacent nodes must be a parallelogram, and a rectangle will be better. In other words, in Figure 1, distance
Localization method design
LDPL model
The distance between TN and BN can be calculated by LDPL through RSSI, and then TN’s coordinate will be calculated by localization algorithm based on BN network topology.
The LDPL model is the theoretical basis in this algorithm. It is illustrated as equation (1), and then equation (2) can be deduced, which indicates the intuitive relationship between RSSI and distance as 2
In equation (1), where
Behavior analysis
Theoretically, in the ideal state, there is only one factor—–distance—-can influence RSSI, and the relationship between distance and RSSI is positive correlated. Therefore, three basic patterns of TN behavior will influence the RSSI from one single BN: (1) approaching the BN means the distance decreasing, (2) being away from the BN means distance decreasing and (3) keeping a constant distance.
Through experimental observation, a TN will be of uniform motion or stationary in most cases. The speed can change, but it only occurs in a very short period of time in most cases. For example, we illustrate some typical cases of a TN movement in Figure 2 and Figure 3 shows the distance variation curves between the TN and BN. It is easy to find out that the TN movement consists of three modes mentioned above: Pattern (1) means rising, Pattern (2) means decline, and Pattern (3) means a straight line. We subdivide the time into a series of small time slices, and then the distance curve will be divided into a series of small fragments. In Figure 3, the distance curves are divided by longitudinal dotted lines to fragments in every 5 s. In each fragment, we can find that the changing behavior mode occurs at once, all curves are similar to parts of a parabola curve or a straight line.

Some typical cases of a TN movement: (a) pedestrians walking in a certain direction, (b) turn a small bend while walking, (c) turn back, and (d) pedestrians move around in a small area.

Four curves of typical TN movement cases.
Based on the analysis above, we can divide the distance variation curves by time slot for easy filtering.
Filter using CF in overlapping time slices for distance
The distance, calculated from RSSI by LDPL, is unreliable for interference. A filter algorithm is proposed as follows.
We subdivided the time into a series of slices of the same length. According to the analysis of TN behavior above, distance points in each time slices are similar to parts of a parabola curve or a straight line. Generally, the time slices are short and harsh environment leads to packet loss, which causes a small number of distance points. Therefore, there will be over fitting using high-order polynomial. Quadratic polynomial of equation (3) is considered as the aim of CF method in this article. We fit the discrete distance points using the least squares principle. Raw distance points, captured from WuHan metro construction, are illustrated in Figure 4, and it is very hard to find out the TN trajectory. After CF in each time slice, the distance points become a continuous curve and have better performance.

Comparison of real distance curve with raw distance curve and smoothed distance curve.
But there is a limitation for properties of polynomial curve fitting: central region of the curve will have a higher similarity with real data than the edge regions. To address this problem, we use a weighted fusion algorithm in multiple time slices overlapping area. Details are described as follows.
We define a basic time unit
where
where

Time slices overlapping areas
Gaussian elimination method can be used to solve
where
In the
where
This algorithm can smooth the discrete distance points and make them to be continuous curves. Furthermore, it can fill some missed RSSI data caused by packet loss.
DRL algorithm
Distance curve, smoothed by CFOTS, is still not enough to achieve acceptable accuracy by the traditional localization algorithm. To address this problem, instead of trying to calculate the exact location with trilateration, the DRL algorithm estimates a series of approximate locations with the ratio of distances.
According to the description in section “System model,” we deploy the four adjacent BNs as a regular rectangle; in other words, any two adjacent BNs will have the same

A typical plan of deployed BN.
First, we calculate the
where
Now if we assume that
where
Estimating
Requirement (2) of the algorithm makes real coordinates of TN to not exceed the rectangular. From the equation above, we know that
Now we analyze the influence for ignoring
There are four equations in (14), and each of them can calculate an

Coordinate offset distribution from DRL. (a)
KF
The location curve obtained from the DRL algorithm is rough and serrate. KF provides an optimal estimation for a linear state space. KF requires a discretization of the continuous location curve. We make a series of discrete points in every 0.1 s from the curve.
The state of a TN is described as
where
The observation model is defined as
where
The filter smooth location points by a recursion, which operate on 15 and 18. They are written as follows.
First, we make a prediction of time
In time
Second, we calculate the optimal Kalman gain as
Finally, we update the states in time
where
where
Practice results and evaluation
We have applied the algorithm in WuHan metro construction for tracking workers and vehicles. The whole project consists of two parts in localization: (1) one-dimensional localization in the tunnel and (2) two-dimensional localization in the station. We just test the two-dimensional part in this article. TN and BN adopts the TI CC1101 as radio frequency (RF) chip, which work on the frequency of 433 MHz. Figure 8 shows some examples. The real localization environments are illustrated in Figure 9. Figure 9(a) shows part of the construction plan. The BNs are implemented in the station like Figure 9(b), and the distance between neighbor BNs was

Product of TN and BN. (a) PCB of TN, (b) TN, and (c) BN.

Localization environment of metro construction in WuHan ChangQing Station. (a) System working on ChangQing Station in Wuhan metro construction, (b) station construction environment, and (c) tunnel construction environment.
Instead of trying to smooth the noise from raw RSSI data completely like traditional filter, CFOTS start by analyzing target’s movements and trying to obtain an approximation of the distance curve. As a comparison, mean filter (MF) is used to smooth raw data. In Figure 10(a), it can be seen that MF can improve the RSSI to a certain extent. Then we calculate the distance. The results, in Figure 10(b), show that it is worse than the distance curve filtered by CFOTS. Furthermore, after filtering the distance by CFOTS, we also tried to smooth it by MF. But the results improve very little.

MF and CFOTS algorithms are compared in the results of filtering raw RSSI and distance. (a) Raw RSSI data and smoothed by mean filter and (b) smoothed by CFOTS and mean filter.
We extracted some data from the project to test other localization algorithms. As a comparison, traditional maximum likelihood estimation (MLE) localization method based on trilateration is used. MLE is a very popular “geometric mapping”-based algorithm, but it has poor performance based on the data. The main reason is that MLE was trying to calculate coordinates accurately on extreme unreliable RSSI data. As a comparison, Figure 11 shows the results by the MLE and DRL algorithm; in addition, KF is applied in both. Table 1 shows some key performance indicators. Obviously, the accuracy of track from the DRL is far higher than MLE. According to our statistics, the maximum error of DRL was limited to about

The results of three localization in a real subway construction environment.
The comparison of algorithms.
MLE: maximum likelihood estimation; CFOTS: curve fitting in a series of overlapping time slices; KF: Kalman filter; DRL: distance ratio location.
Conclusion
At low power consumption and low cost, a new RSSI-based indoor localization and tracking algorithm has been proposed for harsh environments like construction. Although the popular fingerprints-based algorithm can obtain more accuracy, it cannot be applied for changeable environments. So we exploited geometric mapping-based algorithm by modifying trilateration. CFOTS is used to smooth the discrete distance points to a continuous curve. Moreover, it can make up the lost points from packet loss. DRL algorithm can estimate the continuous location curve. Then, the curve will be filtered by KF. The practice results prove that the accuracy can be limited to
