Abstract
Keywords
Introduction
Benefit from the increasing advances in sensing techniques and computer science, multi-target tracking has found its potential applications in various disciplines, such as autonomous robotics, intelligence surveillance, remote sensing and even biomedical research.1–3 The goal of MTT is to estimate the number of targets together with their individual states from a given sequence of measurements, which is contaminated by a number of uncertain sources. The problem of tracking a group of targets was traditionally resolved by many data association-based methods, mainly including nearest neighbor (NN), 4 the joint integrated probabilistic data association (JIPDA), 5 multiple hypothesis tracking (MHT), 6 and multiple-target particle filter. 7 However, due to its combinatorial nature, theses tracking algorithms could be computationally intensive when the target number grows.
The random finite sets (RFS) theory was first introduced into the MTT by Mahler, 8 where the explicit data association was avoided by representing the target states and noisy measurements as two individual RFS. Consequently, the MTT problem was solved within the Bayesian filtering framework utilizing the RFS formulation. In addition, the classical PHD filter 8 and its several typical extensions, such as multi-Bernoulli filters9,10 and cardinalized PHD,11,12 were proposed to approximate the state posterior by propagating the first-order statistical moment or the intensity of the RFS. However, the recursion of various PHD filters is still intractable in general as it involves the calculation of multiple set integrals. Two closed-form implementations of the PHD filter were proposed, including Gaussian mixture PHD (GM-PHD) filter 13 and particle PHD filter 14 which are based on Gaussian mixture (GM) strategy and sequential Monte Carlo (SMC), respectively. Generally, the GM-PHD filter is more efficient and reliable than particle PHD filter as it obviates the expensive clustering process in the particle-based approach. Besides, the standard Kalman filter (KF) and its nonlinear variations could be easily adopted to estimate the individual mixture components under the Gaussian model assumption.
The statistics of the measurement noise statistics, especially the noise deviation and covariance, may not always be a given premise in some practical applications. In the cases that the actual measurement noise statistics deviate from the assumed ones, those standard Gaussian filters will suffer degraded performance or even divergence. In order to realize the state estimation problem involving unknown noise statistics, several adaptive filters were introduced within the framework of Bayesian estimation. For instance, to improve the estimation accuracy for vehicle navigation with noise uncertainty, the fuzzy interacting multiple model (IMM) is employed in sensor fusion to determine the bounds of the process noise covariance. 15 Zhao et al. 16 presented a general filtering scheme based on the IMM algorithm for nonlinear systems with unknown and random nuisance parameters. Furthermore, the linear minimum mean square error (LMMSE) filter 17 was embedded into the above IMM based filtering scheme to handle ballistic target tracking with unknown ballistic coefficient efficiently. By introducing an equivalent covariance matrix into the estimator, the robustness of the standard KF is enhanced against the uncertainty in the process noise. 18 As an example for MTT, the H infinite optimality criterion is integrated with the GM-PHD filter to handle the noise parameter uncertainties. 19 However, the filter could yield a conservative result since it minimizes the possible worst-case estimation errors by treating the noise statistics as bounded quantities. In addition to the above methods, the variational Bayesian (VB) approximation is introduced into Kalman filtering to reclusively estimate the dynamic state and time-varying variance of the measurement noise for linear Gaussian models. 19 For the nonlinear system model along with time-varying measurement noise covariance, a robust unscented KF using VB approximation is proposed to improve its adaptivity. 20 The VB based method has advantages over other aforementioned methods that it deals with posterior inference analytically at a low computational cost, since the posterior is assumed in the same functional form as the prior based on Bayesian conjugacy. 21 As a result, some adaptive multi-target tracking filters based on the VB approximation has been further addressed, and an inverse-Wishart or a product of inverse-Gamma distribution was defined as a conjugate prior for the measurement noise covariance.22–25 However, the mean of the Gaussian measurement noise was assumed to be known as a zero vector. To deal with the cases that the mean of the noises are also unavailable, Özkan et al. 26 proposed a marginalized particle filter (PF) to update the noise statistics by employing finite dimensional sufficient statistics of Normal-inverse-Wishart distributions for each particle.
When referring to the MTT scenario with unknown mean and covariance of the noise, Li et al. 27 introduced a measurement preprocessing procedure prior to the multi-sensor PHD updating step to cope with the unknown statistical properties of the sensors. The measurement noise bias was handled by a Monte Carlo sampling-based debiasing approach with the multi-sensor network. 28 Here, a new adaptive PHD filter is proposed to recursively estimate both the unknown mean and covariance of the measurement noise in the process of target tracking. In the proposed filter, the multi-target state is represented by a Gaussian distribution, while the statistics of the measurement noise is modeled with a Normal-inverse-Wishart distribution. The measurement equation is reformed with the residual vector, and the rule for updating the hyperparameters of the measurement noise statistics is then derived using two different methods: the VB approximation and the Bayesian conjugate prior heuristics. On this basis, the hyperparameters of the model are determined sequentially and iteratively at each measurement updating step. The effectiveness of the proposed filter is evaluated by two multi-target tracking simulations where both the mean and covariance of the measurement noise are unknown.
The organization of the paper is assigned as follows. In section 2, the model formulation of target tracking with unknown measurement noise statistics is introduced. In section 3, the updating rules for the hyperparameters of the measurement noise statistics are derived, then the details of the proposed filter are presented. Section 4 illustrates the effectiveness of the proposed adaptive PHD filter with simulations. The main conclusions are given in the last section.
Problem formulation
Consider the following discrete-time state-space model for each target
where
Since the exact information about the measurement noise statistics is unavailable, not only the multi-target state but also the measurement noise statistics have to be estimated simultaneously. For this purpose, the measurement residual
For multivariate Gaussian distribution of the random variable
where
To track multiple targets with a time-varying number, RFSs are used to represent both the set of the target states and the measurements. The multi-target state RFS in the state space
where
According to the finite set statistics (FISST) theory, the estimation of multi-target state can be approximated by the PHD filter which is sub-optimal but computation tractable. Since the knowledge of the measurement noise statistics is unknown, the standard PHD filter has to be extended by augmenting the target state with the measurement noise statistics. Assume that at time
where
It is noted that the multi-target state is coupled with the unknown noise statistics in the measurement likelihood
Proposed adaptive PHD filter
Joint estimation of state and measurement noise statistics
According to the definition of the transformed measurement model, the posterior of the measurement mean and covariance is reformed as
where the quantities
Assume that the predicted density of the measurement noise statistic is in the same form as the posterior, i.e.,
Here, the VB approximation 29 is employed to compute the estimate of the posterior density:
where
The solution to this optimization problem is obtained by minimizing KLD with respect to the
Proof: For the proof see appendix A.
In what follows, the measurement residuals are treated as samples to derive the updating rule for the hyperparameters of the measurement noise statistics. The Normal-inverse-Wishart posterior can be calculated from its Bayesian conjugate prior with samples of measurement data as follows 30 :
where
The above general recursive equations are specialized for the update rule of the hyperparameters of the measurement noise heuristically. Considering that the measurements are processed sequentially in the update step of the filter, the sample mean is equal to the current measurement
Note that the above heuristic updating rules are essentially the same as the equations (17) and (19). When the hyperparameters at time step
with center at
where
To make full use of the Bayesian conjugate prior, both the joint prior and predicted densities of
where
In the measurement update step, it is intractable to directly derive an exact value of the joint posterior density because the target state is coupled with the measurement noise statistics. Therefore, an iterative method is used to estimate the target state and measurement noise statistics sequentially. Specifically, the measurement noise statistics are first estimated with the updated hyperparameters, which are computed according to the updating rule presented above:
The updated mean and covariance of the target state are estimated by standard update equations of Gaussian filters with the known measurement noise statistics. By substituting the updated mean of the target state into the measurement model, one can obtain the new measurement residual, which is used to update the hyperparameters of the measurement noise statistics again. These iterative steps are performed multiple times until the result is converged or the preset maximum iteration limit is reached.
Adaptive GM-PHD recursion
In this section, the standard GM-PHD recursion is extended in a sense that not only the target state is estimated, but also the measurement noise statistics are adapted at the same time. Assume that all targets follow the same state-space model defined in section 2, thus the state transition and likelihood functions are both Gaussian distributed, i.e.,
where
where
By combining (7), (8), and (25) together, the proposed adaptive PHD recursions are detailed as follows.
The joint predicted intensity is computed by summing over the two intensities with regard to existing targets and spontaneous birthed targets, that is,
where
The predicted mean and covariance matrix of the target states is estimated by
And the predicted hyperparameters of the Normal-inverse-Wishart distribution are computed in accordance with the equation (21), that is,
Then the joint posterior intensity is computed by
where the detected target intensity for each measurement
The fixed point iteration method is employed to compute the above model parameters since it is of asymptotic optimal convergence properties.
31
For each new measurement
where the superscript
The estimated measurement noise statistics are updated with estimates of the hyperparameters from the previous iteration as
The estimated multi-target state is updated by employing standard Gaussian filters with known measurement noise statistics as
The estimated hyperparameters are updated by
where the measurement residual is computed by
Suppose that the algorithm converges after
where the updated weights of the posterior mixture components are calculated by
Note that if nonlinear functions are used as dynamic and measurement models, then the general Gaussian integrals in equations (29) and (35) can be approximated by utilizing commonly used variants of non-linear Gaussian filters, such as the extended KF, the unscented KF, and the cubature KF.
Algorithm summarization
Additional steps including parameter initialization, mixture reduction, and state extraction have to be implemented when realizing the entire tracking procedure. The number of the mixture components after each PHD recursion will increase exponentially as time progresses. Consequently, similarly as in Granström and Orguner, 32 mixture reduction process is performed to ensure the tractable computational cost. This is done by first removing the components that have weights below a truncation threshold and then merging the components with reasonable Mahalanobis distance from the largest component. The multi-target state is finally extracted from the reduced mixture components that are corresponding to the peaks of the posterior cardinality. In conclusion, the pseudo-code of the proposed filter is summarized in Algorithm 1.
Simulation results
Simulation configuration
In this simulation, the performance of the proposed filter is evaluated in a 2-dimensional scenario, where the number of targets is time-varying. As shown in Figure 1, the targets are observed in a surveillance region with the scale
where the scanning period
where the ground truth of the measurement noise statistics are set as
where the weight

True trajectories of the targets moving with a constant velocity model.
Description of the targets.
Other parameters used in the simulations.
General tracking performance
In Figure 2, the resulting target positions estimated by the AGM-PHD filter in one typical trial are presented. It can be easily seen that almost all the positions of the targets are estimated accurately in both

Position estimates of AGM-PHD filter in

Weighted average of estimated measurement noise means.

Weighted average of estimated measurement noise standard deviations.
Comparisons with benchmark filters
To demonstrate the advantages of the AGM-PHD filter, three different variations of GM-PHD filters are also implemented and evaluated in the simulations. The highlights of these filters are described as follows.
GMTS-PHD: the standard GM-PHD filter with the ground truth of the measurement noise statistics, that is, the means and standard deviations of the measurement noise are chosen as 9.0 and 1.0, respectively.
GMTM-PHD: the standard GM-PHD filter with the ground truth of the measurement noise mean only, while the standard deviations of the measurement noise are chosen as 2.0 to match with the initial standard deviation of AGM-PHD.
GM-VBPHD: the adaptive PHD filter proposed in Wu et al. 22 where only the covariance of the measurement noise is assumed unknown, while the mean of the measurement noise is chosen as 8.0 to allow a reasonable distance from the ground truth.
In all the simulations, the estimated accuracy of the target position is evaluated quantitatively by the optimal sub-pattern Assignment (OSPA).
33
Similar to the root mean square error (RMSE) metric used to represent the differences between two vectors, the OSPA metric can be used to measure the differences between two random finite sets. In each simulation run, two hundred independent Monte Carlo trials are conducted. For the OSPA metric, the cut off factor

Comparison of OSPA distances for different filters.

Comparison of average cardinalities for different filters.
To compare the computational costs of these filters, the average running time of Monte Carlo trail of different filters are listed in Table 3. It is noted that the running time required for each filter are measured relative to that of the GMTS-PHD filter. As can be seen from the Table, the GMTM-PHD filter demands almost the same running time as the GMTS-PHD filter. The two VB based filters take more than 25% longer running time. This is mainly because that an additional iteration step has to be carried out to estimate the unknown model hyperparameters during the measurement update stage. The computational cost of the AGM-PHD filter is slightly higher than the GM-VBPHD filter since it involves more number of hyperparameters that need to be estimated in the AGM-PHD filter.
Comparison of computational costs for different filters.
The impacts of different average clutter rate on the tracking performance are also evaluated. All the filters are carried out under six different levels of clutter rate whose value is taken from the interval [5, 30] with step 5.0. For each level of clutter rate, the average OSPA metric with regard to different filters are plotted in Figure 7. It can be observed that the estimation accuracy concerning all the filters degrades as the level of clutter rate increases. However, the differences of the OSPA distance between the AGM-PHD filter and the GMTS-PHD filter remain below 3.0 at all clutter levels. Meanwhile, the increment of the OSPA distance of the GMTM-PHD filter and GM-VBPHD filter are larger than 12.0, which is nearly 1.5 times of the other two filters. This might due to the reason that more spurious measurements are treated as targets when the nominal statistics are larger than the corresponding true values.

Comparison of average OSPA metric under different average clutter rate.
Evaluation with a constant turn model
To demonstrate the performance of the AGM-PHD filter for tracking targets with variable parameters of motion model, the targets are assumed to move within the same surveillance region following a nearly constant turn (CT) model 34 with known but varying turn rate. The tracking setup of the simulation keeps the same except the dynamic motion model, which is given as follows:
where the transition matrix is given as a function of the turn rate:
the turn rate
In Figure 8, the ground truth trajectory of each target together with its start and end positions are presented. All the targets travel along nearly straight lines at the time intervals [1 s, 40 s] and [60 s, 100 s], while making two converse turns at time 40 s and 50 s. During the turning periods, tracking the targets correctly becomes more difficult as the targets’ trajectories change significantly. Same as in the first simulation, 200 independent Monte Carlo trials are conducted for all four PHD filters. Consequently, the result in Figure 9 shows that more erroneous estimates appear around the turning points. Nevertheless, the proposed adaptive PHD filter still exhibit acceptable tracking performance as a whole. This implies that the AGM-PHD filter gains certain robustness against sudden changes in the control noise of non-linear dynamic models. The performance comparison between the filters is presented in Figures 10 and 11. The OSPA distance rises during the periods when the targets make the wide turns. It is also observed in the results, the AGM-PHD filter achieves smaller OSPA distance and more accurate cardinality estimates than the other two counterparts.

True trajectories of the targets moving with a constant turn model.

Target’s position estimates with a constant turn model.

Comparison of OSPA metric between different filters with constant turn model.

Comparison of average cardinalities between different filters with constant turn model.
Conclusion
In conclusion, the classical PHD filter was devised to improve the adaptivity for MTT applications where the measurement noise statistics are unavailable. During the process of the target state estimation, the measurement noise statistics are also estimated and refined iteratively according to the derived updating rule for the hyperparameters of Normal-inverse-Wishart distribution. The updating rule can be obtained based on the VB approximation or the Bayesian conjugate prior heuristics. Simulation results show the proposed adaptive PHD filter achieves comparable tracking performance with the ideal benchmark filter under different cluttering settings. Moreover, the proposed method has a certain degree of robustness even when the motion trajectories of the targets change abruptly.
