Sage Journals: Discover world-class research

Abstract

Gene networks in biological systems are not only nonlinear but also stochastic due to noise corruption. How to accurately estimate the internal states of the noisy gene networks is an attractive issue to researchers. However, the internal states of biological systems are mostly inaccessible by direct measurement. This paper intends to develop a robust extended Kalman filter for state and parameter estimation of a class of gene network systems with uncertain process noises. Quantitative analysis of the estimation performance is conducted and some representative examples are provided for demonstration.

Keywords

stability robustness stochastic model estimation

Introduction

Biochemical networks^1,2 involving metabolic networks, gene regulatory networks and signal transduction networks in biological systems play important roles in diagnosis of disease such as cancer and auto-immunity. Systems biology^2–4 is quite different from traditional biology; it has been developed recently to understand biological systems from system level. Researchers have been devoted to design and construct of biological models by engineering methods and molecular biology techniques. The microarray technology using high-throughput method to measure a large number of gene expression states has also been attempted in the recent decade. According to the measured data it is possible to reconstruct the structure of biochemical networks, quantitatively analyze and systematically design and simulate the system behavior in silico.

In the literature,^1,5,6 some mathematical models suitable for describing behaviors of the biological systems have been proposed. That kind of models can be classified into two major categories: logical model in the discrete-time domain and differential equation set in the continuous-time domain. Different from the deterministic case, biochemical networks of the real biological systems are generally non-ideal and are invariably noisy. For modeling accuracy, the influence resulting from noise contamination should not be ignored. In general, molecular noises involve intrinsic noises resulting from molecular birth and death and extrinsic noises due to environmental perturbations.⁷ The stochastic model was developed to characterize the biological systems with intrinsic and extrinsic molecular noises. In,⁸ the authors have presented a method for measuring performance robustness and presented two mechanisms to cope with the noise and process uncertainties.

With regard to control of biological systems, nonlinear feedback control strategies were applied to regulate the steady state of biological systems in.^9–11 Other issues received increasing attention for noisy biological systems are development of control strategies while ensuring robust stability and filtering ability. In,^12,13 a robust filtering circuit design has been developed by regulating parameters for gene networks to reduce intrinsic and extrinsic molecular noises. An adaptive control design method was also proposed in.¹⁴

Before performing feedback control, the system states should all be available. However, the internal states of most biological systems can only be observed partially. Under the situation, a state estimator is appropriate to reconstruct the full state in the noisy environment. The Kalman filter (KF)¹⁵ is the one for the purpose in the filed of engineering which has been well applied from system and control to signal processing for decades. However, applications of the extended KF (EKF) in state estimation of biochemical networks are rarely found.¹⁶ Until recently, the EKF has been attempted to estimate parameters of the gene regulatory networks.¹⁷ Moreover, a state observer was actually established using the EKF, based on the fluorescence probe model, a dynamic state model of the plant cell bioreactor, and online green fluorescent protein fluorescence measurement.¹⁸

While there were a few papers dealing with the issue of state estimation for biological networks, most of the approaches were based on the traditional Kalman filtering theory which assumed that the noise covariances including the process noise and measurement noise have been precisely known as a priori.^16–18 The KF is shown to be the optimal state estimator against noise with Gaussian distributions. However, in the biological systems, the noise distribution may not be Gaussian, its autocorrelations may not be known exactly, or even uneasily to be precisely modeled.¹⁹ In this paper, an EKF for robustly filtering the states and parameters of the noisy gene networks is introduced. Quantitative error analysis for that kind of systems is presented in details. On the basis of the results obtained one would be able to identify effect of the filtering gain and the sizes of noise uncertainties to estimation performance. Two numerical examples have been conducted to verify the proposed design.

To clarify the notation, throughout this paper, let the vector norm of the real vector x ∊ℝⁿ, denoted by ∥x∥, be defined as $‖ x ‖ = \sqrt{E (x^{T} x)}$ with E(·) denoting the operation of expectation.

Lemma 1: The induced matrix norm ∥A∥ corresponding to the vector norm of $‖ x ‖ = \sqrt{E (x^{T} x)}$ with x ∊ ℝⁿ is given by

‖ A ‖ = σ_{1}

where σ₁ is the maximum singular value of A, ie, $‖ A ‖ = \sqrt{λ_{\max} (A^{T} A)}$ .

Proof: By definition, the induced norm of the operator A is defined by

‖ A ‖ = \sup_{‖ x ‖ = 1} ‖ A x ‖

Let the singular value decomposition of A is given by A = UΣV^T where Σ = diag(σ₁, σ₂, σ_n) with σ₁ ≥ σ₂ ≥ … ≥ 0, the unitary matrices U = [u₁ … u_n] and V = [ν₁ … ν_n]. Then

\begin{matrix} ‖ A x ‖ = \sqrt{E (x^{T} A^{T} A x)} \\ = \sqrt{E (x^{T} \sum_{i = 1}^{n} σ_{i}^{2} {(u_{i} v_{i}^{T})}^{T} u_{i} v_{i}^{T} x)} \\ = \sqrt{E (x^{T} \sum_{i = 1}^{n} σ_{i}^{2} v_{i}^{T} v_{i} x)} \\ \leq \sqrt{E (σ_{1}^{2} x^{T} \sum_{i = 1}^{n} v_{i}^{T} v_{i} x)} \\ = \sqrt{σ_{1}^{2} E (x^{T} x)} \\ = σ_{1} ‖ x ‖ \end{matrix}

That is

\sup_{‖ x ‖ = 1} ‖ A x ‖ \leq σ_{1}

(1)

Alternately, if one chooses x′ = e₁ = [1 0 … 0]^T which obviously satisfies ∥x′∥ = 1, then

\sup_{‖ x ‖ = 1, \forall x \in ℝ^{n}} ‖ A x ‖ \geq σ_{1} ‖ A x' ‖ = \sqrt{E (σ_{1}^{2} x'^{T} x')} = σ_{1} \sqrt{E (x'^{T} x')} = σ_{1}

(2)

From (1) and (2), it can be concluded that ∥A∥ = σ₁.

Problem Formulation

In real biological systems, biochemical networks such as signal transduction networks, gene regulatory networks, and metabolic networks are invariably noisy. As in engineering area of research, the system dynamic behavior can be mathematically described by stochastic models, which could be further used as a basis for the purpose of analysis and control.

System Modeling

S-system is a type of power-law formalism and is based on a particular type of ordinary differential equations in which the component processes are characterized by power-law functions:^1,5,6 ${\dot{x}}_{i} = α_{i} \prod_{j = 1}^{\tilde{n} + \tilde{m}} x_{j}^{g_{i j}} - β_{i} \prod_{j = 1}^{\tilde{n} + \tilde{m}} x_{j}^{h_{i j}}, i = 1, 2, \dots, \tilde{n}$ (3) where x_j (j = 1, …,ñ) are dependent variables such as intermediate metabolites and products, x_j $(j = \tilde{n} + 1, \dots, \tilde{n} + \tilde{m})$ are independent variables such as substrates and enzymes, α_i ≥ 0 and β_i ≥ 0 are rate constants which denote, respectively, production and degradation effects for dependent and independent variables; g_ij and h_ij are kinetic orders. Gene j will active gene i when the values of kinetic orders are positive and gene j will inhibitive gene i when the values of kinetic orders are negative. Zero values of kinetic orders represent gene j won't affect gene i.

The system model can be expressed in the following generalized nonlinear biochemical dynamics with the stoichiometric equation described by

\dot{x} (t) = S V (x (t), p)

(4)

where x(t) ∊ℝ_n is a state vector which denotes the concentration of metabolite, mRNA or protein, p = p(α_i, β_i,g_ij, h_ij)∊ ℝ^mis the parameter vector which include rate constants and kinetic orders, S denotes the stoichiometric matrix, and V(·) is a nonlinear function of the reaction rate. It can be expressed in a more general form as

\dot{x} (t) = f (x (t), p)

(5)

where f(·) ∊ ℝⁿ is a generalized nonlinear function vector.

For biochemical reactions, suppose that there are M intrinsic noise sources and an extrinsic noise, the nonlinear stochastic dynamical system can be described by¹³

\dot{x} (t) = f (x (t), p) + \sum_{i = 1}^{M} g_{i} (x (t)) n_{i} (t) + w (t)

(6)

where x(t) ∊ ℝⁿ is the state which may denote a vector of protein concentrations of n genes, f(·) and g(·) are nonlinear function vectors, n_i(t) is an intrinsic noise source which is the white noise with zero mean and standard deviation σ_i, w(t) is an extrinsic noise source with zero mean and standard deviation σ_w Equation (6) implies that the system is suffered from intrinsic noise corruption due to M kinetic parameter fluctuations. In gene regulatory networks, the external and internal noises occur independently and they appear randomly.¹⁹ More precisely, it has been proposed that the pattern of protein concentration growth is stochastic, exhibiting short bursts of variable numbers of proteins at varying time intervals.

Remark: The system with intrinsic noises in (6) can also be rewritten as the following Ito stochastic equation:

d x (t) = f (x (t), p) d t + \sum_{i = 1}^{M} g_{i} (x (t), p) d N_{i} (t) + d W (t)

(7)

where N_i and W are standard Wiener processes or Brownian motions with dN_i(t) = n_i(t)dt and dW(t) = w(t)dt with the property $E [| N_{i} (t) - N_{i} (τ) |^{2}] = σ_{i}^{2} | t - τ |$ and $E [| W (t) - W (τ) |^{2}] = σ_{w}^{2} | t - τ |$ . The formulation is widely applicable to the general nonlinear gene network with n genes.

After the stochastic differential system in (6) or (7) is modeled to mimic the realistic behaviors of the object, some design specifications can be proposed for robust design of the system so that the desired behaviors or products can be achieved in spite of intrinsic parameter fluctuations and environmental disturbances.

Estimator Design

In practice, the internal states of biological systems may not be directly accessible. Biochemical process for gene regulatory networks is DNA to mRNA by transcription, mRNA to protein by translation and the generated protein regulates the gene. Not all protein concentrations are directly measurable,¹⁸ however, one could access the status of individual protein by utilizing an appropriate state estimator. It is proposed here an EKF to estimate the internal states as well as the parameters of concern.

Fundamental Kalman Filter

The KF uses measurements for a dynamic system observed over time that contain noise, and produce estimated values that tend to be closer to the true values of the measurements.

A stochastic linear time-invariant system with measurement can be described by

\dot{x} (t) = A x (t) + w_{x} (t)

(8)

and

y (t) = C x (t) + v (t)

(9)

where x(t) is the state vector, y(t) is the output, A and C are, respectively, system and output matrices, and w_x(t) and ν(t) are, respectively, process and measurement noises which are zero mean Gaussian noises with covariance matrices Q and R, respectively.

With regard to the system (8)−(9), which x(t) is to be estimated, the corresponding KF is given by¹⁵

\dot{\hat{x}} (t) = A \hat{x} (t) + K [y (t) - C \hat{x} (t)]

(10)

where $\hat{x} (t)$ is the estimated state and K is the optimal Kalman gain given by

K = P C^{T} R^{- 1}

(11)

where P = P^T > 0 satisfies the following Riccati matrix equation:

\begin{matrix} \dot{P} = A P + P A^{T} + Q - P C^{T} R^{- 1} C P, \\ P_{0} = E ({\tilde{x}}_{0} {\tilde{x}}_{0}^{T}) \end{matrix}

(12)

with the estimation error state $\tilde{x} (t) = x (t) - \hat{x} (t)$ .

Extended Kalman Filter

For estimation of the stochastic nonlinear systems, the commonly applied filtering mechanism is EKF which evaluates the partial derivatives at the estimated state value and uses nonlinear functions on the estimate itself.

Suppose that the nonlinear system with measurement is given by

\dot{x} (t) = f (x (t)) + w_{x} (t)

(13)

and

y (t) = h (x (t)) + v (t)

(14)

where f(·)and h(·) are nonlinear function vectors with appropriate dimensions.

The EKF is given by¹⁵

\dot{\hat{x}} (t) = f (\hat{x} (t)) + K (t) [y (t) - h (\hat{x} (t))]

(15)

where the Kalman gain is

K (t) = P H^{T} R^{- 1}

(16)

with P = P^T > 0 satisfying

\begin{matrix} \dot{P} = F P + P F^{T} + Q - P H^{T} R^{- 1} H P, \\ P_{0} = E ({\tilde{x}}_{0} {\tilde{x}}_{0}^{T}) \end{matrix}

(17)

with

{F = \frac{\partial f (x (t))}{\partial x (t)} |}_{x (t) = \hat{x} (t)}, {H = \frac{\partial h (x (t))}{\partial x (t)} |}_{x (t) = \hat{x} (t)}

Estimating states and parameters by EKF

In more general, we consider a generalized representation of the stochastic nonlinear dynamical model described by

\begin{matrix} \dot{x} (t) = f (x (t), p (t)) + w_{x} (t), x_{0} = E [x (0)] \\ \dot{p} (t) = w_{p} (t), p_{0} = E [p (0)] \\ y (t) = h (x (t), p (t)) + v (t) \end{matrix}

(18)

where P_x0 = E[(x(0) − x₀)(x(0) − x₀)T], P_p0 = E[(p(0) − p₀) (p(0) − p₀)T], p(t) ℝ^m denotes the aggregated parameter vector, y(t) ∊ ℝ^r is the measurement output, f(·) and h(·) are nonlinear function vectors. Suppose that the uncorrelated extrinsic noise w_x(t), parameter noise w_p(t) and measurement noise ν(t) are white noises, uncorrelated and satisfy the following properties:

\begin{array}{l} E [w_{x} (t)] = E [w_{p} (t)] = E [v (t)] = 0, \\ E [w_{x} (t) w_{x}^{T} (τ)] = Q δ (t - τ), \\ E [w_{p} (t) w_{p}^{T} (τ)] = Q_{p} δ (t - τ), \\ E [v (t) v^{T} (τ)] = R δ (t - τ) \end{array}

(19)

where the noise uncertainties satisfy

‖ Q - Q_{0} ‖ \leq ε_{1}, ‖ R - R_{0} ‖ \leq ε_{2}, ‖ Q_{p} - Q_{p 0} ‖ \leq ε_{3}

(20)

where $Q = Q^{T} > 0, Q_{P} = Q_{P}^{T} > 0$ and R = R^T > 0 with Q₀, Q_p0 and R₀ being their corresponding nominal parts, and ε₁, ε₃ and ε₂ are finite constants characterizing the respective upper bound of the noise covariance.

For compactness, the matrix format of (18) can next be written in the state-space representation as follows

[\begin{matrix} \dot{x} (t) \\ \dot{p} (t) \end{matrix}] = [\begin{matrix} f (x (t), p (t)) \\ 0_{m \times 1} \end{matrix}] + η (t), y (t) = h (x (t), p (t)) + v (t)

(21)

where $η (t) = {[\begin{matrix} w_{x}^{T} (t) & w_{p}^{T} (t) \end{matrix}]}^{T}$ . The idea of the EKF is that it operates by propagating the mean and error covariance of the state through time. The EKF for (18) in the matrix form can be represented as

[\begin{matrix} \dot{\hat{x}} (t) \\ \dot{\hat{p}} (t) \end{matrix}] = [\begin{matrix} f (\hat{x} (t), \hat{p} (t)) \\ 0_{m \times 1} \end{matrix}] + K [y (t) - \hat{y} (t)], \hat{y} (t) = h (\hat{x} (t), \hat{p} (t))

(22)

where $\hat{x} (t)$ is the estimated state, $\hat{p}$ is the estimated parameter, ŷ(t) is the estimator output, and K is the estimator gain.

Equations (21) and (22) can be further written as

\begin{matrix} \dot{z} (t) = g (z (t)) + η (t), z_{0} = {[x^{T} (0) p^{T} (0)]}^{T} \\ y (t) = h (z (t)) + v (t) \end{matrix}

(23)

and the state estimate equation

\begin{matrix} \dot{\hat{z}} (t) = g (\hat{z} (t)) + K [y (t) - \hat{y} (t)], \\ \hat{y} (t) = h (\hat{z} (t)) \end{matrix}

(24)

where

z (t) = {[x^{T} (t) p^{T} (t)]}^{T}, \hat{z} (t) = {[{\hat{x}}^{T} (t) {\hat{p}}^{T} (t)]}^{T}, g (z (t)) = [\begin{matrix} f (x (t), p (t)) \\ 0_{m \times 1} \end{matrix}], g (\hat{z} (t)) = [\begin{matrix} f (\hat{x} (t), \hat{p} (t)) \\ 0_{m \times 1} \end{matrix}]

(25)

The maximal covariances of the extrinsic, parameter and measurement noises can be specified by Q₀ + ε₁I_n, Q_p0 + ε₃I_m and R₀ + ε₂I respectively Define the estimation error vector $\tilde{z} (t) = z (t) - \hat{z} (t)$ and the error covariance matrix $P (t) = E [\tilde{z} (t) {\tilde{z}}^{T} (t)]$ . It can be proved that the error covariance matrix P and Kalman gain K satisfy, respectively,²⁰

\dot{P} = G P + P G^{T} + L - P H^{T} {(R_{0} + ε_{2} I_{n + m})}^{- 1} H P, P_{0} = E ({\tilde{z}}_{0} {\tilde{z}}_{0}^{T})

(26)

K = P H^{T} {(R_{0} + ε_{2} I_{n + m})}^{- 1}

(27)

where G and H are, respectively, the linearized system and output matrices given by

\begin{matrix} G = {\frac{\partial g (x (t), p (t))}{\partial z (t)} |}_{x (t) = \hat{x} (t), p (t) = \hat{p} (t)} = {[\begin{matrix} \frac{\partial f}{\partial x^{T}} & \frac{\partial f}{\partial p^{T}} \\ 0 & 0 \end{matrix}] |}_{x (t) = \hat{x} (t), p (t) = \hat{p} (t)} \\ H = {\frac{\partial h (x (t), p (t))}{\partial z (t)} |}_{x (t) = \hat{x} (t), p (t) = \hat{p} (t)} = {[\begin{matrix} \frac{\partial h}{\partial x^{T}} & \frac{\partial h}{\partial p^{T}} \end{matrix}] |}_{x (t) = \hat{x} (t), p (t) = \hat{p} (t)} \end{matrix}

and

L = [\begin{matrix} Q_{0} + ε^{1} I_{m} & 0_{m \times n} \\ 0_{m \times n} & Q_{p 0} + ε_{3} I_{m} \end{matrix}]

The linearization is performed around the estimated state $\hat{x} (t)$ and parameter $\hat{p} (t)$ respectively.

As it can be observed from (26) that the magnitude of the Kalman gain K is closely related to the amount of measurement noise reflected by the size of R₀ and the extent of the uncertain noise covariance specified by ε₂. The term L accounts for the increase of extrinsic noise and parameter noise and the term −PH^T(R₀ + ε₂I_n+m)^–1 HP reflects the decrease of uncertainty as a result of measurement.

When there is in the absence of extrinsic noise and a priori information in the biological system, by using the following matrix identity

\frac{d}{d t} P^{- 1} = - P^{- 1} \dot{P} P^{- 1}

the continuous Riccati matrix equation (26) can be written in the linear equation in P⁻¹ as

{\dot{P}}^{- 1} = - P^{- 1} G - G^{T} P^{- 1} + H^{T} {(R_{0} + ε_{2} I_{n + m})}^{- 1} H, P^{- 1} (0) = P_{0}^{- 1}

(28)

Clearly, either large measurement error (large R₀) or large uncertainty of the measurement covariance (large ε₂) cause the error covariance P to increase considerably whenever a measurement is utilized. This also results in a smaller Kalman gain for small state estimate errors and ease off the updating speed of state.

Performance Analysis

It follows from (23) and (24) that the estimation error dynamics is given by

\begin{matrix} \dot{\tilde{z}} (t) = g (z (t)) + η (t) - g (\hat{z} (t)) - K h (z (t)) - K v (t) + K h (\hat{z} (t)) \\ = (G - K H) \tilde{z} (t) + Δ G (z, \hat{z}) - K Δ H (z, \hat{z}) + η (t) - K v (t) \end{matrix}

(29)

where ΔG(z,ẑ) = Δg(ẑ) − Δg(z) and ΔH(z,ẑ) = Δh(ẑ) − Δh(z) with Δg(·) = g(·) − G and Δh(·) = h(·) − H. It is reasonable to have

\begin{matrix} ‖ Δ G (z, \hat{z}) ‖ \leq ρ_{1} ‖ \tilde{z} (t) ‖, \forall \tilde{z} (t) \in ℝ^{n + m}, t \geq 0 \\ ‖ Δ H (z, \hat{z}) ‖ \leq ρ_{2} ‖ \tilde{z} (t) ‖, \forall \tilde{z} (t) \in ℝ^{n + m}, t \geq 0 \end{matrix}

where ρ₁ and ρ₂ are finite positive constants and

\begin{array}{l} ‖ η (t) ‖ \leq ‖ w_{x} (t) ‖ + ‖ w_{p} (t) ‖ \leq \sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})}, \forall t \geq 0, \\ ‖ v (t) ‖ \leq \sqrt{r ε_{2} + tr (R_{0})}, \forall t \geq 0, \end{array}

(30)

with tr(·) denoting the trace operation.

It can be observed from the estimation error dynamics (29) that its solution is

\tilde{z} (t) = Φ (t) {\tilde{z}}_{0} + \int_{0}^{t} Φ (t - τ) [Δ G (z (τ), \hat{z} (τ)) - K Δ H (z (τ), \hat{z} (τ))] d τ + \int_{0}^{t} Φ (t - τ) [η (τ) - K v (τ)] d τ

(31)

where Φ(t) = e^(G−KH)t. Or equivalently,

\tilde{z} (t) = Φ (t) {\tilde{z}}_{0} + \int_{0}^{t} Φ (t - τ) [I - K] [\begin{matrix} Δ G (z (τ), \hat{z} (τ)) \\ Δ H (z (τ), \hat{z} (τ)) \end{matrix}] d τ + \int_{0}^{t} Φ (t - τ) [I - K] υ (τ) d τ

where $υ (t) = [\begin{matrix} η (t) \\ v (t) \end{matrix}]$ .

On the basis of the Kalman filtering theory, the matrix G − KH will be asymptotically stable if (G, H) is detectable around all estimated states ǩ(t). Therefore, there exist positive constants m, m₁ and β such that

‖ Φ (t) ‖ \leq m e^{- β t}, \forall t

(32)

and

‖ Φ (t) [I - K] ‖ \leq m_{1} e^{- β t}, \forall t

(33)

with the induced norms specified by Lemma 1, where β can be chosen to be $\min_{i} | Re λ_{i} (G - K H) |$ where

K = K(∊_1,∊_1,∊₁).

To proceed, by taking norms on both sides of (31) gives

‖ \tilde{z} (t) ‖ \leq ‖ Φ (t) ‖ ‖ {\tilde{z}}_{0} ‖ + (ρ_{1} + ρ_{2}) \int_{0}^{t} ‖ Φ (t - τ) [I - K] ‖ ‖ \tilde{z} (τ) ‖ d τ + \int_{0}^{t} ‖ Φ (t - τ) [I - K] ‖ (‖ η (τ) ‖ + ‖ ν (t) ‖) d τ

(34)

Or equivalently

\begin{array}{l} ‖ \tilde{z} (t) ‖ e^{β t} & \leq m ‖ {\tilde{z}}_{0} ‖ + m_{1} (ρ_{1} + ρ_{2}) \int_{0}^{t} ‖ \tilde{z} (τ) ‖ e^{β τ} d τ + m_{1} \int_{0}^{t} ‖ υ (τ) ‖ e^{β τ} d τ \\ \leq m ‖ {\tilde{z}}_{0} ‖ + m_{1} (ρ_{1} + ρ_{2}) \int_{0}^{t} ‖ \tilde{z} (τ) ‖ e^{β τ} d τ \\ + m_{1} \int_{0}^{t} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) e^{β τ} d τ \\ = m ‖ {\tilde{z}}_{0} ‖ - \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) + γ (t) \\ + m_{1} (ρ_{1} + ρ_{2}) \int_{0}^{t} ‖ \tilde{z} (τ) ‖ e^{β τ} d τ \end{array}

where

γ (t) = \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) e^{β τ}

Applying the Bellman-Gronwall inequality further yields

‖ \tilde{z} (t) ‖ e^{β t} \leq [m ‖ {\tilde{z}}_{0} ‖ - \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})})] e^{m_{1} (ρ_{1} + ρ_{2}) t} + γ (t) + m_{1} (ρ_{1} + ρ_{2}) \int_{0}^{t} γ (τ) e^{m_{1} (ρ_{1} + ρ_{2}) (t - τ)} d τ

‖ \tilde{z} (t) ‖ \leq [m ‖ {\tilde{z}}_{0} ‖ - \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})})] e^{- [β - m_{1} (ρ_{1} + ρ_{2})] t} + γ (t) e^{- β t} + m_{1} (ρ_{1} + ρ_{2}) e^{- [β - m_{1} (ρ_{1} + ρ_{2})] t} \int_{0}^{t} γ (τ) e^{m_{1} (ρ_{1} + ρ_{2}) (t - τ)} d τ

Clearly, when

β > m_{1} (ρ_{1} + ρ_{2})

(35)

then

‖ \tilde{z} (t) ‖ \leq \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})} + + m_{1} (ρ_{1} + ρ_{2}) \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) e^{- [β - m_{1} (ρ_{1} + ρ_{2})] t}

\int_{0}^{t} e^{[β - m_{1} (ρ_{1} + ρ_{2})] τ} d τ = \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) + m_{1} (ρ_{1} + ρ_{2}) \frac{m_{1}}{β} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})}) \frac{1}{β - m_{1} (ρ_{1} + ρ_{2})}

That implies that $‖ \tilde{z} (t) ‖$ wouldn't be diverged with its upper bound specified by

\sup_{t} ‖ \tilde{z} (t) ‖ \leq \frac{m_{1} (\sqrt{n ε_{1} + tr (Q_{0})} + \sqrt{m ε_{3} + tr (Q_{p 0})} + \sqrt{r ε_{2} + tr (R_{0})})}{β - m_{1} (ρ_{1} + ρ_{2})}

(36)

Equations (32) and (35) specifies a crucial condition for determining convergence of the noisily corrupted system which is determined by the Kalman gain K and the values of ρ₁ and ρ₂. The condition is significantly affected by the amount of linearization and noise covariances. Inspection of the equations describing the behavior of the error covariance matrix reveals several observations which confirm engineers' intuition about the operation of the KF. As it can be observed from (36) that the estimation error is closely related to the upper bounds of the unreduced certain process and parameter noises and uncertain measurement noise. The larger the statistical parameters of the disturbances as reflected in the sizes of Q and Q _p , and the more pronounced effect of the disturbances as reflected in the size of R, the more rapidly the error covariance increases.

Larger Kalman gains will expedite the convergence of the estimation error. However, the estimation error increases considerably whenever there are larger noise uncertainties specified by larger ∊ _i, i = 1, 2, 3 and the linearization errors characterized by larger ρ₁ and ρ₂.

When there are process and measurement noises and uncertainties of noise covariances, from the observation of (26), (28) and (36), one should increase the magnitude of K for a larger stability margin β so as to assure convergence of the estimation error. However, as it was shown in (28), large measurement errors and the small error convariance P result in a small K. Thus, there is always a compromise between the optimal state estimation and stability robustness while designing the state estimator.

As for the implementation issue, possibility of the practical implementation of the estimator can be referred, for example, to,¹⁸ which utilized the green fluorescent protein (GFP) as a reporter for real-time bioprocess sensing and GFP concentration and other important states in bioreactor culture of transgenic tobacco cells were successfully estimated. Application of the idea to the current estimator design deserves more attention and is worthy of further investigation.

Simulation Study

For demonstration, two examples for a class of noisy gene regulatory networks are illustrated.

Example 1 : Consider first a two-order system model for a real gene regulatory network given as follows:⁶

\begin{array}{l} [\begin{matrix} {\dot{x}}_{1} (t) \\ {\dot{x}}_{2} (t) \end{matrix}] = [\begin{matrix} α_{1} x_{1}^{g_{11}} (t) x_{2}^{g_{12}} (t) - β_{1} x_{1}^{h_{11}} (t) x_{2}^{h_{12}} (t) \\ α_{2} x_{1}^{g_{21}} (t) x_{2}^{g_{22}} (t) - β_{2} x_{1}^{h_{21}} (t) x_{2}^{h_{22}} (t) \end{matrix}] + w_{x} (t), \\ {[\begin{matrix} x_{10} & x_{20} \end{matrix}]}^{T} = {[\begin{matrix} 1 & 1.5 \end{matrix}]}^{T}, \\ α_{i} = β_{i} = 1, i - 1, 2, \\ g_{11} = 0.268, g_{12} = - 2.26, g_{21} = 2.739, g_{22} = 0.155, \\ h_{11} = 0.465, h_{12} = 0.359, h_{21} = 0.197, h_{22} = 0.281 \end{array}

and

y = x_{2} + ν (t)

where w_x(t) ~ (0, 0.1), w_p(t) ~ (0, 0.1) and ν(t) ~ (0, 0.2). States and parameters are both estimated. The parameters including 4 rate constants and 8 kinetic orders are treated as the states. Thus the state variables are extended from 2 to 14. In this network, for the first term of the first differential equation (the rate of change of x₁), ie, X^0.268₁ (t)x₂^−2.26 (t) with unit rate constant α₁, shows accumulation of gene product 1. Since the variable x₂ is raised to the power of the kinetic parameter −2.26 which reveals gene 2 will inhibit product of gene 1. On the other hand, for the second differential equation, the first term x₁^2.739(t) x₂^0.155(t) with unit rate constant α₂ reflects accumulation of gene product 2. Since the variable x₁ is raised to the power of 2.739 which reveals that gene product 1 will activate gene 2. The second terms −x₁^0.469 (t)x₂^0.359 (t) and −x₁^0.197 (t)x₂^0.281 (t) with unit rate constants β₁ and β₂ in the first and second differential equations reflecting degradation effect on gene products 1 and 2 respectively. Figure 1 illustrates the branch pathway of the two-dimensional S-system network.

The linearized system matrix ∂f/∂x^T based on the state estimation is

{\frac{\partial f}{\partial x^{T}} |}_{x = \hat{x}} = [\begin{matrix} 0.268 {\hat{x}}_{1}^{- 0.732} {\hat{x}}_{2}^{- 2.26} - 0.469 {\hat{x}}_{1}^{- 0.531} {\hat{x}}_{2}^{0.359} & - 2.26 {\hat{x}}_{1}^{- 0.268} {\hat{x}}_{2}^{- 3.26} - 0.359 {\hat{x}}_{1}^{0.46} {\hat{x}}_{2}^{- 0.641} \\ 0.739 {\hat{x}}_{1}^{1.739} {\hat{x}}_{2}^{0.155} - 0.197 {\hat{x}}_{1}^{- 0.803} {\hat{x}}_{2}^{0.281} & 0.155 {\hat{x}}_{1}^{2.739} {\hat{x}}_{2}^{- 0.845} - 0.281 {\hat{x}}_{1}^{0.197} {\hat{x}}_{2}^{- 0.719} \end{matrix}],

and

H = [\begin{matrix} 0 & 1 & 0_{1 \times 12} \end{matrix}]

Figure 1.

The gene regulatory network for Example 1.

For the initial error covariance P(0) = I₁₄, the results of dynamic simulation of the noise-free and estimated states and parameters for the noisy gene regulatory network using the proposed robust EKF given by (26) and (27) are shown as in Figure 2.

Figure 2.

Dynamic simulation of the noise-free and estimated gene states and parameters with P(0) = I₁₄; A) gene states; y-axis is concentration, x-axis is time,B–D) parameters.

Consider next the existence of uncertainties of the extrinsic noise and measurement noise with ∊₁ = 0.05, ∊₃ = 0.05 and ∊₂ = 0.1. The state and parameter responses are shown in Figure 3. The root mean square error (RMSE) was used to quantify the filtering performance with

R M S E = \sqrt{\frac{1}{T_{2} - T_{1}} \int_{T_{1}}^{T_{2}} {({\hat{x}}_{i} (t) - x_{f i} (t))}^{2} d t}

where T₁ ≤ t ≤ T₂, ${\hat{x}}_{i} (t)$ (t) is the i-th estimation state and x_fi(t) is the corresponding noise-free state. The RMSE values for the noise-free and estimated states and parameters in 0 ≤ t ≤ 5 are listed in Table 1 which shows that the estimator is able to filter the extrinsic and measurement noises to retrieve the real state and parameter values.

Figure 3.

Dynamic simulation of the noise-free and estimated gene states and parameters with noise uncertainties (∊ = 0.05, ∊₃ = 0.05 and ∊₂ = 0.1);A) gene states; y-axis is concentration, x-axis is time,B–D) parameters.

Table 1.

Comparison of RMSE values for the system of Example 1 with and without noise uncertainties.

State	with noise uncertainties (∊₁ = ∊₃ = 0.05, ∊₂ = 0.1)	without noise uncertainties
1	0.065	0.043
2	0.053	0.035
3	0.04	0.028
4	0.03	0.017
5	0.032	0.021
6	0.034	0.021
7	0.018	0.01
8	0.022	0.014
9	0.04	0.027
10	0.032	0.021
11	0.032	0.016
12	0.026	0.017
13	0.009	0.007
14	0.05	0.03

Example 2 : Consider a nonlinear gene regulatory of four genes described and shown in Figure 4:¹³

\begin{array}{l} [\begin{matrix} {\dot{x}}_{1} (t) \\ {\dot{x}}_{2} (t) \\ {\dot{x}}_{3} (t) \\ {\dot{x}}_{4} (t) \end{matrix}] = [\begin{matrix} α_{1} x_{2}^{g_{12}} (t) - β_{1} x_{1}^{h_{11}} (t) \\ α_{2} x_{3}^{g_{23}} (t) x_{4}^{g_{24}} (t) - β_{2} x_{2}^{h_{22}} (t) \\ α_{3} x_{1}^{g_{31}} (t) x_{2}^{g_{32}} (t) - β_{3} x_{3}^{h_{33}} (t) \\ α_{4} x_{3}^{g_{43}} (t) - β_{4} x_{4}^{h_{44}} (t) \end{matrix}] + w_{x} (t), \\ {[\begin{matrix} x_{10} & x_{20} & x_{30} & x_{40} \end{matrix}]}^{T} = 0.5 {[\begin{matrix} 1 & 1 & 1 & 1 \end{matrix}]}^{T} \\ α_{1} = 1, α_{2} = 2, α_{3} = 1.5, α_{4} = 3.5, β_{1} = 1, β_{2} = 2, β_{3} = 1.5, β_{4} = 3.5, \\ g_{12} = 1, g_{23} = - 0.5, g_{24} = 2, g_{31} = 4, g_{32} = 1, g_{43} = - 0.5, \\ h_{11} = 1, h_{22} = 1, h_{33} = 2, h_{44} = 2 \end{array}

where w_x ~ (0, 0.2) and w_p ~ (0, 0.2). The state variables are extended from 4 to 22 when 18 parameters including rate constants and kinetic orders are all treated as the state variables. The measurement model is given as

y (t) = x_{2} (t) + x_{3} (t) + ν (t)

where y(t) is the measurement output and the measurement noise ν(t) ~ N(0, 0.5). For this gene regulatory network, gene product 1 activates gene 3, gene product 2 activates genes 1 and 3, gene product 3 represses genes 2 and 4, and gene product 4 activates gene 2.

Figure 4.

Another example of the gene regulatory network.

The linearized system matrix ∂f/∂x^T based on the state estimation for the EKF design can be obtained as

{\frac{\partial f}{\partial x^{T}} |}_{x = \hat{x}} = [\begin{matrix} - 1 & 1 & 0 & 0 \\ 0 & - 2 & - {\hat{x}}_{3}^{- 1.5} (t) {\hat{x}}_{4}^{2} (t) & 4 {\hat{x}}_{3}^{- 0.5} (t) {\hat{x}}_{4} (t) \\ 6 {\hat{x}}_{1}^{3} (t) {\hat{x}}_{2} (t) & 1.5 {\hat{x}}_{14}^{4} (t) & - 3 {\hat{x}}_{3} (t) & 0 \\ 0 & 0 & - 17.5 {\hat{x}}_{3}^{- 1.5} (t) & - 7 {\hat{x}}_{4} (t) \end{matrix}],

and

H = [\begin{matrix} 0 & 1 & 1 & 0_{1 \times 19} \end{matrix}]

For the initial covariance matrix P(0) = I₂₂, the results of dynamic simulation of the noise-free gene states and the estimated states of the noisy gene network are shown as in Figure 5. As it can be seen that the estimator tracked the noise-free case well while there were extrinsic noise and measurement noise.

Figure 5.

Dynamic simulation of the noise-free and estimated gene states and parameters with P(0) = I₂₂; A) gene states, y-concentration, x-axis is time,B–E) parameters.

Consider next the existence of uncertainties of the extrinsic noise and measurement noise with ∊₁ = 0.1, ∊₃ = 0.1 and ∊₂ = 0.25. The gene responses are shown in Figure 6. As in the previous example, the results exhibits larger estimation errors due to added noise uncertainties, however, the deviation is not significant while compared with magnitudes of the nominal state or parameter responses.

Figure 6.

Dynamic simulation of the noise-free and estimated gene states and parameters with noise uncertainties (∊₁ = 0.1, ∊₃ = 0.1 and ∊₂ = 0.25);A) gene states, y-axis is concentration, x-axis is time,B–E) parameters.

Conclusions

This paper proposes a continuous EKF to estimate internal states and parameters of a class of gene networks while there are extrinsic and intrinsic noises and parametric fluctuations. Quantitative performance analysis for state estimation of the EKF is presented. Numerical simulations have confirmed possibility of the proposed method in designing robust EKFs. This shows potential of the presented design method in bridging the engineering approach to solve for the estimation problem in biological systems.

Footnotes

Acknowledgement

This research was sponsored by National Science Council,Taiwan,ROC under the Grant NSC-98-2221-E-005–087.MY3 and partly sponsored by Chung-Shan Institute of Science and Technology,Taiwan,ROC under the Grant XB99086.

Disclosures

This manuscript has been read and approved by all authors. This paper is unique and is not under consideration by any other publication and has not been published elsewhere. The authors and peer reviewers of this paper report no conflicts of interest. The authors confirm that they have permission to reproduce any copyrighted material.

References

Voit

E.O.

Computational Analysis of Biochemical Systems: A Practical Guide for Biochemists and Molecular Biologists. Cambridge University Press, New York; 2000.

Klipp

, Herwig

, Kowald

, Wierling

, Lehrach

Systems Biology in Practice: Concepts. Implementation and Application. Wiley, Berlin; 2005.

Palsson

B.Q.

Systems Biology: Properties of Reconstructed Networks. Cambridge University Press, New York; 2006.

Kitano

Systems biology: a brief overview. Science. 2002; 295: 1662–4.

Karlebach

, Shamir

Modelling and analysis of gene regulatory networks. Nature Reviews Molecular Cell Biology. 2008; 9: 770–80.

Wang

, Qian

, Dougherty

Inference of gene regulatory networks using S-system: a unified approach, IET Systems Biology. 2010; 4: 145–56.

Paulsson

Summing up the noise in gene networks. Nature. 2004; 427: 415–8.

Chen

B.S.

, Wang

Y.C.

, Wu

W.S.

, Li

W.H.

A new measure of the robustness of biochemical networks. Bioinformatics. 2005; 21: 2698–705.

Ervadi-Radhakrishnan

, Voit

E.O.

Controllability of nonlinear biochemical systems. Mathematical Biosciences. 2005; 196: 99–123.

10.

Rao

C.V.

, Wolf

D.M.

, Arkin

A.P.

Control, exploitation and tolerance of intracellular noise. Nature. 2002; 420: 231–6.

11.

Lin

C.L.

, Liu

Y.W.

, Chuang

C.H.

Control design for signal transduction networks. Bioinformatics and Biology Insights. 2009; 3: 1–14.

12.

Chen

B.S.

, Wu

W.S.

, Wang

Y.C.

, Li

W.H.

On the robust circuit design schemes of biochemical networks: steady-state approach. IEEE Transactions on Biomedical Circuits and Systems. 2007; 1: 91–104.

13.

Chen

B.S.

, Wu

W.S.

Robust filtering circuit design for stochastic gene networks under intrinsic and extrinsic molecular noises. Mathematical Biosciences. 2007; 211: 342–55.

14.

Jin

, Lindsey

Stability analysis of genetic regulatory network with additive noises. BMC Genomics. 2008; 9: S21.

15.

Grewal

M.S.

, Andrews

A.P.

Kalman Filtering: Theory and Practice Using MATLAB. Wiley, New York; 2001.

16.

Lillacci

, Valigi

State estimation for a model of gene expression. Proceedings of IEEE International Symposium on Circuits and Systems. Seattle 2008: 2046–9.

17.

Wang

, Liu

, Liang

, Vinciotti

An extended Kalman filtering approach to modeling nonlinear dynamic gene regulatory networks via short gene expression time series. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2009; 6: 410–9.

18.

W.W.

, Liu

, Lu

W.B.

, Xu

N.S.

, Du

G.C.

, Tan

J.L.

Observer-based online compensation of inner filter effect in monitoring fluorescence of GFP-expressing plant cell cultures. Biotechnology and Bioengineering. 2005; 91: 213–26.

19.

McAdams

H.H.

, Arkin

A.P.

Stochastic mechanisms in gene expression. Proceedings of the National Academy of Sciences of the USA. 1997: 814–9.

20.

Poor

, Looze

D.P.

Minimax state estimation for linear stochastic systems with noise uncertainty. IEEE Transactions on Automatic Control. 1981; 26: 902–6.