Bellman filter: Difference between revisions

Line 18: Line 18:

== Model and notation ==

A commonly used specification keeps a linear–Gaussian state transition ~~and~~ ~~allows~~ a general observation density. For <math>t=1,\dots,n</math>:<ref name=”LangeIntro2024″ />

commonly used specification keeps linear–Gaussian state transition a general observation density. For <math>t=1,\dots,n</math>:<ref name=”LangeIntro2024″ />

* ”’Observation equation:”’

Recursive state estimator for state-space models derived via dynamic programming

The Bellman filter is a recursive algorithm for estimating a sequence of unobserved (latent) states in a state-space model from noisy observations. It is typically formulated for models with a linear–Gaussian state transition and a possibly nonlinear and/or non-Gaussian observation density, and it updates the state by solving a per-time-step optimisation problem involving $log ⁡ p ( y t ∣ x t ) {\displaystyle \log p(y_{t}\mid x_{t})} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/9767e963c3476737a627d4d72c0bf8d80834921d" aria-hidden="true" alt="{\displaystyle \log p(y_{t}\mid x_{t})}" width="5337" height="1223.9">$ . Under linear–Gaussian observation models, it reduces to the standard Kalman filter update.^[1]^[2]^[3]

In a discrete-time state-space model, an unobserved state vector evolves over time and generates observations. The linear–Gaussian case admits an optimal recursive solution via the Kalman filter, and standard econometric treatments include the monographs by Harvey and by Durbin & Koopman.^[4]^[5]

For nonlinear and/or non-Gaussian observation models, exact filtering generally becomes intractable and practical methods rely on approximation (e.g. extended/iterated Kalman filtering) or simulation (e.g. particle filters). The Bellman filter is often presented as a “filtering-by-optimisation” alternative that tracks a posterior mode at each time step.^[1]

Although the approach is more generally applicable, a commonly used specification keeps the classic linear–Gaussian state transition while allowing a general observation density. For $t = 1 , … , n {\displaystyle t=1,\dots ,n} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/421bcddab96059f87f27a2706d60c93169fb213d" aria-hidden="true" alt="{\displaystyle t=1,\dots ,n}" width="5026.1" height="1080.4">$ :^[1]

$y t ∼ p ( y t ∣ x t ) {\displaystyle y_{t}\sim p(y_{t}\mid x_{t})} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/389fb055b33c7552ec0cea64b5c19ebf0adda1c8" aria-hidden="true" alt="{\displaystyle y_{t}\sim p(y_{t}\mid x_{t})}" width="6071" height="1223.9">$

State-transition equation:

$x t = c + T x t − 1 + R η t {\displaystyle x_{t}=c+Tx_{t-1}+R\eta _{t}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/3134a26fc541177508d613de572d2e87d4c74b65" aria-hidden="true" alt="{\displaystyle x_{t}=c+Tx_{t-1}+R\eta _{t}}" width="9291.2" height="1152.1">$
where $η t ∼ i.i.d. N ( 0 , Q ) {\displaystyle \eta _{t}\sim {\text{i.i.d. }}{\mathcal {N}}(0,Q)} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/1a660aea3b14ea9d93c8405f0550431b7368eb0e" aria-hidden="true" alt="{\displaystyle \eta _{t}\sim {\text{i.i.d. }}{\mathcal {N}}(0,Q)}" width="7881.8" height="1295.7">$ .

The filter maintains predicted quantities $x ^ t ∣ t − 1 {\displaystyle {\hat {x}}_{t\mid t-1}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/d15beff3e8f19a9999375c2f94a0c87562e9ee8c" aria-hidden="true" alt="{\displaystyle {\hat {x}}_{t\mid t-1}}" width="2285.1" height="1295.7">$ , $P t ∣ t − 1 {\displaystyle P_{t\mid t-1}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/c4258cf373da32ec17dd012a7ad049a2e6ae7f8c" aria-hidden="true" alt="{\displaystyle P_{t\mid t-1}}" width="2355.1" height="1295.7">$ and filtered quantities $x ^ t ∣ t {\displaystyle {\hat {x}}_{t\mid t}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/06ed7da8259ac1e33e90e8858c34b4b52036cc61" aria-hidden="true" alt="{\displaystyle {\hat {x}}_{t\mid t}}" width="1380.7" height="1295.7">$ , $P t ∣ t {\displaystyle P_{t\mid t}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/09d090e1ced8d306c906db0f76a053b5634903e4" aria-hidden="true" alt="{\displaystyle P_{t\mid t}}" width="1450.7" height="1295.7">$ .^[1]

For the observation model, Fisher information is defined as:^[1]
$I ( x ) := ∫ [ − ∇ 2 log ⁡ p ( y ∣ x ) ] p ( y ∣ x ) d y . {\displaystyle I(x):=\int \left[-\nabla ^{2}\log p(y\mid x)\right]\,p(y\mid x)\,dy.} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/0d4ecd6a9068025cc08b8b147153ceab6392ae0f" aria-hidden="true" alt="{\displaystyle I(x):=\int \left[-\nabla ^{2}\log p(y\mid x)\right]\,p(y\mid x)\,dy.}" width="17266.1" height="2443.8">$

$x ^ t ∣ t − 1 = c + T x ^ t − 1 ∣ t − 1 , {\displaystyle {\hat {x}}_{t\mid t-1}=c+T{\hat {x}}_{t-1\mid t-1},} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/4a09e94be6256cf43f4814cae64cdd5795dedf21" aria-hidden="true" alt="{\displaystyle {\hat {x}}_{t\mid t-1}=c+T{\hat {x}}_{t-1\mid t-1},}" width="9448" height="1295.7">$

$P t ∣ t − 1 = T P t − 1 ∣ t − 1 T ⊤ + R Q R ⊤ . {\displaystyle P_{t\mid t-1}=TP_{t-1\mid t-1}T^{\top }+RQR^{\top }.} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/f90452544cabb7dfeb24aacd1e49d70aa80d027e" aria-hidden="true" alt="{\displaystyle P_{t\mid t-1}=TP_{t-1\mid t-1}T^{\top }+RQR^{\top }.}" width="13506.5" height="1510.9">$ ^[1]

$x ^ t ∣ t = arg ⁡ max x ∈ R d { log ⁡ p ( y t ∣ x ) − 1 2 ( x − x ^ t ∣ t − 1 ) ⊤ P t ∣ t − 1 − 1 ( x − x ^ t ∣ t − 1 ) } . {\displaystyle {\hat {x}}_{t\mid t}=\arg \max _{x\in \mathbb {R} ^{d}}\left\{\log p(y_{t}\mid x)-{\tfrac {1}{2}}(x-{\hat {x}}_{t\mid t-1})^{\top }P_{t\mid t-1}^{-1}(x-{\hat {x}}_{t\mid t-1})\right\}.} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/7389b3a79de9dbe8d163952ccf23754dcd5dd206" aria-hidden="true" alt="{\displaystyle {\hat {x}}_{t\mid t}=\arg \max _{x\in \mathbb {R} ^{d}}\left\{\log p(y_{t}\mid x)-{\tfrac {1}{2}}(x-{\hat {x}}_{t\mid t-1})^{\top }P_{t\mid t-1}^{-1}(x-{\hat {x}}_{t\mid t-1})\right\}.}" width="27727" height="2372">$ ^[1]

$P t ∣ t = ( P t ∣ t − 1 − 1 + I ( x ^ t ∣ t ) ) − 1 . {\displaystyle P_{t\mid t}=\left(P_{t\mid t-1}^{-1}+I({\hat {x}}_{t\mid t})\right)^{-1}.} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/ec7edc808565d55fbbd3c7de991f1d9660ebfc9d" aria-hidden="true" alt="{\displaystyle P_{t\mid t}=\left(P_{t\mid t-1}^{-1}+I({\hat {x}}_{t\mid t})\right)^{-1}.}" width="11504.8" height="2228.5">$ ^[1]

Special cases and relationships

[edit]

Linear–Gaussian observation equation

[edit]

If
$y t = d + Z x t + ε t {\displaystyle y_{t}=d+Zx_{t}+\varepsilon _{t}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/a141e6cb8d78e005ab5cd26b7d16c7344336bee2" aria-hidden="true" alt="{\displaystyle y_{t}=d+Zx_{t}+\varepsilon _{t}}" width="7623.3" height="1080.4">$
where $ε t ∼ i.i.d. N ( 0 , H ) {\displaystyle \varepsilon _{t}\sim {\text{i.i.d. }}{\mathcal {N}}(0,H)} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/1ae32b18e7adf6206f73047b025e00a47bf3341b" aria-hidden="true" alt="{\displaystyle \varepsilon _{t}\sim {\text{i.i.d. }}{\mathcal {N}}(0,H)}" width="7947.8" height="1295.7">$ ,
then the update reduces to the Kalman filter update.^[1]^[3]

Iterated (extended) Kalman filtering

[edit]

A Gauss–Newton interpretation of iterated Kalman updates appears in Bell and Cathey.^[6]

With a linear–Gaussian state transition, standard Rauch-Tung-Striebel (RTS) fixed-interval smoothing recursions can be used to obtain smoothed estimates $x ^ t ∣ n {\displaystyle {\hat {x}}_{t\mid n}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/46e47d8709bc6d19ae6d3e3a5515cb7a42d2f656" aria-hidden="true" alt="{\displaystyle {\hat {x}}_{t\mid n}}" width="1549.7" height="1295.7">$ and $P t ∣ n {\displaystyle P_{t\mid n}} <img decoding="async" src="https://wikimedia.org/api/rest_v1/media/math/render/svg/6a655170240436de2c8a090b3ab7c0346ae055e6" aria-hidden="true" alt="{\displaystyle P_{t\mid n}}" width="1619.7" height="1295.7">$ from the filtered output.^[1]^[7]

^ ^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j Lange, Rutger-Jan (2024). “Short and simple introduction to Bellman filtering and smoothing”. arXiv:2405.12668 [stat.ME].
^ Lange, Rutger-Jan (2024). “Bellman filtering and smoothing for state–space models”. Journal of Econometrics. 238 (2): 105632. doi:10.1016/j.jeconom.2023.105632.{{cite journal}}: CS1 maint: article number as page number (link)
^ ^a ^b Kalman, R. E. (1960). “A New Approach to Linear Filtering and Prediction Problems”. Journal of Basic Engineering. 82 (1): 35–45. doi:10.1115/1.3662552.
^ Harvey, A. C. (1990). Forecasting, Structural Time Series Models and the Kalman Filter. Cambridge University Press. ISBN 978-0521405737.
^ Durbin, J.; Koopman, S. J. (2012). Time Series Analysis by State Space Methods (2nd ed.). Oxford University Press. ISBN 978-0199641178.
^ Bell, B. M.; Cathey, F. W. (1993). “The iterated Kalman filter update as a Gauss–Newton method”. IEEE Transactions on Automatic Control. 38 (2): 294–297. doi:10.1109/9.250476.
^ Rauch, H. E.; Tung, F.; Striebel, C. T. (1965). “Maximum likelihood estimates of linear dynamic systems”. AIAA Journal. 3 (8): 1445–1450. doi:10.2514/3.3166.

Durbin, J.; Koopman, S. J. (2012). Time Series Analysis by State Space Methods (2nd ed.). Oxford University Press. ISBN 978-0199641178.
Harvey, A. C. (1990). Forecasting, Structural Time Series Models and the Kalman Filter. Cambridge University Press. ISBN 978-0521405737.
Lange, Rutger-Jan (2024). “Short and simple introduction to Bellman filtering and smoothing”. arXiv:2405.12668 [stat.ME].
Kalman, R. E. (1960). “A New Approach to Linear Filtering and Prediction Problems”. Journal of Basic Engineering. 82 (1): 35–45. doi:10.1115/1.3662552.
Rauch, H. E.; Tung, F.; Striebel, C. T. (1965). “Maximum likelihood estimates of linear dynamic systems”. AIAA Journal. 3 (8): 1445–1450. doi:10.2514/3.3166.

Special cases and relationships

Linear–Gaussian observation equation

Iterated (extended) Kalman filtering

Must Read

Leave a Comment Cancel Reply

Leave a Comment