アンサンブルカルマンフィルターによる大気海洋結合モデルへのデータ同化...

アンサンブルカルマンフィルターによる大気海洋結合モデルへのデータ同化On-line estimation of observation error covariance for ensemble-based filters

Genta UenoThe Institute of Statistical Mathematics

Covariance matrix in DA

,1fx x G vt t t tt

y h x wt t tt

State space model

N Qvt t

1 1 1, |1 2 : 1 11: 2 2 2

1 12 1

TJ x B xy Qx v x x v vT t tT t

y yh x h xRt t t ttt tt

fx x Gt

vt t t

Cost function

Filtered estimates with different θ

Large Qlargeh 大 )

Large R(large ) Which one should be

chosen?

Ensemble approx. of distribution

Ensemble Kalman filter (EnKF),Particle filter(PF)

Non-Gaussian dist.

Ensemble approx./ Particle approx.

xt V jtx jtN |,|

x jt |

V jt |xt

Gaussian dist.

Exactly represented

Kalman filter (KF)

RtH tV ttH tH tV ttK t 1|

GtQtGtF tV ttF tV tt

x ttF tx tt

Kalman filter (KF)

V ttH tK tIV tt

x ttH tytK tx ttx tt

V ttx ttN 1|,1| V ttx ttN |,| V ttx ttN 1|1,1|1

x tt 1|1 x tt 1| x tt |

V tt 1| V tt 1|1 V tt |

xt 1 xt xt

an gain

Simulation

Filtered dist. at t-1 Predicted dist. at t Filtered dist. at

EnKF and PF

x tt)1(

1|1 x tt

x tt)1(|R

esampling

x tt)1(

1|1 x tt

x tt)1(|

Approx. K

alman gain

x nttytp )(

EnKF PF

xt 1 xt xt

| | 1 | 1n n n n

yx x w xHK tt ttt t t t t t

| 1 | 1 | 1V H H V H RK t t t t t t t t t t

Likelihood

Which is the most likely distribution that produces observation yobs ?

Likelihood L() = p(yobs|θ)

In this example, 3 is most likely.

p y |2

p y |3

yobs yobs yobs

, , , |1 2

, , ,| | , | , , | ,1 21 2 1 3 1 2 1

| ,1: 11

L p y y yT

yy yp p p py y y y y y y N TT

p y yt tt

Likelihood of time series

Find θ that maximizes L(θ).In practice, log-likelihood is easy to handle:

y yt t

Likelihood of time series

| ,1: 1

| , | ,1

Tp y y

p y x x y dxt t t

Observation model

Predicted dist.

,H xt t t

N R,f fx

N ~ 0,w

y H wt t t

N Rt t

Non-Gaussian dist.[due to nonlinear model]

If it were Gaussian,

likelihood

Estimation of covariance matrix

Minimizing innovation [predicted error]

Bayes estimation

• Naive• Ensemble mean and covariance of state• Adjustment according to cost function• Matcing with innovation covariance

1. With assumption of Gaussian dist. of state

Maximum likelihood

• Ensemble mean of likelihood2. Without assumption of Gaussian dist. of state

This study

Covariance matching Ueno et al., Q. J. R. Met. Soc. (2010)

Ensemble approx. of likelihood

y h x wt t t

• Find θ that maximizes the ensemble approx. log-likelihood.

dim 1log 2 log

1log exp |

| ,1: 1

| , | ,1: 1

y yt t

p y x x y dxt t t t t

N np y x x x dx

t t t t t

tT yt Rt

y H tt

tN nN n

p y xt t tN n

1 log1 | 11

N n nNyx xR H tt t t t

Observation model

Ensemble mean of likelihood of each member xt|t-1

Regularization of Rt

ttH tytRtx nttH tyt

1explog

Sample covariance(singular due to n<<p)

Regularization withGaussian graphical model

12 neighborhood

Maximum likelihood

ttH tytRtx nttH tytRt

1exploglog

x jxihqijQ

0.1, 0.2, 0.5,1, 2, 5,10

4, 8, 20, 40

1, 2, 5,10

1, 2, 5,10, 20, 50,100,200,500

Data and Model

longitude

The color shows SSH anomalies.

Filtered estimates with different θ

Large Qlargeh 大 )

Large R(large ) Which one should be

chosen?

System noise: magnitude

System noise: zonal correlation length

System noise: meridional correlation length

Observation noise: magnitude

Estimates with MLE

2m, 20deg, 5deg, 20MLE

magnitude = (5.95cm)2, correlation lengths= (2.38, 2.52deg)

Filtered estimate Smoothed estimate

longitude

Summary for the first half

2m, 20 , 5 , 20MLE

• Maximum likelihood estimation can be carried out even for non-Gaussian state distribution with ensemble approximation

• Applicable for ensemble-based filters such as EnKF and PF

• Estimated parameters:

• … Tractable for just four parameters?

Ueno et al., Q. J. R. Met. Soc. (2010)

Motivation for the second half

• The output of DA (i.e. “analysis”) varies with prescribed parameter θ, where θ = (B, Q1:T, R1:T)

B: covariance matrix of the initial state (i.e. V0|0)Qt: covariance matrix of system noiseRt: covariance matrix of observation noise

• My interest is how to construct optimal θ for a fixed dynamic model• Only four parameters so far …

• We allow more degree of freedom on R1:T

• (dim yt)2/2 elements at maximum

Likelihood of Rt

Current assumption , , ,1: 1 2

R R R RT T

, ,1 :

R RT t

Log-likelihood

,1: 1t t

dim 1log 2 log

p | 1 | 121

N n ny yh x h xRt ttt tt t t t

and are fixed1:

Estimation design

• Use ℓt(R1:t) for estimating Rt only• It is of course that R1:t-1 are parameters of ℓt(R1:t)• But they are assumed to have been estimated with former log-likelihood,

ℓ1(R1), …, ℓt-1(R1:t-1) , and to be fixed at current time step t.

• Rt is estimated at each time step t.

Bad news:• The estimated Rt may vary significantly between different time steps.• A time-constant R cannot be estimated within the present framework.

,1: 1t t

dim 1log 2 log

1 1log exp | 1 | 12

N n ny yh x h xRt ttt tt t t t

Experiment

case 1: 20 (control)

case 3 : , ,1

case 4

R rt m

diag r

•　 Assumed structure of Rt

Data and Model

longitude

The color shows SSH anomalies.

Estimate of Rt (Temporal mean)

20Rt R

t t , ,

1diag rR r

t m R I

varcov

•Case t similar output for •Case diagonal: large variance near equator, small variance for off-equator•Case tuniform variance with intermediate value

Estimate of Rt (Spatial mean)

20Rt R

t t , ,

1diag rR r

t m R I

• Case t: small variance for first half, large for second half• Case diagonal: large variance around 1998• Case t: similar for the diagonal case

1992- year -2002

Filtered estimates20R

t t , ,

1diag rR r

t m R I

•Case t: false positive anomalies in the east

•Case t: negative anomalies in the east, but the equatorial Kelvin waves unclear •Case diagonal: negative anomalies and equatorial Kelvin reproduced

Iteration times

• Only 2-4 times• Small number of parameters requires large iteration numbers

diag rR rt m

R It t

Summary of the second half

• An on-line and iterative algorithm for estimating observation error covariance matrix Rt.• The optimality condition of Rt leads a condition of Rt in a closed form.•Application to a coupled atmosphere-ocean model•Only 4-5 iterations are necessary•A diagonal matrix with independent elements produces more likely estimatesthan those of scalar multiplication of fixed matrices ( or I).

アンサンブルカルマンフィルターによる大気海洋結合モデルへのデータ同化...

Documents

Transcript of アンサンブルカルマンフィルターによる大気海洋結合モデルへのデータ同化...

データ解析 第十二回「一般化加法モデル」ibis.t.u-tokyo.ac.jp/suzuki/lecture/2015/dataanalysis/L...データ解析 第十二回「一般化加法モデル」 鈴木

glm TypeI TypeII および TypeIII の計算例 - SAS...3 データとモデル 今回使用するデータおよびモデル式について述べる． 3.1 データ 以下の表1

CST PCB STUDIO / CST BOARDCHECK プリント基 …Cadence 、Mentor Graphics、図研のCADデータや ODB++フォーマットに対応 3D PEECモデル、2D伝送線路モデル、3D

データ共用型（プラットフォーム型）契約モデル規約 - METI...i データ共用型（プラットフォーム型）契約モデル規約 に関する報告書 第1

CMIP3気候モデルにおける北太平洋 10年規模変動の再現性 · cmip3気候モデルにおける北太平洋 10年規模変動の再現性 大島和裕，谷本陽一

LbL-3D生体組織の構築...SkinEthic, MatTex 肝モデル: Hepergen 血液脳関門モデル: ファーマコセル 東洋合成, Scivax, 日立テクノ, 住友べ, サイフューズ他

emis2cmaq jstream version 1...1. 排出量データ変換ツール emis2cmaq_jstreamの概要 排出量データ変換ツール emis2cmaq_jstream は、大気質モデルへの入力用排出量データ

3次元計測データのモデル化及び 利活用のための調 …archive.sokugikyo.or.jp/pdf/apa109_2017_11/109-07.pdf42 先端測量技術 109号 3次元計測データのモデル化及び

MethokenR: 樹木モデルによる言語データ解析―RのmvpartとrandomForestを用いて

交通行動の分析とモデリング 6.5 複数データに基づ …bin.t.u-tokyo.ac.jp › kaken › pdf › 2010_start_omura2.pdf6.5 複数データに基づくモデル推定

中解像度版 大気海洋結合モデルによる

5 海洋数値モデルの計算手法...第5回:海洋数値モデルの計算手法 沿岸海洋モデルPOM シグマ座標系座標変換 シグマ座標系の方程式 自由表面を解くことの困難（CFL条件）

Title SiBUC Mannual 利用編 ver1.0 -Part1: モデル入力データ …...SiBUC Mannual 利用編 ver1.0 ―Part1: モデル入力データの作成と陸面過程解析の方法―

とRedmine を用いた気象研究所共用海洋モデルkaiyo-gakkai.jp/jos/uminokenkyu/vol27/27-5/27-5-01Sakamoto.pdf · 気象研究所共用海洋モデルの開発管理 177

統計的モデル選択 - データが選ぶ良いモデルとは？ · 2009-04-22 · 2つのモデルと推定精度 • 同じサンプル数のとき、モデル1と2では？

2 モデル選択 2-1. ベイズ式のモデル比較tohhiro/...ベイズファクター：観測データに対して2つのモデルを 較 相対的なモデルの良さを 較にするには、モデルの適切性も同時に

MSSG-Aのモデル開発...MSSG-Aのモデル開発 （独）海洋研究開発機構(JAMSTEC) 地球シミュレータセンター(ESC) マルチスケールモデリング研究グループ(MSSG)

公共 NGS データから非モデル生物のデータをより …nakazato/presentation/ngsfield...公共NGS データから非モデル生物のデータをより簡単に得るための検索

大気海洋海氷結合モデルによる 水惑星の気候シミュレーション · 大気海洋海氷結合モデルによる 水惑星の気候シミュレーション 河合佑太

データ構造とアルゴリズム①qma/education/data/ALGO1...5 なぜデータ構造？（2/5） • 理由1：モデルとデータ表現に適切なデータ構造を用いる

データ解析第十二回「一般化加法モデル」ibis.t.u-tokyo.ac.jp/suzuki/lecture/2015/dataanalysis/L...データ解析第十二回「一般化加法モデル」鈴木

glm TypeI TypeII および TypeIII の計算例 - SAS...3 データとモデル今回使用するデータおよびモデル式について述べる． 3.1 データ以下の表1

データ共用型（プラットフォーム型）契約モデル規約 - METI...i データ共用型（プラットフォーム型）契約モデル規約に関する報告書第1

CMIP3気候モデルにおける北太平洋 10年規模変動の再現性 · cmip3気候モデルにおける北太平洋 10年規模変動の再現性大島和裕，谷本陽一

LbL-3D生体組織の構築...SkinEthic, MatTex 肝モデル: Hepergen 血液脳関門モデル: ファーマコセル東洋合成, Scivax, 日立テクノ, 住友べ, サイフューズ他

emis2cmaq jstream version 1...1. 排出量データ変換ツール emis2cmaq_jstreamの概要排出量データ変換ツール emis2cmaq_jstream は、大気質モデルへの入力用排出量データ

3次元計測データのモデル化及び利活用のための調 …archive.sokugikyo.or.jp/pdf/apa109_2017_11/109-07.pdf42 先端測量技術 109号 3次元計測データのモデル化及び

中解像度版大気海洋結合モデルによる

5 海洋数値モデルの計算手法...第5回:海洋数値モデルの計算手法沿岸海洋モデルPOM シグマ座標系座標変換シグマ座標系の方程式自由表面を解くことの困難（CFL条件）

2 モデル選択 2-1. ベイズ式のモデル比較tohhiro/...ベイズファクター：観測データに対して2つのモデルを較相対的なモデルの良さを較にするには、モデルの適切性も同時に

MSSG-Aのモデル開発...MSSG-Aのモデル開発（独）海洋研究開発機構(JAMSTEC) 地球シミュレータセンター(ESC) マルチスケールモデリング研究グループ(MSSG)

大気海洋海氷結合モデルによる水惑星の気候シミュレーション · 大気海洋海氷結合モデルによる水惑星の気候シミュレーション河合佑太