Lecture 4 Estimating the Covariance Functionweb.stanford.edu/class/stats253/lectures/lect4.pdf ·...

Lecture 4Estimating the Covariance Function

Dennis SunStanford University

Stats 253

June 26, 2015

1 Review

2 TheHack

3 Model-Based Approach

4 Course Logistics

1 Review

2 TheHack

4 Course Logistics

Generalized Least Squares

We saw that if the model isy = Xβ + ε,

whereE[ε|X] = 0 andVar[ε|X] = Σ, then the best estimator isβ̂GLS = (XTΣ−1X)−1XTΣ−1y.

Covariance Functions

• Σ is usually unknown, and there is no hope of estimating itfrom just n observations.

• So we parametrizeΣ, i.e.,Σ = Σθ .• There is often a covariance functionΣθ(s, s′) on the spacewhere the observations lie, and we obtain (Σθ)ij = Σθ(si, sj).

• Examples include• Σθ(s, s′) = max(θ1 − θ2d(s, s′), 0)• Σθ(s, s′) = θ1 exp{θ2d(s, s′)}.

• We estimate θ from the data.• Wewill learn twoways: (1) the HackTM and (2) model-based.

1 Review

2 TheHack

4 Course Logistics

“The Chicken or the Egg” Problem

• To calculate β̂GLS , we need an estimate ofΣ.• To estimateΣ = Var(ε), we need an estimate of the error

ε̂ = y −Xβ̂GLS ,

so we need β̂GLS .Solution: Use β̂OLS as a “preliminary” estimate and estimateΣusing

ε̂ = y −Xβ̂OLS .

Estimating the Covariance• Now that we have ε̂, how dowe estimate the covariance?• Σij = Σ(d(si, sj)) is assumed to be a function of the distancebetween the points. So we need to estimate the covariancefunctionΣ(h).

• If data is regularly spaced, then estimateΣ(h) by thecovariance of the observations spaced h apart:

Σ̂(h) =

∑(i,j)∈Sh ε̂iε̂j

where Sh = {(i, j) : d(si, sj) = h}.• If data is irregularly spaced, then we look in a window aroundh.

Σ̂(h) =

∑(i,j)∈Sh,δ ε̂iε̂j

|Sh,δ|,

where Sh,δ = {(i, j) : d(si, sj) ∈ [h− δ, h+ δ]}.

California Ozone ExampleOzone levels measured across California

California Ozone ExampleResiduals ε̂ after regressing out latitude and longitude

California Ozone ExamplePairs of observations where 0 ≤ d(si, sj) < .2

Σ̂(.1) = 2.7× 10−4

California Ozone ExamplePairs of observations where .2 ≤ d(si, sj) < .4

Σ̂(.3) = 1.5× 10−4

California Ozone ExamplePairs of observations where .4 ≤ d(si, sj) < .6

Σ̂(.5) = 1.2× 10−4

California Ozone ExamplePlot of Σ̂(h)

● ●

●●

●● ●

●●

● ●

Distance

California Ozone ExampleCalculate SE = SD/

√n to add error bars:

● ●

●●

●● ●

●●

● ●

Distance

Estimating θ

• Canwe just connect the dots and call that the covariancefunction?

• No! Not guaranteed to be positive definite.• Parametrizing the covariance asΣθ(h) helps ensure that thecovariance is positive definite.

• Choose θ to minimize the difference between the observedand theoretical covariance function:

θ̂ = argminθ

wh(Σ̂(h)− Σθ(h))2.

• Thewh are weights. Wemay want to downweight bins forwhich we have less data, i.e.,wh ∝ Nh.

California Ozone ExampleFitted covariance functions

● ●

●●

Distance

ExponentialGaussian

California Ozone Example

Now that we’ve estimated the covariance function asΣ̂(h) = .00025e−2.496h

we can go back and fit generalized least squares!

1 Review

2 TheHack

4 Course Logistics

Model-Based Approach

• TheHack suffers from two drawbacks:• Weused β̂OLS to get a preliminary estimate of εwhenwereally should be using β̂GLS .

• The final fitting of the covariance function to the data requiresmanual tuning of parameters: bin size, weightswh, etc.

• Another approach is to assume a parametric model andestimate the parameters by maximum likelihood.

TheModel

Themodel is stilly = Xβ + ε,

withE[ε|X] = 0 andVar[ε|X] = Σθ , except nowwe furtherassume that ε is normal.We can nowwrite down a likelihood for our data:

(2π)n/2(det Σθ)1/2exp

2(y −Xβ)TΣ−1θ (y −Xβ)

which we can optimize overβ and θ simultaneously.

Calculating theMLE1

(2π)n/2(det Σθ)1/2exp

2(y −Xβ)TΣ−1θ (y −Xβ)

The log-likelihood is−1

2log det Σθ −

2(y −Xβ)TΣ−1θ (y −Xβ).

Let’s first optimize overβ for any fixed θ. That is, what is β̂(θ)?Why, it’s the GLS estimator! β̂(θ) = (XTΣ−1θ X)−1XTΣ−1θ y.Let’s plug this into the log-likelihood:

2log det Σθ −

2(y −Xβ̂(θ))TΣ−1θ (y −Xβ̂(θ)).

Nowwe “just” have to optimize this over θ.

Calculating theMLE

2log det Σθ −

2(y −Xβ̂(θ))TΣ−1θ (y −Xβ̂(θ))

= −1

2log det Σθ −

2yTΣ−1θ (I −X(XTΣ−1θ X)−1XTΣ−1θ )y

To optimize this, you’ll need to:• compute derivative with respect to θ• solve a highly non-convex problem.

However, the dimensionality of θ is usually small, so you can do abrute-force grid search.

Summary

• TheHack is simple and fairly robust. We don’t make anydistributional assumptions.

• Themodel-based approach is more principled. But it may bedifficult or impossible to compute theMLE in practice, and it issensitive to the assumption of normality.

1 Review

2 TheHack

4 Course Logistics

Logistics

• Homework 1 will be released tonight and due next Friday.• It is a data analysis assignment that involves implementingsome of the methods we’ve discussed.

• It is also a prediction competition. There will be prizes for thewinners.

• I will provide starter code in R andmaybe Python.• JingshuWang, the TA for this course, will have office hoursThursdays 2-4pm.

Lecture 4 Estimating the Covariance Functionweb.stanford.edu/class/stats253/lectures/lect4.pdf ·...

Documents

Transcript of Lecture 4 Estimating the Covariance Functionweb.stanford.edu/class/stats253/lectures/lect4.pdf ·...

RAKVERE MUUSIKAKOOLI KLARNETI ÕPPEKAVA · Põhipill - klarnet 2 2 2 2 2 2 2 1 5.2. Solfedžo 2 2 2 2 2 2 2 5.3. Muusikalugu 1 1 1 1 1 5.4.Ansambel 0,5 0,5 1 1 1 5.5. Üldklaver 0,5

images1.loopnet.com...2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1/2 1. 2. 4. Except hotel uses which shall not exceed 12 stories

. 2. 1. 1. 1. 1. 2. 1. 1. 1. 2. 2. 2. 2. 2. 2. 2. 2. 2. 2. 2. 2. 1. 2. 2. 2. 2. 1. 1. 1. 1. 1. 1. 1. 1. 2. 1. 2. 2. 2. 2. 2. 1. 1. 1. 1. 2. 2. 2. 2. 2. 2. 2. 1. 2. 2. 2. 2. 2. 2

DIE DHL EXPRESS WELT ZIELE, ZEITEN, PREISE 2018 · Saudi Arabien 9 2 2 2 0 2 2 2 2 Schweden 4 1 1 1 EU 1 2 1 1 1 2 Schweiz 5 1 1 1 0 1 2 1 1 1 2 Senegal 10 2 2 0 2 2 2 2 Serbien 6

Δίκτυα Υπολογιστών Icse.uoi.gr/~epap/MYY703/downloads/lect4.pdf · 2019. 10. 23. · 1 Εισαγωγή-Βασικές έννοιες ... η επέκταση του

Falling with Drag (again) Introduction to Special Relativityphysics.ucsc.edu/~joel/Phys5I/11Phys5I-Lect4.pdf · Introduction to Special Relativity Falling with Drag (again) Fdrag

355 2012 lect4 - GitHub Pages · 2020-06-18 · In each ciphertext, letters in positions 1 & 4 are the same letter encrypted under the day key ! With 2 months of day keys and Enigma

HSRE Lect4 Weiner

Heptatopix - Joachim Gohlke · IV. 1/2 1/2 1/2 1/2 1/2 1 1/2 2 b9 bb11 bbbb13 V. 1/2 1/2 1/2 1/2 1 1/2 2 1/2 mmaj7 b9 bb11 bb13 VI. 1/2 1/2 1/2 1 1/2 2 1/2 1/2 mmaj7b5 b9 bb11 #13

Vedrò con mio diletto - FiBO - Kotisivu · 2018. 10. 8. · 1(2) 1(4) 2(4) 2(4) 3(0) 3(1) 3(0) 2(4) 2(3) 2(2) 2(1) 3(1) 2(0) 2(1) 2(1) 2(0) 1(4) 1(3) 1(2) 1(1) 2(1) 3(1) 3(1) 2(1)

Harmonic Oscillatorohsean/417/Lecture Notes/lect4-1.pdf · Title: Harmonic Oscillator.jnt Author: oh Created Date: 9/20/2010 5:51:01 PM

Handout Matlab Lect4

2 1 2 1 2 1 1 2 1 2 1 2 1 2 1 2 1 2 1 0 2 1 2 1 1 1 ... - INHA

CAMPEONATO NACIONAL DE MATEMÁTICAluduscience.pt/regras/catalogo.pdf · Alquerque 3/Sec 1/2 Sec 1 1/2/3 T 1/2 1/2 1/2 1/2 1 1 1 1/2 1/2/3 1/2 1/2 1/2 1/2 1/2 1 Sec 1/2 T T T 2 T 1/2

강구조의개요 - KOCWelearning.kocw.net/contents4/document/lec/2012/SeoulTech/YoonSungwon/2.pdf · 초과 3 1 1/2 3 2 2 3 2 1 이하 2 1 1/2 2 1 1/2 1 1/2 2 2 1/2 4／20 이하

UE13 lect4 Tsft-robots - Olivier White, Ir, PhDolivierwhite.weebly.com/uploads/2/5/4/0/2540941/ue13_lect4_tsft-robots.pdf6. Introduction sur le transfert 7. Intérêt de la robotique

Pejzazna arhitektura - ul-gov.me · zso zso zso zso zso zso zso zso zso zso zso zso 1 1 2 1 2 1 1 2 1 1 1 2 1 1 2 1 2 1 2 1 1 2 90 91 2867 1 1 1 2 3 2 1 1 1 1 2 1 1 3 2 2 1 1 2 1

Lecture 4: Regression ctd and multiple classesaz/lectures/ml/lect4.pdf · 2015. 1. 27. · 2 4 6 8 10-1-0.5 0 0.5 1 1.5 Least Squares 2 4 6 8 10-1-0.5 0 0.5 1 1.5 Ridge Regression

ホーム | 世田谷区ホームページ · 2- 1 1- 1- 1- 2- 1- 1- 1- 1 1- 2- 5- 5- 6- 1- 1- 1- 1- 1- 1- 1- 2- 2 2- 2 2- 2- 2- 3- 4 5- fi2- 2- 3- 2020. 1. 1 (3427)8148 (3425)5579

ÍndiceInformación técnica - gfinder.findernet.com · Grados de protección IP XIII 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 2 1 1 1 ... los relés Finder