Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email...

46
Chapter 6. Point Estimation Weiqi Luo ( 骆骆骆 ) School of Software Sun Yat-Sen University Email [email protected] Office # A313

Transcript of Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email...

Page 1: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

Chapter 6. Point Estimation

Weiqi Luo (骆伟祺 )School of Software

Sun Yat-Sen UniversityEmail : [email protected] Office : # A313

Page 2: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

6.1. Some General Concepts of Point Estimation

6.2. Methods of Point Estimation

2

Chapter 6: Point Estimation

Page 3: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

In ordert to get some population characteristics, statistical inference needs obtain sample data from the population under study, and achieve the conclusions can then be based on the computed values of various sample quantities (statistics).

Typically, we will use the Greek letter θ for the parameter of interest. The objective of point estimation is to select a single number, based on sample data (statistic ), that represents a sensible value for θ.

6.1 Some General Concepts of Point Estimation

3

Page 4: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Point Estimation

A point estimate of a parameter θ is a single number that can be regarded as a sensible value for θ.

A point estimate is obtained by selecting a suitable statistic and computing its value from the given sample data. The selected statistic is called the point estimator of θ.

6.1 Some General Concepts of Point Estimation

4

Population

Parameters θA Sample A quantity

A function(statistic )

Q #1: How to get the candiate estimatorsbased on the population?

Q #2: How to measure the candidate estimators?

Here, the type of population under study is usually known, while the paprameters are unkown.

Estimating

Page 5: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.1 The manufacturer has used this bumper in a sequence of 25 controlled

crashes against a wall, each at 10 mph, using one of its compact car models. Let X = the number of crashes that result in no visible damage to the automobile. What is a sensible estimate of the parameter p = the proportion of all such crashes that result in no damage

If X is observed to be x = 15, the most reasonable estimator and estimate are

6.1 Some General Concepts of Point Estimation

5

estimatorn

Xp ˆ estimate = 60.0

25

15

n

x

Page 6: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.2 Reconsider the accompanying 20 observations on dielectric breakdown

voltage for pieces of epoxy resin first introduced in Example 4.29 (pp. 193)

The pattern in the normal probability plot given there is quite straight, so we now assume that the distribution of breakdown voltage is normal with mean value μ. Because normal distribution are symmetric, μ is also the median lifetime of the distribution. The given observation are then assumed to be the result of a random sample X1, X2, …, X20 from this normal distribution.

6.1 Some General Concepts of Point Estimation

6

24.46 25.61 26.25 26.42 26.66 27.15 27.31 27.54 27.74 27.94

27.98 28.04 28.28 28.49 28.50 28.87 29.11 29.13 29.50 30.88

Page 7: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.2 (Cont’) Consider the following estimators and resulting estimates for μ

6.1 Some General Concepts of Point Estimation

7

a. Estimator= , estimate=X 793.2720/86.555/ nxx i

b. Estimator= , estimate=X~ 960.272/)98.2794.27(~ x

c. Estimator [ min(Xi) + max(Xj)]/2 = the average of the two extreme lifetimes, estimate=[ min(xi)+max(xi)]/2 = (24.46+30.88)/2 = 27.670

d. Estimator = , the 10% trimmed mean (discard the smallest and largest 10% of the sample and then average)

)10(trX

(10)

555.86 24.46 25.61 29.50 30.88estimate 27.838

16trx

Page 8: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.3 In the near future there will be increasing interest in developing low-cost Mg-

based alloys for various casting processes. It is therefore important to have practical ways of determining various mechanical properties of such alloys. Assume that the observations of a random sample X1, X2, …, X8 from the population distribution of elastic modulus under such circumstances. We want to estimate the population variance σ2

6.1 Some General Concepts of Point Estimation

8

Method #1: sample variance 222

2 2/( )

ˆ1 1

i iiX X nX X

Sn n

22

2 2/ 8

ˆ 0.2517

i iX XS

2222 2

/( )ˆ i ii

X X nX XS

n n

22

2 2/ 8

ˆ 0.2208

i iX XS

Method #2: Divided by n rather than n-1

Page 9: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Estimation Error Analysis Note that is a function of the sample Xi’s, so it is a

random variable.

Therefore, an accurate estimator would be one resulting in small estimation errors, so that estimated values will be near the true value θ (unkown).

A good estimator should have the two properties:

1. unbiasedness (i.e. the average error should be zero)

2. minimum variance (i.e. the variance of error should be samll)

6.1 Some General Concepts of Point Estimation

9

+ error of estimation

Page 10: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Unbiased Estimator

A point estimator is said to be an unbiased estimator of θ if

for every possible value of θ.

If is not unbiased, the difference is called the bias of

6.1 Some General Concepts of Point Estimation

10

)ˆ(E

ˆ( )E

pdf of 2 pdf of 1

Bias of θ1

θ

Note: “centered” here means the expectedvaule, not the median, of the the distributionof is equal to θ

Page 11: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Proposition When X is a binomial rv with parameters n and p, the

sample proportion =X/n is an unbiased estimator of p.

Refer to Example 6.1, the sample proportion X/n was used as an estimator of p, where X, the number of sample successes, had a binomial distribution with parameters n and p, thus

11

6.1 Some General Concepts of Point Estimation

p

1 1ˆ( ) ( ) ( ) ( )

XE p E E X np p

n n n

Page 12: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.4 Suppose that X, the reaction time to a certain stimulus, has a uniform

distribution on the interval from 0 to an unknown upper limit θ. It is desired to estimate θ on the basis of a random sample X1, X2, …, Xn of reaction times. Since θ is the largest possible time in the entire population of reaction times, consider as a first estimator the largest sample reaction time:

Since (refer to Ex. 32 in pp. 279 )

Another estimator

6.1 Some General Concepts of Point Estimation

12

1 1 2ˆ max( , , , )nX X X

1( )1

nE

n

biased estimator, why?

2 1 2

1ˆ max( , , , )n

nX X X

n

2

1ˆ( )1

n nE

n n

unbiased estimator

Page 13: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Proposition Let X1, X 2, …, Xn be a random sample from a distribution

with mean μ and variance σ2. Then the estimator

is an unbiased estimator of σ2 , namely

Refer to pp. 259 for the proof.

However,

6.1 Some General Concepts of Point Estimation

13

22 2 ( )

ˆ1

iX XS

n

2 2( )E S

22 2 2( ) 1 1

( ) ( )iX X n nE E S

n n n

Page 14: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Proposition

If X1, X2,…Xn is a random sample from a distribution with mean μ, then is an unbiased estimator of μ. If in addition the distribution is continuous and symmetric, then and any trimmed mean are also unbiased estimator of μ

6.1 Some General Concepts of Point Estimation

14

X

X

Refer to the estimators in Example 6.2

Page 15: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Estimators with Minimum Variance

15

6.1 Some General Concepts of Point Estimation

pdf of , another unbiased estimator

2

pdf of , an unbiased estimator1

θ

Obivously, the estimator is better than the in this example 1 2

Page 16: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.5 (Ex. 6.4 Cont’)

When X1, X2, … Xn is a random sample from a uniform distribution on [0, θ], the estimator

is unbiased for θ

It is also shown that is the MVUE of θ.

16

6.1 Some General Concepts of Point Estimation

1 2

1ˆ max( , , , )n

nX X X

n

1 2

1ˆ max( , , , )n

nX X X

n

Page 17: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Theorem

Let X1, X2, …, Xn be a random sample from a normal distribution with parameters μ and δ. Then the estimator is the MVUE for μ.

6.1 Some General Concepts of Point Estimation

17

X

How about those un-normal distributions?

Page 18: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Estimator Selection When choosing among several different estimators of θ,

select one that is unbiased. Among all estimators of θ that are unbiased, choose the one

that has minimum variance. The resulting is called the minimum variance unbiased estimator (MVUE) of θ.

18

6.1 Some General Concepts of Point Estimation

1

pdf of , the MVUE2

pdf of , a biased estimator1

θ

In some cases, a biased estimator is perferable to the MVUE

Page 19: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.6

Suppose we wish to estimate the thermal conductivity μ of a certain material. We will obtain a random sample X1, X 2, …, Xn of n thermal conductivity measurements. Let’s assume that the population distribution is a member of one of the following three families:

6.1 Some General Concepts of Point Estimation

19

2 2( ) /(2 )

2

2

1( )

21

( ) [1 ( ) ]

1

( ) 20 otherwise

xf x e x

f x xx

c x cf x c

Gaussian Distribution

Cauchy Distribution

Uniform Distribution

Page 20: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software20

6.1 Some General Concepts of Point Estimation

1. If the random sample comes from a normal distribution, then is the best of the four estimators, since it is the MVUE.

X

2. If the random sample comes from a Cauchy distribution, then and (the average of the two extreme observations) are terrible estimators for μ, whereas is quite good; is bad because it is very sensitive to outlying observations, and the heavy tails of the Cauchy distribution make a few such observation likely to appear in any sample.

X

XeX

X

3. If the underlying distribution is uniform, the best estimator is ; this estimator is greatly influenced by outlying observations, but the lack of tails makes such observations impossible.

eX

4. The trimmed mean is best in none of these three situations, but works reasonably well in all three. That is, does not suffer too much in any of the three situations.

)10(trXA Robust estimator

Page 21: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

The Standard Error The standard error of an estimator is its standard

deviation .

If the standard error itself involves unknown parameters whose values can be estimated, substitution of these estimates into yields the estimated standard error (estimated standard deviation) of the estimator. The estimated standard error can be denoted either by or by .

21

6.1 Some General Concepts of Point Estimation

ˆˆ( )V

ˆs

Page 22: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.8

Assuming that breakdown voltage is normally distributed,

is the best estimator of μ. If the value of σ is known to be 1.5, the standard error of is

If, as is usually the case, the value of σ is unknown, the estimate is substituted into to obtain the estimated standard error

22

6.1 Some General Concepts of Point Estimation

X

X

/ 1.5 / 20 0.335X n

462.1ˆ sX

ˆ / 1.462 / 20 0.327X Xs s n

Page 23: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Homework

Ex. 1, Ex. 8, Ex. 9, Ex. 13

23

6.1 Some General Concepts of Point Estimation

Page 24: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Two “constructive” methods for obtaining point estimators

Method of Moments

Maximum Likehood Estimation

6.2 Methods of Point Estimation

24

Page 25: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Moments

Let X1, X2,…, Xn be a random sample from a pmf or pdf f(x). For k = 1, 2, 3, …, the kth population moment, or kth moment of the distribution f(x), is . The kth sample moment is .

6.2 Methods of Point Estimation

25

( )kE X

1(1/ )

n kii

n X

Page 26: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Moment Estimator

Let X1, X2, …, Xn be a random sample from a distribution with pmf or pdf f(x;θ1,…,θm), where θ1,…,θm are parameters whose values are unknown. Then the moment estimators are obtained by equating the first m sample moments to the corresponding first m population moments and solving for θ1,…,θm .

6.2 Methods of Point Estimation

26

1ˆ,..., m

1(1/ )

n kii

n X

K-th sample moment

( )kE X

K-th population moment

n is large

With unkonwn θi

With the given sample xi

Page 27: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

6.2 Methods of Point Estimation

27

1

1, 1,2...

nl

l ii

A X l mn

1 1 1 2

2 2 1 2

1 2

( , ,..., )

( , ,..., )

...

( , ,..., )

m

m

m m m

The first m population moments

1 2( , ,..., ), 1,...,i i mA A A i m

1 1 1 2

2 2 1 2

1 2

( , ,..., )

( , ,..., )

...

( , ,..., )

m

m

m m m

The solution of equations

Use the first m sample moment

to represent the population moments μl

General Algorithm :

Page 28: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.11 Let X1, X2, …, Xn represent a random sample of service times

of n customers at a certain facility, where the underlying distribution is assumed exponential with parameter λ. How to estimate λ by using the method of moments?

Step #1: The 1st population moment E(X) = 1/λ

then we have λ = 1/ E(X)

Step #2: Use the 1st sample moment to represent 1st poulation moment E(X), and get the estimator

6.2 Methods of Point Estimation

28

X

X/1ˆ

Page 29: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.12 Let X1, …, Xn be a random sample from a gamma

distribution with parameters α and β. Its pdf is

There are two parameter need to be estimated, thus, consider the first two monents

6.2 Methods of Point Estimation

29

110

( ; , ) ( )

0 otherwise

x

x e xf x

Page 30: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.12 (Cont’)

6.2 Methods of Point Estimation

30

2

2 2

ˆ1

i

X

X Xn

2 21

ˆiX X

nX

( )E X 2 2 2 2 2 2( ) ( ) [ ( )] (1 )E X V X E X

Step #1:

2 2 2

2 2

( ) ( ) ( ),

( ) ( ) ( )

E X E x E X

E X E X E X

Step #2: 2 21( ), ( )iX E X X E X

n

Page 31: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.13 Let X1, …, Xn be a random sample from a generalized

negative binomial distribution with parameters r and p. Its pmf is

Determine the moment estimators of parameters r and p.

Note: There are two parameters needs to estimate, thus the first two

moments are considered.

6.2 Methods of Point Estimation

31

,....2,1,0,)1(1

1),;(

xpp

r

rxprxnb xr

Page 32: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.13 (Cont’)

6.2 Methods of Point Estimation

32

Step #2:

Step #1: ( ) (1 ) /E X r p p 2 2( ) (1 )( 1) /E X r p r rp p

2

2 2 2 2

( ) ( ),

E(X )-E(X) ( ) ( ) ( )

E X E Xp r

E X E X E X

2 21( ), ( )iX E X X E X

n

221ˆ

XXn

Xp

i

XXXn

Xr

i

22

2

Page 33: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Maximum Likelihood Estimation (Basic Idea)

6.2 Methods of Point Estimation

33

Box 1 Box 2

Experiment: We firstly randomly choose a box, And then randomly choose a ball.

Q: If we get a white ball, which box has the Maximum Likelihood being chosen?

( | 1) 1/15

( | 2) 14 /15

P W Box

P W Box

Page 34: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Maximum Likelihood Estimation (Basic Idea)

6.2 Methods of Point Estimation

34

3 5 3 3 2(p) (1 ) (1 )f p p p p

Q: What is the probability p of hitting the target?

(0.2) 0.0051f (0.4) 0.0230f …

The best one among the four options

(0.6) 0.0346f (0.8) 0.0205f

Page 35: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.14

A sample of ten new bike helmets manufactured by a certain company is obtained. Upon testing, it is found that the first, third, and tenth helmets are flawed, whereas the others are not. Let p = P(flawed helmet) and define X1, …, X10 by Xi = 1 if the ith helmet is flawed and zero otherwise. Then the observed xi’s are 1,0,1,0,0,0,0,0,0,1.

6.2 Methods of Point Estimation

35

3 71 2 10( , ,..., ) (1 ) (1 )f x x x p p p p p p

The Joint pmf of the sample is

For what value of p is the observed sample most likely to have occurred? Or, equivalently, what value of the parameter p should be taken so that the joint pmf of the sample is maximized?

Page 36: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.14 (Cont’)

6.2 Methods of Point Estimation

36

1 2 10ln[ ( , ,.... ; )] 3ln( ) 7 ln(1 )f x x x p p p

3 71 2 10( , ,..., ) (1 ) (1 )f x x x p p p p p p

Equating the derivative of the logarithm of the pmf to zero gives the maximizing value (why?)

1 2 10

3 7 3ln[ ( , ,.... ; )] 0

1 10

d xf x x x p p

dp p p n

where x is the observed number of successes (flawed helmets). The estimate of p is now . It is called the maximum likelihood estimate because for fixed x1,…, x10, it is the parameter value that maximizes the likelihood of the observed sample.

ˆ 3 /10p

Page 37: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Maximum Likelihood Estimation

Let X1, X 2, …, Xn have joint pmf or pdf

where the parameters θ1, …, θm have unknown values. When x1, …, xn are the observed sample values and f is regarded as a function of θ1, …, θm, it is called the likelihood function.

The maximum likelihood estimates(mle’s) are those values of the θi’s that maximize the likelihood function, so that

When the Xi’s are substituted in place of the xi’s, the maximum likelihood estimators result.

6.2 Methods of Point Estimation

37

1 2 1( , ,..., ; ,..., )n mf x x x

1ˆ,..., m

1 1 1 1ˆ ˆ( ,..., ; ,..., ) ( ,..., ; ,..., )n m n mf x x f x x for all θ1, …, θm

Page 38: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.15

Suppose X1, X2, …, Xn is a random sample from an exponential distribution with the unknown parameter λ. Determine the maximum likelihood estimator of λ.

6.2 Methods of Point Estimation

38

inxnxx

n eeexxf )()();,...,( 1

1

The joint pdf is (independence)

The ln(likelihood) is 1ln[ ( ,..., ; )] ln( )n if x x n x

1ln[ ( ,...., ; )]0n

i

d f x x nx

d

1

i

n

x x

Equating to zero the derivative w.r.t. λ:

ˆ 1/ X The estimator

Page 39: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.16 Let X1, X 2, …, Xn is a random sample from a normal

distribution N(μ,δ2). Determine the maximum likelihood estimator of μ and δ2 .

The joint pdf is

6.2 Methods of Point Estimation

39

222 21 2

2 2 2 2

( )/2( )( ) ( )2 2 2 2 2

1 22 2 2

1 1 1 1( ,...., ; , )

22 2 2

inxnxx x

nf x x e e e e

2 2 21 2

1ln[ ( ,..., ; , )] ln(2 ) ( )

2 2n i

nf x x x

Equating to 0 the partial derivatives w.r.t. μ and σ2, finally we have2

2 ( )ˆ , iX X

Xn

Here the mle of δ2 is not the unbiased estimator.

Page 40: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Three steps1. Write the joint pmf/pdf (i.e. Likelihood function)

2. Get the ln(likelihood) (if necessary)

3. Take the partial derivative of ln(f) with respect to θi, equal them to 0, and solve the resulting m equations.

6.2 Methods of Point Estimation

40

1 2 1 11( , ,..., ; ,..., ) ( ; ,..., )

n

n m i mif x x x f x

1 2 1 11

ln[ ( , ,..., ; ,..., )] ln( ( ; ,..., ))n

n m i mi

f x x x f x

1 2 1ln[ ( , ,..., ; ,..., )] 0n mi

df x x x

d

Page 41: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Estimating Function of Parameters

The Invariance Principle

Let be the mle’s of the parameters θ1, …, θm. Then the mle of any function h(θ1,…,θm) of these parameters is the function of the mle’s.

6.2 Methods of Point Estimation

41

1 2ˆ ˆ ˆ, ,..., m

)ˆ,....,ˆ,ˆ( 21 mh

Page 42: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.19 (Ex.6.16 Cont’)

In the normal case, the mle’s of μ and σ2 are and

To obtain the mle of the function

substitute the mle’s into the function

6.2 Methods of Point Estimation

42

X22ˆ ( ) /iX X n

22 ),(h

2/122 ])(1

[ˆˆ XXn i

Page 43: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Large Sample Behavior of the MLE Under very general conditions on the joint distribution of

the sample, when the sample size n is large, the maximum likelihood estimator of any parameter θ is approximately unbiased and has variance that is nearly as small as can be achieved by any estimator. Stated another way, the mle is approximately the MVUE of θ.

Maximum likelihood estimators are generally preferable to moment estimators because of the above efficiency properties.

However, the mle’s often require significantly more computation than do moment estimators. Also, they require that the underlying distribution be specified.

6.2 Methods of Point Estimation

43

θ

])ˆ([ E

Page 44: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.21

Suppose my waiting time for a bus is uniformly distributed on [0,θ] and the results x1, …, xn of a random sample from this distribution have been observed. Since f(x;θ) = 1/θ for 0 ≤ x ≤ θ and 0 otherwise,

As long as max(xi) ≤ θ, the likelihood is 1/θn , which is positive, but as soon asθ <max(xi), the likelihood drops to 0.

Calculus will not work because the maximum of the likelihood occurs at a point of discontinuity.

6.2 Methods of Point Estimation

44

11

1 0 ,...,0

( ,..., ; )0 otherwise

nnn

x xf x x

Page 45: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Example 6.21 (Cont’)

6.2 Methods of Point Estimation

45

Likelihood

max(xi)θ

the figure shows that . Thus, if my waiting times are 2.3, 3.7, 1.5, 0.4, and 3.2, then the mle is .

)max(ˆiX

7.3ˆ

the maximum of the likelihood

1

1 0

( ,..., ; )0 otherwise

inn

xf x x

Page 46: Chapter 6. Point Estimation Weiqi Luo ( 骆伟祺 ) School of Software Sun Yat-Sen University Email : weiqi.luo@yahoo.com Office : # A313 weiqi.luo@yahoo.com.

School of Software

Homework

Ex. 20, Ex. 21, Ex. 29, Ex. 32

6.2 Methods of Point Estimation

46