Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a...

29
Estimation of α Lower bounds for α Estimating of Fs and its density fs Estimation of a Two-component Mixture Model Bodhisattva Sen 1, 2 University of Cambridge, Cambridge, UK Columbia University, New York, USA Indian Statistical Institute, Kolkata, India 6 August, 2012 1 Joint work with Rohit Patra, Columbia University, USA 2 Supported by NSF grants DMS-0906597, AST-1107373, DMS-1150435 (CAREER)

Transcript of Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a...

Page 1: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Estimation of a Two-component Mixture Model

Bodhisattva Sen1,2

University of Cambridge, Cambridge, UKColumbia University, New York, USA

Indian Statistical Institute, Kolkata, India

6 August, 2012

1Joint work with Rohit Patra, Columbia University, USA

2Supported by NSF grants DMS-0906597, AST-1107373, DMS-1150435 (CAREER)

Page 2: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Mixture model with two components

F (x) = αFs(x) + (1− α)Fb(x) (1)

Fb is a known distribution function (DF).

Unknowns: mixing proportion α ∈ (0, 1) and DF Fs (6= Fb).

Problem: Given a random sample from X1,X2, . . . ,Xnind.∼ F , we

wish to (nonparametrically) estimate Fs and the parameter α.

Previous work

Most of the previous work on this problem assume some constraint(parametric models) on the form of Fs ; see e.g., Lindsay (AoS, 1983),Lindsay and Basak (JASA, 1993).

Bordes et al. (AoS, 2006) assume that both the components belong to anunknown symmetric location-shift family.

In the multiple testing setup, this problem has been addressed mostly toestimate α under suitable assumptions; see e.g., Storey (JRSSB, 2002),Genevese & Wasserman (AoS, 2004), Meinshausen & Rice (AoS, 2006).

Page 3: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Applications

In multiple testing problems, e.g., microarray analysis, neuroimaging(fMRI).

Any situation where numerous independent hypotheses tests areperformed.

The p-values are uniformly distributed on [0,1], under H0, while theirdistribution associated with H1 is unknown; see e.g., Efron (2010).

Estimate the proportion of false null hypotheses α; also needed tocontrol multiple error rates, such as the FDR of Benjamini &Hochberg (JRSSB, 1995).

In contamination problems – application in astronomy.

Page 4: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

An application in astronomy

We analyse the radial velocity (RV; line of sight velocity)distribution of stars in Carina, a dwarf spheroidal (dSph) galaxy.

The data have been obtained by Magellan and MMT telescopes andconsist of radial velocity measurements for n = 1215 stars fromCarina, contaminated with Milky Way stars in the field of view.

We would like to understand the distribution of the line of sightvelocity of stars in Carina.

For the contaminating stars from the Milky Way in the field of view,we assume a non-Gaussian velocity distribution Fb that is knownfrom the Besancon Milky Way model (Robin et. al, 2003),calculated along the line of sight to Carina.

Here α is the proportion of stars from Carina.

Page 5: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Outline

1 Estimation of α

2 Lower bounds for α

3 Estimating of Fs and its density fs

Page 6: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

When α is known

We observe an i.i.d. sample from F = αFs + (1− α)Fb.

A naive estimator of Fs would be

Fαs,n =Fn − (1− α)Fb

α,

where Fn is the empirical DF of the observed sample.

Fαs,n is not a valid DF: need not be non-decreasing or lie in [0, 1].

Find the closest DF: impose the known shape constraint ofmonotonicity, accomplished by minimizing∫

{W (x)− Fαs,n(x)}2 dFn(x) ≡ 1

n

n∑i=1

{W (Xi )− Fαs,n(Xi )}2

over all DFs W .

Page 7: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Isotonized estimator

Let

Fαs,n = arg minW DF

1

n

n∑i=1

{W (Xi )− Fαs,n(Xi )}2.

Fαs,n is the isotonized estimator; uniquely defined at the data pointsXi , for all i = 1, . . . , n.

0.0 0.2 0.4 0.6 0.8 1.0

0.00.2

0.40.6

0.81.0

1.21.4

CDF Plot

x

CDF

Naive CDF EstimateCDF EstimateTrue CDF

0.0 0.2 0.4 0.6 0.8 1.0

0.00.5

1.01.5

CDF Plot

x

CDF

Naive CDF EstimateCDF EstimateTrue CDF

Plot of F αns,n, F αn

s,n and Fs for n = 300 and 500, when α = 0.3.

Page 8: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Computation

Recall

Fαs,n = arg minW↑ DF

1

n

n∑i=1

{Fαs,n(X(i))−W (X(i))}2.

The above optimization problem is the same as minimizing

‖V − θ‖2 over θ = (θ1, . . . , θn) ∈ Θinc

where Θinc = {θ ∈ Rn : 0 ≤ θ1 ≤ θ2 ≤ . . . ≤ θn ≤ 1},V = (V1,V2, . . . ,Vn), Vi := Fαs,n(X(i)), i = 1, 2, . . . , n.

The estimator θ is uniquely defined by the projection theorem; it isthe L2 projection of V on a closed convex cone in Rn.

Can easily be computed using the pool-adjacent-violators algorithm(PAVA); see Section 1.2 of Robertson et al. (1988).

Page 9: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Identifiability

When α is unknown, the problem is non-identifiable.

If F = αFs + (1− α)Fb for some Fb (known) and α (unknown),then the mixture model can be re-written as

F = (α + γ)

α + γFs +

γ

α + γFb

)+ (1− α− γ)Fb,

for 0 ≤ γ ≤ 1− α, and the term (αFs + γFb)/(α + γ) can bethought of as the nonparametric component.

A trivial solution occurs when we take α + γ = 1, in which caseF 1

s,n = F 1s,n = Fn.

Hence, α is not uniquely defined.

Page 10: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Identifiability

We redefine the mixing proportion as

α0 := inf

{γ ∈ (0, 1) :

F − (1− γ)Fb

γis a valid DF

}.

Intuitively, this definition makes sure that the “signal” distribution Fs

does not include any contribution from the known “background” Fb.

We consider the estimation of α0 as defined above.

Lemma

Suppose that Fs and Fb are absolutely continuous, i.e., they havedensities fs and fb, respectively. Then α0 < α iff there exists c > 0 suchthat fs(x) ≥ cfb(x), for all x ∈ R.

If the support of Fs is strictly contained in that of Fb, then theproblem is identifiable.

Page 11: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Estimation of α0

When γ = 1, F γs,n = Fn = F γs,n.

When γ is much smaller than α0, the regularization of F γs,n modifies

it, and thus F γs,n and F γs,n are quite different.

Measure distance by dn – the L2(Fn) distance, i.e., if g , h : R→ Rare two functions, then dn(g , h) =

√∫{g(x)− h(x)}2 dFn(x).

0 0.05 0.1 0.15 0.2 0.250

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

0.05

γ

0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.20

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

γ

Plot of γ dn(F γs,n, Fγs,n) (in solid blue) when α0 = 0.1 and n = 5000.

Page 12: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

We will study γ dn(F γs,n, Fγs,n) = dn(Fn, γF γs,n + (1− γ)Fb).

Lemma

γ dn(F γs,n, Fγs,n) is a non-increasing convex function of γ in (0, 1).

Lemma

For 1 ≥ γ ≥ α0, γ dn(F γs,n, Fγs,n) ≤ dn(F ,Fn). Thus,

γ dn(F γs,n, Fγs,n)

a.s.→{

0, γ ≥ α0,> 0, γ < α0.

Note, γ dn(F γs,n, Fγs,n) ≤ γ dn

(F γs,n,

F−(1−γ)Fb

γ

)from which

γ dn

(Fn − (1− γ)Fb

γ,

F − (1− γ)Fb

γ

)= dn(F ,Fn).

Page 13: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

0 0.05 0.1 0.15 0.2 0.250

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

0.05

γ

0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.20

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

γ

Estimation of α0

We would like to compare F γs,n and F γs,n, and choose the smallest γfor which their distance is still small.

Let

αn = inf

{γ ∈ (0 , 1 ] : γ dn(F γs,n, F

γs,n) ≤ cn√

n

}. (2)

Theorem (Consistency of αn)

If cn/√

n→ 0 and cn →∞, then αnP→ α0.

Page 14: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Lower bound for α0

Construct a finite sample (honest) lower confidence bound αn withthe property

P(α0 ≥ αn) ≥ 1 − β, for all n, (3)

for a specified confidence level (1− β), 0 < β < 1.

Would allow one to assert, with a specified level of confidence, thatthe proportion of “signal” is at least αn.

It can also be used to test the hypothesis that there is no “signal” atlevel β by rejecting when αn > 0.

Genovese & Wasserman (AoS, 2004) and Meinshausen & Rice (AoS,2006) usually yield approximate conservative lower bounds.

Page 15: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Theorem

Let Hn be the DF of√

ndn(Fn,F ) :=√

n∫{Fn(x)− F (x)}2 dFn(x). Let

αn be defined as in (2) with cn defined as the (1− β)-quantile of Hn.Then

P(α0 ≥ αn) ≥ 1 − β, for all n.

Proof of Theorem

P(α0 < αn) = P

(α0 dn(Fα0

s,n, Fα0s,n) ≥ cn√

n

)≤ P

(α0 dn(Fα0

s,n,Fα0s ) ≥ cn√

n

)= P

(√n dn(Fn,F ) ≥ cn

)= 1− Hn(cn)

= β,

as α0dn(Fα0s,n,F

α0s ) = α0dn

(Fn−(1−α0)Fb

α0, F−(1−α0)Fb

α0

)= dn(Fn,F ).

Page 16: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Hn is distribution-free and can be easily simulated.

Requires no tuning parameters.

Lower bound holds for all n.

For moderately large n (e.g., n ≥ 500) the distribution Hn can bevery well approximated by that of the (square-root of) Cramer-vonMises statistic, defined as√

nd(Fn,F ) :=√

n∫{Fn(x)− F (x)}2dF (x).

Theorem

Letting Gn to be the DF of√

nd(Fn,F ), we have

supx∈R|Hn(x)− Gn(x)| → 0 as n→∞.

Page 17: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

0 0.05 0.1 0.15 0.2 0.250

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

0.05

γ

0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.18 0.20

0.005

0.01

0.015

0.02

0.025

0.03

0.035

0.04

0.045

γ

Plot of γdn(F γs,n, Fγs,n) (in solid blue) overlaid with its (scaled)

second derivative (in dashed red) for α0 = 0.1 and n = 5000.

A tuning-parameter-free estimator of α0

We can use the elbow of γ dn(F γs,n, Fγs,n) to estimate α0.

It is the point that has the maximum curvature, i.e., the point wherethe second derivative is maximum.

Page 18: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Estimation of Fs

Assume now that the model (1) is identifiable, i.e., α = α0, and αn

be an estimator of α0 (αn can be αn).

A natural nonparametric estimator of Fs is F αns,n.

0.0 0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

x

CD

F

0.0 0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

x

CD

F

Plot of F αns,n (in dotted red), Fs (in dashed black) and F †s,n (in blue).

Theorem

Suppose that αnP→ α0. Then, supx∈R |F αn

s,n(x)− Fs(x)| P→ 0 .

Page 19: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Estimating the density fs

Suppose now that Fs has a non-increasing density fs (w.l.o.g., weassume that fs is non-increasing on [0,∞)).

Natural assumption in many situations, see e.g., Genovese &Wasserman (AoS, 2004).

Define F †s,n := LCM[F αns,n], where for a bounded function

g : [0,∞)→ R, let us represent the least concave majorant (LCM)of g by LCM[g ].

F †s,n is a valid DF with a density f †s,n.

We can now estimate fs by f †s,n = [F †s,n]′.

Page 20: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

1

2

3

4

5

6

7

8

9

10

x

Den

sit

y

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.50

1

2

3

4

5

6

7

8

9

10

x

Density

Plot of the estimate f †s,n (in solid red) and fs (in solid blue).

Theorem

Assume Fs has non-increasing density fs on [0,∞). If αnP→ α0, then,

supx∈R|F †s,n(x)− Fs(x)| P→ 0.

Further, if for any x > 0, fs(x) is continuous at x , then,

f †s,n(x)P→ fs(x).

Page 21: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Multiple testing

Estimate the proportion of genes that are differentially expressed inDNA microarray experiments.

We wish to test n null hypothesis H01,H02, . . . ,H0n using p-valuesX1,X2, . . . ,Xn.

FWER = Prob (# of false rejections ≥ 1), the probability ofcommitting at least one type I error.

Benjamini and Hochberg (1995) proposed FDR = E{

VR 1(R > 0)

},

where V is the number of false rejections and R is the number oftotal rejections.

The BH method guarantees FDR ≤ βα0; an estimate of α0 can beused to yield a procedure with FDR approximately equal to β andthus will result in an increased power.

Page 22: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Application to multiple testing

Our method can be used to yield an estimator of α0.

We can also obtain a completely nonparametric estimator of Fs , thedistribution of the p-values arising from the alternative hypotheses.

The local false discovery rate (LFDR) is defined as the functionl : (0, 1)→ [0,∞), such where

l(x) = P(Hi = 0|Xi = x) =(1− α0)fb(x)

f (x).

The LFDR method can help detect “interesting” cases (e.g.,l(x) ≤ 0.20); see Section 5 of Efron (2010).

If fs is a assumed to have a non-increasing density, we have a naturaltuning-parameter-free estimator l of the LFDR:

l(x) = (1−αn)fb(x)

αnf †s,n(x)+(1−αn)fb(x), for x ∈ (0, 1).

Page 23: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Simulation: lower bounds

Compare our method with that of Genevose & Wasserman (AoS,2004), and Meinshausen & Rice (AoS, 2006).

The method in Meinshausen & Rice (AoS, 2006) are extensions ofthose proposed in Genevose & Wasserman (AoS, 2004).

Table: Coverage probabilities of nominal 95% lower confidence bounds for thethree methods when n = 1000.

αSetting I Setting II

α0L αGW

L αMRL α0

L αGWL αMR

L

0 0.95 0.98 0.93 0.95 0.98 0.930.01 0.97 0.98 0.99 0.97 0.97 0.990.03 0.98 0.98 0.99 0.98 0.98 0.990.05 0.98 0.98 0.99 0.98 0.98 0.990.10 0.99 0.99 1.00 0.99 0.98 0.99

Page 24: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Real example: Prostate data

Genetic expression levels for n = 6033 genes for m1 = 50 normalcontrol subjects and m2 = 52 prostate cancer patients.

Goal: To discover a small number of “interesting” genes whoseexpression levels differ between the cancer and control patients.

Such genes, once identified, might be further investigated for acausal link to prostate cancer development.

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10

50

100

150

200

250

p values

Fre

qu

en

cy

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.40

0.005

0.01

0.015

0.02

0.025

0.03

0.035

α

Page 25: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

The two-sample t-statistic for testing significance of gene i is

ti =xi (1)− xi (2)

si∼ t100 ( under H0),

where si is an estimate of the standard error of xi (1)− xi (2), i.e.,

s2i = (1/50 + 1/52)[

∑50j=1{xij − xi (1)}2 +

∑102j=51{xij − xi (2)}2]/100.

0.0 0.1 0.2 0.3 0.4 0.5 0.6

0.0

0.2

0.4

0.6

0.8

1.0

x

CD

F

0.0 0.2 0.4 0.6 0.8 1.0

02

46

81

0

x

De

nsity

Fαns,n (in dotted red) and F

†s,n (in solid blue). The density f

†s,n .

αn = 0.0877. The lower bound for α0 is 0.0512.

Page 26: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

An application in astronomy

We analyse the radial velocity (RV; line of sight velocity)distribution of stars in Carina, a dwarf spheroidal (dSph) galaxy.

The data have been obtained by Magellan and MMT telescopes andconsist of radial velocity measurements for n = 1215 stars fromCarina, contaminated with Milky Way stars in the field of view.

We would like to understand the distribution of the line of sightvelocity of stars in Carina.

For the contaminating stars from the Milky Way in the field of view,we assume a non-Gaussian velocity distribution Fb that is knownfrom the Besancon Milky Way model (Robin et. al, 2003),calculated along the line of sight to Carina.

Page 27: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

−50 0 50 100 150 200 250 300 3500

0.005

0.01

0.015

0.02

0.025

Radial Velocity (RV)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.80

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

γ

In the left panel we have the histogram of the radial velocity of thecontaminating stars overlaid with the (scaled) kernel densityestimator of the Carina data set.

The right panel shows the plot of γdn(F γs,n, Fγs,n) (in solid blue)

overlaid with its (scaled) second derivative (in dashed red).

Our estimate αn of α0 for this data set turns out to be 0.356, whilethe lower bound for α0 is found to be 0.322.

Page 28: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Astronomers usually assume the distribution of the radial velocitiesfor these dSph galaxies to be Gaussian in nature.

The figure below shows F αns,n (in dashed red) overlaid with the closest

(in terms of minimising the L2(F αns,n) distance) Gaussian distribution

(in solid blue).

Indeed, we see that F αns,n is close to a normal distribution (with mean

222.9 and standard deviation 7.51).

190 200 210 220 230 240 250 2600

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Line of sight velocity

CDF

Page 29: Estimation of a Two-component Mixture Modelbodhi/Talks/2ComponentMixModel.pdf · Estimation of a Two-component Mixture Model Bodhisattva Sen1;2 University of Cambridge, Cambridge,

Estimation of αLower bounds for α

Estimating of Fs and its density fs

Summary

Consistent estimation of a two-parameter mixture model usingtechniques from shape-restricted function estimation.

Avoids the need to specify tuning parameters.

Such shape constraints arise naturally in many contexts.

References

Benjamini, Y. & Hochberg, Y. (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing.J. R. Statist. Soc. B, 57, 289–300.

Efron, B. (2010). Large-Scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction. IMS Monographs.

Genovese, C. & Wasserman, L. (2004). A stochastic process approach to false discovery control. Ann. Statist., 32, 1035–1061.

Meinshausen, N. & Rice, J. (2006). Estimating the proportion of false null hypotheses among a large number of independentlytested hypothesis. Ann. Statist., 34, 373–393.

Robertson,T., Wright, F. T. & Dykstra, R.L. (1988). Order restricted statistical inference. Wiley, New York.

Thank You!

Questions?