Lecture 8: More Hypothesis Testingfaculty.nps.edu/rdfricke/Business_Stats/lecture8.pdf · 3 The...

Business Statistics

Lecture 8: More Hypothesis

Testing

Goals for this Lecture

• Review of t-tests

• Additional hypothesis tests

• Two-sample tests

• Paired tests

The Basic Idea of Hypothesis Testing

• Start with a theory or hypothesis

• For example, m = 814.3

• Collect some data

• Ask: How unusual is it to see this data if the null hypothesis is true?

• If it’s unusual, reject the null hypothesis

• If not, fail to reject the null

• Remember, determine the hypothesis to be tested before looking before looking at the data

It All Ties Back to the Empirical Rule

• If we hypothesize that the data come from a N(0,1)

distribution, how unusual an observation must we see to

reject our hypothesis?

It depends on the alternative hypothesis…

-4 -3 -2 -1 0 1 2 3 4

For Example, a Two-sided Test

-4 -3 -2 -1 0 1 2 3 4

Null: The mean is equal to zero (H0: m = 0)

Alternative: The mean is not equal to zero (Ha: m ≠ 0)

If the rejection criterion is p-value < 0.05, we reject if our

observation is greater than 1.96 or less than -1.96:

In JMP

• JMP computes the probability of seeing

data as extreme or more extreme under

various alternate hypotheses

• You have to choose the appropriate p-value

• Then compare the JMP p-value to 0.05

• Smaller: reject the null

• Larger: fail to reject the null

• Output is in terms of rescaled “t-scores”

• Using t distribution comes from using s to

estimate s

Conducting the Test in JMP

• With one continuous variable, Analyze >

Distribution > red triangle > Test Mean

• Type in the mean to be tested (“Specify

Hypothesized Mean”)

• If population (“true”) standard deviation

known, enter it

• This will be a z-test

• If you leave it blank, JMP does a t-test

• It uses s to estimate s

Back to the Paint Case (primer.jmp)

• A More Complicated Question:

• Suppose we are less interested in the value of 1.2 and more interested in whether processes “a” and “b” have the same mean

• Null hypothesis

• Means are the same: ma- mb = 0

• Alternative hypothesis

• Means are different: ma- mb 0

Solution: Two-sample t-test

Process “a”Process “b”

Mean = mx

SD = sx

Mean = my

SD = sy

• Two sample t-test assumes Xs

and Ys are independent

X1, X2, …, XnY1, Y2, …, Ym

Random Samples

• What do you think the test statistic is?

• How should we rescale the test statistic?

• What does the p-value represent?

Results of Two Sample t-test

• Null Hypothesis: mx- my = 0

• Test Statistic:

• Fact: since and are independent:

• So

)()()( YVarXVarYXVar

( )yxSE X Y

Two-sample t-test

• Test statistic:

• Estimated standard error:

• Rescaled test statistic:

Rescaled Test Statistic

• For some test statistic T where m and s

are not known, compute

• m * is the hypothesized true value

• sT is the sample standard error of the

statistic T

Remember: Rescaling

• In a one-sample test of, choose m*

• Then T = , so the test statistic is

• In a two-sample test, you’re often

testing whether the means are equal

• T = , and the test statistic is

One-sample and Two-sample Tests

. .( ) . .( )

s d T s e X

* ( ) 0 ( )

. .( ) . .( ) . .( )

T X Y X Yt

s d T s e X Y s e X Y

• We must estimate sx and sy

• If sx = sy then we can get a better

estimate

• Remember: Sample variance for a

single sample:

j j xxn

22 )(1

Sample mean

Deviations from sample meanAverage squared deviation

from the mean

Equal Variances?

• Remember, SD is

calculated using

differences from

the mean

• Each group can

have very different

mean but standard

deviations can be

similar

Different Means But Similar SD

• Pooled estimate of sample variance:

1 12( ) ( )

( 1) ( 1)

j jj j

x x y ys

Sample mean for process a

Sample mean for process b

Used two degrees of freedom, n+m-2 left over

• Pooled estimate buys you more df

• Weighted average of and 2

Average squared deviation from different means

More Bang for the Buck

Conducting the Test in JMP

• Need two variables: one continuous and one

categorical (denoting group)

• Then: Analyze > Fit Y by X (continuous

variable is the Y and categorical the X) > red

triangle > Means/Anova/Pooled t

• See the “t Test” part of the output

Case: Taste Testing Teas

• Small taste test of teas (taste.jmp)

• 16 panelists in a focus group

• Each tasted two formulations of a

prepackaged iced tea

• Rated them on a scale of 1 (excellent) to 7

(really bad)

• Company wants to know if there is a

difference in ratings between the two

formulations

• Two-sample t-test on taste.jmp:

• Is there a

significant

difference?

An Initial Evaluation

Taste Case: Any Difference?

• Unless SD’s vastly different (factor of 2), the

equal variance assumption no big deal

Independence Assumption

Very Important

• Independence assumption for two

sample t-test is violated

• Good news: there is an alternate test

that can do even better

• Paired t-test assumes two observations

taken for each unit in the sample

• Observations on the same unit likely to be

more similar than obs’ns on different units

• Here same person tasted each formulation

Paired t-test Looks at Differences

x1-y1=d1

x2-y2=d2• .

xn-yn=dn

• Calculate differences for

each observation

• Calculate sample mean and

SD of differences

• Do a one sample t-test for

differences:

• H0: mean difference is zero

• Ha: mean difference is not 0

Paired t-test in JMP

• Use Analyze >

Matched Pairs

• Two variables,

paired by row:

Results: Paired t-test in JMP

Mean Difference is same as two sample test

SE is smaller –why??

• Heuristic:

• When xj and yj “vary together” then yj will

be big when xj is big

• Since xj & yj tend to be close together, xj-yj

is smaller than when X and Y independent

Why Pairing Helps

• Math:

• When and are not independent thenX Y

( ) ( ) ( ) 2 ( , )Var X Y Var X Var Y Cov X Y

• Cov or “covariance” measures linear

dependence between two variables

It Helps in this Case Because…

• People first have a like or dislike for tea

• Their ratings of the formulations are relative to

this overall opinion of tea

• Taking the difference removes the “person

effect”

0 1 2 3 4 5 6 7 8

Taste 2

Tend to

dislike tea

Tend to

like tea

-3 -2 -1 0 1 2 3

• xj-yj is horizontal distance to the y=x line

• xj-yj is smaller (typically) in the right hand plot

Independence vs. Dependence

Case: Sales Force Comparison

• Newly merged pharmaceutical company

(PharmSal.jmp)

• Two sales forces (“BW” & “GL”), one from

each of the merged companies

• 20 sales districts are the same

• Sales reps divided into these districts

• Sell essentially the same drugs

• Management wants to know if one sales

force outperforms the other

Sales by Division

Division

Minimum

215.25

197.75

Median

409.75

Maximum

Quantiles

Two-Sample t-test ResultsS

Division

• Under the independence

assumption, we conclude

that there is no difference

in the means

• But are they

independent?

The Sales Forces Are Dependent

• Dependence occurs by sales district:

Paired t-test Comparison

• Which

sales force

is doing

better?

More Complicated Tests

• There are even more complicated tests

you can do

• E.G., test for equal variance

• You’re never going to remember all the

steps for each test anyway

• Let the computer do it for you

Terminology

• One-sided vs. two-sided

• Comes from the statement of the alternative hypothesis

• Are you calculating the p-value using one tail or two?

• One-sample vs. two-sample

• Comes from the type of data and the question you are answering

• Are you testing a mean or a difference between means?

Which Test?

• How many populations are sampled?• One: one-sample test

• Two: read on

• Are observations in first sample independent of observations in second sample?• Yes: two-sample t-test

• No: paired t-test

• Big Clue:• Paired t-test needs two observations from each

unit• Unequal sample sizes 2 sample test

• Equal sample sizes you have to decide

Hypothesis Tests in the Computer Age

• Know the null and alternative

hypotheses

• Have some idea of what test statistics

you would look at

• Let the computer figure out how to

rescale them

• Let the computer figure out the p-value

• p-values are always interpreted the

same way

What we have learned so far…

• Descriptive Statistics

• Probability

• Inference for a population mean

• Confidence intervals

• Hypothesis testing

• One-sample test of the mean

• Two-sample tests

• Paired tests

Lecture 8: More Hypothesis Testingfaculty.nps.edu/rdfricke/Business_Stats/lecture8.pdf · 3 The...

Documents

Transcript of Lecture 8: More Hypothesis Testingfaculty.nps.edu/rdfricke/Business_Stats/lecture8.pdf · 3 The...

MethyleneBlueModulates -Secretase,ReversesCerebral ...hypothesis), abnormal Tau phosphorylation (Tau hypothesis), aberrant glial cytokine cycle (neuroinflammation hypothesis), increased

IE241: Introduction to Hypothesis Testing. Topic Slide Hypothesis testing………………………………………..3 Light bulb example………………………………………..4

Lecture8 nervous system

Lecture 5: Hypothesis Testingeckel/biostat2/slides/lecture5.pdfA note about approaches to two-sided hypothesis testing ... hypothesis testing If the null hypothesis value is ... of

Hypothesis-Testing Model-Complexity. Hypothesis Testing …..

MPO1: Quantitative Research Methods Session 2: Random ... · Example to illustrate conditional probability distributions, and hypothesis tests A rare disease infects 1 person in a

Lecture8 IDS

Hypothesis Testing

Lecture8 Memory Sys

Microsoft PowerPoint - LECTURE8

Introdução à Astronomia Prof. Antônio Kanaan Aula 8 – 21 ...kanaan/intastro/lecture8/lec8.pdf · [For example, the massive star Eta Carina has immense stellar winds. Hubble

DC Lecture8

Riemann hypothesis

Test Hypothesis

Pl lecture8

Lecture8 OB Teams

Non-parametric Hypothesis Testing Procedureshaalshraideh/Courses/IE347/Non...Non-parametric Hypothesis Testing Procedures Hypothesis Testing General Procedure for Hypothesis Tests

Hypothesis canvas

Statistical Hypothesis

Biz Forecasting Lecture8