ch10-solns-all_skuce_2e

download ch10-solns-all_skuce_2e

of 41

Transcript of ch10-solns-all_skuce_2e

  • 8/12/2019 ch10-solns-all_skuce_2e

    1/41

    Instructors Solutions Manual - Chapter 10

    Chapter 10 Solutions

    Develop Your Skills 10.11. Call the defects on the night shift population 1, and the defects on the day shift

    population 2.

    H0: 1- 2= 0H1: 1- 2> 0= 0.05

    1x = 35.4, 2x = 27.8, s1= 15.3, s2= 7.9, n1= 45, n2= 50

    We are told that the population distributions of errors are normal.

    2.992

    50

    9.7

    45

    3.15

    0)8.274.35(

    )(

    22

    2

    2

    2

    1

    2

    1

    2121

    n

    s

    n

    s

    xxt

    Degrees of freedom: minimum of (n1 1) and (n2 1), so minimum (44, 49) = 44.

    Closest row in the table is for 45 degrees of freedom.p-value < 0.005

    Reject H0. There is sufficient evidence to infer that the number of defects is higheron the night shift than on the day shift, on average.

    Using the Excel template, we find a more exact p-value of 0.00196.

    MakingDecisionsAbouttheDifferencein

    PopulationMeanswithTwoIndependent

    Samples

    Dothesampledataappeartobenormally

    distributed? yes

    Sample1StandardDeviation 15.3

    Sample2StandardDeviation 7.9

    Sample1Mean 35.4

    Sample2Mean 27.8

    Sample1Size 45

    Sample2Size 50

    HypotheticalDifferenceinPopulationMeans 0

    tScore 2.99245

    OneTailedpValue 0.00196

    TwoTailedpValue 0.00393

    Copyright 2011 Pearson Canada Inc. 234

  • 8/12/2019 ch10-solns-all_skuce_2e

    2/41

    Instructors Solutions Manual - Chapter 10

    2. Call the population of purchases by females population 1, and the purchases by

    males population 2.

    H0: 1- 2= 0

    H1: 1- 2> 0= 0.025

    We start by creating histograms of the sample data to check for normality.

    0

    1

    2

    3

    4

    5

    6

    7

    Fre

    quency

    ValueofPurchase

    DrugstorePurchasesbyFemales

    0

    1

    2

    3

    4

    5

    6

    NumberofPurchases

    ValueofPurchase

    DrugstorePurchasesbyMales

    Neither histogram is perfectly normal, with the purchases by females in particular

    showing some skewness to the right. Note that these histograms were not designedfor comparison (classes are different for each), but to assess normality. We will

    proceed, but with some caution.

    Copyright 2011 Pearson Canada Inc. 235

  • 8/12/2019 ch10-solns-all_skuce_2e

    3/41

    Instructors Solutions Manual - Chapter 10

    We have the data in Excel, so can use the Data Analysis tool for the calculations.

    tTest:TwoSampleAssumingUnequalVariances

    PurchasesbyMales

    PurchasesbyFemales

    Mean 32.42933333 27.82428571

    Variance 85.32606381 23.95908791

    Observations 15 14

    HypothesizedMeanDifference 0

    df 22

    tStat 1.692875747

    P(T

  • 8/12/2019 ch10-solns-all_skuce_2e

    4/41

    Instructors Solutions Manual - Chapter 10

    3. Use the Excel template to construct the confidence interval. Of course, this can also

    be done manually with the formula 2

    2

    2

    1

    2

    1

    21n

    s

    n

    sscoretxx .

    Confidence

    Interval

    Estimate

    for

    the

    DifferenceinPopulationMeans

    Dothesampledataappeartobe

    normallydistributed? yes

    Sample1StandardDeviation 9.23721

    Sample2StandardDeviation 4.8948

    Sample1Mean $32.43

    Sample2Mean $27.82

    Sample1Size 15

    Sample2Size 14

    ConfidenceLevel

    (decimal

    form) 0.95

    UpperConfidenceLimit 10.2621

    LowerConfidenceLimit 1.052

    With 95% confidence, we estimate that the interval (-1.05, 10.26) contains the trueaverage difference in the purchase of females, compared to males, at this drugstore.

    We expect this interval to contain zero, since we failed to reject the hypothesis that

    the difference was zero in Exercise 2.

    Copyright 2011 Pearson Canada Inc. 237

  • 8/12/2019 ch10-solns-all_skuce_2e

    5/41

    Instructors Solutions Manual - Chapter 10

    4. Call the population of daily sales last year population 1, and the daily sales this year

    population 2.

    H0: 1- 2= 0

    H1: 1- 2< 0= 0.03

    We start by creating histograms of the sample data to check for normality.

    0

    2

    4

    6

    8

    10

    12

    Frequency

    DailySales

    SampleofHotDogVendor'sDaily

    Sales,LastYear

    0

    2

    4

    6

    8

    10

    12

    14

    Frequency

    DailySales

    SampleofHotdogVendorDailySales,

    ThisYear

    While neither histogram is perfectly normal, they are approximately normal, and

    sample sizes are fairly large, at 30 and 35.

    Copyright 2011 Pearson Canada Inc. 238

  • 8/12/2019 ch10-solns-all_skuce_2e

    6/41

    Instructors Solutions Manual - Chapter 10

    Again, using Excel, we obtain the following results.

    tTest:TwoSampleAssumingUnequalVariances

    LastYear'sDailySales ThisYear'sDailySalesMean 298.2857143 356.0666667Variance 3368.915966 593.9954023

    Observations 35 30

    HypothesizedMeanDifference 0

    df 47

    tStat 5.36356487

    P(T

  • 8/12/2019 ch10-solns-all_skuce_2e

    7/41

    Instructors Solutions Manual - Chapter 10

    Develop Your Skills 10.26. H0: there is no difference in the locations of the populations of ratings for trainees #1

    and #2

    H1: there is a difference in the locations of the populations of ratings for trainees #1

    and #2= 0.025

    Since these are ranked data, we must use the Wilcoxon Rank Sum Test. First, we

    have to see if the distributions are similar in shape and spread. With so few datapoints, this can be difficult to see. Below are two quick dot plots (you could also do

    a histogram with Excel) that reveal similarities in shape and spread.

    Ratings for Trainee #1

    *

    * * *

    * * * * *

    1 2 3 4 5

    Ratings for Trainee #2

    *

    *

    * *

    * * * *

    1 2 3 4 5 6

    The table below illustrates the ranking process.

    Performance Ratings

    Trainee #1 ordered

    ratings

    ranks Trainee #2 ordered

    ratings

    ranks

    1 1 1 4 2 2.5

    5 2 2.5 5 4 8

    3 3 4.5 4 4 8

    2 3 4.5 5 5 13.5

    3 4 8 6 5 13.5

    4 4 8 2 5 13.5

    4 4 8 5 5 13.5

    5 5 13.5 5 6 17

    4 5 13.5

    rank sum 63.5 rank sum 89.5

    Copyright 2011 Pearson Canada Inc. 240

  • 8/12/2019 ch10-solns-all_skuce_2e

    8/41

    Instructors Solutions Manual - Chapter 10

    We need only calculate the rank sum for the smallest sample, which is the ratings forTrainee #2, but both are shown here. Note that the tables are set up to for W1to be

    calculated from the smallestsample, so W1= 89.5 here (even though these are the

    ratings for Trainee #2). So, n1= 8, and n2= 9.

    Because sample sizes are below 10, we must use the tables to estimate the p-value.

    We see from the table that

    0.046 < P(W 89.5) < 0.057Since this is a two-tailed test,

    0.046 2 < p-value < 0.057 20.092 < p-value < 0.114

    Fail to reject H0. There is insufficient evidence to infer there is a difference in thelocations of the ratings of Trainee #1 and Trainee #2. Note the implications of this

    result. If you simply looked at the ratings, you probably would have concluded that

    the ratings for Trainee #2 were higher. However, they are not significantly higher,and the difference could just be a result of sampling variability. This means that

    Trainee #2 should not be promoted over Trainee #2 on the basis of these ratings.

    Some other criteria will have to be used to decide which trainee to promote.

    Copyright 2011 Pearson Canada Inc. 241

  • 8/12/2019 ch10-solns-all_skuce_2e

    9/41

    Instructors Solutions Manual - Chapter 10

    7. H0: there is no difference in the locations of the populations of distances travelled by

    the current best-selling golf ball and the new golf ballH1: the population of distances travelled by the current best-selling golf ball is to the

    left of the population of distances travelled by the new golf ball

    = 0.05

    First, we must check the histograms for normality.

    0

    1

    2

    3

    4

    5

    Frequency

    Metres

    DistancesTravelledbyCurrentBestSelling

    GolfBall

    0

    1

    2

    3

    4

    Frequency

    Metres

    DistancesTravelledbyNewGolfBall

    Both histograms are non-normal, but they are similar in shape and spread, so weproceed with the Wilcoxon Rank Sum Test.

    Copyright 2011 Pearson Canada Inc. 242

  • 8/12/2019 ch10-solns-all_skuce_2e

    10/41

    Instructors Solutions Manual - Chapter 10

    We will use Excel to analyze these data.

    WilcoxonRankSum TestCalculations

    sample1size 12

    sample

    2size

    15

    W1 221

    W2 157

    We will use the template for the Wilcoxon Rank Sum Test for independent samples.

    MakingDecisionsAboutTwoPopulation

    Locations,TwoIndependentSamplesofNon

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 12

    Sample2Size 15

    Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? yes

    W1 221

    W2 157

    zScore(basedonW1) 2.58613519

    OneTailedpValue 0.00485294

    TwoTailed

    p

    Value 0.00970589

    Again, for consistency, we have selected Sample 1 as the smallest sample, so that we

    assign W1as the rank sum of the smallest sample, which contains the distances

    travelled by the new golf ball. We see that n1= 12, and n2= 15. Since both are 10,

    we can use the normal approximation to the sampling distribution of W1.

    If the distances travelled by the new golf ball are longer, we would expect W1to behigh.

    p-value = P(W1> 221) = 0.005

    The p-value < = 0.05. Reject H0. There is sufficient evidence to suggest that thepopulation of distances travelled by the current best-selling ball is to the left of the

    population of distances travelled by the new golf ball. Note this is equivalent to

    saying the distances travelled by the new golf ball are to the right of the distances

    travelled by the current best-selling ball.

    Copyright 2011 Pearson Canada Inc. 243

  • 8/12/2019 ch10-solns-all_skuce_2e

    11/41

    Instructors Solutions Manual - Chapter 10

    8. H0: there is no difference in the locations of the flight delays before takeoff before

    and after the redesign of the airportH1: the location of the population of flight delays before takeoff before the redesign

    of the airport is to the right of the population of flight delays before takeoff after

    the redesign of the airport

    = 0.05

    These are quantitative data, so we must check histograms.

    0

    2

    46

    8

    10

    12

    14

    Freq

    uency

    Minutes

    FlightDelaysBeforeTakeoff,Before

    AirportRedesign

    0

    2

    4

    6

    8

    10

    12

    14

    16

    Frequency

    Minutes

    FlightDelaysBeforeTakeoff,After

    AirportRedesign

    The histogram for the flight delays after the airport redesign appears non-normal.

    The data sets are not that similar in shape and spread. We will use the WilcoxonRank Sum Test, but we will be cautious about drawing conclusions about location

    only.

    Copyright 2011 Pearson Canada Inc. 244

  • 8/12/2019 ch10-solns-all_skuce_2e

    12/41

    Instructors Solutions Manual - Chapter 10

    Since the data are available in Excel, we can use the Wilcoxon Rank Sum Test

    Calculations tool to get the rank sums, and then use the template to calculate the p-value. The results are shown below.

    Wilcoxon Rank Sum Test Calculations

    sample 1 size 35 sample 2 size 35

    W1 1352.5

    W2 1132.5

    MakingDecisionsAboutTwoPopulation

    Locations,TwoIndependentSamplesofNon

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 35

    Sample2Size 35

    Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? no

    W1 1352.5

    W2 1132.5

    zScore(basedonW1) 1.29207014

    OneTailedpValue 0.09816643

    TwoTailedpValue 0.19633286

    This is a one-tailed test, so p-value = 0.098 > = 0.05. There is insufficientevidence to infer that the population of flight delays before takeoff before the airport

    redesign is to the left of the population of flight delays before takeoff after the airport

    redesign.

    Copyright 2011 Pearson Canada Inc. 245

  • 8/12/2019 ch10-solns-all_skuce_2e

    13/41

    Instructors Solutions Manual - Chapter 10

    9. H0: there is no difference in the locations of the weight losses for young women aged

    18-25 who take the diet pill, compared with those who do not take the diet pillH1: the location of the population of weight losses for the young women aged 18-25

    who take the diet pill is to the right of the population of weight losses of the

    young women who do not take the diet pill

    = 0.04

    We are told the distributions of weight-loss are non-normal, and that both are skewed

    to the right, so there is similarity in shape of the distributions. No indication is givenof the spread of the data, so we will assume similar spreads, noting that our

    conclusions may not be valid if this is not the case.

    We are given the rank sums, and can proceed manually, or use the Excel template.

    The completed Excel template is shown below. Of course, you could also do this

    calculation with the formulas.

    MakingDecisionsAboutTwoPopulation

    Locations,TwoIndependentSamplesofNon

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 25

    Sample2Size 25

    Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? yes

    W1 700W2 575

    zScore(basedonW1) 1.21267813

    OneTailedpValue 0.11262645

    TwoTailedpValue 0.22525291

    If the weight losses with the diet pill are higher, we would expect W 1to be high. Thep-value for a one-tailed test is 0.113, which is > = 0.04.

    Fail to reject H0. There is insufficient evidence to infer that the population of weightlosses of the young women aged 18-25 who took the diet pill is to the right of thepopulation of weight losses for those who did not take the diet pill.

    Copyright 2011 Pearson Canada Inc. 246

  • 8/12/2019 ch10-solns-all_skuce_2e

    14/41

    Instructors Solutions Manual - Chapter 10

    10. H0: there is no difference in the locations of the populations of food ratings by

    weeknight and weekend diners at a restaurantH1: there is a difference in the locations of the populations of food ratings by

    weeknight and weekend diners at a restaurant

    = 0.05

    These are ranked data, so we must examine the distributions for similarity in shape

    and spread. The dot plots below (created simply in Excel) illustrate.

    Dot Plot for Ratings of Weeknight Diners

    *

    * *

    * *

    * * * *

    1

    2

    3

    4

    5

    Dot Plot for Ratings of Weekend Diners

    *

    * *

    * * *

    * * *

    1 2 3 4 5

    There is some similarity in shape, as both dot plots are skewed to the right. However,

    there is much less variability in the ratings of the weekend diners. We will proceed

    with the Wilcoxon Rank Sum Test, but we must be cautious about makingconclusions about location.

    Copyright 2011 Pearson Canada Inc. 247

  • 8/12/2019 ch10-solns-all_skuce_2e

    15/41

    Instructors Solutions Manual - Chapter 10

    The assignment of ranks is illustrated in the following table.

    Rating by Weeknight

    Diners

    Ordered

    RatingsRank

    Ratings by

    Weekend

    Diners

    Ordered

    RatingsRank

    4 1 4.5 1 1 4.5

    5 1 4.5 3 1 4.5

    1 1 4.5 2 1 4.5

    2 1 4.5 1 1 4.5

    1 2 11 1 2 11

    2 2 11 1 2 11

    2 2 11 3 3 15

    1 4 17 2 3 15

    1 5 18 3 3 15

    86 85

    If there was a difference in the food ratings by weeknight and weekend diners at the

    restaurant, we would expect W1and W2to be different. They are very similar here.

    p-value = 2 P(W1> 86)

    Since both samples are of size 9, we must use the tables to approximate the p-value.

    The closest value in the table to 86 is 104, so we can be sure P(W1> 86) > 0.057.

    This means the p-value > 2 0.057 = 0.114.

    Fail to reject H0. There is not enough evidence to conclude there is a difference in the

    food ratings by weekend and weeknight diners at the restaurant.

    Chapter Review ExercisesThroughout these exercises, it is often possible to do the calculations manually, or withExcel. Manual calculations are sometimes illustrated, and when they are not, the results

    should be close to the Excel output.

    1. It is preferable to use the t-test, if the necessary conditions are met, because it is

    harder to reject the null hypothesis with the Wilcoxon Rank Sum Test. The t-testuses all of the information available from the sample data, while the WRST uses the

    ranks, not the actual values. Any time we can use the actual values to make a

    decision, we should.

    Copyright 2011 Pearson Canada Inc. 248

  • 8/12/2019 ch10-solns-all_skuce_2e

    16/41

    Instructors Solutions Manual - Chapter 10

    2. If population 1 was to the right of population 2, we would expect the values in

    sample 1 to be higher than the values in sample 2. As a result, the rank sum forsample 1 should be larger than for sample 2. However, when there are 10

    observations in each sample (so 20 values have to be ranked), the ranks have to add

    up to 210. This means that the rank sum for sample 2 has to equal 210 78 = 132.

    This tells us that the values in sample 1 are generally smaller and to the left of thevalues in sample 2. Therefore, there is no evidence that population 1 is to the right of

    population 2. The p-value here would be 1 P(W178) = 1 0.022 = 0.978. Be sure

    that you think about what the rank sums are telling you. This sample result would behighly unexpected, but if you didn't think about it, you might slip and draw exactly

    the wrong conclusion!

    3. When samples are different sizes, they will tend to have different rank sums, even if

    they come from equivalent populations. The smaller sample will have a smaller rank

    sum, simply because there are fewer data points. So, when comparing rank sums, wehave to take this into consideration. The table is based on the rank sum being

    calculated from the smallest of the two samples. The conclusions could be wrong ifyou mistakenly calculate W from the larger sample.

    4. The unequal-variances version is preferred because:

    i.The unequal-variances version of the t-test will lead to the right decision, even if thevariances are in fact equal (with very few exceptions).

    ii.It can be hard to determine if variances are in fact equal, especially with smallsample sizes. Really, you should do another sample to test for equal variances.

    Remember, the more times you skate across the same frozen lake, the more likelyyou are to observe a rare eventfalling in!and the greater the chance of a Type

    I error.iii.If you mistakenly assume that variances are equal when they are not, results will be

    unreliable, particularly when sample sizes are unequal (and especially when the

    smaller sample has the larger variance).

    5. The Excel template is preferred because the t-score will be more accurate than the

    one used for the manual calculation.

    6. Call the times managers spent on email in the past population 1, and the times

    managers spend on emails after the new procedures have been implemented

    population 2.

    H0: 1- 2= 0

    H1: 1- 2> 0= 0.05

    1x = 49.2, 2x = 39.6, s1= 22.3, s2= 10.6, n1= 27, n2= 25

    We are told that the population distributions of errors are normal.

    Copyright 2011 Pearson Canada Inc. 249

  • 8/12/2019 ch10-solns-all_skuce_2e

    17/41

    Instructors Solutions Manual - Chapter 10

    006.2

    256.10

    273.22

    0)6.392.49(

    )(

    22

    2

    2

    2

    1

    2

    1

    2121

    n

    s

    n

    s

    xxt

    Degrees of freedom: minimum of (n1 1) and (n2 1), so minimum (26, 24) =24.

    Using the table for 24 degrees of freedom, we see

    0.025 < p-value < 0.05Reject H0. There is sufficient evidence to infer that the average time spent by

    managers on email was lower after the new procedures were implemented.

    7. With 90% confidence, we estimate that the interval (1.52 minutes, 17.68 minutes)

    contains the true reduction in the average amount of time managers spend on email

    after the new procedures. The completed Excel template is shown below. Of course,

    this could also be done manually, using the formula 2

    2

    2

    1

    2

    1

    21n

    s

    n

    sscoretxx .

    ConfidenceIntervalEstimateforthe

    DifferenceinPopulationMeans

    Dothe

    sample

    data

    appear

    to

    be

    normallydistributed? yes

    Sample1StandardDeviation 22.3

    Sample2StandardDeviation 10.6

    Sample1Mean 49.2

    Sample2Mean 39.6

    Sample1Size 27

    Sample2Size 25

    ConfidenceLevel(decimalform) 0.9

    UpperConfidenceLimit 17.6756

    LowerConfidence

    Limit 1.52438

    Copyright 2011 Pearson Canada Inc. 250

  • 8/12/2019 ch10-solns-all_skuce_2e

    18/41

    Instructors Solutions Manual - Chapter 10

    8. Call the hours spent doing unpaid work around the home by men in 2000 population

    1, and the hours spent doing such work in 2009 population 2. Since it does notappear that the samemen were involved in the surveys, we will treat these as

    independent samples.

    H0: 1- 2= 0

    H1: 1- 2< 0= 0.025

    1x = 2.2, 2x = 2.6, s1= 0.6, s2= 1.3, n1= 55, n2= 55

    We are told that the samples appear normally distributed, and so will assume the

    population distributions are.

    We can proceed to do the calculations manually, or with the Excel template. The

    completed Excel template is shown below.

    MakingDecisionsAbouttheDifferencein

    PopulationMeanswithTwoIndependent

    Samples

    Dothesampledataappeartobenormally

    distributed? yes

    Sample1StandardDeviation 0.6

    Sample2StandardDeviation 1.3

    Sample1Mean 2.2

    Sample2Mean 2.6

    Sample1Size

    55

    Sample2Size 55

    HypotheticalDifferenceinPopulationMeans 0

    tScore 2.0719

    OneTailedpValue 0.02083

    TwoTailedpValue 0.04167

    This is a one-tailed test, so the p-value is 0.021 < = 0.025.

    Reject H0. There is sufficient evidence to infer that the average number of hours

    men spend doing unpaid work around the home has increased in 2009, comparedwith 2000.

    Copyright 2011 Pearson Canada Inc. 251

  • 8/12/2019 ch10-solns-all_skuce_2e

    19/41

    Instructors Solutions Manual - Chapter 10

    9. This question can be done manually with the formula, or with the Excel template.

    The completed template is shown below.

    ConfidenceInterval

    Estimate

    for

    the

    DifferenceinPopulationMeans

    Dothesampledataappeartobe

    normallydistributed? yes

    Sample1StandardDeviation 0.6

    Sample2StandardDeviation 1.3

    Sample1Mean 2.2

    Sample2Mean 2.6

    Sample1Size 55

    Sample2Size 55

    ConfidenceLevel

    (decimal

    form) 0.95

    UpperConfidenceLimit 0.0155

    LowerConfidenceLimit 0.7845

    We have 95% confidence that the interval (-0.78 hours, -0.02 hours) contains thechange in the average amount of time men spend doing unpaid work around the

    house in 2000, compared with 2009. This means that (0.02 hours, 0.78 hours)

    contains the increase in the average amount of time spend doing unpaid work around

    the house in 2009 compared with 2000.

    10. H0: there is no difference in the locations of the population of ratings of the

    appearance of the grocery storeH1: the location of the population of ratings of the appearance of the grocery store

    six months ago is different from the location of the population of current ratings

    of the grocery store

    = 0.05

    These are ranked data, so we must examine the distributions for similarity in shapeand spread. The diagrams below illustrate.

    Ratings for Grocery Store Appearance Six Months Ago

    * *

    * *

    * * *

    * * * * *

    1 2 3 4 5

    Copyright 2011 Pearson Canada Inc. 252

  • 8/12/2019 ch10-solns-all_skuce_2e

    20/41

    Instructors Solutions Manual - Chapter 10

    Current Ratings for Grocery Store Appearance

    * *

    *

    ** * *

    * * * *

    1 2 3 4 5

    The ratings appear to be similar in shape and spread. The assignment of ranks is

    illustrated below.

    Appearance Ratings SixMonths Ago

    OrderedRatings

    RanksCurrent Appearance

    RatingsOrdered

    RatingsRanks

    5 1 1.5 1 1 1.5

    4 2 3 5 3 5.5

    5 3 5.5 4 3 5.5

    4 3 5.5 5 4 11.5

    2 4 11.5 4 4 11.5

    1 4 11.5 4 4 11.5

    4 4 11.5 5 4 11.5

    4 4 11.5 3 5 19.53 5 19.5 5 5 19.5

    3 5 19.5 4 5 19.5

    5 5 19.5 3 5 19.5

    5 5 19.5 W1 136.5

    W2 139.5

    As usual, we focus on the rank sum of the smallest sample, which contains the

    current ratings for grocery store appearance.

    n1= 11, n2= 12, W1= 136.5

    Since both sample sizes are larger than 10, we can use the normal approximation tothe sampling distribution of W1.

    Copyright 2011 Pearson Canada Inc. 253

  • 8/12/2019 ch10-solns-all_skuce_2e

    21/41

    Instructors Solutions Manual - Chapter 10

    24807681.16

    12)11211)(12(11

    12

    )1nn(nn 2121

    W1

    132

    2

    )11211(11

    2

    )1nn(n 211

    W1

    28.0

    24807681.16

    1325.136

    1

    11

    W

    WW

    z

    This is a two-tailed test. If the ratings are different, then W1would be high. The p-

    value will be 2 P(W1136.5) = 2 P(z 0.28) = 2 (1 0.6103) = 0.7794.Fail to reject H0. There is insufficient evidence to infer that there is a difference

    between the locations of the populations of grocery store ratings for appearance sixmonths ago and currently.

    Copyright 2011 Pearson Canada Inc. 254

  • 8/12/2019 ch10-solns-all_skuce_2e

    22/41

    Instructors Solutions Manual - Chapter 10

    Of course, you could also use the Excel template to do these calculations. It is shown

    below.

    MakingDecisionsAboutTwoPopulation

    Locations,Two

    Independent

    Samples

    of

    Non

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 11

    Sample2Size 12

    Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? yes

    W1 136.5

    W2 139.5

    zScore

    (based

    on

    W1) 0.27695585

    OneTailedpValue 0.390907

    TwoTailedpValue 0.781814

    11. First, realize these are matched-pairs data. Prices are for the same book each year.(Remember to think about whether you have independent or matched-pairs samples,

    because the techniques for each are different.)

    Next check to see if the differences are normally distributed. One possible histogramof differences is shown below.

    0

    1

    2

    3

    4

    5

    6

    7

    8

    9

    Fre

    quency

    (BookPriceLastYear)(BookPriceThisYear)

    BookPriceComparison

    Copyright 2011 Pearson Canada Inc. 255

  • 8/12/2019 ch10-solns-all_skuce_2e

    23/41

    Instructors Solutions Manual - Chapter 10

    The histogram is skewed to the right, but somewhat normal in shape.

    The results of the Data Analysis tool for the t-test are shown below.

    t

    Test:

    Paired

    Two

    Sample

    for

    Means

    BookPriceLastYear BookPriceThisYear

    Mean 14.54 12.426

    Variance 30.37956842 30.895162

    Observations 20 20

    PearsonCorrelation 0.879011986

    HypothesizedMeanDifference 0

    df 19

    tStat 3.471780487

    P(T

  • 8/12/2019 ch10-solns-all_skuce_2e

    24/41

    Instructors Solutions Manual - Chapter 10

    12. We are provided with summary data, and so can proceed either manually or with the

    Excel template. We are told that the sample data are normally distributed, so the t-test of the difference in means is appropriate.

    We will refer to the population of the number of exercises required to master the

    topic, according to professors, as population 1. The population of the number ofexercises required, according to the students experience, as population 2. We are

    asked if the professors have unrealistic expectation of the number of exercises that

    students need to master the topic. We interpret this to mean unrealistically high.

    In this case, the alternative hypothesis will be that 1- 2 > 0.

    H0: 1- 2= 0

    H1: 1- 2> 0= 0.01

    1x = 19.2, 2x = 12.3, s1= 5.2, s2= 3.6, n1= 15, n2= 20

    We are told the sample data appear normally distributed.The completed Excel template is shown below.

    MakingDecisionsAbouttheDifferencein

    PopulationMeanswithTwoIndependent

    Samples

    Dothesampledataappeartobenormally

    distributed? yes

    Sample1StandardDeviation 5.2

    Sample2Standard

    Deviation 3.6

    Sample1Mean 19.2

    Sample2Mean 12.3

    Sample1Size 15

    Sample2Size 20

    HypotheticalDifferenceinPopulationMeans 0

    tScore 4.40765

    OneTailedpValue 0.0001

    TwoTailedpValue 0.0002

    The one tailed p-value is 0.0001, which is less than 1%. Reject H0. There issufficient evidence to infer that professors have unrealistically high expectations of

    the number of exercises that students need to do to master this topic.

    Copyright 2011 Pearson Canada Inc. 257

  • 8/12/2019 ch10-solns-all_skuce_2e

    25/41

    Instructors Solutions Manual - Chapter 10

    13. The completed Excel template is shown below.

    ConfidenceIntervalEstimateforthe

    Differencein

    Population

    Means

    Dothesampledataappeartobe

    normallydistributed? yes

    Sample1StandardDeviation 5.2

    Sample2StandardDeviation 3.6

    Sample1Mean 19.2

    Sample2Mean 12.3

    Sample1Size 15

    Sample2Size 20

    ConfidenceLevel(decimalform) 0.99

    UpperConfidence

    Limit 11.2948

    LowerConfidenceLimit 2.50523

    We have 99% confidence that the interval (2.5, 11.3) contains the true

    overestimation of the number of exercises required to master this topic, compared to

    the actual experience of students. We would not particularly expect this interval tocontain zero, since the hypothesis test in Exercise 12 concluded that professors have

    higher expectations about the number of exercises required to master a topic,

    compared with students. The 99% confidence interval is wider than the interval thatdirectly corresponds to the hypothesis test in exercise 12 (the tail area there would be

    1%; for a 99% confidence interval, there is only % in each tail). However, even

    the wider interval does not contain zero.

    Copyright 2011 Pearson Canada Inc. 258

  • 8/12/2019 ch10-solns-all_skuce_2e

    26/41

    Instructors Solutions Manual - Chapter 10

    14. H0: A- B= 0

    H1: A- B0

    = 0.05

    Ax = 862, Bx = 731, sA= 362, sB= 223, nA= 31, nB= 25

    We are told the sample data appear normally distributed.The completed Excel template is shown below.

    MakingDecisionsAbouttheDifferencein

    PopulationMeanswithTwoIndependent

    Samples

    Dothesampledataappeartobenormally

    distributed? yes

    Sample1StandardDeviation 362

    Sample2Standard

    Deviation 223

    Sample1Mean 862

    Sample2Mean 731

    Sample1Size 31

    Sample2Size 25

    HypotheticalDifferenceinPopulationMeans 0

    tScore 1.66151

    OneTailedpValue 0.05143

    TwoTailedpValue 0.10287

    The two-tailed p-value is 0.103 > . Fail to reject H0. There is insufficient evidence

    to infer there is a difference in the number of pages produced by the two brands of

    cartridges, under these conditions.

    Copyright 2011 Pearson Canada Inc. 259

  • 8/12/2019 ch10-solns-all_skuce_2e

    27/41

    Instructors Solutions Manual - Chapter 10

    15. The completed Excel template is shown below.

    ConfidenceIntervalEstimateforthe

    Differencein

    Population

    Means

    Dothesampledataappeartobe

    normallydistributed? yes

    Sample1StandardDeviation 362

    Sample2StandardDeviation 223

    Sample1Mean 862

    Sample2Mean 731

    Sample1Size 31

    Sample2Size 25

    ConfidenceLevel(decimalform) 0.9

    UpperConfidence

    Limit 263.135

    LowerConfidenceLimit 1.1352

    We have 90% confidence that the interval (-1.1, 263.19) contains the difference in

    the number of pages produced by the two brands of printer cartridge, under these

    conditions.

    Copyright 2011 Pearson Canada Inc. 260

  • 8/12/2019 ch10-solns-all_skuce_2e

    28/41

    Instructors Solutions Manual - Chapter 10

    16. We will refer to the population of wait times for ITM support as population 1, and

    the population of wait times for Dull support as population 2.

    H0: 1- 2= 0

    H1: 1- 20

    = 0.051x = 8.5, 2x = 6.5, s1= 2.6, s2= 1.9, n1= 34, n2= 36

    We are told the sample data appear normally distributed.The completed Excel template is shown below.

    MakingDecisionsAbouttheDifferencein

    PopulationMeanswithTwoIndependent

    Samples

    Dothe

    sample

    data

    appear

    to

    be

    normally

    distributed? yes

    Sample1StandardDeviation 2.6

    Sample2StandardDeviation 1.9

    Sample1Mean 8.5

    Sample2Mean 6.5

    Sample1Size 34

    Sample2Size 36

    HypotheticalDifferenceinPopulationMeans 0

    tScore 3.65697

    OneTailed

    p

    Value 0.00027

    TwoTailedpValue 0.00054

    The two-tailed p-value is 0.00054 < . Reject H0. There is sufficient evidence toinfer there is a difference in average wait times for support between the ITM and

    Dull computers.

    Copyright 2011 Pearson Canada Inc. 261

  • 8/12/2019 ch10-solns-all_skuce_2e

    29/41

    Instructors Solutions Manual - Chapter 10

    17. The completed Excel template is shown below.

    ConfidenceIntervalEstimateforthe

    Differencein

    Population

    Means

    Dothesampledataappeartobe

    normallydistributed? yes

    Sample1StandardDeviation 2.6

    Sample2StandardDeviation 1.9

    Sample1Mean 8.5

    Sample2Mean 6.5

    Sample1Size 34

    Sample2Size 36

    ConfidenceLevel(decimalform) 0.95

    UpperConfidence

    Limit 3.09397

    LowerConfidenceLimit 0.90603

    We have 95% confidence that the interval (0.91 minutes, 3.09 minutes) contains the

    true extra average wait time for support for the ITM computers, compared with the

    Dull computers.

    This confidence interval corresponds directly to the two-tailed hypothesis test in

    Exercise 16. Since the null hypothesis of no difference was rejected there, we wouldnot expect this confidence interval to contain zero (and it does not).

    Copyright 2011 Pearson Canada Inc. 262

  • 8/12/2019 ch10-solns-all_skuce_2e

    30/41

  • 8/12/2019 ch10-solns-all_skuce_2e

    31/41

    Instructors Solutions Manual - Chapter 10

    The data are available on Excel, so it seems reasonable to use Excel to do the t-test.

    t-Test: Two-Sample Assuming Unequal Variances

    MinutesSpent with

    Each Client

    Last January

    MinutesSpent with

    Each Client

    This January

    Mean 50.63333333 59

    Variance 281.7574713 597.8823529

    Observations 30 35

    Hypothesized Mean Difference 0

    df 60

    t Stat -1.62607466

    P(T

  • 8/12/2019 ch10-solns-all_skuce_2e

    32/41

  • 8/12/2019 ch10-solns-all_skuce_2e

    33/41

    Instructors Solutions Manual - Chapter 10

    20. Call the amount of time spent by sales reps in a two week period with the old

    software population 1, and the amount of time spent by sales reps with the newsoftware population 2.

    H0: 1- 2= 0

    H1: 1- 2> 0= 0.04

    First we must examine the sample data. Histograms indicate the data are

    approximately normal, although the sample data for the old software are skewed tothe right. Also, sample sizes, at 30 and 35, are fairly large.

    0

    2

    4

    6

    8

    10

    12

    Frequency

    MinutesinaTwoWeekPeriod

    MinutesSpentbySalesRepsOn

    Computer,OldSoftware

    0

    2

    4

    6

    8

    10

    12

    F

    requency

    MinutesinaTwoWeekPeriod

    MinutesSpentbySalesRepsOn

    Computer,NewSoftware

    Copyright 2011 Pearson Canada Inc. 266

  • 8/12/2019 ch10-solns-all_skuce_2e

    34/41

    Instructors Solutions Manual - Chapter 10

    The Excel output for the t-test is shown below.

    tTest:TwoSampleAssumingUnequalVariances

    OldSoftware

    NewSoftware

    Mean 799.8 608

    Variance 172199.5 68901.88

    Observations 30 35

    HypothesizedMeanDifference 0

    df 48

    tStat 2.184543

    P(T

  • 8/12/2019 ch10-solns-all_skuce_2e

    35/41

    Instructors Solutions Manual - Chapter 10

    21. The completed Excel template is shown below.

    ConfidenceIntervalEstimateforthe

    DifferenceinPopulationMeans

    Dothe

    sample

    data

    appear

    to

    be

    normallydistributed? yes

    Sample1StandardDeviation 414.969

    Sample2StandardDeviation 262.492

    Sample1Mean 799.8

    Sample2Mean 608

    Sample1Size 30

    Sample2Size 35

    ConfidenceLevel(decimalform) 0.96

    UpperConfidenceLimit 377.26

    LowerConfidence

    Limit 6.34049

    At a 96% confidence level, it is estimated that the interval (6.3 minutes, 377.3

    minutes) contains the reduction in the amount of time that sales reps would spendover a two-week period, if the new software was adopted.

    Copyright 2011 Pearson Canada Inc. 268

  • 8/12/2019 ch10-solns-all_skuce_2e

    36/41

    Instructors Solutions Manual - Chapter 10

    22. The data are ranked, so the Wilcoxon Rank Sum Test will be used to make the

    comparisons.

    H0: there is no difference in the locations of the populations of ratings of high-speed

    Internet service for the cable TV company and the telephone company

    H1: there is a difference in the locations of the populations of ratings of high-speedInternet service for the cable TV company and the telephone company

    = 0.025

    Before we use the Wilcoxon Rank Sum Test, we must examine the data to see we can

    reasonably assume that the populations are similar in shape and spread.

    Two possible bar graphs of the data are shown below.

    0

    2

    4

    6

    8

    10

    12

    14

    1 2 3 4 5

    Frequency

    1=VerySatisfied,5=VeryDissatisfied

    Ratingsof

    Internet

    Service

    Provided

    by

    theTelephoneCompany

    0

    2

    4

    6

    8

    10

    12

    14

    1 2 3 4 5

    Frequency

    1=VerySatisfied,5=VeryDissatisfied

    Ratingsof InternetServiceProvidedby

    theCableTVCompany

    The distributions appear similar in shape and spread.

    Copyright 2011 Pearson Canada Inc. 269

  • 8/12/2019 ch10-solns-all_skuce_2e

    37/41

    Instructors Solutions Manual - Chapter 10

    Since the data are available in an Excel file, it seems appropriate to do the

    calculations in Excel. The output of the Wilcoxon Rank Sum Test Calculations isshown below.

    Wilcoxon Rank Sum Test Calculations

    sample 1 size 35

    sample 2 size 35

    W1 1349

    W2 1136

    The relevant template is shown below.

    MakingDecisionsAboutTwoPopulation

    Locations,

    Two

    Independent

    Samples

    of

    Non

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 35

    Sample2Size 35

    Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? yes

    W1 1349

    W2 1136

    zScore

    (based

    on

    W1) 1.25095882

    OneTailedpValue 0.10547475

    TwoTailedpValue 0.2109495

    The two-tailed p-value is 0.21095. Fail to reject H0. There is insufficient evidence

    to infer there is a difference in the locations of the populations of ratings of Internet

    service by the cable TV company and the telephone company.

    Copyright 2011 Pearson Canada Inc. 270

  • 8/12/2019 ch10-solns-all_skuce_2e

    38/41

    Instructors Solutions Manual - Chapter 10

    23.

    a. The data are ranked, so we consider the Wilcoxon Rank Sum Test.

    H0: there is no difference in the locations of the populations of ratings of the old

    instructions and the new instructions for lawnmower assembly

    H1: the location of the population of ratings of the old instructions is to the right of thepopulation of ratings of the new instructions for lawnmower assembly (a higher-

    numbered rating means greater difficulty)

    = 0.05

    The requirement is that the distributions are similar in shape and spread. Two graphs

    of the data are shown below.

    0

    5

    10

    15

    1 2 3 4 5

    Frequency

    1=VeryEasytoReadandFollow,5=Very

    Difficultto

    Read

    and

    Follow

    RatingsforOldLawnmower

    AssemblyInstructions

    0

    5

    10

    15

    20

    1 2 3 4 5

    Fre

    quency

    1=VeryEasytoReadandFollow,5=Very

    DifficulttoReadandFollow

    RatingsforNewLawnmower

    AssemblyInstructions

    Copyright 2011 Pearson Canada Inc. 271

  • 8/12/2019 ch10-solns-all_skuce_2e

    39/41

    Instructors Solutions Manual - Chapter 10

    The distributions are similar in spread, but not in shape. Any conclusion we make

    from the Wilcoxon Rank Sum Test will be weaker, as a result.

    The ranking could of course be completed manually. The output from the WilcoxonRank Sum Test Calculations is shown below.

    WilcoxonRankSumTestCalculations

    sample1size 37

    sample2size 42

    W1 1697

    W2 1463

    Since both sample sizes are more than 10, we will use the Excel template to estimate

    p-value.

    MakingDecisionsAboutTwoPopulation

    Locations,TwoIndependentSamplesofNon

    NormalQuantitativeData orRankedData

    (WRST)

    Sample1Size 37

    Sample

    2

    Size 42Arebothsamplesizesatleast10? yes

    Arethesamplehistogramssimilarinshape

    andspread? no

    W1 1697

    W2 1463

    zScore(basedonW1) 2.13196395

    OneTailedpValue 0.01650491

    TwoTailedpValue 0.03300981

    p-value = 0.016

    Reject H0. There is sufficient evidence to infer that population distributions ofratings for the old and new instructions are different. We have seen that there is a

    difference in shape, but given the marked differences in the frequencies of the "1"

    and "5" ratings, we are probably safe to conclude that customers find the newinstructions easier to read and follow.

    Copyright 2011 Pearson Canada Inc. 272

  • 8/12/2019 ch10-solns-all_skuce_2e

    40/41

    Instructors Solutions Manual - Chapter 10

    b. These data are quantitative, and the samples are independent. We must check fornormality before proceeding. Two possible histograms for the sample data are

    shown below.

    0

    2

    4

    6

    8

    10

    12

    14

    Frequency

    Minutes

    LawnmowerAssemblyTimeswithOld

    Instructions

    0

    2

    4

    6

    8

    10

    12

    14

    16

    F

    requency

    Minutes

    LawnmowerAssemblyTimeswithNew

    Instructions

    Both distributions seem approximately normal. The distribution of times for the new

    instructions is somewhat skewed to the right. However, samples sizes (37 and 42) are

    fairly large, and we will proceed with the t-test.

    H0: 1- 2= 0H1: 1- 2> 0

    = 0.04

    Copyright 2011 Pearson Canada Inc. 273

  • 8/12/2019 ch10-solns-all_skuce_2e

    41/41

    Instructors Solutions Manual - Chapter 10

    The Excel output for this data set is shown below.

    tTest:TwoSampleAssumingUnequalVariances

    AssemblyTimes

    (Minutes)WithOld

    Instructions

    AssemblyTimes

    (Minutes)WithNew

    Instructions

    Mean 64.43243243 47.5952381

    Variance 379.9189189 216.4907085

    Observations 37 42

    HypothesizedMeanDifference 0

    df 67

    tStat 4.287366909

    P(T