Statistics in Research
-
Upload
kent-kawashima -
Category
Education
-
view
113 -
download
0
description
Transcript of Statistics in Research
![Page 1: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/1.jpg)
Coverage• Measures of Central Tendency– Mean– Median– Mode
• Measures of Variability and Dispersion– Range– Average deviation– Variance– Standard deviation
![Page 2: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/2.jpg)
Introduction to Notations
If variable X is the variable of interest, and that n measurements are taken;
then the notation X1, X2, X3, … , Xn will be used to represent n observations.
![Page 3: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/3.jpg)
Sigma
Indicates “summation of”
Σ
![Page 4: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/4.jpg)
Summation Notation
If variable X is the variable of interest, and that n measurements are taken;
the sum of n observations can be written as
Σ Xi = X1+X2+ … +Xn
n
i=1
![Page 5: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/5.jpg)
Summation Notation
Σ Xi = X1+X2+ … +Xn
n
i=1
Upper limit of summation
Lower limit of summation
Greek letter Sigma
![Page 6: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/6.jpg)
Rules of Summation
summation of the sum of variables is
Σ (Xi+Yi) = Σ Xi+ Σ Yi n
i=1
n
i=1
n
i=1
the sum of their summations
![Page 7: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/7.jpg)
Σ (ai+bi+…+zi) =
Σ ai+ Σ bi + … + Σ zi
n
i=1
n
i=1
n
i=1
n
i=1
The summation of the sum of variables is…
the sum of their summations.
![Page 8: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/8.jpg)
Rules of Summation
Σ cXi = c Σ Xi
= c(X1+X2+ … +Xn)
n
i=1
n
i=1
If c is a constant, then…
![Page 9: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/9.jpg)
Rules of Summation
Σ c = ncn
i=1
The summation of a constant is the product of upper limit of summation n and constant c.
![Page 10: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/10.jpg)
MEASURES OF CENTRAL TENDENCYStatistics in Research
![Page 11: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/11.jpg)
Mean
• The sum of all values of the observations divided by the total number of observations
• The sum of all scores divided by the total frequency
![Page 12: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/12.jpg)
Population mean μ = Σ Xi
N
i=1
N
Sample mean x = Σ Xi
n
i=1
n
![Page 13: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/13.jpg)
Σ fiXi
n
i=1
n
Mean in an Ungrouped Frequency
= (f1X1+f2X2+ … +fnXn)
where f is the frequencyof the occurring score
n
![Page 14: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/14.jpg)
Properties - Mean
• The most stable measure of central tendency• Can be affected by extreme values• Its value may not be an actual value in the
data set• If a constant c is added/substracted to all
values, the new mean will increase/decrease by the same amount c
![Page 15: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/15.jpg)
Median
• Positional middle of an array of data• Divides ranked values into halves with 50%
larger than and 50% smaller than the median value.
![Page 16: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/16.jpg)
If n is odd:
Md = X(n+1)/2
If n is even:
Md = Xn/2 + X(n/2)+1
2
![Page 17: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/17.jpg)
Properties - Median
• The median is a positional measure• Can be determined only if arranged in order• Its value may not be an actual value in the
data set• It is affected by the position of items in the
series but not by the value of each item• Affected less by extreme values
![Page 18: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/18.jpg)
Mode
• Value that occurs most frequently in the data set
• Locates the point where scores occur with the greatest density
• Less popular compared to mean and median measures
![Page 19: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/19.jpg)
Properties - Mode
• It may not exist, or if it does, it may not be unique
• Not affected by extreme values• Applicable for both qualitative and
quantitative data
![Page 20: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/20.jpg)
MEASURES OF VARIABILITY AND DISPERSION
Statistics in Research
![Page 21: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/21.jpg)
Range
• Measure of distance along the number line over where data exists
• Exclusive and inclusive range– Exclusive range = largest score - smallest score– Inclusive range = upper limit - lower limit
![Page 22: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/22.jpg)
Properties - Range
• Rough and general measure of dispersion• Largest and smallest extreme values
determine the range• Does not describe distribution of values within
the upper and lower extremes• Does not depend on number of data
![Page 23: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/23.jpg)
Absolute Deviation
Average of absolute deviations of scores from the mean (Mean Deviation) or the median (Median Absolute
Deviation)
![Page 24: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/24.jpg)
MD = Σ | Xi - X |n
i=1
n
MAD = Σ | Xi - Md |n
i=1
n
![Page 25: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/25.jpg)
Properties – Absolute Deviation
• Measures variability of values in the data set• Indicates how compact the group is on a
certain measure
![Page 26: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/26.jpg)
Variance
• Average of the square of deviations measured from the mean
• Population variance (σ2) and sample variance (s2)
![Page 27: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/27.jpg)
σ2 = Σ ( Xi - μ )2N
i=1
N
s2 = Σ ( Xi – X )2n
i=1
n -1
![Page 28: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/28.jpg)
s2 = nΣ Xi2 - (Σ Xi )2
n
i=1
n(n -1)
n
i=1
![Page 29: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/29.jpg)
Properties – Variance
• Addition/subtraction of a constant c to each score will not change the variance of the scores
• Multiplying each score by a constant c changes the variance, resulting in a new variance multiplied by c2
![Page 30: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/30.jpg)
Standard Deviation
• Square root of the average of the square of deviations measured from the mean – square root of the variance
• Population standard deviation (σ) and sample standard deviation (s)
![Page 31: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/31.jpg)
σ = Σ ( Xi - μ )2N
i=1
N
s = Σ ( Xi – X )2n
i=1
n -1
![Page 32: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/32.jpg)
Why n-1?• Degrees of freedom– Measure of how much precision an estimate of
variation has– General rule is that the degrees of freedom
decrease as more parameters have to be estimated
• Xbar estimates μ• Using an estimated mean to find the standard
deviation causes the loss of ONE degree of freedom
![Page 33: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/33.jpg)
Properties – Standard Deviation
• Most used measure of variability• Affected by every value of every observation• Less affected by fluctuations and extreme
values
![Page 34: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/34.jpg)
Properties – Standard Deviation
• Addition/subtraction of a constant c to each score will not change the standard of the scores
• Multiplying each score by a constant c changes the standard deviation, resulting in a new standard deviation multiplied by c
![Page 35: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/35.jpg)
Choosing a measure
• Range– Data are too little or scattered to justify more
precise and laborious measures– Need to know only the total spread of scores
• Absolute Deviation– Find and weigh deviations from the mean/median– Extreme values unduly skews the standard
deviation
![Page 36: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/36.jpg)
Choosing a measure
• Standard Deviation– Need a measure with the best stability– Effect of extreme values have been deemed
acceptable– Compare and correlate with other data sets
![Page 37: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/37.jpg)
FREQUENCY DISTRIBUTIONStatistics in Research
![Page 38: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/38.jpg)
74 79 69 72 53 76 62 82 84 87 9672 79 68 71 50 75 60 81 84 86 9172 77 66 69 50 75 59 80 82 85 8872 77 66 69 50 75 60 81 83 85 8973 78 68 70 50 75 60 81 83 86 8973 59 65 69 50 75 77 80 82 84 8773 79 68 71 51 76 62 81 84 87 9273 79 68 71 52 76 62 82 84 87 9474 79 68 71 53 76 62 82 84 87 9450 57 63 69 72 74 77 80 82 84 87
Raw data
![Page 39: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/39.jpg)
50 57 63 69 72 74 77 80 82 84 8750 59 65 69 72 75 77 80 82 84 8750 59 66 69 72 75 77 80 82 85 8850 60 66 69 72 75 77 81 83 85 8950 60 68 70 73 75 78 81 83 86 8950 60 68 71 73 75 79 81 84 86 9151 62 68 71 73 76 79 81 84 87 9252 62 68 71 73 76 79 82 84 87 9453 62 68 71 74 76 79 82 84 87 9453 62 69 72 74 76 79 82 84 87 96
Array
![Page 40: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/40.jpg)
Frequency Distribution Table
• Class Frequency– Number of observations within a class, f
• Class Limits– End numbers of the class
• Class Interval– Interval between the upper and lower class limits,
ie: [Xupper limit , Xlower limit ]
![Page 41: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/41.jpg)
Frequency Distribution Table• Class Boundaries– True limits of the class, halfway between class limit of
the current class and that of the preceding/succeeding class, LCB and UCB
• Class Size– Difference between UCB and LCB,
ie: XUCB - XLCB
• Class Mark– Midpoint of the class interval, average value of the
upper and lower class limits, ie. Xupper limit - Xlower limit
![Page 42: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/42.jpg)
Constructing an FDT
• Determine number of classes– Sturges Formula, K = 1 + 3.322 log n– Square Root, K = sqrt(n)
• Determine the approximate class size, C’ = R/K• Round off C’ to a more convenient number C
![Page 43: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/43.jpg)
Constructing an FDT
• Determine lower class limit– Lowest class should not be empty, must contain
the lowest value in the data set
• Determine succeeding lower class limits by adding class size C to the current lower class limit
• Tally frequencies
![Page 44: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/44.jpg)
50 57 63 69 72 74 77 80 82 84 8750 59 65 69 72 75 77 80 82 84 8750 59 66 69 72 75 77 80 82 85 8850 60 66 69 72 75 77 81 83 85 8950 60 68 70 73 75 78 81 83 86 8950 60 68 71 73 75 79 81 84 86 9151 62 68 71 73 76 79 81 84 87 9252 62 68 71 73 76 79 82 84 87 9453 62 68 71 74 76 79 82 84 87 9453 62 69 72 74 76 79 82 84 87 96
Array
![Page 45: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/45.jpg)
Frequency Distribution TableClass Frequency LCB UCB RF <CF >CF
50-54 10 49.5 54.5 0.09 10 11055-59 3 54.5 59.5 0.03 13 10060-64 8 59.5 64.5 0.07 21 9765-69 13 64.5 69.5 0.12 34 8970-74 17 69.5 74.5 0.15 51 7675-79 19 74.5 79.5 0.17 70 5980-84 22 79.5 84.5 0.20 92 4085-89 13 84.5 89.5 0.12 105 1890-94 4 89.5 94.5 0.04 109 595-99 1 94.5 99.5 0.01 110 1
![Page 46: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/46.jpg)
Other Terms• Relative frequency, RF– Class frequency divided by number of
observations, ie. RF = fi / n• Relative Frequency Percentage, RFP– RF = (fi / n) x 100%
• Cummulative frequency– Shows accumulated frequencies of successive
classes, either from the beginning (less than CF) or end (greater than CF) of the FDT
![Page 47: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/47.jpg)
Mean from an FD
X = Σ fiXi
K
i=1
Σ fi
K
i=1
where Xi = class mark of the ith class
![Page 48: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/48.jpg)
Median from an FD
Md = LCBMd + C n/2 - <CFMd-1
where LCBMd = lower class boundary of median class <CFMd-1 = less than cumulative frequency preceeding the
median class
fMd
![Page 49: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/49.jpg)
Mode from an FD
Mo = LCBMo + C fMo - fMo-1
where LCBMo = lower class boundary of modal classfMo, fMo-1, fMo+1 = frequency of modal class, class preceding and
class succeeding the modal class
2fMo - fMo-1 - fMo+1
![Page 50: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/50.jpg)
MD = Σ fi |Xi - X|n
i=1
n
Mean Deviation from an FD
where Xi = class mark of the ith class n = total number of observations; total
frequency, ie. n = Σ fi
![Page 51: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/51.jpg)
s2 = Σ fi(Xi - X)2n
i=1
(n -1)
Variance from an FD
where Xi = class mark of the ith class n = total number of observations; total
frequency, ie. n = Σ fi
![Page 52: Statistics in Research](https://reader036.fdocument.pub/reader036/viewer/2022081414/54c6b0674a7959526c8b45e5/html5/thumbnails/52.jpg)
s2 = nΣ fiXi2 - (Σ fiXi )2
n
i=1
n(n -1)
Variance from an FD
n
i=1
where Xi = class mark of the ith class n = total number of observations; total
frequency, ie. n = Σ fi