Data lecture

23

Transcript of Data lecture

Page 1: Data lecture
Page 2: Data lecture

DEFINITION

Data can be defined as a collection of facts or information

from which conclusions may be drawn.

Page 3: Data lecture

TYPES OF DATAQUANTITATIVE QUALITATIVE

Page 4: Data lecture

QUANTITATIVE DATA

• Quantitative or numerical data arise when the observations are counts

or measurements.

• The resulting data are a set of numbers

Eg:

• Cholesterol level in mg/dl

• Height in cms

• Blood sugar in mg/dl

• Number of children in family (numbers)

• Number of diarrheal episodes (numbers)

Page 5: Data lecture

QUANTITATIVE DATA CLASSIFICATION

Page 6: Data lecture

Quantitative data . . .

• Discrete: The data are said to be discrete if the measurements are integers

• Only whole numbers are possible

• There are gaps between numbers

• Eg:

• Number of children in family

• Number of cigarettes smoked per day

Page 7: Data lecture

• Continuous: if the measurements can take on any value, usually within some range

• Theoretically, no gaps between possible values

• Eg:

• Cholesterol (mg/dl)

• Weight (Kgs)

• Blood sugar fasting (mg/dl)

• Most biological variables are continuous.

Quantitative data . . .

Page 8: Data lecture

QUALITATIVE DATA

• The objects being studied are grouped into categories based on some qualitative trait.

• Data that is not given numerically

• Eg:• Religion

• Blood pressure levels

• Smoking status

Page 12: Data lecture

Assessment

Identify the type of data

Gender

Exact age

Number of fillings in tooth

Self reported level of pain

Stages of cancer

Height (in cms)

Qualitative nominal

Quantitative discrete

Quantitative discrete

Qualitative ordinal

Qualitative ordinal

Quantitative continuous

QUANTITATIVE

OR

QUALITATIVE

DISCRETE /CONTINUOUS

OR

NOMINAL/ORDINAL

Page 14: Data lecture

Measurement

• Process of assigning numbers to various aspects of

objects/events according to a rule.

• The aim of measurement is to provide accurate, objective,

sensitive and communicable descriptions of event.

Page 15: Data lecture

Levels of measurement

• Proposed by S. S. Stevens (1946)

• Each scale of measurement satisfies atleast one of the four properties:

• Identity – each value has a meaning

• Magnitude – values have an ordered relationship to other

• Equal intervals – values are spaced equally

• Absolute zero – scale has a true zero point

NOMINAL ORDINAL INTERVAL RATIO

Page 16: Data lecture

Nominal scale

• This scale only satisfies the identity property.

• Values assigned to variables represent a descriptive category, but have no inherent numerical value with respect to magnitude.

• They are usually coded

• Eg: Gender, religion, political affiliation

Page 17: Data lecture

Ordinal scale

• Has both identity and magnitude

• Ordinal data codes can be ranked

• Distance between codes is not meaningful

• Eg: Results of a running race, Staging of cancer

Page 18: Data lecture

Interval scale

• Has the properties of identity, magnitude, and equal intervals.

• Data can not only be ranked, but also have meaningful intervals between scale points

• Eg: Temperature in celsius and farhenheit

Page 19: Data lecture

Ratio

•Satisfies all properties of measurement

•Eg: Weight of an object, temperature in kelvin

•Minimum value is zero (cannot be negative)

Page 20: Data lecture
Page 21: Data lecture

Which scale is appropriate?

• To classify and categorize subjects – NOMINAL

• To rank people according to any characteristic – ORDINAL

• To quantify a trait where distance between rank is same – INTERVAL

• To quantify a trait which has an absolute zero - RATIO

Page 22: Data lecture

Assessment

Ordinal

Identify scale of measurement

Tooth mobility

Temperature in kelvin

Height

Temperature in celsius

Blood group

Number in sports jersey

Horse racing

Time

Ratio

Ratio

Interval

Nominal

Nominal

Ordinal

Ratio

Page 23: Data lecture

WHY DATA IS IMPORTANT ?

Qualitative and quantitative data behave differently and therefore studied

differently

The level of measurement helps you decide how to interpret data from that

variable

The level of measurement determines the type of statistics that is appropriate

for its analysis

Statistical analysis should be planned along with data collection procedures

so that they match