Descriptive Statistics Graphing Techniques

Содержание

Слайд 2

Points and grades from examination

Points and grades from examination

Слайд 3

Sample size n=30
Data sorting → Frequency table
both for quantitative and qualitative data

Sample size n=30 Data sorting → Frequency table both for quantitative and qualitative data

Слайд 4

Exam grade

Exam grade

Слайд 5

Notation
Frequency … ni
Relative frequency … fi
Cumulative Frequency … Ni
Cumulative Percent … Fi

Notation Frequency … ni Relative frequency … fi Cumulative Frequency … Ni Cumulative Percent … Fi

Слайд 6

Points from class test

Points from class test

Слайд 7

Quantitative variables
Grouping into class intervals

Quantitative variables Grouping into class intervals

Слайд 8

How to select the intervals

Number of intervals → in order to describe

How to select the intervals Number of intervals → in order to
the characteristics of the data
Simple reccommendation
intervals of the same width

k … number of intervals
n … sample size

Слайд 9

…then

h … width of interval
R … Range=xmax-xmin
k … number of intervals

Our example:
n=30
R=20-2=18

…then h … width of interval R … Range=xmax-xmin k … number

Слайд 10

Points from class test

Points from class test

Слайд 11

Measures of Central Tendency

Measures that represent with a proper value the tendency

Measures of Central Tendency Measures that represent with a proper value the
of most data to gather around this value
Number of different measures of central tendency
the arithmetic mean
the median
the mode

Слайд 12

The arithmetic mean

Notation

arithmetic mean ……

the sum of the values of a

The arithmetic mean Notation arithmetic mean …… the sum of the values
variable divided by the number of scores (by the sample size)

Слайд 13

Properties of the arithmetic mean

it is expressed in the same unit of

Properties of the arithmetic mean it is expressed in the same unit
measure as the observed variable
it is the point in a distribution of measurements about which the sum of deviations are equal to zero
Note: deviation explains the distance and direction from a reference point – here the arithmetic mean, it is positive when the value is greater than the mean and negative when lower than the mean

3. the mean is very sensitive to extreme values

Слайд 14

Personal income (thousands CZK)

thousands CZK

Personal income (thousands CZK) thousands CZK

Слайд 15

12 of 16 values are below the arithmetic mean, because of

12 of 16 values are below the arithmetic mean, because of the
the highest value x16=120,5 (directors income)
personal income is a commonly studied variable in which other measure of central tendency is preferred

Слайд 16

Other measures of central tendency

The median….

The value above and below which one-half

Other measures of central tendency The median…. The value above and below
of the frequencies fall
n…odd number
median case number=(n+1)/2
n…even number
the arithmetic mean of the two middle values
Properties: Insensitive to extreme values

Слайд 17

Other measures of central tendency

The mode….

The value that occurs with greatest frequency
for

Other measures of central tendency The mode…. The value that occurs with
qualitative (nominal and ordinal) and quantitative discrete data
from a statistical perspective it is also the most probable value

Слайд 18

Personal income (thousands CZK)

n=16… even number

the median the mode

Personal income (thousands CZK) n=16… even number the median the mode

Слайд 19

Personal income (thousands CZK)

n=16… even number

the median the mode

Personal income (thousands CZK) n=16… even number the median the mode

Слайд 20

Use of mean, median and mode

The arithmetic mean
member of mathematical system in

Use of mean, median and mode The arithmetic mean member of mathematical
advanced statistical analysis
preferred measure of central tendency if the distribution is not skewed
The median
when the distribution is skewed
The mode
whenever a quick, rough estimate of central tendency is desired

Слайд 21

The mean, median, mode and skewness

The mean, median, mode and skewness

Слайд 22

to describe the spread of the data, its variation around a central

to describe the spread of the data, its variation around a central
value
we want to express the distance along the scale of values

Measures of Dispersion

Слайд 24

The Range….R

it is the distance between the largest and the smallest value
R=xmax-xmin
it

The Range….R it is the distance between the largest and the smallest
does not explain the variability inside the range !
very simple and straightforward measure of dispersion

Слайд 25

The Variance…s2

it is an average squared deviation of each value from the

The Variance…s2 it is an average squared deviation of each value from
mean
it is the sum of the squared deviations from the mean divided by n
when computing the variation based on sample we correct the calculation

Слайд 26

Working formulas

For easier computation

Formula 1

Formula 2

Working formulas For easier computation Formula 1 Formula 2

Слайд 27

the variance explains both
the variability of the values around the arithmetic

the variance explains both the variability of the values around the arithmetic
mean
the variability among the values
difficult interpretation
(it is expressed in the squares of the unit of measure)

Слайд 28

The Standard Deviation…s

it is the square root of variance
when computing the variation

The Standard Deviation…s it is the square root of variance when computing
based on sample

Слайд 29

it is expressed in the same unit of measure as the observed

it is expressed in the same unit of measure as the observed
variable
the size of the standard deviation is related to the variability in the values
the more homogeneous values, the smaller SD
the heterogeneous values, the larger SD
member of mathematical system in advanced statistical analysis (like the arthmetic mean)

Properties of the standard deviation

Слайд 30

Two data sets with the same arithmetic mean and different SD

Two data sets with the same arithmetic mean and different SD

Слайд 31

Example – Personal income (thousands CZK)

thousands CZK

Example – Personal income (thousands CZK) thousands CZK

Слайд 32

Coefficient of Variation…V

the ratio of the standard deviation to the mean
often reported

Coefficient of Variation…V the ratio of the standard deviation to the mean
as a percentage (%) by multiplying by 100

Слайд 33

it is a relative measure of dispersion
used when comparing two data sets

it is a relative measure of dispersion used when comparing two data
with different units or widely different means
values higher than 50% indicate large variability

Слайд 34

Example – Personal income (thousands CZK)

Example – Personal income (thousands CZK)

Слайд 35

Percentiles (Centiles)

value below which a certain percent of observations fall
scale of percentile

Percentiles (Centiles) value below which a certain percent of observations fall scale
ranks is comprised of 100 units
insensitive to extreme values

Слайд 36

Deciles

divides a distribution into 10 equal parts
there are 9 deciles
D1 – 1st

Deciles divides a distribution into 10 equal parts there are 9 deciles
decile
- 10 percent of values fall below it
D9 – 9th decile
- 90 percent of values fall below it

Слайд 37

divides a distribution into 4 equal parts
Q1 - 25 percent of values

divides a distribution into 4 equal parts Q1 - 25 percent of
fall below it
- 25th centile
Q2 - 50 percent of values fall below it
- 50th centile
Q3 – 75 percent fall below it
- 75th centile

Quartiles

Слайд 39

Graphing Techniques

Graphing Techniques

Слайд 40

Constructing graphs – Bar graph

x – axis: labels of categories
y – axis:

Constructing graphs – Bar graph x – axis: labels of categories y
frequency (relative frequency)
The height of each rectangle is the category`s frequency or relative frequency.

Слайд 41

Arranging the graph

nominal variables – we can arrange the categories in any

Arranging the graph nominal variables – we can arrange the categories in
order:alphabetically, decreasing/increasing order of frequency
ordinal variables – the categories should be placed in their naturally occuring order

Слайд 45

Constructing graphs – Pie graph

Pie chart – a circle divided into sectors

Constructing graphs – Pie graph Pie chart – a circle divided into

each sector represents a category of data
the area of each sector is proportional to the frequency of the category

Слайд 47

Constructing graphs – Histogram

bar graph for quantitative data
values are grouped into intervals

Constructing graphs – Histogram bar graph for quantitative data values are grouped
(classes)
constructed by drawing rectangles for each class of data
the height of each rectangle is the frequency of the class
the width of each rectangle is the same

Слайд 49

Histogram

Histogram

Слайд 51

Constructing graphs – Boxplot

box-and-whisker diagram
five number summary

Constructing graphs – Boxplot box-and-whisker diagram five number summary

Слайд 52

Boxplot

Q3

Q1

Q2

Boxplot Q3 Q1 Q2