STUDYING PHENOMENA AND PROCESSES

Содержание

Слайд 2

DATA SETS CLASSIFICATION

By number of variables there are for each elementary unit

DATA SETS CLASSIFICATION By number of variables there are for each elementary
(=people, companies, countries, cities, etc.)
By the kind of measurement (numbers of categories in each case)
Whether there is a time sequence
Newly created or was previously created by someone else

Слайд 3

Univariate data – just one piece of information for each item.
We

Univariate data – just one piece of information for each item. We
can summarize basic properties
Bivariate data – two pieces of
information for each item.
+Relationship can be measured
Multivariate – many pieces of
information for each item.
+look at the interrelationships among
all the items

Слайд 4

LEVELS OF MEASUREMENT

Nominal-level variable has values that show difference that subjects have

LEVELS OF MEASUREMENT Nominal-level variable has values that show difference that subjects
on the characteristic being measured. Simply put, there is no inherent order of categories.
I.e. religion: Protestant, Catholic, Jewish, other;
Ordinal-level variable has values that show relative differences between subjects on the characteristic being measured. Simply put, there is a meaningful order of categories but we cannot measure the difference between categories.
I.e. Support for abortion values: oppose, neutral, support.
** Nominal + Ordinal =Qualitative data
Interval-level variable has values that communicate exact differences between subjects on the measured characteristic. We can measure both the order and the difference.
I.e. age: 18, 24, 30
***Interval=Quantitative

Слайд 5

QUANTITATIVE DATA (NUMBERS)

Discrete quantitative data can assume values only from a list

QUANTITATIVE DATA (NUMBERS) Discrete quantitative data can assume values only from a
of specific numbers.
I.e. gender of students, coded 0=male, 1=female;
Number of kids in household: 1=1kid, 2=2 kids, 3=3 kids, 4=4 or more kids;
Equipment breakdowns on a factory in the past 24 hours, out of 20 working machines
Continuous quantitative data – all positive numbers, all numbers, all values between 0% and 100%.

Слайд 6

Grouping statistical data

A data class is group of data which is related

Grouping statistical data A data class is group of data which is
by some user defined property.
For example, if you were collecting the ages of the people you met as you walked down the street, you could group them into classes as those in their teens, twenties, thirties, forties and so on. Each of those groups is called a class.
Each of those classes is of a certain width and this is referred to as the Class Interval or Class Size. This class interval is very important when it comes to drawing Histograms and Frequency diagrams. All the classes may have the same class size or they may have different classes sizes depending on how you group your data. The class interval is always a whole number.
Number of intervals: n=1+3,322*lgN (N – number of set values)
Interval size for equal intervals:
(Highest Value-Lowest Value)/no of classes
Size should be a whole number. I.e., if you get 2,7 – your class size is 3

Слайд 7

Grouping in Excel

I.e. you have a raw set of data in excel.

Grouping in Excel I.e. you have a raw set of data in
You have numbers of different people’s ages. Eg. 28 years old -50 ppl, 60 years old – 10 ppl, 14 years old – 10 ppl, etc.. You can sort the data by people’s ages and then, using the formula, count the length of an interval. Put the intervals and using the sum function, organize the data into intervals.
Or function =SUMMPRODUCT

Слайд 8

SPSS data grouping

We want to group income by less than 25, 25-49,

SPSS data grouping We want to group income by less than 25,
50-74, 75 and more
Go Transform-Visual Binning

Слайд 9

We can select scale or ordinal variable to bin them. Binning=take two or

We can select scale or ordinal variable to bin them. Binning=take two
more contiguous values and group them into a category Press “make cutpoints”

Слайд 10

We can select “Equal with Intervals” and put in First Cutpoint Location,

We can select “Equal with Intervals” and put in First Cutpoint Location,
Number of Cutpoints and Width .

Слайд 11

We can make labels if we want and choose to exclude or

We can make labels if we want and choose to exclude or
include the last number in interval. In the end, we get:

Слайд 12

DESCRIPTIVE STATISTICS

Measures of central tendency - identify the most typical value or

DESCRIPTIVE STATISTICS Measures of central tendency - identify the most typical value
best representative of a set of empirical data
Measures of dispersion- amount of variation around the most representative value

Слайд 13

Descriptive statistics: Nominal level

Central Tendency – mode (the most common value of

Descriptive statistics: Nominal level Central Tendency – mode (the most common value
the variable)
Dispersion – variation ratio

Слайд 14

Descriptive statistics: Ordinal level
Central Tendency – mode + median=the one in the

Descriptive statistics: Ordinal level Central Tendency – mode + median=the one in
middle=half the cases with values below the median and half above . Put the data in order and find the middle value.
Median is the value, which rank is [(N+1)/2] with odd variables, or if variables are even – N/2 and (N+1)/2.
Dispersion – range [(highest score-lowest score)+1]

Слайд 15

Descriptive statistics: Interval level
Central tendency: mean [=average]
+Weighed Average
Dispersion – Standard Deviation Sx=
Variation

Descriptive statistics: Interval level Central tendency: mean [=average] +Weighed Average Dispersion –
ratio for interval level=
=Standard Deviation/Average

Слайд 17

Average in Excel: Insert-Function-Average-Enter
Weighed average for named columns, where a is weights

Average in Excel: Insert-Function-Average-Enter Weighed average for named columns, where a is
column: SUMPRODUCT (a;b)/Summ(a).
Median in Excel: =MEDIAN(A1:AN)
Mode in Excel: =MODE(A1:AN)
Standard Deviation: =STDEV(A1:AN)
Or 1) Calculate Avg. 2) Calculate each value’s difference from Avg. 3) Square each one, sum the squared ones and divide by N-1 4) square root it all.
Or, much, much easier: Service-Data Analysis-Descriptive Statistics – and you get everything you need in one click.
(If there is no data analysis option – add it in excel properties)

Слайд 18

SPSS Summary measures for categorical data Go Analyze – Descriptive Statistics - Frequencies

SPSS Summary measures for categorical data Go Analyze – Descriptive Statistics - Frequencies

Слайд 19

Choose variables, press ok and you get your frequency table

Choose variables, press ok and you get your frequency table

Слайд 20

To graphically display press Charts and select the ones you like

To graphically display press Charts and select the ones you like

Слайд 21

Summary measures for scale variables in SPSS * Go Analyze-Descriptive Statistics-Frequencies * Choose variables *

Summary measures for scale variables in SPSS * Go Analyze-Descriptive Statistics-Frequencies *
Click “statistics”, select the ones you need.

Слайд 22

* You get the data in the viewer window * Go back to

* You get the data in the viewer window * Go back
frequencies dialog, click charts and choose the ones you want

Слайд 23

Summary with grouping variables in SPSS

Analyze – Reports- Case Summaries

Summary with grouping variables in SPSS Analyze – Reports- Case Summaries

Слайд 24

* SELECT THE VARIABLE TO BE SUMMARIZED AND A GROUPING VARIABLE, *

* SELECT THE VARIABLE TO BE SUMMARIZED AND A GROUPING VARIABLE, *
DESELECT “DISPLAY CASES” AND PRESS “STATISTICS”

Слайд 25

SELECT MEAN, MEDIAN, MINIMUM, MAXIMUM (or any other you might need) * CLICK

SELECT MEAN, MEDIAN, MINIMUM, MAXIMUM (or any other you might need) *
CONTINUE AND CLICK OPTIONS * IN AN OPTION WINDOW, YOU CAN GIVE A TITLE AND A CAPTION

Слайд 26

* YOU GET THE DATA GROUPED BY THE VARIABLE. * ALL THE

* YOU GET THE DATA GROUPED BY THE VARIABLE. * ALL THE
DESCRIPTIVES ARE GIVEN FOR EACH VARIABLE, AS WELL AS FOR “TOTAL”

Слайд 27

* YOU CAN ALSO LAYER YOUR DATA USING SEVERAL VARIABLES: * ANALYZE-COMPARE MEANS

* YOU CAN ALSO LAYER YOUR DATA USING SEVERAL VARIABLES: * ANALYZE-COMPARE MEANS - MEANS
- MEANS

Слайд 28

* ADD THE VARIABLE YOU WANT TO EXAMINE TO “DEPENDENT LIST” * ADD

* ADD THE VARIABLE YOU WANT TO EXAMINE TO “DEPENDENT LIST” *
THE VARIABLES YOU WANT TO GROUP BY TO “INDEPENDENT LIST” * IF YOU WANT TO HAVE MORE THAN ONE, PRESS NEXT AND ADD THEM

Слайд 29

WHAT YOU GET

Data, grouped by two variables. You get info bout hourly

WHAT YOU GET Data, grouped by two variables. You get info bout
salary, grouped by “years of experience” and “nurse type”.
What you can see is that nursing salaries seem to take into account both experience and type of work…

Слайд 30

You can also select certain cases that follow the rule you choose

You can also select certain cases that follow the rule you choose
(using if=, if> and any functions), as well as sort your data.
(Data-Select Cases; Data-Sort)

Слайд 31

One-way ANOVA (Means comparison) as a Bivariate Descriptive Statistic

Data-Weight Cases
Analyze-Compare Means-Means
Choose Statistics

One-way ANOVA (Means comparison) as a Bivariate Descriptive Statistic Data-Weight Cases Analyze-Compare
you want and check “Anova Table and eta”, as well as “Test for linearity”.
Anova table shows tests for linear, nonlinear and combined relationship.
Significance is lower than 0,05 – there is
linear relationship.

Слайд 32

Graphical visualization in Excel and SPSS Column charts - used to show amounts or

Graphical visualization in Excel and SPSS Column charts - used to show
the number of times a value occurs.

Grouping Histograms - compare values in each category
Cumulative Histograms – show the parts of total and their relation to total. They are useful when “totals” are important for us.
Standardized Cumulative Histograms. Compare percent that each category contribute to total. Useful for three or more variables and when we want to see each variable’s percent of contribution to the total.
3d Histograms compare different values. Can be used to compare data both in categories and in sets.
Cone, pyramid and cylinder can be used analogically

Слайд 33

Graphs

Can show continuous change of values over time on the same scale.

Graphs Can show continuous change of values over time on the same
Are perfect for trend visualization
Graphs and graphs with markers
Cumulative graphs– to show dynamics in contribution of each category to the total.
Normalized cumulative graphs – to demonstrate dynamics in percent contribution of each category to the total
3d graph

Слайд 34

Pie-charts

They are used to chart only one variable at a time. As

Pie-charts They are used to chart only one variable at a time.
a result, it can only be used to show percentages.
The circle of pie charts represents 100%. The circle is subdivided into slices representing data values. The size of each slice shows what part of the 100% it represents.
All the values should not be lower than 0. There should be no more than 7 categories.

Слайд 35

Secondary pie chart and secondary histogram show data regarding one of the

Secondary pie chart and secondary histogram show data regarding one of the
sectors of a pie chart.
Exploded pie chart concentrates on each value

Слайд 36

Bar charts

Are almost the same as histograms, they illustrate comparison of different

Bar charts Are almost the same as histograms, they illustrate comparison of
elements
These are useful when the axis lables are long, yet we want to see the difference between values.
The types are the same as in histograms.

Слайд 37

Area chart

Area charts are much like line charts, but they display different

Area chart Area charts are much like line charts, but they display
colors in the areas below the lines. This colorful and visual display distinguishes the data more clearly.
Area charts emphasize the magnitude of change over time and can be used to draw attention to the total value across a trend.
2-D area and 3-D area charts   display the trend of values over time or other category As a general rule, you should consider using a line chart instead of a nonstacked area chart, because data from one series can be obscured by data from another series.
Stacked area charts display the trend of the contribution of each value over time or other category data.
100% stacked area charts display the trend of the percentage that each value contributes over time or other category data.

Слайд 38

XY (scatter) charts

Scatter charts show the relationships among the numeric values in

XY (scatter) charts Scatter charts show the relationships among the numeric values
several data series, or plots two groups of numbers as one series of xy coordinates.
Scatter with only markers to compare pairs of values.
Scatter with smooth lines and scatter with smooth lines and markers
Scatter with straight lines and scatter with straight lines and markers  

Слайд 39

Stock charts

Most often used to illustrate the fluctuation of stock prices. However,

Stock charts Most often used to illustrate the fluctuation of stock prices.
this chart may also be used for scientific data. For example, you could use a stock chart to indicate the fluctuation of daily or annual temperatures. You must organize your data in the correct order to create stock charts.
High-low-close   The high-low-close stock chart is often used to illustrate stock prices. It requires three series of values in the following order: high, low, and then close.
Open-high-low-close   This type of stock chart requires four series of values in the correct order (open, high, low, and then close).
Volume-high-low-close   This type of stock chart requires four series of values in the correct order (volume, high, low, and then close). It measures volume by using two value axes: one for the columns that measure volume, and the other for the stock prices.
Volume-open-high-low-close   This type of stock chart requires five series of values in the correct order (volume, open, high, low, and then close).

Слайд 40

Surface charts

A surface chart is useful when you want to find optimum

Surface charts A surface chart is useful when you want to find
combinations between two sets of data. As in a topographic map, colors and patterns indicate areas that are in the same range of values.
You can use a surface chart when both categories and data series are numeric values.
Color bands in a surface chart do not represent the data series; they represent the distinction between the values. This chart shows a 3-D view of the data, which can be imagined as a rubber sheet stretched over a 3-D column chart. It is typically used to show relationships between large amounts of data that may otherwise be difficult to see.

Слайд 41

Doughnut charts

Like a pie chart, a doughnut chart shows the relationship of

Doughnut charts Like a pie chart, a doughnut chart shows the relationship
parts to a whole, but it can contain more than one data series.

Слайд 42

Bubble chart

Bubble or bubble with 3-D effect   Both bubble chart types compare sets

Bubble chart Bubble or bubble with 3-D effect Both bubble chart types
of three values instead of two. The third value determines the size of the bubble marker. You can choose to display bubbles in 2-D format or with a 3-D effect.

Слайд 43

Radar chart

Radar charts compare the aggregate values of several data series. Radar

Radar chart Radar charts compare the aggregate values of several data series.
charts display changes in values relative to a center point.

Слайд 44

SPSS has the same graphical visualization types plus a boxplot option.
A boxplot

SPSS has the same graphical visualization types plus a boxplot option. A
shows the five statistics: min, 1st quartile, median, 3rd quartile, maximum.
It is useful for displaying the distribution of a scale variable and pinpoiting outliers (unusual data values).

Слайд 45

Graphical visualization in SPSS

Graphical visualization in SPSS

Слайд 46

* Add variables, dragging them from the variables list to the canvas.

* Add variables, dragging them from the variables list to the canvas.
* Choose graph type below

Слайд 47

I.e. you drag Job Satisfaction to X axis, Household Income to Y

I.e. you drag Job Satisfaction to X axis, Household Income to Y
axis, choose the type as Bar. That’s what you get in the output window:
Имя файла: STUDYING-PHENOMENA-AND-PROCESSES-.pptx
Количество просмотров: 199
Количество скачиваний: 0