Слайд 2Why doing a project?
to be able to prepare data, analyze data and
![Why doing a project? to be able to prepare data, analyze data](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-1.jpg)
provide meaningful answers to given questions based on
the statistical analysis results
Слайд 3Parts of the Project
data collection
data understanding
data preparation
data description
data analysis (hypothesis testing)
![Parts of the Project data collection data understanding data preparation data description data analysis (hypothesis testing)](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-2.jpg)
Слайд 41. Data Collection
kaggle
https://www.kaggle.com/
own research
UCI Machine Learning Repository
https://archive.ics.uci.edu/ml/index.php
![1. Data Collection kaggle https://www.kaggle.com/ own research UCI Machine Learning Repository https://archive.ics.uci.edu/ml/index.php](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-3.jpg)
Слайд 52. Data Understanding
one of the most critical parts
necessary for further data analysis
![2. Data Understanding one of the most critical parts necessary for further](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-4.jpg)
and hypothesis testing
both qualitative and quantitative variables need to be included
Слайд 63. Data Preparation
variables coding
value labels
data transformation/recoding
missing values analysis
extremes and outliers
![3. Data Preparation variables coding value labels data transformation/recoding missing values analysis extremes and outliers](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-5.jpg)
Слайд 74. Data Description
descriptive statistics
graphs
![4. Data Description descriptive statistics graphs](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-6.jpg)
Слайд 85. Data Analysis (hypothesis testing)
comparing an average value between male/female, age categories,
![5. Data Analysis (hypothesis testing) comparing an average value between male/female, age](/_ipx/f_webp&q_80&fit_contain&s_1440x1080/imagesDir/jpg/1026983/slide-7.jpg)
regions
test scores between male/female
expenditures between families from different regions
time spent on social networks between age categories
min of 2 hypotheses (1 member in the team)
min of 3 hypotheses (2 and 3 members in the team)