exploratory data analysis

2024. 10. 24. 18:58Statistics

Stem and leaf plot:
The center of the distribution (median number) can be known.
The overall shape of the distribution can be seen.

Frequency distribution table:
This is a table that classifies the observations of the collected quantitative data into each class and organizes the frequency of observations included in the section by class.

Histogram:
A graph showing the state of the frequency distribution in a column shape using the rank and frequency of the frequency distribution table. It is used to represent the form of continuous data, and the rank of continuous variables is displayed on the x-axis and the frequency is displayed on the y-axis.

Both the width and height of each pillar in the histogram have information.

The total area of the histogram is not 1, but the total area of the relative frequency density (probability) histogram is 1.

Accumulated frequency polygon (Ogive):
A graph that appears when the top midpoints of the column corresponding to the cumulative frequency of each class section are connected in a straight line.

Bar graph:
If the histogram is used to represent the form of continuous data, the bar graph is used to represent categorical data. The horizontal axis displays categories of categorical variables, and the longitudinal axis displays frequencies.

Box and whisker plot:
Instead of drawing a graph using the given data as it is, draw it using the five numerical summaries (minimum, quartile1, median, quartile3, maximum), which are statistics obtained from the data.



'Statistics' 카테고리의 다른 글

t test  (0) 2024.10.25
Test statistics  (0) 2024.10.25
Hypothesis testing procedure  (0) 2024.10.25
Test method according to null hypothesis  (0) 2024.10.25
Analysis methods according to data type  (0) 2024.10.25