Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 2551 |
Duplicate rows (%) | 25.5% |
Total size in memory | 498.0 KiB |
Average record size in memory | 51.0 B |
Variable types
Categorical | 3 |
---|---|
Numeric | 2 |
Dataset
Description | 학생표본 신체(키) 검사 rawdata |
---|---|
Author | 교육부 |
URL | https://www.data.go.kr/data/15051016/fileData.do |
학년도 has constant value "" | Constant |
Dataset has 2551 (25.5%) duplicate rows | Duplicates |
키 is highly overall correlated with 학교급별 | High correlation |
학교급별 is highly overall correlated with 키 | High correlation |
Reproduction
Analysis started | 2023-12-11 23:44:16.615631 |
---|---|
Analysis finished | 2023-12-11 23:44:17.528567 |
Duration | 0.91 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
학년도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2016 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2016 |
---|---|
2nd row | 2016 |
3rd row | 2016 |
4th row | 2016 |
5th row | 2016 |
Common Values
Value | Count | Frequency (%) |
2016 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2016 | 10000 |
학교급별
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
초 | |
---|---|
고 | |
중 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 고 |
---|---|
2nd row | 고 |
3rd row | 고 |
4th row | 초 |
5th row | 고 |
Common Values
Value | Count | Frequency (%) |
초 | 4014 | |
고 | 3235 | |
중 | 2751 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
초 | 4014 | |
고 | 3235 | |
중 | 2751 |
학년
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.587 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 3 |
95-th percentile | 6 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.4447975 |
---|---|
Coefficient of variation (CV) | 0.55848376 |
Kurtosis | 0.029889682 |
Mean | 2.587 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.87008432 |
Sum | 25870 |
Variance | 2.0874397 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 2709 | |
1 | 2670 | |
3 | 2669 | |
6 | 668 | 6.7% |
4 | 653 | 6.5% |
5 | 631 | 6.3% |
Value | Count | Frequency (%) |
1 | 2670 | |
2 | 2709 | |
3 | 2669 | |
4 | 653 | 6.5% |
5 | 631 | 6.3% |
6 | 668 | 6.7% |
Value | Count | Frequency (%) |
6 | 668 | 6.7% |
5 | 631 | 6.3% |
4 | 653 | 6.5% |
3 | 2669 | |
2 | 2709 | |
1 | 2670 |
성별
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
남 | |
---|---|
여 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 여 |
---|---|
2nd row | 남 |
3rd row | 남 |
4th row | 남 |
5th row | 여 |
Common Values
Value | Count | Frequency (%) |
남 | 5033 | |
여 | 4967 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
남 | 5033 | |
여 | 4967 |
키
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 765 |
---|---|
Distinct (%) | 7.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 152.95533 |
Minimum | 95.8 |
---|---|
Maximum | 193.7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 95.8 |
---|---|
5-th percentile | 121.395 |
Q1 | 140 |
median | 157 |
Q3 | 166 |
95-th percentile | 176.3 |
Maximum | 193.7 |
Range | 97.9 |
Interquartile range (IQR) | 26 |
Descriptive statistics
Standard deviation | 17.201766 |
---|---|
Coefficient of variation (CV) | 0.11246268 |
Kurtosis | -0.68669697 |
Mean | 152.95533 |
Median Absolute Deviation (MAD) | 11.4 |
Skewness | -0.50410706 |
Sum | 1529553.3 |
Variance | 295.90076 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
160.0 | 69 | 0.7% |
156.0 | 68 | 0.7% |
165.0 | 58 | 0.6% |
170.0 | 53 | 0.5% |
159.0 | 52 | 0.5% |
157.0 | 52 | 0.5% |
158.0 | 51 | 0.5% |
163.0 | 48 | 0.5% |
162.0 | 48 | 0.5% |
164.0 | 48 | 0.5% |
Other values (755) | 9453 |
Value | Count | Frequency (%) |
95.8 | 1 | |
103.4 | 1 | |
105.4 | 1 | |
108.5 | 1 | |
108.6 | 1 | |
109.0 | 1 | |
109.2 | 1 | |
109.3 | 1 | |
109.6 | 1 | |
110.0 | 1 |
Value | Count | Frequency (%) |
193.7 | 1 | |
190.5 | 1 | |
189.7 | 1 | |
189.0 | 1 | |
188.6 | 1 | |
188.2 | 1 | |
188.1 | 1 | |
187.5 | 2 | |
187.4 | 1 | |
187.2 | 2 |
학교급별 | 학년 | 성별 | 키 | |
---|---|---|---|---|
학교급별 | 1.000 | 0.747 | 0.017 | 0.753 |
학년 | 0.747 | 1.000 | 0.020 | 0.556 |
성별 | 0.017 | 0.020 | 1.000 | 0.558 |
키 | 0.753 | 0.556 | 0.558 | 1.000 |
성별 | 학교급별 | |
---|---|---|
성별 | 1.000 | 0.028 |
학교급별 | 0.028 | 1.000 |
학년 | 키 | 학교급별 | 성별 | |
---|---|---|---|---|
학년 | 1.000 | -0.119 | 0.425 | 0.014 |
키 | -0.119 | 1.000 | 0.625 | 0.431 |
학교급별 | 0.425 | 0.625 | 1.000 | 0.028 |
성별 | 0.014 | 0.431 | 0.028 | 1.000 |
학년도 | 학교급별 | 학년 | 성별 | 키 | |
---|---|---|---|---|---|
76828 | 2016 | 고 | 3 | 여 | 151.8 |
79417 | 2016 | 고 | 1 | 남 | 174.5 |
76173 | 2016 | 고 | 3 | 남 | 182.5 |
25527 | 2016 | 초 | 5 | 남 | 145.6 |
61860 | 2016 | 고 | 1 | 여 | 167.8 |
36217 | 2016 | 중 | 3 | 남 | 169.0 |
16261 | 2016 | 초 | 2 | 남 | 128.8 |
48893 | 2016 | 중 | 3 | 여 | 167.4 |
67790 | 2016 | 고 | 3 | 여 | 163.8 |
60289 | 2016 | 고 | 2 | 남 | 168.0 |
학년도 | 학교급별 | 학년 | 성별 | 키 | |
---|---|---|---|---|---|
65552 | 2016 | 고 | 1 | 남 | 182.5 |
51089 | 2016 | 중 | 3 | 남 | 170.8 |
50805 | 2016 | 중 | 1 | 남 | 150.2 |
79527 | 2016 | 고 | 1 | 남 | 173.3 |
50703 | 2016 | 중 | 3 | 여 | 169.5 |
32988 | 2016 | 초 | 3 | 남 | 141.1 |
41105 | 2016 | 중 | 3 | 남 | 170.9 |
45839 | 2016 | 중 | 1 | 남 | 167.6 |
70239 | 2016 | 고 | 1 | 여 | 154.4 |
18533 | 2016 | 초 | 3 | 여 | 129.3 |
Most frequently occurring
학년도 | 학교급별 | 학년 | 성별 | 키 | # duplicates | |
---|---|---|---|---|---|---|
569 | 2016 | 고 | 3 | 남 | 170.0 | 14 |
207 | 2016 | 고 | 1 | 여 | 161.0 | 13 |
1194 | 2016 | 중 | 2 | 여 | 160.0 | 13 |
225 | 2016 | 고 | 1 | 여 | 163.0 | 12 |
454 | 2016 | 고 | 2 | 여 | 160.0 | 12 |
592 | 2016 | 고 | 3 | 남 | 173.0 | 12 |
691 | 2016 | 고 | 3 | 여 | 158.0 | 12 |
934 | 2016 | 중 | 1 | 여 | 156.0 | 12 |
313 | 2016 | 고 | 2 | 남 | 170.0 | 11 |
344 | 2016 | 고 | 2 | 남 | 174.0 | 11 |