Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 5 |
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 강원대학교 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=8b23faa0-4883-11ed-aa8b-eb79d2b882ad |
월 has constant value "" | Constant |
시도코드 has constant value "" | Constant |
시도명 has constant value "" | Constant |
시군구코드 is highly overall correlated with 아이디 and 1 other fields | High correlation |
시군구명 is highly overall correlated with 아이디 and 1 other fields | High correlation |
아이디 is highly overall correlated with 시군구코드 and 1 other fields | High correlation |
아이디 has unique values | Unique |
격자번호 has unique values | Unique |
생활 has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 11:37:44.285826 |
---|---|
Analysis finished | 2023-12-10 11:37:45.621584 |
Duration | 1.34 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
아이디
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.5 |
Minimum | 1 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5.95 |
Q1 | 25.75 |
median | 50.5 |
Q3 | 75.25 |
95-th percentile | 95.05 |
Maximum | 100 |
Range | 99 |
Interquartile range (IQR) | 49.5 |
Descriptive statistics
Standard deviation | 29.011492 |
---|---|
Coefficient of variation (CV) | 0.57448499 |
Kurtosis | -1.2 |
Mean | 50.5 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 0 |
Sum | 5050 |
Variance | 841.66667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 1.0% |
65 | 1 | 1.0% |
75 | 1 | 1.0% |
74 | 1 | 1.0% |
73 | 1 | 1.0% |
72 | 1 | 1.0% |
71 | 1 | 1.0% |
70 | 1 | 1.0% |
69 | 1 | 1.0% |
68 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
100 | 1 | |
99 | 1 | |
98 | 1 | |
97 | 1 | |
96 | 1 | |
95 | 1 | |
94 | 1 | |
93 | 1 | |
92 | 1 | |
91 | 1 |
월
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 100 |
격자번호
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
다사5955 | 1 | 1.0% |
다사6058 | 1 | 1.0% |
다사5060 | 1 | 1.0% |
다사4954 | 1 | 1.0% |
다사4860 | 1 | 1.0% |
다사5258 | 1 | 1.0% |
다사4758 | 1 | 1.0% |
다사4654 | 1 | 1.0% |
다사4856 | 1 | 1.0% |
다사5159 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
다 | 100 | |
사 | 100 | |
4 | 92 | |
5 | 79 | |
6 | 63 | |
0 | 33 | 5.5% |
9 | 28 | 4.7% |
1 | 26 | 4.3% |
2 | 22 | 3.7% |
3 | 22 | 3.7% |
Other values (2) | 35 | 5.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 400 | |
Other Letter | 200 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
4 | 92 | |
5 | 79 | |
6 | 63 | |
0 | 33 | 8.2% |
9 | 28 | 7.0% |
1 | 26 | 6.5% |
2 | 22 | 5.5% |
3 | 22 | 5.5% |
8 | 18 | 4.5% |
7 | 17 | 4.2% |
Other Letter
Value | Count | Frequency (%) |
다 | 100 | |
사 | 100 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 400 | |
Hangul | 200 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
4 | 92 | |
5 | 79 | |
6 | 63 | |
0 | 33 | 8.2% |
9 | 28 | 7.0% |
1 | 26 | 6.5% |
2 | 22 | 5.5% |
3 | 22 | 5.5% |
8 | 18 | 4.5% |
7 | 17 | 4.2% |
Hangul
Value | Count | Frequency (%) |
다 | 100 | |
사 | 100 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 400 | |
Hangul | 200 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
다 | 100 | |
사 | 100 |
ASCII
Value | Count | Frequency (%) |
4 | 92 | |
5 | 79 | |
6 | 63 | |
0 | 33 | 8.2% |
9 | 28 | 7.0% |
1 | 26 | 6.5% |
2 | 22 | 5.5% |
3 | 22 | 5.5% |
8 | 18 | 4.5% |
7 | 17 | 4.2% |
시도코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
11 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11 |
---|---|
2nd row | 11 |
3rd row | 11 |
4th row | 11 |
5th row | 11 |
Common Values
Value | Count | Frequency (%) |
11 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11 | 100 |
시도명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
서울 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 100 |
시군구코드
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
11380 | |
---|---|
11350 | |
11470 | |
11500 | |
11230 | 1 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 11230 |
---|---|
2nd row | 11470 |
3rd row | 11470 |
4th row | 11470 |
5th row | 11470 |
Common Values
Value | Count | Frequency (%) |
11380 | 31 | |
11350 | 29 | |
11470 | 23 | |
11500 | 16 | |
11230 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11380 | 31 | |
11350 | 29 | |
11470 | 23 | |
11500 | 16 | |
11230 | 1 | 1.0% |
시군구명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
은평구 | |
---|---|
노원구 | |
양천구 | |
강서구 | |
동대문구 | 1 |
Length
Max length | 4 |
---|---|
Median length | 3 |
Mean length | 3.01 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 동대문구 |
---|---|
2nd row | 양천구 |
3rd row | 양천구 |
4th row | 양천구 |
5th row | 양천구 |
Common Values
Value | Count | Frequency (%) |
은평구 | 31 | |
노원구 | 29 | |
양천구 | 23 | |
강서구 | 16 | |
동대문구 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
은평구 | 31 | |
노원구 | 29 | |
양천구 | 23 | |
강서구 | 16 | |
동대문구 | 1 | 1.0% |
생활
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19202.991 |
Minimum | 181.69273 |
---|---|
Maximum | 56013.644 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 181.69273 |
---|---|
5-th percentile | 789.57422 |
Q1 | 5103.7947 |
median | 16896.295 |
Q3 | 30428.953 |
95-th percentile | 44516.339 |
Maximum | 56013.644 |
Range | 55831.951 |
Interquartile range (IQR) | 25325.158 |
Descriptive statistics
Standard deviation | 14908.527 |
---|---|
Coefficient of variation (CV) | 0.77636484 |
Kurtosis | -0.75368576 |
Mean | 19202.991 |
Median Absolute Deviation (MAD) | 12290.078 |
Skewness | 0.51059981 |
Sum | 1920299.1 |
Variance | 2.2226418 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
25721.31535754092 | 1 | 1.0% |
19609.399717242875 | 1 | 1.0% |
7490.16451255872 | 1 | 1.0% |
4442.20153736908 | 1 | 1.0% |
14866.96624496108 | 1 | 1.0% |
17252.29416215763 | 1 | 1.0% |
707.762576316128 | 1 | 1.0% |
1648.2540008263811 | 1 | 1.0% |
5915.142302889724 | 1 | 1.0% |
46106.79784633226 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
181.69273278350448 | 1 | |
580.2682424093317 | 1 | |
631.5210653552972 | 1 | |
707.762576316128 | 1 | |
720.9234435742649 | 1 | |
793.187422757501 | 1 | |
854.0882543697588 | 1 | |
915.7423252128028 | 1 | |
1025.207814435979 | 1 | |
1084.0191628314158 | 1 |
Value | Count | Frequency (%) |
56013.64406394056 | 1 | |
54994.25719577911 | 1 | |
47956.50516746902 | 1 | |
46106.79784633226 | 1 | |
44527.55405723026 | 1 | |
44515.74919376682 | 1 | |
44189.25420017929 | 1 | |
43589.11984188452 | 1 | |
42605.47832265973 | 1 | |
42329.76223715043 | 1 |
아이디 | 격자번호 | 시군구코드 | 시군구명 | 생활 | |
---|---|---|---|---|---|
아이디 | 1.000 | 1.000 | 0.937 | 0.937 | 0.000 |
격자번호 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
시군구코드 | 0.937 | 1.000 | 1.000 | 1.000 | 0.330 |
시군구명 | 0.937 | 1.000 | 1.000 | 1.000 | 0.330 |
생활 | 0.000 | 1.000 | 0.330 | 0.330 | 1.000 |
시군구코드 | 시군구명 | |
---|---|---|
시군구코드 | 1.000 | 1.000 |
시군구명 | 1.000 | 1.000 |
아이디 | 생활 | 시군구코드 | 시군구명 | |
---|---|---|---|---|
아이디 | 1.000 | -0.240 | 0.644 | 0.644 |
생활 | -0.240 | 1.000 | 0.132 | 0.132 |
시군구코드 | 0.644 | 0.132 | 1.000 | 1.000 |
시군구명 | 0.644 | 0.132 | 1.000 | 1.000 |
아이디 | 월 | 격자번호 | 시도코드 | 시도명 | 시군구코드 | 시군구명 | 생활 | |
---|---|---|---|---|---|---|---|---|
0 | 1 | 1 | 다사5955 | 11 | 서울 | 11230 | 동대문구 | 25721.315358 |
1 | 2 | 1 | 다사4350 | 11 | 서울 | 11470 | 양천구 | 30229.420884 |
2 | 3 | 1 | 다사4245 | 11 | 서울 | 11470 | 양천구 | 23239.812655 |
3 | 4 | 1 | 다사4246 | 11 | 서울 | 11470 | 양천구 | 23091.562188 |
4 | 5 | 1 | 다사4450 | 11 | 서울 | 11470 | 양천구 | 33078.495522 |
5 | 6 | 1 | 다사4048 | 11 | 서울 | 11470 | 양천구 | 22940.474798 |
6 | 7 | 1 | 다사4345 | 11 | 서울 | 11470 | 양천구 | 32490.488504 |
7 | 8 | 1 | 다사6264 | 11 | 서울 | 11350 | 노원구 | 1733.857395 |
8 | 9 | 1 | 다사6059 | 11 | 서울 | 11350 | 노원구 | 9855.377281 |
9 | 10 | 1 | 다사6057 | 11 | 서울 | 11350 | 노원구 | 16488.024304 |
아이디 | 월 | 격자번호 | 시도코드 | 시도명 | 시군구코드 | 시군구명 | 생활 | |
---|---|---|---|---|---|---|---|---|
90 | 91 | 1 | 다사4554 | 11 | 서울 | 11380 | 은평구 | 10363.019004 |
91 | 92 | 1 | 다사5062 | 11 | 서울 | 11380 | 은평구 | 580.268242 |
92 | 93 | 1 | 다사4955 | 11 | 서울 | 11380 | 은평구 | 25810.493667 |
93 | 94 | 1 | 다사5156 | 11 | 서울 | 11380 | 은평구 | 5158.80745 |
94 | 95 | 1 | 다사4959 | 11 | 서울 | 11380 | 은평구 | 15563.65756 |
95 | 96 | 1 | 다사6164 | 11 | 서울 | 11350 | 노원구 | 3771.479464 |
96 | 97 | 1 | 다사6560 | 11 | 서울 | 11350 | 노원구 | 6816.149376 |
97 | 98 | 1 | 다사6160 | 11 | 서울 | 11350 | 노원구 | 42329.762237 |
98 | 99 | 1 | 다사6364 | 11 | 서울 | 11350 | 노원구 | 720.923444 |
99 | 100 | 1 | 다사6061 | 11 | 서울 | 11350 | 노원구 | 33912.143883 |