Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.1 KiB |
Average record size in memory | 69.3 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 5 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | ㈜케이티 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KT1DMWKPLDSM00000001 |
ETL일시 has constant value "" | Constant |
기준년월일 has constant value "" | Constant |
행정동코드 has constant value "" | Constant |
ELT날짜 has constant value "" | Constant |
24시간대구분코드 is highly overall correlated with 인구수 | High correlation |
인구수 is highly overall correlated with 24시간대구분코드 | High correlation |
24시간대구분코드 has 28 (7.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:39:28.271344 |
---|---|
Analysis finished | 2023-12-10 06:39:29.525410 |
Duration | 1.25 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ETL일시
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-02-10 00:12:32 |
---|---|
Maximum | 2020-02-10 00:12:32 |
기준년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
20200201 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200201 |
---|---|
2nd row | 20200201 |
3rd row | 20200201 |
4th row | 20200201 |
5th row | 20200201 |
Common Values
Value | Count | Frequency (%) |
20200201 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200201 | 400 |
24시간대구분코드
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 16 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.0725 |
Minimum | 0 |
---|---|
Maximum | 15 |
Zeros | 28 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 7 |
Q3 | 11 |
95-th percentile | 14 |
Maximum | 15 |
Range | 15 |
Interquartile range (IQR) | 8 |
Descriptive statistics
Standard deviation | 4.3892377 |
---|---|
Coefficient of variation (CV) | 0.62060624 |
Kurtosis | -1.2240289 |
Mean | 7.0725 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.015163499 |
Sum | 2829 |
Variance | 19.265407 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
0 | 28 | 7.0% |
13 | 28 | 7.0% |
9 | 28 | 7.0% |
2 | 27 | 6.8% |
14 | 27 | 6.8% |
10 | 27 | 6.8% |
12 | 27 | 6.8% |
3 | 26 | 6.5% |
4 | 26 | 6.5% |
6 | 26 | 6.5% |
Other values (6) | 130 |
Value | Count | Frequency (%) |
0 | 28 | |
1 | 26 | |
2 | 27 | |
3 | 26 | |
4 | 26 | |
5 | 25 | |
6 | 26 | |
7 | 26 | |
8 | 25 | |
9 | 28 |
Value | Count | Frequency (%) |
15 | 2 | 0.5% |
14 | 27 | |
13 | 28 | |
12 | 27 | |
11 | 26 | |
10 | 27 | |
9 | 28 | |
8 | 25 | |
7 | 26 | |
6 | 26 |
성별구분코드
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
F | |
---|---|
M |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | F |
---|---|
2nd row | F |
3rd row | F |
4th row | F |
5th row | F |
Common Values
Value | Count | Frequency (%) |
F | 200 | |
M | 200 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 200 | |
m | 200 |
연령대구분코드
Categorical
Distinct | 14 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
age_15 | |
---|---|
age_20 | |
age_25 | |
age_30 | |
age_35 | |
Other values (9) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | age_00 |
---|---|
2nd row | age_10 |
3rd row | age_15 |
4th row | age_20 |
5th row | age_25 |
Common Values
Value | Count | Frequency (%) |
age_15 | 30 | 7.5% |
age_20 | 30 | 7.5% |
age_25 | 30 | 7.5% |
age_30 | 30 | 7.5% |
age_35 | 30 | 7.5% |
age_40 | 30 | 7.5% |
age_45 | 30 | 7.5% |
age_50 | 30 | 7.5% |
age_55 | 30 | 7.5% |
age_60 | 30 | 7.5% |
Other values (4) | 100 |
Length
Value | Count | Frequency (%) |
age_15 | 30 | 7.5% |
age_20 | 30 | 7.5% |
age_25 | 30 | 7.5% |
age_30 | 30 | 7.5% |
age_35 | 30 | 7.5% |
age_40 | 30 | 7.5% |
age_45 | 30 | 7.5% |
age_50 | 30 | 7.5% |
age_55 | 30 | 7.5% |
age_60 | 30 | 7.5% |
Other values (4) | 100 |
행정동코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
11110560 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11110560 |
---|---|
2nd row | 11110560 |
3rd row | 11110560 |
4th row | 11110560 |
5th row | 11110560 |
Common Values
Value | Count | Frequency (%) |
11110560 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11110560 | 400 |
인구수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 72 |
---|---|
Distinct (%) | 18.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.5625 |
Minimum | 1 |
---|---|
Maximum | 110 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 6 |
median | 11.5 |
Q3 | 21 |
95-th percentile | 59.15 |
Maximum | 110 |
Range | 109 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 18.745388 |
---|---|
Coefficient of variation (CV) | 1.067353 |
Kurtosis | 5.925484 |
Mean | 17.5625 |
Median Absolute Deviation (MAD) | 6.5 |
Skewness | 2.3062844 |
Sum | 7025 |
Variance | 351.38957 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 25 | 6.2% |
7 | 25 | 6.2% |
6 | 24 | 6.0% |
1 | 21 | 5.2% |
4 | 19 | 4.8% |
10 | 18 | 4.5% |
3 | 17 | 4.2% |
9 | 17 | 4.2% |
8 | 13 | 3.2% |
14 | 12 | 3.0% |
Other values (62) | 209 |
Value | Count | Frequency (%) |
1 | 21 | |
2 | 12 | |
3 | 17 | |
4 | 19 | |
5 | 25 | |
6 | 24 | |
7 | 25 | |
8 | 13 | |
9 | 17 | |
10 | 18 |
Value | Count | Frequency (%) |
110 | 1 | |
99 | 1 | |
96 | 1 | |
95 | 1 | |
93 | 2 | |
89 | 1 | |
83 | 1 | |
81 | 1 | |
79 | 1 | |
78 | 1 |
ELT날짜
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
20200201 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200201 |
---|---|
2nd row | 20200201 |
3rd row | 20200201 |
4th row | 20200201 |
5th row | 20200201 |
Common Values
Value | Count | Frequency (%) |
20200201 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200201 | 400 |
24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 인구수 | |
---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.000 | 0.000 | 0.631 |
성별구분코드 | 0.000 | 1.000 | 0.000 | 0.109 |
연령대구분코드 | 0.000 | 0.000 | 1.000 | 0.405 |
인구수 | 0.631 | 0.109 | 0.405 | 1.000 |
연령대구분코드 | 성별구분코드 | |
---|---|---|
연령대구분코드 | 1.000 | 0.000 |
성별구분코드 | 0.000 | 1.000 |
24시간대구분코드 | 인구수 | 성별구분코드 | 연령대구분코드 | |
---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.512 | 0.000 | 0.000 |
인구수 | 0.512 | 1.000 | 0.083 | 0.175 |
성별구분코드 | 0.000 | 0.083 | 1.000 | 0.000 |
연령대구분코드 | 0.000 | 0.175 | 0.000 | 1.000 |
ETL일시 | 기준년월일 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 인구수 | ELT날짜 | |
---|---|---|---|---|---|---|---|---|
0 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_00 | 11110560 | 6 | 20200201 |
1 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_10 | 11110560 | 12 | 20200201 |
2 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_15 | 11110560 | 17 | 20200201 |
3 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_20 | 11110560 | 34 | 20200201 |
4 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_25 | 11110560 | 18 | 20200201 |
5 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_30 | 11110560 | 15 | 20200201 |
6 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_35 | 11110560 | 24 | 20200201 |
7 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_40 | 11110560 | 20 | 20200201 |
8 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_45 | 11110560 | 29 | 20200201 |
9 | 2020-02-10 00:12:32.0 | 20200201 | 0 | F | age_50 | 11110560 | 30 | 20200201 |
ETL일시 | 기준년월일 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 인구수 | ELT날짜 | |
---|---|---|---|---|---|---|---|---|
390 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_35 | 11110560 | 25 | 20200201 |
391 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_40 | 11110560 | 43 | 20200201 |
392 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_45 | 11110560 | 58 | 20200201 |
393 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_50 | 11110560 | 59 | 20200201 |
394 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_55 | 11110560 | 75 | 20200201 |
395 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_60 | 11110560 | 64 | 20200201 |
396 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_65 | 11110560 | 26 | 20200201 |
397 | 2020-02-10 00:12:32.0 | 20200201 | 14 | M | age_70 | 11110560 | 25 | 20200201 |
398 | 2020-02-10 00:12:32.0 | 20200201 | 15 | F | age_00 | 11110560 | 2 | 20200201 |
399 | 2020-02-10 00:12:32.0 | 20200201 | 15 | F | age_10 | 11110560 | 10 | 20200201 |