Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 27.1 KiB |
Average record size in memory | 69.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | ㈜케이티 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KT1DMFPOPLDSM0000001 |
ETL일시 has constant value "" | Constant |
기준년월일 has constant value "" | Constant |
행정동코드 has constant value "" | Constant |
ETL날짜 has constant value "" | Constant |
24시간대구분코드 is highly overall correlated with 인구수 | High correlation |
인구수 is highly overall correlated with 24시간대구분코드 | High correlation |
24시간대구분코드 has 28 (7.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:23:14.267821 |
---|---|
Analysis finished | 2023-12-10 06:23:15.549732 |
Duration | 1.28 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ETL일시
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
2020-02-10 00:12:43.0 |
---|
Length
Max length | 21 |
---|---|
Median length | 21 |
Mean length | 21 |
Min length | 21 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-02-10 00:12:43.0 |
---|---|
2nd row | 2020-02-10 00:12:43.0 |
3rd row | 2020-02-10 00:12:43.0 |
4th row | 2020-02-10 00:12:43.0 |
5th row | 2020-02-10 00:12:43.0 |
Common Values
Value | Count | Frequency (%) |
2020-02-10 00:12:43.0 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-02-10 | 400 | |
00:12:43.0 | 400 |
기준년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
20200201 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200201 |
---|---|
2nd row | 20200201 |
3rd row | 20200201 |
4th row | 20200201 |
5th row | 20200201 |
Common Values
Value | Count | Frequency (%) |
20200201 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200201 | 400 |
24시간대구분코드
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 15 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.7925 |
Minimum | 0 |
---|---|
Maximum | 14 |
Zeros | 28 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 7 |
Q3 | 10 |
95-th percentile | 13 |
Maximum | 14 |
Range | 14 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 4.2129343 |
---|---|
Coefficient of variation (CV) | 0.62023325 |
Kurtosis | -1.2031045 |
Mean | 6.7925 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.0055588647 |
Sum | 2717 |
Variance | 17.748816 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
0 | 28 | 7.0% |
1 | 28 | 7.0% |
8 | 28 | 7.0% |
9 | 28 | 7.0% |
10 | 28 | 7.0% |
11 | 28 | 7.0% |
12 | 28 | 7.0% |
13 | 28 | 7.0% |
2 | 27 | 6.8% |
3 | 27 | 6.8% |
Other values (5) | 122 |
Value | Count | Frequency (%) |
0 | 28 | |
1 | 28 | |
2 | 27 | |
3 | 27 | |
4 | 27 | |
5 | 27 | |
6 | 27 | |
7 | 27 | |
8 | 28 | |
9 | 28 |
Value | Count | Frequency (%) |
14 | 14 | |
13 | 28 | |
12 | 28 | |
11 | 28 | |
10 | 28 | |
9 | 28 | |
8 | 28 | |
7 | 27 | |
6 | 27 | |
5 | 27 |
성별구분코드
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
F | |
---|---|
M |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | F |
---|---|
2nd row | F |
3rd row | F |
4th row | F |
5th row | F |
Common Values
Value | Count | Frequency (%) |
F | 207 | |
M | 193 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 207 | |
m | 193 |
연령대구분코드
Categorical
Distinct | 14 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
age_10 | |
---|---|
age_15 | |
age_20 | |
age_25 | |
age_30 | |
Other values (9) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | age_00 |
---|---|
2nd row | age_10 |
3rd row | age_15 |
4th row | age_20 |
5th row | age_25 |
Common Values
Value | Count | Frequency (%) |
age_10 | 29 | 7.2% |
age_15 | 29 | 7.2% |
age_20 | 29 | 7.2% |
age_25 | 29 | 7.2% |
age_30 | 29 | 7.2% |
age_35 | 29 | 7.2% |
age_40 | 29 | 7.2% |
age_45 | 29 | 7.2% |
age_50 | 29 | 7.2% |
age_55 | 29 | 7.2% |
Other values (4) | 110 |
Length
Value | Count | Frequency (%) |
age_10 | 29 | 7.2% |
age_15 | 29 | 7.2% |
age_20 | 29 | 7.2% |
age_25 | 29 | 7.2% |
age_30 | 29 | 7.2% |
age_35 | 29 | 7.2% |
age_40 | 29 | 7.2% |
age_45 | 29 | 7.2% |
age_50 | 29 | 7.2% |
age_55 | 29 | 7.2% |
Other values (4) | 110 |
행정동코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
11110560 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11110560 |
---|---|
2nd row | 11110560 |
3rd row | 11110560 |
4th row | 11110560 |
5th row | 11110560 |
Common Values
Value | Count | Frequency (%) |
11110560 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11110560 | 400 |
인구수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 225 |
---|---|
Distinct (%) | 56.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 129.4775 |
Minimum | 1 |
---|---|
Maximum | 566 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 6 |
Q1 | 37 |
median | 85 |
Q3 | 199.25 |
95-th percentile | 376 |
Maximum | 566 |
Range | 565 |
Interquartile range (IQR) | 162.25 |
Descriptive statistics
Standard deviation | 121.08784 |
---|---|
Coefficient of variation (CV) | 0.93520373 |
Kurtosis | 0.85193175 |
Mean | 129.4775 |
Median Absolute Deviation (MAD) | 61 |
Skewness | 1.2148892 |
Sum | 51791 |
Variance | 14662.265 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
41 | 6 | 1.5% |
55 | 6 | 1.5% |
27 | 6 | 1.5% |
6 | 5 | 1.2% |
24 | 5 | 1.2% |
14 | 5 | 1.2% |
1 | 5 | 1.2% |
3 | 5 | 1.2% |
16 | 5 | 1.2% |
89 | 4 | 1.0% |
Other values (215) | 348 |
Value | Count | Frequency (%) |
1 | 5 | |
2 | 3 | |
3 | 5 | |
4 | 3 | |
5 | 3 | |
6 | 5 | |
7 | 2 | 0.5% |
8 | 4 | |
9 | 3 | |
11 | 1 | 0.2% |
Value | Count | Frequency (%) |
566 | 1 | |
526 | 1 | |
513 | 1 | |
511 | 1 | |
485 | 1 | |
476 | 1 | |
464 | 1 | |
463 | 1 | |
459 | 1 | |
457 | 1 |
ETL날짜
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
20200201 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200201 |
---|---|
2nd row | 20200201 |
3rd row | 20200201 |
4th row | 20200201 |
5th row | 20200201 |
Common Values
Value | Count | Frequency (%) |
20200201 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200201 | 400 |
24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 인구수 | |
---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.000 | 0.000 | 0.578 |
성별구분코드 | 0.000 | 1.000 | 0.000 | 0.319 |
연령대구분코드 | 0.000 | 0.000 | 1.000 | 0.516 |
인구수 | 0.578 | 0.319 | 0.516 | 1.000 |
성별구분코드 | 연령대구분코드 | |
---|---|---|
성별구분코드 | 1.000 | 0.000 |
연령대구분코드 | 0.000 | 1.000 |
24시간대구분코드 | 인구수 | 성별구분코드 | 연령대구분코드 | |
---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.532 | 0.000 | 0.000 |
인구수 | 0.532 | 1.000 | 0.239 | 0.235 |
성별구분코드 | 0.000 | 0.239 | 1.000 | 0.000 |
연령대구분코드 | 0.000 | 0.235 | 0.000 | 1.000 |
ETL일시 | 기준년월일 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 인구수 | ETL날짜 | |
---|---|---|---|---|---|---|---|---|
0 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_00 | 11110560 | 7 | 20200201 |
1 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_10 | 11110560 | 13 | 20200201 |
2 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_15 | 11110560 | 45 | 20200201 |
3 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_20 | 11110560 | 120 | 20200201 |
4 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_25 | 11110560 | 134 | 20200201 |
5 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_30 | 11110560 | 93 | 20200201 |
6 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_35 | 11110560 | 82 | 20200201 |
7 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_40 | 11110560 | 69 | 20200201 |
8 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_45 | 11110560 | 105 | 20200201 |
9 | 2020-02-10 00:12:43.0 | 20200201 | 0 | F | age_50 | 11110560 | 98 | 20200201 |
ETL일시 | 기준년월일 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 인구수 | ETL날짜 | |
---|---|---|---|---|---|---|---|---|
390 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_25 | 11110560 | 235 | 20200201 |
391 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_30 | 11110560 | 202 | 20200201 |
392 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_35 | 11110560 | 254 | 20200201 |
393 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_40 | 11110560 | 193 | 20200201 |
394 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_45 | 11110560 | 303 | 20200201 |
395 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_50 | 11110560 | 425 | 20200201 |
396 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_55 | 11110560 | 360 | 20200201 |
397 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_60 | 11110560 | 255 | 20200201 |
398 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_65 | 11110560 | 131 | 20200201 |
399 | 2020-02-10 00:12:43.0 | 20200201 | 14 | F | age_70 | 11110560 | 125 | 20200201 |