Dataset statistics
Number of variables | 10 |
---|---|
Number of observations | 400 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 33.7 KiB |
Average record size in memory | 86.3 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 6 |
Numeric | 3 |
Dataset
Description | Sample |
---|---|
Author | ㈜케이티 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=KT1DMLFPLDSM00000001 |
ETL일시 has constant value "" | Constant |
원천테이블 has constant value "" | Constant |
행정동코드 has constant value "" | Constant |
기준년월일 has constant value "" | Constant |
내국인수 is highly overall correlated with 성별구분코드 and 1 other fields | High correlation |
성별구분코드 is highly overall correlated with 내국인수 and 1 other fields | High correlation |
연령대구분코드 is highly overall correlated with 내국인수 and 1 other fields | High correlation |
단기외국인수 is highly imbalanced (86.6%) | Imbalance |
24시간대구분코드 has 30 (7.5%) zeros | Zeros |
내국인수 has 28 (7.0%) zeros | Zeros |
장기외국인수 has 386 (96.5%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 06:18:25.786834 |
---|---|
Analysis finished | 2023-12-10 06:18:28.540172 |
Duration | 2.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
ETL일시
Date
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
Minimum | 2020-02-10 00:14:58 |
---|---|
Maximum | 2020-02-10 00:14:58 |
원천테이블
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
_ |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | _ |
---|---|
2nd row | _ |
3rd row | _ |
4th row | _ |
5th row | _ |
Common Values
Value | Count | Frequency (%) |
_ | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
400 |
24시간대구분코드
Real number (ℝ)
ZEROS
 
Distinct | 14 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.175 |
Minimum | 0 |
---|---|
Maximum | 13 |
Zeros | 30 |
Zeros (%) | 7.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 3 |
median | 6 |
Q3 | 9.25 |
95-th percentile | 12 |
Maximum | 13 |
Range | 13 |
Interquartile range (IQR) | 6.25 |
Descriptive statistics
Standard deviation | 3.857665 |
---|---|
Coefficient of variation (CV) | 0.62472307 |
Kurtosis | -1.1937208 |
Mean | 6.175 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.013625252 |
Sum | 2470 |
Variance | 14.881579 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
0 | 30 | 7.5% |
1 | 30 | 7.5% |
2 | 30 | 7.5% |
3 | 30 | 7.5% |
4 | 30 | 7.5% |
5 | 30 | 7.5% |
6 | 30 | 7.5% |
7 | 30 | 7.5% |
8 | 30 | 7.5% |
9 | 30 | 7.5% |
Other values (4) | 100 |
Value | Count | Frequency (%) |
0 | 30 | |
1 | 30 | |
2 | 30 | |
3 | 30 | |
4 | 30 | |
5 | 30 | |
6 | 30 | |
7 | 30 | |
8 | 30 | |
9 | 30 |
Value | Count | Frequency (%) |
13 | 10 | 2.5% |
12 | 30 | |
11 | 30 | |
10 | 30 | |
9 | 30 | |
8 | 30 | |
7 | 30 | |
6 | 30 | |
5 | 30 | |
4 | 30 |
성별구분코드
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
F | |
---|---|
M | |
- |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | - |
---|---|
2nd row | - |
3rd row | F |
4th row | F |
5th row | F |
Common Values
Value | Count | Frequency (%) |
F | 190 | |
M | 182 | |
- | 28 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 190 | |
m | 182 | |
28 | 7.0% |
연령대구분코드
Categorical
HIGH CORRELATION
 
Distinct | 14 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
age_70 | |
---|---|
_ | |
age_10 | |
age_15 | |
age_20 | |
Other values (9) |
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 5.65 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | _ |
---|---|
2nd row | _ |
3rd row | age_10 |
4th row | age_15 |
5th row | age_20 |
Common Values
Value | Count | Frequency (%) |
age_70 | 52 | |
_ | 28 | 7.0% |
age_10 | 27 | 6.8% |
age_15 | 27 | 6.8% |
age_20 | 27 | 6.8% |
age_25 | 27 | 6.8% |
age_30 | 27 | 6.8% |
age_35 | 27 | 6.8% |
age_40 | 27 | 6.8% |
age_45 | 27 | 6.8% |
Other values (4) | 104 |
Length
Value | Count | Frequency (%) |
age_70 | 52 | |
28 | 7.0% | |
age_10 | 27 | 6.8% |
age_15 | 27 | 6.8% |
age_20 | 27 | 6.8% |
age_25 | 27 | 6.8% |
age_30 | 27 | 6.8% |
age_35 | 27 | 6.8% |
age_40 | 27 | 6.8% |
age_45 | 27 | 6.8% |
Other values (4) | 104 |
행정동코드
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
11110560 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 11110560 |
---|---|
2nd row | 11110560 |
3rd row | 11110560 |
4th row | 11110560 |
5th row | 11110560 |
Common Values
Value | Count | Frequency (%) |
11110560 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
11110560 | 400 |
내국인수
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 142 |
---|---|
Distinct (%) | 35.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 103.875 |
Minimum | 0 |
---|---|
Maximum | 235 |
Zeros | 28 |
Zeros (%) | 7.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 68 |
median | 107 |
Q3 | 142 |
95-th percentile | 187 |
Maximum | 235 |
Range | 235 |
Interquartile range (IQR) | 74 |
Descriptive statistics
Standard deviation | 52.13059 |
---|---|
Coefficient of variation (CV) | 0.50185886 |
Kurtosis | -0.55042537 |
Mean | 103.875 |
Median Absolute Deviation (MAD) | 37 |
Skewness | -0.16790319 |
Sum | 41550 |
Variance | 2717.5984 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 28 | 7.0% |
40 | 12 | 3.0% |
160 | 11 | 2.8% |
87 | 8 | 2.0% |
86 | 7 | 1.8% |
68 | 7 | 1.8% |
185 | 6 | 1.5% |
116 | 6 | 1.5% |
41 | 6 | 1.5% |
83 | 6 | 1.5% |
Other values (132) | 303 |
Value | Count | Frequency (%) |
0 | 28 | |
35 | 1 | 0.2% |
36 | 5 | 1.2% |
37 | 1 | 0.2% |
38 | 3 | 0.8% |
39 | 4 | 1.0% |
40 | 12 | |
41 | 6 | 1.5% |
42 | 4 | 1.0% |
43 | 1 | 0.2% |
Value | Count | Frequency (%) |
235 | 1 | 0.2% |
220 | 1 | 0.2% |
209 | 2 | |
208 | 1 | 0.2% |
205 | 1 | 0.2% |
204 | 1 | 0.2% |
199 | 1 | 0.2% |
195 | 2 | |
194 | 1 | 0.2% |
192 | 3 |
장기외국인수
Real number (ℝ)
ZEROS
 
Distinct | 10 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.905 |
Minimum | 0 |
---|---|
Maximum | 77 |
Zeros | 386 |
Zeros (%) | 96.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 0 |
Maximum | 77 |
Range | 77 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 10.210122 |
---|---|
Coefficient of variation (CV) | 5.3596441 |
Kurtosis | 28.551386 |
Mean | 1.905 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.4018192 |
Sum | 762 |
Variance | 104.24659 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 386 | |
43 | 3 | 0.8% |
44 | 2 | 0.5% |
57 | 2 | 0.5% |
61 | 2 | 0.5% |
49 | 1 | 0.2% |
50 | 1 | 0.2% |
69 | 1 | 0.2% |
64 | 1 | 0.2% |
77 | 1 | 0.2% |
Value | Count | Frequency (%) |
0 | 386 | |
43 | 3 | 0.8% |
44 | 2 | 0.5% |
49 | 1 | 0.2% |
50 | 1 | 0.2% |
57 | 2 | 0.5% |
61 | 2 | 0.5% |
64 | 1 | 0.2% |
69 | 1 | 0.2% |
77 | 1 | 0.2% |
Value | Count | Frequency (%) |
77 | 1 | 0.2% |
69 | 1 | 0.2% |
64 | 1 | 0.2% |
61 | 2 | 0.5% |
57 | 2 | 0.5% |
50 | 1 | 0.2% |
49 | 1 | 0.2% |
44 | 2 | 0.5% |
43 | 3 | 0.8% |
0 | 386 |
단기외국인수
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
0 | |
---|---|
8 | 8 |
7 | 4 |
9 | 2 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 7 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 386 | |
8 | 8 | 2.0% |
7 | 4 | 1.0% |
9 | 2 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 386 | |
8 | 8 | 2.0% |
7 | 4 | 1.0% |
9 | 2 | 0.5% |
기준년월일
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.3 KiB |
20200201 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20200201 |
---|---|
2nd row | 20200201 |
3rd row | 20200201 |
4th row | 20200201 |
5th row | 20200201 |
Common Values
Value | Count | Frequency (%) |
20200201 | 400 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20200201 | 400 |
24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 내국인수 | 장기외국인수 | 단기외국인수 | |
---|---|---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
성별구분코드 | 0.000 | 1.000 | 0.832 | 0.819 | 0.800 | 0.487 |
연령대구분코드 | 0.000 | 0.832 | 1.000 | 0.911 | 0.476 | 0.575 |
내국인수 | 0.000 | 0.819 | 0.911 | 1.000 | 0.477 | 0.567 |
장기외국인수 | 0.000 | 0.800 | 0.476 | 0.477 | 1.000 | 0.000 |
단기외국인수 | 0.000 | 0.487 | 0.575 | 0.567 | 0.000 | 1.000 |
연령대구분코드 | 성별구분코드 | 단기외국인수 | |
---|---|---|---|
연령대구분코드 | 1.000 | 0.686 | 0.359 |
성별구분코드 | 0.686 | 1.000 | 0.484 |
단기외국인수 | 0.359 | 0.484 | 1.000 |
24시간대구분코드 | 내국인수 | 장기외국인수 | 성별구분코드 | 연령대구분코드 | 단기외국인수 | |
---|---|---|---|---|---|---|
24시간대구분코드 | 1.000 | 0.054 | 0.019 | 0.000 | 0.000 | 0.000 |
내국인수 | 0.054 | 1.000 | -0.307 | 0.712 | 0.674 | 0.373 |
장기외국인수 | 0.019 | -0.307 | 1.000 | 0.479 | 0.254 | 0.000 |
성별구분코드 | 0.000 | 0.712 | 0.479 | 1.000 | 0.686 | 0.484 |
연령대구분코드 | 0.000 | 0.674 | 0.254 | 0.686 | 1.000 | 0.359 |
단기외국인수 | 0.000 | 0.373 | 0.000 | 0.484 | 0.359 | 1.000 |
ETL일시 | 원천테이블 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 내국인수 | 장기외국인수 | 단기외국인수 | 기준년월일 | |
---|---|---|---|---|---|---|---|---|---|---|
0 | 2020-02-10 00:14:58.0 | _ | 0 | - | _ | 11110560 | 0 | 43 | 0 | 20200201 |
1 | 2020-02-10 00:14:58.0 | _ | 0 | - | _ | 11110560 | 0 | 0 | 7 | 20200201 |
2 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_10 | 11110560 | 35 | 0 | 0 | 20200201 |
3 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_15 | 11110560 | 74 | 0 | 0 | 20200201 |
4 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_20 | 11110560 | 137 | 0 | 0 | 20200201 |
5 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_25 | 11110560 | 98 | 0 | 0 | 20200201 |
6 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_30 | 11110560 | 87 | 0 | 0 | 20200201 |
7 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_35 | 11110560 | 124 | 0 | 0 | 20200201 |
8 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_40 | 11110560 | 121 | 0 | 0 | 20200201 |
9 | 2020-02-10 00:14:58.0 | _ | 0 | F | age_45 | 11110560 | 158 | 0 | 0 | 20200201 |
ETL일시 | 원천테이블 | 24시간대구분코드 | 성별구분코드 | 연령대구분코드 | 행정동코드 | 내국인수 | 장기외국인수 | 단기외국인수 | 기준년월일 | |
---|---|---|---|---|---|---|---|---|---|---|
390 | 2020-02-10 00:14:58.0 | _ | 13 | - | _ | 11110560 | 0 | 77 | 0 | 20200201 |
391 | 2020-02-10 00:14:58.0 | _ | 13 | - | _ | 11110560 | 0 | 0 | 8 | 20200201 |
392 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_10 | 11110560 | 44 | 0 | 0 | 20200201 |
393 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_15 | 11110560 | 75 | 0 | 0 | 20200201 |
394 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_20 | 11110560 | 110 | 0 | 0 | 20200201 |
395 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_25 | 11110560 | 110 | 0 | 0 | 20200201 |
396 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_30 | 11110560 | 93 | 0 | 0 | 20200201 |
397 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_35 | 11110560 | 135 | 0 | 0 | 20200201 |
398 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_40 | 11110560 | 116 | 0 | 0 | 20200201 |
399 | 2020-02-10 00:14:58.0 | _ | 13 | F | age_45 | 11110560 | 172 | 0 | 0 | 20200201 |