Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 68.3 B |
Variable types
Categorical | 6 |
---|---|
Numeric | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 데이터웨이 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=0a3372e0-2f04-11ea-bccd-b704c648ae09 |
시도명 has constant value "" | Constant |
시군구명 has constant value "" | Constant |
행정동명 is highly overall correlated with 기준일자 and 1 other fields | High correlation |
행정동코드 is highly overall correlated with 기준일자 and 1 other fields | High correlation |
기준일자 is highly overall correlated with 행정동코드 and 1 other fields | High correlation |
성별 is highly overall correlated with 연령대 | High correlation |
연령대 is highly overall correlated with 성별 | High correlation |
행정동코드 is highly imbalanced (89.8%) | Imbalance |
행정동명 is highly imbalanced (89.8%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 13:37:59.191910 |
---|---|
Analysis finished | 2023-12-10 13:38:00.201098 |
Duration | 1.01 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
행정동코드
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
1111061500 | |
---|---|
1111055000 | 1 |
1111058000 | 1 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 1111055000 |
---|---|
2nd row | 1111058000 |
3rd row | 1111061500 |
4th row | 1111061500 |
5th row | 1111061500 |
Common Values
Value | Count | Frequency (%) |
1111061500 | 98 | |
1111055000 | 1 | 1.0% |
1111058000 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1111061500 | 98 | |
1111055000 | 1 | 1.0% |
1111058000 | 1 | 1.0% |
시도명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울특별시 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울특별시 |
---|---|
2nd row | 서울특별시 |
3rd row | 서울특별시 |
4th row | 서울특별시 |
5th row | 서울특별시 |
Common Values
Value | Count | Frequency (%) |
서울특별시 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울특별시 | 100 |
시군구명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
종로구 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 종로구 |
---|---|
2nd row | 종로구 |
3rd row | 종로구 |
4th row | 종로구 |
5th row | 종로구 |
Common Values
Value | Count | Frequency (%) |
종로구 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
종로구 | 100 |
행정동명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
종로1.2.3.4가동 | |
---|---|
부암동 | 1 |
교남동 | 1 |
Length
Max length | 11 |
---|---|
Median length | 11 |
Mean length | 10.84 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 부암동 |
---|---|
2nd row | 교남동 |
3rd row | 종로1.2.3.4가동 |
4th row | 종로1.2.3.4가동 |
5th row | 종로1.2.3.4가동 |
Common Values
Value | Count | Frequency (%) |
종로1.2.3.4가동 | 98 | |
부암동 | 1 | 1.0% |
교남동 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
종로1.2.3.4가동 | 98 | |
부암동 | 1 | 1.0% |
교남동 | 1 | 1.0% |
기준일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 20 |
---|---|
Distinct (%) | 20.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20200320 |
Minimum | 20200302 |
---|---|
Maximum | 20200525 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20200302 |
---|---|
5-th percentile | 20200309 |
Q1 | 20200310 |
median | 20200316 |
Q3 | 20200323 |
95-th percentile | 20200324 |
Maximum | 20200525 |
Range | 223 |
Interquartile range (IQR) | 13 |
Descriptive statistics
Standard deviation | 28.901244 |
---|---|
Coefficient of variation (CV) | 1.430732 × 10-6 |
Kurtosis | 43.333351 |
Mean | 20200320 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 6.4945584 |
Sum | 2.020032 × 109 |
Variance | 835.28192 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20200309 | 17 | |
20200324 | 12 | |
20200310 | 12 | |
20200323 | 11 | |
20200320 | 9 | |
20200312 | 5 | 5.0% |
20200311 | 5 | 5.0% |
20200318 | 4 | 4.0% |
20200319 | 4 | 4.0% |
20200317 | 4 | 4.0% |
Other values (10) | 17 |
Value | Count | Frequency (%) |
20200302 | 1 | 1.0% |
20200303 | 2 | 2.0% |
20200306 | 1 | 1.0% |
20200309 | 17 | |
20200310 | 12 | |
20200311 | 5 | 5.0% |
20200312 | 5 | 5.0% |
20200313 | 3 | 3.0% |
20200314 | 2 | 2.0% |
20200316 | 3 | 3.0% |
Value | Count | Frequency (%) |
20200525 | 1 | 1.0% |
20200507 | 1 | 1.0% |
20200325 | 1 | 1.0% |
20200324 | 12 | |
20200323 | 11 | |
20200321 | 2 | 2.0% |
20200320 | 9 | |
20200319 | 4 | 4.0% |
20200318 | 4 | 4.0% |
20200317 | 4 | 4.0% |
성별
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
F | |
---|---|
M | |
X | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | F |
---|---|
2nd row | F |
3rd row | X |
4th row | F |
5th row | M |
Common Values
Value | Count | Frequency (%) |
F | 55 | |
M | 42 | |
X | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 55 | |
m | 42 | |
x | 3 | 3.0% |
연령대
Categorical
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20 | |
---|---|
50 | |
25 | |
45 | |
55 | |
Other values (6) |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 45 |
---|---|
2nd row | 45 |
3rd row | xx |
4th row | 25 |
5th row | 50 |
Common Values
Value | Count | Frequency (%) |
20 | 25 | |
50 | 20 | |
25 | 17 | |
45 | 12 | |
55 | 7 | 7.0% |
30 | 6 | 6.0% |
60 | 4 | 4.0% |
xx | 3 | 3.0% |
40 | 3 | 3.0% |
35 | 2 | 2.0% |
Length
Value | Count | Frequency (%) |
20 | 25 | |
50 | 20 | |
25 | 17 | |
45 | 12 | |
55 | 7 | 7.0% |
30 | 6 | 6.0% |
60 | 4 | 4.0% |
xx | 3 | 3.0% |
40 | 3 | 3.0% |
35 | 2 | 2.0% |
소비인구(명)
Real number (ℝ)
Distinct | 15 |
---|---|
Distinct (%) | 15.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 47.932797 |
Minimum | 22.861429 |
---|---|
Maximum | 365.78287 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 22.861429 |
---|---|
5-th percentile | 22.861429 |
Q1 | 22.861429 |
median | 30.481906 |
Q3 | 53.343335 |
95-th percentile | 107.44872 |
Maximum | 365.78287 |
Range | 342.92144 |
Interquartile range (IQR) | 30.481906 |
Descriptive statistics
Standard deviation | 44.858933 |
---|---|
Coefficient of variation (CV) | 0.93587138 |
Kurtosis | 26.922152 |
Mean | 47.932797 |
Median Absolute Deviation (MAD) | 7.6204765 |
Skewness | 4.5042285 |
Sum | 4793.2797 |
Variance | 2012.3239 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22.86142941 | 33 | |
30.48190588 | 18 | |
45.72285882 | 11 | 11.0% |
53.34333529 | 10 | 10.0% |
38.10238235 | 9 | 9.0% |
60.96381176 | 5 | 5.0% |
76.2047647 | 3 | 3.0% |
106.68667058 | 3 | 3.0% |
137.16857646 | 2 | 2.0% |
365.78287056 | 1 | 1.0% |
Other values (5) | 5 | 5.0% |
Value | Count | Frequency (%) |
22.86142941 | 33 | |
30.48190588 | 18 | |
38.10238235 | 9 | 9.0% |
45.72285882 | 11 | 11.0% |
53.34333529 | 10 | 10.0% |
60.96381176 | 5 | 5.0% |
76.2047647 | 3 | 3.0% |
83.82524117 | 1 | 1.0% |
91.44571764 | 1 | 1.0% |
99.06619411 | 1 | 1.0% |
Value | Count | Frequency (%) |
365.78287056 | 1 | 1.0% |
220.99381763 | 1 | 1.0% |
137.16857646 | 2 | 2.0% |
121.92762352 | 1 | 1.0% |
106.68667058 | 3 | |
99.06619411 | 1 | 1.0% |
91.44571764 | 1 | 1.0% |
83.82524117 | 1 | 1.0% |
76.2047647 | 3 | |
60.96381176 | 5 |
행정동코드 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|
행정동코드 | 1.000 | 1.000 | 0.940 | 0.000 | 0.000 | 0.000 |
행정동명 | 1.000 | 1.000 | 0.940 | 0.000 | 0.000 | 0.000 |
기준일자 | 0.940 | 0.940 | 1.000 | 0.000 | 0.000 | 0.000 |
성별 | 0.000 | 0.000 | 0.000 | 1.000 | 0.816 | 0.000 |
연령대 | 0.000 | 0.000 | 0.000 | 0.816 | 1.000 | 0.000 |
소비인구(명) | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
행정동명 | 성별 | 행정동코드 | 연령대 | |
---|---|---|---|---|
행정동명 | 1.000 | 0.000 | 1.000 | 0.000 |
성별 | 0.000 | 1.000 | 0.000 | 0.670 |
행정동코드 | 1.000 | 0.000 | 1.000 | 0.000 |
연령대 | 0.000 | 0.670 | 0.000 | 1.000 |
기준일자 | 소비인구(명) | 행정동코드 | 행정동명 | 성별 | 연령대 | |
---|---|---|---|---|---|---|
기준일자 | 1.000 | -0.238 | 0.700 | 0.700 | 0.000 | 0.000 |
소비인구(명) | -0.238 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 |
행정동코드 | 0.700 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 |
행정동명 | 0.700 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 |
성별 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.670 |
연령대 | 0.000 | 0.000 | 0.000 | 0.000 | 0.670 | 1.000 |
행정동코드 | 시도명 | 시군구명 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|---|---|
0 | 1111055000 | 서울특별시 | 종로구 | 부암동 | 20200507 | F | 45 | 22.861429 |
1 | 1111058000 | 서울특별시 | 종로구 | 교남동 | 20200525 | F | 45 | 22.861429 |
2 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | X | xx | 22.861429 |
3 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200303 | F | 25 | 22.861429 |
4 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200303 | M | 50 | 30.481906 |
5 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200306 | F | 20 | 22.861429 |
6 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200309 | F | 20 | 365.782871 |
7 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200309 | M | 25 | 91.445718 |
8 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200309 | F | 25 | 99.066194 |
9 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200309 | F | 30 | 45.722859 |
행정동코드 | 시도명 | 시군구명 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|---|---|
90 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | F | 30 | 22.861429 |
91 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | F | 45 | 22.861429 |
92 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | F | 50 | 45.722859 |
93 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | F | 55 | 30.481906 |
94 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200302 | F | 50 | 30.481906 |
95 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | M | 25 | 30.481906 |
96 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | M | 35 | 22.861429 |
97 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | M | 50 | 30.481906 |
98 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200324 | M | 55 | 30.481906 |
99 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200325 | F | 20 | 45.722859 |