Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 68.3 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 5 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 데이터웨이 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=2afcb2c0-2f04-11ea-bccd-b704c648ae09 |
시도명 has constant value "" | Constant |
연령대 is highly overall correlated with 성별 | High correlation |
시군구명 is highly overall correlated with 행정동코드 and 1 other fields | High correlation |
행정동명 is highly overall correlated with 행정동코드 and 1 other fields | High correlation |
성별 is highly overall correlated with 연령대 | High correlation |
행정동코드 is highly overall correlated with 시군구명 and 1 other fields | High correlation |
시군구명 is highly imbalanced (67.3%) | Imbalance |
연령대 is highly imbalanced (63.2%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 11:22:57.780449 |
---|---|
Analysis finished | 2023-12-10 11:23:00.204218 |
Duration | 2.42 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
행정동코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1112369 × 109 |
Minimum | 1.1110515 × 109 |
---|---|
Maximum | 1.114057 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1.1110515 × 109 |
---|---|
5-th percentile | 1.111053 × 109 |
Q1 | 1.111053 × 109 |
median | 1.1110555 × 109 |
Q3 | 1.1110615 × 109 |
95-th percentile | 1.114055 × 109 |
Maximum | 1.114057 × 109 |
Range | 3005500 |
Interquartile range (IQR) | 8500 |
Descriptive statistics
Standard deviation | 715824.97 |
---|---|
Coefficient of variation (CV) | 0.00064416953 |
Kurtosis | 12.400393 |
Mean | 1.1112369 × 109 |
Median Absolute Deviation (MAD) | 4000 |
Skewness | 3.7619356 |
Sum | 1.1112369 × 1011 |
Variance | 5.1240538 × 1011 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
1111061500 | 42 | |
1111053000 | 37 | |
1111055000 | 9 | 9.0% |
1111051500 | 4 | 4.0% |
1114055000 | 3 | 3.0% |
1114057000 | 3 | 3.0% |
1111056000 | 2 | 2.0% |
Value | Count | Frequency (%) |
1111051500 | 4 | 4.0% |
1111053000 | 37 | |
1111055000 | 9 | 9.0% |
1111056000 | 2 | 2.0% |
1111061500 | 42 | |
1114055000 | 3 | 3.0% |
1114057000 | 3 | 3.0% |
Value | Count | Frequency (%) |
1114057000 | 3 | 3.0% |
1114055000 | 3 | 3.0% |
1111061500 | 42 | |
1111056000 | 2 | 2.0% |
1111055000 | 9 | 9.0% |
1111053000 | 37 | |
1111051500 | 4 | 4.0% |
시도명
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울특별시 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울특별시 |
---|---|
2nd row | 서울특별시 |
3rd row | 서울특별시 |
4th row | 서울특별시 |
5th row | 서울특별시 |
Common Values
Value | Count | Frequency (%) |
서울특별시 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울특별시 | 100 |
시군구명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
종로구 | |
---|---|
중구 | 6 |
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 2.94 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 종로구 |
---|---|
2nd row | 종로구 |
3rd row | 종로구 |
4th row | 종로구 |
5th row | 종로구 |
Common Values
Value | Count | Frequency (%) |
종로구 | 94 | |
중구 | 6 | 6.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
종로구 | 94 | |
중구 | 6 | 6.0% |
행정동명
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 7.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
종로1.2.3.4가동 | |
---|---|
사직동 | |
부암동 | |
청운효자동 | 4 |
명동 | 3 |
Other values (2) |
Length
Max length | 11 |
---|---|
Median length | 5 |
Mean length | 6.38 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 청운효자동 |
---|---|
2nd row | 청운효자동 |
3rd row | 청운효자동 |
4th row | 청운효자동 |
5th row | 사직동 |
Common Values
Value | Count | Frequency (%) |
종로1.2.3.4가동 | 42 | |
사직동 | 37 | |
부암동 | 9 | 9.0% |
청운효자동 | 4 | 4.0% |
명동 | 3 | 3.0% |
필동 | 3 | 3.0% |
평창동 | 2 | 2.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
종로1.2.3.4가동 | 42 | |
사직동 | 37 | |
부암동 | 9 | 9.0% |
청운효자동 | 4 | 4.0% |
명동 | 3 | 3.0% |
필동 | 3 | 3.0% |
평창동 | 2 | 2.0% |
기준일자
Real number (ℝ)
Distinct | 60 |
---|---|
Distinct (%) | 60.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20200919 |
Minimum | 20200803 |
---|---|
Maximum | 20201030 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20200803 |
---|---|
5-th percentile | 20200807 |
Q1 | 20200826 |
median | 20200916 |
Q3 | 20201012 |
95-th percentile | 20201026 |
Maximum | 20201030 |
Range | 227 |
Interquartile range (IQR) | 186 |
Descriptive statistics
Standard deviation | 83.420572 |
---|---|
Coefficient of variation (CV) | 4.1295435 × 10-6 |
Kurtosis | -1.5201713 |
Mean | 20200919 |
Median Absolute Deviation (MAD) | 92 |
Skewness | -0.008459194 |
Sum | 2.0200919 × 109 |
Variance | 6958.9918 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20201023 | 4 | 4.0% |
20200904 | 4 | 4.0% |
20200828 | 4 | 4.0% |
20201016 | 3 | 3.0% |
20200807 | 3 | 3.0% |
20200820 | 3 | 3.0% |
20200825 | 3 | 3.0% |
20200901 | 3 | 3.0% |
20201012 | 3 | 3.0% |
20200929 | 3 | 3.0% |
Other values (50) | 67 |
Value | Count | Frequency (%) |
20200803 | 1 | 1.0% |
20200804 | 1 | 1.0% |
20200805 | 1 | 1.0% |
20200806 | 2 | |
20200807 | 3 | |
20200808 | 1 | 1.0% |
20200810 | 1 | 1.0% |
20200811 | 1 | 1.0% |
20200812 | 1 | 1.0% |
20200813 | 1 | 1.0% |
Value | Count | Frequency (%) |
20201030 | 2 | |
20201029 | 2 | |
20201028 | 1 | 1.0% |
20201026 | 2 | |
20201024 | 1 | 1.0% |
20201023 | 4 | |
20201022 | 1 | 1.0% |
20201021 | 1 | 1.0% |
20201020 | 1 | 1.0% |
20201019 | 2 |
성별
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
X | |
---|---|
M |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | X |
---|---|
2nd row | X |
3rd row | X |
4th row | X |
5th row | X |
Common Values
Value | Count | Frequency (%) |
X | 83 | |
M | 17 | 17.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
x | 83 | |
m | 17 | 17.0% |
연령대
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
xx | |
---|---|
55 | 4 |
50 | 4 |
35 | 3 |
60 | 2 |
Other values (3) | 4 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | xx |
---|---|
2nd row | xx |
3rd row | xx |
4th row | xx |
5th row | xx |
Common Values
Value | Count | Frequency (%) |
xx | 83 | |
55 | 4 | 4.0% |
50 | 4 | 4.0% |
35 | 3 | 3.0% |
60 | 2 | 2.0% |
30 | 2 | 2.0% |
45 | 1 | 1.0% |
70 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
xx | 83 | |
55 | 4 | 4.0% |
50 | 4 | 4.0% |
35 | 3 | 3.0% |
60 | 2 | 2.0% |
30 | 2 | 2.0% |
45 | 1 | 1.0% |
70 | 1 | 1.0% |
소비인구(명)
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 28.023923 |
Minimum | 22.479082 |
---|---|
Maximum | 67.437247 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 22.479082 |
---|---|
5-th percentile | 22.479082 |
Q1 | 22.479082 |
median | 22.479082 |
Q3 | 29.97211 |
95-th percentile | 44.958165 |
Maximum | 67.437247 |
Range | 44.958165 |
Interquartile range (IQR) | 7.4930275 |
Descriptive statistics
Standard deviation | 8.4947687 |
---|---|
Coefficient of variation (CV) | 0.30312561 |
Kurtosis | 4.9003901 |
Mean | 28.023923 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.0160821 |
Sum | 2802.3923 |
Variance | 72.161095 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22.479082395 | 58 | |
29.97210986 | 23 | 23.0% |
37.465137325 | 12 | 12.0% |
44.95816479 | 3 | 3.0% |
52.451192255 | 3 | 3.0% |
67.437247185 | 1 | 1.0% |
Value | Count | Frequency (%) |
22.479082395 | 58 | |
29.97210986 | 23 | 23.0% |
37.465137325 | 12 | 12.0% |
44.95816479 | 3 | 3.0% |
52.451192255 | 3 | 3.0% |
67.437247185 | 1 | 1.0% |
Value | Count | Frequency (%) |
67.437247185 | 1 | 1.0% |
52.451192255 | 3 | 3.0% |
44.95816479 | 3 | 3.0% |
37.465137325 | 12 | 12.0% |
29.97210986 | 23 | 23.0% |
22.479082395 | 58 |
행정동코드 | 시군구명 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|---|
행정동코드 | 1.000 | 0.990 | 1.000 | 0.117 | 0.000 | 0.239 | 0.000 |
시군구명 | 0.990 | 1.000 | 1.000 | 0.109 | 0.000 | 0.252 | 0.000 |
행정동명 | 1.000 | 1.000 | 1.000 | 0.203 | 0.303 | 0.302 | 0.000 |
기준일자 | 0.117 | 0.109 | 0.203 | 1.000 | 0.294 | 0.224 | 0.000 |
성별 | 0.000 | 0.000 | 0.303 | 0.294 | 1.000 | 1.000 | 0.446 |
연령대 | 0.239 | 0.252 | 0.302 | 0.224 | 1.000 | 1.000 | 0.000 |
소비인구(명) | 0.000 | 0.000 | 0.000 | 0.000 | 0.446 | 0.000 | 1.000 |
연령대 | 시군구명 | 행정동명 | 성별 | |
---|---|---|---|---|
연령대 | 1.000 | 0.182 | 0.164 | 0.969 |
시군구명 | 0.182 | 1.000 | 0.974 | 0.000 |
행정동명 | 0.164 | 0.974 | 1.000 | 0.315 |
성별 | 0.969 | 0.000 | 0.315 | 1.000 |
행정동코드 | 기준일자 | 소비인구(명) | 시군구명 | 행정동명 | 성별 | 연령대 | |
---|---|---|---|---|---|---|---|
행정동코드 | 1.000 | 0.094 | 0.347 | 0.910 | 0.974 | 0.000 | 0.182 |
기준일자 | 0.094 | 1.000 | 0.162 | 0.080 | 0.121 | 0.191 | 0.116 |
소비인구(명) | 0.347 | 0.162 | 1.000 | 0.000 | 0.000 | 0.314 | 0.000 |
시군구명 | 0.910 | 0.080 | 0.000 | 1.000 | 0.974 | 0.000 | 0.182 |
행정동명 | 0.974 | 0.121 | 0.000 | 0.974 | 1.000 | 0.315 | 0.164 |
성별 | 0.000 | 0.191 | 0.314 | 0.000 | 0.315 | 1.000 | 0.969 |
연령대 | 0.182 | 0.116 | 0.000 | 0.182 | 0.164 | 0.969 | 1.000 |
행정동코드 | 시도명 | 시군구명 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|---|---|
0 | 1111051500 | 서울특별시 | 종로구 | 청운효자동 | 20200903 | X | xx | 22.479082 |
1 | 1111051500 | 서울특별시 | 종로구 | 청운효자동 | 20200923 | X | xx | 22.479082 |
2 | 1111051500 | 서울특별시 | 종로구 | 청운효자동 | 20200924 | X | xx | 22.479082 |
3 | 1111051500 | 서울특별시 | 종로구 | 청운효자동 | 20200929 | X | xx | 22.479082 |
4 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20201007 | X | xx | 22.479082 |
5 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20201012 | M | 60 | 22.479082 |
6 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20201009 | M | 55 | 22.479082 |
7 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20200807 | M | 30 | 22.479082 |
8 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20200829 | X | xx | 22.479082 |
9 | 1111053000 | 서울특별시 | 종로구 | 사직동 | 20200929 | X | xx | 29.97211 |
행정동코드 | 시도명 | 시군구명 | 행정동명 | 기준일자 | 성별 | 연령대 | 소비인구(명) | |
---|---|---|---|---|---|---|---|---|
90 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20201028 | M | 70 | 22.479082 |
91 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20201008 | X | xx | 44.958165 |
92 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200923 | X | xx | 22.479082 |
93 | 1111061500 | 서울특별시 | 종로구 | 종로1.2.3.4가동 | 20200909 | X | xx | 52.451192 |
94 | 1114055000 | 서울특별시 | 중구 | 명동 | 20201023 | X | xx | 22.479082 |
95 | 1114055000 | 서울특별시 | 중구 | 명동 | 20200925 | X | xx | 22.479082 |
96 | 1114055000 | 서울특별시 | 중구 | 명동 | 20200904 | X | xx | 29.97211 |
97 | 1114057000 | 서울특별시 | 중구 | 필동 | 20201007 | M | 55 | 22.479082 |
98 | 1114057000 | 서울특별시 | 중구 | 필동 | 20201023 | X | xx | 22.479082 |
99 | 1114057000 | 서울특별시 | 중구 | 필동 | 20200904 | M | 30 | 22.479082 |