Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.9 KiB |
Average record size in memory | 70.3 B |
Variable types
Numeric | 5 |
---|---|
Categorical | 3 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 지디에스컨설팅그룹 |
URL | https://www.bigdata-environment.kr/user/data_market/detail.do?id=d5a6d330-2dfc-11ea-9c1b-71bfb969ab02 |
법정동코드 is highly overall correlated with 업종코드 and 1 other fields | High correlation |
업종코드 is highly overall correlated with 법정동코드 and 3 other fields | High correlation |
금액 is highly overall correlated with 업종코드 | High correlation |
업종명 is highly overall correlated with 법정동코드 and 2 other fields | High correlation |
성별 is highly overall correlated with 업종코드 and 2 other fields | High correlation |
연령 is highly overall correlated with 성별 | High correlation |
Reproduction
Analysis started | 2023-12-10 13:14:27.810347 |
---|---|
Analysis finished | 2023-12-10 13:14:32.253625 |
Duration | 4.44 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
법정동코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11110109 |
Minimum | 11110106 |
---|---|
Maximum | 11110117 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 11110106 |
---|---|
5-th percentile | 11110107 |
Q1 | 11110107 |
median | 11110108 |
Q3 | 11110112 |
95-th percentile | 11110114 |
Maximum | 11110117 |
Range | 11 |
Interquartile range (IQR) | 5 |
Descriptive statistics
Standard deviation | 3.0158504 |
---|---|
Coefficient of variation (CV) | 2.7145101 × 10-7 |
Kurtosis | -0.13549702 |
Mean | 11110109 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.9715702 |
Sum | 1.1110109 × 109 |
Variance | 9.0953535 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
11110107 | 41 | |
11110112 | 19 | |
11110108 | 17 | |
11110113 | 8 | 8.0% |
11110117 | 5 | 5.0% |
11110106 | 4 | 4.0% |
11110114 | 3 | 3.0% |
11110109 | 2 | 2.0% |
11110110 | 1 | 1.0% |
Value | Count | Frequency (%) |
11110106 | 4 | 4.0% |
11110107 | 41 | |
11110108 | 17 | |
11110109 | 2 | 2.0% |
11110110 | 1 | 1.0% |
11110112 | 19 | |
11110113 | 8 | 8.0% |
11110114 | 3 | 3.0% |
11110117 | 5 | 5.0% |
Value | Count | Frequency (%) |
11110117 | 5 | 5.0% |
11110114 | 3 | 3.0% |
11110113 | 8 | 8.0% |
11110112 | 19 | |
11110110 | 1 | 1.0% |
11110109 | 2 | 2.0% |
11110108 | 17 | |
11110107 | 41 | |
11110106 | 4 | 4.0% |
년월
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 12.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1884.98 |
Minimum | 1810 |
---|---|
Maximum | 1909 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 1810 |
---|---|
5-th percentile | 1810 |
Q1 | 1901 |
median | 1905 |
Q3 | 1908 |
95-th percentile | 1909 |
Maximum | 1909 |
Range | 99 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 39.743618 |
---|---|
Coefficient of variation (CV) | 0.021084371 |
Kurtosis | -0.13037503 |
Mean | 1884.98 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -1.3617937 |
Sum | 188498 |
Variance | 1579.5552 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1909 | 16 | |
1906 | 13 | |
1810 | 12 | |
1908 | 10 | |
1907 | 9 | |
1904 | 8 | |
1905 | 8 | |
1811 | 6 | 6.0% |
1901 | 5 | 5.0% |
1902 | 5 | 5.0% |
Other values (2) | 8 |
Value | Count | Frequency (%) |
1810 | 12 | |
1811 | 6 | |
1812 | 4 | 4.0% |
1901 | 5 | 5.0% |
1902 | 5 | 5.0% |
1903 | 4 | 4.0% |
1904 | 8 | |
1905 | 8 | |
1906 | 13 | |
1907 | 9 |
Value | Count | Frequency (%) |
1909 | 16 | |
1908 | 10 | |
1907 | 9 | |
1906 | 13 | |
1905 | 8 | |
1904 | 8 | |
1903 | 4 | 4.0% |
1902 | 5 | 5.0% |
1901 | 5 | 5.0% |
1812 | 4 | 4.0% |
업종코드
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4403.88 |
Minimum | 2002 |
---|---|
Maximum | 8201 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 2002 |
---|---|
5-th percentile | 2004 |
Q1 | 2104 |
median | 3401 |
Q3 | 8201 |
95-th percentile | 8201 |
Maximum | 8201 |
Range | 6199 |
Interquartile range (IQR) | 6097 |
Descriptive statistics
Standard deviation | 2571.7261 |
---|---|
Coefficient of variation (CV) | 0.58396825 |
Kurtosis | -1.4026567 |
Mean | 4403.88 |
Median Absolute Deviation (MAD) | 1297 |
Skewness | 0.57061428 |
Sum | 440388 |
Variance | 6613775 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2104 | 30 | |
8201 | 27 | |
5610 | 11 | 11.0% |
2499 | 11 | 11.0% |
4112 | 9 | 9.0% |
2004 | 7 | 7.0% |
3401 | 4 | 4.0% |
2002 | 1 | 1.0% |
Value | Count | Frequency (%) |
2002 | 1 | 1.0% |
2004 | 7 | 7.0% |
2104 | 30 | |
2499 | 11 | 11.0% |
3401 | 4 | 4.0% |
4112 | 9 | 9.0% |
5610 | 11 | 11.0% |
8201 | 27 |
Value | Count | Frequency (%) |
8201 | 27 | |
5610 | 11 | 11.0% |
4112 | 9 | 9.0% |
3401 | 4 | 4.0% |
2499 | 11 | 11.0% |
2104 | 30 | |
2004 | 7 | 7.0% |
2002 | 1 | 1.0% |
업종명
Categorical
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 8.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
한식 | |
---|---|
용역서비스업 | |
주차장 | |
기타 식품 | |
편의점 | |
Other values (3) |
Length
Max length | 6 |
---|---|
Median length | 5 |
Mean length | 3.93 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 한식 |
---|---|
2nd row | 한식 |
3rd row | 한식 |
4th row | 한식 |
5th row | 용역서비스업 |
Common Values
Value | Count | Frequency (%) |
한식 | 30 | |
용역서비스업 | 27 | |
주차장 | 11 | 11.0% |
기타 식품 | 11 | 11.0% |
편의점 | 9 | 9.0% |
커피전문점 | 7 | 7.0% |
건축자재 | 4 | 4.0% |
휴게음식점 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
한식 | 30 | |
용역서비스업 | 27 | |
주차장 | 11 | 9.9% |
기타 | 11 | 9.9% |
식품 | 11 | 9.9% |
편의점 | 9 | 8.1% |
커피전문점 | 7 | 6.3% |
건축자재 | 4 | 3.6% |
휴게음식점 | 1 | 0.9% |
성별
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2.여성 | |
---|---|
1.남성 | |
0.법인 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2.여성 |
---|---|
2nd row | 2.여성 |
3rd row | 1.남성 |
4th row | 2.여성 |
5th row | 2.여성 |
Common Values
Value | Count | Frequency (%) |
2.여성 | 51 | |
1.남성 | 43 | |
0.법인 | 6 | 6.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2.여성 | 51 | |
1.남성 | 43 | |
0.법인 | 6 | 6.0% |
연령
Categorical
HIGH CORRELATION
 
Distinct | 11 |
---|---|
Distinct (%) | 11.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
09.50세미만 | |
---|---|
10.55세미만 | |
08.45세미만 | |
11.60세미만 | |
07.40세미만 | |
Other values (6) |
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 7.82 |
Min length | 5 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 1.0% |
Sample
1st row | 10.55세미만 |
---|---|
2nd row | 08.45세미만 |
3rd row | 12.65세미만 |
4th row | 10.55세미만 |
5th row | 11.60세미만 |
Common Values
Value | Count | Frequency (%) |
09.50세미만 | 27 | |
10.55세미만 | 16 | |
08.45세미만 | 14 | |
11.60세미만 | 11 | |
07.40세미만 | 11 | |
12.65세미만 | 6 | 6.0% |
99.기타 | 6 | 6.0% |
06.35세미만 | 4 | 4.0% |
05.30세미만 | 2 | 2.0% |
13.70세미만 | 2 | 2.0% |
Length
Value | Count | Frequency (%) |
09.50세미만 | 27 | |
10.55세미만 | 16 | |
08.45세미만 | 14 | |
11.60세미만 | 11 | |
07.40세미만 | 11 | |
12.65세미만 | 6 | 6.0% |
99.기타 | 6 | 6.0% |
06.35세미만 | 4 | 4.0% |
05.30세미만 | 2 | 2.0% |
13.70세미만 | 2 | 2.0% |
금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 89 |
---|---|
Distinct (%) | 89.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 62038.56 |
Minimum | 4700 |
---|---|
Maximum | 296000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 4700 |
---|---|
5-th percentile | 10355 |
Q1 | 19775 |
median | 36305 |
Q3 | 81530 |
95-th percentile | 203400 |
Maximum | 296000 |
Range | 291300 |
Interquartile range (IQR) | 61755 |
Descriptive statistics
Standard deviation | 63713.476 |
---|---|
Coefficient of variation (CV) | 1.026998 |
Kurtosis | 2.320859 |
Mean | 62038.56 |
Median Absolute Deviation (MAD) | 19305 |
Skewness | 1.7129248 |
Sum | 6203856 |
Variance | 4.059407 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
25000 | 3 | 3.0% |
20000 | 3 | 3.0% |
16400 | 2 | 2.0% |
22800 | 2 | 2.0% |
18500 | 2 | 2.0% |
50000 | 2 | 2.0% |
19000 | 2 | 2.0% |
12500 | 2 | 2.0% |
89000 | 2 | 2.0% |
83000 | 1 | 1.0% |
Other values (79) | 79 |
Value | Count | Frequency (%) |
4700 | 1 | |
6500 | 1 | |
7900 | 1 | |
8160 | 1 | |
9500 | 1 | |
10400 | 1 | |
11000 | 1 | |
11400 | 1 | |
12500 | 2 | |
13800 | 1 |
Value | Count | Frequency (%) |
296000 | 1 | |
260000 | 1 | |
221000 | 1 | |
219500 | 1 | |
211000 | 1 | |
203000 | 1 | |
201000 | 1 | |
200000 | 1 | |
193000 | 1 | |
162400 | 1 |
이용건수
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 9.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.52 |
Minimum | 3 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 3 |
---|---|
5-th percentile | 3 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 10.05 |
Maximum | 13 |
Range | 10 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 2.2449944 |
---|---|
Coefficient of variation (CV) | 0.49668018 |
Kurtosis | 4.3187808 |
Mean | 4.52 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.0856684 |
Sum | 452 |
Variance | 5.04 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 45 | |
4 | 23 | |
5 | 12 | 12.0% |
6 | 6 | 6.0% |
7 | 4 | 4.0% |
8 | 4 | 4.0% |
11 | 3 | 3.0% |
13 | 2 | 2.0% |
10 | 1 | 1.0% |
Value | Count | Frequency (%) |
3 | 45 | |
4 | 23 | |
5 | 12 | 12.0% |
6 | 6 | 6.0% |
7 | 4 | 4.0% |
8 | 4 | 4.0% |
10 | 1 | 1.0% |
11 | 3 | 3.0% |
13 | 2 | 2.0% |
Value | Count | Frequency (%) |
13 | 2 | 2.0% |
11 | 3 | 3.0% |
10 | 1 | 1.0% |
8 | 4 | 4.0% |
7 | 4 | 4.0% |
6 | 6 | 6.0% |
5 | 12 | 12.0% |
4 | 23 | |
3 | 45 |
법정동코드 | 년월 | 업종코드 | 업종명 | 성별 | 연령 | 금액 | 이용건수 | |
---|---|---|---|---|---|---|---|---|
법정동코드 | 1.000 | 0.000 | 0.756 | 0.927 | 0.515 | 0.424 | 0.443 | 0.429 |
년월 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.300 | 0.026 |
업종코드 | 0.756 | 0.000 | 1.000 | 1.000 | 0.656 | 0.653 | 0.796 | 0.153 |
업종명 | 0.927 | 0.000 | 1.000 | 1.000 | 0.748 | 0.626 | 0.590 | 0.479 |
성별 | 0.515 | 0.000 | 0.656 | 0.748 | 1.000 | 0.848 | 0.643 | 0.288 |
연령 | 0.424 | 0.000 | 0.653 | 0.626 | 0.848 | 1.000 | 0.511 | 0.000 |
금액 | 0.443 | 0.300 | 0.796 | 0.590 | 0.643 | 0.511 | 1.000 | 0.417 |
이용건수 | 0.429 | 0.026 | 0.153 | 0.479 | 0.288 | 0.000 | 0.417 | 1.000 |
성별 | 연령 | 업종명 | |
---|---|---|---|
성별 | 1.000 | 0.717 | 0.633 |
연령 | 0.717 | 1.000 | 0.350 |
업종명 | 0.633 | 0.350 | 1.000 |
법정동코드 | 년월 | 업종코드 | 금액 | 이용건수 | 업종명 | 성별 | 연령 | |
---|---|---|---|---|---|---|---|---|
법정동코드 | 1.000 | 0.151 | -0.613 | 0.387 | 0.027 | 0.565 | 0.355 | 0.213 |
년월 | 0.151 | 1.000 | -0.189 | 0.067 | 0.068 | 0.033 | 0.101 | 0.000 |
업종코드 | -0.613 | -0.189 | 1.000 | -0.606 | 0.036 | 0.984 | 0.619 | 0.417 |
금액 | 0.387 | 0.067 | -0.606 | 1.000 | 0.200 | 0.328 | 0.474 | 0.239 |
이용건수 | 0.027 | 0.068 | 0.036 | 0.200 | 1.000 | 0.265 | 0.145 | 0.000 |
업종명 | 0.565 | 0.033 | 0.984 | 0.328 | 0.265 | 1.000 | 0.633 | 0.350 |
성별 | 0.355 | 0.101 | 0.619 | 0.474 | 0.145 | 0.633 | 1.000 | 0.717 |
연령 | 0.213 | 0.000 | 0.417 | 0.239 | 0.000 | 0.350 | 0.717 | 1.000 |
법정동코드 | 년월 | 업종코드 | 업종명 | 성별 | 연령 | 금액 | 이용건수 | |
---|---|---|---|---|---|---|---|---|
0 | 11110106 | 1909 | 2104 | 한식 | 2.여성 | 10.55세미만 | 83000 | 5 |
1 | 11110106 | 1907 | 2104 | 한식 | 2.여성 | 08.45세미만 | 18500 | 3 |
2 | 11110106 | 1903 | 2104 | 한식 | 1.남성 | 12.65세미만 | 84000 | 3 |
3 | 11110106 | 1810 | 2104 | 한식 | 2.여성 | 10.55세미만 | 50500 | 3 |
4 | 11110107 | 1906 | 8201 | 용역서비스업 | 2.여성 | 11.60세미만 | 11400 | 3 |
5 | 11110107 | 1903 | 8201 | 용역서비스업 | 1.남성 | 08.45세미만 | 25000 | 3 |
6 | 11110107 | 1904 | 5610 | 주차장 | 1.남성 | 11.60세미만 | 4700 | 3 |
7 | 11110107 | 1909 | 8201 | 용역서비스업 | 2.여성 | 07.40세미만 | 15000 | 3 |
8 | 11110107 | 1907 | 8201 | 용역서비스업 | 1.남성 | 11.60세미만 | 6500 | 3 |
9 | 11110107 | 1901 | 5610 | 주차장 | 1.남성 | 10.55세미만 | 10400 | 4 |
법정동코드 | 년월 | 업종코드 | 업종명 | 성별 | 연령 | 금액 | 이용건수 | |
---|---|---|---|---|---|---|---|---|
90 | 11110113 | 1810 | 4112 | 편의점 | 2.여성 | 10.55세미만 | 19000 | 3 |
91 | 11110113 | 1909 | 4112 | 편의점 | 2.여성 | 10.55세미만 | 20420 | 3 |
92 | 11110114 | 1908 | 2004 | 커피전문점 | 2.여성 | 06.35세미만 | 21900 | 3 |
93 | 11110114 | 1909 | 2002 | 휴게음식점 | 2.여성 | 12.65세미만 | 19000 | 3 |
94 | 11110114 | 1909 | 2104 | 한식 | 2.여성 | 10.55세미만 | 200000 | 4 |
95 | 11110117 | 1811 | 2104 | 한식 | 2.여성 | 05.30세미만 | 49500 | 4 |
96 | 11110117 | 1907 | 4112 | 편의점 | 1.남성 | 09.50세미만 | 44550 | 11 |
97 | 11110117 | 1907 | 2104 | 한식 | 2.여성 | 06.35세미만 | 41000 | 4 |
98 | 11110117 | 1907 | 2104 | 한식 | 1.남성 | 11.60세미만 | 138000 | 4 |
99 | 11110117 | 1906 | 4112 | 편의점 | 1.남성 | 09.50세미만 | 22800 | 4 |