Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 761.7 KiB |
Average record size in memory | 78.0 B |
Variable types
Categorical | 2 |
---|---|
Numeric | 4 |
Text | 2 |
Dataset
Description | 경기도 발달 골목 상권 추정 매출 현황 |
---|---|
Author | 경기도시장상권진흥원 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=B7BIA8NM4VIXSPPQQV0132089283&infSeq=1 |
Reproduction
Analysis started | 2024-03-12 23:35:50.717621 |
---|---|
Analysis finished | 2024-03-12 23:35:52.825239 |
Duration | 2.11 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
기준연도
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2023 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2023 |
---|---|
2nd row | 2023 |
3rd row | 2023 |
4th row | 2023 |
5th row | 2023 |
Common Values
Value | Count | Frequency (%) |
2023 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2023 | 10000 |
기준분기
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 |
---|
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 1 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
1 | 10000 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 10000 |
상권ID
Real number (ℝ)
Distinct | 1623 |
---|---|
Distinct (%) | 16.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 804.9223 |
Minimum | 1 |
---|---|
Maximum | 1865 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 67 |
Q1 | 343 |
median | 704 |
Q3 | 1269 |
95-th percentile | 1735 |
Maximum | 1865 |
Range | 1864 |
Interquartile range (IQR) | 926 |
Descriptive statistics
Standard deviation | 543.07291 |
---|---|
Coefficient of variation (CV) | 0.67468986 |
Kurtosis | -1.125771 |
Mean | 804.9223 |
Median Absolute Deviation (MAD) | 442 |
Skewness | 0.35083869 |
Sum | 8049223 |
Variance | 294928.19 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
34 | 22 | 0.2% |
124 | 21 | 0.2% |
888 | 19 | 0.2% |
143 | 19 | 0.2% |
93 | 19 | 0.2% |
767 | 19 | 0.2% |
366 | 19 | 0.2% |
363 | 18 | 0.2% |
82 | 18 | 0.2% |
489 | 18 | 0.2% |
Other values (1613) | 9808 |
Value | Count | Frequency (%) |
1 | 9 | |
3 | 3 | < 0.1% |
4 | 10 | |
5 | 7 | |
6 | 13 | |
7 | 9 | |
8 | 1 | < 0.1% |
9 | 16 | |
10 | 11 | |
11 | 6 | 0.1% |
Value | Count | Frequency (%) |
1865 | 14 | |
1864 | 5 | 0.1% |
1863 | 8 | |
1862 | 8 | |
1861 | 8 | |
1860 | 6 | |
1858 | 3 | < 0.1% |
1857 | 4 | < 0.1% |
1856 | 1 | < 0.1% |
1854 | 4 | < 0.1% |
상권명
Text
Distinct | 1544 |
---|---|
Distinct (%) | 15.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
중앙로 | 42 | 0.4% |
중앙로_2 | 27 | 0.3% |
사강장사강시장 | 22 | 0.2% |
산성대로 | 21 | 0.2% |
경안동주민센터 | 21 | 0.2% |
광명로 | 21 | 0.2% |
중앙로_1 | 21 | 0.2% |
영통로 | 20 | 0.2% |
매화로 | 19 | 0.2% |
엘에스로_2 | 19 | 0.2% |
Other values (1534) | 9767 |
Most occurring characters
Value | Count | Frequency (%) |
로 | 4912 | 7.8% |
길 | 2427 | 3.9% |
번 | 2407 | 3.8% |
1 | 2389 | 3.8% |
_ | 1785 | 2.8% |
2 | 1549 | 2.5% |
동 | 1423 | 2.3% |
터 | 1136 | 1.8% |
역 | 1104 | 1.8% |
주 | 957 | 1.5% |
Other values (357) | 42610 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 52867 | |
Decimal Number | 7852 | 12.5% |
Connector Punctuation | 1785 | 2.8% |
Uppercase Letter | 195 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 4912 | 9.3% |
길 | 2427 | 4.6% |
번 | 2407 | 4.6% |
동 | 1423 | 2.7% |
터 | 1136 | 2.1% |
역 | 1104 | 2.1% |
주 | 957 | 1.8% |
구 | 885 | 1.7% |
대 | 843 | 1.6% |
센 | 815 | 1.5% |
Other values (341) | 35958 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2389 | |
2 | 1549 | |
3 | 875 | 11.1% |
4 | 576 | 7.3% |
5 | 505 | 6.4% |
6 | 468 | 6.0% |
9 | 431 | 5.5% |
7 | 412 | 5.2% |
8 | 330 | 4.2% |
0 | 317 | 4.0% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 69 | |
V | 51 | |
G | 51 | |
N | 18 | 9.2% |
I | 6 | 3.1% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1785 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 52867 | |
Common | 9637 | 15.4% |
Latin | 195 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 4912 | 9.3% |
길 | 2427 | 4.6% |
번 | 2407 | 4.6% |
동 | 1423 | 2.7% |
터 | 1136 | 2.1% |
역 | 1104 | 2.1% |
주 | 957 | 1.8% |
구 | 885 | 1.7% |
대 | 843 | 1.6% |
센 | 815 | 1.5% |
Other values (341) | 35958 |
Common
Value | Count | Frequency (%) |
1 | 2389 | |
_ | 1785 | |
2 | 1549 | |
3 | 875 | 9.1% |
4 | 576 | 6.0% |
5 | 505 | 5.2% |
6 | 468 | 4.9% |
9 | 431 | 4.5% |
7 | 412 | 4.3% |
8 | 330 | 3.4% |
Latin
Value | Count | Frequency (%) |
C | 69 | |
V | 51 | |
G | 51 | |
N | 18 | 9.2% |
I | 6 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 52867 | |
ASCII | 9832 | 15.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
로 | 4912 | 9.3% |
길 | 2427 | 4.6% |
번 | 2407 | 4.6% |
동 | 1423 | 2.7% |
터 | 1136 | 2.1% |
역 | 1104 | 2.1% |
주 | 957 | 1.8% |
구 | 885 | 1.7% |
대 | 843 | 1.6% |
센 | 815 | 1.5% |
Other values (341) | 35958 |
ASCII
Value | Count | Frequency (%) |
1 | 2389 | |
_ | 1785 | |
2 | 1549 | |
3 | 875 | 8.9% |
4 | 576 | 5.9% |
5 | 505 | 5.1% |
6 | 468 | 4.8% |
9 | 431 | 4.4% |
7 | 412 | 4.2% |
8 | 330 | 3.4% |
Other values (6) | 512 | 5.2% |
산업분류코드
Real number (ℝ)
Distinct | 68 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 64140.843 |
Minimum | 47121 |
---|---|
Maximum | 96912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 47121 |
---|---|
5-th percentile | 47122 |
Q1 | 47511 |
median | 56111 |
Q3 | 86202 |
95-th percentile | 96113 |
Maximum | 96912 |
Range | 49791 |
Interquartile range (IQR) | 38691 |
Descriptive statistics
Standard deviation | 19809.472 |
---|---|
Coefficient of variation (CV) | 0.30884334 |
Kurtosis | -1.39729 |
Mean | 64140.843 |
Median Absolute Deviation (MAD) | 8799 |
Skewness | 0.65631651 |
Sum | 6.4140843 × 108 |
Variance | 3.9241519 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
56123 | 407 | 4.1% |
56111 | 399 | 4.0% |
96112 | 356 | 3.6% |
47122 | 329 | 3.3% |
47219 | 319 | 3.2% |
56194 | 314 | 3.1% |
96113 | 259 | 2.6% |
47811 | 253 | 2.5% |
56219 | 248 | 2.5% |
91223 | 241 | 2.4% |
Other values (58) | 6875 |
Value | Count | Frequency (%) |
47121 | 224 | |
47122 | 329 | |
47129 | 164 | |
47211 | 44 | 0.4% |
47212 | 205 | |
47217 | 131 | 1.3% |
47219 | 319 | |
47311 | 159 | |
47312 | 143 | |
47320 | 133 |
Value | Count | Frequency (%) |
96912 | 222 | |
96119 | 102 | 1.0% |
96113 | 259 | |
96112 | 356 | |
95310 | 112 | 1.1% |
95213 | 96 | 1.0% |
95212 | 215 | |
91223 | 241 | |
91222 | 185 | |
91136 | 81 | 0.8% |
산업분류코드명
Text
Distinct | 68 |
---|---|
Distinct (%) | 0.7% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 18 |
Mean length | 9.6644 |
Min length | 3 |
Characters and Unicode
Total characters | 96644 |
---|---|
Distinct characters | 160 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | 슈퍼마켓 |
---|---|
2nd row | 의약품 및 의료용품 소매업 |
3rd row | 중식 음식점업 |
4th row | 기타 식료품 소매업 |
5th row | 한식 일반 음식점업 |
Value | Count | Frequency (%) |
소매업 | 3873 | 13.6% |
및 | 2509 | 8.8% |
기타 | 1745 | 6.1% |
음식점업 | 1543 | 5.4% |
운영업 | 936 | 3.3% |
미용업 | 717 | 2.5% |
일반 | 579 | 2.0% |
가정용 | 466 | 1.6% |
서양식 | 407 | 1.4% |
한식 | 402 | 1.4% |
Other values (115) | 15258 |
Most occurring characters
Value | Count | Frequency (%) |
18435 | ||
업 | 8394 | 8.7% |
소 | 4032 | 4.2% |
매 | 3873 | 4.0% |
식 | 3545 | 3.7% |
용 | 2731 | 2.8% |
품 | 2632 | 2.7% |
및 | 2509 | 2.6% |
점 | 2346 | 2.4% |
기 | 2328 | 2.4% |
Other values (150) | 45819 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 77470 | |
Space Separator | 18435 | 19.1% |
Other Punctuation | 739 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
업 | 8394 | 10.8% |
소 | 4032 | 5.2% |
매 | 3873 | 5.0% |
식 | 3545 | 4.6% |
용 | 2731 | 3.5% |
품 | 2632 | 3.4% |
및 | 2509 | 3.2% |
점 | 2346 | 3.0% |
기 | 2328 | 3.0% |
타 | 1745 | 2.3% |
Other values (148) | 43335 |
Space Separator
Value | Count | Frequency (%) |
18435 |
Other Punctuation
Value | Count | Frequency (%) |
, | 739 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 77470 | |
Common | 19174 | 19.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
업 | 8394 | 10.8% |
소 | 4032 | 5.2% |
매 | 3873 | 5.0% |
식 | 3545 | 4.6% |
용 | 2731 | 3.5% |
품 | 2632 | 3.4% |
및 | 2509 | 3.2% |
점 | 2346 | 3.0% |
기 | 2328 | 3.0% |
타 | 1745 | 2.3% |
Other values (148) | 43335 |
Common
Value | Count | Frequency (%) |
18435 | ||
, | 739 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 77306 | |
ASCII | 19174 | 19.8% |
Compat Jamo | 164 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
18435 | ||
, | 739 | 3.9% |
Hangul
Value | Count | Frequency (%) |
업 | 8394 | 10.9% |
소 | 4032 | 5.2% |
매 | 3873 | 5.0% |
식 | 3545 | 4.6% |
용 | 2731 | 3.5% |
품 | 2632 | 3.4% |
및 | 2509 | 3.2% |
점 | 2346 | 3.0% |
기 | 2328 | 3.0% |
타 | 1745 | 2.3% |
Other values (147) | 43171 |
Compat Jamo
Value | Count | Frequency (%) |
ㆍ | 164 |
매출금액
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 9997 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.2381335 × 108 |
Minimum | 6 |
---|---|
Maximum | 1.9401094 × 1010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 6 |
---|---|
5-th percentile | 611005.85 |
Q1 | 7039251.5 |
median | 32974990 |
Q3 | 1.1827653 × 108 |
95-th percentile | 5.0755985 × 108 |
Maximum | 1.9401094 × 1010 |
Range | 1.9401094 × 1010 |
Interquartile range (IQR) | 1.1123728 × 108 |
Descriptive statistics
Standard deviation | 3.4606227 × 108 |
---|---|
Coefficient of variation (CV) | 2.7950319 |
Kurtosis | 1018.3753 |
Mean | 1.2381335 × 108 |
Median Absolute Deviation (MAD) | 30719238 |
Skewness | 21.87634 |
Sum | 1.2381335 × 1012 |
Variance | 1.1975909 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
285122 | 2 | < 0.1% |
291529 | 2 | < 0.1% |
3850020 | 2 | < 0.1% |
7797062 | 1 | < 0.1% |
129795844 | 1 | < 0.1% |
1000646855 | 1 | < 0.1% |
881782 | 1 | < 0.1% |
23233393 | 1 | < 0.1% |
104922135 | 1 | < 0.1% |
306050663 | 1 | < 0.1% |
Other values (9987) | 9987 |
Value | Count | Frequency (%) |
6 | 1 | |
27 | 1 | |
1277 | 1 | |
1398 | 1 | |
2039 | 1 | |
2836 | 1 | |
2960 | 1 | |
5509 | 1 | |
5673 | 1 | |
6244 | 1 |
Value | Count | Frequency (%) |
19401094394 | 1 | |
8243625581 | 1 | |
5005607514 | 1 | |
4951757501 | 1 | |
4591174666 | 1 | |
4043437183 | 1 | |
3758113640 | 1 | |
3620629784 | 1 | |
3594047276 | 1 | |
3376282422 | 1 |
매출건수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 2954 |
---|---|
Distinct (%) | 29.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1543.8451 |
Minimum | 1 |
---|---|
Maximum | 112783 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 30 |
median | 164 |
Q3 | 943.25 |
95-th percentile | 7867.05 |
Maximum | 112783 |
Range | 112782 |
Interquartile range (IQR) | 913.25 |
Descriptive statistics
Standard deviation | 4658.6147 |
---|---|
Coefficient of variation (CV) | 3.0175402 |
Kurtosis | 109.29248 |
Mean | 1543.8451 |
Median Absolute Deviation (MAD) | 157 |
Skewness | 8.3454143 |
Sum | 15438451 |
Variance | 21702691 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 217 | 2.2% |
4 | 161 | 1.6% |
1 | 150 | 1.5% |
3 | 143 | 1.4% |
6 | 128 | 1.3% |
9 | 123 | 1.2% |
5 | 115 | 1.1% |
7 | 107 | 1.1% |
8 | 100 | 1.0% |
10 | 97 | 1.0% |
Other values (2944) | 8659 |
Value | Count | Frequency (%) |
1 | 150 | |
2 | 217 | |
3 | 143 | |
4 | 161 | |
5 | 115 | |
6 | 128 | |
7 | 107 | |
8 | 100 | |
9 | 123 | |
10 | 97 |
Value | Count | Frequency (%) |
112783 | 1 | |
82058 | 1 | |
79086 | 1 | |
78446 | 1 | |
77147 | 1 | |
75061 | 1 | |
74420 | 1 | |
66105 | 1 | |
64369 | 1 | |
60610 | 1 |
상권ID | 산업분류코드 | 산업분류코드명 | 매출금액 | 매출건수 | |
---|---|---|---|---|---|
상권ID | 1.000 | 0.058 | 0.149 | 0.055 | 0.061 |
산업분류코드 | 0.058 | 1.000 | 1.000 | 0.032 | 0.077 |
산업분류코드명 | 0.149 | 1.000 | 1.000 | 0.098 | 0.300 |
매출금액 | 0.055 | 0.032 | 0.098 | 1.000 | 0.250 |
매출건수 | 0.061 | 0.077 | 0.300 | 0.250 | 1.000 |
상권ID | 산업분류코드 | 매출금액 | 매출건수 | |
---|---|---|---|---|
상권ID | 1.000 | -0.013 | -0.159 | -0.118 |
산업분류코드 | -0.013 | 1.000 | 0.005 | -0.069 |
매출금액 | -0.159 | 0.005 | 1.000 | 0.640 |
매출건수 | -0.118 | -0.069 | 0.640 | 1.000 |
기준연도 | 기준분기 | 상권ID | 상권명 | 산업분류코드 | 산업분류코드명 | 매출금액 | 매출건수 | |
---|---|---|---|---|---|---|---|---|
15382 | 2023 | 1 | 1409 | 김포대로319번길 | 47121 | 슈퍼마켓 | 7797062 | 659 |
2666 | 2023 | 1 | 426 | 인덕원역_2번출구 | 47811 | 의약품 및 의료용품 소매업 | 105005999 | 5851 |
31273 | 2023 | 1 | 217 | 보정동주민센터 | 56121 | 중식 음식점업 | 195053566 | 2139 |
11528 | 2023 | 1 | 1754 | 서정마을2로7번길 | 47219 | 기타 식료품 소매업 | 64563357 | 1582 |
7468 | 2023 | 1 | 1148 | 역전로 | 56111 | 한식 일반 음식점업 | 70797234 | 664 |
9702 | 2023 | 1 | 1537 | 경의로 | 55901 | 기숙사 및 고시원 운영업 | 1348382 | 2 |
39100 | 2023 | 1 | 833 | 철산역_4번출구 | 85629 | 기타 예술학원 | 10076797 | 185 |
6027 | 2023 | 1 | 1081 | 부부로2길 | 47813 | 화장품, 비누 및 방향제 소매업 | 1919764 | 12 |
26134 | 2023 | 1 | 1664 | 평화로 | 56121 | 중식 음식점업 | 4750743 | 47 |
30909 | 2023 | 1 | 569 | 오리로_3 | 56194 | 김밥 및 기타 간이 음식점업 | 9329038 | 226 |
기준연도 | 기준분기 | 상권ID | 상권명 | 산업분류코드 | 산업분류코드명 | 매출금액 | 매출건수 | |
---|---|---|---|---|---|---|---|---|
4851 | 2023 | 1 | 1015 | 율마로438번길 | 56123 | 서양식 음식점업 | 3168337 | 578 |
14506 | 2023 | 1 | 1478 | 다리간2길 | 47411 | 남자용 겉옷 소매업 | 6108894 | 5 |
23735 | 2023 | 1 | 151 | 광덕4로_1 | 56111 | 한식 일반 음식점업 | 356329413 | 5247 |
37911 | 2023 | 1 | 830 | 쇠재안길 | 47311 | 컴퓨터 및 주변장치, 소프트웨어 소매업 | 4880563 | 37 |
1745 | 2023 | 1 | 416 | 시흥정왕동우체국 | 47122 | 체인화 편의점 | 1126925391 | 66105 |
4399 | 2023 | 1 | 655 | 포천우체국 | 47429 | 섬유 원단, 실 및 기타 섬유제품 소매업 | 10619213 | 10 |
7613 | 2023 | 1 | 1091 | 수목원로468번길 | 91135 | 당구장 운영업 | 15000780 | 140 |
6841 | 2023 | 1 | 1043 | 한글세계평화지도전시관 | 91135 | 당구장 운영업 | 14465193 | 136 |
15586 | 2023 | 1 | 1213 | 직행고속버스정류소 | 56111 | 한식 일반 음식점업 | 33522523 | 1231 |
4964 | 2023 | 1 | 858 | 대화역_4번출구 | 47312 | 통신기기 소매업 | 18496666 | 6 |