Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 4407 |
Missing cells | 1664 |
Missing cells (%) | 7.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 180.9 KiB |
Average record size in memory | 42.0 B |
Variable types
Categorical | 1 |
---|---|
Text | 2 |
Numeric | 2 |
Dataset
Description | 이 파일 데이터는 가락, 강서, 양곡 도매시장, 친환경유통센터에 위치한 점포면적과 크기, 위치를 알 수 있습니다. 점포정보를 통해 시장별로 입점 가능한 수, 제공되는 점포 면적 등을 파악할 수 있습니다. |
---|---|
Author | 서울특별시농수산식품공사 |
URL | https://www.data.go.kr/data/15123533/fileData.do |
전용면적 is highly overall correlated with 공용면적 | High correlation |
공용면적 is highly overall correlated with 전용면적 | High correlation |
시장구분 is highly imbalanced (58.6%) | Imbalance |
전용면적 has 219 (5.0%) missing values | Missing |
공용면적 has 1445 (32.8%) missing values | Missing |
공용면적 is highly skewed (γ1 = 27.10600015) | Skewed |
시설번호 has unique values | Unique |
공용면적 has 413 (9.4%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 12:43:04.652863 |
---|---|
Analysis finished | 2023-12-12 12:43:05.996260 |
Duration | 1.34 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시장구분
Categorical
IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 34.6 KiB |
가락 | |
---|---|
강서 | |
양곡 | 197 |
친환경 | 16 |
Length
Max length | 3 |
---|---|
Median length | 2 |
Mean length | 2.0036306 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 가락 |
---|---|
2nd row | 가락 |
3rd row | 가락 |
4th row | 가락 |
5th row | 가락 |
Common Values
Value | Count | Frequency (%) |
가락 | 3648 | |
강서 | 546 | 12.4% |
양곡 | 197 | 4.5% |
친환경 | 16 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
가락 | 3648 | |
강서 | 546 | 12.4% |
양곡 | 197 | 4.5% |
친환경 | 16 | 0.4% |
시설번호
Text
UNIQUE
 
Distinct | 4407 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 34.6 KiB |
Value | Count | Frequency (%) |
1113181 | 1 | < 0.1% |
g1ah694 | 1 | < 0.1% |
g1ah671 | 1 | < 0.1% |
g1ah681 | 1 | < 0.1% |
g1ah691 | 1 | < 0.1% |
g1ah692 | 1 | < 0.1% |
g1ah693 | 1 | < 0.1% |
g1ahb21 | 1 | < 0.1% |
g1ah751 | 1 | < 0.1% |
g1ah051 | 1 | < 0.1% |
Other values (4397) | 4397 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 12101 | |
0 | 3362 | 10.9% |
2 | 3252 | 10.5% |
3 | 1528 | 5.0% |
G | 1446 | 4.7% |
8 | 1167 | 3.8% |
5 | 1116 | 3.6% |
4 | 1101 | 3.6% |
6 | 879 | 2.8% |
7 | 831 | 2.7% |
Other values (32) | 4066 | 13.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26142 | |
Uppercase Letter | 3354 | 10.9% |
Lowercase Letter | 1353 | 4.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
a | 685 | |
b | 390 | |
j | 141 | 10.4% |
c | 43 | 3.2% |
d | 29 | 2.1% |
e | 17 | 1.3% |
f | 13 | 1.0% |
g | 11 | 0.8% |
h | 7 | 0.5% |
i | 4 | 0.3% |
Other values (11) | 13 | 1.0% |
Uppercase Letter
Value | Count | Frequency (%) |
G | 1446 | |
K | 531 | 15.8% |
B | 314 | 9.4% |
F | 274 | 8.2% |
A | 226 | 6.7% |
D | 220 | 6.6% |
C | 169 | 5.0% |
H | 81 | 2.4% |
E | 44 | 1.3% |
S | 28 | 0.8% |
Decimal Number
Value | Count | Frequency (%) |
1 | 12101 | |
0 | 3362 | 12.9% |
2 | 3252 | 12.4% |
3 | 1528 | 5.8% |
8 | 1167 | 4.5% |
5 | 1116 | 4.3% |
4 | 1101 | 4.2% |
6 | 879 | 3.4% |
7 | 831 | 3.2% |
9 | 805 | 3.1% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 26142 | |
Latin | 4707 | 15.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
G | 1446 | |
a | 685 | |
K | 531 | 11.3% |
b | 390 | 8.3% |
B | 314 | 6.7% |
F | 274 | 5.8% |
A | 226 | 4.8% |
D | 220 | 4.7% |
C | 169 | 3.6% |
j | 141 | 3.0% |
Other values (22) | 311 | 6.6% |
Common
Value | Count | Frequency (%) |
1 | 12101 | |
0 | 3362 | 12.9% |
2 | 3252 | 12.4% |
3 | 1528 | 5.8% |
8 | 1167 | 4.5% |
5 | 1116 | 4.3% |
4 | 1101 | 4.2% |
6 | 879 | 3.4% |
7 | 831 | 3.2% |
9 | 805 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 30849 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 12101 | |
0 | 3362 | 10.9% |
2 | 3252 | 10.5% |
3 | 1528 | 5.0% |
G | 1446 | 4.7% |
8 | 1167 | 3.8% |
5 | 1116 | 3.6% |
4 | 1101 | 3.6% |
6 | 879 | 2.8% |
7 | 831 | 2.7% |
Other values (32) | 4066 | 13.2% |
시설명
Text
Distinct | 4405 |
---|---|
Distinct (%) | > 99.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 34.6 KiB |
Length
Max length | 31 |
---|---|
Median length | 29 |
Mean length | 18.540277 |
Min length | 5 |
Characters and Unicode
Total characters | 81707 |
---|---|
Distinct characters | 192 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 4403 ? |
---|---|
Unique (%) | 99.9% |
Sample
1st row | 청과물시장동 1층 318-1호 |
---|---|
2nd row | 청과물시장동 1층 319-1호 |
3rd row | 청과물시장동 1층 320-1호 |
4th row | 청과물시장동 1층 321-1호 |
5th row | 청과물시장동 1층 322-1호 |
Value | Count | Frequency (%) |
1층 | 2559 | 14.8% |
가락몰 | 1664 | 9.6% |
판매동 | 1374 | 7.9% |
지하1층 | 742 | 4.3% |
청과물시장동 | 687 | 4.0% |
청과부류 | 683 | 3.9% |
채소시장 | 502 | 2.9% |
수산물시장동 | 463 | 2.7% |
1동 | 371 | 2.1% |
지하2층 | 354 | 2.0% |
Other values (2104) | 7894 |
Most occurring characters
Value | Count | Frequency (%) |
12889 | 15.8% | |
1 | 8887 | 10.9% |
호 | 4357 | 5.3% |
층 | 4306 | 5.3% |
동 | 4203 | 5.1% |
0 | 3732 | 4.6% |
- | 3534 | 4.3% |
2 | 2720 | 3.3% |
시 | 2347 | 2.9% |
장 | 2154 | 2.6% |
Other values (182) | 32578 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 43004 | |
Decimal Number | 21295 | |
Space Separator | 12889 | 15.8% |
Dash Punctuation | 3534 | 4.3% |
Uppercase Letter | 828 | 1.0% |
Open Punctuation | 77 | 0.1% |
Close Punctuation | 77 | 0.1% |
Other Punctuation | 2 | < 0.1% |
Other Symbol | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
호 | 4357 | 10.1% |
층 | 4306 | 10.0% |
동 | 4203 | 9.8% |
시 | 2347 | 5.5% |
장 | 2154 | 5.0% |
가 | 1802 | 4.2% |
매 | 1706 | 4.0% |
청 | 1693 | 3.9% |
과 | 1692 | 3.9% |
락 | 1679 | 3.9% |
Other values (153) | 17065 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 181 | |
C | 138 | |
D | 115 | |
B | 105 | |
F | 84 | |
H | 81 | |
G | 63 | 7.6% |
E | 35 | 4.2% |
S | 12 | 1.4% |
K | 12 | 1.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8887 | |
0 | 3732 | |
2 | 2720 | 12.8% |
3 | 1219 | 5.7% |
4 | 973 | 4.6% |
5 | 926 | 4.3% |
6 | 752 | 3.5% |
7 | 720 | 3.4% |
8 | 689 | 3.2% |
9 | 677 | 3.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 76 | |
[ | 1 | 1.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 76 | |
] | 1 | 1.3% |
Space Separator
Value | Count | Frequency (%) |
12889 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3534 |
Other Punctuation
Value | Count | Frequency (%) |
, | 2 |
Other Symbol
Value | Count | Frequency (%) |
㈜ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 43005 | |
Common | 37874 | |
Latin | 828 | 1.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
호 | 4357 | 10.1% |
층 | 4306 | 10.0% |
동 | 4203 | 9.8% |
시 | 2347 | 5.5% |
장 | 2154 | 5.0% |
가 | 1802 | 4.2% |
매 | 1706 | 4.0% |
청 | 1693 | 3.9% |
과 | 1692 | 3.9% |
락 | 1679 | 3.9% |
Other values (154) | 17066 |
Common
Value | Count | Frequency (%) |
12889 | ||
1 | 8887 | |
0 | 3732 | 9.9% |
- | 3534 | 9.3% |
2 | 2720 | 7.2% |
3 | 1219 | 3.2% |
4 | 973 | 2.6% |
5 | 926 | 2.4% |
6 | 752 | 2.0% |
7 | 720 | 1.9% |
Other values (7) | 1522 | 4.0% |
Latin
Value | Count | Frequency (%) |
A | 181 | |
C | 138 | |
D | 115 | |
B | 105 | |
F | 84 | |
H | 81 | |
G | 63 | 7.6% |
E | 35 | 4.2% |
S | 12 | 1.4% |
K | 12 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 43004 | |
ASCII | 38702 | |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
12889 | ||
1 | 8887 | |
0 | 3732 | 9.6% |
- | 3534 | 9.1% |
2 | 2720 | 7.0% |
3 | 1219 | 3.1% |
4 | 973 | 2.5% |
5 | 926 | 2.4% |
6 | 752 | 1.9% |
7 | 720 | 1.9% |
Other values (18) | 2350 | 6.1% |
Hangul
Value | Count | Frequency (%) |
호 | 4357 | 10.1% |
층 | 4306 | 10.0% |
동 | 4203 | 9.8% |
시 | 2347 | 5.5% |
장 | 2154 | 5.0% |
가 | 1802 | 4.2% |
매 | 1706 | 4.0% |
청 | 1693 | 3.9% |
과 | 1692 | 3.9% |
락 | 1679 | 3.9% |
Other values (153) | 17065 |
None
Value | Count | Frequency (%) |
㈜ | 1 |
전용면적
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 688 |
---|---|
Distinct (%) | 16.4% |
Missing | 219 |
Missing (%) | 5.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.713926 |
Minimum | 0 |
---|---|
Maximum | 3372 |
Zeros | 25 |
Zeros (%) | 0.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 38.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 6.5 |
Q1 | 12.63 |
median | 22.5 |
Q3 | 40.4 |
95-th percentile | 107.9895 |
Maximum | 3372 |
Range | 3372 |
Interquartile range (IQR) | 27.77 |
Descriptive statistics
Standard deviation | 101.38543 |
---|---|
Coefficient of variation (CV) | 2.5528936 |
Kurtosis | 377.26846 |
Mean | 39.713926 |
Median Absolute Deviation (MAD) | 13.74 |
Skewness | 15.837035 |
Sum | 166321.92 |
Variance | 10279.005 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40.4 | 332 | 7.5% |
12.63 | 228 | 5.2% |
6.7 | 170 | 3.9% |
22.5 | 169 | 3.8% |
42.14 | 142 | 3.2% |
6.5 | 135 | 3.1% |
24.0 | 133 | 3.0% |
18.9 | 128 | 2.9% |
13.4 | 112 | 2.5% |
21.07 | 92 | 2.1% |
Other values (678) | 2547 | |
(Missing) | 219 | 5.0% |
Value | Count | Frequency (%) |
0.0 | 25 | |
1.25 | 1 | < 0.1% |
1.46 | 1 | < 0.1% |
1.69 | 1 | < 0.1% |
2.12 | 1 | < 0.1% |
2.23 | 1 | < 0.1% |
2.3 | 2 | < 0.1% |
3.0 | 1 | < 0.1% |
3.3 | 2 | < 0.1% |
3.5 | 1 | < 0.1% |
Value | Count | Frequency (%) |
3372.0 | 1 | |
1972.84 | 1 | |
1790.0 | 1 | |
1620.0 | 1 | |
1450.0 | 1 | |
1350.0 | 1 | |
1227.8 | 1 | |
1132.09 | 1 | |
912.0 | 1 | |
874.0 | 1 |
공용면적
Real number (ℝ)
HIGH CORRELATION
  MISSING
  SKEWED
  ZEROS
 
Distinct | 326 |
---|---|
Distinct (%) | 11.0% |
Missing | 1445 |
Missing (%) | 32.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15.387355 |
Minimum | 0 |
---|---|
Maximum | 1333.45 |
Zeros | 413 |
Zeros (%) | 9.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 38.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5.44 |
median | 10.67 |
Q3 | 22.73 |
95-th percentile | 38.10295 |
Maximum | 1333.45 |
Range | 1333.45 |
Interquartile range (IQR) | 17.29 |
Descriptive statistics
Standard deviation | 31.330845 |
---|---|
Coefficient of variation (CV) | 2.0361423 |
Kurtosis | 1072.1726 |
Mean | 15.387355 |
Median Absolute Deviation (MAD) | 8.65 |
Skewness | 27.106 |
Sum | 45577.346 |
Variance | 981.62187 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 413 | 9.4% |
22.73 | 332 | 7.5% |
5.44 | 228 | 5.2% |
9.7 | 167 | 3.8% |
18.43 | 141 | 3.2% |
1.66 | 128 | 2.9% |
10.67 | 126 | 2.9% |
9.215 | 85 | 1.9% |
8.84 | 65 | 1.5% |
12.36 | 54 | 1.2% |
Other values (316) | 1223 | |
(Missing) | 1445 |
Value | Count | Frequency (%) |
0.0 | 413 | |
0.87 | 6 | 0.1% |
1.66 | 128 | 2.9% |
2.3 | 24 | 0.5% |
2.33 | 1 | < 0.1% |
2.34 | 15 | 0.3% |
2.45 | 16 | 0.4% |
2.5 | 14 | 0.3% |
2.67 | 9 | 0.2% |
2.79 | 2 | < 0.1% |
Value | Count | Frequency (%) |
1333.45 | 1 | < 0.1% |
285.97 | 1 | < 0.1% |
282.32 | 5 | |
277.24 | 2 | < 0.1% |
179.77 | 1 | < 0.1% |
168.84 | 1 | < 0.1% |
152.861 | 1 | < 0.1% |
151.23 | 1 | < 0.1% |
144.0 | 1 | < 0.1% |
130.0 | 1 | < 0.1% |
시장구분 | 전용면적 | 공용면적 | |
---|---|---|---|
시장구분 | 1.000 | 0.041 | 0.076 |
전용면적 | 0.041 | 1.000 | 0.843 |
공용면적 | 0.076 | 0.843 | 1.000 |
전용면적 | 공용면적 | 시장구분 | |
---|---|---|---|
전용면적 | 1.000 | 0.863 | 0.028 |
공용면적 | 0.863 | 1.000 | 0.030 |
시장구분 | 0.028 | 0.030 | 1.000 |
시장구분 | 시설번호 | 시설명 | 전용면적 | 공용면적 | |
---|---|---|---|---|---|
0 | 가락 | 1113181 | 청과물시장동 1층 318-1호 | 40.91 | 23.01 |
1 | 가락 | 1113191 | 청과물시장동 1층 319-1호 | 40.4 | 22.73 |
2 | 가락 | 1113201 | 청과물시장동 1층 320-1호 | 40.4 | 22.73 |
3 | 가락 | 1113211 | 청과물시장동 1층 321-1호 | 40.4 | 22.73 |
4 | 가락 | 1113221 | 청과물시장동 1층 322-1호 | 40.4 | 22.73 |
5 | 가락 | 1113231 | 청과물시장동 1층 323-1호 | 40.4 | 22.73 |
6 | 가락 | 1113241 | 청과물시장동 1층 324-1호 | 40.4 | 22.73 |
7 | 가락 | 1113251 | 청과물시장동 1층 325-1호 | 40.4 | 22.73 |
8 | 가락 | 1113261 | 청과물시장동 1층 326-1호 | 41.41 | 23.29 |
9 | 가락 | 1113271 | 청과물시장동 1층 327-1호 | 41.41 | 23.29 |
시장구분 | 시설번호 | 시설명 | 전용면적 | 공용면적 | |
---|---|---|---|---|---|
4397 | 강서 | K116131 | 청과동 1층 613-1호 | 21.07 | 9.215 |
4398 | 강서 | K116132 | 청과동 1층 613-2호 | 21.07 | 9.215 |
4399 | 강서 | K116141 | 청과동 1층 614-1호 | 42.14 | 18.43 |
4400 | 강서 | K117011 | 청과동 1층 701-1호 | 21.07 | 9.215 |
4401 | 강서 | K117012 | 청과동 1층 701-2호 | 21.07 | 9.215 |
4402 | 강서 | K117021 | 청과동 1층 702-1호 | 42.14 | 18.43 |
4403 | 강서 | K117031 | 청과동 1층 703-1호 | 42.14 | 18.43 |
4404 | 강서 | K117041 | 청과동 1층 704-1호 | 42.14 | 18.43 |
4405 | 강서 | K117051 | 청과동 1층 705-1호 | 42.14 | 18.43 |
4406 | 강서 | K117061 | 청과동 1층 706-1호 | 21.07 | 9.215 |