Overview

Dataset statistics

Number of variables33
Number of observations3041
Missing cells39216
Missing cells (%)39.1%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory846.5 KiB
Average record size in memory285.0 B

Variable types

Categorical11
Text5
DateTime2
Unsupported6
Numeric8
Boolean1

Dataset

Description식품판매업(기타) 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=UGJX67M0QC8AN6GJW1BU13386287&infSeq=1

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
영업상태구분코드 is highly imbalanced (60.3%)Imbalance
위생업태명 is highly imbalanced (98.9%)Imbalance
남성종사자수 is highly imbalanced (61.9%)Imbalance
여성종사자수 is highly imbalanced (61.9%)Imbalance
본사종업원수 is highly imbalanced (70.1%)Imbalance
공장생산직종업원수 is highly imbalanced (76.1%)Imbalance
보증금액 is highly imbalanced (80.1%)Imbalance
월세금액 is highly imbalanced (80.1%)Imbalance
다중이용업소여부 is highly imbalanced (99.6%)Imbalance
인허가취소일자 has 3041 (100.0%) missing valuesMissing
폐업일자 has 1766 (58.1%) missing valuesMissing
소재지시설전화번호 has 2764 (90.9%) missing valuesMissing
소재지면적정보 has 2677 (88.0%) missing valuesMissing
도로명우편번호 has 2673 (87.9%) missing valuesMissing
소재지도로명주소 has 167 (5.5%) missing valuesMissing
WGS84위도 has 54 (1.8%) missing valuesMissing
WGS84경도 has 54 (1.8%) missing valuesMissing
X좌표값 has 2675 (88.0%) missing valuesMissing
Y좌표값 has 2675 (88.0%) missing valuesMissing
영업장주변구분명 has 3041 (100.0%) missing valuesMissing
등급구분명 has 3041 (100.0%) missing valuesMissing
공장사무직종업원수 has 2732 (89.8%) missing valuesMissing
공장판매직종업원수 has 2730 (89.8%) missing valuesMissing
시설총규모 has 3041 (100.0%) missing valuesMissing
전통업소지정번호 has 3041 (100.0%) missing valuesMissing
전통업소음식 has 3041 (100.0%) missing valuesMissing
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
영업장주변구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등급구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
시설총규모 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소지정번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소음식 is an unsupported type, check if it needs cleaning or further analysisUnsupported
공장사무직종업원수 has 298 (9.8%) zerosZeros
공장판매직종업원수 has 294 (9.7%) zerosZeros

Reproduction

Analysis started2023-12-10 21:29:23.041257
Analysis finished2023-12-10 21:29:24.419406
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct31
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
수원시
252 
고양시
248 
용인시
242 
남양주시
197 
안산시
 
182
Other values (26)
1920 

Length

Max length4
Median length3
Mean length3.1114765
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
수원시 252
 
8.3%
고양시 248
 
8.2%
용인시 242
 
8.0%
남양주시 197
 
6.5%
안산시 182
 
6.0%
화성시 176
 
5.8%
성남시 135
 
4.4%
평택시 130
 
4.3%
부천시 128
 
4.2%
의정부시 115
 
3.8%
Other values (21) 1236
40.6%

Length

2023-12-11T06:29:24.497123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 252
 
8.3%
고양시 248
 
8.2%
용인시 242
 
8.0%
남양주시 197
 
6.5%
안산시 182
 
6.0%
화성시 176
 
5.8%
성남시 135
 
4.4%
평택시 130
 
4.3%
부천시 128
 
4.2%
의정부시 115
 
3.8%
Other values (21) 1236
40.6%
Distinct2463
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
2023-12-11T06:29:24.787883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length23
Mean length8.92634
Min length2

Characters and Unicode

Total characters27145
Distinct characters502
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2153 ?
Unique (%)70.8%

Sample

1st row가평여행마트
2nd row주식회사조은마트
3rd row홈마트
4th row가평군농협하나로마트 설악점
5th row가평군농협하나로마트 청평점
ValueCountFrequency (%)
주)이마트에브리데이 82
 
2.0%
주식회사 68
 
1.7%
노브랜드 51
 
1.3%
하나로마트 49
 
1.2%
롯데쇼핑(주)롯데슈퍼 46
 
1.1%
진로마트 38
 
0.9%
롯데슈퍼 37
 
0.9%
주)지에스리테일 36
 
0.9%
gs수퍼 32
 
0.8%
홈마트 29
 
0.7%
Other values (2518) 3533
88.3%
2023-12-11T06:29:25.224479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1959
 
7.2%
1911
 
7.0%
1148
 
4.2%
1130
 
4.2%
) 1097
 
4.0%
( 1080
 
4.0%
964
 
3.6%
553
 
2.0%
544
 
2.0%
519
 
1.9%
Other values (492) 16240
59.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23380
86.1%
Close Punctuation 1098
 
4.0%
Open Punctuation 1081
 
4.0%
Space Separator 964
 
3.6%
Uppercase Letter 379
 
1.4%
Decimal Number 147
 
0.5%
Lowercase Letter 56
 
0.2%
Dash Punctuation 29
 
0.1%
Other Punctuation 9
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1959
 
8.4%
1911
 
8.2%
1148
 
4.9%
1130
 
4.8%
553
 
2.4%
544
 
2.3%
519
 
2.2%
421
 
1.8%
371
 
1.6%
357
 
1.5%
Other values (436) 14467
61.9%
Uppercase Letter
ValueCountFrequency (%)
S 92
24.3%
G 85
22.4%
K 33
 
8.7%
A 20
 
5.3%
C 18
 
4.7%
M 17
 
4.5%
L 13
 
3.4%
O 13
 
3.4%
N 12
 
3.2%
D 11
 
2.9%
Other values (14) 65
17.2%
Lowercase Letter
ValueCountFrequency (%)
r 8
14.3%
o 7
12.5%
a 7
12.5%
t 7
12.5%
e 6
10.7%
m 6
10.7%
p 6
10.7%
s 3
 
5.4%
k 1
 
1.8%
i 1
 
1.8%
Other values (4) 4
7.1%
Decimal Number
ValueCountFrequency (%)
2 45
30.6%
0 32
21.8%
1 27
18.4%
3 13
 
8.8%
4 9
 
6.1%
5 9
 
6.1%
6 7
 
4.8%
7 3
 
2.0%
8 2
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 1097
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 1080
99.9%
[ 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 8
88.9%
& 1
 
11.1%
Space Separator
ValueCountFrequency (%)
964
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23381
86.1%
Common 3328
 
12.3%
Latin 435
 
1.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1959
 
8.4%
1911
 
8.2%
1148
 
4.9%
1130
 
4.8%
553
 
2.4%
544
 
2.3%
519
 
2.2%
421
 
1.8%
371
 
1.6%
357
 
1.5%
Other values (436) 14468
61.9%
Latin
ValueCountFrequency (%)
S 92
21.1%
G 85
19.5%
K 33
 
7.6%
A 20
 
4.6%
C 18
 
4.1%
M 17
 
3.9%
L 13
 
3.0%
O 13
 
3.0%
N 12
 
2.8%
D 11
 
2.5%
Other values (28) 121
27.8%
Common
ValueCountFrequency (%)
) 1097
33.0%
( 1080
32.5%
964
29.0%
2 45
 
1.4%
0 32
 
1.0%
- 29
 
0.9%
1 27
 
0.8%
3 13
 
0.4%
4 9
 
0.3%
5 9
 
0.3%
Other values (7) 23
 
0.7%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23379
86.1%
ASCII 3763
 
13.9%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1959
 
8.4%
1911
 
8.2%
1148
 
4.9%
1130
 
4.8%
553
 
2.4%
544
 
2.3%
519
 
2.2%
421
 
1.8%
371
 
1.6%
357
 
1.5%
Other values (435) 14466
61.9%
ASCII
ValueCountFrequency (%)
) 1097
29.2%
( 1080
28.7%
964
25.6%
S 92
 
2.4%
G 85
 
2.3%
2 45
 
1.2%
K 33
 
0.9%
0 32
 
0.9%
- 29
 
0.8%
1 27
 
0.7%
Other values (45) 279
 
7.4%
None
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct2177
Distinct (%)71.6%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
Minimum1986-07-15 00:00:00
Maximum2023-12-01 00:00:00
2023-12-11T06:29:25.359717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:25.502488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

영업상태구분코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2665 
1
300 
2
 
76

Length

Max length4
Median length4
Mean length3.6290694
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
<NA> 2665
87.6%
1 300
 
9.9%
2 76
 
2.5%

Length

2023-12-11T06:29:25.652201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:25.773340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2665
87.6%
1 300
 
9.9%
2 76
 
2.5%

영업상태명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
운영중
1466 
폐업 등
1199 
영업
300 
폐업
 
76

Length

Max length4
Median length3
Mean length3.2706347
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row영업
3rd row영업
4th row영업
5th row영업

Common Values

ValueCountFrequency (%)
운영중 1466
48.2%
폐업 등 1199
39.4%
영업 300
 
9.9%
폐업 76
 
2.5%

Length

2023-12-11T06:29:25.884005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:26.017575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 1466
34.6%
폐업 1275
30.1%
1199
28.3%
영업 300
 
7.1%

폐업일자
Date

MISSING 

Distinct1010
Distinct (%)79.2%
Missing1766
Missing (%)58.1%
Memory size23.9 KiB
Minimum1996-12-24 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T06:29:26.356603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:29:26.502100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct272
Distinct (%)98.2%
Missing2764
Missing (%)90.9%
Memory size23.9 KiB
2023-12-11T06:29:26.796404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.458484
Min length7

Characters and Unicode

Total characters3174
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique267 ?
Unique (%)96.4%

Sample

1st row031 582 7900
2nd row031 585 8135
3rd row031 5843346
4th row031 5812390
5th row031 585 7750
ValueCountFrequency (%)
031 223
32.1%
02 20
 
2.9%
032 9
 
1.3%
5601 4
 
0.6%
404 3
 
0.4%
380 3
 
0.4%
634 3
 
0.4%
574 3
 
0.4%
378 3
 
0.4%
925 3
 
0.4%
Other values (385) 421
60.6%
2023-12-11T06:29:27.266904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 478
15.1%
3 439
13.8%
432
13.6%
1 411
12.9%
2 235
7.4%
8 235
7.4%
5 208
6.6%
6 197
6.2%
9 192
6.0%
7 187
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2742
86.4%
Space Separator 432
 
13.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 478
17.4%
3 439
16.0%
1 411
15.0%
2 235
8.6%
8 235
8.6%
5 208
7.6%
6 197
7.2%
9 192
7.0%
7 187
 
6.8%
4 160
 
5.8%
Space Separator
ValueCountFrequency (%)
432
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3174
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 478
15.1%
3 439
13.8%
432
13.6%
1 411
12.9%
2 235
7.4%
8 235
7.4%
5 208
6.6%
6 197
6.2%
9 192
6.0%
7 187
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3174
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 478
15.1%
3 439
13.8%
432
13.6%
1 411
12.9%
2 235
7.4%
8 235
7.4%
5 208
6.6%
6 197
6.2%
9 192
6.0%
7 187
 
5.9%

소재지면적정보
Real number (ℝ)

MISSING 

Distinct355
Distinct (%)97.5%
Missing2677
Missing (%)88.0%
Infinite0
Infinite (%)0.0%
Mean968.37712
Minimum271.7
Maximum12339.8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:27.446407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum271.7
5-th percentile330.12
Q1480.475
median678.485
Q31002.175
95-th percentile2594.4795
Maximum12339.8
Range12068.1
Interquartile range (IQR)521.7

Descriptive statistics

Standard deviation1131.9616
Coefficient of variation (CV)1.1689265
Kurtosis46.074665
Mean968.37712
Median Absolute Deviation (MAD)253.775
Skewness5.9113116
Sum352489.27
Variance1281337.1
MonotonicityNot monotonic
2023-12-11T06:29:27.589002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
797.8 2
 
0.1%
1320.0 2
 
0.1%
531.6 2
 
0.1%
546.24 2
 
0.1%
815.48 2
 
0.1%
821.22 2
 
0.1%
374.75 2
 
0.1%
312.73 2
 
0.1%
1086.02 2
 
0.1%
2644.0 1
 
< 0.1%
Other values (345) 345
 
11.3%
(Missing) 2677
88.0%
ValueCountFrequency (%)
271.7 1
< 0.1%
289.3 1
< 0.1%
295.48 1
< 0.1%
306.9 1
< 0.1%
308.6 1
< 0.1%
310.2 1
< 0.1%
312.73 2
0.1%
320.44 1
< 0.1%
321.22 1
< 0.1%
322.2 1
< 0.1%
ValueCountFrequency (%)
12339.8 1
< 0.1%
9717.0 1
< 0.1%
9294.78 1
< 0.1%
5630.0 1
< 0.1%
5025.03 1
< 0.1%
4855.64 1
< 0.1%
3918.23 1
< 0.1%
3913.0 1
< 0.1%
3794.0 1
< 0.1%
3492.84 1
< 0.1%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct314
Distinct (%)85.3%
Missing2673
Missing (%)87.9%
Infinite0
Infinite (%)0.0%
Mean14247.367
Minimum10019
Maximum18608
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:27.739074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10019
5-th percentile10388.05
Q111746.5
median14061
Q316826.75
95-th percentile18316
Maximum18608
Range8589
Interquartile range (IQR)5080.25

Descriptive statistics

Standard deviation2684.7911
Coefficient of variation (CV)0.18844121
Kurtosis-1.4478835
Mean14247.367
Median Absolute Deviation (MAD)2546.5
Skewness0.08046095
Sum5243031
Variance7208103.2
MonotonicityNot monotonic
2023-12-11T06:29:27.875260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11117 4
 
0.1%
11486 3
 
0.1%
10551 3
 
0.1%
14327 3
 
0.1%
10500 3
 
0.1%
10366 3
 
0.1%
11477 3
 
0.1%
12084 2
 
0.1%
16688 2
 
0.1%
16484 2
 
0.1%
Other values (304) 340
 
11.2%
(Missing) 2673
87.9%
ValueCountFrequency (%)
10019 1
< 0.1%
10040 2
0.1%
10077 1
< 0.1%
10078 1
< 0.1%
10101 1
< 0.1%
10208 1
< 0.1%
10239 1
< 0.1%
10241 1
< 0.1%
10270 1
< 0.1%
10293 1
< 0.1%
ValueCountFrequency (%)
18608 1
< 0.1%
18527 1
< 0.1%
18517 1
< 0.1%
18501 1
< 0.1%
18497 1
< 0.1%
18490 2
0.1%
18476 2
0.1%
18473 1
< 0.1%
18472 1
< 0.1%
18453 1
< 0.1%
Distinct2487
Distinct (%)86.5%
Missing167
Missing (%)5.5%
Memory size23.9 KiB
2023-12-11T06:29:28.182810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length63
Mean length28.206333
Min length13

Characters and Unicode

Total characters81065
Distinct characters511
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2140 ?
Unique (%)74.5%

Sample

1st row경기도 가평군 가평읍 가화로 26-6, A동, 1층
2nd row경기도 가평군 가평읍 가화로 42, A,B동
3rd row경기도 가평군 설악면 유명로 1608, 1층
4th row경기도 가평군 설악면 신천중앙로 112
5th row경기도 가평군 청평면 구청평로 88
ValueCountFrequency (%)
경기도 2874
 
16.4%
1층 569
 
3.3%
수원시 245
 
1.4%
고양시 241
 
1.4%
용인시 228
 
1.3%
남양주시 194
 
1.1%
화성시 174
 
1.0%
지하1층 147
 
0.8%
안산시 146
 
0.8%
성남시 132
 
0.8%
Other values (3676) 12544
71.7%
2023-12-11T06:29:28.804745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14627
 
18.0%
1 3987
 
4.9%
3022
 
3.7%
3012
 
3.7%
3002
 
3.7%
2974
 
3.7%
2712
 
3.3%
2376
 
2.9%
, 1985
 
2.4%
( 1982
 
2.4%
Other values (501) 41386
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46895
57.8%
Space Separator 14627
 
18.0%
Decimal Number 12754
 
15.7%
Other Punctuation 2005
 
2.5%
Open Punctuation 1982
 
2.4%
Close Punctuation 1982
 
2.4%
Dash Punctuation 366
 
0.5%
Uppercase Letter 308
 
0.4%
Math Symbol 128
 
0.2%
Lowercase Letter 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3022
 
6.4%
3012
 
6.4%
3002
 
6.4%
2974
 
6.3%
2712
 
5.8%
2376
 
5.1%
1199
 
2.6%
1135
 
2.4%
984
 
2.1%
919
 
2.0%
Other values (445) 25560
54.5%
Uppercase Letter
ValueCountFrequency (%)
B 181
58.8%
A 29
 
9.4%
C 15
 
4.9%
S 11
 
3.6%
P 8
 
2.6%
M 8
 
2.6%
H 7
 
2.3%
K 6
 
1.9%
T 5
 
1.6%
E 5
 
1.6%
Other values (13) 33
 
10.7%
Decimal Number
ValueCountFrequency (%)
1 3987
31.3%
2 1670
13.1%
3 1173
 
9.2%
0 1152
 
9.0%
4 943
 
7.4%
5 930
 
7.3%
6 793
 
6.2%
7 777
 
6.1%
8 687
 
5.4%
9 642
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
e 5
31.2%
a 4
25.0%
h 1
 
6.2%
c 1
 
6.2%
z 1
 
6.2%
l 1
 
6.2%
b 1
 
6.2%
t 1
 
6.2%
s 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 1985
99.0%
. 15
 
0.7%
/ 2
 
0.1%
@ 2
 
0.1%
& 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 126
98.4%
+ 1
 
0.8%
1
 
0.8%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
14627
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1982
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1982
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 366
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46895
57.8%
Common 33844
41.7%
Latin 326
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3022
 
6.4%
3012
 
6.4%
3002
 
6.4%
2974
 
6.3%
2712
 
5.8%
2376
 
5.1%
1199
 
2.6%
1135
 
2.4%
984
 
2.1%
919
 
2.0%
Other values (445) 25560
54.5%
Latin
ValueCountFrequency (%)
B 181
55.5%
A 29
 
8.9%
C 15
 
4.6%
S 11
 
3.4%
P 8
 
2.5%
M 8
 
2.5%
H 7
 
2.1%
K 6
 
1.8%
T 5
 
1.5%
E 5
 
1.5%
Other values (24) 51
 
15.6%
Common
ValueCountFrequency (%)
14627
43.2%
1 3987
 
11.8%
, 1985
 
5.9%
( 1982
 
5.9%
) 1982
 
5.9%
2 1670
 
4.9%
3 1173
 
3.5%
0 1152
 
3.4%
4 943
 
2.8%
5 930
 
2.7%
Other values (12) 3413
 
10.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46895
57.8%
ASCII 34167
42.1%
Number Forms 2
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14627
42.8%
1 3987
 
11.7%
, 1985
 
5.8%
( 1982
 
5.8%
) 1982
 
5.8%
2 1670
 
4.9%
3 1173
 
3.4%
0 1152
 
3.4%
4 943
 
2.8%
5 930
 
2.7%
Other values (43) 3736
 
10.9%
Hangul
ValueCountFrequency (%)
3022
 
6.4%
3012
 
6.4%
3002
 
6.4%
2974
 
6.3%
2712
 
5.8%
2376
 
5.1%
1199
 
2.6%
1135
 
2.4%
984
 
2.1%
919
 
2.0%
Other values (445) 25560
54.5%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Distinct2873
Distinct (%)94.5%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
2023-12-11T06:29:29.143133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length120
Median length63
Mean length26.852351
Min length15

Characters and Unicode

Total characters81658
Distinct characters469
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2728 ?
Unique (%)89.7%

Sample

1st row경기도 가평군 가평읍 대곡리 202-3 A동, 1층
2nd row경기도 가평군 가평읍 대곡리 221 A,B동
3rd row경기도 가평군 설악면 신천리 665-2
4th row경기도 가평군 설악면 신천리 432-4 외 1필지
5th row경기도 가평군 청평면 청평리 619-2 외2필지
ValueCountFrequency (%)
경기도 3041
 
17.7%
1층 426
 
2.5%
수원시 252
 
1.5%
고양시 248
 
1.4%
용인시 242
 
1.4%
남양주시 197
 
1.1%
안산시 182
 
1.1%
화성시 176
 
1.0%
지하1층 165
 
1.0%
성남시 135
 
0.8%
Other values (4155) 12074
70.5%
2023-12-11T06:29:29.724131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14465
 
17.7%
1 4514
 
5.5%
3758
 
4.6%
3169
 
3.9%
3142
 
3.8%
3114
 
3.8%
3063
 
3.8%
2868
 
3.5%
2680
 
3.3%
- 2264
 
2.8%
Other values (459) 38621
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47635
58.3%
Decimal Number 15902
 
19.5%
Space Separator 14465
 
17.7%
Dash Punctuation 2264
 
2.8%
Other Punctuation 511
 
0.6%
Uppercase Letter 313
 
0.4%
Close Punctuation 224
 
0.3%
Open Punctuation 222
 
0.3%
Math Symbol 109
 
0.1%
Lowercase Letter 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3758
 
7.9%
3169
 
6.7%
3142
 
6.6%
3114
 
6.5%
3063
 
6.4%
2868
 
6.0%
2680
 
5.6%
1264
 
2.7%
1004
 
2.1%
995
 
2.1%
Other values (409) 22578
47.4%
Uppercase Letter
ValueCountFrequency (%)
B 163
52.1%
A 39
 
12.5%
C 16
 
5.1%
S 12
 
3.8%
P 9
 
2.9%
H 8
 
2.6%
M 8
 
2.6%
E 7
 
2.2%
K 6
 
1.9%
L 6
 
1.9%
Other values (12) 39
 
12.5%
Decimal Number
ValueCountFrequency (%)
1 4514
28.4%
2 1862
11.7%
3 1518
 
9.5%
0 1401
 
8.8%
4 1286
 
8.1%
5 1240
 
7.8%
6 1146
 
7.2%
7 1127
 
7.1%
8 979
 
6.2%
9 829
 
5.2%
Lowercase Letter
ValueCountFrequency (%)
e 6
46.2%
a 2
 
15.4%
h 1
 
7.7%
c 1
 
7.7%
y 1
 
7.7%
b 1
 
7.7%
s 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 483
94.5%
. 22
 
4.3%
@ 3
 
0.6%
/ 2
 
0.4%
& 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 108
99.1%
+ 1
 
0.9%
Space Separator
ValueCountFrequency (%)
14465
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2264
100.0%
Close Punctuation
ValueCountFrequency (%)
) 224
100.0%
Open Punctuation
ValueCountFrequency (%)
( 222
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47635
58.3%
Common 33697
41.3%
Latin 326
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3758
 
7.9%
3169
 
6.7%
3142
 
6.6%
3114
 
6.5%
3063
 
6.4%
2868
 
6.0%
2680
 
5.6%
1264
 
2.7%
1004
 
2.1%
995
 
2.1%
Other values (409) 22578
47.4%
Latin
ValueCountFrequency (%)
B 163
50.0%
A 39
 
12.0%
C 16
 
4.9%
S 12
 
3.7%
P 9
 
2.8%
H 8
 
2.5%
M 8
 
2.5%
E 7
 
2.1%
K 6
 
1.8%
e 6
 
1.8%
Other values (19) 52
 
16.0%
Common
ValueCountFrequency (%)
14465
42.9%
1 4514
 
13.4%
- 2264
 
6.7%
2 1862
 
5.5%
3 1518
 
4.5%
0 1401
 
4.2%
4 1286
 
3.8%
5 1240
 
3.7%
6 1146
 
3.4%
7 1127
 
3.3%
Other values (11) 2874
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47635
58.3%
ASCII 34023
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
14465
42.5%
1 4514
 
13.3%
- 2264
 
6.7%
2 1862
 
5.5%
3 1518
 
4.5%
0 1401
 
4.1%
4 1286
 
3.8%
5 1240
 
3.6%
6 1146
 
3.4%
7 1127
 
3.3%
Other values (40) 3200
 
9.4%
Hangul
ValueCountFrequency (%)
3758
 
7.9%
3169
 
6.7%
3142
 
6.6%
3114
 
6.5%
3063
 
6.4%
2868
 
6.0%
2680
 
5.6%
1264
 
2.7%
1004
 
2.1%
995
 
2.1%
Other values (409) 22578
47.4%
Distinct1291
Distinct (%)42.5%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
2023-12-11T06:29:30.121432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.0601776
Min length5

Characters and Unicode

Total characters18429
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique627 ?
Unique (%)20.6%

Sample

1st row477-804
2nd row477-804
3rd row477-853
4th row477-853
5th row477-813
ValueCountFrequency (%)
445160 17
 
0.6%
472901 16
 
0.5%
482060 15
 
0.5%
425831 15
 
0.5%
445360 14
 
0.5%
425140 13
 
0.4%
415809 12
 
0.4%
476802 11
 
0.4%
445320 11
 
0.4%
415060 11
 
0.4%
Other values (1281) 2906
95.6%
2023-12-11T06:29:30.646653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 4513
24.5%
8 2485
13.5%
1 2050
11.1%
0 2008
10.9%
2 1648
 
8.9%
3 1320
 
7.2%
5 1255
 
6.8%
6 1137
 
6.2%
7 955
 
5.2%
9 707
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18078
98.1%
Dash Punctuation 351
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 4513
25.0%
8 2485
13.7%
1 2050
11.3%
0 2008
11.1%
2 1648
 
9.1%
3 1320
 
7.3%
5 1255
 
6.9%
6 1137
 
6.3%
7 955
 
5.3%
9 707
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 351
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 18429
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 4513
24.5%
8 2485
13.5%
1 2050
11.1%
0 2008
10.9%
2 1648
 
8.9%
3 1320
 
7.2%
5 1255
 
6.8%
6 1137
 
6.2%
7 955
 
5.2%
9 707
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18429
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 4513
24.5%
8 2485
13.5%
1 2050
11.1%
0 2008
10.9%
2 1648
 
8.9%
3 1320
 
7.2%
5 1255
 
6.8%
6 1137
 
6.2%
7 955
 
5.2%
9 707
 
3.8%

WGS84위도
Real number (ℝ)

MISSING 

Distinct2195
Distinct (%)73.5%
Missing54
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean37.446598
Minimum36.959509
Maximum38.156364
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:30.819378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.959509
5-th percentile37.051558
Q137.28427
median37.394678
Q337.650968
95-th percentile37.828363
Maximum38.156364
Range1.1968549
Interquartile range (IQR)0.3666981

Descriptive statistics

Standard deviation0.23228708
Coefficient of variation (CV)0.0062031558
Kurtosis-0.69999123
Mean37.446598
Median Absolute Deviation (MAD)0.16318867
Skewness0.13311717
Sum111852.99
Variance0.053957288
MonotonicityNot monotonic
2023-12-11T06:29:31.021612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.6465071802 6
 
0.2%
37.2265476022 5
 
0.2%
37.6814913143 5
 
0.2%
37.5055031307 5
 
0.2%
37.5419070721 5
 
0.2%
37.1910783176 5
 
0.2%
37.5277081455 4
 
0.1%
37.661567383 4
 
0.1%
37.5877020631 4
 
0.1%
37.6470962619 4
 
0.1%
Other values (2185) 2940
96.7%
(Missing) 54
 
1.8%
ValueCountFrequency (%)
36.959508969 2
0.1%
36.9627169365 3
0.1%
36.9635550603 2
0.1%
36.9636615555 3
0.1%
36.9638839766 1
 
< 0.1%
36.9642058043 1
 
< 0.1%
36.9643104371 2
0.1%
36.9763331411 1
 
< 0.1%
36.9788993771 1
 
< 0.1%
36.9789601868 1
 
< 0.1%
ValueCountFrequency (%)
38.1563638656 1
< 0.1%
38.0904627858 1
< 0.1%
38.0901656506 1
< 0.1%
38.0898143955 1
< 0.1%
38.0585915159 1
< 0.1%
38.0366703182 1
< 0.1%
38.027602793 1
< 0.1%
38.025153012 2
0.1%
38.0248475787 1
< 0.1%
38.0235102993 1
< 0.1%

WGS84경도
Real number (ℝ)

MISSING 

Distinct2195
Distinct (%)73.5%
Missing54
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean127.01933
Minimum126.55495
Maximum127.7562
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:31.214269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.55495
5-th percentile126.73473
Q1126.83869
median127.03739
Q3127.13855
95-th percentile127.42206
Maximum127.7562
Range1.2012591
Interquartile range (IQR)0.29986536

Descriptive statistics

Standard deviation0.20481391
Coefficient of variation (CV)0.0016124626
Kurtosis0.29794756
Mean127.01933
Median Absolute Deviation (MAD)0.14635063
Skewness0.52958251
Sum379406.73
Variance0.041948739
MonotonicityNot monotonic
2023-12-11T06:29:31.409933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.6833852938 6
 
0.2%
126.9721330041 5
 
0.2%
126.7808951307 5
 
0.2%
126.7819304398 5
 
0.2%
127.2032174183 5
 
0.2%
127.2063762566 5
 
0.2%
126.8192802531 4
 
0.1%
126.7441037261 4
 
0.1%
127.2130258444 4
 
0.1%
126.8948092677 4
 
0.1%
Other values (2185) 2940
96.7%
(Missing) 54
 
1.8%
ValueCountFrequency (%)
126.5549454228 1
< 0.1%
126.5567385684 1
< 0.1%
126.5704334044 1
< 0.1%
126.5812320799 1
< 0.1%
126.5830135218 2
0.1%
126.584324258 1
< 0.1%
126.5856269206 1
< 0.1%
126.5870054212 1
< 0.1%
126.5971495947 1
< 0.1%
126.5971662321 1
< 0.1%
ValueCountFrequency (%)
127.7562044742 1
< 0.1%
127.7555835694 1
< 0.1%
127.7104484814 1
< 0.1%
127.6812089874 1
< 0.1%
127.6615499846 1
< 0.1%
127.6459893506 1
< 0.1%
127.643769209 1
< 0.1%
127.6417304677 1
< 0.1%
127.6403894965 1
< 0.1%
127.6402566316 1
< 0.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2665 
기타식품판매업
376 

Length

Max length7
Median length4
Mean length4.3709306
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타식품판매업
2nd row기타식품판매업
3rd row기타식품판매업
4th row기타식품판매업
5th row기타식품판매업

Common Values

ValueCountFrequency (%)
<NA> 2665
87.6%
기타식품판매업 376
 
12.4%

Length

2023-12-11T06:29:31.561481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:31.674235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2665
87.6%
기타식품판매업 376
 
12.4%

X좌표값
Real number (ℝ)

MISSING 

Distinct342
Distinct (%)93.4%
Missing2675
Missing (%)88.0%
Infinite0
Infinite (%)0.0%
Mean205075.14
Minimum163131.41
Maximum266817.76
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:31.796946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum163131.41
5-th percentile178526.78
Q1191430.32
median204888.69
Q3213784.03
95-th percentile242979.85
Maximum266817.76
Range103686.34
Interquartile range (IQR)22353.711

Descriptive statistics

Standard deviation18382.311
Coefficient of variation (CV)0.089636952
Kurtosis0.43985666
Mean205075.14
Median Absolute Deviation (MAD)10727.479
Skewness0.53771082
Sum75057502
Variance3.3790935 × 108
MonotonicityNot monotonic
2023-12-11T06:29:31.972220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
185250.616718986 2
 
0.1%
189632.695594889 2
 
0.1%
228838.105478233 2
 
0.1%
194483.06165026 2
 
0.1%
163131.413670876 2
 
0.1%
213800.3432084 2
 
0.1%
216667.476216735 2
 
0.1%
175737.739505791 2
 
0.1%
183603.099389364 2
 
0.1%
186454.86377403 2
 
0.1%
Other values (332) 346
 
11.4%
(Missing) 2675
88.0%
ValueCountFrequency (%)
163131.413670876 2
0.1%
164518.967014235 1
< 0.1%
171746.8055645 1
< 0.1%
171813.68118138 1
< 0.1%
171970.945357311 1
< 0.1%
172220.710118868 1
< 0.1%
173872.74137809 1
< 0.1%
175417.656719379 1
< 0.1%
175502.379214822 1
< 0.1%
175737.739505791 2
0.1%
ValueCountFrequency (%)
266817.756499423 1
< 0.1%
256308.818539595 2
0.1%
256199.409581762 1
< 0.1%
255869.351046222 1
< 0.1%
251950.009768959 1
< 0.1%
251231.434806888 1
< 0.1%
248422.699531906 1
< 0.1%
248230.553929964 1
< 0.1%
247417.137602275 1
< 0.1%
245382.458306151 1
< 0.1%

Y좌표값
Real number (ℝ)

MISSING 

Distinct342
Distinct (%)93.4%
Missing2675
Missing (%)88.0%
Infinite0
Infinite (%)0.0%
Mean439786.44
Minimum384214.03
Maximum509707.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:32.111484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum384214.03
5-th percentile389732.33
Q1419904.67
median434757.98
Q3462763.74
95-th percentile480880.31
Maximum509707.6
Range125493.56
Interquartile range (IQR)42859.064

Descriptive statistics

Standard deviation27500.713
Coefficient of variation (CV)0.062531972
Kurtosis-0.89781088
Mean439786.44
Median Absolute Deviation (MAD)21870.623
Skewness0.076816321
Sum1.6096184 × 108
Variance7.5628924 × 108
MonotonicityNot monotonic
2023-12-11T06:29:32.296782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
459091.442294794 2
 
0.1%
437032.2561501 2
 
0.1%
428431.22290219 2
 
0.1%
427000.44916979 2
 
0.1%
460934.886475638 2
 
0.1%
456185.774822758 2
 
0.1%
469094.610885102 2
 
0.1%
430574.649620404 2
 
0.1%
424201.372984139 2
 
0.1%
420215.054655998 2
 
0.1%
Other values (332) 346
 
11.4%
(Missing) 2675
88.0%
ValueCountFrequency (%)
384214.034353206 1
< 0.1%
384575.637814855 1
< 0.1%
386097.672886033 1
< 0.1%
387025.979345035 1
< 0.1%
387674.877266816 1
< 0.1%
387798.75161755 1
< 0.1%
388325.881328789 1
< 0.1%
388559.044134104 1
< 0.1%
388576.191243011 1
< 0.1%
388621.580123975 1
< 0.1%
ValueCountFrequency (%)
509707.595151058 1
< 0.1%
495346.139228329 1
< 0.1%
495167.132130244 1
< 0.1%
494724.537320769 2
0.1%
489231.382036444 1
< 0.1%
488583.056118126 1
< 0.1%
488322.116635734 1
< 0.1%
488245.277180172 1
< 0.1%
487975.621391369 1
< 0.1%
487569.737354782 1
< 0.1%

위생업태명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
기타식품판매업
3038 
<NA>
 
3

Length

Max length7
Median length7
Mean length6.9970404
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타식품판매업
2nd row기타식품판매업
3rd row기타식품판매업
4th row기타식품판매업
5th row기타식품판매업

Common Values

ValueCountFrequency (%)
기타식품판매업 3038
99.9%
<NA> 3
 
0.1%

Length

2023-12-11T06:29:32.445579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:32.561910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타식품판매업 3038
99.9%
na 3
 
0.1%

남성종사자수
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2816 
0
 
225

Length

Max length4
Median length4
Mean length3.7780335
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2816
92.6%
0 225
 
7.4%

Length

2023-12-11T06:29:32.681714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:32.825435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2816
92.6%
0 225
 
7.4%

여성종사자수
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2816 
0
 
225

Length

Max length4
Median length4
Mean length3.7780335
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2816
92.6%
0 225
 
7.4%

Length

2023-12-11T06:29:32.961521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:33.076435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2816
92.6%
0 225
 
7.4%

영업장주변구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

등급구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

본사종업원수
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2735 
0
305 
13
 
1

Length

Max length4
Median length4
Mean length3.6984545
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2735
89.9%
0 305
 
10.0%
13 1
 
< 0.1%

Length

2023-12-11T06:29:33.212467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:33.337204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2735
89.9%
0 305
 
10.0%
13 1
 
< 0.1%

공장사무직종업원수
Real number (ℝ)

MISSING  ZEROS 

Distinct6
Distinct (%)1.9%
Missing2732
Missing (%)89.8%
Infinite0
Infinite (%)0.0%
Mean0.11650485
Minimum0
Maximum13
Zeros298
Zeros (%)9.8%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:33.455236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum13
Range13
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.86378401
Coefficient of variation (CV)7.4141461
Kurtosis165.39834
Mean0.11650485
Median Absolute Deviation (MAD)0
Skewness11.850812
Sum36
Variance0.74612281
MonotonicityNot monotonic
2023-12-11T06:29:33.583333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 298
 
9.8%
2 5
 
0.2%
1 2
 
0.1%
3 2
 
0.1%
5 1
 
< 0.1%
13 1
 
< 0.1%
(Missing) 2732
89.8%
ValueCountFrequency (%)
0 298
9.8%
1 2
 
0.1%
2 5
 
0.2%
3 2
 
0.1%
5 1
 
< 0.1%
13 1
 
< 0.1%
ValueCountFrequency (%)
13 1
 
< 0.1%
5 1
 
< 0.1%
3 2
 
0.1%
2 5
 
0.2%
1 2
 
0.1%
0 298
9.8%

공장판매직종업원수
Real number (ℝ)

MISSING  ZEROS 

Distinct13
Distinct (%)4.2%
Missing2730
Missing (%)89.8%
Infinite0
Infinite (%)0.0%
Mean0.69453376
Minimum0
Maximum42
Zeros294
Zeros (%)9.7%
Negative0
Negative (%)0.0%
Memory size26.9 KiB
2023-12-11T06:29:33.719237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2.5
Maximum42
Range42
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4.1790494
Coefficient of variation (CV)6.0170572
Kurtosis70.836384
Mean0.69453376
Median Absolute Deviation (MAD)0
Skewness8.1063573
Sum216
Variance17.464454
MonotonicityNot monotonic
2023-12-11T06:29:34.116313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
0 294
 
9.7%
3 3
 
0.1%
10 2
 
0.1%
8 2
 
0.1%
5 2
 
0.1%
20 1
 
< 0.1%
6 1
 
< 0.1%
35 1
 
< 0.1%
9 1
 
< 0.1%
42 1
 
< 0.1%
Other values (3) 3
 
0.1%
(Missing) 2730
89.8%
ValueCountFrequency (%)
0 294
9.7%
2 1
 
< 0.1%
3 3
 
0.1%
5 2
 
0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
8 2
 
0.1%
9 1
 
< 0.1%
10 2
 
0.1%
20 1
 
< 0.1%
ValueCountFrequency (%)
42 1
< 0.1%
40 1
< 0.1%
35 1
< 0.1%
20 1
< 0.1%
10 2
0.1%
9 1
< 0.1%
8 2
0.1%
7 1
< 0.1%
6 1
< 0.1%
5 2
0.1%

공장생산직종업원수
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2734 
0
305 
3
 
1
4
 
1

Length

Max length4
Median length4
Mean length3.6971391
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row0
2nd row0
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2734
89.9%
0 305
 
10.0%
3 1
 
< 0.1%
4 1
 
< 0.1%

Length

2023-12-11T06:29:34.304957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:34.422023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2734
89.9%
0 305
 
10.0%
3 1
 
< 0.1%
4 1
 
< 0.1%

보증금액
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2807 
0
 
232
600000000
 
1
60000000
 
1

Length

Max length9
Median length4
Mean length3.7740875
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2807
92.3%
0 232
 
7.6%
600000000 1
 
< 0.1%
60000000 1
 
< 0.1%

Length

2023-12-11T06:29:34.552819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:34.674276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2807
92.3%
0 232
 
7.6%
600000000 1
 
< 0.1%
60000000 1
 
< 0.1%

월세금액
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.9 KiB
<NA>
2807 
0
 
232
20000000
 
1
5500000
 
1

Length

Max length8
Median length4
Mean length3.7734298
Min length1

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row0
2nd row<NA>
3rd row0
4th row<NA>
5th row0

Common Values

ValueCountFrequency (%)
<NA> 2807
92.3%
0 232
 
7.6%
20000000 1
 
< 0.1%
5500000 1
 
< 0.1%

Length

2023-12-11T06:29:34.804091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:29:34.946620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2807
92.3%
0 232
 
7.6%
20000000 1
 
< 0.1%
5500000 1
 
< 0.1%

다중이용업소여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing3
Missing (%)0.1%
Memory size6.1 KiB
False
3037 
True
 
1
(Missing)
 
3
ValueCountFrequency (%)
False 3037
99.9%
True 1
 
< 0.1%
(Missing) 3
 
0.1%
2023-12-11T06:29:35.035342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

시설총규모
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

전통업소지정번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

전통업소음식
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3041
Missing (%)100.0%
Memory size26.9 KiB

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부시설총규모전통업소지정번호전통업소음식
0가평군가평여행마트2023-08-31<NA>1영업<NA><NA>325.7312421경기도 가평군 가평읍 가화로 26-6, A동, 1층경기도 가평군 가평읍 대곡리 202-3 A동, 1층477-80437.821757127.516271기타식품판매업245370.744036480027.5913기타식품판매업00<NA><NA>000000N<NA><NA><NA>
1가평군주식회사조은마트2014-08-07<NA>1영업<NA>031 582 79001008.712419경기도 가평군 가평읍 가화로 42, A,B동경기도 가평군 가평읍 대곡리 221 A,B동477-80437.823965127.516317기타식품판매업245382.458306480282.31208기타식품판매업<NA><NA><NA><NA>0000<NA><NA>N<NA><NA><NA>
2가평군홈마트2023-09-12<NA>1영업<NA><NA>810.912467경기도 가평군 설악면 유명로 1608, 1층경기도 가평군 설악면 신천리 665-2477-85337.673209127.487481기타식품판매업242901.331182463542.72268기타식품판매업00<NA><NA>000000N<NA><NA><NA>
3가평군가평군농협하나로마트 설악점2008-08-27<NA>1영업<NA>031 585 8135568.6212465경기도 가평군 설악면 신천중앙로 112경기도 가평군 설악면 신천리 432-4 외 1필지477-85337.677762127.491619기타식품판매업243287.051486464032.393013기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
4가평군가평군농협하나로마트 청평점2006-05-15<NA>1영업<NA>031 58433461129.512453경기도 가평군 청평면 구청평로 88경기도 가평군 청평면 청평리 619-2 외2필지477-81337.735074127.415256기타식품판매업236536.915896470364.566268기타식품판매업00<NA><NA>000000N<NA><NA><NA>
5가평군가평군농협하나로마트2000-02-16<NA>1영업<NA>031 58123901105.6712419경기도 가평군 가평읍 가화로 120경기도 가평군 가평읍 읍내리 472 외3필지477-80137.830176127.514182기타식품판매업245168.059874480991.243378기타식품판매업<NA><NA><NA><NA>0000<NA><NA>N<NA><NA><NA>
6가평군가평군농협하나로마트 조종점2008-07-04<NA>1영업<NA>031 585 7750746.212438경기도 가평군 조종면 조종희망로 4경기도 가평군 조종면 현리 410-11243837.817785127.348831기타식품판매업230648.078253479520.634086기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7가평군가평군농협 하나로마트 자라섬점2014-03-24<NA>1영업<NA>031 5829720573.9812422경기도 가평군 가평읍 호반로 2562, 1,3층경기도 가평군 가평읍 달전리 452-1 1,3층477-80437.814429127.515099기타식품판매업245286.319396479221.184496기타식품판매업<NA><NA><NA><NA>0000<NA><NA>N<NA><NA><NA>
8가평군가평군농협하나로마트 북면점2006-04-26<NA>1영업<NA>031 5822590379.1812403경기도 가평군 북면 가화로 992경기도 가평군 북면 목동리 820-1477-84237.883884127.549241기타식품판매업248230.55393486955.045296기타식품판매업<NA><NA><NA><NA>0000<NA><NA>N<NA><NA><NA>
9가평군가평군농협하나로마트 청평점20060515<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 구청평로 88경기도 가평군 청평면 청평리 619-2번지 외2필지47781337.735074127.415256<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부시설총규모전통업소지정번호전통업소음식
3031화성시진로마트20121112<NA><NA>폐업 등20151030<NA><NA><NA>경기도 화성시 봉담읍 동화새터길 62 (외1필지)경기도 화성시 봉담읍 동화리 438-9번지 외1필지44589337.220078126.956748<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3032화성시매송마트20040911<NA><NA>폐업 등20060612<NA><NA><NA>경기도 화성시 매송면 화성로 2385경기도 화성시 매송면 원평리 62번지44583237.24943126.926577<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3033화성시엘마트20011214<NA><NA>폐업 등20180129<NA><NA><NA>경기도 화성시 효행로 995 (진안동)경기도 화성시 진안동 507-2번지44539037.212106127.036485<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3034화성시럭키마트19981210<NA><NA>폐업 등20170124<NA><NA><NA>경기도 화성시 정남면 만년로 575경기도 화성시 정남면 괘랑리 918-1번지44596537.172594126.983232<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3035화성시홈마트20050128<NA><NA>폐업 등20060626<NA><NA><NA>경기도 화성시 효행로 237경기도 화성시 기안동 335-8번지44531037.226548126.972133<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3036화성시롯데쇼핑(주) 롯데슈퍼 화성점20051004<NA><NA>폐업 등20160415<NA><NA><NA>경기도 화성시 효행로 287 (기안동,외 4필지(보보스프라자 1층))경기도 화성시 기안동 371-2번지 외 4필지(보보스프라자 1층)44531037.222736126.974346<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3037화성시쿨 마 트20060711<NA><NA>폐업 등20130313<NA><NA><NA>경기도 화성시 효행로265번길 10경기도 화성시 기안동 355-8번지44531037.224498126.973353<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3038화성시농민할인마트20140627<NA><NA>폐업 등20150529<NA><NA><NA>경기도 화성시 병점2로 22 (병점동, 외 3필지 2층 전부)경기도 화성시 병점동 432번지 외 3필지 2층 전부44536037.205691127.039443<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3039화성시제로마트20061116<NA><NA>폐업 등20160816<NA><NA><NA>경기도 화성시 우정읍 조암서로22번길 22경기도 화성시 우정읍 조암리 353번지44595537.083111126.817583<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3040화성시700마켓 진안점20061213<NA><NA>폐업 등20071120<NA><NA><NA>경기도 화성시 병점중앙로 185경기도 화성시 진안동 431-5번지44539037.213835127.036114<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부# duplicates
0화성시쿨마트20060711<NA>폐업 등20130313<NA><NA><NA>경기도 화성시 효행로 237 (기안동)경기도 화성시 기안동 335-8번지44531037.226548126.972133<NA><NA><NA>기타식품판매업<NA><NA><NA><NA><NA><NA><NA><NA>N2