Overview

Dataset statistics

Number of variables33
Number of observations7421
Missing cells90987
Missing cells (%)37.2%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory2.0 MiB
Average record size in memory285.0 B

Variable types

Categorical11
Text5
DateTime2
Unsupported6
Numeric8
Boolean1

Alerts

Dataset has 2 (< 0.1%) duplicate rowsDuplicates
위생업태명 is highly imbalanced (91.5%)Imbalance
본사종업원수 is highly imbalanced (52.3%)Imbalance
공장사무직종업원수 is highly imbalanced (69.6%)Imbalance
공장생산직종업원수 is highly imbalanced (70.2%)Imbalance
보증금액 is highly imbalanced (71.1%)Imbalance
다중이용업소여부 is highly imbalanced (99.8%)Imbalance
인허가취소일자 has 7421 (100.0%) missing valuesMissing
폐업일자 has 4652 (62.7%) missing valuesMissing
소재지시설전화번호 has 6720 (90.6%) missing valuesMissing
소재지면적정보 has 5835 (78.6%) missing valuesMissing
도로명우편번호 has 5676 (76.5%) missing valuesMissing
소재지도로명주소 has 235 (3.2%) missing valuesMissing
X좌표값 has 5720 (77.1%) missing valuesMissing
Y좌표값 has 5720 (77.1%) missing valuesMissing
영업장주변구분명 has 7421 (100.0%) missing valuesMissing
등급구분명 has 7421 (100.0%) missing valuesMissing
공장판매직종업원수 has 5809 (78.3%) missing valuesMissing
월세금액 has 5863 (79.0%) missing valuesMissing
다중이용업소여부 has 79 (1.1%) missing valuesMissing
시설총규모 has 7421 (100.0%) missing valuesMissing
전통업소지정번호 has 7421 (100.0%) missing valuesMissing
전통업소음식 has 7421 (100.0%) missing valuesMissing
월세금액 is highly skewed (γ1 = 21.00013619)Skewed
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
영업장주변구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
등급구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
시설총규모 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소지정번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
전통업소음식 is an unsupported type, check if it needs cleaning or further analysisUnsupported
공장판매직종업원수 has 1595 (21.5%) zerosZeros
월세금액 has 1553 (20.9%) zerosZeros

Reproduction

Analysis started2023-12-10 21:58:57.825531
Analysis finished2023-12-10 21:58:59.984589
Duration2.16 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct32
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
성남시
677 
용인시
663 
고양시
650 
수원시
 
422
화성시
 
396
Other values (27)
4613 

Length

Max length4
Median length3
Mean length3.0677806
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
성남시 677
 
9.1%
용인시 663
 
8.9%
고양시 650
 
8.8%
수원시 422
 
5.7%
화성시 396
 
5.3%
남양주시 380
 
5.1%
광주시 359
 
4.8%
안양시 351
 
4.7%
부천시 336
 
4.5%
파주시 323
 
4.4%
Other values (22) 2864
38.6%

Length

2023-12-11T06:59:00.048573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성남시 677
 
9.1%
용인시 663
 
8.9%
고양시 650
 
8.8%
수원시 422
 
5.7%
화성시 396
 
5.3%
남양주시 380
 
5.1%
광주시 359
 
4.8%
안양시 351
 
4.7%
부천시 336
 
4.5%
파주시 323
 
4.4%
Other values (22) 2864
38.6%
Distinct6701
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
2023-12-11T06:59:00.403751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length7.2387818
Min length2

Characters and Unicode

Total characters53719
Distinct characters900
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6062 ?
Unique (%)81.7%

Sample

1st row주식회사 청리움
2nd row수제식품
3rd row일심
4th row운악농산
5th row비오팜
ValueCountFrequency (%)
주식회사 731
 
8.3%
농업회사법인 90
 
1.0%
34
 
0.4%
영농조합법인 14
 
0.2%
유한회사 12
 
0.1%
푸드 9
 
0.1%
f&b 8
 
0.1%
식품 7
 
0.1%
코퍼레이션 7
 
0.1%
코리아 6
 
0.1%
Other values (6956) 7846
89.5%
2023-12-11T06:59:00.946977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3970
 
7.4%
) 3159
 
5.9%
( 3117
 
5.8%
1606
 
3.0%
1580
 
2.9%
1423
 
2.6%
1345
 
2.5%
1218
 
2.3%
1180
 
2.2%
1009
 
1.9%
Other values (890) 34112
63.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44374
82.6%
Close Punctuation 3159
 
5.9%
Open Punctuation 3117
 
5.8%
Space Separator 1345
 
2.5%
Uppercase Letter 1000
 
1.9%
Lowercase Letter 457
 
0.9%
Other Punctuation 131
 
0.2%
Decimal Number 125
 
0.2%
Dash Punctuation 10
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3970
 
8.9%
1606
 
3.6%
1580
 
3.6%
1423
 
3.2%
1218
 
2.7%
1180
 
2.7%
1009
 
2.3%
993
 
2.2%
826
 
1.9%
742
 
1.7%
Other values (820) 29827
67.2%
Uppercase Letter
ValueCountFrequency (%)
F 135
13.5%
B 83
 
8.3%
S 74
 
7.4%
C 63
 
6.3%
A 63
 
6.3%
N 58
 
5.8%
O 55
 
5.5%
E 54
 
5.4%
T 44
 
4.4%
M 43
 
4.3%
Other values (15) 328
32.8%
Lowercase Letter
ValueCountFrequency (%)
o 64
14.0%
e 52
11.4%
i 35
 
7.7%
n 35
 
7.7%
t 35
 
7.7%
r 30
 
6.6%
a 30
 
6.6%
s 26
 
5.7%
d 24
 
5.3%
l 19
 
4.2%
Other values (13) 107
23.4%
Decimal Number
ValueCountFrequency (%)
2 26
20.8%
1 26
20.8%
0 16
12.8%
3 13
10.4%
8 10
 
8.0%
9 10
 
8.0%
5 9
 
7.2%
7 8
 
6.4%
6 4
 
3.2%
4 3
 
2.4%
Other Punctuation
ValueCountFrequency (%)
& 77
58.8%
. 34
26.0%
' 6
 
4.6%
, 6
 
4.6%
/ 5
 
3.8%
2
 
1.5%
· 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 3159
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3117
100.0%
Space Separator
ValueCountFrequency (%)
1345
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44374
82.6%
Common 7887
 
14.7%
Latin 1457
 
2.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3970
 
8.9%
1606
 
3.6%
1580
 
3.6%
1423
 
3.2%
1218
 
2.7%
1180
 
2.7%
1009
 
2.3%
993
 
2.2%
826
 
1.9%
742
 
1.7%
Other values (820) 29827
67.2%
Latin
ValueCountFrequency (%)
F 135
 
9.3%
B 83
 
5.7%
S 74
 
5.1%
o 64
 
4.4%
C 63
 
4.3%
A 63
 
4.3%
N 58
 
4.0%
O 55
 
3.8%
E 54
 
3.7%
e 52
 
3.6%
Other values (38) 756
51.9%
Common
ValueCountFrequency (%)
) 3159
40.1%
( 3117
39.5%
1345
17.1%
& 77
 
1.0%
. 34
 
0.4%
2 26
 
0.3%
1 26
 
0.3%
0 16
 
0.2%
3 13
 
0.2%
- 10
 
0.1%
Other values (11) 64
 
0.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44373
82.6%
ASCII 9341
 
17.4%
None 4
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3970
 
8.9%
1606
 
3.6%
1580
 
3.6%
1423
 
3.2%
1218
 
2.7%
1180
 
2.7%
1009
 
2.3%
993
 
2.2%
826
 
1.9%
742
 
1.7%
Other values (819) 29826
67.2%
ASCII
ValueCountFrequency (%)
) 3159
33.8%
( 3117
33.4%
1345
14.4%
F 135
 
1.4%
B 83
 
0.9%
& 77
 
0.8%
S 74
 
0.8%
o 64
 
0.7%
C 63
 
0.7%
A 63
 
0.7%
Other values (57) 1161
 
12.4%
None
ValueCountFrequency (%)
2
50.0%
1
25.0%
· 1
25.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct3699
Distinct (%)49.8%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
Minimum1992-04-30 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T06:59:01.125470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:59:01.292531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5666 
1
1389 
2
 
366

Length

Max length4
Median length4
Mean length3.2905269
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5666
76.4%
1 1389
 
18.7%
2 366
 
4.9%

Length

2023-12-11T06:59:01.429089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:01.534980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5666
76.4%
1 1389
 
18.7%
2 366
 
4.9%

영업상태명
Categorical

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
운영중
3263 
폐업 등
2403 
영업
1389 
폐업
366 

Length

Max length4
Median length3
Mean length3.0873198
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업
2nd row운영중
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
운영중 3263
44.0%
폐업 등 2403
32.4%
영업 1389
18.7%
폐업 366
 
4.9%

Length

2023-12-11T06:59:01.681942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:01.809348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
운영중 3263
33.2%
폐업 2769
28.2%
2403
24.5%
영업 1389
14.1%

폐업일자
Date

MISSING 

Distinct1804
Distinct (%)65.1%
Missing4652
Missing (%)62.7%
Memory size58.1 KiB
Minimum1997-08-28 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T06:59:01.916142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:59:02.082123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct666
Distinct (%)95.0%
Missing6720
Missing (%)90.6%
Memory size58.1 KiB
2023-12-11T06:59:02.320885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.399429
Min length7

Characters and Unicode

Total characters7991
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique636 ?
Unique (%)90.7%

Sample

1st row070 78424682
2nd row26630527
3rd row031 9698181
4th row02 3963177
5th row031 813 4818
ValueCountFrequency (%)
031 394
 
24.4%
070 90
 
5.6%
02 60
 
3.7%
032 11
 
0.7%
322 7
 
0.4%
593 4
 
0.2%
281 4
 
0.2%
0110 4
 
0.2%
949 3
 
0.2%
339 3
 
0.2%
Other values (921) 1034
64.1%
2023-12-11T06:59:02.659590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1314
16.4%
3 983
12.3%
1 958
12.0%
955
12.0%
7 716
9.0%
2 587
7.3%
8 581
7.3%
5 490
 
6.1%
6 478
 
6.0%
9 469
 
5.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7036
88.0%
Space Separator 955
 
12.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1314
18.7%
3 983
14.0%
1 958
13.6%
7 716
10.2%
2 587
8.3%
8 581
8.3%
5 490
 
7.0%
6 478
 
6.8%
9 469
 
6.7%
4 460
 
6.5%
Space Separator
ValueCountFrequency (%)
955
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7991
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1314
16.4%
3 983
12.3%
1 958
12.0%
955
12.0%
7 716
9.0%
2 587
7.3%
8 581
7.3%
5 490
 
6.1%
6 478
 
6.0%
9 469
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7991
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1314
16.4%
3 983
12.3%
1 958
12.0%
955
12.0%
7 716
9.0%
2 587
7.3%
8 581
7.3%
5 490
 
6.1%
6 478
 
6.0%
9 469
 
5.9%

소재지면적정보
Real number (ℝ)

MISSING 

Distinct940
Distinct (%)59.3%
Missing5835
Missing (%)78.6%
Infinite0
Infinite (%)0.0%
Mean77.89652
Minimum0
Maximum5533.5
Zeros58
Zeros (%)0.8%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:02.785318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q19.25
median32
Q378.575
95-th percentile276.0775
Maximum5533.5
Range5533.5
Interquartile range (IQR)69.325

Descriptive statistics

Standard deviation212.78878
Coefficient of variation (CV)2.7316853
Kurtosis315.63378
Mean77.89652
Median Absolute Deviation (MAD)26.57
Skewness14.686725
Sum123543.88
Variance45279.065
MonotonicityNot monotonic
2023-12-11T06:59:02.909853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.3 126
 
1.7%
0.0 58
 
0.8%
10.0 46
 
0.6%
33.0 38
 
0.5%
6.6 30
 
0.4%
20.0 17
 
0.2%
16.5 16
 
0.2%
15.0 15
 
0.2%
6.0 15
 
0.2%
198.0 12
 
0.2%
Other values (930) 1213
 
16.3%
(Missing) 5835
78.6%
ValueCountFrequency (%)
0.0 58
0.8%
1.0 1
 
< 0.1%
1.2 8
 
0.1%
1.3 1
 
< 0.1%
1.5 1
 
< 0.1%
1.65 3
 
< 0.1%
1.8 1
 
< 0.1%
1.92 1
 
< 0.1%
2.0 10
 
0.1%
2.36 1
 
< 0.1%
ValueCountFrequency (%)
5533.5 1
< 0.1%
3138.0 1
< 0.1%
2572.35 1
< 0.1%
1727.25 1
< 0.1%
1377.18 1
< 0.1%
1309.0 1
< 0.1%
1214.8 1
< 0.1%
1100.04 1
< 0.1%
939.5 1
< 0.1%
930.0 1
< 0.1%

도로명우편번호
Real number (ℝ)

MISSING 

Distinct1094
Distinct (%)62.7%
Missing5676
Missing (%)76.5%
Infinite0
Infinite (%)0.0%
Mean13986.77
Minimum3374
Maximum30128
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:03.028286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3374
5-th percentile10111.4
Q111485
median13558
Q316650
95-th percentile18409.8
Maximum30128
Range26754
Interquartile range (IQR)5165

Descriptive statistics

Standard deviation2770.0765
Coefficient of variation (CV)0.19804976
Kurtosis-0.54433185
Mean13986.77
Median Absolute Deviation (MAD)2654
Skewness0.17413972
Sum24406914
Variance7673323.6
MonotonicityNot monotonic
2023-12-11T06:59:03.164745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10071 18
 
0.2%
18469 14
 
0.2%
10402 12
 
0.2%
10401 12
 
0.2%
14544 11
 
0.1%
10908 10
 
0.1%
17084 10
 
0.1%
10449 10
 
0.1%
12113 9
 
0.1%
10545 9
 
0.1%
Other values (1084) 1630
 
22.0%
(Missing) 5676
76.5%
ValueCountFrequency (%)
3374 2
< 0.1%
10005 1
 
< 0.1%
10009 2
< 0.1%
10011 1
 
< 0.1%
10013 1
 
< 0.1%
10014 1
 
< 0.1%
10016 3
< 0.1%
10017 1
 
< 0.1%
10019 1
 
< 0.1%
10020 1
 
< 0.1%
ValueCountFrequency (%)
30128 1
 
< 0.1%
18632 1
 
< 0.1%
18629 2
< 0.1%
18626 1
 
< 0.1%
18624 2
< 0.1%
18622 1
 
< 0.1%
18614 1
 
< 0.1%
18608 2
< 0.1%
18606 3
< 0.1%
18593 1
 
< 0.1%
Distinct6787
Distinct (%)94.4%
Missing235
Missing (%)3.2%
Memory size58.1 KiB
2023-12-11T06:59:03.500087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length52
Mean length31.653075
Min length13

Characters and Unicode

Total characters227459
Distinct characters650
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6436 ?
Unique (%)89.6%

Sample

1st row경기도 가평군 설악면 한서로 534, 3층
2nd row경기도 가평군 청평면 경춘로 961, 1층
3rd row경기도 가평군 청평면 머내길 176
4th row경기도 가평군 청평면 행자골길 1
5th row경기도 가평군 설악면 미사리로540번길 4-9, 2층
ValueCountFrequency (%)
경기도 7183
 
15.0%
1층 1201
 
2.5%
성남시 668
 
1.4%
용인시 645
 
1.3%
고양시 643
 
1.3%
2층 576
 
1.2%
수원시 411
 
0.9%
일부 392
 
0.8%
화성시 380
 
0.8%
일부호 380
 
0.8%
Other values (9555) 35434
74.0%
2023-12-11T06:59:03.992699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40751
 
17.9%
1 9711
 
4.3%
7645
 
3.4%
7550
 
3.3%
7499
 
3.3%
7460
 
3.3%
6907
 
3.0%
6432
 
2.8%
2 5952
 
2.6%
, 5807
 
2.6%
Other values (640) 121745
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 127341
56.0%
Space Separator 40751
 
17.9%
Decimal Number 39820
 
17.5%
Other Punctuation 5848
 
2.6%
Close Punctuation 5123
 
2.3%
Open Punctuation 5123
 
2.3%
Dash Punctuation 2273
 
1.0%
Uppercase Letter 1069
 
0.5%
Lowercase Letter 71
 
< 0.1%
Math Symbol 24
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7645
 
6.0%
7550
 
5.9%
7499
 
5.9%
7460
 
5.9%
6907
 
5.4%
6432
 
5.1%
3699
 
2.9%
3379
 
2.7%
3273
 
2.6%
2938
 
2.3%
Other values (572) 70559
55.4%
Uppercase Letter
ValueCountFrequency (%)
B 233
21.8%
A 207
19.4%
C 74
 
6.9%
I 72
 
6.7%
T 59
 
5.5%
E 47
 
4.4%
D 43
 
4.0%
S 36
 
3.4%
K 35
 
3.3%
L 32
 
3.0%
Other values (16) 231
21.6%
Lowercase Letter
ValueCountFrequency (%)
e 18
25.4%
s 11
15.5%
t 10
14.1%
k 7
 
9.9%
n 7
 
9.9%
a 7
 
9.9%
l 2
 
2.8%
c 2
 
2.8%
r 2
 
2.8%
w 1
 
1.4%
Other values (4) 4
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 9711
24.4%
2 5952
14.9%
3 4178
10.5%
0 3965
10.0%
4 3459
 
8.7%
5 3039
 
7.6%
6 2774
 
7.0%
7 2461
 
6.2%
8 2171
 
5.5%
9 2110
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 5807
99.3%
. 23
 
0.4%
& 8
 
0.1%
: 4
 
0.1%
@ 2
 
< 0.1%
' 1
 
< 0.1%
/ 1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
8
50.0%
5
31.2%
3
 
18.8%
Math Symbol
ValueCountFrequency (%)
~ 23
95.8%
1
 
4.2%
Space Separator
ValueCountFrequency (%)
40751
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5123
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5123
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2273
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 127341
56.0%
Common 98962
43.5%
Latin 1156
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7645
 
6.0%
7550
 
5.9%
7499
 
5.9%
7460
 
5.9%
6907
 
5.4%
6432
 
5.1%
3699
 
2.9%
3379
 
2.7%
3273
 
2.6%
2938
 
2.3%
Other values (572) 70559
55.4%
Latin
ValueCountFrequency (%)
B 233
20.2%
A 207
17.9%
C 74
 
6.4%
I 72
 
6.2%
T 59
 
5.1%
E 47
 
4.1%
D 43
 
3.7%
S 36
 
3.1%
K 35
 
3.0%
L 32
 
2.8%
Other values (33) 318
27.5%
Common
ValueCountFrequency (%)
40751
41.2%
1 9711
 
9.8%
2 5952
 
6.0%
, 5807
 
5.9%
) 5123
 
5.2%
( 5123
 
5.2%
3 4178
 
4.2%
0 3965
 
4.0%
4 3459
 
3.5%
5 3039
 
3.1%
Other values (15) 11854
 
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 127341
56.0%
ASCII 100099
44.0%
Number Forms 16
 
< 0.1%
Math Operators 1
 
< 0.1%
None 1
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40751
40.7%
1 9711
 
9.7%
2 5952
 
5.9%
, 5807
 
5.8%
) 5123
 
5.1%
( 5123
 
5.1%
3 4178
 
4.2%
0 3965
 
4.0%
4 3459
 
3.5%
5 3039
 
3.0%
Other values (52) 12991
 
13.0%
Hangul
ValueCountFrequency (%)
7645
 
6.0%
7550
 
5.9%
7499
 
5.9%
7460
 
5.9%
6907
 
5.4%
6432
 
5.1%
3699
 
2.9%
3379
 
2.7%
3273
 
2.6%
2938
 
2.3%
Other values (572) 70559
55.4%
Number Forms
ValueCountFrequency (%)
8
50.0%
5
31.2%
3
 
18.8%
Math Operators
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%
Distinct7131
Distinct (%)96.1%
Missing2
Missing (%)< 0.1%
Memory size58.1 KiB
2023-12-11T06:59:04.321450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length50
Mean length27.160669
Min length14

Characters and Unicode

Total characters201505
Distinct characters605
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6898 ?
Unique (%)93.0%

Sample

1st row경기도 가평군 설악면 위곡리 7-2 3층
2nd row경기도 가평군 청평면 하천리 489-6번지
3rd row경기도 가평군 청평면 대성리 243-4번지 외1필지
4th row경기도 가평군 상면 원흥리 466번지
5th row경기도 가평군 청평면 상천리 1139-1번지
ValueCountFrequency (%)
경기도 7416
 
17.2%
1층 792
 
1.8%
성남시 676
 
1.6%
용인시 664
 
1.5%
고양시 650
 
1.5%
수원시 421
 
1.0%
화성시 396
 
0.9%
남양주시 380
 
0.9%
2층 370
 
0.9%
분당구 367
 
0.9%
Other values (9676) 30988
71.9%
2023-12-11T06:59:04.791708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37218
 
18.5%
1 8761
 
4.3%
7759
 
3.9%
7705
 
3.8%
7695
 
3.8%
7456
 
3.7%
7264
 
3.6%
6753
 
3.4%
- 5882
 
2.9%
5690
 
2.8%
Other values (595) 99322
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117234
58.2%
Decimal Number 38630
 
19.2%
Space Separator 37218
 
18.5%
Dash Punctuation 5882
 
2.9%
Uppercase Letter 893
 
0.4%
Open Punctuation 550
 
0.3%
Close Punctuation 548
 
0.3%
Other Punctuation 463
 
0.2%
Lowercase Letter 57
 
< 0.1%
Letter Number 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7759
 
6.6%
7705
 
6.6%
7695
 
6.6%
7456
 
6.4%
7264
 
6.2%
6753
 
5.8%
5690
 
4.9%
3377
 
2.9%
2573
 
2.2%
2422
 
2.1%
Other values (529) 58540
49.9%
Uppercase Letter
ValueCountFrequency (%)
B 189
21.2%
A 158
17.7%
I 75
 
8.4%
C 62
 
6.9%
T 52
 
5.8%
E 45
 
5.0%
D 39
 
4.4%
S 34
 
3.8%
K 31
 
3.5%
L 28
 
3.1%
Other values (16) 180
20.2%
Lowercase Letter
ValueCountFrequency (%)
e 17
29.8%
k 8
14.0%
n 6
 
10.5%
a 6
 
10.5%
s 5
 
8.8%
t 4
 
7.0%
c 3
 
5.3%
l 2
 
3.5%
r 2
 
3.5%
y 1
 
1.8%
Other values (3) 3
 
5.3%
Decimal Number
ValueCountFrequency (%)
1 8761
22.7%
2 5315
13.8%
3 4230
11.0%
0 3622
9.4%
4 3474
 
9.0%
5 3192
 
8.3%
6 2924
 
7.6%
7 2565
 
6.6%
8 2410
 
6.2%
9 2137
 
5.5%
Other Punctuation
ValueCountFrequency (%)
, 418
90.3%
. 25
 
5.4%
& 9
 
1.9%
: 3
 
0.6%
@ 3
 
0.6%
/ 3
 
0.6%
· 1
 
0.2%
' 1
 
0.2%
Letter Number
ValueCountFrequency (%)
8
50.0%
5
31.2%
3
 
18.8%
Math Symbol
ValueCountFrequency (%)
~ 13
92.9%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
37218
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5882
100.0%
Open Punctuation
ValueCountFrequency (%)
( 550
100.0%
Close Punctuation
ValueCountFrequency (%)
) 548
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117234
58.2%
Common 83305
41.3%
Latin 966
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7759
 
6.6%
7705
 
6.6%
7695
 
6.6%
7456
 
6.4%
7264
 
6.2%
6753
 
5.8%
5690
 
4.9%
3377
 
2.9%
2573
 
2.2%
2422
 
2.1%
Other values (529) 58540
49.9%
Latin
ValueCountFrequency (%)
B 189
19.6%
A 158
16.4%
I 75
 
7.8%
C 62
 
6.4%
T 52
 
5.4%
E 45
 
4.7%
D 39
 
4.0%
S 34
 
3.5%
K 31
 
3.2%
L 28
 
2.9%
Other values (32) 253
26.2%
Common
ValueCountFrequency (%)
37218
44.7%
1 8761
 
10.5%
- 5882
 
7.1%
2 5315
 
6.4%
3 4230
 
5.1%
0 3622
 
4.3%
4 3474
 
4.2%
5 3192
 
3.8%
6 2924
 
3.5%
7 2565
 
3.1%
Other values (14) 6122
 
7.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117233
58.2%
ASCII 84253
41.8%
Number Forms 16
 
< 0.1%
None 1
 
< 0.1%
Math Operators 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37218
44.2%
1 8761
 
10.4%
- 5882
 
7.0%
2 5315
 
6.3%
3 4230
 
5.0%
0 3622
 
4.3%
4 3474
 
4.1%
5 3192
 
3.8%
6 2924
 
3.5%
7 2565
 
3.0%
Other values (51) 7070
 
8.4%
Hangul
ValueCountFrequency (%)
7759
 
6.6%
7705
 
6.6%
7695
 
6.6%
7456
 
6.4%
7264
 
6.2%
6753
 
5.8%
5690
 
4.9%
3377
 
2.9%
2573
 
2.2%
2422
 
2.1%
Other values (528) 58539
49.9%
Number Forms
ValueCountFrequency (%)
8
50.0%
5
31.2%
3
 
18.8%
None
ValueCountFrequency (%)
· 1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2505
Distinct (%)33.8%
Missing6
Missing (%)0.1%
Memory size58.1 KiB
2023-12-11T06:59:05.181166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.15118
Min length5

Characters and Unicode

Total characters45611
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1127 ?
Unique (%)15.2%

Sample

1st row477-855
2nd row477816
3rd row477812
4th row12442
5th row477814
ValueCountFrequency (%)
410837 88
 
1.2%
462807 60
 
0.8%
410835 46
 
0.6%
449853 32
 
0.4%
410-837 31
 
0.4%
431815 30
 
0.4%
14544 26
 
0.4%
464894 25
 
0.3%
472501 25
 
0.3%
472861 24
 
0.3%
Other values (2495) 7028
94.8%
2023-12-11T06:59:05.697382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 10711
23.5%
8 5746
12.6%
1 4961
10.9%
0 4392
9.6%
2 3831
 
8.4%
3 3591
 
7.9%
6 3336
 
7.3%
5 3233
 
7.1%
7 2405
 
5.3%
9 1800
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44006
96.5%
Dash Punctuation 1605
 
3.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 10711
24.3%
8 5746
13.1%
1 4961
11.3%
0 4392
10.0%
2 3831
 
8.7%
3 3591
 
8.2%
6 3336
 
7.6%
5 3233
 
7.3%
7 2405
 
5.5%
9 1800
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 1605
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 45611
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 10711
23.5%
8 5746
12.6%
1 4961
10.9%
0 4392
9.6%
2 3831
 
8.4%
3 3591
 
7.9%
6 3336
 
7.3%
5 3233
 
7.1%
7 2405
 
5.3%
9 1800
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45611
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 10711
23.5%
8 5746
12.6%
1 4961
10.9%
0 4392
9.6%
2 3831
 
8.4%
3 3591
 
7.9%
6 3336
 
7.3%
5 3233
 
7.1%
7 2405
 
5.3%
9 1800
 
3.9%

WGS84위도
Real number (ℝ)

Distinct5957
Distinct (%)81.1%
Missing72
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean37.444993
Minimum36.491092
Maximum38.185224
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:05.880230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.491092
5-th percentile37.104031
Q137.295464
median37.404047
Q337.634993
95-th percentile37.803369
Maximum38.185224
Range1.6941318
Interquartile range (IQR)0.33952934

Descriptive statistics

Standard deviation0.21311835
Coefficient of variation (CV)0.0056915045
Kurtosis-0.43051732
Mean37.444993
Median Absolute Deviation (MAD)0.14095305
Skewness0.14923346
Sum275183.25
Variance0.04541943
MonotonicityNot monotonic
2023-12-11T06:59:06.017917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3923317778 16
 
0.2%
37.6103315074 13
 
0.2%
37.3443549674 13
 
0.2%
37.3889401776 12
 
0.2%
37.3716127805 11
 
0.1%
37.2378906598 10
 
0.1%
37.438418567 9
 
0.1%
37.2748617644 9
 
0.1%
37.436719428 9
 
0.1%
37.370051815 8
 
0.1%
Other values (5947) 7239
97.5%
(Missing) 72
 
1.0%
ValueCountFrequency (%)
36.491091941 1
< 0.1%
36.9165803501 1
< 0.1%
36.9168961792 1
< 0.1%
36.938720379 1
< 0.1%
36.9402870569 2
< 0.1%
36.9439296295 1
< 0.1%
36.9443129561 1
< 0.1%
36.9448447964 1
< 0.1%
36.9463540043 1
< 0.1%
36.9469764639 1
< 0.1%
ValueCountFrequency (%)
38.1852237833 1
< 0.1%
38.1121625152 1
< 0.1%
38.1044947446 1
< 0.1%
38.099152433 1
< 0.1%
38.0857792983 1
< 0.1%
38.0683427258 2
< 0.1%
38.0570591423 1
< 0.1%
38.0516167324 1
< 0.1%
38.0489618246 1
< 0.1%
38.040968574 1
< 0.1%

WGS84경도
Real number (ℝ)

Distinct5957
Distinct (%)81.1%
Missing72
Missing (%)1.0%
Infinite0
Infinite (%)0.0%
Mean127.02284
Minimum126.52556
Maximum127.7557
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:06.154893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.52556
5-th percentile126.71741
Q1126.8355
median127.04111
Q3127.16542
95-th percentile127.37685
Maximum127.7557
Range1.2301413
Interquartile range (IQR)0.32992087

Descriptive statistics

Standard deviation0.21133728
Coefficient of variation (CV)0.0016637738
Kurtosis-0.21110699
Mean127.02284
Median Absolute Deviation (MAD)0.15574843
Skewness0.23289646
Sum933490.87
Variance0.044663445
MonotonicityNot monotonic
2023-12-11T06:59:06.342198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.9565123356 16
 
0.2%
127.1454453107 13
 
0.2%
127.10510721 13
 
0.2%
127.122552825 12
 
0.2%
126.9514327046 11
 
0.1%
127.1103081978 10
 
0.1%
127.1780452538 9
 
0.1%
127.0800687639 9
 
0.1%
127.1698572002 9
 
0.1%
126.9530364977 8
 
0.1%
Other values (5947) 7239
97.5%
(Missing) 72
 
1.0%
ValueCountFrequency (%)
126.5255574103 1
< 0.1%
126.5392340859 1
< 0.1%
126.5431591094 2
< 0.1%
126.5444458283 1
< 0.1%
126.5448899178 1
< 0.1%
126.5472916706 1
< 0.1%
126.5497387546 1
< 0.1%
126.5534964268 1
< 0.1%
126.5538438955 1
< 0.1%
126.553924773 1
< 0.1%
ValueCountFrequency (%)
127.7556986623 1
< 0.1%
127.7410826858 1
< 0.1%
127.7390928435 1
< 0.1%
127.7322701087 1
< 0.1%
127.7174001724 1
< 0.1%
127.7117391788 2
< 0.1%
127.7102109036 1
< 0.1%
127.7052478804 1
< 0.1%
127.7033737721 2
< 0.1%
127.6787410152 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5666 
유통전문판매업
1755 

Length

Max length7
Median length4
Mean length4.7094731
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유통전문판매업
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5666
76.4%
유통전문판매업 1755
 
23.6%

Length

2023-12-11T06:59:06.480078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:06.599692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5666
76.4%
유통전문판매업 1755
 
23.6%

X좌표값
Real number (ℝ)

MISSING 

Distinct1476
Distinct (%)86.8%
Missing5720
Missing (%)77.1%
Infinite0
Infinite (%)0.0%
Mean201233.21
Minimum159314.41
Maximum263503.33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:06.739741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum159314.41
5-th percentile173618.26
Q1184396.67
median204053.16
Q3212259.83
95-th percentile232066.44
Maximum263503.33
Range104188.92
Interquartile range (IQR)27863.167

Descriptive statistics

Standard deviation18736.143
Coefficient of variation (CV)0.093106616
Kurtosis-0.0511922
Mean201233.21
Median Absolute Deviation (MAD)12438.709
Skewness0.21131426
Sum3.4229769 × 108
Variance3.5104307 × 108
MonotonicityNot monotonic
2023-12-11T06:59:06.891268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
209714.027222914 10
 
0.1%
213491.170141499 7
 
0.1%
166542.983302429 7
 
0.1%
207902.474796614 7
 
0.1%
211047.581577928 6
 
0.1%
178038.654941912 6
 
0.1%
178804.224364993 5
 
0.1%
190327.674570207 4
 
0.1%
190124.458858418 4
 
0.1%
179849.580666893 4
 
0.1%
Other values (1466) 1641
 
22.1%
(Missing) 5720
77.1%
ValueCountFrequency (%)
159314.412542039 1
< 0.1%
160005.834801456 1
< 0.1%
160782.488155845 1
< 0.1%
160999.5126943 1
< 0.1%
161171.526286835 1
< 0.1%
162245.357956494 1
< 0.1%
162258.097455238 1
< 0.1%
162322.488198526 1
< 0.1%
162809.708386591 1
< 0.1%
163400.673329392 1
< 0.1%
ValueCountFrequency (%)
263503.334786381 1
< 0.1%
260103.749908482 1
< 0.1%
258987.97299564 1
< 0.1%
258508.583739121 1
< 0.1%
258244.697456528 1
< 0.1%
258231.881573701 1
< 0.1%
258179.530853098 1
< 0.1%
257543.596351006 1
< 0.1%
257500.617431035 1
< 0.1%
257070.620588812 1
< 0.1%

Y좌표값
Real number (ℝ)

MISSING 

Distinct1476
Distinct (%)86.8%
Missing5720
Missing (%)77.1%
Infinite0
Infinite (%)0.0%
Mean439697.6
Minimum332261.72
Maximum510706.88
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:07.032995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum332261.72
5-th percentile404077.82
Q1419888.95
median435332.56
Q3460894.74
95-th percentile479225.3
Maximum510706.88
Range178445.16
Interquartile range (IQR)41005.788

Descriptive statistics

Standard deviation24429.488
Coefficient of variation (CV)0.055559748
Kurtosis-0.59634177
Mean439697.6
Median Absolute Deviation (MAD)19806.594
Skewness0.10659711
Sum7.4792562 × 108
Variance5.9679989 × 108
MonotonicityNot monotonic
2023-12-11T06:59:07.163365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
415112.443559383 10
 
0.1%
418778.755712447 7
 
0.1%
460146.42097319 7
 
0.1%
412102.059482355 7
 
0.1%
460377.706809703 6
 
0.1%
444806.206937757 6
 
0.1%
468269.565890856 5
 
0.1%
455360.926623372 4
 
0.1%
435332.556961729 4
 
0.1%
461692.59774274 4
 
0.1%
Other values (1466) 1641
 
22.1%
(Missing) 5720
77.1%
ValueCountFrequency (%)
332261.720946312 1
< 0.1%
382496.944194433 1
< 0.1%
383353.711740691 1
< 0.1%
383398.020242234 1
< 0.1%
386801.395629022 1
< 0.1%
387336.981684963 1
< 0.1%
387358.575437611 1
< 0.1%
387457.28616239 1
< 0.1%
387466.269788875 1
< 0.1%
387559.389577978 1
< 0.1%
ValueCountFrequency (%)
510706.876236034 1
< 0.1%
509193.042846029 1
< 0.1%
507339.645685303 1
< 0.1%
506026.952448755 1
< 0.1%
505420.338446721 1
< 0.1%
505114.726157185 1
< 0.1%
502946.983088466 1
< 0.1%
499598.94226441 1
< 0.1%
499481.510906398 1
< 0.1%
495427.901018012 1
< 0.1%

위생업태명
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
유통전문판매업
7342 
<NA>
 
79

Length

Max length7
Median length7
Mean length6.9680636
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유통전문판매업
2nd row유통전문판매업
3rd row유통전문판매업
4th row유통전문판매업
5th row유통전문판매업

Common Values

ValueCountFrequency (%)
유통전문판매업 7342
98.9%
<NA> 79
 
1.1%

Length

2023-12-11T06:59:07.322098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:07.685615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유통전문판매업 7342
98.9%
na 79
 
1.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5868 
0
1553 

Length

Max length4
Median length4
Mean length3.372187
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5868
79.1%
0 1553
 
20.9%

Length

2023-12-11T06:59:07.787935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:07.893842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5868
79.1%
0 1553
 
20.9%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5868 
0
1553 

Length

Max length4
Median length4
Mean length3.372187
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5868
79.1%
0 1553
 
20.9%

Length

2023-12-11T06:59:08.013016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:08.109551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5868
79.1%
0 1553
 
20.9%

영업장주변구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB

등급구분명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB

본사종업원수
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5809 
0
1611 
1
 
1

Length

Max length4
Median length4
Mean length3.3483358
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5809
78.3%
0 1611
 
21.7%
1 1
 
< 0.1%

Length

2023-12-11T06:59:08.210929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:08.310459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5809
78.3%
0 1611
 
21.7%
1 1
 
< 0.1%

공장사무직종업원수
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5807 
0
1590 
1
 
12
2
 
6
3
 
5

Length

Max length4
Median length4
Mean length3.3475273
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5807
78.3%
0 1590
 
21.4%
1 12
 
0.2%
2 6
 
0.1%
3 5
 
0.1%
4 1
 
< 0.1%

Length

2023-12-11T06:59:08.412523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:08.519896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5807
78.3%
0 1590
 
21.4%
1 12
 
0.2%
2 6
 
0.1%
3 5
 
0.1%
4 1
 
< 0.1%

공장판매직종업원수
Real number (ℝ)

MISSING  ZEROS 

Distinct6
Distinct (%)0.4%
Missing5809
Missing (%)78.3%
Infinite0
Infinite (%)0.0%
Mean0.021712159
Minimum0
Maximum5
Zeros1595
Zeros (%)21.5%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:08.606529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.25192009
Coefficient of variation (CV)11.60272
Kurtosis250.92394
Mean0.021712159
Median Absolute Deviation (MAD)0
Skewness14.862323
Sum35
Variance0.063463733
MonotonicityNot monotonic
2023-12-11T06:59:08.703116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 1595
 
21.5%
1 8
 
0.1%
2 5
 
0.1%
5 2
 
< 0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%
(Missing) 5809
78.3%
ValueCountFrequency (%)
0 1595
21.5%
1 8
 
0.1%
2 5
 
0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%
5 2
 
< 0.1%
ValueCountFrequency (%)
5 2
 
< 0.1%
4 1
 
< 0.1%
3 1
 
< 0.1%
2 5
 
0.1%
1 8
 
0.1%
0 1595
21.5%

공장생산직종업원수
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5808 
0
1602 
1
 
4
2
 
3
3
 
3

Length

Max length4
Median length4
Mean length3.3479315
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5808
78.3%
0 1602
 
21.6%
1 4
 
0.1%
2 3
 
< 0.1%
3 3
 
< 0.1%
6 1
 
< 0.1%

Length

2023-12-11T06:59:08.825266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:08.951166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5808
78.3%
0 1602
 
21.6%
1 4
 
0.1%
2 3
 
< 0.1%
3 3
 
< 0.1%
6 1
 
< 0.1%

보증금액
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size58.1 KiB
<NA>
5863 
0
1554 
20000000
 
1
10000000
 
1
5000000
 
1

Length

Max length8
Median length4
Mean length3.3736693
Min length1

Unique

Unique4 ?
Unique (%)0.1%

Sample

1st row0
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 5863
79.0%
0 1554
 
20.9%
20000000 1
 
< 0.1%
10000000 1
 
< 0.1%
5000000 1
 
< 0.1%
1000000 1
 
< 0.1%

Length

2023-12-11T06:59:09.088155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:59:09.201626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5863
79.0%
0 1554
 
20.9%
20000000 1
 
< 0.1%
10000000 1
 
< 0.1%
5000000 1
 
< 0.1%
1000000 1
 
< 0.1%

월세금액
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct6
Distinct (%)0.4%
Missing5863
Missing (%)79.0%
Infinite0
Infinite (%)0.0%
Mean2086.0077
Minimum0
Maximum1000000
Zeros1553
Zeros (%)20.9%
Negative0
Negative (%)0.0%
Memory size65.4 KiB
2023-12-11T06:59:09.291587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1000000
Range1000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation40196.375
Coefficient of variation (CV)19.269524
Kurtosis461.52369
Mean2086.0077
Median Absolute Deviation (MAD)0
Skewness21.000136
Sum3250000
Variance1.6157485 × 109
MonotonicityNot monotonic
2023-12-11T06:59:09.398828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 1553
 
20.9%
1000000 1
 
< 0.1%
900000 1
 
< 0.1%
500000 1
 
< 0.1%
650000 1
 
< 0.1%
200000 1
 
< 0.1%
(Missing) 5863
79.0%
ValueCountFrequency (%)
0 1553
20.9%
200000 1
 
< 0.1%
500000 1
 
< 0.1%
650000 1
 
< 0.1%
900000 1
 
< 0.1%
1000000 1
 
< 0.1%
ValueCountFrequency (%)
1000000 1
 
< 0.1%
900000 1
 
< 0.1%
650000 1
 
< 0.1%
500000 1
 
< 0.1%
200000 1
 
< 0.1%
0 1553
20.9%

다중이용업소여부
Boolean

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing79
Missing (%)1.1%
Memory size14.6 KiB
False
7341 
True
 
1
(Missing)
 
79
ValueCountFrequency (%)
False 7341
98.9%
True 1
 
< 0.1%
(Missing) 79
 
1.1%
2023-12-11T06:59:09.487970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

시설총규모
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB

전통업소지정번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB

전통업소음식
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7421
Missing (%)100.0%
Memory size65.4 KiB

Sample

시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부시설총규모전통업소지정번호전통업소음식
0가평군주식회사 청리움2022-07-29<NA>1영업<NA><NA>10.012463경기도 가평군 설악면 한서로 534, 3층경기도 가평군 설악면 위곡리 7-2 3층477-85537.661818127.543062유통전문판매업247822.828445462315.042126유통전문판매업00<NA><NA>000000N<NA><NA><NA>
1가평군수제식품20170814<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 경춘로 961, 1층경기도 가평군 청평면 하천리 489-6번지47781637.747792127.428988<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
2가평군일심20180404<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 머내길 176경기도 가평군 청평면 대성리 243-4번지 외1필지47781237.718505127.380509<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
3가평군운악농산20010403<NA><NA>운영중<NA><NA><NA><NA><NA>경기도 가평군 상면 원흥리 466번지1244237.81548127.323732<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
4가평군비오팜20170926<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 행자골길 1경기도 가평군 청평면 상천리 1139-1번지47781437.766153127.446886<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
5가평군주식회사 더킹스20180320<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 설악면 미사리로540번길 4-9, 2층경기도 가평군 설악면 미사리 266-2번지 2층47785337.698693127.540644<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
6가평군유진인터내셔널20151013<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 경춘로 866, 2층경기도 가평군 청평면 청평리 319-46번지 2층 1.2동47781337.740533127.423092<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7가평군백품푸드케어20180504<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 경춘로 1225경기도 가평군 청평면 상천리 1148-1번지47781437.766513127.446364<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
8가평군(주)티마드20091215<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 톳골길 53, 1층경기도 가평군 청평면 청평리 312-32번지 외 1필지, 1층47781337.740711127.420527<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
9가평군가평잣한과&에덴떡집20060811<NA><NA>운영중<NA><NA><NA><NA>경기도 가평군 청평면 상천고갯길 2-1 (외1필지)경기도 가평군 청평면 상천리 254-1번지 외1필지47781437.780112127.465375<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
시군명사업장명인허가일자인허가취소일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수영업장주변구분명등급구분명본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부시설총규모전통업소지정번호전통업소음식
7411화성시(주)중외제약19940929<NA><NA>폐업 등20071018<NA><NA><NA>경기도 화성시 안녕남로 181경기도 화성시 안녕동 146-141번지44538037.198136126.999172<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7412화성시유푸드20050929<NA><NA>폐업 등20140311<NA><NA><NA><NA>경기도 화성시 봉담읍 왕림리 60-5번지44590637.201796126.945097<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7413화성시토속식품(주)20050913<NA><NA>폐업 등20090908<NA><NA><NA>경기도 화성시 우정읍 버들로 76경기도 화성시 우정읍 조암리 492-1번지44595537.080031126.807204<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7414화성시다드림푸드20050802<NA><NA>폐업 등20090901<NA><NA><NA>경기도 화성시 비봉면 주석로485번길 19경기도 화성시 비봉면 자안리 680번지 외2필지44584337.199592126.875075<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7415화성시MR푸드20050425<NA><NA>폐업 등20140311<NA><NA><NA>경기도 화성시 팔탄면 온천로 429-1 (2층)경기도 화성시 팔탄면 율암리 838-3번지 2층44591337.153394126.880039<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7416화성시(주)백두산유통20050509<NA><NA>폐업 등20070601<NA><NA><NA>경기도 화성시 태안로 89경기도 화성시 병점동 520-2번지 한일타운 상가 201호(일부)44536037.200068127.037697<NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7417화성시(주)연자방19981019<NA><NA>폐업 등20051019<NA><NA><NA><NA>경기도 화성시 청계동 510-68번지445140<NA><NA><NA><NA><NA>유통전문판매업<NA><NA><NA><NA><NA><NA><NA><NA><NA><NA>N<NA><NA><NA>
7418<NA>위라클2022-08-17<NA>1영업<NA>070 7626000110.030128세종특별자치시 나성북1로 22, 디펠리체 8층 803-42호 (나성동)세종특별자치시 나성동 790 디펠리체339-00336.491092127.258769유통전문판매업223110.250959332261.720946유통전문판매업00<NA><NA>000000N<NA><NA><NA>
7419<NA>네이처스템2021-11-01<NA>2폐업2023-11-0802 355435515.03374서울특별시 은평구 통일로 590, 2동 902호 (녹번동, 대림아파트)서울특별시 은평구 녹번동 276 대림아파트 2동 902호122-77337.599876126.936881유통전문판매업194367.970248455314.710894유통전문판매업00<NA><NA>001000N<NA><NA><NA>
7420<NA>네이처스템2021-11-01<NA>2폐업2023-10-2602 355435515.03374서울특별시 은평구 통일로 590, 2동 902호 (녹번동, 대림아파트)서울특별시 은평구 녹번동 276 대림아파트 2동 902호122-77337.599876126.936881유통전문판매업194367.970248455314.710894유통전문판매업00<NA><NA>001000N<NA><NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태구분코드영업상태명폐업일자소재지시설전화번호소재지면적정보도로명우편번호소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도업태구분명정보X좌표값Y좌표값위생업태명남성종사자수여성종사자수본사종업원수공장사무직종업원수공장판매직종업원수공장생산직종업원수보증금액월세금액다중이용업소여부# duplicates
1오산시광고장수2021-04-222폐업2023-05-10<NA>71.6918102경기도 오산시 독산성로270번길 141, 1층 (세교동)경기도 오산시 세교동 483-2 1층447-24037.183894127.036883유통전문판매업203202.885073409110.100407유통전문판매업00000000N3
0고양시(주)테일러팜스2010-04-052폐업2023-08-28901 965593.610442경기도 고양시 일산동구 일산로 138 (백석동,일산테크노타운 부대동 401호일부)경기도 고양시 일산동구 백석동 1141-1 일산테크노타운 부대동 401호일부410-83537.650785126.794625유통전문판매업181835.358339460894.742344유통전문판매업00000000N2