Overview

Dataset statistics

Number of variables12
Number of observations7080
Missing cells3184
Missing cells (%)3.7%
Duplicate rows107
Duplicate rows (%)1.5%
Total size in memory677.7 KiB
Average record size in memory98.0 B

Variable types

Categorical2
Text5
DateTime3
Numeric2

Dataset

Description경상남도 거제시 식품위생업소 현황(업종명, 업소명, 인허가일자, 영업자시작일, 주소, 전화번호, 급수시설, 업태명, 좌표, 데이터기준일)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3079230

Alerts

기준일자 has constant value ""Constant
Dataset has 107 (1.5%) duplicate rowsDuplicates
급수시설 is highly imbalanced (70.3%)Imbalance
소재지전화 has 3137 (44.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 23:51:23.782326
Analysis finished2023-12-10 23:51:26.215489
Duration2.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct17
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
일반음식점
4157 
휴게음식점
1030 
건강기능식품일반판매업
 
366
즉석판매제조가공업
 
342
유흥주점영업
 
336
Other values (12)
849 

Length

Max length13
Median length5
Mean length5.7032486
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 4157
58.7%
휴게음식점 1030
 
14.5%
건강기능식품일반판매업 366
 
5.2%
즉석판매제조가공업 342
 
4.8%
유흥주점영업 336
 
4.7%
집단급식소 244
 
3.4%
식품자동판매기영업 158
 
2.2%
제과점영업 90
 
1.3%
위탁급식영업 77
 
1.1%
식품제조가공업 75
 
1.1%
Other values (7) 205
 
2.9%

Length

2023-12-11T08:51:26.281584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 4157
58.6%
휴게음식점 1030
 
14.5%
건강기능식품일반판매업 366
 
5.2%
즉석판매제조가공업 342
 
4.8%
유흥주점영업 336
 
4.7%
집단급식소 263
 
3.7%
식품자동판매기영업 158
 
2.2%
제과점영업 90
 
1.3%
위탁급식영업 77
 
1.1%
식품제조가공업 75
 
1.1%
Other values (7) 205
 
2.9%
Distinct6279
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
2023-12-11T08:51:26.556220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length28
Mean length7.264548
Min length1

Characters and Unicode

Total characters51433
Distinct characters998
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5548 ?
Unique (%)78.4%

Sample

1st row성영식당
2nd row맥심경양식
3rd row인덕식당
4th row털보네분식
5th row도밍고레스토랑
ValueCountFrequency (%)
거제고현점 102
 
1.2%
세븐일레븐 84
 
0.9%
거제점 72
 
0.8%
고현점 68
 
0.8%
씨유 63
 
0.7%
거제아주점 56
 
0.6%
아주점 56
 
0.6%
옥포점 48
 
0.5%
장평점 45
 
0.5%
gs25 34
 
0.4%
Other values (6391) 8225
92.9%
2023-12-11T08:51:26.960158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1960
 
3.8%
1780
 
3.5%
1301
 
2.5%
1215
 
2.4%
( 1058
 
2.1%
) 1056
 
2.1%
999
 
1.9%
781
 
1.5%
716
 
1.4%
619
 
1.2%
Other values (988) 39948
77.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44043
85.6%
Space Separator 1780
 
3.5%
Uppercase Letter 1499
 
2.9%
Open Punctuation 1058
 
2.1%
Close Punctuation 1056
 
2.1%
Lowercase Letter 988
 
1.9%
Decimal Number 747
 
1.5%
Other Punctuation 237
 
0.5%
Dash Punctuation 20
 
< 0.1%
Math Symbol 2
 
< 0.1%
Other values (3) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1960
 
4.5%
1301
 
3.0%
1215
 
2.8%
999
 
2.3%
781
 
1.8%
716
 
1.6%
619
 
1.4%
601
 
1.4%
592
 
1.3%
557
 
1.3%
Other values (907) 34702
78.8%
Uppercase Letter
ValueCountFrequency (%)
C 143
 
9.5%
A 127
 
8.5%
S 121
 
8.1%
E 117
 
7.8%
B 116
 
7.7%
G 93
 
6.2%
O 81
 
5.4%
H 72
 
4.8%
D 61
 
4.1%
L 61
 
4.1%
Other values (16) 507
33.8%
Lowercase Letter
ValueCountFrequency (%)
e 133
13.5%
a 123
12.4%
o 83
 
8.4%
r 77
 
7.8%
i 58
 
5.9%
n 54
 
5.5%
l 50
 
5.1%
f 48
 
4.9%
u 40
 
4.0%
t 39
 
3.9%
Other values (15) 283
28.6%
Other Punctuation
ValueCountFrequency (%)
& 99
41.8%
. 77
32.5%
31
 
13.1%
# 8
 
3.4%
' 7
 
3.0%
: 5
 
2.1%
/ 3
 
1.3%
· 3
 
1.3%
@ 2
 
0.8%
1
 
0.4%
Decimal Number
ValueCountFrequency (%)
2 219
29.3%
5 131
17.5%
1 106
14.2%
0 72
 
9.6%
4 48
 
6.4%
9 48
 
6.4%
6 33
 
4.4%
8 32
 
4.3%
3 30
 
4.0%
7 28
 
3.7%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
1780
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1058
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1056
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Other Number
ValueCountFrequency (%)
³ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
˚ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44016
85.6%
Common 4903
 
9.5%
Latin 2487
 
4.8%
Han 27
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1960
 
4.5%
1301
 
3.0%
1215
 
2.8%
999
 
2.3%
781
 
1.8%
716
 
1.6%
619
 
1.4%
601
 
1.4%
592
 
1.3%
557
 
1.3%
Other values (886) 34675
78.8%
Latin
ValueCountFrequency (%)
C 143
 
5.7%
e 133
 
5.3%
A 127
 
5.1%
a 123
 
4.9%
S 121
 
4.9%
E 117
 
4.7%
B 116
 
4.7%
G 93
 
3.7%
o 83
 
3.3%
O 81
 
3.3%
Other values (41) 1350
54.3%
Common
ValueCountFrequency (%)
1780
36.3%
( 1058
21.6%
) 1056
21.5%
2 219
 
4.5%
5 131
 
2.7%
1 106
 
2.2%
& 99
 
2.0%
. 77
 
1.6%
0 72
 
1.5%
4 48
 
1.0%
Other values (20) 257
 
5.2%
Han
ValueCountFrequency (%)
3
 
11.1%
2
 
7.4%
2
 
7.4%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Other values (11) 11
40.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44016
85.6%
ASCII 7352
 
14.3%
None 36
 
0.1%
CJK 25
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Modifier Letters 1
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1960
 
4.5%
1301
 
3.0%
1215
 
2.8%
999
 
2.3%
781
 
1.8%
716
 
1.6%
619
 
1.4%
601
 
1.4%
592
 
1.3%
557
 
1.3%
Other values (886) 34675
78.8%
ASCII
ValueCountFrequency (%)
1780
24.2%
( 1058
14.4%
) 1056
14.4%
2 219
 
3.0%
C 143
 
1.9%
e 133
 
1.8%
5 131
 
1.8%
A 127
 
1.7%
a 123
 
1.7%
S 121
 
1.6%
Other values (65) 2461
33.5%
None
ValueCountFrequency (%)
31
86.1%
· 3
 
8.3%
³ 1
 
2.8%
1
 
2.8%
CJK
ValueCountFrequency (%)
3
 
12.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (9) 9
36.0%
Modifier Letters
ValueCountFrequency (%)
˚ 1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%
Distinct3741
Distinct (%)52.8%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
Minimum1964-10-31 00:00:00
Maximum2022-08-17 00:00:00
2023-12-11T08:51:27.132687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:27.290422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2370
Distinct (%)33.5%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
Minimum1970-09-08 00:00:00
Maximum2022-08-17 00:00:00
2023-12-11T08:51:27.522014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:27.916651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct5817
Distinct (%)82.6%
Missing41
Missing (%)0.6%
Memory size55.4 KiB
2023-12-11T08:51:28.238488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length57
Mean length29.050859
Min length18

Characters and Unicode

Total characters204489
Distinct characters430
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4896 ?
Unique (%)69.6%

Sample

1st row경상남도 거제시 장승포로 49 (장승포동,132.133.134호)
2nd row경상남도 거제시 하청면 하청로 17
3rd row경상남도 거제시 서문로 80 (고현동,중앙빌딩 104호)
4th row경상남도 거제시 옥포로 186 (옥포동)
5th row경상남도 거제시 문동폭포길 12 (문동동)
ValueCountFrequency (%)
경상남도 7039
 
18.3%
거제시 7039
 
18.3%
고현동 1137
 
3.0%
1층 890
 
2.3%
옥포동 617
 
1.6%
장평동 501
 
1.3%
아주동 484
 
1.3%
일운면 393
 
1.0%
거제중앙로 350
 
0.9%
거제대로 312
 
0.8%
Other values (4308) 19654
51.2%
2023-12-11T08:51:28.764665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31382
 
15.3%
1 12242
 
6.0%
9001
 
4.4%
8938
 
4.4%
7925
 
3.9%
7393
 
3.6%
7291
 
3.6%
7176
 
3.5%
7129
 
3.5%
7064
 
3.5%
Other values (420) 98948
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 116708
57.1%
Decimal Number 34562
 
16.9%
Space Separator 31382
 
15.3%
Open Punctuation 6622
 
3.2%
Close Punctuation 6622
 
3.2%
Other Punctuation 6457
 
3.2%
Dash Punctuation 1607
 
0.8%
Uppercase Letter 297
 
0.1%
Math Symbol 174
 
0.1%
Lowercase Letter 57
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9001
 
7.7%
8938
 
7.7%
7925
 
6.8%
7393
 
6.3%
7291
 
6.2%
7176
 
6.1%
7129
 
6.1%
7064
 
6.1%
6071
 
5.2%
5290
 
4.5%
Other values (375) 43430
37.2%
Uppercase Letter
ValueCountFrequency (%)
B 91
30.6%
A 66
22.2%
K 23
 
7.7%
C 19
 
6.4%
R 19
 
6.4%
P 18
 
6.1%
I 17
 
5.7%
S 10
 
3.4%
D 8
 
2.7%
J 5
 
1.7%
Other values (8) 21
 
7.1%
Decimal Number
ValueCountFrequency (%)
1 12242
35.4%
2 5191
15.0%
3 3354
 
9.7%
0 2944
 
8.5%
4 2544
 
7.4%
5 2142
 
6.2%
6 1796
 
5.2%
7 1576
 
4.6%
8 1488
 
4.3%
9 1285
 
3.7%
Lowercase Letter
ValueCountFrequency (%)
e 41
71.9%
o 6
 
10.5%
p 2
 
3.5%
k 2
 
3.5%
r 2
 
3.5%
j 2
 
3.5%
t 2
 
3.5%
Other Punctuation
ValueCountFrequency (%)
6433
99.6%
. 20
 
0.3%
/ 2
 
< 0.1%
@ 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
31382
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6622
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6622
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1607
100.0%
Math Symbol
ValueCountFrequency (%)
~ 174
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 116706
57.1%
Common 87426
42.8%
Latin 355
 
0.2%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9001
 
7.7%
8938
 
7.7%
7925
 
6.8%
7393
 
6.3%
7291
 
6.2%
7176
 
6.1%
7129
 
6.1%
7064
 
6.1%
6071
 
5.2%
5290
 
4.5%
Other values (374) 43428
37.2%
Latin
ValueCountFrequency (%)
B 91
25.6%
A 66
18.6%
e 41
11.5%
K 23
 
6.5%
C 19
 
5.4%
R 19
 
5.4%
P 18
 
5.1%
I 17
 
4.8%
S 10
 
2.8%
D 8
 
2.3%
Other values (16) 43
12.1%
Common
ValueCountFrequency (%)
31382
35.9%
1 12242
 
14.0%
( 6622
 
7.6%
) 6622
 
7.6%
6433
 
7.4%
2 5191
 
5.9%
3 3354
 
3.8%
0 2944
 
3.4%
4 2544
 
2.9%
5 2142
 
2.5%
Other values (9) 7950
 
9.1%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 116706
57.1%
ASCII 81347
39.8%
None 6433
 
3.1%
CJK 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
31382
38.6%
1 12242
 
15.0%
( 6622
 
8.1%
) 6622
 
8.1%
2 5191
 
6.4%
3 3354
 
4.1%
0 2944
 
3.6%
4 2544
 
3.1%
5 2142
 
2.6%
6 1796
 
2.2%
Other values (33) 6508
 
8.0%
Hangul
ValueCountFrequency (%)
9001
 
7.7%
8938
 
7.7%
7925
 
6.8%
7393
 
6.3%
7291
 
6.2%
7176
 
6.1%
7129
 
6.1%
7064
 
6.1%
6071
 
5.2%
5290
 
4.5%
Other values (374) 43428
37.2%
None
ValueCountFrequency (%)
6433
100.0%
CJK
ValueCountFrequency (%)
2
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct5307
Distinct (%)75.0%
Missing2
Missing (%)< 0.1%
Memory size55.4 KiB
2023-12-11T08:51:29.130828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length47
Mean length23.384007
Min length15

Characters and Unicode

Total characters165512
Distinct characters399
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4179 ?
Unique (%)59.0%

Sample

1st row경상남도 거제시 장승포동 283-99 132.133.134호
2nd row경상남도 거제시 옥포동 322
3rd row경상남도 거제시 하청면 하청리 658-4
4th row경상남도 거제시 고현동 961-178 중앙빌딩 104호
5th row경상남도 거제시 옥포동 544-4
ValueCountFrequency (%)
거제시 7073
20.6%
경상남도 7070
20.6%
고현동 1582
 
4.6%
1층 1125
 
3.3%
옥포동 1054
 
3.1%
장평동 758
 
2.2%
아주동 628
 
1.8%
일운면 394
 
1.1%
상동동 337
 
1.0%
장승포동 294
 
0.9%
Other values (4820) 14039
40.9%
2023-12-11T08:51:29.576839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33937
20.5%
1 9072
 
5.5%
7687
 
4.6%
7547
 
4.6%
7541
 
4.6%
7256
 
4.4%
7211
 
4.4%
7161
 
4.3%
7102
 
4.3%
6527
 
3.9%
Other values (389) 64471
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88312
53.4%
Decimal Number 34634
 
20.9%
Space Separator 33937
 
20.5%
Dash Punctuation 5661
 
3.4%
Close Punctuation 1263
 
0.8%
Open Punctuation 1262
 
0.8%
Uppercase Letter 195
 
0.1%
Other Punctuation 160
 
0.1%
Math Symbol 55
 
< 0.1%
Lowercase Letter 31
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7687
 
8.7%
7547
 
8.5%
7541
 
8.5%
7256
 
8.2%
7211
 
8.2%
7161
 
8.1%
7102
 
8.0%
6527
 
7.4%
1999
 
2.3%
1852
 
2.1%
Other values (350) 26429
29.9%
Uppercase Letter
ValueCountFrequency (%)
B 46
23.6%
A 30
15.4%
K 21
10.8%
P 19
9.7%
I 18
 
9.2%
R 17
 
8.7%
C 13
 
6.7%
D 6
 
3.1%
L 5
 
2.6%
S 5
 
2.6%
Other values (8) 15
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 9072
26.2%
2 4114
11.9%
3 3124
 
9.0%
9 3113
 
9.0%
0 2962
 
8.6%
5 2806
 
8.1%
4 2612
 
7.5%
6 2571
 
7.4%
7 2195
 
6.3%
8 2065
 
6.0%
Other Punctuation
ValueCountFrequency (%)
139
86.9%
. 19
 
11.9%
@ 2
 
1.2%
Space Separator
ValueCountFrequency (%)
33937
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5661
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1263
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1262
100.0%
Math Symbol
ValueCountFrequency (%)
~ 55
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 31
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88311
53.4%
Common 76973
46.5%
Latin 227
 
0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7687
 
8.7%
7547
 
8.5%
7541
 
8.5%
7256
 
8.2%
7211
 
8.2%
7161
 
8.1%
7102
 
8.0%
6527
 
7.4%
1999
 
2.3%
1852
 
2.1%
Other values (349) 26428
29.9%
Latin
ValueCountFrequency (%)
B 46
20.3%
e 31
13.7%
A 30
13.2%
K 21
9.3%
P 19
8.4%
I 18
 
7.9%
R 17
 
7.5%
C 13
 
5.7%
D 6
 
2.6%
L 5
 
2.2%
Other values (10) 21
9.3%
Common
ValueCountFrequency (%)
33937
44.1%
1 9072
 
11.8%
- 5661
 
7.4%
2 4114
 
5.3%
3 3124
 
4.1%
9 3113
 
4.0%
0 2962
 
3.8%
5 2806
 
3.6%
4 2612
 
3.4%
6 2571
 
3.3%
Other values (9) 7001
 
9.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88311
53.4%
ASCII 77060
46.6%
None 139
 
0.1%
Number Forms 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
33937
44.0%
1 9072
 
11.8%
- 5661
 
7.3%
2 4114
 
5.3%
3 3124
 
4.1%
9 3113
 
4.0%
0 2962
 
3.8%
5 2806
 
3.6%
4 2612
 
3.4%
6 2571
 
3.3%
Other values (27) 7088
 
9.2%
Hangul
ValueCountFrequency (%)
7687
 
8.7%
7547
 
8.5%
7541
 
8.5%
7256
 
8.2%
7211
 
8.2%
7161
 
8.1%
7102
 
8.0%
6527
 
7.4%
1999
 
2.3%
1852
 
2.1%
Other values (349) 26428
29.9%
None
ValueCountFrequency (%)
139
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

소재지전화
Text

MISSING 

Distinct3386
Distinct (%)85.9%
Missing3137
Missing (%)44.3%
Memory size55.4 KiB
2023-12-11T08:51:29.796917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.996703
Min length9

Characters and Unicode

Total characters47303
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2950 ?
Unique (%)74.8%

Sample

1st row055-681-9548
2nd row055-687-4095
3rd row055-636-3912
4th row055-687-6161
5th row055-635-3114
ValueCountFrequency (%)
055-630-6789 18
 
0.5%
055-682-3561 10
 
0.3%
055-630-3080 9
 
0.2%
055-680-6113 8
 
0.2%
055-634-3001 8
 
0.2%
055-641-2500 8
 
0.2%
055-733-7202 7
 
0.2%
055-733-7201 6
 
0.2%
055-631-2114 6
 
0.2%
055-730-5000 5
 
0.1%
Other values (3376) 3858
97.8%
2023-12-11T08:51:30.173582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 9884
20.9%
- 7880
16.7%
0 6184
13.1%
6 5619
11.9%
3 4396
9.3%
8 3436
 
7.3%
2 2605
 
5.5%
7 2212
 
4.7%
1 2185
 
4.6%
9 1498
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 39423
83.3%
Dash Punctuation 7880
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 9884
25.1%
0 6184
15.7%
6 5619
14.3%
3 4396
11.2%
8 3436
 
8.7%
2 2605
 
6.6%
7 2212
 
5.6%
1 2185
 
5.5%
9 1498
 
3.8%
4 1404
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 7880
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 47303
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 9884
20.9%
- 7880
16.7%
0 6184
13.1%
6 5619
11.9%
3 4396
9.3%
8 3436
 
7.3%
2 2605
 
5.5%
7 2212
 
4.7%
1 2185
 
4.6%
9 1498
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 47303
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 9884
20.9%
- 7880
16.7%
0 6184
13.1%
6 5619
11.9%
3 4396
9.3%
8 3436
 
7.3%
2 2605
 
5.5%
7 2212
 
4.7%
1 2185
 
4.6%
9 1498
 
3.2%

급수시설
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
상수도전용
6255 
<NA>
 
494
지하수전용
 
212
상수도(음용)지하수(주방용)겸용
 
90
간이상수도
 
29

Length

Max length17
Median length5
Mean length5.0827684
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상수도전용
2nd row상수도전용
3rd row상수도전용
4th row상수도전용
5th row상수도전용

Common Values

ValueCountFrequency (%)
상수도전용 6255
88.3%
<NA> 494
 
7.0%
지하수전용 212
 
3.0%
상수도(음용)지하수(주방용)겸용 90
 
1.3%
간이상수도 29
 
0.4%

Length

2023-12-11T08:51:30.309977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:51:30.410840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상수도전용 6255
88.3%
na 494
 
7.0%
지하수전용 212
 
3.0%
상수도(음용)지하수(주방용)겸용 90
 
1.3%
간이상수도 29
 
0.4%
Distinct66
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
2023-12-11T08:51:30.593856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length4.6799435
Min length2

Characters and Unicode

Total characters33134
Distinct characters152
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st row한식
2nd row경양식
3rd row한식
4th row분식
5th row경양식
ValueCountFrequency (%)
한식 1309
18.1%
식육(숯불구이 542
 
7.5%
기타 456
 
6.3%
커피숍 452
 
6.2%
호프/통닭 441
 
6.1%
정종/대포집/소주방 354
 
4.9%
즉석판매제조가공업 342
 
4.7%
횟집 320
 
4.4%
분식 260
 
3.6%
룸살롱 232
 
3.2%
Other values (55) 2531
35.0%
2023-12-11T08:51:30.909292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3082
 
9.3%
1309
 
4.0%
1255
 
3.8%
/ 1149
 
3.5%
1096
 
3.3%
1096
 
3.3%
) 875
 
2.6%
( 875
 
2.6%
780
 
2.4%
673
 
2.0%
Other values (142) 20944
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30058
90.7%
Other Punctuation 1167
 
3.5%
Close Punctuation 875
 
2.6%
Open Punctuation 875
 
2.6%
Space Separator 159
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3082
 
10.3%
1309
 
4.4%
1255
 
4.2%
1096
 
3.6%
1096
 
3.6%
780
 
2.6%
673
 
2.2%
669
 
2.2%
633
 
2.1%
554
 
1.8%
Other values (137) 18911
62.9%
Other Punctuation
ValueCountFrequency (%)
/ 1149
98.5%
, 18
 
1.5%
Close Punctuation
ValueCountFrequency (%)
) 875
100.0%
Open Punctuation
ValueCountFrequency (%)
( 875
100.0%
Space Separator
ValueCountFrequency (%)
159
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30058
90.7%
Common 3076
 
9.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3082
 
10.3%
1309
 
4.4%
1255
 
4.2%
1096
 
3.6%
1096
 
3.6%
780
 
2.6%
673
 
2.2%
669
 
2.2%
633
 
2.1%
554
 
1.8%
Other values (137) 18911
62.9%
Common
ValueCountFrequency (%)
/ 1149
37.4%
) 875
28.4%
( 875
28.4%
159
 
5.2%
, 18
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30058
90.7%
ASCII 3076
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3082
 
10.3%
1309
 
4.4%
1255
 
4.2%
1096
 
3.6%
1096
 
3.6%
780
 
2.6%
673
 
2.2%
669
 
2.2%
633
 
2.1%
554
 
1.8%
Other values (137) 18911
62.9%
ASCII
ValueCountFrequency (%)
/ 1149
37.4%
) 875
28.4%
( 875
28.4%
159
 
5.2%
, 18
 
0.6%

위도
Real number (ℝ)

Distinct3947
Distinct (%)55.8%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean34.883797
Minimum34.708696
Maximum35.258754
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size62.4 KiB
2023-12-11T08:51:31.029844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.708696
5-th percentile34.827785
Q134.868424
median34.888838
Q334.893545
95-th percentile34.956349
Maximum35.258754
Range0.55005859
Interquartile range (IQR)0.02512174

Descriptive statistics

Standard deviation0.039212652
Coefficient of variation (CV)0.0011240935
Kurtosis12.874678
Mean34.883797
Median Absolute Deviation (MAD)0.00829351
Skewness0.75359266
Sum246907.52
Variance0.001537632
MonotonicityNot monotonic
2023-12-11T08:51:31.165050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34.89713105 64
 
0.9%
34.8910509 58
 
0.8%
34.86858834 50
 
0.7%
34.88166837 40
 
0.6%
35.0080169 29
 
0.4%
34.84368572 23
 
0.3%
34.88625409 20
 
0.3%
34.87426144 19
 
0.3%
34.86691347 17
 
0.2%
34.89699685 17
 
0.2%
Other values (3937) 6741
95.2%
ValueCountFrequency (%)
34.70869586 1
< 0.1%
34.71458607 1
< 0.1%
34.71474423 1
< 0.1%
34.7152561 1
< 0.1%
34.71994371 1
< 0.1%
34.72082431 2
< 0.1%
34.72097717 1
< 0.1%
34.7215224 2
< 0.1%
34.72236245 2
< 0.1%
34.72550735 1
< 0.1%
ValueCountFrequency (%)
35.25875445 8
0.1%
35.030926 1
 
< 0.1%
35.03044495 1
 
< 0.1%
35.03011881 1
 
< 0.1%
35.02940595 1
 
< 0.1%
35.02644172 1
 
< 0.1%
35.0226368 1
 
< 0.1%
35.02166258 2
 
< 0.1%
35.01753311 1
 
< 0.1%
35.01649319 4
0.1%

경도
Real number (ℝ)

Distinct3936
Distinct (%)55.6%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean128.65079
Minimum128.29866
Maximum128.74839
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size62.4 KiB
2023-12-11T08:51:31.302688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.29866
5-th percentile128.57242
Q1128.62258
median128.63981
Q3128.69287
95-th percentile128.72667
Maximum128.74839
Range0.4497261
Interquartile range (IQR)0.0702981

Descriptive statistics

Standard deviation0.051525331
Coefficient of variation (CV)0.00040050534
Kurtosis2.7809506
Mean128.65079
Median Absolute Deviation (MAD)0.03708955
Skewness-0.88577556
Sum910590.32
Variance0.0026548597
MonotonicityNot monotonic
2023-12-11T08:51:31.447009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.6102142 64
 
0.9%
128.6164053 58
 
0.8%
128.6881305 50
 
0.7%
128.7253131 40
 
0.6%
128.7114627 29
 
0.4%
128.7030011 23
 
0.3%
128.6236554 20
 
0.3%
128.6329608 19
 
0.3%
128.6276757 17
 
0.2%
128.6501964 17
 
0.2%
Other values (3926) 6741
95.2%
ValueCountFrequency (%)
128.2986614 8
0.1%
128.4727813 1
 
< 0.1%
128.4728484 1
 
< 0.1%
128.4742779 1
 
< 0.1%
128.4746722 1
 
< 0.1%
128.4758429 1
 
< 0.1%
128.4761788 1
 
< 0.1%
128.4763796 1
 
< 0.1%
128.4768179 1
 
< 0.1%
128.4768956 2
 
< 0.1%
ValueCountFrequency (%)
128.7483875 1
 
< 0.1%
128.7402924 1
 
< 0.1%
128.7374552 1
 
< 0.1%
128.7373868 6
0.1%
128.7371217 1
 
< 0.1%
128.7370418 2
 
< 0.1%
128.7370211 1
 
< 0.1%
128.7369898 1
 
< 0.1%
128.7369813 1
 
< 0.1%
128.7369583 1
 
< 0.1%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size55.4 KiB
Minimum2022-08-30 00:00:00
Maximum2022-08-30 00:00:00
2023-12-11T08:51:31.558570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:31.638778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T08:51:25.622607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:25.412295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:25.713933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:51:25.521199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:51:31.699761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명급수시설업태명위도경도
업종명1.0000.1821.0000.1490.197
급수시설0.1821.0000.2710.0280.058
업태명1.0000.2711.0000.2060.232
위도0.1490.0280.2061.0000.313
경도0.1970.0580.2320.3131.000
2023-12-11T08:51:31.785162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명급수시설
업종명1.0000.086
급수시설0.0861.000
2023-12-11T08:51:31.861226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
위도경도업종명급수시설
위도1.000-0.1310.0670.019
경도-0.1311.0000.0830.038
업종명0.0670.0831.0000.086
급수시설0.0190.0380.0861.000

Missing values

2023-12-11T08:51:25.833858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:51:26.005383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T08:51:26.141144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업종명업소명인허가일자영업자시작일소재지(도로명)소재지(지번)소재지전화급수시설업태명위도경도기준일자
0일반음식점성영식당1982-08-171982-08-17경상남도 거제시 장승포로 49 (장승포동,132.133.134호)경상남도 거제시 장승포동 283-99 132.133.134호055-681-9548상수도전용한식34.867724128.7295442022-08-30
1일반음식점맥심경양식1984-03-081984-03-08<NA>경상남도 거제시 옥포동 322055-687-4095상수도전용경양식34.889573128.6884042022-08-30
2일반음식점인덕식당1987-06-111987-06-11경상남도 거제시 하청면 하청로 17경상남도 거제시 하청면 하청리 658-4055-636-3912상수도전용한식34.955991128.6544372022-08-30
3일반음식점털보네분식1988-11-301988-11-30경상남도 거제시 서문로 80 (고현동,중앙빌딩 104호)경상남도 거제시 고현동 961-178 중앙빌딩 104호<NA>상수도전용분식34.88999128.6246252022-08-30
4일반음식점도밍고레스토랑1990-12-181990-12-18경상남도 거제시 옥포로 186 (옥포동)경상남도 거제시 옥포동 544-4055-687-6161상수도전용경양식34.891519128.6917142022-08-30
5일반음식점우성가든1992-07-041992-07-04경상남도 거제시 문동폭포길 12 (문동동)경상남도 거제시 문동동 140-2055-635-3114상수도전용한식34.85509128.6511422022-08-30
6일반음식점제일식당1992-07-271992-07-27경상남도 거제시 남부면 해금강3길 7-9경상남도 거제시 남부면 갈곶리 65055-632-3539상수도전용횟집34.737434128.6750052022-08-30
7일반음식점은하수횟집1993-06-301993-06-30경상남도 거제시 남부면 근포1길 68경상남도 거제시 남부면 저구리 477-1055-633-1438상수도전용횟집34.720824128.5873172022-08-30
8일반음식점은하수횟집1993-06-301993-06-30경상남도 거제시 남부면 근포1길 68경상남도 거제시 남부면 저구리 477-1055-633-1438상수도전용횟집34.720824128.5873172022-08-30
9일반음식점농장삼계탕1995-03-171995-03-17경상남도 거제시 장평1로1길 23 (장평동)경상남도 거제시 장평동 31-4055-633-6332상수도전용탕류(보신용)34.891663128.6147692022-08-30
업종명업소명인허가일자영업자시작일소재지(도로명)소재지(지번)소재지전화급수시설업태명위도경도기준일자
7070즉석판매제조가공업크크밥2022-08-172022-08-17경상남도 거제시 거제중앙로15길 5,1층 (고현동)경상남도 거제시 고현동 154-1<NA>상수도전용즉석판매제조가공업34.889568128.6243592022-08-30
7071식품소분업롯데쇼핑(주)롯데마트거제점2015-02-112022-04-29경상남도 거제시 서간도길 9-9,상가동 지하1층 (옥포동,거제엘크루랜드마크)경상남도 거제시 옥포동 241-1 외17필지 거제엘크루랜드마크 지하1층 상가동055-641-2500상수도전용식품소분업34.869095128.6279232022-08-30
7072식품소분업이땀당2022-02-252022-02-25경상남도 거제시 고현로11길 24,삼덕빌딩 4층 402호 (고현동)경상남도 거제시 고현동 961-55 삼덕빌딩<NA>상수도전용식품소분업34.912716128.6554612022-08-30
7073식품자동판매기영업와와블럭카페2022-06-302022-06-30경상남도 거제시 상동7길 30-1,대동다숲아파트 상가동 3층 301호 (상동동)경상남도 거제시 상동동 735 대동다숲아파트<NA>상수도전용식품자동판매기영업34.890993128.6398072022-08-30
7074유통전문판매업연초농협2022-01-272022-01-27경상남도 거제시 연초면 거제북로 24-10,1층경상남도 거제시 연초면 죽토리 340-3055-636-5551상수도전용유통전문판매업34.874261128.6329612022-08-30
7075제과점영업정항우케익2022-03-042022-03-04경상남도 거제시 수양로 473,신현농협하나로마트 가동 1층 (양정동)경상남도 거제시 양정동 962-2 신현농협하나로마트<NA>상수도전용제과점영업34.911874128.5411062022-08-30
7076건강기능식품일반판매업거제축산업협동조합2021-07-302021-07-30경상남도 거제시 거제중앙로 1726,거제농수산물 종합유통센터 지하1층 (상동동)경상남도 거제시 상동동 590-1055-639-1275<NA>영업장판매34.896369128.6340942022-08-30
7077건강기능식품일반판매업온통로2022-02-242022-02-24경상남도 거제시 사등면 금포2길 31,1층경상남도 거제시 사등면 사등리 2010-1031-366-9274<NA>전자상거래(통신판매업)34.866519128.6379762022-08-30
7078건강기능식품일반판매업지니상점2022-05-042022-05-04경상남도 거제시 중곡로 42,119동 2층 206호 (고현동,덕산베스트타운)경상남도 거제시 고현동 1039 덕산베스트타운 119동 206호<NA><NA>전자상거래(통신판매업)<NA><NA>2022-08-30
7079건강기능식품일반판매업이담2022-06-232022-06-23경상남도 거제시 상동3길 15,105동 22층 2201호 (상동동,힐스테이트 거제)경상남도 거제시 상동동 1080 힐스테이트 거제<NA><NA>전자상거래(통신판매업)<NA><NA>2022-08-30

Duplicate rows

Most frequently occurring

업종명업소명인허가일자영업자시작일소재지(도로명)소재지(지번)소재지전화급수시설업태명위도경도기준일자# duplicates
0건강기능식품일반판매업홈플러스 거제점이자녹스2011-07-082011-07-08경상남도 거제시 장평로 12 (장평동,홈플러스 이자녹스화장품)경상남도 거제시 장평동 59 홈플러스 이자녹스화장품055-680-0966<NA>영업장판매34.891051128.6164052022-08-302
1식품소분업봉순이식품2014-03-202014-03-20경상남도 거제시 사등면 거제대로 6244,1층경상남도 거제시 사등면 오량리 951-12 (1층)055-635-9087간이상수도식품소분업34.887484128.4869232022-08-302
2식품자동판매기영업친구낚시편의점2009-04-292009-04-29경상남도 거제시 사등면 성포로3길 1 (친구낚시편의점 앞)경상남도 거제시 사등면 성포리 351-15 친구낚시편의점 앞055-633-9997상수도전용식품자동판매기영업34.921139128.5231082022-08-302
3위탁급식영업삼성웰스토리(주)삼성중공업A식당2008-03-252021-01-14경상남도 거제시 장평3로 80 (장평동)경상남도 거제시 장평동 530055-630-6789상수도전용위탁급식영업34.897131128.6102142022-08-302
4위탁급식영업삼성웰스토리(주)삼성중공업E식당2008-01-022021-01-14경상남도 거제시 장평3로 80 (장평동)경상남도 거제시 장평동 530055-630-6789상수도전용위탁급식영업34.897131128.6102142022-08-302
5위탁급식영업삼성웰스토리(주)삼성중공업피솔식당2008-06-202021-01-14경상남도 거제시 장평3로 80 (장평동)경상남도 거제시 장평동 530055-630-6789상수도전용위탁급식영업34.897131128.6102142022-08-302
6유흥주점영업꿈의궁전1985-03-252015-06-18경상남도 거제시 옥수로10길 35 (능포동,38호(2층))경상남도 거제시 능포동 655-7 38호(2층)055-682-2951상수도전용룸살롱34.876232128.7313472022-08-302
7유흥주점영업베테랑1985-10-152009-12-01경상남도 거제시 옥포대첩로4길 35 (옥포동,1호)경상남도 거제시 옥포동 355-7 1호055-687-0223상수도전용룸살롱34.890647128.6928552022-08-302
8유흥주점영업이프로1985-12-182012-06-14경상남도 거제시 옥포대첩로4길 22 (옥포동)경상남도 거제시 옥포동 544-15055-687-2178상수도전용룸살롱34.89201128.6926592022-08-302
9유흥주점영업제일노래주점2001-10-042010-12-22경상남도 거제시 장평로8길 15 (장평동,(지층))경상남도 거제시 장평동 370-3 (지층)055-637-4405상수도(음용)지하수(주방용)겸용룸살롱34.891991128.6106142022-08-302