Overview

Dataset statistics

Number of variables6
Number of observations1187
Missing cells502
Missing cells (%)7.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory56.9 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description경상남도 제과점 현황에 관한 데이터로, 인허가관할기관, 업소명, 업종, 업태, 업소주소에 관한 정보를 제공합니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15069263

Alerts

업종 has constant value ""Constant
연번 is highly overall correlated with 인허가관할기관High correlation
인허가관할기관 is highly overall correlated with 연번High correlation
업소전화번호 has 502 (42.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:59:53.037968
Analysis finished2023-12-10 23:59:53.834354
Duration0.8 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1187
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean594
Minimum1
Maximum1187
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.6 KiB
2023-12-11T08:59:53.904675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile60.3
Q1297.5
median594
Q3890.5
95-th percentile1127.7
Maximum1187
Range1186
Interquartile range (IQR)593

Descriptive statistics

Standard deviation342.80169
Coefficient of variation (CV)0.57710723
Kurtosis-1.2
Mean594
Median Absolute Deviation (MAD)297
Skewness0
Sum705078
Variance117513
MonotonicityNot monotonic
2023-12-11T08:59:54.041112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
799 1
 
0.1%
797 1
 
0.1%
796 1
 
0.1%
795 1
 
0.1%
794 1
 
0.1%
793 1
 
0.1%
792 1
 
0.1%
791 1
 
0.1%
790 1
 
0.1%
Other values (1177) 1177
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1187 1
0.1%
1186 1
0.1%
1185 1
0.1%
1184 1
0.1%
1183 1
0.1%
1182 1
0.1%
1181 1
0.1%
1180 1
0.1%
1179 1
0.1%
1178 1
0.1%

인허가관할기관
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
김해시
189 
진주시
154 
양산시
114 
창원시 성산구
92 
창원시 의창구
88 
Other values (17)
550 

Length

Max length10
Median length4
Mean length5.474305
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row 거제시
2nd row 거제시
3rd row 거제시
4th row 거제시
5th row 거제시

Common Values

ValueCountFrequency (%)
김해시 189
15.9%
진주시 154
13.0%
양산시 114
9.6%
창원시 성산구 92
7.8%
창원시 의창구 88
7.4%
거제시 83
 
7.0%
창원시 진해구 82
 
6.9%
창원시 마산회원구 61
 
5.1%
창원시 마산합포구 56
 
4.7%
통영시 50
 
4.2%
Other values (12) 218
18.4%

Length

2023-12-11T08:59:54.166193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 379
24.2%
김해시 189
12.1%
진주시 154
9.8%
양산시 114
 
7.3%
성산구 92
 
5.9%
의창구 88
 
5.6%
거제시 83
 
5.3%
진해구 82
 
5.2%
마산회원구 61
 
3.9%
마산합포구 56
 
3.6%
Other values (13) 268
17.1%
Distinct1072
Distinct (%)90.3%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-11T08:59:54.401651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length7.8963774
Min length1

Characters and Unicode

Total characters9373
Distinct characters549
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1020 ?
Unique (%)85.9%

Sample

1st row(주)외도보타니아제과점
2nd row경성빵집
3rd row고려당
4th row그녀의케익
5th row꿈꾸는쉐프
ValueCountFrequency (%)
파리바게뜨 60
 
4.0%
뚜레쥬르 38
 
2.5%
파리바게트 29
 
1.9%
베이커리 27
 
1.8%
탑스베이커리 17
 
1.1%
몽블랑제 12
 
0.8%
탑베이커리 10
 
0.7%
하나로베이커리 10
 
0.7%
빵굽는마을 7
 
0.5%
김해점 6
 
0.4%
Other values (1150) 1296
85.7%
2023-12-11T08:59:54.797642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
549
 
5.9%
458
 
4.9%
326
 
3.5%
293
 
3.1%
213
 
2.3%
212
 
2.3%
205
 
2.2%
195
 
2.1%
192
 
2.0%
161
 
1.7%
Other values (539) 6569
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8302
88.6%
Space Separator 326
 
3.5%
Lowercase Letter 222
 
2.4%
Uppercase Letter 156
 
1.7%
Close Punctuation 132
 
1.4%
Open Punctuation 132
 
1.4%
Decimal Number 81
 
0.9%
Other Punctuation 18
 
0.2%
Dash Punctuation 3
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
549
 
6.6%
458
 
5.5%
293
 
3.5%
213
 
2.6%
212
 
2.6%
205
 
2.5%
195
 
2.3%
192
 
2.3%
161
 
1.9%
160
 
1.9%
Other values (474) 5664
68.2%
Lowercase Letter
ValueCountFrequency (%)
e 39
17.6%
a 31
14.0%
r 19
 
8.6%
o 16
 
7.2%
n 14
 
6.3%
d 12
 
5.4%
l 11
 
5.0%
i 9
 
4.1%
c 9
 
4.1%
k 8
 
3.6%
Other values (13) 54
24.3%
Uppercase Letter
ValueCountFrequency (%)
A 22
14.1%
B 20
12.8%
D 10
 
6.4%
S 10
 
6.4%
C 9
 
5.8%
E 9
 
5.8%
R 9
 
5.8%
O 8
 
5.1%
M 8
 
5.1%
N 7
 
4.5%
Other values (12) 44
28.2%
Decimal Number
ValueCountFrequency (%)
2 20
24.7%
1 18
22.2%
0 10
12.3%
9 9
11.1%
3 5
 
6.2%
5 5
 
6.2%
4 4
 
4.9%
6 4
 
4.9%
7 4
 
4.9%
8 2
 
2.5%
Other Punctuation
ValueCountFrequency (%)
& 7
38.9%
. 6
33.3%
, 2
 
11.1%
# 2
 
11.1%
' 1
 
5.6%
Space Separator
ValueCountFrequency (%)
326
100.0%
Close Punctuation
ValueCountFrequency (%)
) 132
100.0%
Open Punctuation
ValueCountFrequency (%)
( 132
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8302
88.6%
Common 692
 
7.4%
Latin 379
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
549
 
6.6%
458
 
5.5%
293
 
3.5%
213
 
2.6%
212
 
2.6%
205
 
2.5%
195
 
2.3%
192
 
2.3%
161
 
1.9%
160
 
1.9%
Other values (474) 5664
68.2%
Latin
ValueCountFrequency (%)
e 39
 
10.3%
a 31
 
8.2%
A 22
 
5.8%
B 20
 
5.3%
r 19
 
5.0%
o 16
 
4.2%
n 14
 
3.7%
d 12
 
3.2%
l 11
 
2.9%
D 10
 
2.6%
Other values (36) 185
48.8%
Common
ValueCountFrequency (%)
326
47.1%
) 132
19.1%
( 132
19.1%
2 20
 
2.9%
1 18
 
2.6%
0 10
 
1.4%
9 9
 
1.3%
& 7
 
1.0%
. 6
 
0.9%
3 5
 
0.7%
Other values (9) 27
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8302
88.6%
ASCII 1070
 
11.4%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
549
 
6.6%
458
 
5.5%
293
 
3.5%
213
 
2.6%
212
 
2.6%
205
 
2.5%
195
 
2.3%
192
 
2.3%
161
 
1.9%
160
 
1.9%
Other values (474) 5664
68.2%
ASCII
ValueCountFrequency (%)
326
30.5%
) 132
12.3%
( 132
12.3%
e 39
 
3.6%
a 31
 
2.9%
A 22
 
2.1%
B 20
 
1.9%
2 20
 
1.9%
r 19
 
1.8%
1 18
 
1.7%
Other values (54) 311
29.1%
Number Forms
ValueCountFrequency (%)
1
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
제과점영업
1187 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제과점영업
2nd row제과점영업
3rd row제과점영업
4th row제과점영업
5th row제과점영업

Common Values

ValueCountFrequency (%)
제과점영업 1187
100.0%

Length

2023-12-11T08:59:54.919955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:59:55.005616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제과점영업 1187
100.0%
Distinct1181
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size9.4 KiB
2023-12-11T08:59:55.263775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length50
Mean length30.973041
Min length18

Characters and Unicode

Total characters36765
Distinct characters407
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1175 ?
Unique (%)99.0%

Sample

1st row경상남도 거제시 일운면 외도길 17(2층)
2nd row경상남도 거제시 능포로 136-1(1층 능포동)
3rd row경상남도 거제시 거제면 읍내로2길 36-1
4th row경상남도 거제시 장평로6길 25(106동 106호 장평동, 대한아파트)
5th row경상남도 거제시 사등면 두동로 16(지1층)
ValueCountFrequency (%)
경상남도 1187
 
17.4%
창원시 379
 
5.6%
김해시 189
 
2.8%
진주시 154
 
2.3%
1층 120
 
1.8%
양산시 114
 
1.7%
성산구 92
 
1.3%
의창구 88
 
1.3%
거제시 83
 
1.2%
진해구 82
 
1.2%
Other values (2339) 4329
63.5%
2023-12-11T08:59:55.754416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5643
 
15.3%
1 2153
 
5.9%
1413
 
3.8%
1363
 
3.7%
1244
 
3.4%
1220
 
3.3%
1199
 
3.3%
1091
 
3.0%
1049
 
2.9%
( 1030
 
2.8%
Other values (397) 19360
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22197
60.4%
Decimal Number 6065
 
16.5%
Space Separator 5643
 
15.3%
Open Punctuation 1033
 
2.8%
Close Punctuation 1033
 
2.8%
Other Punctuation 509
 
1.4%
Dash Punctuation 215
 
0.6%
Uppercase Letter 61
 
0.2%
Math Symbol 7
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1413
 
6.4%
1363
 
6.1%
1244
 
5.6%
1220
 
5.5%
1199
 
5.4%
1091
 
4.9%
1049
 
4.7%
640
 
2.9%
592
 
2.7%
573
 
2.6%
Other values (358) 11813
53.2%
Uppercase Letter
ValueCountFrequency (%)
A 19
31.1%
S 7
 
11.5%
B 7
 
11.5%
D 5
 
8.2%
K 5
 
8.2%
G 4
 
6.6%
E 3
 
4.9%
C 3
 
4.9%
Y 2
 
3.3%
J 2
 
3.3%
Other values (4) 4
 
6.6%
Decimal Number
ValueCountFrequency (%)
1 2153
35.5%
2 736
 
12.1%
0 641
 
10.6%
3 552
 
9.1%
4 400
 
6.6%
5 392
 
6.5%
6 341
 
5.6%
7 316
 
5.2%
8 288
 
4.7%
9 246
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 485
95.3%
· 15
 
2.9%
* 3
 
0.6%
@ 3
 
0.6%
. 2
 
0.4%
& 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1030
99.7%
[ 3
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 1030
99.7%
] 3
 
0.3%
Space Separator
ValueCountFrequency (%)
5643
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 215
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22197
60.4%
Common 14506
39.5%
Latin 62
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1413
 
6.4%
1363
 
6.1%
1244
 
5.6%
1220
 
5.5%
1199
 
5.4%
1091
 
4.9%
1049
 
4.7%
640
 
2.9%
592
 
2.7%
573
 
2.6%
Other values (358) 11813
53.2%
Common
ValueCountFrequency (%)
5643
38.9%
1 2153
 
14.8%
( 1030
 
7.1%
) 1030
 
7.1%
2 736
 
5.1%
0 641
 
4.4%
3 552
 
3.8%
, 485
 
3.3%
4 400
 
2.8%
5 392
 
2.7%
Other values (14) 1444
 
10.0%
Latin
ValueCountFrequency (%)
A 19
30.6%
S 7
 
11.3%
B 7
 
11.3%
D 5
 
8.1%
K 5
 
8.1%
G 4
 
6.5%
E 3
 
4.8%
C 3
 
4.8%
Y 2
 
3.2%
J 2
 
3.2%
Other values (5) 5
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22197
60.4%
ASCII 14553
39.6%
None 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5643
38.8%
1 2153
 
14.8%
( 1030
 
7.1%
) 1030
 
7.1%
2 736
 
5.1%
0 641
 
4.4%
3 552
 
3.8%
, 485
 
3.3%
4 400
 
2.7%
5 392
 
2.7%
Other values (28) 1491
 
10.2%
Hangul
ValueCountFrequency (%)
1413
 
6.4%
1363
 
6.1%
1244
 
5.6%
1220
 
5.5%
1199
 
5.4%
1091
 
4.9%
1049
 
4.7%
640
 
2.9%
592
 
2.7%
573
 
2.6%
Other values (358) 11813
53.2%
None
ValueCountFrequency (%)
· 15
100.0%

업소전화번호
Text

MISSING 

Distinct675
Distinct (%)98.5%
Missing502
Missing (%)42.3%
Memory size9.4 KiB
2023-12-11T08:59:56.100097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.294891
Min length7

Characters and Unicode

Total characters7737
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique665 ?
Unique (%)97.1%

Sample

1st row055 681 4541
2nd row055 6882628
3rd row055 634 3062
4th row055 637 1572
5th row055 682 4995
ValueCountFrequency (%)
055 579
34.0%
747 9
 
0.5%
070 8
 
0.5%
051 7
 
0.4%
761 7
 
0.4%
251 6
 
0.4%
649 6
 
0.4%
645 6
 
0.4%
682 6
 
0.4%
312 6
 
0.4%
Other values (814) 1064
62.4%
2023-12-11T08:59:56.603784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 1750
22.6%
0 1111
14.4%
1027
13.3%
3 664
 
8.6%
2 629
 
8.1%
8 545
 
7.0%
6 471
 
6.1%
7 445
 
5.8%
4 432
 
5.6%
1 356
 
4.6%
Other values (2) 307
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6698
86.6%
Space Separator 1027
 
13.3%
Dash Punctuation 12
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1750
26.1%
0 1111
16.6%
3 664
 
9.9%
2 629
 
9.4%
8 545
 
8.1%
6 471
 
7.0%
7 445
 
6.6%
4 432
 
6.4%
1 356
 
5.3%
9 295
 
4.4%
Space Separator
ValueCountFrequency (%)
1027
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7737
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 1750
22.6%
0 1111
14.4%
1027
13.3%
3 664
 
8.6%
2 629
 
8.1%
8 545
 
7.0%
6 471
 
6.1%
7 445
 
5.8%
4 432
 
5.6%
1 356
 
4.6%
Other values (2) 307
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7737
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 1750
22.6%
0 1111
14.4%
1027
13.3%
3 664
 
8.6%
2 629
 
8.1%
8 545
 
7.0%
6 471
 
6.1%
7 445
 
5.8%
4 432
 
5.6%
1 356
 
4.6%
Other values (2) 307
 
4.0%

Interactions

2023-12-11T08:59:53.560959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:59:56.703684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관
연번1.0000.969
인허가관할기관0.9691.000
2023-12-11T08:59:56.782278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관
연번1.0000.830
인허가관할기관0.8301.000

Missing values

2023-12-11T08:59:53.681847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:59:53.795101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번인허가관할기관업소명업종업소주소업소전화번호
01거제시(주)외도보타니아제과점제과점영업경상남도 거제시 일운면 외도길 17(2층)055 681 4541
12거제시경성빵집제과점영업경상남도 거제시 능포로 136-1(1층 능포동)<NA>
23거제시고려당제과점영업경상남도 거제시 거제면 읍내로2길 36-1055 6882628
34거제시그녀의케익제과점영업경상남도 거제시 장평로6길 25(106동 106호 장평동, 대한아파트)<NA>
45거제시꿈꾸는쉐프제과점영업경상남도 거제시 사등면 두동로 16(지1층)055 634 3062
56거제시농협베이커리제과점영업경상남도 거제시 장평로 65(1층 장평동)055 637 1572
67거제시달인 아주점제과점영업경상남도 거제시 아주1로4길 31(2동 1층 아주동, 하나로마트아주점)055 682 4995
78거제시델리카페피솔제과점영업경상남도 거제시 장평3로 80(11층 장평동, 피솔복합관)055 630 6011
89거제시디맥스베이커리제과점영업경상남도 거제시 옥포중앙로 43(옥포동,1층)055 6880020
910거제시뚜레쥬르 거제사등점제과점영업경상남도 거제시 사등면 두동로 65(2층)055 636 0264
연번인허가관할기관업소명업종업소주소업소전화번호
11771178합천군갓구운 농협베이커리제과점영업경상남도 합천군 합천읍 대야로 904(합천중부농협 하나로마트 내)<NA>
11781179합천군뚜레쥬르 경남합천점제과점영업경상남도 합천군 합천읍 옥산로 126055 931 0000
11791180합천군빠띠셰베이커리제과점영업경상남도 합천군 합천읍 충효로 39<NA>
11801181합천군빵굽는 마을제과점영업경상남도 합천군 합천읍 동서로 70-15509312446
11811182합천군오세요 수제디저트제과점영업경상남도 합천군 합천읍 동서로 113(범한빌딩 101호)<NA>
11821183합천군왕후쉼터제과점영업경상남도 합천군 합천읍 충효로 91<NA>
11831184합천군파리바게트 경남합천점제과점영업경상남도 합천군 합천읍 동서로 90-1055 931 8211
11841185합천군풀베이커리제과점영업경상남도 합천군 초계면 초계중앙로 118<NA>
11851186합천군합천제과점제과점영업경상남도 합천군 합천읍 충효로 77(1층)5509330151
11861187합천군호두명가제과점영업경상남도 합천군 합천읍 충효로 63055 982 8585