Overview

Dataset statistics

Number of variables6
Number of observations1242
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory59.6 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description경상남도 제과점 현황에 관한 데이터로, 인허가관할기관, 업소명, 업종, 업태, 업소주소에 관한 정보를 제공합니다.
Author경상남도
URLhttps://www.data.go.kr/data/15069263/fileData.do

Alerts

업종 has constant value ""Constant
연번 is highly overall correlated with 인허가관할기관High correlation
인허가관할기관 is highly overall correlated with 연번High correlation
업태 is highly imbalanced (99.1%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:26:14.915334
Analysis finished2023-12-12 22:26:15.728274
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1242
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean621.5
Minimum1
Maximum1242
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.0 KiB
2023-12-13T07:26:16.034315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile63.05
Q1311.25
median621.5
Q3931.75
95-th percentile1179.95
Maximum1242
Range1241
Interquartile range (IQR)620.5

Descriptive statistics

Standard deviation358.67883
Coefficient of variation (CV)0.57711798
Kurtosis-1.2
Mean621.5
Median Absolute Deviation (MAD)310.5
Skewness0
Sum771903
Variance128650.5
MonotonicityStrictly increasing
2023-12-13T07:26:16.144635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
827 1
 
0.1%
834 1
 
0.1%
833 1
 
0.1%
832 1
 
0.1%
831 1
 
0.1%
830 1
 
0.1%
829 1
 
0.1%
828 1
 
0.1%
826 1
 
0.1%
Other values (1232) 1232
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1242 1
0.1%
1241 1
0.1%
1240 1
0.1%
1239 1
0.1%
1238 1
0.1%
1237 1
0.1%
1236 1
0.1%
1235 1
0.1%
1234 1
0.1%
1233 1
0.1%

인허가관할기관
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
김해시
198 
진주시
174 
양산시
127 
창원시 성산구
109 
거제시
90 
Other values (17)
544 

Length

Max length9
Median length3
Mean length4.394525
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row거제시
2nd row거제시
3rd row거제시
4th row거제시
5th row거제시

Common Values

ValueCountFrequency (%)
김해시 198
15.9%
진주시 174
14.0%
양산시 127
10.2%
창원시 성산구 109
8.8%
거제시 90
7.2%
창원시 진해구 85
 
6.8%
창원시 의창구 71
 
5.7%
통영시 58
 
4.7%
창원시 마산합포구 57
 
4.6%
창원시 마산회원구 55
 
4.4%
Other values (12) 218
17.6%

Length

2023-12-13T07:26:16.247155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원시 377
23.3%
김해시 198
12.2%
진주시 174
10.7%
양산시 127
 
7.8%
성산구 109
 
6.7%
거제시 90
 
5.6%
진해구 85
 
5.3%
의창구 71
 
4.4%
통영시 58
 
3.6%
마산합포구 57
 
3.5%
Other values (13) 273
16.9%
Distinct1142
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2023-12-13T07:26:16.482089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length23
Mean length8.0692432
Min length1

Characters and Unicode

Total characters10022
Distinct characters571
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1094 ?
Unique (%)88.1%

Sample

1st row외도널서리제과
2nd row외도파티쉐리
3rd row원베이커리
4th row원베이커리
5th row장목하나로 베이커리
ValueCountFrequency (%)
파리바게뜨 65
 
4.0%
뚜레쥬르 40
 
2.5%
베이커리 32
 
2.0%
파리바게트 31
 
1.9%
탑스베이커리 22
 
1.4%
몽블랑제 10
 
0.6%
bakery 8
 
0.5%
탑베이커리 8
 
0.5%
cake 7
 
0.4%
빵굽는마을 7
 
0.4%
Other values (1237) 1393
85.8%
2023-12-13T07:26:16.868997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
528
 
5.3%
493
 
4.9%
381
 
3.8%
342
 
3.4%
235
 
2.3%
220
 
2.2%
218
 
2.2%
196
 
2.0%
196
 
2.0%
) 163
 
1.6%
Other values (561) 7050
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8480
84.6%
Lowercase Letter 403
 
4.0%
Space Separator 381
 
3.8%
Uppercase Letter 309
 
3.1%
Close Punctuation 164
 
1.6%
Open Punctuation 164
 
1.6%
Decimal Number 98
 
1.0%
Other Punctuation 20
 
0.2%
Dash Punctuation 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
528
 
6.2%
493
 
5.8%
342
 
4.0%
235
 
2.8%
220
 
2.6%
218
 
2.6%
196
 
2.3%
196
 
2.3%
160
 
1.9%
159
 
1.9%
Other values (491) 5733
67.6%
Lowercase Letter
ValueCountFrequency (%)
e 61
15.1%
a 51
12.7%
o 42
10.4%
r 29
 
7.2%
n 28
 
6.9%
l 24
 
6.0%
k 20
 
5.0%
c 18
 
4.5%
i 16
 
4.0%
h 13
 
3.2%
Other values (14) 101
25.1%
Uppercase Letter
ValueCountFrequency (%)
A 41
13.3%
B 33
 
10.7%
E 30
 
9.7%
R 18
 
5.8%
S 18
 
5.8%
K 18
 
5.8%
D 16
 
5.2%
O 15
 
4.9%
I 13
 
4.2%
N 13
 
4.2%
Other values (13) 94
30.4%
Decimal Number
ValueCountFrequency (%)
2 24
24.5%
1 16
16.3%
9 13
13.3%
0 12
12.2%
5 9
 
9.2%
3 8
 
8.2%
4 5
 
5.1%
8 5
 
5.1%
7 4
 
4.1%
6 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
& 6
30.0%
. 6
30.0%
' 3
15.0%
, 3
15.0%
# 2
 
10.0%
Close Punctuation
ValueCountFrequency (%)
) 163
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 163
99.4%
[ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%
Other Number
ValueCountFrequency (%)
³ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8479
84.6%
Common 829
 
8.3%
Latin 713
 
7.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
528
 
6.2%
493
 
5.8%
342
 
4.0%
235
 
2.8%
220
 
2.6%
218
 
2.6%
196
 
2.3%
196
 
2.3%
160
 
1.9%
159
 
1.9%
Other values (490) 5732
67.6%
Latin
ValueCountFrequency (%)
e 61
 
8.6%
a 51
 
7.2%
o 42
 
5.9%
A 41
 
5.8%
B 33
 
4.6%
E 30
 
4.2%
r 29
 
4.1%
n 28
 
3.9%
l 24
 
3.4%
k 20
 
2.8%
Other values (38) 354
49.6%
Common
ValueCountFrequency (%)
381
46.0%
) 163
19.7%
( 163
19.7%
2 24
 
2.9%
1 16
 
1.9%
9 13
 
1.6%
0 12
 
1.4%
5 9
 
1.1%
3 8
 
1.0%
& 6
 
0.7%
Other values (12) 34
 
4.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8479
84.6%
ASCII 1540
 
15.4%
CJK 1
 
< 0.1%
Number Forms 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
528
 
6.2%
493
 
5.8%
342
 
4.0%
235
 
2.8%
220
 
2.6%
218
 
2.6%
196
 
2.3%
196
 
2.3%
160
 
1.9%
159
 
1.9%
Other values (490) 5732
67.6%
ASCII
ValueCountFrequency (%)
381
24.7%
) 163
 
10.6%
( 163
 
10.6%
e 61
 
4.0%
a 51
 
3.3%
o 42
 
2.7%
A 41
 
2.7%
B 33
 
2.1%
E 30
 
1.9%
r 29
 
1.9%
Other values (58) 546
35.5%
CJK
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
³ 1
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
제과점영업
1242 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제과점영업
2nd row제과점영업
3rd row제과점영업
4th row제과점영업
5th row제과점영업

Common Values

ValueCountFrequency (%)
제과점영업 1242
100.0%

Length

2023-12-13T07:26:16.974794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:26:17.050981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제과점영업 1242
100.0%

업태
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
제과점영업
1241 
푸드트럭
 
1

Length

Max length5
Median length5
Mean length4.9991948
Min length4

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row제과점영업
2nd row제과점영업
3rd row제과점영업
4th row제과점영업
5th row제과점영업

Common Values

ValueCountFrequency (%)
제과점영업 1241
99.9%
푸드트럭 1
 
0.1%

Length

2023-12-13T07:26:17.131684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:26:17.204567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제과점영업 1241
99.9%
푸드트럭 1
 
0.1%
Distinct1240
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
2023-12-13T07:26:17.471940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length53
Mean length31.256039
Min length18

Characters and Unicode

Total characters38820
Distinct characters429
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1238 ?
Unique (%)99.7%

Sample

1st row경상남도 거제시 일운면 구조라로4길 21(주1동 1층)
2nd row경상남도 거제시 일운면 구조라로2길 23-3(1~2층)
3rd row경상남도 거제시 장승로 48(1층 장승포동)
4th row경상남도 거제시 능포로2길 38(상가동 1층 110호 능포동, 옥명대우아파트)
5th row경상남도 거제시 장목면 거제북로 1210(장목농협 하나로마트 1동 1층)
ValueCountFrequency (%)
경상남도 1242
 
17.2%
창원시 377
 
5.2%
김해시 198
 
2.7%
진주시 174
 
2.4%
1층 151
 
2.1%
양산시 127
 
1.8%
성산구 96
 
1.3%
거제시 90
 
1.2%
진해구 85
 
1.2%
의창구 84
 
1.2%
Other values (2451) 4614
63.7%
2023-12-13T07:26:17.894417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6009
 
15.5%
1 2285
 
5.9%
1476
 
3.8%
1434
 
3.7%
1305
 
3.4%
1272
 
3.3%
1262
 
3.3%
1142
 
2.9%
) 1095
 
2.8%
( 1095
 
2.8%
Other values (419) 20445
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23389
60.2%
Decimal Number 6439
 
16.6%
Space Separator 6009
 
15.5%
Close Punctuation 1098
 
2.8%
Open Punctuation 1098
 
2.8%
Other Punctuation 468
 
1.2%
Dash Punctuation 234
 
0.6%
Uppercase Letter 69
 
0.2%
Lowercase Letter 8
 
< 0.1%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1476
 
6.3%
1434
 
6.1%
1305
 
5.6%
1272
 
5.4%
1262
 
5.4%
1142
 
4.9%
1079
 
4.6%
717
 
3.1%
587
 
2.5%
580
 
2.5%
Other values (371) 12535
53.6%
Uppercase Letter
ValueCountFrequency (%)
A 16
23.2%
B 10
14.5%
S 6
 
8.7%
D 6
 
8.7%
G 5
 
7.2%
K 5
 
7.2%
C 4
 
5.8%
O 2
 
2.9%
R 2
 
2.9%
M 2
 
2.9%
Other values (9) 11
15.9%
Decimal Number
ValueCountFrequency (%)
1 2285
35.5%
2 789
 
12.3%
0 684
 
10.6%
3 586
 
9.1%
5 440
 
6.8%
4 434
 
6.7%
6 356
 
5.5%
7 328
 
5.1%
8 291
 
4.5%
9 246
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
37.5%
a 1
 
12.5%
h 1
 
12.5%
l 1
 
12.5%
u 1
 
12.5%
s 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
, 447
95.5%
· 13
 
2.8%
@ 3
 
0.6%
* 3
 
0.6%
. 2
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 1095
99.7%
] 3
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 1095
99.7%
[ 3
 
0.3%
Space Separator
ValueCountFrequency (%)
6009
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 234
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23389
60.2%
Common 15354
39.6%
Latin 77
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1476
 
6.3%
1434
 
6.1%
1305
 
5.6%
1272
 
5.4%
1262
 
5.4%
1142
 
4.9%
1079
 
4.6%
717
 
3.1%
587
 
2.5%
580
 
2.5%
Other values (371) 12535
53.6%
Latin
ValueCountFrequency (%)
A 16
20.8%
B 10
13.0%
S 6
 
7.8%
D 6
 
7.8%
G 5
 
6.5%
K 5
 
6.5%
C 4
 
5.2%
e 3
 
3.9%
O 2
 
2.6%
R 2
 
2.6%
Other values (15) 18
23.4%
Common
ValueCountFrequency (%)
6009
39.1%
1 2285
 
14.9%
) 1095
 
7.1%
( 1095
 
7.1%
2 789
 
5.1%
0 684
 
4.5%
3 586
 
3.8%
, 447
 
2.9%
5 440
 
2.9%
4 434
 
2.8%
Other values (13) 1490
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23389
60.2%
ASCII 15418
39.7%
None 13
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6009
39.0%
1 2285
 
14.8%
) 1095
 
7.1%
( 1095
 
7.1%
2 789
 
5.1%
0 684
 
4.4%
3 586
 
3.8%
, 447
 
2.9%
5 440
 
2.9%
4 434
 
2.8%
Other values (37) 1554
 
10.1%
Hangul
ValueCountFrequency (%)
1476
 
6.3%
1434
 
6.1%
1305
 
5.6%
1272
 
5.4%
1262
 
5.4%
1142
 
4.9%
1079
 
4.6%
717
 
3.1%
587
 
2.5%
580
 
2.5%
Other values (371) 12535
53.6%
None
ValueCountFrequency (%)
· 13
100.0%

Interactions

2023-12-13T07:26:15.474474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:26:17.976560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관업태
연번1.0000.9710.006
인허가관할기관0.9711.0000.143
업태0.0060.1431.000
2023-12-13T07:26:18.056576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가관할기관업태
인허가관할기관1.0000.112
업태0.1121.000
2023-12-13T07:26:18.131811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번인허가관할기관업태
연번1.0000.8380.004
인허가관할기관0.8381.0000.112
업태0.0040.1121.000

Missing values

2023-12-13T07:26:15.586953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:26:15.688313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번인허가관할기관업소명업종업태업소주소
01거제시외도널서리제과제과점영업제과점영업경상남도 거제시 일운면 구조라로4길 21(주1동 1층)
12거제시외도파티쉐리제과점영업제과점영업경상남도 거제시 일운면 구조라로2길 23-3(1~2층)
23거제시원베이커리제과점영업제과점영업경상남도 거제시 장승로 48(1층 장승포동)
34거제시원베이커리제과점영업제과점영업경상남도 거제시 능포로2길 38(상가동 1층 110호 능포동, 옥명대우아파트)
45거제시장목하나로 베이커리제과점영업제과점영업경상남도 거제시 장목면 거제북로 1210(장목농협 하나로마트 1동 1층)
56거제시샹뜨레베이커리 고현수협점제과점영업제과점영업경상남도 거제시 고현천로 52(수협마트 1층 고현동)
67거제시샹뜨레베이커리제과점영업제과점영업경상남도 거제시 옥포대첩로 57(1층 옥포동)
78거제시샹뜨레베이커리제과점영업제과점영업경상남도 거제시 성산로1길 2(1층 옥포동)
89거제시샹뜨레 베이커리제과점영업제과점영업경상남도 거제시 사등면 두동로 16(지1층)
910거제시샹뜨레 베이커리제과점영업제과점영업경상남도 거제시 사등면 성포로 133(1층)
연번인허가관할기관업소명업종업태업소주소
12321233합천군합천제과점제과점영업제과점영업경상남도 합천군 합천읍 충효로 77(1층)
12331234합천군풀베이커리제과점영업제과점영업경상남도 합천군 초계면 초계중앙로 118
12341235합천군파리바게트 경남합천점제과점영업제과점영업경상남도 합천군 합천읍 동서로 90-1
12351236합천군오세요 수제디저트제과점영업제과점영업경상남도 합천군 합천읍 동서로 113(범한빌딩 101호)
12361237합천군오두막산골빵집제과점영업제과점영업경상남도 합천군 쌍백면 평구3길 6
12371238합천군빵굽는 마을제과점영업제과점영업경상남도 합천군 합천읍 동서로 70-1
12381239합천군빠띠셰베이커리제과점영업제과점영업경상남도 합천군 합천읍 충효로 39
12391240합천군뚜레쥬르 경남합천점제과점영업제과점영업경상남도 합천군 합천읍 옥산로 126
12401241합천군갓구운 농협베이커리제과점영업제과점영업경상남도 합천군 합천읍 대야로 904(합천중부농협 하나로마트 내)
12411242합천군cafe here(카페..여기)제과점영업제과점영업경상남도 합천군 대양면 동부로 32-2(1층)