Overview

Dataset statistics

Number of variables4
Number of observations319
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.4 KiB
Average record size in memory33.4 B

Variable types

Categorical1
Text2
Numeric1

Dataset

Description법정소독의무대상으로 소독을 실시하여야 하는 식품접객업 업소 중 연면적 300제곱미터 이상의 업소에 대한 설명입니다.
URLhttps://www.data.go.kr/data/15048868/fileData.do

Alerts

업소명 has unique valuesUnique
영업장면적(내부및외부) has 72 (22.6%) zerosZeros

Reproduction

Analysis started2023-12-12 03:02:05.171240
Analysis finished2023-12-12 03:02:05.893961
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct7
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
집단급식소
180 
일반음식점
82 
위탁급식영업
22 
휴게음식점
21 
유흥주점영업
 
11
Other values (2)
 
3

Length

Max length6
Median length5
Mean length5.1003135
Min length4

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
집단급식소 180
56.4%
일반음식점 82
25.7%
위탁급식영업 22
 
6.9%
휴게음식점 21
 
6.6%
유흥주점영업 11
 
3.4%
제과점영업 2
 
0.6%
단란주점 1
 
0.3%

Length

2023-12-12T12:02:05.993404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:02:06.151226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 180
56.4%
일반음식점 82
25.7%
위탁급식영업 22
 
6.9%
휴게음식점 21
 
6.6%
유흥주점영업 11
 
3.4%
제과점영업 2
 
0.6%
단란주점 1
 
0.3%

업소명
Text

UNIQUE 

Distinct319
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T12:02:06.445960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length19
Mean length8.5047022
Min length2

Characters and Unicode

Total characters2713
Distinct characters377
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique319 ?
Unique (%)100.0%

Sample

1st row북경장
2nd row더하우스갑을
3rd row호텔동방블라썸
4th row호텔동방스카이라운지
5th row포시즌뷔페
ValueCountFrequency (%)
경상국립대학교 9
 
2.4%
진주점 3
 
0.8%
학생생활관식당 2
 
0.5%
돌잔치전문점 2
 
0.5%
진주신안점 2
 
0.5%
스타벅스 2
 
0.5%
한국산업기술시험원 2
 
0.5%
주)아이비푸드 2
 
0.5%
주)이마트진주점 1
 
0.3%
진주혜광학교 1
 
0.3%
Other values (344) 344
93.0%
2023-12-12T12:02:06.962219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
122
 
4.5%
104
 
3.8%
100
 
3.7%
94
 
3.5%
66
 
2.4%
58
 
2.1%
52
 
1.9%
51
 
1.9%
) 48
 
1.8%
( 48
 
1.8%
Other values (367) 1970
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2500
92.1%
Space Separator 51
 
1.9%
Close Punctuation 48
 
1.8%
Open Punctuation 48
 
1.8%
Uppercase Letter 37
 
1.4%
Lowercase Letter 22
 
0.8%
Decimal Number 4
 
0.1%
Other Punctuation 2
 
0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
122
 
4.9%
104
 
4.2%
100
 
4.0%
94
 
3.8%
66
 
2.6%
58
 
2.3%
52
 
2.1%
48
 
1.9%
45
 
1.8%
43
 
1.7%
Other values (327) 1768
70.7%
Uppercase Letter
ValueCountFrequency (%)
T 4
10.8%
D 4
10.8%
A 4
10.8%
N 3
 
8.1%
E 3
 
8.1%
H 2
 
5.4%
S 2
 
5.4%
L 2
 
5.4%
U 2
 
5.4%
O 2
 
5.4%
Other values (8) 9
24.3%
Lowercase Letter
ValueCountFrequency (%)
o 4
18.2%
e 4
18.2%
n 3
13.6%
d 2
9.1%
i 2
9.1%
t 1
 
4.5%
a 1
 
4.5%
k 1
 
4.5%
s 1
 
4.5%
c 1
 
4.5%
Other values (2) 2
9.1%
Decimal Number
ValueCountFrequency (%)
2 1
25.0%
5 1
25.0%
3 1
25.0%
7 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 48
100.0%
Open Punctuation
ValueCountFrequency (%)
( 48
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2500
92.1%
Common 153
 
5.6%
Latin 60
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
122
 
4.9%
104
 
4.2%
100
 
4.0%
94
 
3.8%
66
 
2.6%
58
 
2.3%
52
 
2.1%
48
 
1.9%
45
 
1.8%
43
 
1.7%
Other values (327) 1768
70.7%
Latin
ValueCountFrequency (%)
T 4
 
6.7%
D 4
 
6.7%
o 4
 
6.7%
A 4
 
6.7%
e 4
 
6.7%
N 3
 
5.0%
E 3
 
5.0%
n 3
 
5.0%
d 2
 
3.3%
i 2
 
3.3%
Other values (21) 27
45.0%
Common
ValueCountFrequency (%)
51
33.3%
) 48
31.4%
( 48
31.4%
. 1
 
0.7%
2 1
 
0.7%
& 1
 
0.7%
5 1
 
0.7%
3 1
 
0.7%
7 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2500
92.1%
ASCII 212
 
7.8%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
122
 
4.9%
104
 
4.2%
100
 
4.0%
94
 
3.8%
66
 
2.6%
58
 
2.3%
52
 
2.1%
48
 
1.9%
45
 
1.8%
43
 
1.7%
Other values (327) 1768
70.7%
ASCII
ValueCountFrequency (%)
51
24.1%
) 48
22.6%
( 48
22.6%
T 4
 
1.9%
D 4
 
1.9%
o 4
 
1.9%
A 4
 
1.9%
e 4
 
1.9%
N 3
 
1.4%
E 3
 
1.4%
Other values (29) 39
18.4%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct316
Distinct (%)99.4%
Missing1
Missing (%)0.3%
Memory size2.6 KiB
2023-12-12T12:02:07.398473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length43
Mean length29.150943
Min length18

Characters and Unicode

Total characters9270
Distinct characters256
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique315 ?
Unique (%)99.1%

Sample

1st row경상남도 진주시 동성동 13-15 (1.2.3층)
2nd row경상남도 진주시 신안동 24-5 외1필지 갑을가든 A동 1층일부,2층일부
3rd row경상남도 진주시 옥봉동 803-4 동방호텔 3층일부,4층일부
4th row경상남도 진주시 옥봉동 803-4 동방호텔 10층 일부
5th row경상남도 진주시 칠암동 496-22 fourseasons
ValueCountFrequency (%)
경상남도 318
 
17.3%
진주시 318
 
17.3%
충무공동 43
 
2.3%
1층 34
 
1.8%
1층일부 29
 
1.6%
일부 28
 
1.5%
평거동 22
 
1.2%
2층 20
 
1.1%
가좌동 18
 
1.0%
칠암동 15
 
0.8%
Other values (619) 993
54.0%
2023-12-12T12:02:07.964085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1663
 
17.9%
1 427
 
4.6%
390
 
4.2%
378
 
4.1%
359
 
3.9%
345
 
3.7%
336
 
3.6%
323
 
3.5%
322
 
3.5%
317
 
3.4%
Other values (246) 4410
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5261
56.8%
Space Separator 1663
 
17.9%
Decimal Number 1608
 
17.3%
Close Punctuation 206
 
2.2%
Open Punctuation 206
 
2.2%
Other Punctuation 154
 
1.7%
Dash Punctuation 137
 
1.5%
Uppercase Letter 19
 
0.2%
Lowercase Letter 11
 
0.1%
Math Symbol 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
390
 
7.4%
378
 
7.2%
359
 
6.8%
345
 
6.6%
336
 
6.4%
323
 
6.1%
322
 
6.1%
317
 
6.0%
221
 
4.2%
159
 
3.0%
Other values (212) 2111
40.1%
Decimal Number
ValueCountFrequency (%)
1 427
26.6%
2 266
16.5%
3 143
 
8.9%
0 135
 
8.4%
5 123
 
7.6%
9 114
 
7.1%
7 109
 
6.8%
4 106
 
6.6%
6 93
 
5.8%
8 92
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
A 6
31.6%
B 3
15.8%
G 2
 
10.5%
S 2
 
10.5%
D 2
 
10.5%
C 1
 
5.3%
Y 1
 
5.3%
M 1
 
5.3%
E 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
s 3
27.3%
o 2
18.2%
n 1
 
9.1%
e 1
 
9.1%
a 1
 
9.1%
r 1
 
9.1%
u 1
 
9.1%
f 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 140
90.9%
. 14
 
9.1%
Space Separator
ValueCountFrequency (%)
1663
100.0%
Close Punctuation
ValueCountFrequency (%)
) 206
100.0%
Open Punctuation
ValueCountFrequency (%)
( 206
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 137
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5261
56.8%
Common 3979
42.9%
Latin 30
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
390
 
7.4%
378
 
7.2%
359
 
6.8%
345
 
6.6%
336
 
6.4%
323
 
6.1%
322
 
6.1%
317
 
6.0%
221
 
4.2%
159
 
3.0%
Other values (212) 2111
40.1%
Common
ValueCountFrequency (%)
1663
41.8%
1 427
 
10.7%
2 266
 
6.7%
) 206
 
5.2%
( 206
 
5.2%
3 143
 
3.6%
, 140
 
3.5%
- 137
 
3.4%
0 135
 
3.4%
5 123
 
3.1%
Other values (7) 533
 
13.4%
Latin
ValueCountFrequency (%)
A 6
20.0%
B 3
10.0%
s 3
10.0%
G 2
 
6.7%
S 2
 
6.7%
D 2
 
6.7%
o 2
 
6.7%
n 1
 
3.3%
e 1
 
3.3%
a 1
 
3.3%
Other values (7) 7
23.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5261
56.8%
ASCII 4009
43.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1663
41.5%
1 427
 
10.7%
2 266
 
6.6%
) 206
 
5.1%
( 206
 
5.1%
3 143
 
3.6%
, 140
 
3.5%
- 137
 
3.4%
0 135
 
3.4%
5 123
 
3.1%
Other values (24) 563
 
14.0%
Hangul
ValueCountFrequency (%)
390
 
7.4%
378
 
7.2%
359
 
6.8%
345
 
6.6%
336
 
6.4%
323
 
6.1%
322
 
6.1%
317
 
6.0%
221
 
4.2%
159
 
3.0%
Other values (212) 2111
40.1%

영업장면적(내부및외부)
Real number (ℝ)

ZEROS 

Distinct233
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean391.50304
Minimum0
Maximum4565.62
Zeros72
Zeros (%)22.6%
Negative0
Negative (%)0.0%
Memory size2.9 KiB
2023-12-12T12:02:08.154619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q119.25
median336.6
Q3486.75
95-th percentile1075.119
Maximum4565.62
Range4565.62
Interquartile range (IQR)467.5

Descriptive statistics

Standard deviation494.91732
Coefficient of variation (CV)1.2641468
Kurtosis23.773359
Mean391.50304
Median Absolute Deviation (MAD)252.6
Skewness3.8920647
Sum124889.47
Variance244943.16
MonotonicityNot monotonic
2023-12-12T12:02:08.323729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 72
 
22.6%
1586.12 2
 
0.6%
948.0 2
 
0.6%
337.16 2
 
0.6%
480.33 2
 
0.6%
455.37 2
 
0.6%
486.75 2
 
0.6%
509.9 2
 
0.6%
685.0 2
 
0.6%
514.89 2
 
0.6%
Other values (223) 229
71.8%
ValueCountFrequency (%)
0.0 72
22.6%
12.36 1
 
0.3%
13.68 1
 
0.3%
14.4 1
 
0.3%
15.5 1
 
0.3%
16.5 1
 
0.3%
17.1 1
 
0.3%
17.22 1
 
0.3%
18.1 1
 
0.3%
20.4 1
 
0.3%
ValueCountFrequency (%)
4565.62 1
0.3%
3364.19 1
0.3%
3229.05 1
0.3%
2802.14 1
0.3%
1648.33 1
0.3%
1624.98 1
0.3%
1586.12 2
0.6%
1555.0 1
0.3%
1532.0 1
0.3%
1511.17 1
0.3%

Interactions

2023-12-12T12:02:05.535661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:02:08.445753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명영업장면적(내부및외부)
업종명1.0000.242
영업장면적(내부및외부)0.2421.000
2023-12-12T12:02:08.548999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업장면적(내부및외부)업종명
영업장면적(내부및외부)1.0000.086
업종명0.0861.000

Missing values

2023-12-12T12:02:05.704263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:02:05.848354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(지번)영업장면적(내부및외부)
0일반음식점북경장경상남도 진주시 동성동 13-15 (1.2.3층)302.46
1일반음식점더하우스갑을경상남도 진주시 신안동 24-5 외1필지 갑을가든 A동 1층일부,2층일부936.0
2일반음식점호텔동방블라썸경상남도 진주시 옥봉동 803-4 동방호텔 3층일부,4층일부1423.73
3일반음식점호텔동방스카이라운지경상남도 진주시 옥봉동 803-4 동방호텔 10층 일부781.5
4일반음식점포시즌뷔페경상남도 진주시 칠암동 496-22 fourseasons1440.85
5일반음식점진양호밀면경상남도 진주시 평거동 398-10 외1필지 2층322.47
6일반음식점레이크사이드 컨벤션(Lake side convention)경상남도 진주시 판문동 469-5 번지(1층.2층)421.3
7일반음식점목넴기숯불갈비식육식당경상남도 진주시 강남동 199-3 번지(1.2층)313.72
8일반음식점양가돈경상남도 진주시 평거동 786-1 2층332.49
9일반음식점망경횟집경상남도 진주시 망경동 96-5396.65
업종명업소명소재지(지번)영업장면적(내부및외부)
309집단급식소(공립단설)진주누리유치원경상남도 진주시 진산로358번길 13, 1층 일부 (장재동)181.89
310집단급식소진주불교대학총동문회 관음의집경상남도 진주시 망경로275번길 2, 1층 (망경동)76.24
311집단급식소주식회사코렛경상남도 진주시 에나로128번길 26, 6층 601호 일부 (충무공동)92.18
312집단급식소충무공초등학교경상남도 진주시 사들로 70, 1층 일부 (충무공동)757.38
313집단급식소한일병원경상남도 진주시 범골로 17, 한일병원 주1동 지하1층 일부 (충무공동)514.89
314집단급식소대곡중학교경상남도 진주시 소호로 8, 1층 (충무공동)722.0
315집단급식소프라임병원경상남도 진주시 명석면 나불로 305, 8층일부174.13
316집단급식소한화갤러리아(주) 진주점경상남도 진주시 진주대로 1095, 한화갤러리아백화점 지하2층 (평안동)468.18
317집단급식소진주갈릴리교회경상남도 진주시 천수로 141, 진주갈릴리교회 지하1층 (망경동)362.99
318집단급식소은하수초등학교경상남도 진주시 진주역로73번길 7, 은하수초등학교 (가좌동)666.15