Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.7 KiB
Average record size in memory58.3 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description위생관리등급평가 결과(1회성)
Author경상북도 영천시
URLhttps://www.data.go.kr/data/15004625/fileData.do

Alerts

평가등급 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
비 고 is highly overall correlated with 평가등급High correlation
연번 is highly overall correlated with 평가등급High correlation
업종명 is highly imbalanced (67.3%)Imbalance
연번 has unique valuesUnique
업소명 has unique valuesUnique
소재지(도로명) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:39:01.545581
Analysis finished2023-12-12 15:39:02.376344
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-13T00:39:02.463444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-13T00:39:02.622124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

업종명
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
식품제조가공업
94 
식품첨가물제조업
 
6

Length

Max length8
Median length7
Mean length7.06
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품제조가공업
2nd row식품제조가공업
3rd row식품제조가공업
4th row식품제조가공업
5th row식품제조가공업

Common Values

ValueCountFrequency (%)
식품제조가공업 94
94.0%
식품첨가물제조업 6
 
6.0%

Length

2023-12-13T00:39:02.778911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:39:02.872268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품제조가공업 94
94.0%
식품첨가물제조업 6
 
6.0%

업소명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:39:03.094478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length7.9
Min length2

Characters and Unicode

Total characters790
Distinct characters226
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row농업회사법인(주)하비랑
2nd row농업회사법인(주)삼홍
3rd row권민성과채즙
4th row주식회사생초록농원
5th row티앤
ValueCountFrequency (%)
주식회사 8
 
6.6%
농업회사법인 6
 
4.9%
주)한방세상 1
 
0.8%
버섯마을 1
 
0.8%
삼송식품 1
 
0.8%
별빛촌산삼배양영농조합법인 1
 
0.8%
영천곤충영농조합법인 1
 
0.8%
주)우성식품 1
 
0.8%
주)오케이바이오랜드 1
 
0.8%
쌍별에프엔비협동조합 1
 
0.8%
Other values (100) 100
82.0%
2023-12-13T00:39:03.506086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
5.1%
35
 
4.4%
35
 
4.4%
31
 
3.9%
28
 
3.5%
25
 
3.2%
24
 
3.0%
22
 
2.8%
21
 
2.7%
) 18
 
2.3%
Other values (216) 511
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 714
90.4%
Space Separator 22
 
2.8%
Close Punctuation 18
 
2.3%
Open Punctuation 17
 
2.2%
Uppercase Letter 12
 
1.5%
Lowercase Letter 3
 
0.4%
Other Symbol 2
 
0.3%
Decimal Number 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
5.6%
35
 
4.9%
35
 
4.9%
31
 
4.3%
28
 
3.9%
25
 
3.5%
24
 
3.4%
21
 
2.9%
17
 
2.4%
16
 
2.2%
Other values (197) 442
61.9%
Uppercase Letter
ValueCountFrequency (%)
N 2
16.7%
L 2
16.7%
S 1
8.3%
O 1
8.3%
I 1
8.3%
E 1
8.3%
A 1
8.3%
H 1
8.3%
K 1
8.3%
W 1
8.3%
Lowercase Letter
ValueCountFrequency (%)
t 1
33.3%
l 1
33.3%
a 1
33.3%
Decimal Number
ValueCountFrequency (%)
3 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 716
90.6%
Common 59
 
7.5%
Latin 15
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
5.6%
35
 
4.9%
35
 
4.9%
31
 
4.3%
28
 
3.9%
25
 
3.5%
24
 
3.4%
21
 
2.9%
17
 
2.4%
16
 
2.2%
Other values (198) 444
62.0%
Latin
ValueCountFrequency (%)
N 2
13.3%
L 2
13.3%
t 1
 
6.7%
l 1
 
6.7%
a 1
 
6.7%
S 1
 
6.7%
O 1
 
6.7%
I 1
 
6.7%
E 1
 
6.7%
A 1
 
6.7%
Other values (3) 3
20.0%
Common
ValueCountFrequency (%)
22
37.3%
) 18
30.5%
( 17
28.8%
3 1
 
1.7%
2 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 714
90.4%
ASCII 74
 
9.4%
None 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
40
 
5.6%
35
 
4.9%
35
 
4.9%
31
 
4.3%
28
 
3.9%
25
 
3.5%
24
 
3.4%
21
 
2.9%
17
 
2.4%
16
 
2.2%
Other values (197) 442
61.9%
ASCII
ValueCountFrequency (%)
22
29.7%
) 18
24.3%
( 17
23.0%
N 2
 
2.7%
L 2
 
2.7%
3 1
 
1.4%
t 1
 
1.4%
l 1
 
1.4%
a 1
 
1.4%
S 1
 
1.4%
Other values (8) 8
 
10.8%
None
ValueCountFrequency (%)
2
100.0%
Distinct97
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:39:03.836308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.17
Min length3

Characters and Unicode

Total characters317
Distinct characters110
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)94.0%

Sample

1st row황광주
2nd row최재혁
3rd row권민성
4th row최성구
5th row이희찬
ValueCountFrequency (%)
김재혈 2
 
1.9%
최재혁 2
 
1.9%
도기식 2
 
1.9%
1명 2
 
1.9%
2
 
1.9%
권주영 1
 
1.0%
강동영 1
 
1.0%
박동환 1
 
1.0%
한민아 1
 
1.0%
오세창 1
 
1.0%
Other values (90) 90
85.7%
2023-12-13T00:39:04.307447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
5.0%
14
 
4.4%
12
 
3.8%
9
 
2.8%
9
 
2.8%
7
 
2.2%
7
 
2.2%
7
 
2.2%
7
 
2.2%
6
 
1.9%
Other values (100) 223
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 308
97.2%
Space Separator 5
 
1.6%
Decimal Number 2
 
0.6%
Open Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
5.2%
14
 
4.5%
12
 
3.9%
9
 
2.9%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
Other values (96) 214
69.5%
Space Separator
ValueCountFrequency (%)
5
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 308
97.2%
Common 9
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
5.2%
14
 
4.5%
12
 
3.9%
9
 
2.9%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
Other values (96) 214
69.5%
Common
ValueCountFrequency (%)
5
55.6%
1 2
 
22.2%
( 1
 
11.1%
) 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 308
97.2%
ASCII 9
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
5.2%
14
 
4.5%
12
 
3.9%
9
 
2.9%
9
 
2.9%
7
 
2.3%
7
 
2.3%
7
 
2.3%
7
 
2.3%
6
 
1.9%
Other values (96) 214
69.5%
ASCII
ValueCountFrequency (%)
5
55.6%
1 2
 
22.2%
( 1
 
11.1%
) 1
 
11.1%
Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-13T00:39:04.627846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length17.99
Min length12

Characters and Unicode

Total characters1799
Distinct characters129
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row영천시 고경면 상계로 178-7
2nd row영천시 대창면 강회3길 46 주1동
3rd row영천시 화남면 신선로 66
4th row영천시 청통면 사일로 113-31
5th row영천시 임고면 운주로 267-36
ValueCountFrequency (%)
영천시 100
 
23.5%
청통면 14
 
3.3%
금호읍 11
 
2.6%
도동 8
 
1.9%
임고면 8
 
1.9%
고경면 7
 
1.6%
한방로 6
 
1.4%
18 6
 
1.4%
화남면 6
 
1.4%
천문로 6
 
1.4%
Other values (190) 254
59.6%
2023-12-13T00:39:05.170875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
339
18.8%
117
 
6.5%
108
 
6.0%
106
 
5.9%
1 78
 
4.3%
60
 
3.3%
56
 
3.1%
2 46
 
2.6%
45
 
2.5%
- 43
 
2.4%
Other values (119) 801
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 987
54.9%
Decimal Number 355
 
19.7%
Space Separator 339
 
18.8%
Dash Punctuation 43
 
2.4%
Open Punctuation 33
 
1.8%
Close Punctuation 33
 
1.8%
Uppercase Letter 9
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
117
 
11.9%
108
 
10.9%
106
 
10.7%
60
 
6.1%
56
 
5.7%
45
 
4.6%
41
 
4.2%
25
 
2.5%
18
 
1.8%
18
 
1.8%
Other values (100) 393
39.8%
Decimal Number
ValueCountFrequency (%)
1 78
22.0%
2 46
13.0%
3 39
11.0%
6 36
10.1%
4 30
 
8.5%
8 29
 
8.2%
9 28
 
7.9%
0 26
 
7.3%
7 23
 
6.5%
5 20
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
A 3
33.3%
E 2
22.2%
B 2
22.2%
D 1
 
11.1%
C 1
 
11.1%
Space Separator
ValueCountFrequency (%)
339
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 33
100.0%
Close Punctuation
ValueCountFrequency (%)
) 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 987
54.9%
Common 803
44.6%
Latin 9
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
117
 
11.9%
108
 
10.9%
106
 
10.7%
60
 
6.1%
56
 
5.7%
45
 
4.6%
41
 
4.2%
25
 
2.5%
18
 
1.8%
18
 
1.8%
Other values (100) 393
39.8%
Common
ValueCountFrequency (%)
339
42.2%
1 78
 
9.7%
2 46
 
5.7%
- 43
 
5.4%
3 39
 
4.9%
6 36
 
4.5%
( 33
 
4.1%
) 33
 
4.1%
4 30
 
3.7%
8 29
 
3.6%
Other values (4) 97
 
12.1%
Latin
ValueCountFrequency (%)
A 3
33.3%
E 2
22.2%
B 2
22.2%
D 1
 
11.1%
C 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 987
54.9%
ASCII 812
45.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
339
41.7%
1 78
 
9.6%
2 46
 
5.7%
- 43
 
5.3%
3 39
 
4.8%
6 36
 
4.4%
( 33
 
4.1%
) 33
 
4.1%
4 30
 
3.7%
8 29
 
3.6%
Other values (9) 106
 
13.1%
Hangul
ValueCountFrequency (%)
117
 
11.9%
108
 
10.9%
106
 
10.7%
60
 
6.1%
56
 
5.7%
45
 
4.6%
41
 
4.2%
25
 
2.5%
18
 
1.8%
18
 
1.8%
Other values (100) 393
39.8%

평가등급
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
평가불가
50 
일반관리
39 
자율관리
11 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자율관리
2nd row자율관리
3rd row자율관리
4th row자율관리
5th row자율관리

Common Values

ValueCountFrequency (%)
평가불가 50
50.0%
일반관리 39
39.0%
자율관리 11
 
11.0%

Length

2023-12-13T00:39:05.341856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:39:05.457080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평가불가 50
50.0%
일반관리 39
39.0%
자율관리 11
 
11.0%

비 고
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
<NA>
50 
폐문
34 
haccp
10 
업종변경
 
2
폐업
 
2
Other values (2)
 
2

Length

Max length5
Median length4
Mean length3.36
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 50
50.0%
폐문 34
34.0%
haccp 10
 
10.0%
업종변경 2
 
2.0%
폐업 2
 
2.0%
2공장 1
 
1.0%
3공장 1
 
1.0%

Length

2023-12-13T00:39:05.570479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T00:39:05.720998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 50
50.0%
폐문 34
34.0%
haccp 10
 
10.0%
업종변경 2
 
2.0%
폐업 2
 
2.0%
2공장 1
 
1.0%
3공장 1
 
1.0%

Interactions

2023-12-13T00:39:02.080373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:39:05.822016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명업소명영업자소재지(도로명)평가등급비 고
연번1.0000.5521.0000.7991.0000.9660.600
업종명0.5521.0001.0001.0001.0000.0340.000
업소명1.0001.0001.0001.0001.0001.0001.000
영업자0.7991.0001.0001.0001.0000.0001.000
소재지(도로명)1.0001.0001.0001.0001.0001.0001.000
평가등급0.9660.0341.0000.0001.0001.000NaN
비 고0.6000.0001.0001.0001.000NaN1.000
2023-12-13T00:39:05.942846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
평가등급비 고업종명
평가등급1.0001.0000.054
비 고1.0001.0000.000
업종명0.0540.0001.000
2023-12-13T00:39:06.048168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종명평가등급비 고
연번1.0000.4080.9350.453
업종명0.4081.0000.0540.000
평가등급0.9350.0541.0001.000
비 고0.4530.0001.0001.000

Missing values

2023-12-13T00:39:02.197244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:39:02.330228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종명업소명영업자소재지(도로명)평가등급비 고
01식품제조가공업농업회사법인(주)하비랑황광주영천시 고경면 상계로 178-7자율관리<NA>
12식품제조가공업농업회사법인(주)삼홍최재혁영천시 대창면 강회3길 46 주1동자율관리<NA>
23식품제조가공업권민성과채즙권민성영천시 화남면 신선로 66자율관리<NA>
34식품제조가공업주식회사생초록농원최성구영천시 청통면 사일로 113-31자율관리<NA>
45식품제조가공업티앤이희찬영천시 임고면 운주로 267-36자율관리<NA>
56식품제조가공업주식회사 태극인김상진영천시 화산면 장수로 781자율관리<NA>
67식품제조가공업농업회사법인(주)원광정용호영천시 임고면 황강공단길 15자율관리<NA>
78식품제조가공업농업회사법인 한조주식회사신현수영천시 천문로 692-60 (오미동)자율관리<NA>
89식품제조가공업신성에프엔비(주) 영천지사최영환영천시 천문로 692-38 주1동 (오미동)자율관리<NA>
910식품제조가공업주)자두푸드시스템곽혁태영천시 청통면 계포1길 3-2자율관리<NA>
연번업종명업소명영업자소재지(도로명)평가등급비 고
9091식품제조가공업희망마을영농조합법인김준호영천시 북안면 운북로 1195평가불가폐문
9192식품제조가공업별다믄영농조합법인조영운영천시 임고면 포은로 650평가불가폐문
9293식품제조가공업농업회사법인 주식회사 홀그레인호밀농장류한욱영천시 화산면 대암길 64평가불가폐문
9394식품제조가공업광명농산강인숙영천시 서당길 25-75 (도남동)평가불가폐문
9495식품제조가공업건강을쓰담배윤경영천시 화남면 월령길 194평가불가폐문
9596식품제조가공업대한한약협동조합한상일영천시 화남면 한천길 131-13 부2평가불가폐문
9697식품첨가물제조업덕수산업이준호영천시 임고면 운주로 189-17평가불가폐문
9798식품제조가공업보강곤충농장식품안연정영천시 야사시장길 43 A동 110호 (야사동)평가불가폐문
9899식품제조가공업용수농원안홍석영천시 고경면 창방우길 46평가불가폐업
99100식품제조가공업떡마루김민엽영천시 임고면 덕연길 6평가불가폐업