Overview

Dataset statistics

Number of variables4
Number of observations313
Missing cells3
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory32.4 B

Variable types

Categorical2
Text2

Dataset

Description전라남도 보건환경연구원에서 검사하는 미생물, 식품 약품 분석 관련 각종 검사들에 대한 수수료 내역을 정리한 파일입니다.
Author전라남도
URLhttps://www.data.go.kr/data/15041954/fileData.do

Alerts

구분 is highly overall correlated with 항목High correlation
항목 is highly overall correlated with 구분High correlation
항목 is highly imbalanced (79.6%)Imbalance

Reproduction

Analysis started2023-12-12 00:16:46.231918
Analysis finished2023-12-12 00:16:46.704102
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

항목
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
식품약품분석관련
303 
미생물 관련
 
10

Length

Max length8
Median length8
Mean length7.9361022
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미생물 관련
2nd row미생물 관련
3rd row미생물 관련
4th row미생물 관련
5th row미생물 관련

Common Values

ValueCountFrequency (%)
식품약품분석관련 303
96.8%
미생물 관련 10
 
3.2%

Length

2023-12-12T09:16:46.792667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:16:47.021868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품약품분석관련 303
93.8%
미생물 10
 
3.1%
관련 10
 
3.1%

구분
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
기구 및 용기포장의 시험, 검사
49 
식품(가공식품, 농임산물, 수산물, 축산물 포함)의 시험, 검사 식품별 규격 확인 시험법
47 
건강기능식품의 시험, 검사 개별성분시험법
32 
식품(가공식품, 농임산물, 수산물, 축산물 포함)의 시험, 검사 성분시험법
25 
위생용품의 시험, 검사
24 
Other values (16)
136 

Length

Max length49
Median length42
Mean length25.530351
Min length5

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row미생물검사
2nd row미생물검사
3rd row미생물검사
4th row미생물검사
5th row미생물검사

Common Values

ValueCountFrequency (%)
기구 및 용기포장의 시험, 검사 49
15.7%
식품(가공식품, 농임산물, 수산물, 축산물 포함)의 시험, 검사 식품별 규격 확인 시험법 47
15.0%
건강기능식품의 시험, 검사 개별성분시험법 32
10.2%
식품(가공식품, 농임산물, 수산물, 축산물 포함)의 시험, 검사 성분시험법 25
8.0%
위생용품의 시험, 검사 24
7.7%
의약외품의 시험, 검사 22
7.0%
식품첨가물의 시험, 검사 20
 
6.4%
화장품의 시험, 검사 15
 
4.8%
미생물학적 검사 13
 
4.2%
식품(가공식품, 농임산물, 수산물, 축산물 포함)의 시험, 검사 유해물질 13
 
4.2%
Other values (11) 53
16.9%

Length

2023-12-12T09:16:47.198569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
검사 294
16.7%
시험 284
16.2%
포함)의 116
 
6.6%
식품(가공식품 116
 
6.6%
농임산물 116
 
6.6%
수산물 116
 
6.6%
축산물 116
 
6.6%
시험법 51
 
2.9%
49
 
2.8%
기구 49
 
2.8%
Other values (26) 451
25.7%
Distinct300
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T09:16:47.551651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length178
Median length120
Mean length17.942492
Min length1

Characters and Unicode

Total characters5616
Distinct characters392
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique288 ?
Unique (%)92.0%

Sample

1st row대장균군검사, 일반세균수검사, 유산균수검사, 진균수검사(1항목당)
2nd row식중독균 검사(살모넬라검사, 대장균 O157:H7 검사, 장염비브리오, 황색포도상구균, 리스테리아 모노사이토제네스)(1항목당)
3rd row여시니아 엔테로코리티카, 바실러스 세레우스(1항목당)
4th row클로스트리디움 퍼프린젠스, 클로스트리디움 보툴리늄(1항목당)
5th row캠필로박터 제주니
ValueCountFrequency (%)
46
 
4.9%
시험 26
 
2.8%
경우에는 16
 
1.7%
기준 15
 
1.6%
추가함 14
 
1.5%
규격에서 12
 
1.3%
정한 12
 
1.3%
1항목 12
 
1.3%
따라 12
 
1.3%
식중독균 11
 
1.2%
Other values (525) 768
81.4%
2023-12-12T09:16:48.097217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
635
 
11.3%
( 210
 
3.7%
) 210
 
3.7%
145
 
2.6%
, 126
 
2.2%
1 106
 
1.9%
97
 
1.7%
88
 
1.6%
82
 
1.5%
81
 
1.4%
Other values (382) 3836
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4040
71.9%
Space Separator 635
 
11.3%
Decimal Number 217
 
3.9%
Open Punctuation 210
 
3.7%
Close Punctuation 210
 
3.7%
Other Punctuation 136
 
2.4%
Uppercase Letter 85
 
1.5%
Connector Punctuation 41
 
0.7%
Lowercase Letter 33
 
0.6%
Math Symbol 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
145
 
3.6%
97
 
2.4%
88
 
2.2%
82
 
2.0%
81
 
2.0%
81
 
2.0%
76
 
1.9%
75
 
1.9%
68
 
1.7%
65
 
1.6%
Other values (335) 3182
78.8%
Uppercase Letter
ValueCountFrequency (%)
A 17
20.0%
H 12
14.1%
B 11
12.9%
C 7
8.2%
P 7
8.2%
D 6
 
7.1%
E 6
 
7.1%
I 6
 
7.1%
G 3
 
3.5%
V 3
 
3.5%
Other values (6) 7
8.2%
Lowercase Letter
ValueCountFrequency (%)
n 10
30.3%
s 5
15.2%
e 3
 
9.1%
p 3
 
9.1%
a 2
 
6.1%
l 2
 
6.1%
g 2
 
6.1%
r 2
 
6.1%
t 1
 
3.0%
b 1
 
3.0%
Other values (2) 2
 
6.1%
Decimal Number
ValueCountFrequency (%)
1 106
48.8%
0 54
24.9%
5 22
 
10.1%
2 10
 
4.6%
4 10
 
4.6%
6 7
 
3.2%
7 5
 
2.3%
3 2
 
0.9%
9 1
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 126
92.6%
· 5
 
3.7%
" 2
 
1.5%
. 2
 
1.5%
: 1
 
0.7%
Space Separator
ValueCountFrequency (%)
635
100.0%
Open Punctuation
ValueCountFrequency (%)
( 210
100.0%
Close Punctuation
ValueCountFrequency (%)
) 210
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 41
100.0%
Math Symbol
ValueCountFrequency (%)
= 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4040
71.9%
Common 1458
 
26.0%
Latin 118
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
145
 
3.6%
97
 
2.4%
88
 
2.2%
82
 
2.0%
81
 
2.0%
81
 
2.0%
76
 
1.9%
75
 
1.9%
68
 
1.7%
65
 
1.6%
Other values (335) 3182
78.8%
Latin
ValueCountFrequency (%)
A 17
14.4%
H 12
 
10.2%
B 11
 
9.3%
n 10
 
8.5%
C 7
 
5.9%
P 7
 
5.9%
D 6
 
5.1%
E 6
 
5.1%
I 6
 
5.1%
s 5
 
4.2%
Other values (18) 31
26.3%
Common
ValueCountFrequency (%)
635
43.6%
( 210
 
14.4%
) 210
 
14.4%
, 126
 
8.6%
1 106
 
7.3%
0 54
 
3.7%
_ 41
 
2.8%
5 22
 
1.5%
2 10
 
0.7%
4 10
 
0.7%
Other values (9) 34
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4040
71.9%
ASCII 1571
 
28.0%
None 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
635
40.4%
( 210
 
13.4%
) 210
 
13.4%
, 126
 
8.0%
1 106
 
6.7%
0 54
 
3.4%
_ 41
 
2.6%
5 22
 
1.4%
A 17
 
1.1%
H 12
 
0.8%
Other values (36) 138
 
8.8%
Hangul
ValueCountFrequency (%)
145
 
3.6%
97
 
2.4%
88
 
2.2%
82
 
2.0%
81
 
2.0%
81
 
2.0%
76
 
1.9%
75
 
1.9%
68
 
1.7%
65
 
1.6%
Other values (335) 3182
78.8%
None
ValueCountFrequency (%)
· 5
100.0%
Distinct209
Distinct (%)67.4%
Missing3
Missing (%)1.0%
Memory size2.6 KiB
2023-12-12T09:16:48.791961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length5
Mean length5.0322581
Min length3

Characters and Unicode

Total characters1560
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique159 ?
Unique (%)51.3%

Sample

1st row13000
2nd row15000
3rd row13000
4th row20000
5th row22000
ValueCountFrequency (%)
42000 11
 
3.4%
8600 8
 
2.5%
30000 7
 
2.2%
시험항목에 6
 
1.9%
36000 6
 
1.9%
26000 6
 
1.9%
유사 6
 
1.9%
준함 5
 
1.6%
35000 5
 
1.6%
45000 4
 
1.2%
Other values (201) 258
80.1%
2023-12-12T09:16:49.360984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 768
49.2%
3 112
 
7.2%
1 108
 
6.9%
2 97
 
6.2%
5 91
 
5.8%
4 78
 
5.0%
6 76
 
4.9%
7 68
 
4.4%
8 58
 
3.7%
9 39
 
2.5%
Other values (10) 65
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1495
95.8%
Other Letter 53
 
3.4%
Space Separator 12
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 768
51.4%
3 112
 
7.5%
1 108
 
7.2%
2 97
 
6.5%
5 91
 
6.1%
4 78
 
5.2%
6 76
 
5.1%
7 68
 
4.5%
8 58
 
3.9%
9 39
 
2.6%
Other Letter
ValueCountFrequency (%)
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
5
9.4%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1507
96.6%
Hangul 53
 
3.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 768
51.0%
3 112
 
7.4%
1 108
 
7.2%
2 97
 
6.4%
5 91
 
6.0%
4 78
 
5.2%
6 76
 
5.0%
7 68
 
4.5%
8 58
 
3.8%
9 39
 
2.6%
Hangul
ValueCountFrequency (%)
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
5
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1507
96.6%
Hangul 53
 
3.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 768
51.0%
3 112
 
7.4%
1 108
 
7.2%
2 97
 
6.4%
5 91
 
6.0%
4 78
 
5.2%
6 76
 
5.0%
7 68
 
4.5%
8 58
 
3.8%
9 39
 
2.6%
Hangul
ValueCountFrequency (%)
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
6
11.3%
5
9.4%

Correlations

2023-12-12T09:16:49.470212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항목구분
항목1.0001.000
구분1.0001.000
2023-12-12T09:16:49.566418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분항목
구분1.0000.969
항목0.9691.000
2023-12-12T09:16:49.653953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
항목구분
항목1.0000.969
구분0.9691.000

Missing values

2023-12-12T09:16:46.548778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:16:46.659858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

항목구분검사명수수료
0미생물 관련미생물검사대장균군검사, 일반세균수검사, 유산균수검사, 진균수검사(1항목당)13000
1미생물 관련미생물검사식중독균 검사(살모넬라검사, 대장균 O157:H7 검사, 장염비브리오, 황색포도상구균, 리스테리아 모노사이토제네스)(1항목당)15000
2미생물 관련미생물검사여시니아 엔테로코리티카, 바실러스 세레우스(1항목당)13000
3미생물 관련미생물검사클로스트리디움 퍼프린젠스, 클로스트리디움 보툴리늄(1항목당)20000
4미생물 관련미생물검사캠필로박터 제주니22000
5미생물 관련미생물검사엔테로박터 사카자키44000
6미생물 관련미생물검사바실러스 세레우스 정량검사90000
7미생물 관련미생물검사클로스트리디움 퍼프린젠스 정량검사80000
8미생물 관련미생물검사세균발육 시험12000
9미생물 관련미생물검사레지오넬라균검사(환경가검물)40000
항목구분검사명수수료
303식품약품분석관련위생용품의 시험, 검사면체와 축의 접착강도5100
304식품약품분석관련위생용품의 시험, 검사축의 강도5100
305식품약품분석관련위생용품의 시험, 검사염소화페놀류 오염화석탄(PCP)45100
306식품약품분석관련위생용품의 시험, 검사염소화페놀류 사염화석탄산(TeCP)43600
307식품약품분석관련위생용품의 시험, 검사아조염료126400
308식품약품분석관련위생용품의 시험, 검사프탈레이트계 가소제77300
309식품약품분석관련위생용품의 시험, 검사유해원소 용출87500
310식품약품분석관련위생용품의 시험, 검사유해원소 함유량35000
311식품약품분석관련위생용품의 시험, 검사기타 시험(1항목당) 기기분석30000
312식품약품분석관련위생용품의 시험, 검사기타 시험(1항목당) 상기외의 시험유사 시험항목에 준함