Overview

Dataset statistics

Number of variables5
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory47.0 B

Variable types

Categorical3
Text1
Numeric1

Dataset

Description대전광역시 학교급식 납품업체에 대한 식재료 안전성검사를 실시하여 학생들이 안전한 먹거리 조성을 기여하기위해 실시.
URLhttps://www.data.go.kr/data/15083626/fileData.do

Alerts

검사결과 has constant value ""Constant
검사항목 is highly overall correlated with 검사건수 and 1 other fieldsHigh correlation
구분 is highly overall correlated with 검사건수 and 1 other fieldsHigh correlation
검사건수 is highly overall correlated with 구분 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 17:47:51.331478
Analysis finished2023-12-12 17:47:51.837130
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size308.0 B
곡류
14 
김치류
육류(한우)
 
1
육류(돼지고기)
 
1

Length

Max length8
Median length2
Mean length2.7272727
Min length2

Unique

Unique2 ?
Unique (%)9.1%

Sample

1st row김치류
2nd row김치류
3rd row김치류
4th row김치류
5th row김치류

Common Values

ValueCountFrequency (%)
곡류 14
63.6%
김치류 6
27.3%
육류(한우) 1
 
4.5%
육류(돼지고기) 1
 
4.5%

Length

2023-12-13T02:47:51.920443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:47:52.060989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
곡류 14
63.6%
김치류 6
27.3%
육류(한우 1
 
4.5%
육류(돼지고기 1
 
4.5%
Distinct21
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-13T02:47:52.300108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length3.3181818
Min length2

Characters and Unicode

Total characters73
Distinct characters40
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)90.9%

Sample

1st row깍두기
2nd row총각김치
3rd row배추김치
4th row열무김치
5th row백김치
ValueCountFrequency (%)
찰보리쌀 2
 
9.1%
현미 1
 
4.5%
한우2등급이상 1
 
4.5%
쌀보리 1
 
4.5%
찰흑미 1
 
4.5%
수수 1
 
4.5%
찰현미 1
 
4.5%
현미찹쌀 1
 
4.5%
백미 1
 
4.5%
서리태 1
 
4.5%
Other values (11) 11
50.0%
2023-12-13T02:47:52.704944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
8.2%
5
 
6.8%
5
 
6.8%
4
 
5.5%
4
 
5.5%
4
 
5.5%
4
 
5.5%
3
 
4.1%
3
 
4.1%
2
 
2.7%
Other values (30) 33
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72
98.6%
Decimal Number 1
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
8.3%
5
 
6.9%
5
 
6.9%
4
 
5.6%
4
 
5.6%
4
 
5.6%
4
 
5.6%
3
 
4.2%
3
 
4.2%
2
 
2.8%
Other values (29) 32
44.4%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72
98.6%
Common 1
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
8.3%
5
 
6.9%
5
 
6.9%
4
 
5.6%
4
 
5.6%
4
 
5.6%
4
 
5.6%
3
 
4.2%
3
 
4.2%
2
 
2.8%
Other values (29) 32
44.4%
Common
ValueCountFrequency (%)
2 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72
98.6%
ASCII 1
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
8.3%
5
 
6.9%
5
 
6.9%
4
 
5.6%
4
 
5.6%
4
 
5.6%
4
 
5.6%
3
 
4.2%
3
 
4.2%
2
 
2.8%
Other values (29) 32
44.4%
ASCII
ValueCountFrequency (%)
2 1
100.0%

검사건수
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)54.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.8636364
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-13T02:47:52.861580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.05
Q12.25
median6.5
Q310
95-th percentile34.75
Maximum42
Range41
Interquartile range (IQR)7.75

Descriptive statistics

Standard deviation10.398239
Coefficient of variation (CV)1.1731347
Kurtosis6.1036853
Mean8.8636364
Median Absolute Deviation (MAD)4
Skewness2.4977874
Sum195
Variance108.12338
MonotonicityNot monotonic
2023-12-13T02:47:53.000798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
2 4
18.2%
11 3
13.6%
10 2
9.1%
7 2
9.1%
3 2
9.1%
6 2
9.1%
1 2
9.1%
9 1
 
4.5%
8 1
 
4.5%
5 1
 
4.5%
Other values (2) 2
9.1%
ValueCountFrequency (%)
1 2
9.1%
2 4
18.2%
3 2
9.1%
5 1
 
4.5%
6 2
9.1%
7 2
9.1%
8 1
 
4.5%
9 1
 
4.5%
10 2
9.1%
11 3
13.6%
ValueCountFrequency (%)
42 1
 
4.5%
36 1
 
4.5%
11 3
13.6%
10 2
9.1%
9 1
 
4.5%
8 1
 
4.5%
7 2
9.1%
6 2
9.1%
5 1
 
4.5%
3 2
9.1%

검사항목
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size308.0 B
납,카드뮴
14 
납,카드뮴,보존료,타르색소
일반세균, 대장균, 한우유전자
 
1
일반세균, 대장균, 유해잔류물질
 
1

Length

Max length17
Median length5
Mean length8.5
Min length5

Unique

Unique2 ?
Unique (%)9.1%

Sample

1st row납,카드뮴,보존료,타르색소
2nd row납,카드뮴,보존료,타르색소
3rd row납,카드뮴,보존료,타르색소
4th row납,카드뮴,보존료,타르색소
5th row납,카드뮴,보존료,타르색소

Common Values

ValueCountFrequency (%)
납,카드뮴 14
63.6%
납,카드뮴,보존료,타르색소 6
27.3%
일반세균, 대장균, 한우유전자 1
 
4.5%
일반세균, 대장균, 유해잔류물질 1
 
4.5%

Length

2023-12-13T02:47:53.133008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:47:53.271336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
납,카드뮴 14
53.8%
납,카드뮴,보존료,타르색소 6
23.1%
일반세균 2
 
7.7%
대장균 2
 
7.7%
한우유전자 1
 
3.8%
유해잔류물질 1
 
3.8%

검사결과
Categorical

CONSTANT 

Distinct1
Distinct (%)4.5%
Missing0
Missing (%)0.0%
Memory size308.0 B
이상없음
22 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이상없음
2nd row이상없음
3rd row이상없음
4th row이상없음
5th row이상없음

Common Values

ValueCountFrequency (%)
이상없음 22
100.0%

Length

2023-12-13T02:47:53.432875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:47:53.546818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이상없음 22
100.0%

Interactions

2023-12-13T02:47:51.523437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:47:53.616560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분검사품목검사건수검사항목
구분1.0001.0000.9371.000
검사품목1.0001.0001.0001.000
검사건수0.9371.0001.0000.937
검사항목1.0001.0000.9371.000
2023-12-13T02:47:53.711794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사항목구분
검사항목1.0001.000
구분1.0001.000
2023-12-13T02:47:53.834149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사건수구분검사항목
검사건수1.0000.9290.929
구분0.9291.0001.000
검사항목0.9291.0001.000

Missing values

2023-12-13T02:47:51.660773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:47:51.792499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분검사품목검사건수검사항목검사결과
0김치류깍두기11납,카드뮴,보존료,타르색소이상없음
1김치류총각김치11납,카드뮴,보존료,타르색소이상없음
2김치류배추김치11납,카드뮴,보존료,타르색소이상없음
3김치류열무김치10납,카드뮴,보존료,타르색소이상없음
4김치류백김치10납,카드뮴,보존료,타르색소이상없음
5김치류석박지2납,카드뮴,보존료,타르색소이상없음
6곡류청차조7납,카드뮴이상없음
7곡류찹쌀9납,카드뮴이상없음
8곡류찰수수7납,카드뮴이상없음
9곡류찰기장8납,카드뮴이상없음
구분검사품목검사건수검사항목검사결과
12곡류서리태3납,카드뮴이상없음
13곡류백미6납,카드뮴이상없음
14곡류현미찹쌀5납,카드뮴이상없음
15곡류찰현미2납,카드뮴이상없음
16곡류수수1납,카드뮴이상없음
17곡류찰보리쌀2납,카드뮴이상없음
18곡류찰흑미1납,카드뮴이상없음
19곡류쌀보리2납,카드뮴이상없음
20육류(한우)한우2등급이상36일반세균, 대장균, 한우유전자이상없음
21육류(돼지고기)무항생제42일반세균, 대장균, 유해잔류물질이상없음