Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells63
Missing cells (%)33.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory57.3 B

Variable types

Unsupported2
Text1
Numeric2
Categorical1

Dataset

Description유통식품및자가품질검사현황2017년상반기
Author전라북도
URLhttps://www.bigdatahub.go.kr/opendata/dataSet/detail.nm?contentId=37&rlik=49451aebf056b486&serviceId=202925

Alerts

자가품질건수 is highly overall correlated with 적합 and 1 other fieldsHigh correlation
적합 is highly overall correlated with 자가품질건수 and 1 other fieldsHigh correlation
부적합 is highly overall correlated with 자가품질건수 and 1 other fieldsHigh correlation
부적합 is highly imbalanced (54.5%)Imbalance
Unnamed: 0 has 31 (100.0%) missing valuesMissing
Unnamed: 1 has 31 (100.0%) missing valuesMissing
식품 유형 has 1 (3.2%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
자가품질건수 has 4 (12.9%) zerosZeros
적합 has 4 (12.9%) zerosZeros

Reproduction

Analysis started2024-03-14 01:52:51.816897
Analysis finished2024-03-14 01:52:52.367197
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing31
Missing (%)100.0%
Memory size411.0 B

식품 유형
Text

MISSING 

Distinct30
Distinct (%)100.0%
Missing1
Missing (%)3.2%
Memory size380.0 B
2024-03-14T10:52:52.488504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length8
Mean length4.4333333
Min length2

Characters and Unicode

Total characters133
Distinct characters71
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row과자류
2nd row빵또는 떡류
3rd row코코아가공품 및 초콜릿류
4th row잼류
5th row올리고당류
ValueCountFrequency (%)
과자류 1
 
2.9%
규격외일반가공품 1
 
2.9%
젓갈류 1
 
2.9%
절임식품 1
 
2.9%
조림식품 1
 
2.9%
주류 1
 
2.9%
건포류 1
 
2.9%
기타식품류 1
 
2.9%
장기보존식품 1
 
2.9%
드레싱류 1
 
2.9%
Other values (25) 25
71.4%
2024-03-14T10:52:52.772501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
14.3%
11
 
8.3%
8
 
6.0%
5
 
3.8%
5
 
3.8%
4
 
3.0%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (61) 69
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128
96.2%
Space Separator 5
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
14.8%
11
 
8.6%
8
 
6.2%
5
 
3.9%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.6%
Other values (60) 67
52.3%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 128
96.2%
Common 5
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
14.8%
11
 
8.6%
8
 
6.2%
5
 
3.9%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.6%
Other values (60) 67
52.3%
Common
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 128
96.2%
ASCII 5
 
3.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
19
 
14.8%
11
 
8.6%
8
 
6.2%
5
 
3.9%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
2
 
1.6%
Other values (60) 67
52.3%
ASCII
ValueCountFrequency (%)
5
100.0%

자가품질건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct24
Distinct (%)77.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.677419
Minimum0
Maximum739
Zeros4
Zeros (%)12.9%
Negative0
Negative (%)0.0%
Memory size411.0 B
2024-03-14T10:52:52.884756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median9
Q326.5
95-th percentile157
Maximum739
Range739
Interquartile range (IQR)24.5

Descriptive statistics

Standard deviation135.71671
Coefficient of variation (CV)2.8465616
Kurtosis24.228008
Mean47.677419
Median Absolute Deviation (MAD)8
Skewness4.7709373
Sum1478
Variance18419.026
MonotonicityNot monotonic
2024-03-14T10:52:52.982686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
0 4
 
12.9%
2 3
 
9.7%
7 2
 
6.5%
1 2
 
6.5%
45 1
 
3.2%
74 1
 
3.2%
739 1
 
3.2%
86 1
 
3.2%
11 1
 
3.2%
18 1
 
3.2%
Other values (14) 14
45.2%
ValueCountFrequency (%)
0 4
12.9%
1 2
6.5%
2 3
9.7%
4 1
 
3.2%
5 1
 
3.2%
6 1
 
3.2%
7 2
6.5%
8 1
 
3.2%
9 1
 
3.2%
10 1
 
3.2%
ValueCountFrequency (%)
739 1
3.2%
228 1
3.2%
86 1
3.2%
75 1
3.2%
74 1
3.2%
45 1
3.2%
36 1
3.2%
29 1
3.2%
24 1
3.2%
19 1
3.2%

적합
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct22
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.032258
Minimum0
Maximum729
Zeros4
Zeros (%)12.9%
Negative0
Negative (%)0.0%
Memory size411.0 B
2024-03-14T10:52:53.095874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median9
Q326.5
95-th percentile154.5
Maximum729
Range729
Interquartile range (IQR)24.5

Descriptive statistics

Standard deviation133.82463
Coefficient of variation (CV)2.8453797
Kurtosis24.268527
Mean47.032258
Median Absolute Deviation (MAD)8
Skewness4.774219
Sum1458
Variance17909.032
MonotonicityNot monotonic
2024-03-14T10:52:53.236237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
0 4
 
12.9%
2 3
 
9.7%
74 2
 
6.5%
5 2
 
6.5%
1 2
 
6.5%
7 2
 
6.5%
45 1
 
3.2%
729 1
 
3.2%
86 1
 
3.2%
11 1
 
3.2%
Other values (12) 12
38.7%
ValueCountFrequency (%)
0 4
12.9%
1 2
6.5%
2 3
9.7%
4 1
 
3.2%
5 2
6.5%
6 1
 
3.2%
7 2
6.5%
9 1
 
3.2%
10 1
 
3.2%
11 1
 
3.2%
ValueCountFrequency (%)
729 1
3.2%
223 1
3.2%
86 1
3.2%
74 2
6.5%
45 1
3.2%
35 1
3.2%
29 1
3.2%
24 1
3.2%
19 1
3.2%
18 1
3.2%

부적합
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Memory size380.0 B
0
25 
1
2
 
1
5
 
1
10
 
1

Length

Max length2
Median length1
Mean length1.0322581
Min length1

Unique

Unique3 ?
Unique (%)9.7%

Sample

1st row0
2nd row1
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 25
80.6%
1 3
 
9.7%
2 1
 
3.2%
5 1
 
3.2%
10 1
 
3.2%

Length

2024-03-14T10:52:53.384915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:52:53.475815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 25
80.6%
1 3
 
9.7%
2 1
 
3.2%
5 1
 
3.2%
10 1
 
3.2%

Interactions

2024-03-14T10:52:52.085091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T10:52:51.934610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T10:52:52.167309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T10:52:52.011064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T10:52:53.543884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식품 유형자가품질건수적합부적합
식품 유형1.0001.0001.0001.000
자가품질건수1.0001.0001.0000.831
적합1.0001.0001.0000.831
부적합1.0000.8310.8311.000
2024-03-14T10:52:53.640043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자가품질건수적합부적합
자가품질건수1.0000.9990.786
적합0.9991.0000.786
부적합0.7860.7861.000

Missing values

2024-03-14T10:52:52.257674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T10:52:52.335246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Unnamed: 0Unnamed: 1식품 유형자가품질건수적합부적합
0<NA><NA>과자류45450
1<NA><NA>빵또는 떡류36351
2<NA><NA>코코아가공품 및 초콜릿류220
3<NA><NA>잼류220
4<NA><NA>올리고당류000
5<NA><NA>두부류 또는 묵류550
6<NA><NA>식용유지류660
7<NA><NA>면류990
8<NA><NA>다류19190
9<NA><NA>커피13130
Unnamed: 0Unnamed: 1식품 유형자가품질건수적합부적합
21<NA><NA>규격외일반가공품75741
22<NA><NA>장기보존식품110
23<NA><NA>건강기능식품000
24<NA><NA>식품첨가물29290
25<NA><NA>기구및용기포장18180
26<NA><NA>위생용품11110
27<NA><NA>농산물770
28<NA><NA>서류가공품110
29<NA><NA>수산물86860
30<NA><NA><NA>73972910