Overview

Dataset statistics

Number of variables4
Number of observations62
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory37.1 B

Variable types

Numeric3
Categorical1

Dataset

Description국립농산물품질관리원에서 관리하는 농축산물 원산지표시 업태별 적발현황(연도, 업종, 거짓표시 적발실적, 미표시 적발실적)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220613000000002102

Alerts

거짓표시 적발실적(개소) is highly overall correlated with 미표시 적발실적(개소)High correlation
미표시 적발실적(개소) is highly overall correlated with 거짓표시 적발실적(개소)High correlation

Reproduction

Analysis started2024-03-23 07:21:55.657662
Analysis finished2024-03-23 07:22:00.231658
Duration4.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct6
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.4355
Minimum2018
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-23T07:22:00.674144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2018
Q12019
median2020
Q32022
95-th percentile2023
Maximum2023
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7331954
Coefficient of variation (CV)0.00085783257
Kurtosis-1.2941004
Mean2020.4355
Median Absolute Deviation (MAD)1.5
Skewness0.053629018
Sum125267
Variance3.0039662
MonotonicityIncreasing
2024-03-23T07:22:01.297324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 11
17.7%
2019 11
17.7%
2020 10
16.1%
2021 10
16.1%
2022 10
16.1%
2023 10
16.1%
ValueCountFrequency (%)
2018 11
17.7%
2019 11
17.7%
2020 10
16.1%
2021 10
16.1%
2022 10
16.1%
2023 10
16.1%
ValueCountFrequency (%)
2023 10
16.1%
2022 10
16.1%
2021 10
16.1%
2020 10
16.1%
2019 11
17.7%
2018 11
17.7%

업종
Categorical

Distinct14
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
일반음식점
식육판매업
가공업체
통신판매업체
노점상
Other values (9)
32 

Length

Max length9
Median length6
Mean length4.4516129
Min length2

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row일반음식점
2nd row식육판매업
3rd row가공업체
4th row통신판매업체
5th row집단급식소

Common Values

ValueCountFrequency (%)
일반음식점 6
9.7%
식육판매업 6
9.7%
가공업체 6
9.7%
통신판매업체 6
9.7%
노점상 6
9.7%
휴게음식점 6
9.7%
슈퍼 5
8.1%
식품유통업 4
6.5%
도매상 4
6.5%
제과점영업 4
6.5%
Other values (4) 9
14.5%

Length

2024-03-23T07:22:01.851679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 6
9.7%
식육판매업 6
9.7%
가공업체 6
9.7%
통신판매업체 6
9.7%
노점상 6
9.7%
휴게음식점 6
9.7%
슈퍼 5
8.1%
식품유통업 4
6.5%
도매상 4
6.5%
제과점영업 4
6.5%
Other values (4) 9
14.5%

거짓표시 적발실적(개소)
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean159.95161
Minimum1
Maximum1633
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-23T07:22:02.223894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.1
Q119
median30.5
Q3147.25
95-th percentile968.3
Maximum1633
Range1632
Interquartile range (IQR)128.25

Descriptive statistics

Standard deviation335.65588
Coefficient of variation (CV)2.0984838
Kurtosis11.252363
Mean159.95161
Median Absolute Deviation (MAD)23
Skewness3.3595381
Sum9917
Variance112664.87
MonotonicityNot monotonic
2024-03-23T07:22:02.891957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
21 3
 
4.8%
37 3
 
4.8%
15 3
 
4.8%
20 2
 
3.2%
9 2
 
3.2%
176 2
 
3.2%
4 2
 
3.2%
29 2
 
3.2%
27 2
 
3.2%
28 2
 
3.2%
Other values (37) 39
62.9%
ValueCountFrequency (%)
1 1
 
1.6%
2 1
 
1.6%
4 2
3.2%
6 1
 
1.6%
7 1
 
1.6%
8 1
 
1.6%
9 2
3.2%
10 1
 
1.6%
13 1
 
1.6%
15 3
4.8%
ValueCountFrequency (%)
1633 1
1.6%
1594 1
1.6%
991 1
1.6%
974 1
1.6%
860 1
1.6%
283 1
1.6%
265 1
1.6%
244 1
1.6%
217 1
1.6%
197 1
1.6%

미표시 적발실적(개소)
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.46774
Minimum6
Maximum806
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-23T07:22:03.793612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile12.05
Q124
median46
Q3121.5
95-th percentile600.8
Maximum806
Range800
Interquartile range (IQR)97.5

Descriptive statistics

Standard deviation180.38917
Coefficient of variation (CV)1.5099404
Kurtosis6.2941587
Mean119.46774
Median Absolute Deviation (MAD)28.5
Skewness2.5982186
Sum7407
Variance32540.253
MonotonicityNot monotonic
2024-03-23T07:22:04.589989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46 3
 
4.8%
33 3
 
4.8%
13 3
 
4.8%
23 3
 
4.8%
24 3
 
4.8%
20 2
 
3.2%
39 2
 
3.2%
652 1
 
1.6%
296 1
 
1.6%
92 1
 
1.6%
Other values (40) 40
64.5%
ValueCountFrequency (%)
6 1
 
1.6%
8 1
 
1.6%
9 1
 
1.6%
12 1
 
1.6%
13 3
4.8%
16 1
 
1.6%
19 1
 
1.6%
20 2
3.2%
21 1
 
1.6%
23 3
4.8%
ValueCountFrequency (%)
806 1
1.6%
746 1
1.6%
652 1
1.6%
603 1
1.6%
559 1
1.6%
296 1
1.6%
234 1
1.6%
233 1
1.6%
227 1
1.6%
212 1
1.6%

Interactions

2024-03-23T07:21:58.251397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:55.946109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:57.082980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:58.669500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:56.252529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:57.397524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:59.177871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:56.605595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:21:57.761889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T07:22:04.938853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
연도1.0000.0000.0000.000
업종0.0001.0000.6270.661
거짓표시 적발실적(개소)0.0000.6271.0000.909
미표시 적발실적(개소)0.0000.6610.9091.000
2024-03-23T07:22:05.203514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도거짓표시 적발실적(개소)미표시 적발실적(개소)업종
연도1.000-0.377-0.3240.000
거짓표시 적발실적(개소)-0.3771.0000.8680.353
미표시 적발실적(개소)-0.3240.8681.0000.339
업종0.0000.3530.3391.000

Missing values

2024-03-23T07:21:59.701088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:22:00.122356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
02018일반음식점1633652
12018식육판매업244152
22018가공업체197212
32018통신판매업체5929
42018집단급식소3733
52018노점상3762
62018슈퍼2851
72018휴게음식점2828
82018식품유통업2723
92018도매상2123
연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
522023일반음식점265227
532023가공업체5246
542023식육판매업3746
552023통신판매업체158
562023노점상423
572023휴게음식점713
582023식육즉석판매가공업49
592023도매상913
602023슈퍼213
612023제과점영업16