Overview

Dataset statistics

Number of variables4
Number of observations62
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory36.1 B

Variable types

Numeric2
Categorical1
Text1

Dataset

Description국립농산물품질관리원에서 관리하는 농축산물 원산지표시 업태별 적발현황(연도, 업종, 거짓표시 적발실적, 미표시 적발실적)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220613000000002102

Reproduction

Analysis started2024-03-23 07:22:15.661905
Analysis finished2024-03-23 07:22:18.030505
Duration2.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Real number (ℝ)

Distinct6
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.4355
Minimum2018
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-23T07:22:18.189992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2018
Q12019
median2020
Q32022
95-th percentile2023
Maximum2023
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7331954
Coefficient of variation (CV)0.00085783257
Kurtosis-1.2941004
Mean2020.4355
Median Absolute Deviation (MAD)1.5
Skewness0.053629018
Sum125267
Variance3.0039662
MonotonicityIncreasing
2024-03-23T07:22:18.631840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 11
17.7%
2019 11
17.7%
2020 10
16.1%
2021 10
16.1%
2022 10
16.1%
2023 10
16.1%
ValueCountFrequency (%)
2018 11
17.7%
2019 11
17.7%
2020 10
16.1%
2021 10
16.1%
2022 10
16.1%
2023 10
16.1%
ValueCountFrequency (%)
2023 10
16.1%
2022 10
16.1%
2021 10
16.1%
2020 10
16.1%
2019 11
17.7%
2018 11
17.7%

업종
Categorical

Distinct14
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size628.0 B
일반음식점
식육판매업
가공업체
통신판매업체
노점상
Other values (9)
32 

Length

Max length9
Median length6
Mean length4.4516129
Min length2

Unique

Unique1 ?
Unique (%)1.6%

Sample

1st row일반음식점
2nd row식육판매업
3rd row가공업체
4th row통신판매업체
5th row집단급식소

Common Values

ValueCountFrequency (%)
일반음식점 6
9.7%
식육판매업 6
9.7%
가공업체 6
9.7%
통신판매업체 6
9.7%
노점상 6
9.7%
휴게음식점 6
9.7%
슈퍼 5
8.1%
식품유통업 4
6.5%
도매상 4
6.5%
제과점영업 4
6.5%
Other values (4) 9
14.5%

Length

2024-03-23T07:22:19.222782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반음식점 6
9.7%
식육판매업 6
9.7%
가공업체 6
9.7%
통신판매업체 6
9.7%
노점상 6
9.7%
휴게음식점 6
9.7%
슈퍼 5
8.1%
식품유통업 4
6.5%
도매상 4
6.5%
제과점영업 4
6.5%
Other values (4) 9
14.5%
Distinct48
Distinct (%)77.4%
Missing0
Missing (%)0.0%
Memory size628.0 B
2024-03-23T07:22:19.864231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length2
Mean length2.3225806
Min length1

Characters and Unicode

Total characters144
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)59.7%

Sample

1st row1633
2nd row244
3rd row197
4th row59
5th row37
ValueCountFrequency (%)
20 3
 
4.8%
21 3
 
4.8%
30 3
 
4.8%
29 2
 
3.2%
15 2
 
3.2%
176 2
 
3.2%
19 2
 
3.2%
27 2
 
3.2%
37 2
 
3.2%
28 2
 
3.2%
Other values (38) 39
62.9%
2024-03-23T07:22:21.278959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 32
22.2%
2 20
13.9%
7 15
10.4%
9 15
10.4%
3 13
9.0%
6 11
 
7.6%
4 11
 
7.6%
0 10
 
6.9%
5 9
 
6.2%
8 7
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 143
99.3%
Other Punctuation 1
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 32
22.4%
2 20
14.0%
7 15
10.5%
9 15
10.5%
3 13
9.1%
6 11
 
7.7%
4 11
 
7.7%
0 10
 
7.0%
5 9
 
6.3%
8 7
 
4.9%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 144
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 32
22.2%
2 20
13.9%
7 15
10.4%
9 15
10.4%
3 13
9.0%
6 11
 
7.6%
4 11
 
7.6%
0 10
 
6.9%
5 9
 
6.2%
8 7
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 144
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 32
22.2%
2 20
13.9%
7 15
10.4%
9 15
10.4%
3 13
9.0%
6 11
 
7.6%
4 11
 
7.6%
0 10
 
6.9%
5 9
 
6.2%
8 7
 
4.9%
Distinct52
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean132.74194
Minimum9
Maximum806
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size690.0 B
2024-03-23T07:22:21.785652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9
5-th percentile18
Q127.25
median50
Q3128.5
95-th percentile649.55
Maximum806
Range797
Interquartile range (IQR)101.25

Descriptive statistics

Standard deviation197.91917
Coefficient of variation (CV)1.4910071
Kurtosis5.0998189
Mean132.74194
Median Absolute Deviation (MAD)29.5
Skewness2.4408429
Sum8230
Variance39171.998
MonotonicityNot monotonic
2024-03-23T07:22:22.510192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33 4
 
6.5%
24 3
 
4.8%
39 2
 
3.2%
18 2
 
3.2%
49 2
 
3.2%
23 2
 
3.2%
20 2
 
3.2%
652 1
 
1.6%
92 1
 
1.6%
78 1
 
1.6%
Other values (42) 42
67.7%
ValueCountFrequency (%)
9 1
 
1.6%
12 1
 
1.6%
16 1
 
1.6%
18 2
3.2%
19 1
 
1.6%
20 2
3.2%
21 1
 
1.6%
22 1
 
1.6%
23 2
3.2%
24 3
4.8%
ValueCountFrequency (%)
806 1
1.6%
796 1
1.6%
746 1
1.6%
652 1
1.6%
603 1
1.6%
559 1
1.6%
296 1
1.6%
234 1
1.6%
233 1
1.6%
212 1
1.6%

Interactions

2024-03-23T07:22:16.862036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:22:16.041583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:22:17.202717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-23T07:22:16.276205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T07:22:22.902034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
연도1.0000.0000.5480.000
업종0.0001.0000.6880.770
거짓표시 적발실적(개소)0.5480.6881.0000.999
미표시 적발실적(개소)0.0000.7700.9991.000
2024-03-23T07:22:23.133940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연도미표시 적발실적(개소)업종
연도1.000-0.1760.000
미표시 적발실적(개소)-0.1761.0000.448
업종0.0000.4481.000

Missing values

2024-03-23T07:22:17.588407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:22:17.914410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
02018일반음식점1633652
12018식육판매업244152
22018가공업체197212
32018통신판매업체5929
42018집단급식소3733
52018노점상3762
62018슈퍼2851
72018휴게음식점2828
82018식품유통업2723
92018도매상2123
연도업종거짓표시 적발실적(개소)미표시 적발실적(개소)
522023일반음식점909796
532023가공업체158130
542023식육판매업113110
552023통신판매업체4422
562023노점상1749
572023휴게음식점3042
582023식육즉석판매가공업1418
592023도매상2033
602023슈퍼718
612023제과점영업79