Overview

Dataset statistics

Number of variables8
Number of observations2674
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory182.9 KiB
Average record size in memory70.0 B

Variable types

Text1
Categorical6
Numeric1

Dataset

Description농산물 유통 관련하여 가공용쌀 공급업에 대해 지정용도외 사용, 원산지표시, 관리대장 비치 등 단속정보(단속년월, 시도명, 조사건수, 위반업체수, 지정용도외 사용 건수, 표시위반 건수,관리대장 미비치 건수, 기타 )
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20170912000000000790

Alerts

위반업 건수 has constant value ""Constant
위반업 건수 - 지정용도 외 양곡사용 처분건 has constant value ""Constant
위반업 건수 - 관리대장미비치 has constant value ""Constant
위반업 건수 - 표시 위반 건수 has constant value ""Constant
위반업 건수 - 기타 위반 건수 has constant value ""Constant

Reproduction

Analysis started2024-03-23 07:23:29.898718
Analysis finished2024-03-23 07:23:30.763490
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct245
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-03-23T07:23:31.311342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters16044
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row199912
2nd row200009
3rd row200009
4th row200101
5th row200101
ValueCountFrequency (%)
201009 17
 
0.6%
201104 17
 
0.6%
201101 17
 
0.6%
201301 17
 
0.6%
201302 17
 
0.6%
200503 16
 
0.6%
201208 16
 
0.6%
200507 16
 
0.6%
201401 16
 
0.6%
201109 16
 
0.6%
Other values (236) 2510
93.8%
2024-03-23T07:23:32.380581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6333
39.5%
2 3509
21.9%
1 2732
17.0%
5 537
 
3.3%
9 520
 
3.2%
6 517
 
3.2%
8 514
 
3.2%
7 512
 
3.2%
4 442
 
2.8%
3 427
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16043
> 99.9%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6333
39.5%
2 3509
21.9%
1 2732
17.0%
5 537
 
3.3%
9 520
 
3.2%
6 517
 
3.2%
8 514
 
3.2%
7 512
 
3.2%
4 442
 
2.8%
3 427
 
2.7%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 16044
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6333
39.5%
2 3509
21.9%
1 2732
17.0%
5 537
 
3.3%
9 520
 
3.2%
6 517
 
3.2%
8 514
 
3.2%
7 512
 
3.2%
4 442
 
2.8%
3 427
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16044
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6333
39.5%
2 3509
21.9%
1 2732
17.0%
5 537
 
3.3%
9 520
 
3.2%
6 517
 
3.2%
8 514
 
3.2%
7 512
 
3.2%
4 442
 
2.8%
3 427
 
2.7%

시도명
Categorical

Distinct18
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
경기도
241 
강원도
234 
충청남도
228 
경상남도
218 
전라남도
204 
Other values (13)
1549 

Length

Max length7
Median length5
Mean length4.2524308
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전라남도
2nd row충청북도
3rd row전라남도
4th row인천광역시
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 241
 
9.0%
강원도 234
 
8.8%
충청남도 228
 
8.5%
경상남도 218
 
8.2%
전라남도 204
 
7.6%
경상북도 202
 
7.6%
충청북도 195
 
7.3%
전라북도 190
 
7.1%
인천광역시 167
 
6.2%
서울특별시 147
 
5.5%
Other values (8) 648
24.2%

Length

2024-03-23T07:23:32.780923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 241
 
9.0%
강원도 234
 
8.8%
충청남도 228
 
8.5%
경상남도 218
 
8.2%
전라남도 204
 
7.6%
경상북도 202
 
7.6%
충청북도 195
 
7.3%
전라북도 190
 
7.1%
인천광역시 167
 
6.2%
서울특별시 147
 
5.5%
Other values (8) 648
24.2%

조사건수
Real number (ℝ)

Distinct115
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.580404
Minimum1
Maximum229
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.6 KiB
2024-03-23T07:23:33.168508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q15
median13
Q324
95-th percentile57
Maximum229
Range228
Interquartile range (IQR)19

Descriptive statistics

Standard deviation20.387099
Coefficient of variation (CV)1.0972366
Kurtosis12.571209
Mean18.580404
Median Absolute Deviation (MAD)9
Skewness2.7949401
Sum49684
Variance415.63382
MonotonicityNot monotonic
2024-03-23T07:23:33.597300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 190
 
7.1%
3 147
 
5.5%
4 135
 
5.0%
2 123
 
4.6%
5 109
 
4.1%
7 103
 
3.9%
6 102
 
3.8%
13 98
 
3.7%
10 89
 
3.3%
12 87
 
3.3%
Other values (105) 1491
55.8%
ValueCountFrequency (%)
1 190
7.1%
2 123
4.6%
3 147
5.5%
4 135
5.0%
5 109
4.1%
6 102
3.8%
7 103
3.9%
8 78
2.9%
9 77
2.9%
10 89
3.3%
ValueCountFrequency (%)
229 1
< 0.1%
178 1
< 0.1%
169 1
< 0.1%
147 1
< 0.1%
141 2
0.1%
132 1
< 0.1%
125 2
0.1%
124 1
< 0.1%
122 2
0.1%
121 1
< 0.1%

위반업 건수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
0
2674 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2674
100.0%

Length

2024-03-23T07:23:33.998112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:23:34.281545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2674
100.0%
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
0
2674 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2674
100.0%

Length

2024-03-23T07:23:34.578555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:23:34.864078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2674
100.0%
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
0
2674 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2674
100.0%

Length

2024-03-23T07:23:35.165580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:23:35.450893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2674
100.0%
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
0
2674 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2674
100.0%

Length

2024-03-23T07:23:35.818655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:23:36.150226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2674
100.0%
Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
0
2674 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2674
100.0%

Length

2024-03-23T07:23:36.417075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-23T07:23:36.662776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2674
100.0%

Interactions

2024-03-23T07:23:30.156026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-23T07:23:36.873332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명조사건수
시도명1.0000.465
조사건수0.4651.000
2024-03-23T07:23:37.346796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
조사건수시도명
조사건수1.0000.172
시도명0.1721.000

Missing values

2024-03-23T07:23:30.406377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T07:23:30.661089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

단속년월시도명조사건수위반업 건수위반업 건수 - 지정용도 외 양곡사용 처분건위반업 건수 - 관리대장미비치위반업 건수 - 표시 위반 건수위반업 건수 - 기타 위반 건수
0199912전라남도100000
1200009충청북도100000
2200009전라남도100000
3200101인천광역시100000
4200101경기도700000
5200101강원도900000
6200101충청북도2200000
7200101충청남도400000
8200101전라남도200000
9200101경상북도1200000
단속년월시도명조사건수위반업 건수위반업 건수 - 지정용도 외 양곡사용 처분건위반업 건수 - 관리대장미비치위반업 건수 - 표시 위반 건수위반업 건수 - 기타 위반 건수
2664202102세종특별자치시300000
2665202102경기도1800000
2666202102강원도2100000
2667202102충청북도2100000
2668202102충청남도1000000
2669202102전라북도1800000
2670202102전라남도1200000
2671202102경상북도1900000
2672202102경상남도900000
2673202102제주특별자치도300000