Overview

Dataset statistics

Number of variables7
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory58.8 B

Variable types

Text3
Categorical3
Numeric1

Dataset

Description산림정책사업관리시스템에서 제공하는 분류별 품목코드 정보 제공
Author산림청
URLhttps://www.data.go.kr/data/15071644/fileData.do

Alerts

대분류코드명 is highly overall correlated with 대분류코드값High correlation
대분류코드값 is highly overall correlated with 대분류코드명High correlation
품목코드 has unique valuesUnique
중분류코드명 has unique valuesUnique
소분류코드명 has unique valuesUnique
소분류코드값 has 44 (61.1%) zerosZeros

Reproduction

Analysis started2023-12-11 22:44:50.970264
Analysis finished2023-12-11 22:44:52.840547
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

품목코드
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-12T07:44:53.015699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length5.875
Min length5

Characters and Unicode

Total characters423
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row70900
2nd row140200
3rd row191200
4th row191300
5th row191400
ValueCountFrequency (%)
70900 1
 
1.4%
140200 1
 
1.4%
193400 1
 
1.4%
170800 1
 
1.4%
170400 1
 
1.4%
70200 1
 
1.4%
141200 1
 
1.4%
140900 1
 
1.4%
70300 1
 
1.4%
140700 1
 
1.4%
Other values (62) 62
86.1%
2023-12-12T07:44:53.350007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 148
35.0%
1 93
22.0%
X 56
 
13.2%
7 26
 
6.1%
9 25
 
5.9%
2 21
 
5.0%
4 20
 
4.7%
3 11
 
2.6%
8 9
 
2.1%
5 5
 
1.2%
Other values (4) 9
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 363
85.8%
Uppercase Letter 60
 
14.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 148
40.8%
1 93
25.6%
7 26
 
7.2%
9 25
 
6.9%
2 21
 
5.8%
4 20
 
5.5%
3 11
 
3.0%
8 9
 
2.5%
5 5
 
1.4%
6 5
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
X 56
93.3%
B 2
 
3.3%
A 1
 
1.7%
V 1
 
1.7%

Most occurring scripts

ValueCountFrequency (%)
Common 363
85.8%
Latin 60
 
14.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 148
40.8%
1 93
25.6%
7 26
 
7.2%
9 25
 
6.9%
2 21
 
5.8%
4 20
 
5.5%
3 11
 
3.0%
8 9
 
2.5%
5 5
 
1.4%
6 5
 
1.4%
Latin
ValueCountFrequency (%)
X 56
93.3%
B 2
 
3.3%
A 1
 
1.7%
V 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 423
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 148
35.0%
1 93
22.0%
X 56
 
13.2%
7 26
 
6.1%
9 25
 
5.9%
2 21
 
5.0%
4 20
 
4.7%
3 11
 
2.6%
8 9
 
2.1%
5 5
 
1.2%
Other values (4) 9
 
2.1%

대분류코드명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
약용작물류
38 
수실류
14 
산채류
12 
버섯류

Length

Max length5
Median length5
Mean length4.0555556
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수실류
2nd row산채류
3rd row약용작물류
4th row약용작물류
5th row약용작물류

Common Values

ValueCountFrequency (%)
약용작물류 38
52.8%
수실류 14
 
19.4%
산채류 12
 
16.7%
버섯류 8
 
11.1%

Length

2023-12-12T07:44:53.465579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:44:53.559936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
약용작물류 38
52.8%
수실류 14
 
19.4%
산채류 12
 
16.7%
버섯류 8
 
11.1%

대분류코드값
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size708.0 B
XX
28 
19
19 
14
10 
7
17

Length

Max length2
Median length2
Mean length1.875
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row7
2nd row14
3rd row19
4th row19
5th row19

Common Values

ValueCountFrequency (%)
XX 28
38.9%
19 19
26.4%
14 10
 
13.9%
7 9
 
12.5%
17 6
 
8.3%

Length

2023-12-12T07:44:53.651760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:44:53.752921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
xx 28
38.9%
19 19
26.4%
14 10
 
13.9%
7 9
 
12.5%
17 6
 
8.3%

중분류코드명
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-12T07:44:53.967100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length6
Mean length2.9444444
Min length1

Characters and Unicode

Total characters212
Distinct characters115
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row다래
2nd row고사리
3rd row시호
4th row목단
5th row작약
ValueCountFrequency (%)
다래 1
 
1.4%
고사리 1
 
1.4%
두충 1
 
1.4%
석이 1
 
1.4%
표고버섯 1
 
1.4%
대추 1
 
1.4%
산마늘 1
 
1.4%
더덕 1
 
1.4%
1
 
1.4%
도라지 1
 
1.4%
Other values (62) 62
86.1%
2023-12-12T07:44:54.280743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
6.6%
11
 
5.2%
7
 
3.3%
6
 
2.8%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (105) 148
69.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 208
98.1%
Close Punctuation 2
 
0.9%
Open Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
6.7%
11
 
5.3%
7
 
3.4%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (103) 144
69.2%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 208
98.1%
Common 4
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
6.7%
11
 
5.3%
7
 
3.4%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (103) 144
69.2%
Common
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 208
98.1%
ASCII 4
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
6.7%
11
 
5.3%
7
 
3.4%
6
 
2.9%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (103) 144
69.2%
ASCII
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%
Distinct30
Distinct (%)41.7%
Missing0
Missing (%)0.0%
Memory size708.0 B
1
30 
13
 
3
7
 
3
2
 
2
12
 
2
Other values (25)
32 

Length

Max length2
Median length1
Mean length1.3611111
Min length1

Unique

Unique18 ?
Unique (%)25.0%

Sample

1st row9
2nd row2
3rd row12
4th row13
5th row14

Common Values

ValueCountFrequency (%)
1 30
41.7%
13 3
 
4.2%
7 3
 
4.2%
2 2
 
2.8%
12 2
 
2.8%
11 2
 
2.8%
21 2
 
2.8%
3 2
 
2.8%
6 2
 
2.8%
4 2
 
2.8%
Other values (20) 22
30.6%

Length

2023-12-12T07:44:54.389691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 30
41.7%
7 3
 
4.2%
13 3
 
4.2%
3 2
 
2.8%
9 2
 
2.8%
4 2
 
2.8%
6 2
 
2.8%
8 2
 
2.8%
21 2
 
2.8%
11 2
 
2.8%
Other values (20) 22
30.6%

소분류코드명
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-12T07:44:54.600848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length6
Mean length2.9583333
Min length1

Characters and Unicode

Total characters213
Distinct characters117
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row다래
2nd row고사리
3rd row시호
4th row목단
5th row작약
ValueCountFrequency (%)
다래 1
 
1.4%
고사리 1
 
1.4%
두충 1
 
1.4%
석이 1
 
1.4%
표고버섯 1
 
1.4%
대추 1
 
1.4%
산마늘 1
 
1.4%
더덕 1
 
1.4%
1
 
1.4%
도라지 1
 
1.4%
Other values (62) 62
86.1%
2023-12-12T07:44:54.994872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
6.1%
11
 
5.2%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (107) 149
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 209
98.1%
Close Punctuation 2
 
0.9%
Open Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
6.2%
11
 
5.3%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (105) 145
69.4%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 209
98.1%
Common 4
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
6.2%
11
 
5.3%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (105) 145
69.4%
Common
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 209
98.1%
ASCII 4
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
6.2%
11
 
5.3%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (105) 145
69.4%
ASCII
ValueCountFrequency (%)
) 2
50.0%
( 2
50.0%

소분류코드값
Real number (ℝ)

ZEROS 

Distinct29
Distinct (%)40.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.6388889
Minimum0
Maximum28
Zeros44
Zeros (%)61.1%
Negative0
Negative (%)0.0%
Memory size780.0 B
2023-12-12T07:44:55.109607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q310.25
95-th percentile24.45
Maximum28
Range28
Interquartile range (IQR)10.25

Descriptive statistics

Standard deviation8.7408794
Coefficient of variation (CV)1.5501067
Kurtosis0.26137875
Mean5.6388889
Median Absolute Deviation (MAD)0
Skewness1.3050253
Sum406
Variance76.402973
MonotonicityNot monotonic
2023-12-12T07:44:55.213083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
0 44
61.1%
1 1
 
1.4%
28 1
 
1.4%
27 1
 
1.4%
26 1
 
1.4%
25 1
 
1.4%
24 1
 
1.4%
23 1
 
1.4%
22 1
 
1.4%
21 1
 
1.4%
Other values (19) 19
26.4%
ValueCountFrequency (%)
0 44
61.1%
1 1
 
1.4%
2 1
 
1.4%
3 1
 
1.4%
4 1
 
1.4%
5 1
 
1.4%
6 1
 
1.4%
7 1
 
1.4%
8 1
 
1.4%
9 1
 
1.4%
ValueCountFrequency (%)
28 1
1.4%
27 1
1.4%
26 1
1.4%
25 1
1.4%
24 1
1.4%
23 1
1.4%
22 1
1.4%
21 1
1.4%
20 1
1.4%
19 1
1.4%

Interactions

2023-12-12T07:44:52.585516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:44:55.289625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목코드대분류코드명대분류코드값중분류코드명품목중분류코드값소분류코드명소분류코드값
품목코드1.0001.0001.0001.0001.0001.0001.000
대분류코드명1.0001.0000.8571.0000.0001.0000.454
대분류코드값1.0000.8571.0001.0000.8301.0000.660
중분류코드명1.0001.0001.0001.0001.0001.0001.000
품목중분류코드값1.0000.0000.8301.0001.0001.0000.000
소분류코드명1.0001.0001.0001.0001.0001.0001.000
소분류코드값1.0000.4540.6601.0000.0001.0001.000
2023-12-12T07:44:55.427827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분류코드명품목중분류코드값대분류코드값
대분류코드명1.0000.0000.832
품목중분류코드값0.0001.0000.410
대분류코드값0.8320.4101.000
2023-12-12T07:44:55.509628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소분류코드값대분류코드명대분류코드값품목중분류코드값
소분류코드값1.0000.2720.3180.000
대분류코드명0.2721.0000.8320.000
대분류코드값0.3180.8321.0000.410
품목중분류코드값0.0000.0000.4101.000

Missing values

2023-12-12T07:44:52.716161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:44:52.800810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

품목코드대분류코드명대분류코드값중분류코드명품목중분류코드값소분류코드명소분류코드값
070900수실류7다래9다래0
1140200산채류14고사리2고사리0
2191200약용작물류19시호12시호0
3191300약용작물류19목단13목단0
4191400약용작물류19작약14작약0
5191700약용작물류19천궁17천궁0
6192700약용작물류19결명자27결명자0
7192800약용작물류19구기자28구기자0
8193000약용작물류19오미자30오미자0
919AB00약용작물류19산양삼AB산양삼0
품목코드대분류코드명대분류코드값중분류코드명품목중분류코드값소분류코드명소분류코드값
62192100약용작물류19하수오21하수오0
63192900약용작물류19산수유29산수유0
64193800약용작물류19구절초38구절초0
65194700약용작물류19삼지구엽초47삼지구엽초0
66194800약용작물류19480
67195100약용작물류19독활51독활0
68197700약용작물류19산초77산초0
6919B400약용작물류19감초B4감초0
7070600수실류7도토리6도토리0
7119V000약용작물류19헛개나무V0헛개나무0