Overview

Dataset statistics

Number of variables3
Number of observations334
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.3 KiB
Average record size in memory25.4 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description산림청 국립산림품종관리센터에서 실시한 특성조사요령(Test Guideline) 발간 목록입니다. 번호, 종명, 발행년월 등의 정보를 포함합니다.
URLhttps://www.data.go.kr/data/15064353/fileData.do

Alerts

번호 is highly overall correlated with 발행년월High correlation
발행년월 is highly overall correlated with 번호High correlation
번호 has unique valuesUnique
종명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:38:03.144488
Analysis finished2023-12-12 06:38:03.745841
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct334
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean167.5
Minimum1
Maximum334
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.1 KiB
2023-12-12T15:38:03.819632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.65
Q184.25
median167.5
Q3250.75
95-th percentile317.35
Maximum334
Range333
Interquartile range (IQR)166.5

Descriptive statistics

Standard deviation96.561725
Coefficient of variation (CV)0.57648791
Kurtosis-1.2
Mean167.5
Median Absolute Deviation (MAD)83.5
Skewness0
Sum55945
Variance9324.1667
MonotonicityStrictly increasing
2023-12-12T15:38:04.031268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
231 1
 
0.3%
229 1
 
0.3%
228 1
 
0.3%
227 1
 
0.3%
226 1
 
0.3%
225 1
 
0.3%
224 1
 
0.3%
223 1
 
0.3%
222 1
 
0.3%
Other values (324) 324
97.0%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
334 1
0.3%
333 1
0.3%
332 1
0.3%
331 1
0.3%
330 1
0.3%
329 1
0.3%
328 1
0.3%
327 1
0.3%
326 1
0.3%
325 1
0.3%

종명
Text

UNIQUE 

Distinct334
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-12T15:38:04.360188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length3.7694611
Min length1

Characters and Unicode

Total characters1259
Distinct characters324
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique334 ?
Unique (%)100.0%

Sample

1st row표고버섯
2nd row밤나무
3rd row백운풀
4th row황해쑥
5th row천마
ValueCountFrequency (%)
표고버섯 1
 
0.3%
멀꿀 1
 
0.3%
개비자나무 1
 
0.3%
청미래덩굴 1
 
0.3%
다릅나무 1
 
0.3%
회양목 1
 
0.3%
시무나무 1
 
0.3%
비수리 1
 
0.3%
등칡 1
 
0.3%
1
 
0.3%
Other values (324) 324
97.0%
2023-12-12T15:38:04.859804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
134
 
10.6%
122
 
9.7%
38
 
3.0%
28
 
2.2%
25
 
2.0%
19
 
1.5%
19
 
1.5%
17
 
1.4%
15
 
1.2%
15
 
1.2%
Other values (314) 827
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1258
99.9%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
10.7%
122
 
9.7%
38
 
3.0%
28
 
2.2%
25
 
2.0%
19
 
1.5%
19
 
1.5%
17
 
1.4%
15
 
1.2%
15
 
1.2%
Other values (313) 826
65.7%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1258
99.9%
Common 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
10.7%
122
 
9.7%
38
 
3.0%
28
 
2.2%
25
 
2.0%
19
 
1.5%
19
 
1.5%
17
 
1.4%
15
 
1.2%
15
 
1.2%
Other values (313) 826
65.7%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1258
99.9%
ASCII 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
134
 
10.7%
122
 
9.7%
38
 
3.0%
28
 
2.2%
25
 
2.0%
19
 
1.5%
19
 
1.5%
17
 
1.4%
15
 
1.2%
15
 
1.2%
Other values (313) 826
65.7%
ASCII
ValueCountFrequency (%)
1
100.0%

발행년월
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2016-11
53 
2014-10
32 
2013-06
25 
2012-06
21 
2021-10
 
18
Other values (19)
185 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row2008-06
2nd row2009-02
3rd row2009-05
4th row2009-05
5th row2009-05

Common Values

ValueCountFrequency (%)
2016-11 53
15.9%
2014-10 32
 
9.6%
2013-06 25
 
7.5%
2012-06 21
 
6.3%
2021-10 18
 
5.4%
2020-12 18
 
5.4%
2018-06 18
 
5.4%
2022-11 16
 
4.8%
2010-03 14
 
4.2%
2017-06 14
 
4.2%
Other values (14) 105
31.4%

Length

2023-12-12T15:38:05.012972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2016-11 53
15.9%
2014-10 32
 
9.6%
2013-06 25
 
7.5%
2012-06 21
 
6.3%
2021-10 18
 
5.4%
2020-12 18
 
5.4%
2018-06 18
 
5.4%
2022-11 16
 
4.8%
2010-03 14
 
4.2%
2017-06 14
 
4.2%
Other values (14) 105
31.4%

Interactions

2023-12-12T15:38:03.457354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:38:05.110789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년월
번호1.0000.977
발행년월0.9771.000
2023-12-12T15:38:05.229430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호발행년월
번호1.0000.850
발행년월0.8501.000

Missing values

2023-12-12T15:38:03.621230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:38:03.709668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호종명발행년월
01표고버섯2008-06
12밤나무2009-02
23백운풀2009-05
34황해쑥2009-05
45천마2009-05
56울릉산마늘2009-05
67참나물2009-05
78곰취2009-05
89벌개미취2009-05
910기린초2009-05
번호종명발행년월
324325매듭풀2022-11
325326긴병꽃풀2022-11
326327넉줄고사리2022-11
327328개별꽃2022-11
328329실거리나무2022-11
329330예덕나무2022-11
330331노간주나무2022-11
3313322022-11
332333복자기2022-11
333334미역줄나무2022-11