Overview

Dataset statistics

Number of variables8
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.3 KiB
Average record size in memory68.0 B

Variable types

Numeric2
Categorical2
Text2
DateTime2

Dataset

Description임업분야 생명자원 분양 자료(연번, 분양기관, 신청일자, 승인일자, 분양자원(종), 분양자원(점),유형)
URLhttps://www.data.go.kr/data/15024948/fileData.do

Alerts

분양기관 has constant value ""Constant
분양자원(점) is highly overall correlated with 유형High correlation
유형 is highly overall correlated with 분양자원(점)High correlation
유형 is highly imbalanced (85.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:31:43.566172
Analysis finished2023-12-12 01:31:45.115805
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-12T10:31:45.224264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.2
Q117
median33
Q349
95-th percentile61.8
Maximum65
Range64
Interquartile range (IQR)32

Descriptive statistics

Standard deviation18.90767
Coefficient of variation (CV)0.57295971
Kurtosis-1.2
Mean33
Median Absolute Deviation (MAD)16
Skewness0
Sum2145
Variance357.5
MonotonicityStrictly increasing
2023-12-12T10:31:45.705492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
50 1
 
1.5%
36 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
Other values (55) 55
84.6%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%
56 1
1.5%

분양기관
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
국립산림품종관리센터
65 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국립산림품종관리센터
2nd row국립산림품종관리센터
3rd row국립산림품종관리센터
4th row국립산림품종관리센터
5th row국립산림품종관리센터

Common Values

ValueCountFrequency (%)
국립산림품종관리센터 65
100.0%

Length

2023-12-12T10:31:45.858689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:31:45.992312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국립산림품종관리센터 65
100.0%
Distinct44
Distinct (%)67.7%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-12T10:31:46.222187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length10.061538
Min length4

Characters and Unicode

Total characters654
Distinct characters96
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)49.2%

Sample

1st row경기도보건환경연구원
2nd row전남대학교
3rd row국민대학교
4th row서울시립대학교
5th row㈜생명의나무
ValueCountFrequency (%)
국립산림과학원 10
 
10.8%
서울대학교 7
 
7.5%
전남대학교 5
 
5.4%
충북대학교 5
 
5.4%
산림자원학과 4
 
4.3%
산림생명자원연구부 3
 
3.2%
산림과학부 3
 
3.2%
㈜브레인트리 3
 
3.2%
농업생명과학대학 3
 
3.2%
국민대학교 2
 
2.2%
Other values (42) 48
51.6%
2023-12-12T10:31:46.696973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
11.5%
42
 
6.4%
40
 
6.1%
37
 
5.7%
37
 
5.7%
36
 
5.5%
34
 
5.2%
28
 
4.3%
22
 
3.4%
20
 
3.1%
Other values (86) 283
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 605
92.5%
Space Separator 28
 
4.3%
Open Punctuation 7
 
1.1%
Close Punctuation 7
 
1.1%
Other Symbol 4
 
0.6%
Uppercase Letter 2
 
0.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
12.4%
42
 
6.9%
40
 
6.6%
37
 
6.1%
37
 
6.1%
36
 
6.0%
34
 
5.6%
22
 
3.6%
20
 
3.3%
15
 
2.5%
Other values (79) 247
40.8%
Uppercase Letter
ValueCountFrequency (%)
G 1
50.0%
F 1
50.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 609
93.1%
Common 43
 
6.6%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
12.3%
42
 
6.9%
40
 
6.6%
37
 
6.1%
37
 
6.1%
36
 
5.9%
34
 
5.6%
22
 
3.6%
20
 
3.3%
15
 
2.5%
Other values (80) 251
41.2%
Common
ValueCountFrequency (%)
28
65.1%
( 7
 
16.3%
) 7
 
16.3%
& 1
 
2.3%
Latin
ValueCountFrequency (%)
G 1
50.0%
F 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 605
92.5%
ASCII 45
 
6.9%
None 4
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
12.4%
42
 
6.9%
40
 
6.6%
37
 
6.1%
37
 
6.1%
36
 
6.0%
34
 
5.6%
22
 
3.6%
20
 
3.3%
15
 
2.5%
Other values (79) 247
40.8%
ASCII
ValueCountFrequency (%)
28
62.2%
( 7
 
15.6%
) 7
 
15.6%
& 1
 
2.2%
G 1
 
2.2%
F 1
 
2.2%
None
ValueCountFrequency (%)
4
100.0%
Distinct64
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size652.0 B
Minimum2013-02-22 00:00:00
Maximum2022-07-27 00:00:00
2023-12-12T10:31:46.845942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:31:46.978781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct63
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size652.0 B
Minimum2013-03-04 00:00:00
Maximum2022-08-02 00:00:00
2023-12-12T10:31:47.129303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:31:47.285495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct39
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2023-12-12T10:31:47.497522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length5.6
Min length2

Characters and Unicode

Total characters364
Distinct characters62
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)41.5%

Sample

1st row잣나무
2nd row소나무 등 3종
3rd row소나무 등 5종
4th row소나무 등 9종
5th row리기다소나무
ValueCountFrequency (%)
24
20.9%
소나무 19
16.5%
낙엽송 7
 
6.1%
2종 6
 
5.2%
3종 5
 
4.3%
독일가문비 4
 
3.5%
일본잎갈나무 4
 
3.5%
편백 4
 
3.5%
4종 3
 
2.6%
잣나무 3
 
2.6%
Other values (26) 36
31.3%
2023-12-12T10:31:47.875559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
13.7%
47
 
12.9%
47
 
12.9%
25
 
6.9%
24
 
6.6%
21
 
5.8%
8
 
2.2%
2 8
 
2.2%
7
 
1.9%
7
 
1.9%
Other values (52) 120
33.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 287
78.8%
Space Separator 50
 
13.7%
Decimal Number 27
 
7.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
16.4%
47
16.4%
25
 
8.7%
24
 
8.4%
21
 
7.3%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
5
 
1.7%
Other values (42) 89
31.0%
Decimal Number
ValueCountFrequency (%)
2 8
29.6%
3 5
18.5%
4 4
14.8%
5 3
 
11.1%
6 2
 
7.4%
9 2
 
7.4%
7 1
 
3.7%
8 1
 
3.7%
1 1
 
3.7%
Space Separator
ValueCountFrequency (%)
50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 287
78.8%
Common 77
 
21.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
16.4%
47
16.4%
25
 
8.7%
24
 
8.4%
21
 
7.3%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
5
 
1.7%
Other values (42) 89
31.0%
Common
ValueCountFrequency (%)
50
64.9%
2 8
 
10.4%
3 5
 
6.5%
4 4
 
5.2%
5 3
 
3.9%
6 2
 
2.6%
9 2
 
2.6%
7 1
 
1.3%
8 1
 
1.3%
1 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 287
78.8%
ASCII 77
 
21.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
64.9%
2 8
 
10.4%
3 5
 
6.5%
4 4
 
5.2%
5 3
 
3.9%
6 2
 
2.6%
9 2
 
2.6%
7 1
 
1.3%
8 1
 
1.3%
1 1
 
1.3%
Hangul
ValueCountFrequency (%)
47
16.4%
47
16.4%
25
 
8.7%
24
 
8.4%
21
 
7.3%
8
 
2.8%
7
 
2.4%
7
 
2.4%
7
 
2.4%
5
 
1.7%
Other values (42) 89
31.0%

분양자원(점)
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.2307692
Minimum1
Maximum137
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2023-12-12T10:31:48.031330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q38
95-th percentile35.4
Maximum137
Range136
Interquartile range (IQR)7

Descriptive statistics

Standard deviation19.114596
Coefficient of variation (CV)2.0707479
Kurtosis31.679536
Mean9.2307692
Median Absolute Deviation (MAD)2
Skewness5.0851848
Sum600
Variance365.36779
MonotonicityNot monotonic
2023-12-12T10:31:48.261982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
1 23
35.4%
2 8
 
12.3%
4 6
 
9.2%
5 4
 
6.2%
3 3
 
4.6%
9 2
 
3.1%
24 2
 
3.1%
6 2
 
3.1%
8 2
 
3.1%
20 2
 
3.1%
Other values (11) 11
16.9%
ValueCountFrequency (%)
1 23
35.4%
2 8
 
12.3%
3 3
 
4.6%
4 6
 
9.2%
5 4
 
6.2%
6 2
 
3.1%
7 1
 
1.5%
8 2
 
3.1%
9 2
 
3.1%
10 1
 
1.5%
ValueCountFrequency (%)
137 1
1.5%
46 1
1.5%
44 1
1.5%
36 1
1.5%
33 1
1.5%
24 2
3.1%
20 2
3.1%
19 1
1.5%
18 1
1.5%
13 1
1.5%

유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)4.6%
Missing0
Missing (%)0.0%
Memory size652.0 B
종자
63 
엽시료
 
1
DNA
 
1

Length

Max length3
Median length2
Mean length2.0307692
Min length2

Unique

Unique2 ?
Unique (%)3.1%

Sample

1st row종자
2nd row종자
3rd row종자
4th row종자
5th row종자

Common Values

ValueCountFrequency (%)
종자 63
96.9%
엽시료 1
 
1.5%
DNA 1
 
1.5%

Length

2023-12-12T10:31:48.435983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:31:48.557396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종자 63
96.9%
엽시료 1
 
1.5%
dna 1
 
1.5%

Interactions

2023-12-12T10:31:44.569870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:31:44.301450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:31:44.695719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:31:44.437576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:31:48.654921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분양신청자신청일자승인일자분양자원(종)분양자원(점)유형
연번1.0000.8501.0001.0000.0000.3010.097
분양신청자0.8501.0000.9970.9080.9110.0000.000
신청일자1.0000.9971.0000.9930.9901.0001.000
승인일자1.0000.9080.9931.0000.9621.0001.000
분양자원(종)0.0000.9110.9900.9621.0000.0000.859
분양자원(점)0.3010.0001.0001.0000.0001.0000.702
유형0.0970.0001.0001.0000.8590.7021.000
2023-12-12T10:31:48.787003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분양자원(점)유형
연번1.000-0.3270.027
분양자원(점)-0.3271.0000.673
유형0.0270.6731.000

Missing values

2023-12-12T10:31:44.859547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:31:45.038243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분양기관분양신청자신청일자승인일자분양자원(종)분양자원(점)유형
01국립산림품종관리센터경기도보건환경연구원2013-02-222013-03-04잣나무13종자
12국립산림품종관리센터전남대학교2013-03-062013-03-13소나무 등 3종3종자
23국립산림품종관리센터국민대학교2013-03-212013-04-01소나무 등 5종5종자
34국립산림품종관리센터서울시립대학교2013-03-182013-04-01소나무 등 9종11종자
45국립산림품종관리센터㈜생명의나무2013-06-102013-06-20리기다소나무2종자
56국립산림품종관리센터경상대학교2013-11-112013-11-22소나무 등 42종44종자
67국립산림품종관리센터(사)한국시설양묘연구회2013-12-112013-12-23낙엽송4종자
78국립산림품종관리센터영남대학교2014-02-032014-02-12소나무 등 12종18종자
89국립산림품종관리센터㈜브레인트리2014-03-282014-04-14소나무1종자
910국립산림품종관리센터국립산림과학원2014-07-292014-08-12낙엽송 등 2종2종자
연번분양기관분양신청자신청일자승인일자분양자원(종)분양자원(점)유형
5556국립산림품종관리센터국립산림과학원(산림생명정보연구과)2021-08-042021-08-09독일가문비1종자
5657국립산림품종관리센터국립산림과학원(산림생명정보연구과)2021-08-102021-08-19소나무 등 3종19종자
5758국립산림품종관리센터전남대학교 산림자원학과2021-11-052021-11-18구상나무1종자
5859국립산림품종관리센터서울대학교 농업생명과학대학 산림과학부2022-03-232022-04-04편백2종자
5960국립산림품종관리센터충북 산림환경연구소 산림바이오센터2022-04-152022-04-27광나무 외 3종4종자
6061국립산림품종관리센터전북대학교 조경학과2022-04-282022-05-09굴거리나무1종자
6162국립산림품종관리센터한국교통대학교 전자공학과2022-04-272022-05-09소나무1종자
6263국립산림품종관리센터전남대학교 농업생명과학대학 산림자원학과2022-05-112022-05-18독일가문비1종자
6364국립산림품종관리센터가톨릭대학교 의생명과학과2022-07-042022-07-11백합나무1종자
6465국립산림품종관리센터서울대학교 농림생물자원학부2022-07-272022-08-02일본잎갈나무1종자