Overview

Dataset statistics

Number of variables7
Number of observations6315
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory351.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
DateTime1
Text3
Categorical2

Dataset

Description국립종자원 종자저장고 출고 정보로 번호,출고일자,품종관리번호,작물명,품종명,출고구분 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15066481/fileData.do

Alerts

데이터추출일자 has constant value ""Constant

Reproduction

Analysis started2023-12-12 12:56:21.478002
Analysis finished2023-12-12 12:56:22.791743
Duration1.31 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

출고번호
Real number (ℝ)

Distinct6314
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20895.679
Minimum17490
Maximum25364
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size55.6 KiB
2023-12-12T21:56:22.892854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17490
5-th percentile17851.7
Q119210.5
median20864
Q322552.5
95-th percentile23959.3
Maximum25364
Range7874
Interquartile range (IQR)3342

Descriptive statistics

Standard deviation1957.7608
Coefficient of variation (CV)0.093692134
Kurtosis-1.1707625
Mean20895.679
Median Absolute Deviation (MAD)1671
Skewness0.043166921
Sum1.3195622 × 108
Variance3832827.4
MonotonicityNot monotonic
2023-12-12T21:56:23.096725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
23222 2
 
< 0.1%
17490 1
 
< 0.1%
21940 1
 
< 0.1%
21949 1
 
< 0.1%
21948 1
 
< 0.1%
21947 1
 
< 0.1%
21946 1
 
< 0.1%
21945 1
 
< 0.1%
21944 1
 
< 0.1%
21943 1
 
< 0.1%
Other values (6304) 6304
99.8%
ValueCountFrequency (%)
17490 1
< 0.1%
17491 1
< 0.1%
17492 1
< 0.1%
17493 1
< 0.1%
17494 1
< 0.1%
17495 1
< 0.1%
17496 1
< 0.1%
17497 1
< 0.1%
17498 1
< 0.1%
17499 1
< 0.1%
ValueCountFrequency (%)
25364 1
< 0.1%
25363 1
< 0.1%
24940 1
< 0.1%
24939 1
< 0.1%
24938 1
< 0.1%
24937 1
< 0.1%
24936 1
< 0.1%
24935 1
< 0.1%
24934 1
< 0.1%
24933 1
< 0.1%
Distinct263
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
Minimum2020-01-02 00:00:00
Maximum2022-12-30 00:00:00
2023-12-12T21:56:23.280981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:56:23.459387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct5263
Distinct (%)83.3%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
2023-12-12T21:56:23.751306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters82095
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4366 ?
Unique (%)69.1%

Sample

1st row110110-000234
2nd row241120-000452
3rd row110110-000235
4th row110110-000236
5th row110110-000237
ValueCountFrequency (%)
110810-001788 5
 
0.1%
111110-003394 5
 
0.1%
111310-004319 5
 
0.1%
111410-004690 5
 
0.1%
111810-006966 5
 
0.1%
111810-006967 5
 
0.1%
110010-000153 5
 
0.1%
111810-007139 4
 
0.1%
112010-008266 4
 
0.1%
111810-007140 4
 
0.1%
Other values (5253) 6268
99.3%
2023-12-12T21:56:24.210485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 23672
28.8%
1 18298
22.3%
2 12449
15.2%
- 6315
 
7.7%
3 5463
 
6.7%
4 3293
 
4.0%
8 2969
 
3.6%
9 2663
 
3.2%
7 2434
 
3.0%
6 2333
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 75780
92.3%
Dash Punctuation 6315
 
7.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 23672
31.2%
1 18298
24.1%
2 12449
16.4%
3 5463
 
7.2%
4 3293
 
4.3%
8 2969
 
3.9%
9 2663
 
3.5%
7 2434
 
3.2%
6 2333
 
3.1%
5 2206
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 6315
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 82095
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 23672
28.8%
1 18298
22.3%
2 12449
15.2%
- 6315
 
7.7%
3 5463
 
6.7%
4 3293
 
4.0%
8 2969
 
3.6%
9 2663
 
3.2%
7 2434
 
3.0%
6 2333
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 82095
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 23672
28.8%
1 18298
22.3%
2 12449
15.2%
- 6315
 
7.7%
3 5463
 
6.7%
4 3293
 
4.0%
8 2969
 
3.6%
9 2663
 
3.2%
7 2434
 
3.0%
6 2333
 
2.8%
Distinct142
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
2023-12-12T21:56:24.544746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length2
Mean length2.5338084
Min length1

Characters and Unicode

Total characters16001
Distinct characters208
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)0.4%

Sample

1st row참외
2nd row참외
3rd row참외
4th row참외
5th row참외
ValueCountFrequency (%)
고추 1040
16.4%
토마토 607
 
9.6%
391
 
6.2%
오이 354
 
5.6%
314
 
4.9%
배추 312
 
4.9%
멜론 273
 
4.3%
참외 258
 
4.1%
수박 250
 
3.9%
양파 247
 
3.9%
Other values (133) 2301
36.3%
2023-12-12T21:56:25.057190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1914
 
12.0%
1221
 
7.6%
1134
 
7.1%
663
 
4.1%
651
 
4.1%
623
 
3.9%
554
 
3.5%
446
 
2.8%
444
 
2.8%
420
 
2.6%
Other values (198) 7931
49.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15162
94.8%
Close Punctuation 331
 
2.1%
Open Punctuation 331
 
2.1%
Other Punctuation 83
 
0.5%
Uppercase Letter 53
 
0.3%
Space Separator 34
 
0.2%
Math Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1914
 
12.6%
1221
 
8.1%
1134
 
7.5%
663
 
4.4%
651
 
4.3%
623
 
4.1%
554
 
3.7%
446
 
2.9%
444
 
2.9%
420
 
2.8%
Other values (191) 7092
46.8%
Math Symbol
ValueCountFrequency (%)
+ 6
85.7%
× 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 331
100.0%
Open Punctuation
ValueCountFrequency (%)
( 331
100.0%
Other Punctuation
ValueCountFrequency (%)
, 83
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 53
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15162
94.8%
Common 786
 
4.9%
Latin 53
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1914
 
12.6%
1221
 
8.1%
1134
 
7.5%
663
 
4.4%
651
 
4.3%
623
 
4.1%
554
 
3.7%
446
 
2.9%
444
 
2.9%
420
 
2.8%
Other values (191) 7092
46.8%
Common
ValueCountFrequency (%)
) 331
42.1%
( 331
42.1%
, 83
 
10.6%
34
 
4.3%
+ 6
 
0.8%
× 1
 
0.1%
Latin
ValueCountFrequency (%)
X 53
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15162
94.8%
ASCII 838
 
5.2%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1914
 
12.6%
1221
 
8.1%
1134
 
7.5%
663
 
4.4%
651
 
4.3%
623
 
4.1%
554
 
3.7%
446
 
2.9%
444
 
2.9%
420
 
2.8%
Other values (191) 7092
46.8%
ASCII
ValueCountFrequency (%)
) 331
39.5%
( 331
39.5%
, 83
 
9.9%
X 53
 
6.3%
34
 
4.1%
+ 6
 
0.7%
None
ValueCountFrequency (%)
× 1
100.0%
Distinct4374
Distinct (%)69.3%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
2023-12-12T21:56:25.343940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length4.4535234
Min length1

Characters and Unicode

Total characters28124
Distinct characters710
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3153 ?
Unique (%)49.9%

Sample

1st row부산906호
2nd row금싸라기은천
3rd row부산907호
4th row부산909호
5th row수로왕
ValueCountFrequency (%)
일반종 214
 
3.3%
재래종 53
 
0.8%
이루미 7
 
0.1%
온누리2호 6
 
0.1%
골든벨 6
 
0.1%
아름 6
 
0.1%
금강밀 6
 
0.1%
라피드 6
 
0.1%
슈퍼스타 6
 
0.1%
슈퍼플러스 6
 
0.1%
Other values (4414) 6099
95.1%
2023-12-12T21:56:25.808707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1183
 
4.2%
937
 
3.3%
1 787
 
2.8%
690
 
2.5%
2 674
 
2.4%
0 616
 
2.2%
463
 
1.6%
443
 
1.6%
410
 
1.5%
381
 
1.4%
Other values (700) 21540
76.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23987
85.3%
Decimal Number 3677
 
13.1%
Dash Punctuation 214
 
0.8%
Space Separator 120
 
0.4%
Uppercase Letter 97
 
0.3%
Close Punctuation 12
 
< 0.1%
Open Punctuation 12
 
< 0.1%
Lowercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1183
 
4.9%
937
 
3.9%
690
 
2.9%
463
 
1.9%
443
 
1.8%
410
 
1.7%
381
 
1.6%
380
 
1.6%
353
 
1.5%
326
 
1.4%
Other values (666) 18421
76.8%
Uppercase Letter
ValueCountFrequency (%)
R 28
28.9%
C 13
13.4%
G 9
 
9.3%
S 9
 
9.3%
K 6
 
6.2%
B 5
 
5.2%
Y 5
 
5.2%
P 5
 
5.2%
W 4
 
4.1%
J 4
 
4.1%
Other values (6) 9
 
9.3%
Decimal Number
ValueCountFrequency (%)
1 787
21.4%
2 674
18.3%
0 616
16.8%
3 300
 
8.2%
5 288
 
7.8%
4 231
 
6.3%
7 210
 
5.7%
6 202
 
5.5%
9 202
 
5.5%
8 167
 
4.5%
Lowercase Letter
ValueCountFrequency (%)
r 2
40.0%
c 1
20.0%
h 1
20.0%
m 1
20.0%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%
Space Separator
ValueCountFrequency (%)
120
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23987
85.3%
Common 4035
 
14.3%
Latin 102
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1183
 
4.9%
937
 
3.9%
690
 
2.9%
463
 
1.9%
443
 
1.8%
410
 
1.7%
381
 
1.6%
380
 
1.6%
353
 
1.5%
326
 
1.4%
Other values (666) 18421
76.8%
Latin
ValueCountFrequency (%)
R 28
27.5%
C 13
12.7%
G 9
 
8.8%
S 9
 
8.8%
K 6
 
5.9%
B 5
 
4.9%
Y 5
 
4.9%
P 5
 
4.9%
W 4
 
3.9%
J 4
 
3.9%
Other values (10) 14
13.7%
Common
ValueCountFrequency (%)
1 787
19.5%
2 674
16.7%
0 616
15.3%
3 300
 
7.4%
5 288
 
7.1%
4 231
 
5.7%
- 214
 
5.3%
7 210
 
5.2%
6 202
 
5.0%
9 202
 
5.0%
Other values (4) 311
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23987
85.3%
ASCII 4137
 
14.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1183
 
4.9%
937
 
3.9%
690
 
2.9%
463
 
1.9%
443
 
1.8%
410
 
1.7%
381
 
1.6%
380
 
1.6%
353
 
1.5%
326
 
1.4%
Other values (666) 18421
76.8%
ASCII
ValueCountFrequency (%)
1 787
19.0%
2 674
16.3%
0 616
14.9%
3 300
 
7.3%
5 288
 
7.0%
4 231
 
5.6%
- 214
 
5.2%
7 210
 
5.1%
6 202
 
4.9%
9 202
 
4.9%
Other values (24) 413
10.0%

출고구분
Categorical

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
정상출고(발아시험)
3077 
기타
1710 
DB구축용
768 
정상출고(분산)
588 
분쟁종자대비시험용
 
148
Other values (3)
 
24

Length

Max length10
Median length9
Mean length7.0071259
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row기타
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
정상출고(발아시험) 3077
48.7%
기타 1710
27.1%
DB구축용 768
 
12.2%
정상출고(분산) 588
 
9.3%
분쟁종자대비시험용 148
 
2.3%
품종진위성검정용 12
 
0.2%
품종보호재배시험용 8
 
0.1%
<NA> 4
 
0.1%

Length

2023-12-12T21:56:25.975385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:26.123650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상출고(발아시험 3077
48.7%
기타 1710
27.1%
db구축용 768
 
12.2%
정상출고(분산 588
 
9.3%
분쟁종자대비시험용 148
 
2.3%
품종진위성검정용 12
 
0.2%
품종보호재배시험용 8
 
0.1%
na 4
 
0.1%

데이터추출일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size49.5 KiB
2023-07-24
6315 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-24
2nd row2023-07-24
3rd row2023-07-24
4th row2023-07-24
5th row2023-07-24

Common Values

ValueCountFrequency (%)
2023-07-24 6315
100.0%

Length

2023-12-12T21:56:26.295316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:56:26.393883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-24 6315
100.0%

Interactions

2023-12-12T21:56:22.423360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:56:26.448984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출고번호출고구분
출고번호1.0000.647
출고구분0.6471.000
2023-12-12T21:56:26.532244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출고번호출고구분
출고번호1.0000.399
출고구분0.3991.000

Missing values

2023-12-12T21:56:22.585803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:56:22.727954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

출고번호출고일품종관리번호작물명품종명출고구분데이터추출일자
0174902020-01-02110110-000234참외부산906호기타2023-07-24
1174912020-01-02241120-000452참외금싸라기은천기타2023-07-24
2174922020-01-02110110-000235참외부산907호기타2023-07-24
3174932020-01-02110110-000236참외부산909호기타2023-07-24
4174942020-01-02110110-000237참외수로왕기타2023-07-24
5174952020-01-02110110-000238참외금항아리기타2023-07-24
6174962020-01-02240321-000463참외대황기타2023-07-24
7174972020-01-02110310-000447참외참왕기타2023-07-24
8174982020-01-02240421-001009참외가야꿀기타2023-07-24
9174992020-01-02240421-001010참외007꿀기타2023-07-24
출고번호출고일품종관리번호작물명품종명출고구분데이터추출일자
6305243142022-12-08112210-009210엠에스158-2정상출고(발아시험)2023-07-24
6306243152022-12-26112210-009210엠에스158-2정상출고(발아시험)2023-07-24
6307244672022-12-12132220-002481들깨농협기름정상출고(발아시험)2023-07-24
6308244712022-12-21132220-002783단삼일반종정상출고(발아시험)2023-07-24
6309245312022-12-21132220-002919왜당귀일반종정상출고(발아시험)2023-07-24
6310245342022-12-21132220-002920참당귀일반종정상출고(발아시험)2023-07-24
6311249272022-03-11132220-000412호박(서양계)에스피28정상출고(발아시험)2023-07-24
6312249282022-03-11132220-000388토마토에스브이티지6219정상출고(발아시험)2023-07-24
6313253632022-07-22132220-012142왜당귀일반종정상출고(발아시험)2023-07-24
6314253642022-08-26132220-012142왜당귀일반종정상출고(발아시험)2023-07-24