Overview

Dataset statistics

Number of variables4
Number of observations27
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1023.0 B
Average record size in memory37.9 B

Variable types

Text2
Categorical1
Numeric1

Dataset

Description경상북도농업기술원 시험분석에 관한 조례에 의거한 농작물 시험경비 입니다. 시험이란 농작물 등과 관련된 물질 또는 물품의 성질, 능력, 변화 및 그 물질이 다른 물질에 미치는 영향을 장기간의 실험 연구를 통하여 밝혀내는 것을 말합니다.
Author경상북도
URLhttps://www.data.go.kr/data/15062974/fileData.do

Alerts

기준금액(원) is highly overall correlated with 산출기준단위High correlation
산출기준단위 is highly overall correlated with 기준금액(원)High correlation
시험구분 has unique valuesUnique
기준금액(원) has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:54:20.124949
Analysis finished2023-12-12 10:54:20.896602
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시험구분
Text

UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12T19:54:21.223392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length16.851852
Min length9

Characters and Unicode

Total characters455
Distinct characters82
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)100.0%

Sample

1st row1. 수도육묘시험(일반)
2nd row2. 수도육묘시험(기계이양)
3rd row3. 수도본답재배시험(이양재배)
4th row4. 수도본답재배시험(직파재배)
5th row5. 전작물재배시험
ValueCountFrequency (%)
약효 8
 
9.5%
농약의 8
 
9.5%
2
 
2.4%
1 1
 
1.2%
20 1
 
1.2%
약해시험(전작 1
 
1.2%
21 1
 
1.2%
1
 
1.2%
약해시험(수도작 1
 
1.2%
성능시험 1
 
1.2%
Other values (59) 59
70.2%
2023-12-12T19:54:21.916913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
12.7%
31
 
6.8%
. 27
 
5.9%
26
 
5.7%
24
 
5.3%
19
 
4.2%
( 18
 
4.0%
18
 
4.0%
) 18
 
4.0%
1 13
 
2.9%
Other values (72) 203
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 289
63.5%
Space Separator 58
 
12.7%
Decimal Number 45
 
9.9%
Other Punctuation 27
 
5.9%
Open Punctuation 18
 
4.0%
Close Punctuation 18
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
10.7%
26
 
9.0%
24
 
8.3%
19
 
6.6%
18
 
6.2%
10
 
3.5%
9
 
3.1%
8
 
2.8%
8
 
2.8%
8
 
2.8%
Other values (58) 128
44.3%
Decimal Number
ValueCountFrequency (%)
1 13
28.9%
2 11
24.4%
7 3
 
6.7%
4 3
 
6.7%
6 3
 
6.7%
5 3
 
6.7%
3 3
 
6.7%
0 2
 
4.4%
9 2
 
4.4%
8 2
 
4.4%
Space Separator
ValueCountFrequency (%)
58
100.0%
Other Punctuation
ValueCountFrequency (%)
. 27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 289
63.5%
Common 166
36.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
10.7%
26
 
9.0%
24
 
8.3%
19
 
6.6%
18
 
6.2%
10
 
3.5%
9
 
3.1%
8
 
2.8%
8
 
2.8%
8
 
2.8%
Other values (58) 128
44.3%
Common
ValueCountFrequency (%)
58
34.9%
. 27
16.3%
( 18
 
10.8%
) 18
 
10.8%
1 13
 
7.8%
2 11
 
6.6%
7 3
 
1.8%
4 3
 
1.8%
6 3
 
1.8%
5 3
 
1.8%
Other values (4) 9
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 289
63.5%
ASCII 166
36.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
58
34.9%
. 27
16.3%
( 18
 
10.8%
) 18
 
10.8%
1 13
 
7.8%
2 11
 
6.6%
7 3
 
1.8%
4 3
 
1.8%
6 3
 
1.8%
5 3
 
1.8%
Other values (4) 9
 
5.4%
Hangul
ValueCountFrequency (%)
31
 
10.7%
26
 
9.0%
24
 
8.3%
19
 
6.6%
18
 
6.2%
10
 
3.5%
9
 
3.1%
8
 
2.8%
8
 
2.8%
8
 
2.8%
Other values (58) 128
44.3%
Distinct19
Distinct (%)70.4%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12T19:54:22.187987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length40
Mean length30.407407
Min length7

Characters and Unicode

Total characters821
Distinct characters58
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)51.9%

Sample

1st row3이상 처리, 3이상 반복, 구당면적 5제곱미터
2nd row3이상 처리, 5이상 반복, 구당면적 1상자
3rd row3이상 처리, 3이상 반복, 구당면적 33제곱미터
4th row3이상 처리, 3이상 반복, 구당면적 33제곱미터
5th row3이상 처리, 4이상 반복, 구당면적 36제곱미터
ValueCountFrequency (%)
3이상 40
21.4%
처리 25
13.4%
반복 25
13.4%
구당면적 23
12.3%
대조약제 8
 
4.3%
이하 8
 
4.3%
2약제 8
 
4.3%
30제곱미터 6
 
3.2%
4이상 5
 
2.7%
2이상 4
 
2.1%
Other values (25) 35
18.7%
2023-12-12T19:54:22.803929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
160
19.5%
, 61
 
7.4%
60
 
7.3%
3 54
 
6.6%
53
 
6.5%
35
 
4.3%
27
 
3.3%
27
 
3.3%
25
 
3.0%
25
 
3.0%
Other values (48) 294
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 490
59.7%
Space Separator 160
 
19.5%
Decimal Number 107
 
13.0%
Other Punctuation 64
 
7.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
12.2%
53
 
10.8%
35
 
7.1%
27
 
5.5%
27
 
5.5%
25
 
5.1%
25
 
5.1%
25
 
5.1%
23
 
4.7%
23
 
4.7%
Other values (37) 167
34.1%
Decimal Number
ValueCountFrequency (%)
3 54
50.5%
2 17
 
15.9%
5 9
 
8.4%
0 9
 
8.4%
1 7
 
6.5%
4 6
 
5.6%
6 3
 
2.8%
7 2
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 61
95.3%
. 3
 
4.7%
Space Separator
ValueCountFrequency (%)
160
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 490
59.7%
Common 331
40.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
12.2%
53
 
10.8%
35
 
7.1%
27
 
5.5%
27
 
5.5%
25
 
5.1%
25
 
5.1%
25
 
5.1%
23
 
4.7%
23
 
4.7%
Other values (37) 167
34.1%
Common
ValueCountFrequency (%)
160
48.3%
, 61
 
18.4%
3 54
 
16.3%
2 17
 
5.1%
5 9
 
2.7%
0 9
 
2.7%
1 7
 
2.1%
4 6
 
1.8%
. 3
 
0.9%
6 3
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 490
59.7%
ASCII 331
40.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
160
48.3%
, 61
 
18.4%
3 54
 
16.3%
2 17
 
5.1%
5 9
 
2.7%
0 9
 
2.7%
1 7
 
2.1%
4 6
 
1.8%
. 3
 
0.9%
6 3
 
0.9%
Hangul
ValueCountFrequency (%)
60
 
12.2%
53
 
10.8%
35
 
7.1%
27
 
5.5%
27
 
5.5%
25
 
5.1%
25
 
5.1%
25
 
5.1%
23
 
4.7%
23
 
4.7%
Other values (37) 167
34.1%

산출기준단위
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)33.3%
Missing0
Missing (%)0.0%
Memory size348.0 B
100제곱미터
19 
15상자
 
1
180키로그램
 
1
90키로그램
 
1
60제곱미터
 
1
Other values (4)

Length

Max length7
Median length7
Mean length6.2592593
Min length2

Unique

Unique8 ?
Unique (%)29.6%

Sample

1st row100제곱미터
2nd row15상자
3rd row100제곱미터
4th row100제곱미터
5th row100제곱미터

Common Values

ValueCountFrequency (%)
100제곱미터 19
70.4%
15상자 1
 
3.7%
180키로그램 1
 
3.7%
90키로그램 1
 
3.7%
60제곱미터 1
 
3.7%
1점 1
 
3.7%
1장치 1
 
3.7%
1식 1
 
3.7%
50제곱미터 1
 
3.7%

Length

2023-12-12T19:54:23.060852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:54:23.284847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
100제곱미터 19
70.4%
15상자 1
 
3.7%
180키로그램 1
 
3.7%
90키로그램 1
 
3.7%
60제곱미터 1
 
3.7%
1점 1
 
3.7%
1장치 1
 
3.7%
1식 1
 
3.7%
50제곱미터 1
 
3.7%

기준금액(원)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct27
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1276329.6
Minimum67900
Maximum4660700
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-12T19:54:23.497944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum67900
5-th percentile159100
Q1938950
median1110000
Q31385500
95-th percentile2576400
Maximum4660700
Range4592800
Interquartile range (IQR)446550

Descriptive statistics

Standard deviation897303.6
Coefficient of variation (CV)0.70303437
Kurtosis7.2499146
Mean1276329.6
Median Absolute Deviation (MAD)172000
Skewness2.1927933
Sum34460900
Variance8.0515375 × 1011
MonotonicityNot monotonic
2023-12-12T19:54:23.715589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
955200 1
 
3.7%
977200 1
 
3.7%
1120000 1
 
3.7%
1980000 1
 
3.7%
1870000 1
 
3.7%
1110000 1
 
3.7%
1590000 1
 
3.7%
1020000 1
 
3.7%
938000 1
 
3.7%
976600 1
 
3.7%
Other values (17) 17
63.0%
ValueCountFrequency (%)
67900 1
3.7%
129400 1
3.7%
228400 1
3.7%
682000 1
3.7%
683900 1
3.7%
825400 1
3.7%
938000 1
3.7%
939900 1
3.7%
955200 1
3.7%
976600 1
3.7%
ValueCountFrequency (%)
4660700 1
3.7%
2718000 1
3.7%
2246000 1
3.7%
1980000 1
3.7%
1870000 1
3.7%
1590000 1
3.7%
1495000 1
3.7%
1276000 1
3.7%
1267100 1
3.7%
1240300 1
3.7%

Interactions

2023-12-12T19:54:20.425956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:54:23.844003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시험구분시험조건산출기준단위기준금액(원)
시험구분1.0001.0001.0001.000
시험조건1.0001.0000.9540.868
산출기준단위1.0000.9541.0000.895
기준금액(원)1.0000.8680.8951.000
2023-12-12T19:54:23.991309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준금액(원)산출기준단위
기준금액(원)1.0000.696
산출기준단위0.6961.000

Missing values

2023-12-12T19:54:20.629354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:54:20.826048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시험구분시험조건산출기준단위기준금액(원)
01. 수도육묘시험(일반)3이상 처리, 3이상 반복, 구당면적 5제곱미터100제곱미터955200
12. 수도육묘시험(기계이양)3이상 처리, 5이상 반복, 구당면적 1상자15상자977200
23. 수도본답재배시험(이양재배)3이상 처리, 3이상 반복, 구당면적 33제곱미터100제곱미터1267100
34. 수도본답재배시험(직파재배)3이상 처리, 3이상 반복, 구당면적 33제곱미터100제곱미터1069000
45. 전작물재배시험3이상 처리, 4이상 반복, 구당면적 36제곱미터100제곱미터683900
56. 특용작물재배시험3이상 처리, 4이상 반복, 구당면적 24제곱미터100제곱미터825400
67. 채소재배시험(노지재배)3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터939900
78. 채소재배시험(시설재배)3이상 처리, 3이상 반복, 구당면적 15제곱미터100제곱미터1223900
89. 채소육묘시험2이상 처리, 3이상 반복, 구당면적 7.5제곱미터100제곱미터682000
910. 과수재배시험(노지재배)3이상 처리, 4이상 반복, 구당면적 2주이상100제곱미터1240300
시험구분시험조건산출기준단위기준금액(원)
1718. 곡류저장용기 및 자재효능시험2이상 처리, 3이상 반복, 구당면적 3개월1장치228400
1819. 곡류가공기계 및 저장시설 성능시험가공기계 및 저장시설에 한함1식129400
1920. 농약의 약효 약해시험(수도작 벼)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터976600
2021. 농약의 약효 약해시험(전작 콩)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터938000
2122. 농약의 약효 약해시험(특작 땅콩)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터1020000
2223. 농약의 약효 약해시험(과수 사과)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 20제곱미터, 1주100제곱미터1590000
2324. 농약의 약효 약해시험(노지채소 수박)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터1110000
2425. 농약의 약효 약해시험(시설채소 고추)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 20제곱미터100제곱미터1870000
2526. 농약의 약효 약해시험(화훼 카네이션)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 10제곱미터50제곱미터1980000
2627. 농약의 약효 약해시험(잔디)대조약제 2약제 이하, 3이상 처리, 3이상 반복, 구당면적 30제곱미터100제곱미터1120000