Overview

Dataset statistics

Number of variables3
Number of observations22
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory682.0 B
Average record size in memory31.0 B

Variable types

Text1
Categorical1
Numeric1

Dataset

Description경상북도농업기술원 시험 분석에 관한 조례에 의건한 농작물 시험분석 수수료 정보입니다. 분석이란 그 물질 또는 물품을 구성하고 있는 성분을 단기간의 검사를 통하여 밝혀내는 것을 말합니다.
Author경상북도
URLhttps://www.data.go.kr/data/15062973/fileData.do

Alerts

분석항목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:09:08.416760
Analysis finished2023-12-12 07:09:08.838684
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분석항목
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-12T16:09:09.057790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length25.5
Mean length19.318182
Min length2

Characters and Unicode

Total characters425
Distinct characters110
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row비료 주성분(질소, 인산, 가리, 고토, 망간, 아연, 구리, 붕소, 철, 규산, 몰리브덴, 알카리분, 유기물, CEC, 칼슘)
2nd row유해성분(카드뮴, 비소, 납, 수은, 크롬, 염산불용해물, 니켈, 염분, 염소)
3rd row기타성분(유기물 질소의비)
4th row기타성분(구연산칼슘)
5th row기타성분(수용성질소)
ValueCountFrequency (%)
cl 2
 
2.7%
염분 2
 
2.7%
k2o 2
 
2.7%
p2o5 2
 
2.7%
무기성분(n 1
 
1.4%
작물 1
 
1.4%
기타 1
 
1.4%
cao 1
 
1.4%
mgo 1
 
1.4%
ph 1
 
1.4%
Other values (59) 59
80.8%
2023-12-12T16:09:09.523426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
12.0%
, 43
 
10.1%
22
 
5.2%
( 21
 
4.9%
) 21
 
4.9%
17
 
4.0%
10
 
2.4%
O 9
 
2.1%
9
 
2.1%
C 9
 
2.1%
Other values (100) 213
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 233
54.8%
Space Separator 51
 
12.0%
Other Punctuation 43
 
10.1%
Uppercase Letter 37
 
8.7%
Open Punctuation 21
 
4.9%
Close Punctuation 21
 
4.9%
Lowercase Letter 11
 
2.6%
Decimal Number 8
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
9.4%
17
 
7.3%
10
 
4.3%
9
 
3.9%
8
 
3.4%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
7
 
3.0%
Other values (73) 130
55.8%
Uppercase Letter
ValueCountFrequency (%)
O 9
24.3%
C 9
24.3%
N 4
10.8%
S 3
 
8.1%
P 3
 
8.1%
K 2
 
5.4%
E 2
 
5.4%
F 1
 
2.7%
D 1
 
2.7%
A 1
 
2.7%
Other values (2) 2
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
a 3
27.3%
l 2
18.2%
s 1
 
9.1%
b 1
 
9.1%
d 1
 
9.1%
p 1
 
9.1%
g 1
 
9.1%
i 1
 
9.1%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
5 2
 
25.0%
4 1
 
12.5%
Space Separator
ValueCountFrequency (%)
51
100.0%
Other Punctuation
ValueCountFrequency (%)
, 43
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 233
54.8%
Common 144
33.9%
Latin 48
 
11.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
9.4%
17
 
7.3%
10
 
4.3%
9
 
3.9%
8
 
3.4%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
7
 
3.0%
Other values (73) 130
55.8%
Latin
ValueCountFrequency (%)
O 9
18.8%
C 9
18.8%
N 4
 
8.3%
S 3
 
6.2%
P 3
 
6.2%
a 3
 
6.2%
l 2
 
4.2%
K 2
 
4.2%
E 2
 
4.2%
F 1
 
2.1%
Other values (10) 10
20.8%
Common
ValueCountFrequency (%)
51
35.4%
, 43
29.9%
( 21
14.6%
) 21
14.6%
2 5
 
3.5%
5 2
 
1.4%
4 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 233
54.8%
ASCII 192
45.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
26.6%
, 43
22.4%
( 21
10.9%
) 21
10.9%
O 9
 
4.7%
C 9
 
4.7%
2 5
 
2.6%
N 4
 
2.1%
S 3
 
1.6%
P 3
 
1.6%
Other values (17) 23
12.0%
Hangul
ValueCountFrequency (%)
22
 
9.4%
17
 
7.3%
10
 
4.3%
9
 
3.9%
8
 
3.4%
8
 
3.4%
8
 
3.4%
7
 
3.0%
7
 
3.0%
7
 
3.0%
Other values (73) 130
55.8%

분석기준
Categorical

Distinct3
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Memory size308.0 B
1점
11 
1점 1성분
1점

Length

Max length6
Median length4.5
Mean length3.7272727
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1점 1성분
2nd row1점 1성분
3rd row1점
4th row1점
5th row1점 1성분

Common Values

ValueCountFrequency (%)
1점 11
50.0%
1점 1성분 9
40.9%
1점 2
 
9.1%

Length

2023-12-12T16:09:09.689044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:09:09.825079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1점 22
71.0%
1성분 9
29.0%

기준금액(원)
Real number (ℝ)

Distinct21
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27831.818
Minimum7500
Maximum91100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size330.0 B
2023-12-12T16:09:09.936469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7500
5-th percentile8555
Q111700
median18750
Q330350
95-th percentile81260
Maximum91100
Range83600
Interquartile range (IQR)18650

Descriptive statistics

Standard deviation24658.59
Coefficient of variation (CV)0.88598561
Kurtosis1.9774418
Mean27831.818
Median Absolute Deviation (MAD)9000
Skewness1.7184769
Sum612300
Variance6.0804608 × 108
MonotonicityNot monotonic
2023-12-12T16:09:10.038044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
11700 2
 
9.1%
14300 1
 
4.5%
16000 1
 
4.5%
10500 1
 
4.5%
8500 1
 
4.5%
22900 1
 
4.5%
20500 1
 
4.5%
30400 1
 
4.5%
17000 1
 
4.5%
30200 1
 
4.5%
Other values (11) 11
50.0%
ValueCountFrequency (%)
7500 1
4.5%
8500 1
4.5%
9600 1
4.5%
9900 1
4.5%
10500 1
4.5%
11700 2
9.1%
13500 1
4.5%
14300 1
4.5%
16000 1
4.5%
17000 1
4.5%
ValueCountFrequency (%)
91100 1
4.5%
81400 1
4.5%
78600 1
4.5%
43800 1
4.5%
33000 1
4.5%
30400 1
4.5%
30200 1
4.5%
28600 1
4.5%
22900 1
4.5%
21600 1
4.5%

Interactions

2023-12-12T16:09:08.619779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:09:10.112878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분석항목분석기준기준금액(원)
분석항목1.0001.0001.000
분석기준1.0001.0000.616
기준금액(원)1.0000.6161.000
2023-12-12T16:09:10.200141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준금액(원)분석기준
기준금액(원)1.0000.381
분석기준0.3811.000

Missing values

2023-12-12T16:09:08.717222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:09:08.804036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분석항목분석기준기준금액(원)
0비료 주성분(질소, 인산, 가리, 고토, 망간, 아연, 구리, 붕소, 철, 규산, 몰리브덴, 알카리분, 유기물, CEC, 칼슘)1점 1성분14300
1유해성분(카드뮴, 비소, 납, 수은, 크롬, 염산불용해물, 니켈, 염분, 염소)1점 1성분16000
2기타성분(유기물 질소의비)1점28600
3기타성분(구연산칼슘)1점78600
4기타성분(수용성질소)1점 1성분33000
5수분1점7500
6물리성(분말도)1점9600
7부숙도(콤백법)1점43800
8부숙도(솔비타)1점81400
9부숙도(종자발아법)1점91100
분석항목분석기준기준금액(원)
12토양화학성분(유기물)1점21600
13토양화학성분(유효규산)1점30200
14토양화학성분(석회요구량)1점17000
15토양화학성분(양이온치환용량)1점30400
16토양화학성분(암모늄태 질소)1점20500
17토양화학성분(질산태질소)1점22900
18작물 오염분석(S, F, Cl, 중금속)1점 1성분11700
19수질 오염분석(N, COD, SO4, Na, Cl)1점 1성분8500
20토양 오염분석(As, Ca, Pb, Cd)1점 1성분11700
21식물체 무기성분(N, P2O5, SiO2, K2O, 기타) 분석1점 1성분10500