Overview

Dataset statistics

Number of variables6
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory53.7 B

Variable types

Numeric2
Categorical3
Text1

Dataset

Description경상남도 농작물진단처방 기타항목 데이터입니다.
Author경상남도
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15049543

Alerts

분석기준 is highly overall correlated with 분류코드(소) and 3 other fieldsHigh correlation
분류코드(대) is highly overall correlated with 분류코드(소) and 3 other fieldsHigh correlation
분류코드(소) is highly overall correlated with 비용 and 2 other fieldsHigh correlation
비용 is highly overall correlated with 분류코드(소) and 3 other fieldsHigh correlation
단위 is highly overall correlated with 비용 and 2 other fieldsHigh correlation
분류코드(소) has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:28:45.673108
Analysis finished2023-12-10 23:28:46.751665
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류코드(소)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.638889
Minimum1
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-11T08:28:46.835360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median19.5
Q329.25
95-th percentile36.25
Maximum38
Range37
Interquartile range (IQR)19.5

Descriptive statistics

Standard deviation11.319144
Coefficient of variation (CV)0.57636376
Kurtosis-1.2442776
Mean19.638889
Median Absolute Deviation (MAD)10
Skewness-0.027061858
Sum707
Variance128.12302
MonotonicityStrictly increasing
2023-12-11T08:28:46.977751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
21 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
30 1
 
2.8%
31 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
38 1
2.8%
37 1
2.8%
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%

분류코드(대)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size420.0 B
E
15 
A
11 
C
10 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA
2nd rowA
3rd rowA
4th rowA
5th rowA

Common Values

ValueCountFrequency (%)
E 15
41.7%
A 11
30.6%
C 10
27.8%

Length

2023-12-11T08:28:47.122990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:28:47.249282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
e 15
41.7%
a 11
30.6%
c 10
27.8%
Distinct33
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-11T08:28:47.420967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.3055556
Min length1

Characters and Unicode

Total characters119
Distinct characters67
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)83.3%

Sample

1st row규소(Si)
2nd row철(Fe)
3rd row망간(Mn)
4th row아연(Zn)
5th row비소
ValueCountFrequency (%)
질소 3
 
7.7%
망간 2
 
5.1%
크롬 2
 
5.1%
2
 
5.1%
규소(si 1
 
2.6%
석회 1
 
2.6%
염산불용해물 1
 
2.6%
인산 1
 
2.6%
칼리 1
 
2.6%
고토 1
 
2.6%
Other values (24) 24
61.5%
2023-12-11T08:28:47.774356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
6.7%
6
 
5.0%
5
 
4.2%
( 4
 
3.4%
) 4
 
3.4%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
3
 
2.5%
Other values (57) 77
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100
84.0%
Open Punctuation 4
 
3.4%
Close Punctuation 4
 
3.4%
Lowercase Letter 4
 
3.4%
Uppercase Letter 4
 
3.4%
Space Separator 3
 
2.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
8.0%
6
 
6.0%
5
 
5.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
Other values (47) 61
61.0%
Uppercase Letter
ValueCountFrequency (%)
Z 1
25.0%
S 1
25.0%
M 1
25.0%
F 1
25.0%
Lowercase Letter
ValueCountFrequency (%)
n 2
50.0%
i 1
25.0%
e 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100
84.0%
Common 11
 
9.2%
Latin 8
 
6.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
8.0%
6
 
6.0%
5
 
5.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
Other values (47) 61
61.0%
Latin
ValueCountFrequency (%)
n 2
25.0%
Z 1
12.5%
S 1
12.5%
i 1
12.5%
M 1
12.5%
F 1
12.5%
e 1
12.5%
Common
ValueCountFrequency (%)
( 4
36.4%
) 4
36.4%
3
27.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100
84.0%
ASCII 19
 
16.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
8.0%
6
 
6.0%
5
 
5.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
3
 
3.0%
2
 
2.0%
Other values (47) 61
61.0%
ASCII
ValueCountFrequency (%)
( 4
21.1%
) 4
21.1%
3
15.8%
n 2
10.5%
Z 1
 
5.3%
S 1
 
5.3%
i 1
 
5.3%
M 1
 
5.3%
F 1
 
5.3%
e 1
 
5.3%

비용
Real number (ℝ)

HIGH CORRELATION 

Distinct13
Distinct (%)36.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13833.333
Minimum8500
Maximum30400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-11T08:28:47.906269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8500
5-th percentile8500
Q18500
median13000
Q314300
95-th percentile29000
Maximum30400
Range21900
Interquartile range (IQR)5800

Descriptive statistics

Standard deviation6442.892
Coefficient of variation (CV)0.46575123
Kurtosis1.2894203
Mean13833.333
Median Absolute Deviation (MAD)3700
Skewness1.4375862
Sum498000
Variance41510857
MonotonicityNot monotonic
2023-12-11T08:28:48.020869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
8500 11
30.6%
14300 10
27.8%
9900 4
 
11.1%
9600 2
 
5.6%
30200 1
 
2.8%
17000 1
 
2.8%
30400 1
 
2.8%
20500 1
 
2.8%
22900 1
 
2.8%
16000 1
 
2.8%
Other values (3) 3
 
8.3%
ValueCountFrequency (%)
8500 11
30.6%
9600 2
 
5.6%
9900 4
 
11.1%
11700 1
 
2.8%
14300 10
27.8%
16000 1
 
2.8%
17000 1
 
2.8%
20500 1
 
2.8%
22900 1
 
2.8%
25400 1
 
2.8%
ValueCountFrequency (%)
30400 1
 
2.8%
30200 1
 
2.8%
28600 1
 
2.8%
25400 1
 
2.8%
22900 1
 
2.8%
20500 1
 
2.8%
17000 1
 
2.8%
16000 1
 
2.8%
14300 10
27.8%
11700 1
 
2.8%

분석기준
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size420.0 B
농촌진흥청고시 제2017-19
15 
수질오염공정시험기준
11 
농촌진흥청토양및식물체분석법
토양오염공정시험기준
 
1

Length

Max length16
Median length14
Mean length13.5
Min length10

Unique

Unique1 ?
Unique (%)2.8%

Sample

1st row수질오염공정시험기준
2nd row수질오염공정시험기준
3rd row수질오염공정시험기준
4th row수질오염공정시험기준
5th row수질오염공정시험기준

Common Values

ValueCountFrequency (%)
농촌진흥청고시 제2017-19 15
41.7%
수질오염공정시험기준 11
30.6%
농촌진흥청토양및식물체분석법 9
25.0%
토양오염공정시험기준 1
 
2.8%

Length

2023-12-11T08:28:48.150238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:28:48.268150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
농촌진흥청고시 15
29.4%
제2017-19 15
29.4%
수질오염공정시험기준 11
21.6%
농촌진흥청토양및식물체분석법 9
17.6%
토양오염공정시험기준 1
 
2.0%

단위
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size420.0 B
%
13 
mg/L
11 
mg/kg
kg/10a
 
1
cmolc/kg
 
1

Length

Max length8
Median length6
Mean length3.3333333
Min length1

Unique

Unique3 ?
Unique (%)8.3%

Sample

1st rowmg/L
2nd rowmg/L
3rd rowmg/L
4th rowmg/L
5th rowmg/L

Common Values

ValueCountFrequency (%)
% 13
36.1%
mg/L 11
30.6%
mg/kg 9
25.0%
kg/10a 1
 
2.8%
cmolc/kg 1
 
2.8%
<NA> 1
 
2.8%

Length

2023-12-11T08:28:48.400173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:28:48.531014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
13
36.1%
mg/l 11
30.6%
mg/kg 9
25.0%
kg/10a 1
 
2.8%
cmolc/kg 1
 
2.8%
na 1
 
2.8%

Interactions

2023-12-11T08:28:46.124519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:28:45.951444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:28:46.207436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:28:46.038553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:28:48.615763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류코드(소)분류코드(대)항목명비용분석기준단위
분류코드(소)1.0000.9360.7250.6440.8930.874
분류코드(대)0.9361.0000.0000.8291.0000.838
항목명0.7250.0001.0000.9640.0000.973
비용0.6440.8290.9641.0000.8690.793
분석기준0.8931.0000.0000.8691.0000.746
단위0.8740.8380.9730.7930.7461.000
2023-12-11T08:28:48.711206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단위분석기준분류코드(대)
단위1.0000.6780.844
분석기준0.6781.0000.985
분류코드(대)0.8440.9851.000
2023-12-11T08:28:48.812940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류코드(소)비용분류코드(대)분석기준단위
분류코드(소)1.0000.6520.8140.6910.494
비용0.6521.0000.5780.7100.523
분류코드(대)0.8140.5781.0000.9850.844
분석기준0.6910.7100.9851.0000.678
단위0.4940.5230.8440.6781.000

Missing values

2023-12-11T08:28:46.605965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:28:46.712467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류코드(소)분류코드(대)항목명비용분석기준단위
01A규소(Si)8500수질오염공정시험기준mg/L
12A철(Fe)8500수질오염공정시험기준mg/L
23A망간(Mn)8500수질오염공정시험기준mg/L
34A아연(Zn)8500수질오염공정시험기준mg/L
45A비소8500수질오염공정시험기준mg/L
56A니켈8500수질오염공정시험기준mg/L
67A카드뮴8500수질오염공정시험기준mg/L
78A구리8500수질오염공정시험기준mg/L
89A크롬8500수질오염공정시험기준mg/L
910A수은8500수질오염공정시험기준mg/L
분류코드(소)분류코드(대)항목명비용분석기준단위
2629E규산14300농촌진흥청고시 제2017-19%
2730E붕소14300농촌진흥청고시 제2017-19%
2831E망간14300농촌진흥청고시 제2017-19mg/kg
2932E14300농촌진흥청고시 제2017-19mg/kg
3033E알카리분14300농촌진흥청고시 제2017-19%
3134E분말도9600농촌진흥청고시 제2017-19%
3235E입상9600농촌진흥청고시 제2017-19%
3336E붕괴도25400농촌진흥청고시 제2017-19%
3437E유기물대질소비28600농촌진흥청고시 제2017-19<NA>
3538C크롬11700토양오염공정시험기준mg/kg