Overview

Dataset statistics

Number of variables9
Number of observations5227
Missing cells30
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory388.1 KiB
Average record size in memory76.0 B

Variable types

Numeric4
Categorical3
Boolean1
Text1

Dataset

Description충청북도 농업기술원 농가경영기록장(농가의 소득을 증진시킬 수 있는 회원전용 농가경영 관리 프로그램)의 수입지출관련 이용자 접속기록, 거래, 거래처 등의 관리시스템으로 일련번호, 품목일련번호, 수입/지출코드, 등록일시, 수정일시, 상태, 세부항목, 품목재배일련번호, 순번등을 제공합니다.
Author충청북도
URLhttps://www.data.go.kr/data/15050325/fileData.do

Alerts

등록일시 has constant value ""Constant
수정일시 has constant value ""Constant
상태 has constant value ""Constant
일련번호 is highly overall correlated with 품목재배일련번호High correlation
품목재배일련번호 is highly overall correlated with 일련번호High correlation
일련번호 has unique valuesUnique
순번 has 714 (13.7%) zerosZeros

Reproduction

Analysis started2023-12-12 17:10:25.573179
Analysis finished2023-12-12 17:10:28.405117
Duration2.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct5227
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3211.9413
Minimum1
Maximum6360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size46.1 KiB
2023-12-13T02:10:28.482701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile356.2
Q11624.5
median3223
Q34813.5
95-th percentile6058.7
Maximum6360
Range6359
Interquartile range (IQR)3189

Descriptive statistics

Standard deviation1836.8226
Coefficient of variation (CV)0.57187302
Kurtosis-1.2128263
Mean3211.9413
Median Absolute Deviation (MAD)1594
Skewness-0.014772304
Sum16788817
Variance3373917.1
MonotonicityStrictly increasing
2023-12-13T02:10:28.631362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
4273 1
 
< 0.1%
4297 1
 
< 0.1%
4296 1
 
< 0.1%
4295 1
 
< 0.1%
4294 1
 
< 0.1%
4293 1
 
< 0.1%
4292 1
 
< 0.1%
4290 1
 
< 0.1%
4288 1
 
< 0.1%
Other values (5217) 5217
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
14 1
< 0.1%
15 1
< 0.1%
17 1
< 0.1%
19 1
< 0.1%
ValueCountFrequency (%)
6360 1
< 0.1%
6359 1
< 0.1%
6357 1
< 0.1%
6353 1
< 0.1%
6352 1
< 0.1%
6351 1
< 0.1%
6350 1
< 0.1%
6349 1
< 0.1%
6348 1
< 0.1%
6347 1
< 0.1%

품목일련번호
Real number (ℝ)

Distinct227
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.03405
Minimum1
Maximum548
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size46.1 KiB
2023-12-13T02:10:28.756788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10
Q135
median62
Q3145
95-th percentile334
Maximum548
Range547
Interquartile range (IQR)110

Descriptive statistics

Standard deviation112.56471
Coefficient of variation (CV)1.0516719
Kurtosis2.2310301
Mean107.03405
Median Absolute Deviation (MAD)39
Skewness1.6655154
Sum559467
Variance12670.813
MonotonicityNot monotonic
2023-12-13T02:10:28.911540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35 363
 
6.9%
66 235
 
4.5%
13 208
 
4.0%
22 197
 
3.8%
29 144
 
2.8%
39 138
 
2.6%
185 137
 
2.6%
312 124
 
2.4%
42 108
 
2.1%
334 105
 
2.0%
Other values (217) 3468
66.3%
ValueCountFrequency (%)
1 5
 
0.1%
4 37
 
0.7%
5 53
1.0%
6 1
 
< 0.1%
7 96
1.8%
8 20
 
0.4%
9 6
 
0.1%
10 87
1.7%
11 13
 
0.2%
12 1
 
< 0.1%
ValueCountFrequency (%)
548 4
 
0.1%
515 24
0.5%
498 12
 
0.2%
476 1
 
< 0.1%
474 25
0.5%
472 23
0.4%
471 3
 
0.1%
470 3
 
0.1%
469 2
 
< 0.1%
468 53
1.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size41.0 KiB
O
3877 
I
1350 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowI
3rd rowI
4th rowO
5th rowI

Common Values

ValueCountFrequency (%)
O 3877
74.2%
I 1350
 
25.8%

Length

2023-12-13T02:10:29.050048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:10:29.467489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 3877
74.2%
i 1350
 
25.8%

등록일시
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size41.0 KiB
1900-01-01
5227 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1900-01-01
2nd row1900-01-01
3rd row1900-01-01
4th row1900-01-01
5th row1900-01-01

Common Values

ValueCountFrequency (%)
1900-01-01 5227
100.0%

Length

2023-12-13T02:10:29.563242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:10:29.652801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1900-01-01 5227
100.0%

수정일시
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size41.0 KiB
1900-01-01
5227 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1900-01-01
2nd row1900-01-01
3rd row1900-01-01
4th row1900-01-01
5th row1900-01-01

Common Values

ValueCountFrequency (%)
1900-01-01 5227
100.0%

Length

2023-12-13T02:10:29.740629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:10:29.839295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1900-01-01 5227
100.0%

상태
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
False
5227 
ValueCountFrequency (%)
False 5227
100.0%
2023-12-13T02:10:29.906940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct3746
Distinct (%)72.1%
Missing30
Missing (%)0.6%
Memory size41.0 KiB
2023-12-13T02:10:30.165253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length45
Mean length5.8495286
Min length1

Characters and Unicode

Total characters30400
Distinct characters805
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3124 ?
Unique (%)60.1%

Sample

1st row배추
2nd row절임배추
3rd row예금이자
4th row아파트임대
5th row멜론
ValueCountFrequency (%)
판매 40
 
0.6%
박스 29
 
0.5%
살충제 27
 
0.4%
25
 
0.4%
제초제 24
 
0.4%
휘발유 24
 
0.4%
비닐 24
 
0.4%
유류비 23
 
0.4%
인건비 22
 
0.3%
퇴비 20
 
0.3%
Other values (4025) 6180
96.0%
2023-12-13T02:10:30.617010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3586
 
11.8%
1430
 
4.7%
816
 
2.7%
1 762
 
2.5%
5 538
 
1.8%
2 537
 
1.8%
417
 
1.4%
414
 
1.4%
381
 
1.3%
366
 
1.2%
Other values (795) 21153
69.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20154
66.3%
Decimal Number 6587
 
21.7%
Space Separator 1430
 
4.7%
Lowercase Letter 662
 
2.2%
Other Punctuation 557
 
1.8%
Open Punctuation 358
 
1.2%
Close Punctuation 357
 
1.2%
Uppercase Letter 116
 
0.4%
Dash Punctuation 92
 
0.3%
Math Symbol 86
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
816
 
4.0%
417
 
2.1%
414
 
2.1%
381
 
1.9%
366
 
1.8%
346
 
1.7%
319
 
1.6%
288
 
1.4%
284
 
1.4%
281
 
1.4%
Other values (719) 16242
80.6%
Lowercase Letter
ValueCountFrequency (%)
g 233
35.2%
k 177
26.7%
m 105
15.9%
l 26
 
3.9%
c 16
 
2.4%
a 13
 
2.0%
x 13
 
2.0%
e 12
 
1.8%
s 11
 
1.7%
o 10
 
1.5%
Other values (13) 46
 
6.9%
Uppercase Letter
ValueCountFrequency (%)
G 21
18.1%
K 11
9.5%
L 9
 
7.8%
B 8
 
6.9%
P 8
 
6.9%
A 8
 
6.9%
C 7
 
6.0%
R 7
 
6.0%
X 7
 
6.0%
M 6
 
5.2%
Other values (13) 24
20.7%
Other Punctuation
ValueCountFrequency (%)
, 242
43.4%
. 214
38.4%
/ 65
 
11.7%
: 13
 
2.3%
* 12
 
2.2%
@ 3
 
0.5%
& 2
 
0.4%
\ 2
 
0.4%
; 2
 
0.4%
# 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 3586
54.4%
1 762
 
11.6%
5 538
 
8.2%
2 537
 
8.2%
3 276
 
4.2%
4 269
 
4.1%
6 184
 
2.8%
8 183
 
2.8%
7 158
 
2.4%
9 94
 
1.4%
Math Symbol
ValueCountFrequency (%)
~ 34
39.5%
× 31
36.0%
= 17
19.8%
+ 4
 
4.7%
Space Separator
ValueCountFrequency (%)
1430
100.0%
Open Punctuation
ValueCountFrequency (%)
( 358
100.0%
Close Punctuation
ValueCountFrequency (%)
) 357
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Currency Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20154
66.3%
Common 9468
31.1%
Latin 778
 
2.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
816
 
4.0%
417
 
2.1%
414
 
2.1%
381
 
1.9%
366
 
1.8%
346
 
1.7%
319
 
1.6%
288
 
1.4%
284
 
1.4%
281
 
1.4%
Other values (719) 16242
80.6%
Latin
ValueCountFrequency (%)
g 233
29.9%
k 177
22.8%
m 105
13.5%
l 26
 
3.3%
G 21
 
2.7%
c 16
 
2.1%
a 13
 
1.7%
x 13
 
1.7%
e 12
 
1.5%
K 11
 
1.4%
Other values (36) 151
19.4%
Common
ValueCountFrequency (%)
0 3586
37.9%
1430
 
15.1%
1 762
 
8.0%
5 538
 
5.7%
2 537
 
5.7%
( 358
 
3.8%
) 357
 
3.8%
3 276
 
2.9%
4 269
 
2.8%
, 242
 
2.6%
Other values (20) 1113
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20124
66.2%
ASCII 10214
33.6%
None 32
 
0.1%
Compat Jamo 30
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3586
35.1%
1430
 
14.0%
1 762
 
7.5%
5 538
 
5.3%
2 537
 
5.3%
( 358
 
3.5%
) 357
 
3.5%
3 276
 
2.7%
4 269
 
2.6%
, 242
 
2.4%
Other values (64) 1859
18.2%
Hangul
ValueCountFrequency (%)
816
 
4.1%
417
 
2.1%
414
 
2.1%
381
 
1.9%
366
 
1.8%
346
 
1.7%
319
 
1.6%
288
 
1.4%
284
 
1.4%
281
 
1.4%
Other values (705) 16212
80.6%
None
ValueCountFrequency (%)
× 31
96.9%
1
 
3.1%
Compat Jamo
ValueCountFrequency (%)
5
16.7%
5
16.7%
3
10.0%
3
10.0%
3
10.0%
2
 
6.7%
2
 
6.7%
1
 
3.3%
1
 
3.3%
1
 
3.3%
Other values (4) 4
13.3%

품목재배일련번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1176
Distinct (%)22.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7961.3499
Minimum2
Maximum15607
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size46.1 KiB
2023-12-13T02:10:30.754657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile824
Q14658
median8101
Q311385
95-th percentile14715
Maximum15607
Range15605
Interquartile range (IQR)6727

Descriptive statistics

Standard deviation4092.2378
Coefficient of variation (CV)0.51401306
Kurtosis-0.92133443
Mean7961.3499
Median Absolute Deviation (MAD)3416
Skewness-0.02819011
Sum41613976
Variance16746410
MonotonicityNot monotonic
2023-12-13T02:10:30.887848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9885 102
 
2.0%
2753 88
 
1.7%
4560 78
 
1.5%
11823 71
 
1.4%
8585 60
 
1.1%
5509 52
 
1.0%
2906 51
 
1.0%
8740 48
 
0.9%
4930 48
 
0.9%
13738 47
 
0.9%
Other values (1166) 4582
87.7%
ValueCountFrequency (%)
2 7
0.1%
3 2
 
< 0.1%
8 1
 
< 0.1%
14 1
 
< 0.1%
28 14
0.3%
32 6
 
0.1%
46 15
0.3%
47 5
 
0.1%
71 1
 
< 0.1%
83 2
 
< 0.1%
ValueCountFrequency (%)
15607 2
 
< 0.1%
15586 1
 
< 0.1%
15585 2
 
< 0.1%
15582 5
0.1%
15577 1
 
< 0.1%
15567 1
 
< 0.1%
15459 3
 
0.1%
15457 4
 
0.1%
15449 10
0.2%
15430 5
0.1%

순번
Real number (ℝ)

ZEROS 

Distinct46
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.875837
Minimum0
Maximum96
Zeros714
Zeros (%)13.7%
Negative0
Negative (%)0.0%
Memory size46.1 KiB
2023-12-13T02:10:31.018988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q39
95-th percentile25
Maximum96
Range96
Interquartile range (IQR)8

Descriptive statistics

Standard deviation10.184148
Coefficient of variation (CV)1.4811504
Kurtosis17.604001
Mean6.875837
Median Absolute Deviation (MAD)2
Skewness3.357548
Sum35940
Variance103.71688
MonotonicityNot monotonic
2023-12-13T02:10:31.161258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
1 1213
23.2%
2 958
18.3%
0 714
13.7%
8 474
 
9.1%
9 277
 
5.3%
11 212
 
4.1%
10 195
 
3.7%
6 187
 
3.6%
3 159
 
3.0%
14 152
 
2.9%
Other values (36) 686
13.1%
ValueCountFrequency (%)
0 714
13.7%
1 1213
23.2%
2 958
18.3%
3 159
 
3.0%
4 12
 
0.2%
5 3
 
0.1%
6 187
 
3.6%
7 8
 
0.2%
8 474
 
9.1%
9 277
 
5.3%
ValueCountFrequency (%)
96 3
 
0.1%
95 4
 
0.1%
92 2
 
< 0.1%
89 1
 
< 0.1%
88 5
 
0.1%
84 2
 
< 0.1%
70 1
 
< 0.1%
57 1
 
< 0.1%
51 1
 
< 0.1%
45 38
0.7%

Interactions

2023-12-13T02:10:27.802185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:26.459352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:26.922884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.361356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.891688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:26.575011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.026893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.493533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.991102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:26.686974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.132004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.611184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:28.105660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:26.807866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.240065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:10:27.705870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:10:31.286228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호품목일련번호수입/지출코드품목재배일련번호순번
일련번호1.0000.5650.2230.9290.181
품목일련번호0.5651.0000.1250.6610.327
수입/지출코드0.2230.1251.0000.2060.319
품목재배일련번호0.9290.6610.2061.0000.164
순번0.1810.3270.3190.1641.000
2023-12-13T02:10:31.399284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호품목일련번호품목재배일련번호순번수입/지출코드
일련번호1.0000.2230.8200.0620.171
품목일련번호0.2231.0000.2340.0980.096
품목재배일련번호0.8200.2341.0000.0700.158
순번0.0620.0980.0701.0000.318
수입/지출코드0.1710.0960.1580.3181.000

Missing values

2023-12-13T02:10:28.217202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:10:28.351797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호품목일련번호수입/지출코드등록일시수정일시상태세부항목품목재배일련번호순번
015I1900-01-011900-01-01N배추21
125I1900-01-011900-01-01N절임배추21
255I1900-01-011900-01-01N예금이자22
365O1900-01-011900-01-01N아파트임대25
4767I1900-01-011900-01-01N멜론3281
5867O1900-01-011900-01-01N자가3286
6145I1900-01-011900-01-01N<NA>22
7155I1900-01-011900-01-01N가나21
8175O1900-01-011900-01-01N살충제24
91975I1900-01-011900-01-01N17941
일련번호품목일련번호수입/지출코드등록일시수정일시상태세부항목품목재배일련번호순번
5217634723O1900-01-011900-01-01N발근력155822
5218634823O1900-01-011900-01-01N요소1558210
5219634923O1900-01-011900-01-01N155828
5220635023O1900-01-011900-01-01N농기계수리155820
5221635113O1900-01-011900-01-01N퇴비구입155850
5222635213O1900-01-011900-01-01N퇴비구입 20kg 100포 소똥 1차155858
52236353140O1900-01-011900-01-01N아로니아1년생 80주155861
5224635762O1900-01-011900-01-01N네오 유황, 한버네71892
52256359148O1900-01-011900-01-01N아리기계유제 18리터156072
52266360148O1900-01-011900-01-01N베푸란(동방아그로)156072