Overview

Dataset statistics

Number of variables23
Number of observations10000
Missing cells68569
Missing cells (%)29.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 MiB
Average record size in memory207.0 B

Variable types

Numeric8
Categorical4
Text3
DateTime2
Boolean1
Unsupported5

Dataset

Description충청북도 농업기술원 농가경영기록장(농가의 소득을 증진시킬 수 있는 회원전용 농가경영 관리 프로그램)의 수입지출관련 이용자 접속기록, 거래, 거래처 등의 관리시스템으로 일련번호, 수입/지출코드, 거래일자, 등록일시, 수정일시, 상태, 품목일련번호, 종류계정과목, 등급구분, 포장단위금액, 포장단위구분, 수량, 단가, 수입/지출구분코드, 차변전표번호, 대변천표번호, 적요, 세부항목, 세부항목명(세부항목테이블에없는경우), 반대분개일자, 반대분개구분(현금/통장), 반대분개차변전표번호, 반대분개대변전표번호등을 제공합니다
Author충청북도
URLhttps://www.data.go.kr/data/15050324/fileData.do

Alerts

상태 has constant value ""Constant
등급구분 is highly imbalanced (62.2%)Imbalance
수입/지출구분코드 is highly imbalanced (72.0%)Imbalance
포장단위구분 has 235 (2.4%) missing valuesMissing
수량 has 191 (1.9%) missing valuesMissing
단가 has 177 (1.8%) missing valuesMissing
적요 has 9410 (94.1%) missing valuesMissing
세부항목 has 9278 (92.8%) missing valuesMissing
세부항목명(세부항목테이블에없는경우) has 9278 (92.8%) missing valuesMissing
반대분개일자 has 10000 (100.0%) missing valuesMissing
반대분개구분(현금/통장) has 10000 (100.0%) missing valuesMissing
반대분개차변전표번호 has 10000 (100.0%) missing valuesMissing
반대분개대변전표번호 has 10000 (100.0%) missing valuesMissing
수량 is highly skewed (γ1 = 38.65718105)Skewed
단가 is highly skewed (γ1 = 82.57404186)Skewed
일련번호 has unique valuesUnique
차변전표번호 has unique valuesUnique
대변천표번호 has unique valuesUnique
포장단위구분 is an unsupported type, check if it needs cleaning or further analysisUnsupported
반대분개일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
반대분개구분(현금/통장) is an unsupported type, check if it needs cleaning or further analysisUnsupported
반대분개차변전표번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
반대분개대변전표번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
수량 has 7922 (79.2%) zerosZeros
단가 has 8443 (84.4%) zerosZeros

Reproduction

Analysis started2023-12-12 17:18:41.058393
Analysis finished2023-12-12 17:18:42.092302
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50046.353
Minimum22
Maximum99593
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:42.479641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22
5-th percentile5286.75
Q125432.75
median50128.5
Q374478.75
95-th percentile94326.55
Maximum99593
Range99571
Interquartile range (IQR)49046

Descriptive statistics

Standard deviation28503.538
Coefficient of variation (CV)0.56954275
Kurtosis-1.1820008
Mean50046.353
Median Absolute Deviation (MAD)24489.5
Skewness-0.018669721
Sum5.0046354 × 108
Variance8.1245166 × 108
MonotonicityNot monotonic
2023-12-13T02:18:42.648098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
61233 1
 
< 0.1%
32117 1
 
< 0.1%
84755 1
 
< 0.1%
35982 1
 
< 0.1%
4758 1
 
< 0.1%
62906 1
 
< 0.1%
76299 1
 
< 0.1%
87849 1
 
< 0.1%
99478 1
 
< 0.1%
94881 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
22 1
< 0.1%
30 1
< 0.1%
78 1
< 0.1%
94 1
< 0.1%
99 1
< 0.1%
122 1
< 0.1%
135 1
< 0.1%
143 1
< 0.1%
178 1
< 0.1%
184 1
< 0.1%
ValueCountFrequency (%)
99593 1
< 0.1%
99585 1
< 0.1%
99574 1
< 0.1%
99566 1
< 0.1%
99565 1
< 0.1%
99560 1
< 0.1%
99535 1
< 0.1%
99530 1
< 0.1%
99509 1
< 0.1%
99478 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
O
5605 
I
4395 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowI
2nd rowO
3rd rowI
4th rowI
5th rowO

Common Values

ValueCountFrequency (%)
O 5605
56.0%
I 4395
44.0%

Length

2023-12-13T02:18:42.806225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:42.900985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
o 5605
56.0%
i 4395
44.0%
Distinct1872
Distinct (%)18.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:18:43.146920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.9998
Min length8

Characters and Unicode

Total characters99998
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)2.0%

Sample

1st row2016-06-19
2nd row2016-03-06
3rd row2017-09-16
4th row2017-11-03
5th row2015-04-22
ValueCountFrequency (%)
2015-11-24 27
 
0.3%
2016-06-27 20
 
0.2%
2017-06-19 19
 
0.2%
2016-05-23 18
 
0.2%
2016-05-02 18
 
0.2%
2017-05-29 18
 
0.2%
2015-06-23 17
 
0.2%
2016-04-06 17
 
0.2%
2017-08-25 17
 
0.2%
2016-02-16 17
 
0.2%
Other values (1862) 9812
98.1%
2023-12-13T02:18:43.611733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 22435
22.4%
- 19998
20.0%
1 18306
18.3%
2 15580
15.6%
6 4577
 
4.6%
7 4484
 
4.5%
5 4160
 
4.2%
4 3403
 
3.4%
3 3291
 
3.3%
8 1951
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80000
80.0%
Dash Punctuation 19998
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 22435
28.0%
1 18306
22.9%
2 15580
19.5%
6 4577
 
5.7%
7 4484
 
5.6%
5 4160
 
5.2%
4 3403
 
4.3%
3 3291
 
4.1%
8 1951
 
2.4%
9 1813
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 19998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99998
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 22435
22.4%
- 19998
20.0%
1 18306
18.3%
2 15580
15.6%
6 4577
 
4.6%
7 4484
 
4.5%
5 4160
 
4.2%
4 3403
 
3.4%
3 3291
 
3.3%
8 1951
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99998
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 22435
22.4%
- 19998
20.0%
1 18306
18.3%
2 15580
15.6%
6 4577
 
4.6%
7 4484
 
4.5%
5 4160
 
4.2%
4 3403
 
3.4%
3 3291
 
3.3%
8 1951
 
2.0%
Distinct2393
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2018-02-01 14:56:00
2023-12-13T02:18:43.787204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:18:43.957246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2391
Distinct (%)23.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum1900-01-01 00:00:00
Maximum2019-06-26 15:08:00
2023-12-13T02:18:44.147079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:18:44.311674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

상태
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2023-12-13T02:18:44.447190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

품목일련번호
Real number (ℝ)

Distinct277
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean105.5959
Minimum0
Maximum572
Zeros61
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:44.563573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile13
Q129
median62
Q3140
95-th percentile402
Maximum572
Range572
Interquartile range (IQR)111

Descriptive statistics

Standard deviation117.15769
Coefficient of variation (CV)1.1094909
Kurtosis2.4173272
Mean105.5959
Median Absolute Deviation (MAD)36
Skewness1.7694584
Sum1055959
Variance13725.924
MonotonicityNot monotonic
2023-12-13T02:18:44.739842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22 608
 
6.1%
35 515
 
5.1%
69 423
 
4.2%
29 341
 
3.4%
13 321
 
3.2%
66 304
 
3.0%
140 296
 
3.0%
26 260
 
2.6%
223 254
 
2.5%
34 234
 
2.3%
Other values (267) 6444
64.4%
ValueCountFrequency (%)
0 61
0.6%
1 11
 
0.1%
4 25
 
0.2%
5 80
0.8%
6 21
 
0.2%
7 112
1.1%
8 12
 
0.1%
9 3
 
< 0.1%
10 64
0.6%
11 1
 
< 0.1%
ValueCountFrequency (%)
572 2
 
< 0.1%
560 1
 
< 0.1%
559 1
 
< 0.1%
546 2
 
< 0.1%
542 1
 
< 0.1%
540 8
 
0.1%
535 21
0.2%
522 1
 
< 0.1%
520 2
 
< 0.1%
519 13
0.1%

종류계정과목
Real number (ℝ)

Distinct54
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean464.8317
Minimum402
Maximum556
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:44.919054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum402
5-th percentile406
Q1412
median495
Q3503
95-th percentile506
Maximum556
Range154
Interquartile range (IQR)91

Descriptive statistics

Standard deviation44.848914
Coefficient of variation (CV)0.096484199
Kurtosis-1.4342332
Mean464.8317
Median Absolute Deviation (MAD)29
Skewness-0.14237566
Sum4648317
Variance2011.4251
MonotonicityNot monotonic
2023-12-13T02:18:45.072314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
503 3767
37.7%
412 1058
 
10.6%
411 810
 
8.1%
495 628
 
6.3%
407 595
 
5.9%
453 446
 
4.5%
555 287
 
2.9%
402 284
 
2.8%
464 242
 
2.4%
413 230
 
2.3%
Other values (44) 1653
16.5%
ValueCountFrequency (%)
402 284
 
2.8%
403 6
 
0.1%
404 8
 
0.1%
405 188
 
1.9%
406 37
 
0.4%
407 595
5.9%
408 9
 
0.1%
409 5
 
0.1%
410 168
 
1.7%
411 810
8.1%
ValueCountFrequency (%)
556 10
 
0.1%
555 287
 
2.9%
554 7
 
0.1%
553 1
 
< 0.1%
552 4
 
< 0.1%
506 229
 
2.3%
505 85
 
0.9%
504 5
 
0.1%
503 3767
37.7%
495 628
 
6.3%

등급구분
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
7786 
0
1826 
1
 
197
2
 
110
4
 
47

Length

Max length4
Median length4
Mean length3.3358
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row3
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 7786
77.9%
0 1826
 
18.3%
1 197
 
2.0%
2 110
 
1.1%
4 47
 
0.5%
3 34
 
0.3%

Length

2023-12-13T02:18:45.250232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:45.386707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 7786
77.9%
0 1826
 
18.3%
1 197
 
2.0%
2 110
 
1.1%
4 47
 
0.5%
3 34
 
0.3%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
7563 
<NA>
2437 

Length

Max length4
Median length1
Mean length1.7311
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row<NA>
4th row<NA>
5th row1

Common Values

ValueCountFrequency (%)
1 7563
75.6%
<NA> 2437
 
24.4%

Length

2023-12-13T02:18:45.526712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:45.632078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 7563
75.6%
na 2437
 
24.4%

포장단위구분
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing235
Missing (%)2.4%
Memory size156.2 KiB

수량
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct279
Distinct (%)2.8%
Missing191
Missing (%)1.9%
Infinite0
Infinite (%)0.0%
Mean38.056051
Minimum0
Maximum38000
Zeros7922
Zeros (%)79.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:45.748973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile54.6
Maximum38000
Range38000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation618.3393
Coefficient of variation (CV)16.248121
Kurtosis1872.9
Mean38.056051
Median Absolute Deviation (MAD)0
Skewness38.657181
Sum373291.8
Variance382343.49
MonotonicityNot monotonic
2023-12-13T02:18:45.918174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 7922
79.2%
1.0 127
 
1.3%
5.0 125
 
1.2%
10.0 105
 
1.1%
2.0 103
 
1.0%
4.0 76
 
0.8%
20.0 68
 
0.7%
6.0 60
 
0.6%
7.0 56
 
0.6%
15.0 54
 
0.5%
Other values (269) 1113
 
11.1%
(Missing) 191
 
1.9%
ValueCountFrequency (%)
0.0 7922
79.2%
0.1 2
 
< 0.1%
0.2 1
 
< 0.1%
0.5 1
 
< 0.1%
0.8 1
 
< 0.1%
1.0 127
 
1.3%
1.4 1
 
< 0.1%
1.9 1
 
< 0.1%
2.0 103
 
1.0%
3.0 53
 
0.5%
ValueCountFrequency (%)
38000.0 1
 
< 0.1%
20000.0 3
< 0.1%
15900.0 1
 
< 0.1%
12500.0 1
 
< 0.1%
12000.0 1
 
< 0.1%
10000.0 1
 
< 0.1%
9000.0 2
< 0.1%
6378.0 1
 
< 0.1%
5100.0 1
 
< 0.1%
4200.0 2
< 0.1%

단가
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct289
Distinct (%)2.9%
Missing177
Missing (%)1.8%
Infinite0
Infinite (%)0.0%
Mean6145.1008
Minimum0
Maximum24000000
Zeros8443
Zeros (%)84.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:46.098524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile10000
Maximum24000000
Range24000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation263381.54
Coefficient of variation (CV)42.86041
Kurtosis7224.8103
Mean6145.1008
Median Absolute Deviation (MAD)0
Skewness82.574042
Sum60363325
Variance6.9369834 × 1010
MonotonicityNot monotonic
2023-12-13T02:18:46.375762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8443
84.4%
10000 150
 
1.5%
7000 55
 
0.5%
2000 52
 
0.5%
15000 43
 
0.4%
20000 43
 
0.4%
5000 39
 
0.4%
6000 37
 
0.4%
3000 37
 
0.4%
12000 34
 
0.3%
Other values (279) 890
 
8.9%
(Missing) 177
 
1.8%
ValueCountFrequency (%)
0 8443
84.4%
1 24
 
0.2%
2 13
 
0.1%
3 1
 
< 0.1%
4 4
 
< 0.1%
6 2
 
< 0.1%
7 1
 
< 0.1%
8 1
 
< 0.1%
10 1
 
< 0.1%
11 1
 
< 0.1%
ValueCountFrequency (%)
24000000 1
< 0.1%
10000000 1
< 0.1%
1200000 1
< 0.1%
800000 2
< 0.1%
590000 1
< 0.1%
582500 1
< 0.1%
549000 1
< 0.1%
500000 1
< 0.1%
463800 1
< 0.1%
450000 1
< 0.1%

수입/지출구분코드
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
9284 
2
 
389
3
 
327

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 9284
92.8%
2 389
 
3.9%
3 327
 
3.3%

Length

2023-12-13T02:18:46.551911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:46.674411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 9284
92.8%
2 389
 
3.9%
3 327
 
3.3%

차변전표번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean664841.14
Minimum568494
Maximum764340
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:46.805518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum568494
5-th percentile578615.5
Q1616657.5
median664021
Q3711637.5
95-th percentile753321.1
Maximum764340
Range195846
Interquartile range (IQR)94980

Descriptive statistics

Standard deviation55771.33
Coefficient of variation (CV)0.083886702
Kurtosis-1.1700978
Mean664841.14
Median Absolute Deviation (MAD)47437
Skewness0.02886503
Sum6.6484114 × 109
Variance3.1104413 × 109
MonotonicityNot monotonic
2023-12-13T02:18:46.982764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
685798 1
 
< 0.1%
629310 1
 
< 0.1%
733579 1
 
< 0.1%
636668 1
 
< 0.1%
577600 1
 
< 0.1%
688966 1
 
< 0.1%
715394 1
 
< 0.1%
739847 1
 
< 0.1%
764083 1
 
< 0.1%
754466 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
568494 1
< 0.1%
568510 1
< 0.1%
568594 1
< 0.1%
568618 1
< 0.1%
568628 1
< 0.1%
568664 1
< 0.1%
568690 1
< 0.1%
568704 1
< 0.1%
568770 1
< 0.1%
568782 1
< 0.1%
ValueCountFrequency (%)
764340 1
< 0.1%
764319 1
< 0.1%
764297 1
< 0.1%
764271 1
< 0.1%
764269 1
< 0.1%
764259 1
< 0.1%
764207 1
< 0.1%
764197 1
< 0.1%
764155 1
< 0.1%
764083 1
< 0.1%

대변천표번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean664842.14
Minimum568495
Maximum764341
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:47.185160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum568495
5-th percentile578616.5
Q1616658.5
median664022
Q3711638.5
95-th percentile753322.1
Maximum764341
Range195846
Interquartile range (IQR)94980

Descriptive statistics

Standard deviation55771.33
Coefficient of variation (CV)0.083886576
Kurtosis-1.1700978
Mean664842.14
Median Absolute Deviation (MAD)47437
Skewness0.02886503
Sum6.6484214 × 109
Variance3.1104413 × 109
MonotonicityNot monotonic
2023-12-13T02:18:47.388544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
685799 1
 
< 0.1%
629311 1
 
< 0.1%
733580 1
 
< 0.1%
636669 1
 
< 0.1%
577601 1
 
< 0.1%
688967 1
 
< 0.1%
715395 1
 
< 0.1%
739848 1
 
< 0.1%
764084 1
 
< 0.1%
754467 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
568495 1
< 0.1%
568511 1
< 0.1%
568595 1
< 0.1%
568619 1
< 0.1%
568629 1
< 0.1%
568665 1
< 0.1%
568691 1
< 0.1%
568705 1
< 0.1%
568771 1
< 0.1%
568783 1
< 0.1%
ValueCountFrequency (%)
764341 1
< 0.1%
764320 1
< 0.1%
764298 1
< 0.1%
764272 1
< 0.1%
764270 1
< 0.1%
764260 1
< 0.1%
764208 1
< 0.1%
764198 1
< 0.1%
764156 1
< 0.1%
764084 1
< 0.1%

적요
Text

MISSING 

Distinct559
Distinct (%)94.7%
Missing9410
Missing (%)94.1%
Memory size156.2 KiB
2023-12-13T02:18:47.678436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length155
Median length71
Mean length22.19322
Min length1

Characters and Unicode

Total characters13094
Distinct characters562
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique541 ?
Unique (%)91.7%

Sample

1st row각관 6개등
2nd row0.042361111
3rd row식대
4th row세면기 외부풒
5th row[Web발신][일시불]13,230원NH농협카드(8*2*)박*규 님06/25 10:50누계 1,914,143원옥션_Smile Pay
ValueCountFrequency (%)
web발신]농협 50
 
3.0%
413099-56-****00 40
 
2.4%
21
 
1.3%
m 16
 
1.0%
web발신]현대카드 15
 
0.9%
web발신]로컬푸드 13
 
0.8%
판매금액 13
 
0.8%
옥션옥션 12
 
0.7%
356-****-5545-13 12
 
0.7%
택배 9
 
0.5%
Other values (1266) 1439
87.7%
2023-12-13T02:18:48.169939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1267
 
9.7%
0 1203
 
9.2%
1 586
 
4.5%
* 394
 
3.0%
2 394
 
3.0%
, 389
 
3.0%
5 351
 
2.7%
4 322
 
2.5%
3 301
 
2.3%
289
 
2.2%
Other values (552) 7598
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5351
40.9%
Decimal Number 4013
30.6%
Space Separator 1267
 
9.7%
Other Punctuation 1108
 
8.5%
Lowercase Letter 382
 
2.9%
Uppercase Letter 247
 
1.9%
Open Punctuation 222
 
1.7%
Close Punctuation 221
 
1.7%
Dash Punctuation 213
 
1.6%
Math Symbol 62
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
289
 
5.4%
163
 
3.0%
146
 
2.7%
113
 
2.1%
100
 
1.9%
90
 
1.7%
89
 
1.7%
79
 
1.5%
79
 
1.5%
74
 
1.4%
Other values (489) 4129
77.2%
Lowercase Letter
ValueCountFrequency (%)
e 140
36.6%
b 135
35.3%
k 18
 
4.7%
g 15
 
3.9%
m 14
 
3.7%
x 13
 
3.4%
a 9
 
2.4%
l 8
 
2.1%
i 7
 
1.8%
y 6
 
1.6%
Other values (7) 17
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
W 136
55.1%
N 28
 
11.3%
H 27
 
10.9%
M 18
 
7.3%
S 9
 
3.6%
L 6
 
2.4%
P 6
 
2.4%
K 5
 
2.0%
B 3
 
1.2%
A 3
 
1.2%
Other values (3) 6
 
2.4%
Other Punctuation
ValueCountFrequency (%)
* 394
35.6%
, 389
35.1%
/ 136
 
12.3%
: 121
 
10.9%
. 57
 
5.1%
@ 5
 
0.5%
2
 
0.2%
% 1
 
0.1%
# 1
 
0.1%
& 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
0 1203
30.0%
1 586
14.6%
2 394
 
9.8%
5 351
 
8.7%
4 322
 
8.0%
3 301
 
7.5%
6 244
 
6.1%
9 242
 
6.0%
8 196
 
4.9%
7 174
 
4.3%
Math Symbol
ValueCountFrequency (%)
× 25
40.3%
> 21
33.9%
= 9
 
14.5%
~ 4
 
6.5%
+ 3
 
4.8%
Open Punctuation
ValueCountFrequency (%)
[ 155
69.8%
( 67
30.2%
Close Punctuation
ValueCountFrequency (%)
] 155
70.1%
) 66
29.9%
Space Separator
ValueCountFrequency (%)
1267
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 213
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7114
54.3%
Hangul 5351
40.9%
Latin 629
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
289
 
5.4%
163
 
3.0%
146
 
2.7%
113
 
2.1%
100
 
1.9%
90
 
1.7%
89
 
1.7%
79
 
1.5%
79
 
1.5%
74
 
1.4%
Other values (489) 4129
77.2%
Common
ValueCountFrequency (%)
1267
17.8%
0 1203
16.9%
1 586
 
8.2%
* 394
 
5.5%
2 394
 
5.5%
, 389
 
5.5%
5 351
 
4.9%
4 322
 
4.5%
3 301
 
4.2%
6 244
 
3.4%
Other values (23) 1663
23.4%
Latin
ValueCountFrequency (%)
e 140
22.3%
W 136
21.6%
b 135
21.5%
N 28
 
4.5%
H 27
 
4.3%
k 18
 
2.9%
M 18
 
2.9%
g 15
 
2.4%
m 14
 
2.2%
x 13
 
2.1%
Other values (20) 85
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7716
58.9%
Hangul 5344
40.8%
None 27
 
0.2%
Compat Jamo 7
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1267
16.4%
0 1203
15.6%
1 586
 
7.6%
* 394
 
5.1%
2 394
 
5.1%
, 389
 
5.0%
5 351
 
4.5%
4 322
 
4.2%
3 301
 
3.9%
6 244
 
3.2%
Other values (51) 2265
29.4%
Hangul
ValueCountFrequency (%)
289
 
5.4%
163
 
3.1%
146
 
2.7%
113
 
2.1%
100
 
1.9%
90
 
1.7%
89
 
1.7%
79
 
1.5%
79
 
1.5%
74
 
1.4%
Other values (487) 4122
77.1%
None
ValueCountFrequency (%)
× 25
92.6%
2
 
7.4%
Compat Jamo
ValueCountFrequency (%)
6
85.7%
1
 
14.3%

세부항목
Real number (ℝ)

MISSING 

Distinct288
Distinct (%)39.9%
Missing9278
Missing (%)92.8%
Infinite0
Infinite (%)0.0%
Mean3266.205
Minimum25
Maximum6357
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:18:48.358861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum25
5-th percentile343.1
Q11806
median3782
Q34639
95-th percentile5561.45
Maximum6357
Range6332
Interquartile range (IQR)2833

Descriptive statistics

Standard deviation1625.135
Coefficient of variation (CV)0.49756063
Kurtosis-1.0264399
Mean3266.205
Median Absolute Deviation (MAD)956
Skewness-0.38176255
Sum2358200
Variance2641063.8
MonotonicityNot monotonic
2023-12-13T02:18:48.528061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4639 176
 
1.8%
1806 47
 
0.5%
2468 15
 
0.1%
2825 14
 
0.1%
2826 10
 
0.1%
62 10
 
0.1%
1665 9
 
0.1%
1568 8
 
0.1%
2827 7
 
0.1%
1253 6
 
0.1%
Other values (278) 420
 
4.2%
(Missing) 9278
92.8%
ValueCountFrequency (%)
25 1
 
< 0.1%
51 5
0.1%
62 10
0.1%
78 2
 
< 0.1%
82 1
 
< 0.1%
85 1
 
< 0.1%
87 1
 
< 0.1%
90 1
 
< 0.1%
91 1
 
< 0.1%
129 1
 
< 0.1%
ValueCountFrequency (%)
6357 1
< 0.1%
6320 1
< 0.1%
6304 1
< 0.1%
6301 1
< 0.1%
6149 1
< 0.1%
6148 1
< 0.1%
6139 2
< 0.1%
6047 1
< 0.1%
6035 1
< 0.1%
6034 1
< 0.1%
Distinct277
Distinct (%)38.4%
Missing9278
Missing (%)92.8%
Memory size156.2 KiB
2023-12-13T02:18:48.870001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length20
Mean length4.966759
Min length1

Characters and Unicode

Total characters3586
Distinct characters324
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique197 ?
Unique (%)27.3%

Sample

1st row친지(이웃) 제공
2nd row인터넷 및 통신비
3rd row무농약건대추
4th row생표고 중/10000원
5th row건고추 판매(화근)
ValueCountFrequency (%)
오디판매 176
 
19.8%
소매 47
 
5.3%
생표고 16
 
1.8%
개인용돈 15
 
1.7%
유류비 14
 
1.6%
와송분말 14
 
1.6%
판매 11
 
1.2%
10
 
1.1%
사과판매 10
 
1.1%
10
 
1.1%
Other values (333) 564
63.6%
2023-12-13T02:18:49.357829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
282
 
7.9%
236
 
6.6%
197
 
5.5%
190
 
5.3%
183
 
5.1%
0 109
 
3.0%
75
 
2.1%
61
 
1.7%
55
 
1.5%
( 51
 
1.4%
Other values (314) 2147
59.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2910
81.1%
Decimal Number 242
 
6.7%
Space Separator 183
 
5.1%
Lowercase Letter 77
 
2.1%
Open Punctuation 51
 
1.4%
Close Punctuation 51
 
1.4%
Other Punctuation 39
 
1.1%
Uppercase Letter 19
 
0.5%
Dash Punctuation 10
 
0.3%
Math Symbol 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
282
 
9.7%
236
 
8.1%
197
 
6.8%
190
 
6.5%
75
 
2.6%
61
 
2.1%
55
 
1.9%
50
 
1.7%
49
 
1.7%
45
 
1.5%
Other values (280) 1670
57.4%
Decimal Number
ValueCountFrequency (%)
0 109
45.0%
1 46
19.0%
2 23
 
9.5%
5 23
 
9.5%
7 16
 
6.6%
3 9
 
3.7%
4 9
 
3.7%
6 5
 
2.1%
8 2
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
A 5
26.3%
B 4
21.1%
H 3
15.8%
R 2
 
10.5%
K 1
 
5.3%
M 1
 
5.3%
P 1
 
5.3%
V 1
 
5.3%
F 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
g 35
45.5%
k 21
27.3%
m 13
 
16.9%
e 5
 
6.5%
y 1
 
1.3%
l 1
 
1.3%
c 1
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 17
43.6%
/ 14
35.9%
. 8
20.5%
Math Symbol
ValueCountFrequency (%)
~ 3
75.0%
× 1
 
25.0%
Space Separator
ValueCountFrequency (%)
183
100.0%
Open Punctuation
ValueCountFrequency (%)
( 51
100.0%
Close Punctuation
ValueCountFrequency (%)
) 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2910
81.1%
Common 580
 
16.2%
Latin 96
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
282
 
9.7%
236
 
8.1%
197
 
6.8%
190
 
6.5%
75
 
2.6%
61
 
2.1%
55
 
1.9%
50
 
1.7%
49
 
1.7%
45
 
1.5%
Other values (280) 1670
57.4%
Common
ValueCountFrequency (%)
183
31.6%
0 109
18.8%
( 51
 
8.8%
) 51
 
8.8%
1 46
 
7.9%
2 23
 
4.0%
5 23
 
4.0%
, 17
 
2.9%
7 16
 
2.8%
/ 14
 
2.4%
Other values (8) 47
 
8.1%
Latin
ValueCountFrequency (%)
g 35
36.5%
k 21
21.9%
m 13
 
13.5%
A 5
 
5.2%
e 5
 
5.2%
B 4
 
4.2%
H 3
 
3.1%
R 2
 
2.1%
K 1
 
1.0%
M 1
 
1.0%
Other values (6) 6
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2910
81.1%
ASCII 675
 
18.8%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
282
 
9.7%
236
 
8.1%
197
 
6.8%
190
 
6.5%
75
 
2.6%
61
 
2.1%
55
 
1.9%
50
 
1.7%
49
 
1.7%
45
 
1.5%
Other values (280) 1670
57.4%
ASCII
ValueCountFrequency (%)
183
27.1%
0 109
16.1%
( 51
 
7.6%
) 51
 
7.6%
1 46
 
6.8%
g 35
 
5.2%
2 23
 
3.4%
5 23
 
3.4%
k 21
 
3.1%
, 17
 
2.5%
Other values (23) 116
17.2%
None
ValueCountFrequency (%)
× 1
100.0%

반대분개일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

반대분개구분(현금/통장)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

반대분개차변전표번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

반대분개대변전표번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Sample

일련번호수입/지출코드거래일자등록일시수정일시상태품목일련번호종류계정과목등급구분포장단위금액포장단위구분수량단가수입/지출구분코드차변전표번호대변천표번호적요세부항목세부항목명(세부항목테이블에없는경우)반대분개일자반대분개구분(현금/통장)반대분개차변전표번호반대분개대변전표번호
5842661233I2016-06-191900-01-01 00:001900-01-01 00:00N140503<NA>110.001685798685799<NA><NA><NA><NA><NA><NA><NA>
4842451011O2016-03-061900-01-01 00:001900-01-01 00:00N140411<NA>110.001665750665751<NA><NA><NA><NA><NA><NA><NA>
8710590924I2017-09-162017-09-19 04:442017-09-19 04:44N695043<NA>120.0150001746246746247<NA><NA><NA><NA><NA><NA><NA>
8996293965I2017-11-032017-11-07 08:182017-11-07 08:18N45030<NA>77.010001752456752457<NA><NA><NA><NA><NA><NA><NA>
2688728502O2015-04-221900-01-01 00:001900-01-01 00:00N119411<NA>110.001622360622361<NA><NA><NA><NA><NA><NA><NA>
1537316288I2014-06-041900-01-01 00:001900-01-01 00:00N45555<NA>110.001599268599269<NA>1707친지(이웃) 제공<NA><NA><NA><NA>
3900941303O2015-11-281900-01-01 00:001900-01-01 00:00N69495<NA>110.001646790646791<NA>2628인터넷 및 통신비<NA><NA><NA><NA>
6180164734I2016-09-211900-01-01 00:001900-01-01 00:00N62503<NA>110.001692562692563<NA><NA><NA><NA><NA><NA><NA>
1187212505I2014-01-201900-01-01 00:001900-01-01 00:00N76503<NA>110.001592232592233<NA>219무농약건대추<NA><NA><NA><NA>
1845519507I2014-08-311900-01-01 00:001900-01-01 00:00N56503<NA>110.001605460605461<NA><NA><NA><NA><NA><NA><NA>
일련번호수입/지출코드거래일자등록일시수정일시상태품목일련번호종류계정과목등급구분포장단위금액포장단위구분수량단가수입/지출구분코드차변전표번호대변천표번호적요세부항목세부항목명(세부항목테이블에없는경우)반대분개일자반대분개구분(현금/통장)반대분개차변전표번호반대분개대변전표번호
3488236913I2015-09-161900-01-01 00:001900-01-01 00:00N29503<NA>110.001638392638393<NA><NA><NA><NA><NA><NA><NA>
1112611711O2014-02-041900-01-01 00:001900-01-01 00:00N124495<NA>110.001590734590735<NA><NA><NA><NA><NA><NA><NA>
8757391409O2017-09-252017-09-26 08:342017-09-26 08:34N254120<NA>10.001747224747225동네분 여자 1명<NA><NA><NA><NA><NA><NA>
3553237593O2015-10-191900-01-01 00:001900-01-01 00:00N22453<NA>110.001639692639693<NA><NA><NA><NA><NA><NA><NA>
7282176071O2017-03-122017-03-21 08:402017-03-21 08:40N304020<NA>10.001714868714869<NA><NA><NA><NA><NA><NA><NA>
3134833164O2015-07-151900-01-01 00:001900-01-01 00:00N56411<NA>110.001631288631289<NA><NA><NA><NA><NA><NA><NA>
1202812679O2014-03-081900-01-01 00:001900-01-01 00:00N53407<NA>110.001592544592545<NA><NA><NA><NA><NA><NA><NA>
4751150072I2016-03-011900-01-01 00:001900-01-01 00:00N21503<NA>119.090003663912663913<NA><NA><NA><NA><NA><NA><NA>
59406198O2013-07-281900-01-01 00:001900-01-01 00:00N39462<NA>110.001580348580349<NA><NA><NA><NA><NA><NA><NA>
4200044407O2015-09-201900-01-01 00:001900-01-01 00:00N45495<NA>110.003652798652799<NA><NA><NA><NA><NA><NA><NA>