Overview

Dataset statistics

Number of variables5
Number of observations6052
Missing cells6054
Missing cells (%)20.0%
Duplicate rows1131
Duplicate rows (%)18.7%
Total size in memory242.4 KiB
Average record size in memory41.0 B

Variable types

DateTime3
Unsupported1
Text1

Dataset

Description충청남도 청양군의 농기계임대사업관리시스템의 수리정보에 관한 데이터로 시작일자, 만료일자, 작업명, 시작위치, 만료위치에 관한 데이터를 나타냅니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=330&beforeMenuCd=DOM_000000201001001000&publicdatapk=15089545

Alerts

Dataset has 1131 (18.7%) duplicate rowsDuplicates
만료일자 has 6052 (100.0%) missing valuesMissing
만료일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 19:49:54.230018
Analysis finished2024-01-09 19:49:54.603880
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1475
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size47.4 KiB
Minimum2011-02-10 00:00:00
Maximum2022-09-16 00:00:00
2024-01-10T04:49:54.656311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:49:54.760702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

만료일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing6052
Missing (%)100.0%
Memory size53.3 KiB
Distinct484
Distinct (%)8.0%
Missing2
Missing (%)< 0.1%
Memory size47.4 KiB
2024-01-10T04:49:54.980677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length5
Mean length5.4530579
Min length2

Characters and Unicode

Total characters32991
Distinct characters311
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique366 ?
Unique (%)6.0%

Sample

1st row[임대보류]
2nd row[임대보류]
3rd row[임대보류]
4th row[임대보류]
5th row[임대보류]
ValueCountFrequency (%)
운용중 2544
37.1%
수리중 2533
37.0%
마모 178
 
2.6%
파손 101
 
1.5%
폐기대상 74
 
1.1%
교체 69
 
1.0%
칼날마모 59
 
0.9%
칼날 49
 
0.7%
29
 
0.4%
불량 28
 
0.4%
Other values (489) 1185
17.3%
2024-01-10T04:49:55.309850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
[ 5292
16.0%
] 5292
16.0%
5174
15.7%
2719
8.2%
2626
8.0%
2595
7.9%
2584
7.8%
812
 
2.5%
373
 
1.1%
361
 
1.1%
Other values (301) 5163
15.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21388
64.8%
Open Punctuation 5309
 
16.1%
Close Punctuation 5309
 
16.1%
Space Separator 812
 
2.5%
Other Punctuation 91
 
0.3%
Decimal Number 64
 
0.2%
Uppercase Letter 10
 
< 0.1%
Lowercase Letter 7
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5174
24.2%
2719
12.7%
2626
12.3%
2595
12.1%
2584
12.1%
373
 
1.7%
361
 
1.7%
307
 
1.4%
197
 
0.9%
188
 
0.9%
Other values (272) 4264
19.9%
Decimal Number
ValueCountFrequency (%)
4 14
21.9%
1 14
21.9%
2 11
17.2%
3 7
10.9%
7 7
10.9%
8 3
 
4.7%
6 3
 
4.7%
0 3
 
4.7%
5 2
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
L 2
20.0%
V 2
20.0%
B 2
20.0%
T 1
10.0%
O 1
10.0%
P 1
10.0%
A 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
v 4
57.1%
o 1
 
14.3%
t 1
 
14.3%
p 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 75
82.4%
. 11
 
12.1%
/ 5
 
5.5%
Open Punctuation
ValueCountFrequency (%)
[ 5292
99.7%
( 17
 
0.3%
Close Punctuation
ValueCountFrequency (%)
] 5292
99.7%
) 17
 
0.3%
Space Separator
ValueCountFrequency (%)
812
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21388
64.8%
Common 11586
35.1%
Latin 17
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5174
24.2%
2719
12.7%
2626
12.3%
2595
12.1%
2584
12.1%
373
 
1.7%
361
 
1.7%
307
 
1.4%
197
 
0.9%
188
 
0.9%
Other values (272) 4264
19.9%
Common
ValueCountFrequency (%)
[ 5292
45.7%
] 5292
45.7%
812
 
7.0%
, 75
 
0.6%
) 17
 
0.1%
( 17
 
0.1%
4 14
 
0.1%
1 14
 
0.1%
2 11
 
0.1%
. 11
 
0.1%
Other values (8) 31
 
0.3%
Latin
ValueCountFrequency (%)
v 4
23.5%
L 2
11.8%
V 2
11.8%
B 2
11.8%
T 1
 
5.9%
O 1
 
5.9%
P 1
 
5.9%
o 1
 
5.9%
t 1
 
5.9%
p 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21388
64.8%
ASCII 11603
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
[ 5292
45.6%
] 5292
45.6%
812
 
7.0%
, 75
 
0.6%
) 17
 
0.1%
( 17
 
0.1%
4 14
 
0.1%
1 14
 
0.1%
2 11
 
0.1%
. 11
 
0.1%
Other values (19) 48
 
0.4%
Hangul
ValueCountFrequency (%)
5174
24.2%
2719
12.7%
2626
12.3%
2595
12.1%
2584
12.1%
373
 
1.7%
361
 
1.7%
307
 
1.4%
197
 
0.9%
188
 
0.9%
Other values (272) 4264
19.9%
Distinct1475
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size47.4 KiB
Minimum2011-02-10 00:00:00
Maximum2022-09-16 00:00:00
2024-01-10T04:49:55.417817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:49:55.524004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1475
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size47.4 KiB
Minimum2011-02-10 00:00:00
Maximum2022-09-16 00:00:00
2024-01-10T04:49:55.627297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:49:55.729318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2024-01-10T04:49:54.495754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:49:54.570430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시작일자만료일자작업명시작위치만료위치
02011-02-10<NA>[임대보류]2011-02-102011-02-10
12011-02-10<NA>[임대보류]2011-02-102011-02-10
22011-02-10<NA>[임대보류]2011-02-102011-02-10
32011-02-10<NA>[임대보류]2011-02-102011-02-10
42011-02-10<NA>[임대보류]2011-02-102011-02-10
52011-02-10<NA>[임대보류]2011-02-102011-02-10
62011-02-10<NA>[임대보류]2011-02-102011-02-10
72011-02-10<NA>[임대보류]2011-02-102011-02-10
82011-02-10<NA>[임대보류]2011-02-102011-02-10
92011-04-14<NA>[운용중]2011-04-142011-04-14
시작일자만료일자작업명시작위치만료위치
60422022-08-25<NA>[수리중] 450LA 밸트 2개 교체2022-08-252022-08-25
60432022-08-25<NA>[운용중]2022-08-252022-08-25
60442022-08-26<NA>[수리중]2022-08-262022-08-26
60452022-09-06<NA>[운용중]2022-09-062022-09-06
60462022-09-08<NA>[수리중]2022-09-082022-09-08
60472022-09-08<NA>[수리중]2022-09-082022-09-08
60482022-09-08<NA>[운용중]2022-09-082022-09-08
60492022-09-14<NA>[운용중]2022-09-142022-09-14
60502022-09-15<NA>[수리중]2022-09-152022-09-15
60512022-09-16<NA>[운용중]2022-09-162022-09-16

Duplicate rows

Most frequently occurring

시작일자작업명시작위치만료위치# duplicates
1642014-12-19[수리중]2014-12-192014-12-1979
2702015-12-29[운용중]2015-12-292015-12-2974
10402021-09-07[폐기대상]2021-09-072021-09-0773
1892015-03-02[운용중]2015-03-022015-03-0250
10812022-03-11[운용중]2022-03-112022-03-1150
3652016-06-20[수리중]2016-06-202016-06-2039
10492021-10-18[운용중]2021-10-182021-10-1828
10422021-09-27[운용중]2021-09-272021-09-2727
302012-09-20[운용중]2012-09-202012-09-2025
8722019-05-17[수리중]2019-05-172019-05-1725