Overview

Dataset statistics

Number of variables5
Number of observations2381
Missing cells0
Missing cells (%)0.0%
Duplicate rows376
Duplicate rows (%)15.8%
Total size in memory95.5 KiB
Average record size in memory41.1 B

Variable types

DateTime1
Text2
Numeric1
Categorical1

Dataset

Description시흥도시공사에서 보유한 자산현황입니다. 공기구를 비롯한 자산으로 등록된 각종 물품내역이 존재합니다.각 자산의 취득일자, 자산명, 품목, 취득원가, 잔존가액, 처분현황이 기재되어 있습니다.
Author시흥도시공사
URLhttps://www.data.go.kr/data/15098885/fileData.do

Alerts

Dataset has 376 (15.8%) duplicate rowsDuplicates
처분현황 is highly imbalanced (61.4%)Imbalance

Reproduction

Analysis started2024-03-15 00:00:45.450979
Analysis finished2024-03-15 00:00:47.043202
Duration1.59 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct425
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Memory size18.7 KiB
Minimum2005-10-01 00:00:00
Maximum2023-12-06 00:00:00
2024-03-15T09:00:47.234423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T09:00:47.929440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct417
Distinct (%)17.5%
Missing0
Missing (%)0.0%
Memory size18.7 KiB
2024-03-15T09:00:49.004930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length21
Mean length6.1045779
Min length2

Characters and Unicode

Total characters14535
Distinct characters422
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)8.0%

Sample

1st row쓰레기수거용트럭
2nd row라미네이터
3rd row자바라컨베이어
4th row책상
5th row책상
ValueCountFrequency (%)
lcd패널또는모니터 131
 
5.0%
데스크톱컴퓨터 124
 
4.7%
작업용의자 102
 
3.9%
모니터 89
 
3.4%
의자 74
 
2.8%
차량용운행기록계 73
 
2.8%
책상 57
 
2.2%
냉난방기 56
 
2.1%
방화벽장치 50
 
1.9%
태블릿컴퓨터 48
 
1.8%
Other values (445) 1827
69.4%
2024-03-15T09:00:50.517882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
613
 
4.2%
509
 
3.5%
450
 
3.1%
339
 
2.3%
329
 
2.3%
315
 
2.2%
288
 
2.0%
258
 
1.8%
250
 
1.7%
231
 
1.6%
Other values (412) 10953
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13270
91.3%
Uppercase Letter 580
 
4.0%
Space Separator 250
 
1.7%
Decimal Number 157
 
1.1%
Lowercase Letter 105
 
0.7%
Open Punctuation 56
 
0.4%
Close Punctuation 56
 
0.4%
Dash Punctuation 38
 
0.3%
Other Punctuation 23
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
613
 
4.6%
509
 
3.8%
450
 
3.4%
339
 
2.6%
329
 
2.5%
315
 
2.4%
288
 
2.2%
258
 
1.9%
231
 
1.7%
216
 
1.6%
Other values (361) 9722
73.3%
Uppercase Letter
ValueCountFrequency (%)
C 153
26.4%
D 138
23.8%
L 131
22.6%
T 52
 
9.0%
P 21
 
3.6%
V 17
 
2.9%
F 14
 
2.4%
R 14
 
2.4%
S 12
 
2.1%
I 5
 
0.9%
Other values (9) 23
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
a 23
21.9%
c 15
14.3%
t 14
13.3%
p 12
11.4%
d 11
10.5%
e 6
 
5.7%
i 5
 
4.8%
v 3
 
2.9%
n 2
 
1.9%
l 2
 
1.9%
Other values (9) 12
11.4%
Decimal Number
ValueCountFrequency (%)
0 41
26.1%
6 39
24.8%
2 36
22.9%
3 16
 
10.2%
1 13
 
8.3%
5 8
 
5.1%
4 4
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 22
95.7%
/ 1
 
4.3%
Space Separator
ValueCountFrequency (%)
250
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13267
91.3%
Latin 685
 
4.7%
Common 580
 
4.0%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
613
 
4.6%
509
 
3.8%
450
 
3.4%
339
 
2.6%
329
 
2.5%
315
 
2.4%
288
 
2.2%
258
 
1.9%
231
 
1.7%
216
 
1.6%
Other values (358) 9719
73.3%
Latin
ValueCountFrequency (%)
C 153
22.3%
D 138
20.1%
L 131
19.1%
T 52
 
7.6%
a 23
 
3.4%
P 21
 
3.1%
V 17
 
2.5%
c 15
 
2.2%
F 14
 
2.0%
R 14
 
2.0%
Other values (28) 107
15.6%
Common
ValueCountFrequency (%)
250
43.1%
( 56
 
9.7%
) 56
 
9.7%
0 41
 
7.1%
6 39
 
6.7%
- 38
 
6.6%
2 36
 
6.2%
, 22
 
3.8%
3 16
 
2.8%
1 13
 
2.2%
Other values (3) 13
 
2.2%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13267
91.3%
ASCII 1265
 
8.7%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
613
 
4.6%
509
 
3.8%
450
 
3.4%
339
 
2.6%
329
 
2.5%
315
 
2.4%
288
 
2.2%
258
 
1.9%
231
 
1.7%
216
 
1.6%
Other values (358) 9719
73.3%
ASCII
ValueCountFrequency (%)
250
19.8%
C 153
12.1%
D 138
10.9%
L 131
10.4%
( 56
 
4.4%
) 56
 
4.4%
T 52
 
4.1%
0 41
 
3.2%
6 39
 
3.1%
- 38
 
3.0%
Other values (41) 311
24.6%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

품목
Text

Distinct865
Distinct (%)36.3%
Missing0
Missing (%)0.0%
Memory size18.7 KiB
2024-03-15T09:00:51.598845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length58
Mean length39.167577
Min length5

Characters and Unicode

Total characters93258
Distinct characters552
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)21.0%

Sample

1st row쓰레기수거용트럭, 현대자동차, 메가트럭 5톤, 신압착진개차 DLX(2007년형)
2nd row래미네이터, 레세전자, CN/LS-2421, 4롤러, 210mm
3rd row롤러컨베이어, 일진콘베이어, 기폭600mm, 기장1~6m
4th row책상, 동양하이테크, DH-24, 1800×1200×720mm
5th row책상, 동양하이테크, DH-20, 1200×800×720mm
ValueCountFrequency (%)
시흥도시공사주문제작 319
 
2.8%
액정모니터 228
 
2.0%
작업용의자 210
 
1.9%
intel 199
 
1.8%
core 193
 
1.7%
레드스톤시스템 178
 
1.6%
데스크톱컴퓨터 175
 
1.6%
삼성전자 142
 
1.3%
i5 118
 
1.0%
엘지전자 101
 
0.9%
Other values (2415) 9396
83.5%
2024-03-15T09:00:53.311826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8891
 
9.5%
, 7576
 
8.1%
0 5954
 
6.4%
1 2609
 
2.8%
m 2229
 
2.4%
2 2117
 
2.3%
5 1910
 
2.0%
× 1605
 
1.7%
- 1380
 
1.5%
1242
 
1.3%
Other values (542) 57745
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32895
35.3%
Decimal Number 18753
20.1%
Uppercase Letter 11895
 
12.8%
Other Punctuation 8981
 
9.6%
Space Separator 8891
 
9.5%
Lowercase Letter 7950
 
8.5%
Math Symbol 1684
 
1.8%
Dash Punctuation 1380
 
1.5%
Open Punctuation 400
 
0.4%
Close Punctuation 400
 
0.4%
Other values (3) 29
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1242
 
3.8%
1234
 
3.8%
861
 
2.6%
841
 
2.6%
797
 
2.4%
740
 
2.2%
680
 
2.1%
659
 
2.0%
553
 
1.7%
492
 
1.5%
Other values (461) 24796
75.4%
Uppercase Letter
ValueCountFrequency (%)
S 1012
 
8.5%
C 1000
 
8.4%
H 811
 
6.8%
D 658
 
5.5%
M 655
 
5.5%
A 624
 
5.2%
G 607
 
5.1%
N 601
 
5.1%
W 572
 
4.8%
I 564
 
4.7%
Other values (17) 4791
40.3%
Lowercase Letter
ValueCountFrequency (%)
m 2229
28.0%
e 819
 
10.3%
i 523
 
6.6%
r 507
 
6.4%
o 496
 
6.2%
l 446
 
5.6%
n 430
 
5.4%
c 392
 
4.9%
a 363
 
4.6%
t 361
 
4.5%
Other values (16) 1384
17.4%
Decimal Number
ValueCountFrequency (%)
0 5954
31.7%
1 2609
13.9%
2 2117
 
11.3%
5 1910
 
10.2%
3 1233
 
6.6%
6 1221
 
6.5%
4 1094
 
5.8%
7 1077
 
5.7%
8 1024
 
5.5%
9 514
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 7576
84.4%
. 738
 
8.2%
/ 660
 
7.3%
* 5
 
0.1%
: 1
 
< 0.1%
· 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
× 1605
95.3%
~ 70
 
4.2%
+ 9
 
0.5%
Other Symbol
ValueCountFrequency (%)
18
69.2%
7
 
26.9%
1
 
3.8%
Space Separator
ValueCountFrequency (%)
8891
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1380
100.0%
Open Punctuation
ValueCountFrequency (%)
( 400
100.0%
Close Punctuation
ValueCountFrequency (%)
) 400
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 40516
43.4%
Hangul 32895
35.3%
Latin 19830
21.3%
Greek 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1242
 
3.8%
1234
 
3.8%
861
 
2.6%
841
 
2.6%
797
 
2.4%
740
 
2.2%
680
 
2.1%
659
 
2.0%
553
 
1.7%
492
 
1.5%
Other values (461) 24796
75.4%
Latin
ValueCountFrequency (%)
m 2229
 
11.2%
S 1012
 
5.1%
C 1000
 
5.0%
e 819
 
4.1%
H 811
 
4.1%
D 658
 
3.3%
M 655
 
3.3%
A 624
 
3.1%
G 607
 
3.1%
N 601
 
3.0%
Other values (43) 10814
54.5%
Common
ValueCountFrequency (%)
8891
21.9%
, 7576
18.7%
0 5954
14.7%
1 2609
 
6.4%
2 2117
 
5.2%
5 1910
 
4.7%
× 1605
 
4.0%
- 1380
 
3.4%
3 1233
 
3.0%
6 1221
 
3.0%
Other values (17) 6020
14.9%
Greek
ValueCountFrequency (%)
Φ 17
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58712
63.0%
Hangul 32895
35.3%
None 1623
 
1.7%
CJK Compat 25
 
< 0.1%
Number Forms 2
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8891
 
15.1%
, 7576
 
12.9%
0 5954
 
10.1%
1 2609
 
4.4%
m 2229
 
3.8%
2 2117
 
3.6%
5 1910
 
3.3%
- 1380
 
2.4%
3 1233
 
2.1%
6 1221
 
2.1%
Other values (64) 23592
40.2%
None
ValueCountFrequency (%)
× 1605
98.9%
Φ 17
 
1.0%
· 1
 
0.1%
Hangul
ValueCountFrequency (%)
1242
 
3.8%
1234
 
3.8%
861
 
2.6%
841
 
2.6%
797
 
2.4%
740
 
2.2%
680
 
2.1%
659
 
2.0%
553
 
1.7%
492
 
1.5%
Other values (461) 24796
75.4%
CJK Compat
ValueCountFrequency (%)
18
72.0%
7
 
28.0%
Number Forms
ValueCountFrequency (%)
2
100.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

취득원가
Real number (ℝ)

Distinct893
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005995
Minimum1000
Maximum2.32553 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.1 KiB
2024-03-15T09:00:53.732670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile84700
Q1183050
median363000
Q3943000
95-th percentile4895000
Maximum2.32553 × 108
Range2.32552 × 108
Interquartile range (IQR)759950

Descriptive statistics

Standard deviation11222024
Coefficient of variation (CV)5.5942431
Kurtosis225.47645
Mean2005995
Median Absolute Deviation (MAD)236000
Skewness13.860382
Sum4.7762741 × 109
Variance1.2593382 × 1014
MonotonicityNot monotonic
2024-03-15T09:00:54.177083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
209000 43
 
1.8%
110000 34
 
1.4%
154000 28
 
1.2%
280000 27
 
1.1%
121000 27
 
1.1%
230000 26
 
1.1%
210000 23
 
1.0%
132000 23
 
1.0%
330000 20
 
0.8%
165000 19
 
0.8%
Other values (883) 2111
88.7%
ValueCountFrequency (%)
1000 1
 
< 0.1%
21500 1
 
< 0.1%
32000 2
 
0.1%
32490 1
 
< 0.1%
34100 1
 
< 0.1%
38500 8
0.3%
44000 6
0.3%
48000 4
0.2%
48500 1
 
< 0.1%
49400 2
 
0.1%
ValueCountFrequency (%)
232553000 1
< 0.1%
230597060 1
< 0.1%
180370320 1
< 0.1%
144051885 2
0.1%
132384533 2
0.1%
124042930 1
< 0.1%
118966200 1
< 0.1%
106000000 1
< 0.1%
79360000 1
< 0.1%
67977090 1
< 0.1%

처분현황
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size18.7 KiB
취득 조달구입
1820 
취득/자체구입
515 
취득/무상관리전환
 
40
처분/폐기
 
5
이동/사용전환
 
1

Length

Max length9
Median length7
Mean length7.0293994
Min length5

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row취득 조달구입
2nd row취득 조달구입
3rd row취득 조달구입
4th row취득 조달구입
5th row취득 조달구입

Common Values

ValueCountFrequency (%)
취득 조달구입 1820
76.4%
취득/자체구입 515
 
21.6%
취득/무상관리전환 40
 
1.7%
처분/폐기 5
 
0.2%
이동/사용전환 1
 
< 0.1%

Length

2024-03-15T09:00:54.622557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T09:00:54.947640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
취득 1820
43.3%
조달구입 1820
43.3%
취득/자체구입 515
 
12.3%
취득/무상관리전환 40
 
1.0%
처분/폐기 5
 
0.1%
이동/사용전환 1
 
< 0.1%

Interactions

2024-03-15T09:00:46.223560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T09:00:55.075723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취득원가처분현황
취득원가1.0000.000
처분현황0.0001.000
2024-03-15T09:00:55.317487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
취득원가처분현황
취득원가1.0000.000
처분현황0.0001.000

Missing values

2024-03-15T09:00:46.556421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:00:46.887476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

취득일자자산명품목취득원가처분현황
02005-10-01쓰레기수거용트럭쓰레기수거용트럭, 현대자동차, 메가트럭 5톤, 신압착진개차 DLX(2007년형)67977090취득 조달구입
12006-01-01라미네이터래미네이터, 레세전자, CN/LS-2421, 4롤러, 210mm495000취득 조달구입
22007-02-01자바라컨베이어롤러컨베이어, 일진콘베이어, 기폭600mm, 기장1~6m1430000취득 조달구입
32007-04-01책상책상, 동양하이테크, DH-24, 1800×1200×720mm627000취득 조달구입
42007-12-01책상책상, 동양하이테크, DH-20, 1200×800×720mm407000취득 조달구입
52008-01-01책상책상, 동양하이테크, DH-20, 1200×800×720mm1155000취득 조달구입
62008-02-01LCD패널또는모니터액정모니터, 삼성전자, CN/S22B350T, 55.88cm780000취득 조달구입
72008-03-01PET압축기가이드롤러테이퍼롤러, 한국NSK, A2047, Φ12×10.785mm396000취득 조달구입
82008-11-01쓰레기수거용트럭쓰레기수거용트럭, 현대자동차, 뉴파워트럭, 11톤CNG암롤트럭 PRO(2008년형)-박스미포함79360000취득 조달구입
92008-12-01보안용카메라영상감시장치, 씨큐프라임, SSC-100, 방범감시시스템7000000취득 조달구입
취득일자자산명품목취득원가처분현황
23712023-12-05회의용탁자회의용탁자, 디자인가구, 시흥도시공사주문제작, 2400×1100×720mm, 사각형407000취득/자체구입
23722023-12-05보조책상보조책상, 동양사무용가구, 시흥도시공사주문제작, 600×1200×720mm187000취득/자체구입
23732023-12-05보조책상보조책상, 동양사무용가구, 시흥도시공사주문제작, 600×1200×720mm187000취득/자체구입
23742023-12-05보조책상보조책상, 동양사무용가구, 시흥도시공사주문제작, 600×1200×720mm187000취득/자체구입
23752023-12-05보조책상보조책상, 동양사무용가구, 시흥도시공사주문제작, 600×1200×720mm187000취득/자체구입
23762023-12-05책상책상, 동양사무용가구, 시흥도시공사주문제작, 1400×1200×720mm, 1인용198000취득/자체구입
23772023-12-05책상책상, 동양사무용가구, 시흥도시공사주문제작, 1400×1200×720mm, 1인용198000취득/자체구입
23782023-12-05책상책상, 동양사무용가구, 시흥도시공사주문제작, 1400×1200×720mm, 1인용198000취득/자체구입
23792023-12-05책상책상, 동양사무용가구, 시흥도시공사주문제작, 1400×1200×720mm, 1인용198000취득/자체구입
23802023-12-06난로(등유)난로, 가야, KJH-173, 19.8kW, 등유955130취득/자체구입

Duplicate rows

Most frequently occurring

취득일자자산명품목취득원가처분현황# duplicates
1702020-02-25태블릿컴퓨터태블릿컴퓨터, 삼성전자, SHW-M380S, DualCore(1GHz), 갤럭시탭10.1/Wi-Fi/32GB209000취득/무상관리전환32
2152020-11-24차량용운행기록계차량용운행기록계, 지넷시스템, X2B230000취득/자체구입26
2142020-11-24자동차안정성컨트롤시스템자동차안정성컨트롤시스템, 동하비전, DH-01S, 후방카메라110000취득/자체구입24
1782020-03-2760T블럭2단사무실칸막이, 신영아트팬스, 시흥도시공사주문제작, 1100×10×1200mm134000취득/자체구입18
1672020-02-14무전기무선송수신기, 유니모테크놀로지, UDR-400I, 디지털무전기262108취득 조달구입17
482016-05-13카드단말기 및 통신모뎀스마트카드단말기, 리버타스, LHT-101, RF카드단말기385000취득 조달구입16
3012022-06-16무정전 전원 축전지밀폐고정형납축전지, 에이블아이앤씨, 시흥도시공사주문제작, 12V, 100Ah154000취득 조달구입16
752017-03-28야영용텐트야영용텐트, 비젼코베아, CN/에덴2, VKTT-601-78ED, 7~8인용279000취득 조달구입15
2552021-04-20보조책상보조책상, 예스퍼니처, 시흥도시공사주문제작, 600×1200×720mm144580취득 조달구입15
1792020-03-2760T블럭2단사무실칸막이, 신영아트팬스, 시흥도시공사주문제작, 1100×10×700mm132000취득/자체구입12