Overview

Dataset statistics

Number of variables11
Number of observations4980
Missing cells9960
Missing cells (%)18.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory471.9 KiB
Average record size in memory97.0 B

Variable types

Categorical5
Text1
Numeric3
Unsupported2

Dataset

Description파일 다운로드
Author강서구
URLhttps://data.seoul.go.kr/dataList/OA-21828/F/1/datasetView.do

Alerts

승인건수 has constant value ""Constant
완료건수 has constant value ""Constant
발생건수 is highly overall correlated with 취소건수 and 1 other fieldsHigh correlation
취소건수 is highly overall correlated with 발생건수 and 1 other fieldsHigh correlation
푸시건수 is highly overall correlated with 발생건수 and 1 other fieldsHigh correlation
영치건수 has 4980 (100.0%) missing valuesMissing
영치금액 has 4980 (100.0%) missing valuesMissing
영치건수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
영치금액 is an unsupported type, check if it needs cleaning or further analysisUnsupported
발생건수 has 3120 (62.7%) zerosZeros
취소건수 has 3130 (62.9%) zerosZeros
푸시건수 has 3130 (62.9%) zerosZeros

Reproduction

Analysis started2024-04-20 18:55:41.789866
Analysis finished2024-04-20 18:55:45.238480
Duration3.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

통계월
Categorical

Distinct10
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
2020-06
506 
2020-07
506 
2020-08
506 
2020-09
506 
2020-10
506 
Other values (5)
2450 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-01
2nd row2020-01
3rd row2020-01
4th row2020-01
5th row2020-01

Common Values

ValueCountFrequency (%)
2020-06 506
10.2%
2020-07 506
10.2%
2020-08 506
10.2%
2020-09 506
10.2%
2020-10 506
10.2%
2020-03 496
10.0%
2020-04 496
10.0%
2020-05 496
10.0%
2020-02 486
9.8%
2020-01 476
9.6%

Length

2024-04-21T03:55:45.450184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:55:45.797954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-06 506
10.2%
2020-07 506
10.2%
2020-08 506
10.2%
2020-09 506
10.2%
2020-10 506
10.2%
2020-03 496
10.0%
2020-04 496
10.0%
2020-05 496
10.0%
2020-02 486
9.8%
2020-01 476
9.6%
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
3
1630 
5
1550 
30
1500 
20
300 

Length

Max length2
Median length1
Mean length1.3614458
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 1630
32.7%
5 1550
31.1%
30 1500
30.1%
20 300
 
6.0%

Length

2024-04-21T03:55:46.233195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:55:46.561606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 1630
32.7%
5 1550
31.1%
30 1500
30.1%
20 300
 
6.0%

촉탁구분명
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
0
2860 
1
2120 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 2860
57.4%
1 2120
42.6%

Length

2024-04-21T03:55:46.898423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:55:47.193586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 2860
57.4%
1 2120
42.6%
Distinct132
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
2024-04-21T03:55:47.887348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length20.911044
Min length20

Characters and Unicode

Total characters104137
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11500CTVGTV100000484
2nd row11500CTVGTV100000485
3rd row11500CTVGTV100000486
4th row11500CTVGTV1000005531
5th row11500CTVGTV1000005561
ValueCountFrequency (%)
11500ctvgtv1000023960 70
 
1.4%
11500ctvgtv1000026775 70
 
1.4%
11500ctvgtv1000016266 70
 
1.4%
11500ctvgtv1000022249 70
 
1.4%
11500ctvgtv1000014557 70
 
1.4%
11500ctvgtv100000485 70
 
1.4%
11500ctvgtv1000012848 70
 
1.4%
11500ctvgtv1000011136 70
 
1.4%
11500ctvgtv1000020540 70
 
1.4%
11500ctvgtv1000017975 70
 
1.4%
Other values (122) 4280
85.9%
2024-04-21T03:55:49.040728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 32013
30.7%
1 19582
18.8%
T 9960
 
9.6%
V 9960
 
9.6%
5 6685
 
6.4%
C 4980
 
4.8%
G 4980
 
4.8%
2 3929
 
3.8%
6 2507
 
2.4%
4 2367
 
2.3%
Other values (4) 7174
 
6.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 74257
71.3%
Uppercase Letter 29880
28.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 32013
43.1%
1 19582
26.4%
5 6685
 
9.0%
2 3929
 
5.3%
6 2507
 
3.4%
4 2367
 
3.2%
7 2356
 
3.2%
8 1664
 
2.2%
3 1661
 
2.2%
9 1493
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
T 9960
33.3%
V 9960
33.3%
C 4980
16.7%
G 4980
16.7%

Most occurring scripts

ValueCountFrequency (%)
Common 74257
71.3%
Latin 29880
28.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 32013
43.1%
1 19582
26.4%
5 6685
 
9.0%
2 3929
 
5.3%
6 2507
 
3.4%
4 2367
 
3.2%
7 2356
 
3.2%
8 1664
 
2.2%
3 1661
 
2.2%
9 1493
 
2.0%
Latin
ValueCountFrequency (%)
T 9960
33.3%
V 9960
33.3%
C 4980
16.7%
G 4980
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104137
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 32013
30.7%
1 19582
18.8%
T 9960
 
9.6%
V 9960
 
9.6%
5 6685
 
6.4%
C 4980
 
4.8%
G 4980
 
4.8%
2 3929
 
3.8%
6 2507
 
2.4%
4 2367
 
2.3%
Other values (4) 7174
 
6.9%

발생건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct65
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.164659
Minimum0
Maximum286
Zeros3120
Zeros (%)62.7%
Negative0
Negative (%)0.0%
Memory size43.9 KiB
2024-04-21T03:55:49.445614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile66
Maximum286
Range286
Interquartile range (IQR)3

Descriptive statistics

Standard deviation30.496873
Coefficient of variation (CV)3.000285
Kurtosis28.255065
Mean10.164659
Median Absolute Deviation (MAD)0
Skewness4.8205258
Sum50620
Variance930.05924
MonotonicityNot monotonic
2024-04-21T03:55:49.872997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3120
62.7%
1 360
 
7.2%
2 150
 
3.0%
3 120
 
2.4%
5 100
 
2.0%
10 80
 
1.6%
4 70
 
1.4%
14 50
 
1.0%
6 50
 
1.0%
9 50
 
1.0%
Other values (55) 830
 
16.7%
ValueCountFrequency (%)
0 3120
62.7%
1 360
 
7.2%
2 150
 
3.0%
3 120
 
2.4%
4 70
 
1.4%
5 100
 
2.0%
6 50
 
1.0%
7 30
 
0.6%
8 10
 
0.2%
9 50
 
1.0%
ValueCountFrequency (%)
286 10
0.2%
241 10
0.2%
196 10
0.2%
183 10
0.2%
176 10
0.2%
148 10
0.2%
146 10
0.2%
125 10
0.2%
119 10
0.2%
117 10
0.2%

취소건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct52
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1646586
Minimum0
Maximum134
Zeros3130
Zeros (%)62.9%
Negative0
Negative (%)0.0%
Memory size43.9 KiB
2024-04-21T03:55:50.286620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile35
Maximum134
Range134
Interquartile range (IQR)3

Descriptive statistics

Standard deviation15.923377
Coefficient of variation (CV)2.5830103
Kurtosis19.916195
Mean6.1646586
Median Absolute Deviation (MAD)0
Skewness4.0588697
Sum30700
Variance253.55392
MonotonicityNot monotonic
2024-04-21T03:55:50.749449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3130
62.9%
1 370
 
7.4%
2 140
 
2.8%
3 130
 
2.6%
4 100
 
2.0%
9 80
 
1.6%
10 80
 
1.6%
5 80
 
1.6%
18 60
 
1.2%
6 50
 
1.0%
Other values (42) 760
 
15.3%
ValueCountFrequency (%)
0 3130
62.9%
1 370
 
7.4%
2 140
 
2.8%
3 130
 
2.6%
4 100
 
2.0%
5 80
 
1.6%
6 50
 
1.0%
7 50
 
1.0%
8 20
 
0.4%
9 80
 
1.6%
ValueCountFrequency (%)
134 10
0.2%
115 10
0.2%
100 10
0.2%
89 10
0.2%
88 10
0.2%
76 20
0.4%
69 10
0.2%
66 10
0.2%
65 10
0.2%
63 10
0.2%

승인건수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
0
4980 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 4980
100.0%

Length

2024-04-21T03:55:51.166889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:55:51.447788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 4980
100.0%

완료건수
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size39.0 KiB
0
4980 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 4980
100.0%

Length

2024-04-21T03:55:51.746228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T03:55:52.027024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 4980
100.0%

영치건수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4980
Missing (%)100.0%
Memory size43.9 KiB

영치금액
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4980
Missing (%)100.0%
Memory size43.9 KiB

푸시건수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct100
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.375502
Minimum0
Maximum1258
Zeros3130
Zeros (%)62.9%
Negative0
Negative (%)0.0%
Memory size43.9 KiB
2024-04-21T03:55:52.347678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q324
95-th percentile288
Maximum1258
Range1258
Interquartile range (IQR)24

Descriptive statistics

Standard deviation143.10629
Coefficient of variation (CV)2.7323135
Kurtosis25.149656
Mean52.375502
Median Absolute Deviation (MAD)0
Skewness4.5130684
Sum260830
Variance20479.411
MonotonicityNot monotonic
2024-04-21T03:55:52.800333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3130
62.9%
6 320
 
6.4%
12 100
 
2.0%
2 60
 
1.2%
60 50
 
1.0%
50 50
 
1.0%
48 40
 
0.8%
38 40
 
0.8%
14 40
 
0.8%
28 40
 
0.8%
Other values (90) 1110
 
22.3%
ValueCountFrequency (%)
0 3130
62.9%
2 60
 
1.2%
4 10
 
0.2%
6 320
 
6.4%
8 10
 
0.2%
12 100
 
2.0%
14 40
 
0.8%
16 30
 
0.6%
18 30
 
0.6%
24 20
 
0.4%
ValueCountFrequency (%)
1258 10
0.2%
1150 10
0.2%
922 10
0.2%
856 10
0.2%
812 10
0.2%
742 10
0.2%
724 10
0.2%
688 10
0.2%
550 10
0.2%
516 10
0.2%

Interactions

2024-04-21T03:55:43.766465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:42.350788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:43.148881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:44.021037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:42.612534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:43.367330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:44.299134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:42.885795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T03:55:43.578467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T03:55:53.072416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
통계월관심차량구분명촉탁구분명발생건수취소건수푸시건수
통계월1.0000.0000.0000.2860.4270.291
관심차량구분명0.0001.0000.6480.4060.5060.461
촉탁구분명0.0000.6481.0000.3560.3730.291
발생건수0.2860.4060.3561.0000.9310.966
취소건수0.4270.5060.3730.9311.0000.935
푸시건수0.2910.4610.2910.9660.9351.000
2024-04-21T03:55:53.345337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
촉탁구분명통계월관심차량구분명
촉탁구분명1.0000.0000.453
통계월0.0001.0000.000
관심차량구분명0.4530.0001.000
2024-04-21T03:55:53.600571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생건수취소건수푸시건수통계월관심차량구분명촉탁구분명
발생건수1.0000.9970.9960.1340.2710.355
취소건수0.9971.0000.9980.1430.3260.286
푸시건수0.9960.9981.0000.1360.3130.291
통계월0.1340.1430.1361.0000.0000.000
관심차량구분명0.2710.3260.3130.0001.0000.453
촉탁구분명0.3550.2860.2910.0000.4531.000

Missing values

2024-04-21T03:55:44.673658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T03:55:45.050451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

통계월관심차량구분명촉탁구분명시설물분류명발생건수취소건수승인건수완료건수영치건수영치금액푸시건수
02020-013011500CTVGTV100000484232300<NA><NA>146
12020-013011500CTVGTV1000004857700<NA><NA>54
22020-013011500CTVGTV100000486141400<NA><NA>148
32020-013011500CTVGTV10000055310000<NA><NA>0
42020-013011500CTVGTV10000055610000<NA><NA>0
52020-013011500CTVGTV1000007064400<NA><NA>8
62020-013011500CTVGTV1000008741111100<NA><NA>82
72020-013011500CTVGTV1000010323111100<NA><NA>82
82020-013011500CTVGTV10000108344400<NA><NA>8
92020-013011500CTVGTV100001091151500<NA><NA>92
통계월관심차량구분명촉탁구분명시설물분류명발생건수취소건수승인건수완료건수영치건수영치금액푸시건수
49702020-1030111500CTVGTV10000175980000<NA><NA>0
49712020-1030111500CTVGTV10000179750000<NA><NA>0
49722020-1030111500CTVGTV10000201630000<NA><NA>0
49732020-1030111500CTVGTV10000205400000<NA><NA>0
49742020-1030111500CTVGTV10000218720000<NA><NA>0
49752020-1030111500CTVGTV10000222490000<NA><NA>0
49762020-1030111500CTVGTV10000235840000<NA><NA>0
49772020-1030111500CTVGTV10000239600000<NA><NA>0
49782020-1030111500CTVGTV10000263990000<NA><NA>0
49792020-1030111500CTVGTV10000267750000<NA><NA>0