Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory722.7 KiB
Average record size in memory74.0 B

Variable types

Numeric2
DateTime2
Categorical2
Text2

Dataset

Description대구시설공단 가로등관리시스템 고장등관리 데이터입니다.
Author대구시설공단
URLhttps://www.data.go.kr/data/15088101/fileData.do

Alerts

관리번호 is highly overall correlated with 구청High correlation
구청 is highly overall correlated with 관리번호High correlation
접수구분 is highly imbalanced (82.0%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:52:07.271775
Analysis finished2023-12-12 09:52:08.714694
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8872.1552
Minimum1
Maximum17761
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:52:08.794168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile860.95
Q14366.5
median8867.5
Q313344.75
95-th percentile16952.05
Maximum17761
Range17760
Interquartile range (IQR)8978.25

Descriptive statistics

Standard deviation5167.4652
Coefficient of variation (CV)0.5824363
Kurtosis-1.2065126
Mean8872.1552
Median Absolute Deviation (MAD)4490.5
Skewness0.0034280156
Sum88721552
Variance26702697
MonotonicityNot monotonic
2023-12-12T18:52:08.944095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15458 1
 
< 0.1%
1712 1
 
< 0.1%
11232 1
 
< 0.1%
8538 1
 
< 0.1%
6664 1
 
< 0.1%
396 1
 
< 0.1%
6236 1
 
< 0.1%
1569 1
 
< 0.1%
17726 1
 
< 0.1%
1452 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
7 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
16 1
< 0.1%
19 1
< 0.1%
20 1
< 0.1%
ValueCountFrequency (%)
17761 1
< 0.1%
17758 1
< 0.1%
17755 1
< 0.1%
17754 1
< 0.1%
17753 1
< 0.1%
17751 1
< 0.1%
17750 1
< 0.1%
17749 1
< 0.1%
17748 1
< 0.1%
17747 1
< 0.1%
Distinct273
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-02 00:00:00
Maximum2020-12-31 00:00:00
2023-12-12T18:52:09.224251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:52:09.490273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

접수구분
Categorical

IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일상점검
9298 
민원신고
 
462
직원신고
 
144
시정견문
 
66
안전공사
 
17

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일상점검
2nd row일상점검
3rd row일상점검
4th row일상점검
5th row일상점검

Common Values

ValueCountFrequency (%)
일상점검 9298
93.0%
민원신고 462
 
4.6%
직원신고 144
 
1.4%
시정견문 66
 
0.7%
안전공사 17
 
0.2%
견문정보 13
 
0.1%

Length

2023-12-12T18:52:09.729211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:52:09.876519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일상점검 9298
93.0%
민원신고 462
 
4.6%
직원신고 144
 
1.4%
시정견문 66
 
0.7%
안전공사 17
 
0.2%
견문정보 13
 
0.1%

관리번호
Real number (ℝ)

HIGH CORRELATION 

Distinct8138
Distinct (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5465388.6
Minimum1001000
Maximum9999999
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T18:52:10.032261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001000
5-th percentile1131001.9
Q13143021.8
median6036503.5
Q37256010.2
95-th percentile8423009.2
Maximum9999999
Range8998999
Interquartile range (IQR)4112988.5

Descriptive statistics

Standard deviation2272730.7
Coefficient of variation (CV)0.41584063
Kurtosis-0.99105649
Mean5465388.6
Median Absolute Deviation (MAD)1411502.5
Skewness-0.40411626
Sum5.4653886 × 1010
Variance5.1653046 × 1012
MonotonicityNot monotonic
2023-12-12T18:52:10.198210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9999999 36
 
0.4%
3045000 11
 
0.1%
7420000 9
 
0.1%
5355000 9
 
0.1%
8601000 8
 
0.1%
9999000 8
 
0.1%
2114000 8
 
0.1%
2024000 7
 
0.1%
5264000 7
 
0.1%
5053000 7
 
0.1%
Other values (8128) 9890
98.9%
ValueCountFrequency (%)
1001000 2
< 0.1%
1001009 1
< 0.1%
1002005 1
< 0.1%
1002007 1
< 0.1%
1002012 1
< 0.1%
1003000 1
< 0.1%
1003003 2
< 0.1%
1003004 1
< 0.1%
1004009 1
< 0.1%
1004013 1
< 0.1%
ValueCountFrequency (%)
9999999 36
0.4%
9999000 8
 
0.1%
8647000 1
 
< 0.1%
8646002 2
 
< 0.1%
8637010 1
 
< 0.1%
8637009 1
 
< 0.1%
8637008 2
 
< 0.1%
8636000 1
 
< 0.1%
8604000 1
 
< 0.1%
8603000 1
 
< 0.1%

구청
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
달성군
1807 
북구
1800 
달서구
1716 
수성구
1609 
동구
1483 
Other values (4)
1585 

Length

Max length3
Median length3
Mean length2.5132
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동구
2nd row북구
3rd row북구
4th row북구
5th row달성군

Common Values

ValueCountFrequency (%)
달성군 1807
18.1%
북구 1800
18.0%
달서구 1716
17.2%
수성구 1609
16.1%
동구 1483
14.8%
서구 568
 
5.7%
중구 529
 
5.3%
남구 444
 
4.4%
기타 44
 
0.4%

Length

2023-12-12T18:52:10.340505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:52:10.462220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달성군 1807
18.1%
북구 1800
18.0%
달서구 1716
17.2%
수성구 1609
16.1%
동구 1483
14.8%
서구 568
 
5.7%
중구 529
 
5.3%
남구 444
 
4.4%
기타 44
 
0.4%
Distinct365
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:52:10.795637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.1682
Min length2

Characters and Unicode

Total characters41682
Distinct characters203
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)0.4%

Sample

1st row동대구로
2nd row복현로
3rd row신암로
4th row구암로
5th row테크노폴리스
ValueCountFrequency (%)
테크노폴리스 721
 
7.2%
달구벌대로 474
 
4.7%
기타 430
 
4.3%
신천대로 189
 
1.9%
혁신도시 181
 
1.8%
신천동로 159
 
1.6%
국채보상로 158
 
1.6%
앞산순환로 156
 
1.6%
이시아폴리스 154
 
1.5%
칠곡3지구 117
 
1.2%
Other values (355) 7261
72.6%
2023-12-12T18:52:11.259448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6688
 
16.0%
1529
 
3.7%
1454
 
3.5%
1319
 
3.2%
966
 
2.3%
936
 
2.2%
934
 
2.2%
900
 
2.2%
900
 
2.2%
878
 
2.1%
Other values (193) 25178
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40426
97.0%
Decimal Number 1216
 
2.9%
Open Punctuation 20
 
< 0.1%
Close Punctuation 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6688
 
16.5%
1529
 
3.8%
1454
 
3.6%
1319
 
3.3%
966
 
2.4%
936
 
2.3%
934
 
2.3%
900
 
2.2%
900
 
2.2%
878
 
2.2%
Other values (181) 23922
59.2%
Decimal Number
ValueCountFrequency (%)
2 329
27.1%
5 304
25.0%
3 180
14.8%
1 134
11.0%
4 102
 
8.4%
6 57
 
4.7%
9 35
 
2.9%
0 34
 
2.8%
8 24
 
2.0%
7 17
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40426
97.0%
Common 1256
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6688
 
16.5%
1529
 
3.8%
1454
 
3.6%
1319
 
3.3%
966
 
2.4%
936
 
2.3%
934
 
2.3%
900
 
2.2%
900
 
2.2%
878
 
2.2%
Other values (181) 23922
59.2%
Common
ValueCountFrequency (%)
2 329
26.2%
5 304
24.2%
3 180
14.3%
1 134
10.7%
4 102
 
8.1%
6 57
 
4.5%
9 35
 
2.8%
0 34
 
2.7%
8 24
 
1.9%
( 20
 
1.6%
Other values (2) 37
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40426
97.0%
ASCII 1256
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6688
 
16.5%
1529
 
3.8%
1454
 
3.6%
1319
 
3.3%
966
 
2.4%
936
 
2.3%
934
 
2.3%
900
 
2.2%
900
 
2.2%
878
 
2.2%
Other values (181) 23922
59.2%
ASCII
ValueCountFrequency (%)
2 329
26.2%
5 304
24.2%
3 180
14.3%
1 134
10.7%
4 102
 
8.1%
6 57
 
4.5%
9 35
 
2.8%
0 34
 
2.7%
8 24
 
1.9%
( 20
 
1.6%
Other values (2) 37
 
2.9%

구간
Text

Distinct2334
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T18:52:11.529592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length12.9843
Min length2

Characters and Unicode

Total characters129843
Distinct characters486
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique568 ?
Unique (%)5.7%

Sample

1st row귀빈예식장 - 동대구역네거리
2nd row성화여고 - 서한이다움
3rd row동인네거리 - 칠성교
4th row동천교 - 칠곡네거리
5th row테크노폴리스 LP-2-22
ValueCountFrequency (%)
7512
27.3%
테크노폴리스 719
 
2.6%
혁신도시 196
 
0.7%
진입로 165
 
0.6%
이시아폴리스 154
 
0.6%
2공구 110
 
0.4%
대구국가산업단지 106
 
0.4%
봉산육거리 91
 
0.3%
대구스타디움 76
 
0.3%
계산오거리 67
 
0.2%
Other values (2154) 18332
66.6%
2023-12-12T18:52:11.917604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17535
 
13.5%
- 9960
 
7.7%
6260
 
4.8%
4656
 
3.6%
3421
 
2.6%
3409
 
2.6%
2254
 
1.7%
2106
 
1.6%
1928
 
1.5%
1877
 
1.4%
Other values (476) 76437
58.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92539
71.3%
Space Separator 17535
 
13.5%
Dash Punctuation 9960
 
7.7%
Uppercase Letter 4653
 
3.6%
Decimal Number 4396
 
3.4%
Close Punctuation 328
 
0.3%
Open Punctuation 328
 
0.3%
Other Punctuation 63
 
< 0.1%
Lowercase Letter 41
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6260
 
6.8%
4656
 
5.0%
3421
 
3.7%
3409
 
3.7%
2254
 
2.4%
2106
 
2.3%
1928
 
2.1%
1877
 
2.0%
1799
 
1.9%
1642
 
1.8%
Other values (436) 63187
68.3%
Uppercase Letter
ValueCountFrequency (%)
P 1638
35.2%
L 1625
34.9%
C 394
 
8.5%
I 334
 
7.2%
A 156
 
3.4%
B 115
 
2.5%
E 73
 
1.6%
T 69
 
1.5%
M 61
 
1.3%
G 53
 
1.1%
Other values (10) 135
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 1248
28.4%
2 1100
25.0%
3 657
14.9%
4 338
 
7.7%
5 232
 
5.3%
8 195
 
4.4%
0 194
 
4.4%
6 192
 
4.4%
7 130
 
3.0%
9 110
 
2.5%
Other Punctuation
ValueCountFrequency (%)
. 44
69.8%
# 10
 
15.9%
, 4
 
6.3%
& 3
 
4.8%
· 2
 
3.2%
Space Separator
ValueCountFrequency (%)
17535
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9960
100.0%
Close Punctuation
ValueCountFrequency (%)
) 328
100.0%
Open Punctuation
ValueCountFrequency (%)
( 328
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92539
71.3%
Common 32610
 
25.1%
Latin 4694
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6260
 
6.8%
4656
 
5.0%
3421
 
3.7%
3409
 
3.7%
2254
 
2.4%
2106
 
2.3%
1928
 
2.1%
1877
 
2.0%
1799
 
1.9%
1642
 
1.8%
Other values (436) 63187
68.3%
Latin
ValueCountFrequency (%)
P 1638
34.9%
L 1625
34.6%
C 394
 
8.4%
I 334
 
7.1%
A 156
 
3.3%
B 115
 
2.4%
E 73
 
1.6%
T 69
 
1.5%
M 61
 
1.3%
G 53
 
1.1%
Other values (11) 176
 
3.7%
Common
ValueCountFrequency (%)
17535
53.8%
- 9960
30.5%
1 1248
 
3.8%
2 1100
 
3.4%
3 657
 
2.0%
4 338
 
1.0%
) 328
 
1.0%
( 328
 
1.0%
5 232
 
0.7%
8 195
 
0.6%
Other values (9) 689
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 92539
71.3%
ASCII 37302
28.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17535
47.0%
- 9960
26.7%
P 1638
 
4.4%
L 1625
 
4.4%
1 1248
 
3.3%
2 1100
 
2.9%
3 657
 
1.8%
C 394
 
1.1%
4 338
 
0.9%
I 334
 
0.9%
Other values (29) 2473
 
6.6%
Hangul
ValueCountFrequency (%)
6260
 
6.8%
4656
 
5.0%
3421
 
3.7%
3409
 
3.7%
2254
 
2.4%
2106
 
2.3%
1928
 
2.1%
1877
 
2.0%
1799
 
1.9%
1642
 
1.8%
Other values (436) 63187
68.3%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct8629
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-01-02 00:00:00
Maximum2021-01-12 16:27:00
2023-12-12T18:52:12.162873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:52:12.368878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:52:08.316937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:52:08.030456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:52:08.412824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:52:08.185725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:52:12.457990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번접수구분관리번호구청
순번1.0000.1290.1900.127
접수구분0.1291.0000.1820.196
관리번호0.1900.1821.0000.993
구청0.1270.1960.9931.000
2023-12-12T18:52:12.549573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수구분구청
접수구분1.0000.099
구청0.0991.000
2023-12-12T18:52:12.650800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번관리번호접수구분구청
순번1.0000.0580.0680.058
관리번호0.0581.0000.0960.979
접수구분0.0680.0961.0000.099
구청0.0580.9790.0991.000

Missing values

2023-12-12T18:52:08.529040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:52:08.654134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번접수일자접수구분관리번호구청가로명구간처리일
15457154582020-10-29 00:00:00일상점검2110000동구동대구로귀빈예식장 - 동대구역네거리2020-10-29 16:00:00
677167722020-05-11 00:00:00일상점검5437000북구복현로성화여고 - 서한이다움2020-05-11 16:39:00
17754177552020-12-31 00:00:00일상점검5161004북구신암로동인네거리 - 칠성교2020-12-31 15:47:00
104510462020-01-23 00:00:00일상점검5430000북구구암로동천교 - 칠곡네거리2020-01-23 15:25:00
15226152272020-10-23 00:00:00일상점검8387006달성군테크노폴리스테크노폴리스 LP-2-222020-10-23 15:16:00
11125111262020-08-01 00:00:00일상점검7026000달서구달서대로경농 - 서부수질사업소2020-08-01 00:49:00
545754582020-04-10 00:00:00일상점검6233000수성구들안로들안길네거리 - 들안길삼거리2020-04-10 00:00:00
742674272020-05-21 00:00:00일상점검5240024북구칠곡3지구거동교 - 우방아파트2020-05-28 12:31:00
11195111962020-08-03 00:00:00일상점검5241000북구칠곡3지구원종장 - 학남초교2020-08-03 16:40:00
169316942020-02-06 00:00:00일상점검3169000서구문화로평리1동주민센터 - 오스카주차장2020-02-07 13:57:00
순번접수일자접수구분관리번호구청가로명구간처리일
282128222020-02-26 00:00:00일상점검2170001동구안심로송정삼거리 - 안심차량기지2020-02-26 17:04:00
380738082020-03-12 00:00:00일상점검5047005북구고성북로태평전화국 - 대구여중2020-03-13 16:00:00
11579115802020-08-11 00:00:00일상점검7457008달서구기타월암초교 - 월배 e편한세상2020-08-11 17:41:00
13224132252020-09-07 00:00:00일상점검7006009달서구성서공단북로아시아시멘트 - 금복주2020-09-23 15:34:00
15497154982020-10-29 00:00:00일상점검2367004동구신평로오리온제과 - 신덕마을 입구2020-10-29 15:55:00
4904912020-01-10 00:00:00일상점검5170000북구침산로침산네거리 - 북침산네거리2020-01-10 17:52:00
3373382020-01-08 00:00:00일상점검8529000달성군국가산업단지대구국가산업단지 LP-1-5B2020-01-08 00:00:00
304230432020-03-02 00:00:00일상점검2480000동구혁신도시혁신도시 3공구2020-03-02 15:48:00
141114122020-01-31 00:00:00일상점검5206000북구복현로검단네거리 - 복현육교2020-01-31 17:33:00
117211732020-01-29 00:00:00일상점검1074013중구달구벌대로계산오거리 - 반월당2020-01-29 17:24:00