Overview

Dataset statistics

Number of variables5
Number of observations58
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory43.3 B

Variable types

Numeric1
Categorical2
DateTime1
Text1

Dataset

Description대구공공시설관리공단(구.대구시설공단) 가로등관리시스템 가로등공사정보입니다, 순번, 공사구분, 입력일, 공사명, 공사진행상태로 구성되어있습니다.
URLhttps://www.data.go.kr/data/15120488/fileData.do

Alerts

공사진행상태 is highly imbalanced (63.8%)Imbalance
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:24:32.656894
Analysis finished2023-12-12 22:24:33.162630
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct58
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.5
Minimum1
Maximum58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-13T07:24:33.249151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.85
Q115.25
median29.5
Q343.75
95-th percentile55.15
Maximum58
Range57
Interquartile range (IQR)28.5

Descriptive statistics

Standard deviation16.886879
Coefficient of variation (CV)0.57243656
Kurtosis-1.2
Mean29.5
Median Absolute Deviation (MAD)14.5
Skewness0
Sum1711
Variance285.16667
MonotonicityStrictly increasing
2023-12-13T07:24:33.430252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.7%
45 1
 
1.7%
33 1
 
1.7%
34 1
 
1.7%
35 1
 
1.7%
36 1
 
1.7%
37 1
 
1.7%
38 1
 
1.7%
39 1
 
1.7%
40 1
 
1.7%
Other values (48) 48
82.8%
ValueCountFrequency (%)
1 1
1.7%
2 1
1.7%
3 1
1.7%
4 1
1.7%
5 1
1.7%
6 1
1.7%
7 1
1.7%
8 1
1.7%
9 1
1.7%
10 1
1.7%
ValueCountFrequency (%)
58 1
1.7%
57 1
1.7%
56 1
1.7%
55 1
1.7%
54 1
1.7%
53 1
1.7%
52 1
1.7%
51 1
1.7%
50 1
1.7%
49 1
1.7%

공사구분
Categorical

Distinct5
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
G0001
25 
G0008
20 
G0009
G0004
G0007
 
2

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowG0001
2nd rowG0001
3rd rowG0001
4th rowG0001
5th rowG0001

Common Values

ValueCountFrequency (%)
G0001 25
43.1%
G0008 20
34.5%
G0009 7
 
12.1%
G0004 4
 
6.9%
G0007 2
 
3.4%

Length

2023-12-13T07:24:33.603226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:24:33.712701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
g0001 25
43.1%
g0008 20
34.5%
g0009 7
 
12.1%
g0004 4
 
6.9%
g0007 2
 
3.4%
Distinct45
Distinct (%)77.6%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2020-01-13 00:00:00
Maximum2021-06-15 00:00:00
2023-12-13T07:24:33.853245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:24:33.994031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
Distinct57
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-13T07:24:34.279155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length35
Mean length28.224138
Min length15

Characters and Unicode

Total characters1637
Distinct characters143
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique56 ?
Unique (%)96.6%

Sample

1st row달서구 299-10번주 외 2개소 손괴 가로등주 교체공사
2nd row동구 83-10번주 외 3개소 손괴 가로등주 교체공사
3rd row달성군 447-3번주 외 1개소 손괴 가로등주 교체공사
4th row동구 145-44번주 외 5개소 손괴 가로등주 교체공사
5th row수성구 109-11번주 외 4개소 손괴 가로등주 교체공사
ValueCountFrequency (%)
가로등주 34
 
10.0%
손괴 29
 
8.5%
교체공사 22
 
6.5%
연간단가 21
 
6.2%
19
 
5.6%
2020년 19
 
5.6%
기성 13
 
3.8%
공사 9
 
2.6%
교체공사(3권역 8
 
2.4%
교체공사(2권역 7
 
2.1%
Other values (90) 159
46.8%
2023-12-13T07:24:34.767797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
282
 
17.2%
63
 
3.8%
59
 
3.6%
2 59
 
3.6%
57
 
3.5%
52
 
3.2%
52
 
3.2%
0 44
 
2.7%
44
 
2.7%
43
 
2.6%
Other values (133) 882
53.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1072
65.5%
Space Separator 282
 
17.2%
Decimal Number 187
 
11.4%
Uppercase Letter 30
 
1.8%
Open Punctuation 26
 
1.6%
Close Punctuation 25
 
1.5%
Dash Punctuation 11
 
0.7%
Math Symbol 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
5.9%
59
 
5.5%
57
 
5.3%
52
 
4.9%
52
 
4.9%
44
 
4.1%
43
 
4.0%
42
 
3.9%
29
 
2.7%
29
 
2.7%
Other values (112) 602
56.2%
Decimal Number
ValueCountFrequency (%)
2 59
31.6%
0 44
23.5%
1 26
13.9%
3 20
 
10.7%
4 12
 
6.4%
5 9
 
4.8%
6 6
 
3.2%
7 6
 
3.2%
9 3
 
1.6%
8 2
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
D 9
30.0%
L 9
30.0%
E 9
30.0%
A 1
 
3.3%
C 1
 
3.3%
B 1
 
3.3%
Space Separator
ValueCountFrequency (%)
282
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1072
65.5%
Common 535
32.7%
Latin 30
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
5.9%
59
 
5.5%
57
 
5.3%
52
 
4.9%
52
 
4.9%
44
 
4.1%
43
 
4.0%
42
 
3.9%
29
 
2.7%
29
 
2.7%
Other values (112) 602
56.2%
Common
ValueCountFrequency (%)
282
52.7%
2 59
 
11.0%
0 44
 
8.2%
( 26
 
4.9%
1 26
 
4.9%
) 25
 
4.7%
3 20
 
3.7%
4 12
 
2.2%
- 11
 
2.1%
5 9
 
1.7%
Other values (5) 21
 
3.9%
Latin
ValueCountFrequency (%)
D 9
30.0%
L 9
30.0%
E 9
30.0%
A 1
 
3.3%
C 1
 
3.3%
B 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1072
65.5%
ASCII 565
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
282
49.9%
2 59
 
10.4%
0 44
 
7.8%
( 26
 
4.6%
1 26
 
4.6%
) 25
 
4.4%
3 20
 
3.5%
4 12
 
2.1%
- 11
 
1.9%
5 9
 
1.6%
Other values (11) 51
 
9.0%
Hangul
ValueCountFrequency (%)
63
 
5.9%
59
 
5.5%
57
 
5.3%
52
 
4.9%
52
 
4.9%
44
 
4.1%
43
 
4.0%
42
 
3.9%
29
 
2.7%
29
 
2.7%
Other values (112) 602
56.2%

공사진행상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
공사완료
54 
설계완료
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공사완료
2nd row공사완료
3rd row공사완료
4th row공사완료
5th row공사완료

Common Values

ValueCountFrequency (%)
공사완료 54
93.1%
설계완료 4
 
6.9%

Length

2023-12-13T07:24:34.959291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:24:35.077992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사완료 54
93.1%
설계완료 4
 
6.9%

Interactions

2023-12-13T07:24:32.893465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:24:35.151752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번공사구분입력일공사명공사진행상태
순번1.0000.6510.9961.0000.281
공사구분0.6511.0000.8871.0000.000
입력일0.9960.8871.0001.0000.772
공사명1.0001.0001.0001.0000.000
공사진행상태0.2810.0000.7720.0001.000
2023-12-13T07:24:35.252448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공사진행상태공사구분
공사진행상태1.0000.000
공사구분0.0001.000
2023-12-13T07:24:35.354517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번공사구분공사진행상태
순번1.0000.3060.193
공사구분0.3061.0000.000
공사진행상태0.1930.0001.000

Missing values

2023-12-13T07:24:33.024575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:24:33.123103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번공사구분입력일공사명공사진행상태
01G00012020-01-13달서구 299-10번주 외 2개소 손괴 가로등주 교체공사공사완료
12G00012020-01-15동구 83-10번주 외 3개소 손괴 가로등주 교체공사공사완료
23G00012020-01-28달성군 447-3번주 외 1개소 손괴 가로등주 교체공사공사완료
34G00012020-02-04동구 145-44번주 외 5개소 손괴 가로등주 교체공사공사완료
45G00012020-02-27수성구 109-11번주 외 4개소 손괴 가로등주 교체공사공사완료
56G00012020-02-28중구 56-7번주 외 3개소 손괴 가로등주 교체공사공사완료
67G00012020-03-02달서구 273-1번주 외 6개소 손괴 가로등주 교체공사공사완료
78G00012020-03-02동구 138-3번주 외 3개소 손괴 가로등주 교체공사공사완료
89G00092020-03-06국우터널 인입 수전설비 변경공사 설계 용역공사완료
910G00082020-03-20수성지하차도 LED조명등 교체공사공사완료
순번공사구분입력일공사명공사진행상태
4849G00082021-03-19공산터널 수전설비 인입 변경 공사공사완료
4950G00082021-03-22성서택지 등 14개소 LED 보행등 설치 공사공사완료
5051G00012021-03-222020년 연간단가 손괴 가로등주 교체공사(1권역) 6회차 기성공사완료
5152G00082021-03-22팔조령터널 입출구부 조도개선 공사공사완료
5253G00082021-03-31청수로 외 2개소 LED조명등 교체공사공사완료
5354G00082021-04-05도시고속도로 외 2개소 LED 조명등 교체공사공사완료
5455G00082021-05-25월성로 등 2개소 LED조명등 교체공사공사완료
5556G00072021-05-26지산택지 외 2개소 퇴색 가로등주 도색 공사공사완료
5657G00012021-06-102021년 연간단가 손괴 가로등주 교체공사(3권역) 1회차공사완료
5758G00012021-06-152021년 연간단가 손괴 가로등주 교체공사(2권역) 1회차설계완료