Overview

Dataset statistics

Number of variables12
Number of observations136
Missing cells64
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.0 KiB
Average record size in memory98.0 B

Variable types

Categorical3
Text6
DateTime2
Numeric1

Dataset

Description국토교통부(국토관리청)에서 발주한 도로 공사의 계약정보로 발주청, 공사명, 공사구분, 주요사업지, 사업개요, 노선명, 공사위치, 시공사, 현장주소, 시작일, 준공일, 도급액(원) 항목의 데이터를 제공합니다.
Author국토교통부
URLhttps://www.data.go.kr/data/15052289/fileData.do

Alerts

공사구분 has constant value ""Constant
발주청 is highly overall correlated with 노선명High correlation
노선명 is highly overall correlated with 발주청High correlation
공사위치 has 36 (26.5%) missing valuesMissing
시작일 has 14 (10.3%) missing valuesMissing
준공일 has 14 (10.3%) missing valuesMissing
공사명 has unique valuesUnique
도급액(원) has 25 (18.4%) zerosZeros

Reproduction

Analysis started2023-12-16 15:09:50.689391
Analysis finished2023-12-16 15:09:56.233345
Duration5.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발주청
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
부산지방국토관리청
51 
익산지방국토관리청
31 
대전지방국토관리청
28 
서울지방국토관리청
14 
원주지방국토관리청
12 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울지방국토관리청
2nd row서울지방국토관리청
3rd row서울지방국토관리청
4th row서울지방국토관리청
5th row서울지방국토관리청

Common Values

ValueCountFrequency (%)
부산지방국토관리청 51
37.5%
익산지방국토관리청 31
22.8%
대전지방국토관리청 28
20.6%
서울지방국토관리청 14
 
10.3%
원주지방국토관리청 12
 
8.8%

Length

2023-12-16T15:09:56.582132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:09:57.076306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산지방국토관리청 51
37.5%
익산지방국토관리청 31
22.8%
대전지방국토관리청 28
20.6%
서울지방국토관리청 14
 
10.3%
원주지방국토관리청 12
 
8.8%

공사명
Text

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-16T15:09:58.123138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length35
Mean length20.830882
Min length11

Characters and Unicode

Total characters2833
Distinct characters212
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)100.0%

Sample

1st row공도-대덕 도로확장공사
2nd row국도38호선 평택 오성-고덕 도로확장공사
3rd row국도3호선대체우회도로(민락2지구) 소음저감시설 설치공사
4th row국도3호선 동이천IC 도로건설공사
5th row국도3호선 연천-신탄리2 도로건설공사
ValueCountFrequency (%)
도로건설공사 43
 
10.1%
국도건설공사 23
 
5.4%
건설공사 15
 
3.5%
도로시설개량공사 15
 
3.5%
11
 
2.6%
2023년도 8
 
1.9%
구축 8
 
1.9%
공사 8
 
1.9%
도로확장공사 8
 
1.9%
국도its 8
 
1.9%
Other values (232) 278
65.4%
2023-12-16T15:10:00.152359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
289
 
10.2%
189
 
6.7%
167
 
5.9%
142
 
5.0%
113
 
4.0%
101
 
3.6%
- 94
 
3.3%
94
 
3.3%
83
 
2.9%
( 63
 
2.2%
Other values (202) 1498
52.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2090
73.8%
Space Separator 289
 
10.2%
Decimal Number 163
 
5.8%
Dash Punctuation 94
 
3.3%
Open Punctuation 63
 
2.2%
Close Punctuation 63
 
2.2%
Uppercase Letter 45
 
1.6%
Math Symbol 25
 
0.9%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
189
 
9.0%
167
 
8.0%
142
 
6.8%
113
 
5.4%
101
 
4.8%
94
 
4.5%
83
 
4.0%
60
 
2.9%
38
 
1.8%
32
 
1.5%
Other values (180) 1071
51.2%
Decimal Number
ValueCountFrequency (%)
2 52
31.9%
1 43
26.4%
3 23
14.1%
0 12
 
7.4%
5 11
 
6.7%
4 6
 
3.7%
6 6
 
3.7%
7 4
 
2.5%
8 3
 
1.8%
9 3
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
I 17
37.8%
S 9
20.0%
C 9
20.0%
T 8
17.8%
W 1
 
2.2%
B 1
 
2.2%
Space Separator
ValueCountFrequency (%)
289
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 94
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Math Symbol
ValueCountFrequency (%)
~ 25
100.0%
Other Punctuation
ValueCountFrequency (%)
· 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2090
73.8%
Common 698
 
24.6%
Latin 45
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
189
 
9.0%
167
 
8.0%
142
 
6.8%
113
 
5.4%
101
 
4.8%
94
 
4.5%
83
 
4.0%
60
 
2.9%
38
 
1.8%
32
 
1.5%
Other values (180) 1071
51.2%
Common
ValueCountFrequency (%)
289
41.4%
- 94
 
13.5%
( 63
 
9.0%
) 63
 
9.0%
2 52
 
7.4%
1 43
 
6.2%
~ 25
 
3.6%
3 23
 
3.3%
0 12
 
1.7%
5 11
 
1.6%
Other values (6) 23
 
3.3%
Latin
ValueCountFrequency (%)
I 17
37.8%
S 9
20.0%
C 9
20.0%
T 8
17.8%
W 1
 
2.2%
B 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2090
73.8%
ASCII 742
 
26.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
289
38.9%
- 94
 
12.7%
( 63
 
8.5%
) 63
 
8.5%
2 52
 
7.0%
1 43
 
5.8%
~ 25
 
3.4%
3 23
 
3.1%
I 17
 
2.3%
0 12
 
1.6%
Other values (11) 61
 
8.2%
Hangul
ValueCountFrequency (%)
189
 
9.0%
167
 
8.0%
142
 
6.8%
113
 
5.4%
101
 
4.8%
94
 
4.5%
83
 
4.0%
60
 
2.9%
38
 
1.8%
32
 
1.5%
Other values (180) 1071
51.2%
None
ValueCountFrequency (%)
· 1
100.0%

공사구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
도로
136 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도로
2nd row도로
3rd row도로
4th row도로
5th row도로

Common Values

ValueCountFrequency (%)
도로 136
100.0%

Length

2023-12-16T15:10:01.058878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:10:01.560225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도로 136
100.0%
Distinct117
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-16T15:10:02.723939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length27
Mean length15.492647
Min length2

Characters and Unicode

Total characters2107
Distinct characters187
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)75.7%

Sample

1st row안성시 공도읍, 안성시 대덕면
2nd row평택시 고덕면 동고리 일원
3rd row의정부시 민락2지구
4th row경기 이천, 여주
5th row경기도 연천군
ValueCountFrequency (%)
31
 
5.6%
경북 17
 
3.1%
일원 16
 
2.9%
전라남도 9
 
1.6%
영남권 8
 
1.4%
전라북도 8
 
1.4%
안동시 7
 
1.3%
충청남도 7
 
1.3%
충북 6
 
1.1%
경남 5
 
0.9%
Other values (317) 442
79.5%
2023-12-16T15:10:05.110583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
423
 
20.1%
98
 
4.7%
87
 
4.1%
82
 
3.9%
78
 
3.7%
60
 
2.8%
53
 
2.5%
52
 
2.5%
49
 
2.3%
~ 49
 
2.3%
Other values (177) 1076
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1582
75.1%
Space Separator 423
 
20.1%
Math Symbol 51
 
2.4%
Other Punctuation 30
 
1.4%
Dash Punctuation 10
 
0.5%
Decimal Number 5
 
0.2%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
 
6.2%
87
 
5.5%
82
 
5.2%
78
 
4.9%
60
 
3.8%
53
 
3.4%
52
 
3.3%
49
 
3.1%
37
 
2.3%
37
 
2.3%
Other values (167) 949
60.0%
Math Symbol
ValueCountFrequency (%)
~ 49
96.1%
2
 
3.9%
Other Punctuation
ValueCountFrequency (%)
, 28
93.3%
/ 2
 
6.7%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
2 2
40.0%
Space Separator
ValueCountFrequency (%)
423
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1582
75.1%
Common 525
 
24.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
98
 
6.2%
87
 
5.5%
82
 
5.2%
78
 
4.9%
60
 
3.8%
53
 
3.4%
52
 
3.3%
49
 
3.1%
37
 
2.3%
37
 
2.3%
Other values (167) 949
60.0%
Common
ValueCountFrequency (%)
423
80.6%
~ 49
 
9.3%
, 28
 
5.3%
- 10
 
1.9%
) 3
 
0.6%
( 3
 
0.6%
1 3
 
0.6%
2
 
0.4%
/ 2
 
0.4%
2 2
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1582
75.1%
ASCII 523
 
24.8%
Math Operators 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
423
80.9%
~ 49
 
9.4%
, 28
 
5.4%
- 10
 
1.9%
) 3
 
0.6%
( 3
 
0.6%
1 3
 
0.6%
/ 2
 
0.4%
2 2
 
0.4%
Hangul
ValueCountFrequency (%)
98
 
6.2%
87
 
5.5%
82
 
5.2%
78
 
4.9%
60
 
3.8%
53
 
3.4%
52
 
3.3%
49
 
3.1%
37
 
2.3%
37
 
2.3%
Other values (167) 949
60.0%
Math Operators
ValueCountFrequency (%)
2
100.0%
Distinct118
Distinct (%)86.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-16T15:10:07.451100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length50
Mean length20.220588
Min length5

Characters and Unicode

Total characters2750
Distinct characters156
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)79.4%

Sample

1st rowL=3.7km, 차로수(4→6차로 확장), 교량1개소
2nd row평택 오성-고덕간 도로확장공사
3rd row소음저감시설 설치
4th row동이천IC 신설
5th row2차로 시설개량 L=6.9km
ValueCountFrequency (%)
교량 14
 
3.0%
연장 14
 
3.0%
13
 
2.8%
교차로 12
 
2.6%
2차로 11
 
2.4%
9
 
1.9%
its 8
 
1.7%
설치 8
 
1.7%
시설물 8
 
1.7%
시설개량 7
 
1.5%
Other values (255) 364
77.8%
2023-12-16T15:10:10.538084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
333
 
12.1%
m 151
 
5.5%
. 148
 
5.4%
1 131
 
4.8%
2 107
 
3.9%
= 101
 
3.7%
100
 
3.6%
, 94
 
3.4%
k 92
 
3.3%
5 79
 
2.9%
Other values (146) 1414
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 960
34.9%
Decimal Number 585
21.3%
Space Separator 333
 
12.1%
Other Punctuation 274
 
10.0%
Lowercase Letter 247
 
9.0%
Uppercase Letter 140
 
5.1%
Math Symbol 117
 
4.3%
Close Punctuation 42
 
1.5%
Open Punctuation 40
 
1.5%
Dash Punctuation 11
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
10.4%
68
 
7.1%
64
 
6.7%
56
 
5.8%
50
 
5.2%
46
 
4.8%
45
 
4.7%
45
 
4.7%
38
 
4.0%
32
 
3.3%
Other values (106) 416
43.3%
Decimal Number
ValueCountFrequency (%)
1 131
22.4%
2 107
18.3%
5 79
13.5%
0 66
11.3%
4 50
 
8.5%
3 40
 
6.8%
9 32
 
5.5%
8 28
 
4.8%
6 28
 
4.8%
7 24
 
4.1%
Uppercase Letter
ValueCountFrequency (%)
L 70
50.0%
B 31
22.1%
I 9
 
6.4%
S 8
 
5.7%
T 8
 
5.7%
K 6
 
4.3%
M 3
 
2.1%
R 2
 
1.4%
V 2
 
1.4%
C 1
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
m 151
61.1%
k 92
37.2%
h 1
 
0.4%
r 1
 
0.4%
p 1
 
0.4%
a 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
= 101
86.3%
~ 11
 
9.4%
> 2
 
1.7%
2
 
1.7%
1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 148
54.0%
, 94
34.3%
/ 17
 
6.2%
: 15
 
5.5%
Space Separator
ValueCountFrequency (%)
333
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1403
51.0%
Hangul 960
34.9%
Latin 387
 
14.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
10.4%
68
 
7.1%
64
 
6.7%
56
 
5.8%
50
 
5.2%
46
 
4.8%
45
 
4.7%
45
 
4.7%
38
 
4.0%
32
 
3.3%
Other values (106) 416
43.3%
Common
ValueCountFrequency (%)
333
23.7%
. 148
10.5%
1 131
 
9.3%
2 107
 
7.6%
= 101
 
7.2%
, 94
 
6.7%
5 79
 
5.6%
0 66
 
4.7%
4 50
 
3.6%
) 42
 
3.0%
Other values (14) 252
18.0%
Latin
ValueCountFrequency (%)
m 151
39.0%
k 92
23.8%
L 70
18.1%
B 31
 
8.0%
I 9
 
2.3%
S 8
 
2.1%
T 8
 
2.1%
K 6
 
1.6%
M 3
 
0.8%
R 2
 
0.5%
Other values (6) 7
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1786
64.9%
Hangul 960
34.9%
Arrows 2
 
0.1%
CJK Compat 1
 
< 0.1%
Math Operators 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
333
18.6%
m 151
 
8.5%
. 148
 
8.3%
1 131
 
7.3%
2 107
 
6.0%
= 101
 
5.7%
, 94
 
5.3%
k 92
 
5.2%
5 79
 
4.4%
L 70
 
3.9%
Other values (27) 480
26.9%
Hangul
ValueCountFrequency (%)
100
 
10.4%
68
 
7.1%
64
 
6.7%
56
 
5.8%
50
 
5.2%
46
 
4.8%
45
 
4.7%
45
 
4.7%
38
 
4.0%
32
 
3.3%
Other values (106) 416
43.3%
Arrows
ValueCountFrequency (%)
2
100.0%
CJK Compat
ValueCountFrequency (%)
1
100.0%
Math Operators
ValueCountFrequency (%)
1
100.0%

노선명
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)30.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
기타
19 
<NA>
16 
국도 77호선(부산 ~ 인천)
 
7
국도 1호선(목포 ~ 신의주(판문점))
 
7
국도 3호선(남해 ~ 초산 (철원))
 
6
Other values (36)
81 

Length

Max length21
Median length16
Mean length13.080882
Min length2

Unique

Unique14 ?
Unique (%)10.3%

Sample

1st row국도 38호선(서산 ~ 동해)
2nd row국도 38호선(서산 ~ 동해)
3rd row국도 3호선(남해 ~ 초산 (철원))
4th row국도 3호선(남해 ~ 초산 (철원))
5th row국도 3호선(남해 ~ 초산 (철원))

Common Values

ValueCountFrequency (%)
기타 19
 
14.0%
<NA> 16
 
11.8%
국도 77호선(부산 ~ 인천) 7
 
5.1%
국도 1호선(목포 ~ 신의주(판문점)) 7
 
5.1%
국도 3호선(남해 ~ 초산 (철원)) 6
 
4.4%
국도 59호선(광양 ~ 하동) 5
 
3.7%
국도 19호선(남해 ~ 원주) 5
 
3.7%
국도 20호선(산청 ~ 포항) 5
 
3.7%
국도 36호선(대천 ~ 울진) 4
 
2.9%
국도 35호선(부산 ~ 강릉) 4
 
2.9%
Other values (31) 58
42.6%

Length

2023-12-16T15:10:11.661638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
국도 101
22.8%
99
22.3%
기타 19
 
4.3%
na 16
 
3.6%
포항 7
 
1.6%
77호선(부산 7
 
1.6%
인천 7
 
1.6%
1호선(목포 7
 
1.6%
신의주(판문점 7
 
1.6%
철원 6
 
1.4%
Other values (69) 167
37.7%

공사위치
Text

MISSING 

Distinct99
Distinct (%)99.0%
Missing36
Missing (%)26.5%
Memory size1.2 KiB
2023-12-16T15:10:13.058743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length46.5
Mean length34.7
Min length5

Characters and Unicode

Total characters3470
Distinct characters205
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row안성시 공도읍 만정리~대덕면 마정리
2nd row경기도 연천군 신서면 도신리 426-1~경기도 연천군 도신로 49-2
3rd row경기도 이천시 부발읍 응암리 582-27~경기도 여주시 가남읍 은봉리 736
4th row여주시 가남읍 은봉리~이천시 장호원읍 풍계리
5th row경기도 평택시 청북면 현곡리~경기도 화성시 양감면 요당리
ValueCountFrequency (%)
경상북도 12
 
1.6%
강원도 10
 
1.3%
전라북도 9
 
1.2%
전라남도 9
 
1.2%
안동시 9
 
1.2%
여수시 7
 
0.9%
충청북도 7
 
0.9%
전남 6
 
0.8%
6
 
0.8%
신안군 5
 
0.7%
Other values (480) 664
89.2%
2023-12-16T15:10:15.158294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
646
 
18.6%
154
 
4.4%
145
 
4.2%
118
 
3.4%
106
 
3.1%
~ 100
 
2.9%
96
 
2.8%
96
 
2.8%
80
 
2.3%
62
 
1.8%
Other values (195) 1867
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2376
68.5%
Space Separator 646
 
18.6%
Decimal Number 284
 
8.2%
Math Symbol 100
 
2.9%
Dash Punctuation 48
 
1.4%
Other Punctuation 5
 
0.1%
Open Punctuation 5
 
0.1%
Close Punctuation 5
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
6.5%
145
 
6.1%
118
 
5.0%
106
 
4.5%
96
 
4.0%
96
 
4.0%
80
 
3.4%
62
 
2.6%
61
 
2.6%
59
 
2.5%
Other values (178) 1399
58.9%
Decimal Number
ValueCountFrequency (%)
1 57
20.1%
2 43
15.1%
4 35
12.3%
9 28
9.9%
3 23
8.1%
5 23
8.1%
6 21
 
7.4%
0 20
 
7.0%
7 18
 
6.3%
8 16
 
5.6%
Space Separator
ValueCountFrequency (%)
646
100.0%
Math Symbol
ValueCountFrequency (%)
~ 100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2376
68.5%
Common 1093
31.5%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
6.5%
145
 
6.1%
118
 
5.0%
106
 
4.5%
96
 
4.0%
96
 
4.0%
80
 
3.4%
62
 
2.6%
61
 
2.6%
59
 
2.5%
Other values (178) 1399
58.9%
Common
ValueCountFrequency (%)
646
59.1%
~ 100
 
9.1%
1 57
 
5.2%
- 48
 
4.4%
2 43
 
3.9%
4 35
 
3.2%
9 28
 
2.6%
3 23
 
2.1%
5 23
 
2.1%
6 21
 
1.9%
Other values (6) 69
 
6.3%
Latin
ValueCountFrequency (%)
U 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2376
68.5%
ASCII 1094
31.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
646
59.0%
~ 100
 
9.1%
1 57
 
5.2%
- 48
 
4.4%
2 43
 
3.9%
4 35
 
3.2%
9 28
 
2.6%
3 23
 
2.1%
5 23
 
2.1%
6 21
 
1.9%
Other values (7) 70
 
6.4%
Hangul
ValueCountFrequency (%)
154
 
6.5%
145
 
6.1%
118
 
5.0%
106
 
4.5%
96
 
4.0%
96
 
4.0%
80
 
3.4%
62
 
2.6%
61
 
2.6%
59
 
2.5%
Other values (178) 1399
58.9%
Distinct119
Distinct (%)87.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-16T15:10:16.430188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length8.8382353
Min length4

Characters and Unicode

Total characters1202
Distinct characters141
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)77.9%

Sample

1st row은파건설(주)
2nd row주식회사 시티
3rd row다스코 주식회사
4th row보구건설㈜
5th row지오종합건설 주식회사
ValueCountFrequency (%)
주식회사 40
 
17.2%
19
 
8.2%
8
 
3.4%
7
 
3.0%
5
 
2.1%
계룡건설산업(주 4
 
1.7%
3
 
1.3%
3
 
1.3%
3
 
1.3%
3
 
1.3%
Other values (123) 138
59.2%
2023-12-16T15:10:18.350364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
131
 
10.9%
105
 
8.7%
92
 
7.7%
81
 
6.7%
( 79
 
6.6%
) 79
 
6.6%
55
 
4.6%
52
 
4.3%
52
 
4.3%
21
 
1.7%
Other values (131) 455
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 932
77.5%
Space Separator 105
 
8.7%
Open Punctuation 79
 
6.6%
Close Punctuation 79
 
6.6%
Uppercase Letter 4
 
0.3%
Other Symbol 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
131
 
14.1%
92
 
9.9%
81
 
8.7%
55
 
5.9%
52
 
5.6%
52
 
5.6%
21
 
2.3%
20
 
2.1%
20
 
2.1%
17
 
1.8%
Other values (125) 391
42.0%
Uppercase Letter
ValueCountFrequency (%)
L 2
50.0%
D 2
50.0%
Space Separator
ValueCountFrequency (%)
105
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 935
77.8%
Common 263
 
21.9%
Latin 4
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
131
 
14.0%
92
 
9.8%
81
 
8.7%
55
 
5.9%
52
 
5.6%
52
 
5.6%
21
 
2.2%
20
 
2.1%
20
 
2.1%
17
 
1.8%
Other values (126) 394
42.1%
Common
ValueCountFrequency (%)
105
39.9%
( 79
30.0%
) 79
30.0%
Latin
ValueCountFrequency (%)
L 2
50.0%
D 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 932
77.5%
ASCII 267
 
22.2%
None 3
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
131
 
14.1%
92
 
9.9%
81
 
8.7%
55
 
5.9%
52
 
5.6%
52
 
5.6%
21
 
2.3%
20
 
2.1%
20
 
2.1%
17
 
1.8%
Other values (125) 391
42.0%
ASCII
ValueCountFrequency (%)
105
39.3%
( 79
29.6%
) 79
29.6%
L 2
 
0.7%
D 2
 
0.7%
None
ValueCountFrequency (%)
3
100.0%
Distinct135
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-16T15:10:19.699447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length36
Mean length22.830882
Min length15

Characters and Unicode

Total characters3105
Distinct characters219
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)98.5%

Sample

1st row경기도 안성시 대림동산길 76
2nd row경기도 평택시 고덕면 방축리 793-4
3rd row전라남도 화순군 동면 운농리 1039-1
4th row경기도 이천시 부발읍 가산리 30-4
5th row경기도 연천군 신서면 대광리 601-3
ValueCountFrequency (%)
경상북도 25
 
3.5%
전라북도 20
 
2.8%
전라남도 17
 
2.4%
충청북도 15
 
2.1%
경상남도 15
 
2.1%
경기도 15
 
2.1%
강원도 11
 
1.5%
충청남도 10
 
1.4%
전주시 4
 
0.6%
화순군 4
 
0.6%
Other values (480) 575
80.9%
2023-12-16T15:10:21.694024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
577
 
18.6%
139
 
4.5%
1 108
 
3.5%
80
 
2.6%
71
 
2.3%
3 70
 
2.3%
68
 
2.2%
61
 
2.0%
60
 
1.9%
60
 
1.9%
Other values (209) 1811
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1908
61.4%
Space Separator 577
 
18.6%
Decimal Number 503
 
16.2%
Dash Punctuation 59
 
1.9%
Other Punctuation 21
 
0.7%
Open Punctuation 19
 
0.6%
Close Punctuation 18
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
139
 
7.3%
80
 
4.2%
71
 
3.7%
68
 
3.6%
61
 
3.2%
60
 
3.1%
60
 
3.1%
55
 
2.9%
50
 
2.6%
48
 
2.5%
Other values (194) 1216
63.7%
Decimal Number
ValueCountFrequency (%)
1 108
21.5%
3 70
13.9%
2 56
11.1%
4 53
10.5%
0 41
 
8.2%
6 38
 
7.6%
5 36
 
7.2%
8 35
 
7.0%
7 34
 
6.8%
9 32
 
6.4%
Space Separator
ValueCountFrequency (%)
577
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 59
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1908
61.4%
Common 1197
38.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
139
 
7.3%
80
 
4.2%
71
 
3.7%
68
 
3.6%
61
 
3.2%
60
 
3.1%
60
 
3.1%
55
 
2.9%
50
 
2.6%
48
 
2.5%
Other values (194) 1216
63.7%
Common
ValueCountFrequency (%)
577
48.2%
1 108
 
9.0%
3 70
 
5.8%
- 59
 
4.9%
2 56
 
4.7%
4 53
 
4.4%
0 41
 
3.4%
6 38
 
3.2%
5 36
 
3.0%
8 35
 
2.9%
Other values (5) 124
 
10.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1908
61.4%
ASCII 1197
38.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
577
48.2%
1 108
 
9.0%
3 70
 
5.8%
- 59
 
4.9%
2 56
 
4.7%
4 53
 
4.4%
0 41
 
3.4%
6 38
 
3.2%
5 36
 
3.0%
8 35
 
2.9%
Other values (5) 124
 
10.4%
Hangul
ValueCountFrequency (%)
139
 
7.3%
80
 
4.2%
71
 
3.7%
68
 
3.6%
61
 
3.2%
60
 
3.1%
60
 
3.1%
55
 
2.9%
50
 
2.6%
48
 
2.5%
Other values (194) 1216
63.7%

시작일
Date

MISSING 

Distinct105
Distinct (%)86.1%
Missing14
Missing (%)10.3%
Memory size1.2 KiB
Minimum2008-03-24 00:00:00
Maximum2023-07-26 00:00:00
2023-12-16T15:10:22.291973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:10:23.199254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

준공일
Date

MISSING 

Distinct107
Distinct (%)87.7%
Missing14
Missing (%)10.3%
Memory size1.2 KiB
Minimum2023-09-24 00:00:00
Maximum2031-10-25 00:00:00
2023-12-16T15:10:24.030358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:10:24.732161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

도급액(원)
Real number (ℝ)

ZEROS 

Distinct110
Distinct (%)80.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3845026 × 1010
Minimum0
Maximum6.72 × 1011
Zeros25
Zeros (%)18.4%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-16T15:10:25.588653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q19.2189102 × 109
median3.0536369 × 1010
Q38.2334547 × 1010
95-th percentile1.3975 × 1011
Maximum6.72 × 1011
Range6.72 × 1011
Interquartile range (IQR)7.3115637 × 1010

Descriptive statistics

Standard deviation7.5708252 × 1010
Coefficient of variation (CV)1.4060399
Kurtosis33.221
Mean5.3845026 × 1010
Median Absolute Deviation (MAD)3.0536369 × 1010
Skewness4.6703818
Sum7.3229235 × 1012
Variance5.7317395 × 1021
MonotonicityNot monotonic
2023-12-16T15:10:26.771701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 25
 
18.4%
106000000000 2
 
1.5%
102000000000 2
 
1.5%
224000000000 1
 
0.7%
32858370800 1
 
0.7%
13841800000 1
 
0.7%
4400759000 1
 
0.7%
123000000000 1
 
0.7%
21563000000 1
 
0.7%
89407000000 1
 
0.7%
Other values (100) 100
73.5%
ValueCountFrequency (%)
0 25
18.4%
33286935 1
 
0.7%
4271453000 1
 
0.7%
4400759000 1
 
0.7%
6488987000 1
 
0.7%
7866890000 1
 
0.7%
7901300000 1
 
0.7%
8515700000 1
 
0.7%
8815441020 1
 
0.7%
9077818780 1
 
0.7%
ValueCountFrequency (%)
672000000000 1
0.7%
303000000000 1
0.7%
257000000000 1
0.7%
224000000000 1
0.7%
220000000000 1
0.7%
167000000000 1
0.7%
145000000000 1
0.7%
138000000000 1
0.7%
127000000000 1
0.7%
123000000000 1
0.7%

Interactions

2023-12-16T15:09:53.444293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:10:27.332098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발주청노선명공사위치도급액(원)
발주청1.0000.9501.0000.000
노선명0.9501.0001.0000.739
공사위치1.0001.0001.0001.000
도급액(원)0.0000.7391.0001.000
2023-12-16T15:10:27.944109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
노선명발주청
노선명1.0000.619
발주청0.6191.000
2023-12-16T15:10:28.513114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도급액(원)발주청노선명
도급액(원)1.0000.0000.357
발주청0.0001.0000.619
노선명0.3570.6191.000

Missing values

2023-12-16T15:09:54.256106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:09:55.262440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-16T15:09:55.980148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

발주청공사명공사구분주요사업지사업개요노선명공사위치시공사현장주소시작일준공일도급액(원)
0서울지방국토관리청공도-대덕 도로확장공사도로안성시 공도읍, 안성시 대덕면L=3.7km, 차로수(4→6차로 확장), 교량1개소국도 38호선(서산 ~ 동해)안성시 공도읍 만정리~대덕면 마정리은파건설(주)경기도 안성시 대림동산길 762023-07-262023-12-220
1서울지방국토관리청국도38호선 평택 오성-고덕 도로확장공사도로평택시 고덕면 동고리 일원평택 오성-고덕간 도로확장공사국도 38호선(서산 ~ 동해)<NA>주식회사 시티경기도 평택시 고덕면 방축리 793-42021-05-312027-04-2980528856916
2서울지방국토관리청국도3호선대체우회도로(민락2지구) 소음저감시설 설치공사도로의정부시 민락2지구소음저감시설 설치국도 3호선(남해 ~ 초산 (철원))<NA>다스코 주식회사전라남도 화순군 동면 운농리 1039-12021-10-082024-10-0641832000000
3서울지방국토관리청국도3호선 동이천IC 도로건설공사도로경기 이천, 여주동이천IC 신설국도 3호선(남해 ~ 초산 (철원))<NA>보구건설㈜경기도 이천시 부발읍 가산리 30-42020-10-072025-09-1021655081000
4서울지방국토관리청국도3호선 연천-신탄리2 도로건설공사도로경기도 연천군2차로 시설개량 L=6.9km국도 3호선(남해 ~ 초산 (철원))경기도 연천군 신서면 도신리 426-1~경기도 연천군 도신로 49-2지오종합건설 주식회사경기도 연천군 신서면 대광리 601-32020-02-252025-02-2223385993000
5서울지방국토관리청국도46호선 화도녹촌IC 병목지점 개선공사도로남양주시 화도읍 녹촌리 일원신설 교차로 연결로 2개소 및 접속도로 1식, Ramp 교량 3개소/455.0m국도 46호선(인천 ~ 고성)<NA>진지건설(주)경기도 남양주시 화도읍 녹촌로 105-10, 현장사무실2022-05-102025-04-2311305627000
6서울지방국토관리청국도48호선 한강시네폴리스 도로확장공사도로경기도 김포시도로확장 등국도 48호선(강화 ~ 서울)<NA>(주)부강건설전라북도 전주시 완산구 천잠로 559, 효자동3가 (효자동3가)2023-07-202025-08-220
7서울지방국토관리청도로인프라 국가성능시험장 구축사업도로연천군도로인프라 국가성능시험장 구축기타<NA>대신종합건설(주)경기도 연천군 연천읍 평화로 1709<NA><NA>0
8서울지방국토관리청성남-장호원 도로건설공사(6-1공구)도로이천시, 여주시도로 신설국도 3호선(남해 ~ 초산 (철원))경기도 이천시 부발읍 응암리 582-27~경기도 여주시 가남읍 은봉리 736고운시티아이(주)경기도 이천시 대월면 장평리 2372021-07-082027-06-0646555335000
9서울지방국토관리청성남-장호원(6-2공구) 도로건설공사도로이천시, 여주시국도 신설국도 3호선(남해 ~ 초산 (철원))여주시 가남읍 은봉리~이천시 장호원읍 풍계리강산건설(주)경기도 이천시 장호원읍 이풍로 2132022-06-132028-05-1185603187470
발주청공사명공사구분주요사업지사업개요노선명공사위치시공사현장주소시작일준공일도급액(원)
126부산지방국토관리청창원 양곡 양곡교차로 교차로개선사업도로창원시 성산구교량2개소국도 2호선(목포 ~ 부산)<NA>영진건설㈜전라남도 화순군 화순읍 진각로 144, 5층 404호<NA><NA>0
127부산지방국토관리청창원시관내 국도대체우회도로(귀곡-행암)건설공사도로경남 창원 귀곡실시설계 용역국도 2호선(목포 ~ 부산)창원시 성산구 양곡동~창원시 진해구 석동지에스건설(주)경상남도 창원시 진해구 조천북로 832013-06-032023-11-30145000000000
128부산지방국토관리청창원시관내 국도대체우회도로(제2안민터널) 건설공사도로창원L-3.85km기타창원시 진해구 석동~창원시 성산구 천선동DL이앤씨 주식회사경상남도 창원시 진해구 진해대로 943-12016-04-222024-03-31121000000000
129부산지방국토관리청청량-옥동 국도건설공사(WBS)도로울산 울주군 청량읍 문죽리~남구 옥동L=1.6km, B=20.0m(4차로), 교량 3개소, 교차로 2개소<NA>울산광역시 울주군 청량읍 문죽리~울산광역시 남구 옥동(주) 제일종합건설전라북도 익산시 익산대로 124 제일건설2018-11-192023-10-230
130부산지방국토관리청청량-옥동 국도건설공사도로울산 울주군 청량읍 문죽리~남구 옥동L=1.6km, B=20.0m(4차로), 교량 3개소, 교차로 2개소<NA>울산광역시 울주군 청량읍 문죽리~울산광역시 남구 옥동(주) 제일종합건설울산광역시 청량읍 문죽리 337-22018-11-192023-10-2332658121000
131부산지방국토관리청포항-안동1-1(1공구) 국도건설공사도로포항시포항-안동1-1(1공구)국도 31호선(부산 ~ 신고산)경상북도 포항시 북구 기계면 인비리 826-1~경상북도 포항시 북구 죽장면 일광리 산 114-2(주)영무토건경상북도 포항시 북구 기계면 구지리 132020-12-302027-11-23102000000000
132부산지방국토관리청포항-안동1-1(2공구) 국도건설공사도로포항, 청송포항-안동1-1(2공구)국도 31호선(부산 ~ 신고산)경상북도 포항시 북구 죽장면 일광리~경상북도 청송군 현동면 눌인리(주)케이알산업경상북도 포항시 북구 죽장면 죽장로 2012021-02-052027-12-3091511500000
133부산지방국토관리청포항-안동2(1공구) 국도건설공사도로경북 청송군 현서면 두현리 ∼ 경북 안동시 길안면 송사리L=9.5km, B=19.5m국도 35호선(부산 ~ 강릉)경상북도 청송군 현서면 두현리~경상북도 안동시 길안면 송사리현대아산 주식회사경상북도 안동시 길안면 송사리 1232019-01-292025-12-2272818000000
134부산지방국토관리청포항-안동2(2공구) 국도건설공사도로경북 안동시 길안면 송사리 ∼ 경북 안동시 길안면 천지리L=10.1km, B=19.5m국도 35호선(부산 ~ 강릉)경북 안동시 길안면 송사리 산2임~경북 안동시 길안면 천지리 798전(주) 태 영 건 설경상북도 안동시 길안면 만음리 7112019-02-112026-01-0464389302000
135부산지방국토관리청함안 군북-가야 국도건설공사도로경남 함안군 군북면 사도리L=8.30km, B=11.50m(2차로), 평면교차로 15개소<NA>경상남도 함안군 군북면 함마대로 454~경상남도 함안군 가야읍 함마대로 1445죽암건설주식회사전라남도 순천시 연향동 1437-22021-01-182026-01-1632503686000