Overview

Dataset statistics

Number of variables15
Number of observations60
Missing cells68
Missing cells (%)7.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.2 KiB
Average record size in memory122.2 B

Variable types

Unsupported3
Categorical7
Text5

Dataset

Description부산도시공사의 연간 발주계획입니다. 사업명, 발주금액, 담당부서 등에 대한 데이터가 있습니다. ※ 발주계획은 계획으로서 변경될 수 있습니다.
URLhttps://www.data.go.kr/data/15109850/fileData.do

Alerts

Unnamed: 1 is highly overall correlated with Unnamed: 2 and 5 other fieldsHigh correlation
Unnamed: 13 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 2 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 3 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 5 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 9 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 10 is highly overall correlated with Unnamed: 1 and 5 other fieldsHigh correlation
Unnamed: 1 is highly imbalanced (77.3%)Imbalance
Unnamed: 2 is highly imbalanced (61.5%)Imbalance
2023년도 공사계약 발주 계획 has 1 (1.7%) missing valuesMissing
Unnamed: 4 has 2 (3.3%) missing valuesMissing
Unnamed: 6 has 2 (3.3%) missing valuesMissing
Unnamed: 7 has 2 (3.3%) missing valuesMissing
Unnamed: 8 has 2 (3.3%) missing valuesMissing
Unnamed: 11 has 2 (3.3%) missing valuesMissing
Unnamed: 12 has 2 (3.3%) missing valuesMissing
Unnamed: 14 has 55 (91.7%) missing valuesMissing
2023년도 공사계약 발주 계획 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 23:22:56.665142
Analysis finished2023-12-12 23:22:58.176405
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2023년도 공사계약 발주 계획
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)1.7%
Memory size612.0 B

Unnamed: 1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
공사
56 
<NA>
 
2
분 류
 
1
공사 (예시)
 
1

Length

Max length7
Median length2
Mean length2.1666667
Min length2

Unique

Unique2 ?
Unique (%)3.3%

Sample

1st row<NA>
2nd row<NA>
3rd row분 류
4th row공사 (예시)
5th row공사

Common Values

ValueCountFrequency (%)
공사 56
93.3%
<NA> 2
 
3.3%
분 류 1
 
1.7%
공사 (예시) 1
 
1.7%

Length

2023-12-13T08:22:58.259744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:58.393771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 57
91.9%
na 2
 
3.2%
1
 
1.6%
1
 
1.6%
예시 1
 
1.6%

Unnamed: 2
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
자체조달
51 
<NA>
 
3
-
 
3
중앙조달
 
2
조달방식
 
1

Length

Max length4
Median length4
Mean length3.85
Min length1

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row<NA>
2nd row<NA>
3rd row조달방식
4th row자체조달
5th row<NA>

Common Values

ValueCountFrequency (%)
자체조달 51
85.0%
<NA> 3
 
5.0%
- 3
 
5.0%
중앙조달 2
 
3.3%
조달방식 1
 
1.7%

Length

2023-12-13T08:22:58.574920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:58.688585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자체조달 51
85.0%
na 3
 
5.0%
3
 
5.0%
중앙조달 2
 
3.3%
조달방식 1
 
1.7%

Unnamed: 3
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
제한경쟁
30 
수의계약
16 
제한경쟁 (지역제한)
민간참여 사업 (협약)
 
3
<NA>
 
2
Other values (2)
 
3

Length

Max length12
Median length4
Mean length5.1
Min length4

Unique

Unique1 ?
Unique (%)1.7%

Sample

1st row<NA>
2nd row<NA>
3rd row계약방법
4th row일반경쟁
5th row수의계약

Common Values

ValueCountFrequency (%)
제한경쟁 30
50.0%
수의계약 16
26.7%
제한경쟁 (지역제한) 6
 
10.0%
민간참여 사업 (협약) 3
 
5.0%
<NA> 2
 
3.3%
일반경쟁 2
 
3.3%
계약방법 1
 
1.7%

Length

2023-12-13T08:22:58.834331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:22:58.965592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제한경쟁 36
50.0%
수의계약 16
22.2%
지역제한 6
 
8.3%
민간참여 3
 
4.2%
사업 3
 
4.2%
협약 3
 
4.2%
na 2
 
2.8%
일반경쟁 2
 
2.8%
계약방법 1
 
1.4%

Unnamed: 4
Text

MISSING 

Distinct58
Distinct (%)100.0%
Missing2
Missing (%)3.3%
Memory size612.0 B
2023-12-13T08:22:59.306990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length27.5
Mean length23.5
Min length6

Characters and Unicode

Total characters1363
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)100.0%

Sample

1st row사 업 명
2nd row○○○○○ 정보통신공사
3rd row사옥 1층 로비 환경개선공사
4th row센텀2지구 도시첨단산업단지 조성공사
5th row범방마을 연결도로 개설공사
ValueCountFrequency (%)
2023년 12
 
4.9%
임대시설물 9
 
3.7%
부산 8
 
3.3%
2공구 8
 
3.3%
에코델타시티 7
 
2.9%
교체 6
 
2.5%
보수공사 5
 
2.1%
에코델타시티(2단계 5
 
2.1%
단가계약 5
 
2.1%
정보통신공사 5
 
2.1%
Other values (124) 173
71.2%
2023-12-13T08:22:59.836674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
182
 
13.4%
72
 
5.3%
52
 
3.8%
2 49
 
3.6%
34
 
2.5%
) 29
 
2.1%
( 29
 
2.1%
27
 
2.0%
23
 
1.7%
23
 
1.7%
Other values (172) 843
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 985
72.3%
Space Separator 182
 
13.4%
Decimal Number 95
 
7.0%
Close Punctuation 29
 
2.1%
Open Punctuation 29
 
2.1%
Other Punctuation 24
 
1.8%
Control 8
 
0.6%
Uppercase Letter 6
 
0.4%
Other Symbol 5
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
7.3%
52
 
5.3%
34
 
3.5%
27
 
2.7%
23
 
2.3%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
18
 
1.8%
Other values (157) 677
68.7%
Decimal Number
ValueCountFrequency (%)
2 49
51.6%
3 16
 
16.8%
0 12
 
12.6%
1 11
 
11.6%
4 4
 
4.2%
5 3
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 13
54.2%
· 11
45.8%
Uppercase Letter
ValueCountFrequency (%)
L 3
50.0%
B 3
50.0%
Space Separator
ValueCountFrequency (%)
182
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Control
ValueCountFrequency (%)
8
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 985
72.3%
Common 372
 
27.3%
Latin 6
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
7.3%
52
 
5.3%
34
 
3.5%
27
 
2.7%
23
 
2.3%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
18
 
1.8%
Other values (157) 677
68.7%
Common
ValueCountFrequency (%)
182
48.9%
2 49
 
13.2%
) 29
 
7.8%
( 29
 
7.8%
3 16
 
4.3%
, 13
 
3.5%
0 12
 
3.2%
· 11
 
3.0%
1 11
 
3.0%
8
 
2.2%
Other values (3) 12
 
3.2%
Latin
ValueCountFrequency (%)
L 3
50.0%
B 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 985
72.3%
ASCII 362
 
26.6%
None 11
 
0.8%
Geometric Shapes 5
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
182
50.3%
2 49
 
13.5%
) 29
 
8.0%
( 29
 
8.0%
3 16
 
4.4%
, 13
 
3.6%
0 12
 
3.3%
1 11
 
3.0%
8
 
2.2%
4 4
 
1.1%
Other values (3) 9
 
2.5%
Hangul
ValueCountFrequency (%)
72
 
7.3%
52
 
5.3%
34
 
3.5%
27
 
2.7%
23
 
2.3%
23
 
2.3%
20
 
2.0%
20
 
2.0%
19
 
1.9%
18
 
1.8%
Other values (157) 677
68.7%
None
ValueCountFrequency (%)
· 11
100.0%
Geometric Shapes
ValueCountFrequency (%)
5
100.0%

Unnamed: 5
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
전문
12 
통신
전기
건축
조경
Other values (9)
23 

Length

Max length8
Median length2
Mean length2.35
Min length2

Unique

Unique3 ?
Unique (%)5.0%

Sample

1st row<NA>
2nd row<NA>
3rd row공종
4th row통신
5th row건축

Common Values

ValueCountFrequency (%)
전문 12
20.0%
통신 7
11.7%
전기 7
11.7%
건축 6
10.0%
조경 5
8.3%
소방 5
8.3%
토목 4
 
6.7%
토건 4
 
6.7%
전공종 3
 
5.0%
<NA> 2
 
3.3%
Other values (4) 5
8.3%

Length

2023-12-13T08:23:00.005504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전문 12
18.8%
건축 9
14.1%
통신 7
10.9%
전기 7
10.9%
조경 5
7.8%
소방 5
7.8%
토목 5
7.8%
토건 4
 
6.2%
전공종 3
 
4.7%
기계 3
 
4.7%
Other values (3) 4
 
6.2%

Unnamed: 6
Text

MISSING 

Distinct33
Distinct (%)56.9%
Missing2
Missing (%)3.3%
Memory size612.0 B
2023-12-13T08:23:00.192550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length19
Mean length18.051724
Min length4

Characters and Unicode

Total characters1047
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)36.2%

Sample

1st row사업기간
2nd row2023.01. ~ 2023.03.
3rd row2023.02. ~ 2023.03
4th row2023.12. ~ 2026.12.
5th row2023.07. ~ 2023.09.
ValueCountFrequency (%)
55
32.5%
2023.04 14
 
8.3%
2023.03 11
 
6.5%
2025.03 10
 
5.9%
2023.11 10
 
5.9%
2023.07 9
 
5.3%
2023.10 8
 
4.7%
2024.06 7
 
4.1%
2023.02 7
 
4.1%
2023.05 5
 
3.0%
Other values (17) 33
19.5%
2023-12-13T08:23:00.866626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 242
23.1%
. 211
20.2%
0 198
18.9%
111
10.6%
3 107
10.2%
~ 56
 
5.3%
1 44
 
4.2%
4 28
 
2.7%
5 17
 
1.6%
7 13
 
1.2%
Other values (7) 20
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 665
63.5%
Other Punctuation 211
 
20.2%
Space Separator 111
 
10.6%
Math Symbol 56
 
5.3%
Other Letter 4
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 242
36.4%
0 198
29.8%
3 107
16.1%
1 44
 
6.6%
4 28
 
4.2%
5 17
 
2.6%
7 13
 
2.0%
6 10
 
1.5%
8 3
 
0.5%
9 3
 
0.5%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 211
100.0%
Space Separator
ValueCountFrequency (%)
111
100.0%
Math Symbol
ValueCountFrequency (%)
~ 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1043
99.6%
Hangul 4
 
0.4%

Most frequent character per script

Common
ValueCountFrequency (%)
2 242
23.2%
. 211
20.2%
0 198
19.0%
111
10.6%
3 107
10.3%
~ 56
 
5.4%
1 44
 
4.2%
4 28
 
2.7%
5 17
 
1.6%
7 13
 
1.2%
Other values (3) 16
 
1.5%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1043
99.6%
Hangul 4
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 242
23.2%
. 211
20.2%
0 198
19.0%
111
10.6%
3 107
10.3%
~ 56
 
5.4%
1 44
 
4.2%
4 28
 
2.7%
5 17
 
1.6%
7 13
 
1.2%
Other values (3) 16
 
1.5%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)3.3%
Memory size612.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)3.3%
Memory size612.0 B

Unnamed: 9
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size612.0 B
시설관리처 시설관리2부
15 
복지사업처 아르피나사업부
13 
토목사업처 조경사업부
주택사업처 주택사업1부
기전사업처 전기사업부
Other values (10)
19 

Length

Max length13
Median length12.5
Mean length11.65
Min length4

Unique

Unique5 ?
Unique (%)8.3%

Sample

1st row<NA>
2nd row<NA>
3rd row발주부서
4th row재무관리처 계약부
5th row경영지원실 사옥TF팀

Common Values

ValueCountFrequency (%)
시설관리처 시설관리2부 15
25.0%
복지사업처 아르피나사업부 13
21.7%
토목사업처 조경사업부 5
 
8.3%
주택사업처 주택사업1부 4
 
6.7%
기전사업처 전기사업부 4
 
6.7%
기전사업처 스마트사업부 4
 
6.7%
시설관리처 시설관리1부 4
 
6.7%
<NA> 2
 
3.3%
토목사업처 단지사업1부 2
 
3.3%
맞춤임대처 맞춤임대관리부 2
 
3.3%
Other values (5) 5
 
8.3%

Length

2023-12-13T08:23:01.140165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
시설관리처 19
16.2%
시설관리2부 15
12.8%
복지사업처 13
11.1%
아르피나사업부 13
11.1%
토목사업처 8
 
6.8%
기전사업처 8
 
6.8%
조경사업부 5
 
4.3%
주택사업처 4
 
3.4%
주택사업1부 4
 
3.4%
전기사업부 4
 
3.4%
Other values (14) 24
20.5%

Unnamed: 10
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)43.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
박철홍
13 
김우종
기웅석
김성주
임영훈
Other values (21)
28 

Length

Max length11
Median length3
Mean length3.3833333
Min length3

Unique

Unique15 ?
Unique (%)25.0%

Sample

1st row<NA>
2nd row<NA>
3rd row담당자
4th row김민우
5th row최흥식

Common Values

ValueCountFrequency (%)
박철홍 13
21.7%
김우종 7
11.7%
기웅석 5
 
8.3%
김성주 4
 
6.7%
임영훈 3
 
5.0%
김학준 3
 
5.0%
이동헌 2
 
3.3%
허재영,기웅석,송호림 2
 
3.3%
최수종 2
 
3.3%
박일룡 2
 
3.3%
Other values (16) 17
28.3%

Length

2023-12-13T08:23:01.457256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
박철홍 13
21.3%
김우종 7
 
11.5%
기웅석 5
 
8.2%
김성주 4
 
6.6%
임영훈 3
 
4.9%
김학준 3
 
4.9%
최수종 2
 
3.3%
na 2
 
3.3%
박일룡 2
 
3.3%
허재영,기웅석,송호림 2
 
3.3%
Other values (17) 18
29.5%

Unnamed: 11
Text

MISSING 

Distinct37
Distinct (%)63.8%
Missing2
Missing (%)3.3%
Memory size612.0 B
2023-12-13T08:23:01.755688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length12
Mean length12.086207
Min length4

Characters and Unicode

Total characters701
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)48.3%

Sample

1st row전화번호
2nd row051-810-1285
3rd row051-810-8596
4th row051-810-1492
5th row051-810-1496
ValueCountFrequency (%)
051-810-1339 7
 
11.9%
051-810-1394 5
 
8.5%
051-810-1354 4
 
6.8%
051-810-1469 3
 
5.1%
051-810-1427 3
 
5.1%
051-810-1396 3
 
5.1%
051-810-1375 2
 
3.4%
051-810-8564 2
 
3.4%
051-810-1336 2
 
3.4%
051-740-3293 1
 
1.7%
Other values (27) 27
45.8%
2023-12-13T08:23:02.169858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 146
20.8%
0 123
17.5%
- 116
16.5%
5 70
10.0%
3 65
9.3%
8 51
 
7.3%
4 38
 
5.4%
9 31
 
4.4%
7 22
 
3.1%
2 18
 
2.6%
Other values (6) 21
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 580
82.7%
Dash Punctuation 116
 
16.5%
Other Letter 4
 
0.6%
Control 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 146
25.2%
0 123
21.2%
5 70
12.1%
3 65
11.2%
8 51
 
8.8%
4 38
 
6.6%
9 31
 
5.3%
7 22
 
3.8%
2 18
 
3.1%
6 16
 
2.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 697
99.4%
Hangul 4
 
0.6%

Most frequent character per script

Common
ValueCountFrequency (%)
1 146
20.9%
0 123
17.6%
- 116
16.6%
5 70
10.0%
3 65
9.3%
8 51
 
7.3%
4 38
 
5.5%
9 31
 
4.4%
7 22
 
3.2%
2 18
 
2.6%
Other values (2) 17
 
2.4%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 697
99.4%
Hangul 4
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 146
20.9%
0 123
17.6%
- 116
16.6%
5 70
10.0%
3 65
9.3%
8 51
 
7.3%
4 38
 
5.5%
9 31
 
4.4%
7 22
 
3.2%
2 18
 
2.6%
Other values (2) 17
 
2.4%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 12
Text

MISSING 

Distinct43
Distinct (%)74.1%
Missing2
Missing (%)3.3%
Memory size612.0 B
2023-12-13T08:23:02.524743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length77
Median length32.5
Mean length20.637931
Min length7

Characters and Unicode

Total characters1197
Distinct characters179
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)60.3%

Sample

1st row발주물량 또는 규모
2nd row(예시) 복도 창호 7개설치 전기공사 1식 공원 등 150본 / 6개소 그린리모델링 대상세대(110세대)
3rd row사옥1층 로비 환경개선 1식
4th rowA=275천㎡(1단계) 단지조성공사 1식
5th row범방마을 연결도로 개설공사
ValueCountFrequency (%)
1식 32
 
13.7%
보수 10
 
4.3%
설비공사 8
 
3.4%
임대아파트 7
 
3.0%
7
 
3.0%
교체 6
 
2.6%
전기공사 5
 
2.1%
그린리모델링 4
 
1.7%
대상세대(110세대 4
 
1.7%
건축·토목분야 4
 
1.7%
Other values (118) 147
62.8%
2023-12-13T08:23:03.002030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
169
 
14.1%
1 72
 
6.0%
36
 
3.0%
35
 
2.9%
35
 
2.9%
27
 
2.3%
22
 
1.8%
( 21
 
1.8%
21
 
1.8%
) 21
 
1.8%
Other values (169) 738
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 734
61.3%
Space Separator 169
 
14.1%
Decimal Number 167
 
14.0%
Other Punctuation 34
 
2.8%
Open Punctuation 21
 
1.8%
Close Punctuation 21
 
1.8%
Math Symbol 15
 
1.3%
Uppercase Letter 13
 
1.1%
Control 9
 
0.8%
Other Symbol 8
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
4.9%
35
 
4.8%
35
 
4.8%
27
 
3.7%
22
 
3.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
Other values (142) 486
66.2%
Decimal Number
ValueCountFrequency (%)
1 72
43.1%
2 19
 
11.4%
0 17
 
10.2%
4 13
 
7.8%
7 11
 
6.6%
5 10
 
6.0%
6 9
 
5.4%
8 7
 
4.2%
9 5
 
3.0%
3 4
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
B 3
23.1%
L 3
23.1%
A 3
23.1%
C 2
15.4%
T 1
 
7.7%
V 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 21
61.8%
/ 9
26.5%
· 4
 
11.8%
Math Symbol
ValueCountFrequency (%)
= 9
60.0%
~ 6
40.0%
Space Separator
ValueCountFrequency (%)
169
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Control
ValueCountFrequency (%)
9
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Lowercase Letter
ValueCountFrequency (%)
m 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 734
61.3%
Common 444
37.1%
Latin 19
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
4.9%
35
 
4.8%
35
 
4.8%
27
 
3.7%
22
 
3.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
Other values (142) 486
66.2%
Common
ValueCountFrequency (%)
169
38.1%
1 72
16.2%
( 21
 
4.7%
) 21
 
4.7%
, 21
 
4.7%
2 19
 
4.3%
0 17
 
3.8%
4 13
 
2.9%
7 11
 
2.5%
5 10
 
2.3%
Other values (10) 70
15.8%
Latin
ValueCountFrequency (%)
m 6
31.6%
B 3
15.8%
L 3
15.8%
A 3
15.8%
C 2
 
10.5%
T 1
 
5.3%
V 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 734
61.3%
ASCII 451
37.7%
CJK Compat 8
 
0.7%
None 4
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
169
37.5%
1 72
16.0%
( 21
 
4.7%
) 21
 
4.7%
, 21
 
4.7%
2 19
 
4.2%
0 17
 
3.8%
4 13
 
2.9%
7 11
 
2.4%
5 10
 
2.2%
Other values (15) 77
17.1%
Hangul
ValueCountFrequency (%)
36
 
4.9%
35
 
4.8%
35
 
4.8%
27
 
3.7%
22
 
3.0%
21
 
2.9%
19
 
2.6%
18
 
2.5%
18
 
2.5%
17
 
2.3%
Other values (142) 486
66.2%
CJK Compat
ValueCountFrequency (%)
8
100.0%
None
ValueCountFrequency (%)
· 4
100.0%

Unnamed: 13
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)11.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
-
26 
<NA>
16 
추정가격 2천만원 이하 소액공사 등
14 
수의계약 사유
 
1
(예시) 추정가격 2천만원 이하 소액공사 등
 
1
Other values (2)
 
2

Length

Max length24
Median length19
Mean length7
Min length1

Unique

Unique4 ?
Unique (%)6.7%

Sample

1st row<NA>
2nd row<NA>
3rd row수의계약 사유
4th row(예시) 추정가격 2천만원 이하 소액공사 등
5th row추정가격 2천만원 이하 소액공사 등

Common Values

ValueCountFrequency (%)
- 26
43.3%
<NA> 16
26.7%
추정가격 2천만원 이하 소액공사 등 14
23.3%
수의계약 사유 1
 
1.7%
(예시) 추정가격 2천만원 이하 소액공사 등 1
 
1.7%
추정가격 1억원 미만 소액공사 1
 
1.7%
추정가격 2천만원 이하 소액공사 1
 
1.7%

Length

2023-12-13T08:23:03.171243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:23:03.273120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
26
20.3%
추정가격 17
13.3%
소액공사 17
13.3%
na 16
12.5%
2천만원 16
12.5%
이하 16
12.5%
15
11.7%
수의계약 1
 
0.8%
사유 1
 
0.8%
예시 1
 
0.8%
Other values (2) 2
 
1.6%

Unnamed: 14
Text

MISSING 

Distinct3
Distinct (%)60.0%
Missing55
Missing (%)91.7%
Memory size612.0 B
2023-12-13T08:23:03.451076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length31
Mean length20.6
Min length2

Characters and Unicode

Total characters103
Distinct characters30
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)40.0%

Sample

1st row(단위: 천원)
2nd row비고
3rd row사업기간 월까지는 정해지지 않음. - 발주금액 총사업비
4th row사업기간 월까지는 정해지지 않음. - 발주금액 총사업비
5th row사업기간 월까지는 정해지지 않음. - 발주금액 총사업비
ValueCountFrequency (%)
사업기간 3
12.5%
월까지는 3
12.5%
정해지지 3
12.5%
않음 3
12.5%
3
12.5%
발주금액 3
12.5%
총사업비 3
12.5%
단위 1
 
4.2%
천원 1
 
4.2%
비고 1
 
4.2%
2023-12-13T08:23:03.756929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
18.4%
9
 
8.7%
6
 
5.8%
6
 
5.8%
4
 
3.9%
. 3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
3
 
2.9%
Other values (20) 44
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72
69.9%
Space Separator 19
 
18.4%
Other Punctuation 4
 
3.9%
Control 3
 
2.9%
Dash Punctuation 3
 
2.9%
Open Punctuation 1
 
1.0%
Close Punctuation 1
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
12.5%
6
 
8.3%
6
 
8.3%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
Other values (13) 29
40.3%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
: 1
 
25.0%
Space Separator
ValueCountFrequency (%)
19
100.0%
Control
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72
69.9%
Common 31
30.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
12.5%
6
 
8.3%
6
 
8.3%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
Other values (13) 29
40.3%
Common
ValueCountFrequency (%)
19
61.3%
. 3
 
9.7%
3
 
9.7%
- 3
 
9.7%
( 1
 
3.2%
: 1
 
3.2%
) 1
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72
69.9%
ASCII 31
30.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19
61.3%
. 3
 
9.7%
3
 
9.7%
- 3
 
9.7%
( 1
 
3.2%
: 1
 
3.2%
) 1
 
3.2%
Hangul
ValueCountFrequency (%)
9
 
12.5%
6
 
8.3%
6
 
8.3%
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
Other values (13) 29
40.3%

Correlations

2023-12-13T08:23:03.883196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14
Unnamed: 11.0000.6610.9851.0000.8101.0001.0001.0001.0001.0001.0000.000
Unnamed: 20.6611.0000.9151.0000.9380.9430.9201.0001.0001.0000.6950.000
Unnamed: 30.9850.9151.0001.0000.9471.0000.9731.0001.0001.0000.8470.000
Unnamed: 41.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Unnamed: 50.8100.9380.9471.0001.0000.9420.9150.9810.9750.9930.9300.000
Unnamed: 61.0000.9431.0001.0000.9421.0000.9810.9730.9900.9950.9740.000
Unnamed: 91.0000.9200.9731.0000.9150.9811.0001.0001.0000.9950.9290.000
Unnamed: 101.0001.0001.0001.0000.9810.9731.0001.0001.0000.9881.0000.000
Unnamed: 111.0001.0001.0001.0000.9750.9901.0001.0001.0000.9961.0000.000
Unnamed: 121.0001.0001.0001.0000.9930.9950.9950.9880.9961.0001.0001.000
Unnamed: 131.0000.6950.8471.0000.9300.9740.9291.0001.0001.0001.0000.000
Unnamed: 140.0000.0000.0001.0000.0000.0000.0000.0000.0001.0000.0001.000
2023-12-13T08:23:04.391894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 5Unnamed: 2Unnamed: 10Unnamed: 9Unnamed: 1Unnamed: 13Unnamed: 3
Unnamed: 51.0000.7810.7310.6390.6050.5810.779
Unnamed: 20.7811.0000.7890.7440.6810.5110.788
Unnamed: 100.7310.7891.0000.8660.7750.8430.797
Unnamed: 90.6390.7440.8661.0000.8940.7750.842
Unnamed: 10.6050.6810.7750.8941.0000.9630.826
Unnamed: 130.5810.5110.8430.7750.9631.0000.745
Unnamed: 30.7790.7880.7970.8420.8260.7451.000
2023-12-13T08:23:04.516289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 5Unnamed: 9Unnamed: 10Unnamed: 13
Unnamed: 11.0000.6810.8260.6050.8940.7750.963
Unnamed: 20.6811.0000.7880.7810.7440.7890.511
Unnamed: 30.8260.7881.0000.7790.8420.7970.745
Unnamed: 50.6050.7810.7791.0000.6390.7310.581
Unnamed: 90.8940.7440.8420.6391.0000.8660.775
Unnamed: 100.7750.7890.7970.7310.8661.0000.843
Unnamed: 130.9630.5110.7450.5810.7750.8431.000

Missing values

2023-12-13T08:22:57.511619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:22:57.762150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:22:57.978720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2023년도 공사계약 발주 계획Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14
0○ 대상기간 : 2023년 1월~12월<NA><NA><NA><NA><NA><NA>NaNNaN<NA><NA><NA><NA><NA>(단위: 천원)
1※ 작성시 유의사항\n* 사업기간 등 날짜형식, 금액단위 등 준수요망 / 발주물량 또는 규모에 쉼표(반점) 사용 금지\n* 발주금액의 경우 2023년도 예산반영액이 아닌 총 계약금액 기준\n* 공사비의 경우 도급금액으로 작성(지급자재비 제외)\n* 2023년도 예산서에 반영된 신규계약건은 누락없이 작성 요망\n* 조달방식 : 자체조달 / 중앙조달\n* 계약방법 : 일반경쟁 / 제한경쟁 / 지명경쟁 / PQ / 수의계약 등\n* 공종 : 규정된 공종을 선택(토목,건축,토건,조경,전문,전기,통신,소방,기타)\n* 계약방법이 "수의계약"일 경우 사유 기재\n ※ 향후 2023년도 발주계획 나라장터 입력 자료로도 활용할 예정임(별도 확인 공지 12월말 예정)<NA><NA><NA><NA><NA><NA>NaNNaN<NA><NA><NA><NA><NA><NA>
2번호분 류조달방식계약방법사 업 명공종사업기간발주시기발주금액(천원)발주부서담당자전화번호발주물량 또는 규모수의계약 사유비고
3NaN공사 (예시)자체조달일반경쟁○○○○○ 정보통신공사통신2023.01. ~ 2023.03.2023.01.200000재무관리처 계약부김민우051-810-1285(예시) 복도 창호 7개설치 전기공사 1식 공원 등 150본 / 6개소 그린리모델링 대상세대(110세대)(예시) 추정가격 2천만원 이하 소액공사 등<NA>
41공사<NA>수의계약사옥 1층 로비 환경개선공사건축2023.02. ~ 2023.032023.01200000경영지원실 사옥TF팀최흥식051-810-8596사옥1층 로비 환경개선 1식추정가격 2천만원 이하 소액공사 등<NA>
52공사자체조달일반경쟁센텀2지구 도시첨단산업단지 조성공사토목2023.12. ~ 2026.12.2023.11.36200000토목사업처 단지사업1부박현석051-810-1492A=275천㎡(1단계) 단지조성공사 1식-<NA>
63공사자체조달수의계약범방마을 연결도로 개설공사토목2023.07. ~ 2023.09.2023.06.56475토목사업처 단지사업1부손지현051-810-1496범방마을 연결도로 개설공사추정가격 1억원 미만 소액공사<NA>
74공사자체조달제한경쟁 (지역제한)오시리아관광단지내 테마파크·역사 주변 도로확장 및 오수관로 보수 공사토목2023.03. ~ 2024.06.2023.03.3482000토목사업처 단지사업2부김태수051-810-1402도로(B=m, L=110m), 교량1개소(B=12m, L=47m) 신설, 기존 도로확장(B=3m, L=466m 4개소), 오수관로 보수 등<NA><NA>
85공사자체조달제한경쟁 (지역제한)오시리아 공원녹지 유지관리공사조경2023.03. ~ 2023.12.2023.03.1000000토목사업처 조경사업부곽일환051-810-1376오시리아관광단지 내 문화공원, 경관녹지 등 식생유지관리 1식<NA><NA>
96공사자체조달제한경쟁 (지역제한)오시리아 워터프론트파크(백사장 구간) 조경공사조경2023.09. ~ 2024.04.2023.07.1000000토목사업처 조경사업부오시훈051-810-1373문화공원 1개소(A=5,000㎡)<NA><NA>
2023년도 공사계약 발주 계획Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14
5047공사자체조달수의계약물탱크실 급수펌프(공용부, 사우나) 교체전문2023.03. ~ 2023.03.2023.03.10800복지사업처 아르피나사업부박철홍051-740-3295급수펌프 9개 교체 설비공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5148공사자체조달수의계약수목방재 등전문2023.06. ~ 2023.06.2023.06.1700복지사업처 아르피나사업부박철홍051-740-3296수목방재 1회 1식추정가격 2천만원 이하 소액공사 등<NA>
5249공사자체조달수의계약2층 골프장 수평벨트라인 수선전문2023.10. ~ 2023.10.2023.10.20000복지사업처 아르피나사업부박철홍051-740-3297수편라인밸트 교체 설비공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5350공사자체조달수의계약객실내부 조명등 교체전문2023.03. ~ 2023.03.2023.03.3000복지사업처 아르피나사업부박철홍051-740-3298객실내부조명등교체 100실 설비공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5451공사자체조달수의계약소방 작동에 따른 소방시설 보수소방2023.11. ~ 2023.11.2023.11.10000복지사업처 아르피나사업부박철홍051-740-3299소방공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5552공사자체조달수의계약골프장 페인트 도색(2,3,4층)전문2023.02. ~ 2023.02.2023.02.10000복지사업처 아르피나사업부박철홍051-740-3300폐인트도색 3개층 시설공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5653공사자체조달수의계약공조기 액츄에이터 교체전문2023.10. ~ 2023.10.2023.10.2000복지사업처 아르피나사업부박철홍051-740-3301공조기 엑츄에어터 2기 교체 설비공사1식추정가격 2천만원 이하 소액공사 등<NA>
5754공사자체조달수의계약급탕탱크(스팀용) 액츄에이터 교체전문2023.01. ~ 2023.14.2023.01.5000복지사업처 아르피나사업부박철홍051-740-3302급탕탱크용 1기교체 설비공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5855공사자체조달수의계약공조기 급배기 댐퍼 모터교체전문2023.01. ~ 2023.15.2023.01.1000복지사업처 아르피나사업부박철홍051-740-3303모터교체 1기 설비공사 1식추정가격 2천만원 이하 소액공사 등<NA>
5956공사자체조달수의계약각종 노후시설 보수비 등전문2023.04. ~ 2023.7.2023.01.20000복지사업처 아르피나사업부박철홍051-740-3304노후시설보수 4회 시설공사 1식추정가격 2천만원 이하 소액공사 등<NA>