Overview

Dataset statistics

Number of variables13
Number of observations209
Missing cells41
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.4 KiB
Average record size in memory104.6 B

Variable types

Categorical7
Text6

Dataset

Description전북특별자치도 장수군의 공사발주계획 현황(구분, 사업명, 발주시기, 공종, 사업비총계, 도급액, 관급자재대, 기타, 금차도급액, 국고보조금, 부서명, 담당자, 전화번호)에 대한 데이터 정보를 제공하고자 합니다
Author전북특별자치도 장수군
URLhttps://www.data.go.kr/data/15099370/fileData.do

Alerts

구분 is highly imbalanced (95.6%)Imbalance
기타 is highly imbalanced (65.1%)Imbalance
국고보조금(백만원) is highly imbalanced (62.7%)Imbalance
도급액 has 4 (1.9%) missing valuesMissing
금차도급액(백만원) has 37 (17.7%) missing valuesMissing

Reproduction

Analysis started2024-04-06 08:26:56.929007
Analysis finished2024-04-06 08:27:00.383828
Duration3.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
공사
208 
공사
 
1

Length

Max length3
Median length2
Mean length2.0047847
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row공사
2nd row공사
3rd row공사
4th row공사
5th row공사

Common Values

ValueCountFrequency (%)
공사 208
99.5%
공사 1
 
0.5%

Length

2024-04-06T17:27:00.516237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:27:00.728174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공사 209
100.0%
Distinct198
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-06T17:27:01.205936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length22
Mean length15.157895
Min length5

Characters and Unicode

Total characters3168
Distinct characters324
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)92.8%

Sample

1st row장수레드푸드 융복합센터 진입도로(군도12호) 개설공사
2nd row장수교촌로(3-4호) 개설공사
3rd row의암송, 봉덕리느티나무 보호사업
4th row전북형 도시재생 뉴딜사업 스마트 골목길 정비사업
5th row2023년 상수도시설 긴급보수공사(단가계약)
ValueCountFrequency (%)
2023년 21
 
3.5%
17
 
2.8%
조성사업 17
 
2.8%
설치 14
 
2.3%
공사 12
 
2.0%
보수정비 10
 
1.7%
신축공사 9
 
1.5%
장수군 9
 
1.5%
정비사업 7
 
1.2%
소규모 7
 
1.2%
Other values (351) 481
79.6%
2024-04-06T17:27:02.058707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
397
 
12.5%
169
 
5.3%
108
 
3.4%
102
 
3.2%
91
 
2.9%
88
 
2.8%
56
 
1.8%
55
 
1.7%
54
 
1.7%
2 47
 
1.5%
Other values (314) 2001
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2577
81.3%
Space Separator 397
 
12.5%
Decimal Number 102
 
3.2%
Open Punctuation 39
 
1.2%
Close Punctuation 39
 
1.2%
Other Punctuation 9
 
0.3%
Math Symbol 3
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
169
 
6.6%
108
 
4.2%
102
 
4.0%
91
 
3.5%
88
 
3.4%
56
 
2.2%
55
 
2.1%
54
 
2.1%
46
 
1.8%
41
 
1.6%
Other values (297) 1767
68.6%
Decimal Number
ValueCountFrequency (%)
2 47
46.1%
3 24
23.5%
0 22
21.6%
1 3
 
2.9%
4 3
 
2.9%
6 1
 
1.0%
8 1
 
1.0%
5 1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 33
84.6%
[ 6
 
15.4%
Close Punctuation
ValueCountFrequency (%)
) 33
84.6%
] 6
 
15.4%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
. 1
 
11.1%
Space Separator
ValueCountFrequency (%)
397
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2577
81.3%
Common 591
 
18.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
169
 
6.6%
108
 
4.2%
102
 
4.0%
91
 
3.5%
88
 
3.4%
56
 
2.2%
55
 
2.1%
54
 
2.1%
46
 
1.8%
41
 
1.6%
Other values (297) 1767
68.6%
Common
ValueCountFrequency (%)
397
67.2%
2 47
 
8.0%
( 33
 
5.6%
) 33
 
5.6%
3 24
 
4.1%
0 22
 
3.7%
, 8
 
1.4%
] 6
 
1.0%
[ 6
 
1.0%
~ 3
 
0.5%
Other values (7) 12
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2577
81.3%
ASCII 591
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
397
67.2%
2 47
 
8.0%
( 33
 
5.6%
) 33
 
5.6%
3 24
 
4.1%
0 22
 
3.7%
, 8
 
1.4%
] 6
 
1.0%
[ 6
 
1.0%
~ 3
 
0.5%
Other values (7) 12
 
2.0%
Hangul
ValueCountFrequency (%)
169
 
6.6%
108
 
4.2%
102
 
4.0%
91
 
3.5%
88
 
3.4%
56
 
2.2%
55
 
2.1%
54
 
2.1%
46
 
1.8%
41
 
1.6%
Other values (297) 1767
68.6%
Distinct10
Distinct (%)4.8%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-03-23
87 
2024-04-23
48 
2024-02-23
24 
2024-05-23
17 
2024-01-23
11 
Other values (5)
22 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row2024-01-23
2nd row2024-01-23
3rd row2024-01-23
4th row2024-01-23
5th row2024-01-23

Common Values

ValueCountFrequency (%)
2024-03-23 87
41.6%
2024-04-23 48
23.0%
2024-02-23 24
 
11.5%
2024-05-23 17
 
8.1%
2024-01-23 11
 
5.3%
2024-06-23 11
 
5.3%
2024-12-23 4
 
1.9%
2024-07-23 3
 
1.4%
2024-09-23 3
 
1.4%
2024-08-23 1
 
0.5%

Length

2024-04-06T17:27:02.306720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:27:02.552974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-23 87
41.6%
2024-04-23 48
23.0%
2024-02-23 24
 
11.5%
2024-05-23 17
 
8.1%
2024-01-23 11
 
5.3%
2024-06-23 11
 
5.3%
2024-12-23 4
 
1.9%
2024-07-23 3
 
1.4%
2024-09-23 3
 
1.4%
2024-08-23 1
 
0.5%
Distinct11
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
토목
79 
기타
37 
전문
26 
건축
24 
전기
13 
Other values (6)
30 

Length

Max length7
Median length2
Mean length2.0956938
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토목
2nd row토목
3rd row전문
4th row토목
5th row전문(상수도)

Common Values

ValueCountFrequency (%)
토목 79
37.8%
기타 37
17.7%
전문 26
 
12.4%
건축 24
 
11.5%
전기 13
 
6.2%
산림 9
 
4.3%
소방 5
 
2.4%
통신 5
 
2.4%
전문(상수도) 4
 
1.9%
토건 4
 
1.9%

Length

2024-04-06T17:27:02.872374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
토목 79
37.8%
기타 37
17.7%
전문 26
 
12.4%
건축 24
 
11.5%
전기 13
 
6.2%
산림 9
 
4.3%
소방 5
 
2.4%
통신 5
 
2.4%
전문(상수도 4
 
1.9%
토건 4
 
1.9%
Distinct108
Distinct (%)51.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-06T17:27:03.345796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.7703349
Min length1

Characters and Unicode

Total characters579
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)36.4%

Sample

1st row750
2nd row500
3rd row18
4th row400
5th row900
ValueCountFrequency (%)
20 14
 
6.7%
10 9
 
4.3%
26 9
 
4.3%
300 9
 
4.3%
500 7
 
3.3%
100 7
 
3.3%
30 6
 
2.9%
200 6
 
2.9%
400 5
 
2.4%
9 4
 
1.9%
Other values (98) 133
63.6%
2024-04-06T17:27:04.173398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 196
33.9%
1 75
 
13.0%
2 67
 
11.6%
5 49
 
8.5%
3 41
 
7.1%
6 26
 
4.5%
4 26
 
4.5%
, 26
 
4.5%
9 25
 
4.3%
8 25
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 553
95.5%
Other Punctuation 26
 
4.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 196
35.4%
1 75
 
13.6%
2 67
 
12.1%
5 49
 
8.9%
3 41
 
7.4%
6 26
 
4.7%
4 26
 
4.7%
9 25
 
4.5%
8 25
 
4.5%
7 23
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 579
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 196
33.9%
1 75
 
13.0%
2 67
 
11.6%
5 49
 
8.5%
3 41
 
7.1%
6 26
 
4.5%
4 26
 
4.5%
, 26
 
4.5%
9 25
 
4.3%
8 25
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 579
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 196
33.9%
1 75
 
13.0%
2 67
 
11.6%
5 49
 
8.5%
3 41
 
7.1%
6 26
 
4.5%
4 26
 
4.5%
, 26
 
4.5%
9 25
 
4.3%
8 25
 
4.3%

도급액
Text

MISSING 

Distinct100
Distinct (%)48.8%
Missing4
Missing (%)1.9%
Memory size1.8 KiB
2024-04-06T17:27:04.729913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.5658537
Min length1

Characters and Unicode

Total characters526
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)33.7%

Sample

1st row550
2nd row350
3rd row18
4th row250
5th row900
ValueCountFrequency (%)
0 15
 
7.3%
20 14
 
6.8%
300 9
 
4.4%
26 9
 
4.4%
100 7
 
3.4%
60 6
 
2.9%
70 6
 
2.9%
30 6
 
2.9%
10 6
 
2.9%
200 6
 
2.9%
Other values (90) 121
59.0%
2024-04-06T17:27:05.604515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 195
37.1%
2 65
 
12.4%
1 57
 
10.8%
5 38
 
7.2%
3 33
 
6.3%
6 32
 
6.1%
4 25
 
4.8%
7 23
 
4.4%
8 22
 
4.2%
9 19
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 509
96.8%
Other Punctuation 17
 
3.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 195
38.3%
2 65
 
12.8%
1 57
 
11.2%
5 38
 
7.5%
3 33
 
6.5%
6 32
 
6.3%
4 25
 
4.9%
7 23
 
4.5%
8 22
 
4.3%
9 19
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 526
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 195
37.1%
2 65
 
12.4%
1 57
 
10.8%
5 38
 
7.2%
3 33
 
6.3%
6 32
 
6.1%
4 25
 
4.8%
7 23
 
4.4%
8 22
 
4.2%
9 19
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 526
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 195
37.1%
2 65
 
12.4%
1 57
 
10.8%
5 38
 
7.2%
3 33
 
6.3%
6 32
 
6.1%
4 25
 
4.8%
7 23
 
4.4%
8 22
 
4.2%
9 19
 
3.6%

관급자재대
Categorical

Distinct45
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
94 
<NA>
27 
200
 
8
20
 
8
50
 
6
Other values (40)
66 

Length

Max length5
Median length4
Mean length1.9904306
Min length1

Unique

Unique27 ?
Unique (%)12.9%

Sample

1st row200
2nd row150
3rd row0
4th row150
5th row<NA>

Common Values

ValueCountFrequency (%)
0 94
45.0%
<NA> 27
 
12.9%
200 8
 
3.8%
20 8
 
3.8%
50 6
 
2.9%
150 5
 
2.4%
350 4
 
1.9%
700 4
 
1.9%
2 4
 
1.9%
100 4
 
1.9%
Other values (35) 45
21.5%

Length

2024-04-06T17:27:05.906156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0 94
45.0%
na 27
 
12.9%
200 8
 
3.8%
20 8
 
3.8%
50 6
 
2.9%
150 5
 
2.4%
350 4
 
1.9%
700 4
 
1.9%
2 4
 
1.9%
100 4
 
1.9%
Other values (35) 45
21.5%

기타
Categorical

IMBALANCE 

Distinct15
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
155 
<NA>
36 
10
 
4
30
 
2
15
 
2
Other values (10)
 
10

Length

Max length6
Median length1
Mean length1.6507177
Min length1

Unique

Unique10 ?
Unique (%)4.8%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
0 155
74.2%
<NA> 36
 
17.2%
10 4
 
1.9%
30 2
 
1.0%
15 2
 
1.0%
150 1
 
0.5%
80 1
 
0.5%
50 1
 
0.5%
159 1
 
0.5%
103 1
 
0.5%
Other values (5) 5
 
2.4%

Length

2024-04-06T17:27:06.204479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0 155
74.2%
na 36
 
17.2%
10 4
 
1.9%
30 2
 
1.0%
15 2
 
1.0%
150 1
 
0.5%
80 1
 
0.5%
50 1
 
0.5%
159 1
 
0.5%
103 1
 
0.5%
Other values (5) 5
 
2.4%
Distinct82
Distinct (%)47.7%
Missing37
Missing (%)17.7%
Memory size1.8 KiB
2024-04-06T17:27:06.603246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.4651163
Min length1

Characters and Unicode

Total characters424
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)31.4%

Sample

1st row550
2nd row350
3rd row18
4th row400
5th row301
ValueCountFrequency (%)
0 23
 
13.4%
300 7
 
4.1%
26 7
 
4.1%
10 6
 
3.5%
100 5
 
2.9%
400 5
 
2.9%
9 5
 
2.9%
70 5
 
2.9%
30 5
 
2.9%
200 4
 
2.3%
Other values (72) 100
58.1%
2024-04-06T17:27:07.350701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 162
38.2%
1 51
 
12.0%
2 39
 
9.2%
5 38
 
9.0%
3 25
 
5.9%
4 22
 
5.2%
6 21
 
5.0%
9 19
 
4.5%
8 17
 
4.0%
7 15
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 409
96.5%
Other Punctuation 15
 
3.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 162
39.6%
1 51
 
12.5%
2 39
 
9.5%
5 38
 
9.3%
3 25
 
6.1%
4 22
 
5.4%
6 21
 
5.1%
9 19
 
4.6%
8 17
 
4.2%
7 15
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 424
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 162
38.2%
1 51
 
12.0%
2 39
 
9.2%
5 38
 
9.0%
3 25
 
5.9%
4 22
 
5.2%
6 21
 
5.0%
9 19
 
4.5%
8 17
 
4.0%
7 15
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 424
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 162
38.2%
1 51
 
12.0%
2 39
 
9.2%
5 38
 
9.0%
3 25
 
5.9%
4 22
 
5.2%
6 21
 
5.0%
9 19
 
4.5%
8 17
 
4.0%
7 15
 
3.5%

국고보조금(백만원)
Categorical

IMBALANCE 

Distinct28
Distinct (%)13.4%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
0
149 
<NA>
31 
300
 
3
1,000
 
2
30
 
1
Other values (23)
23 

Length

Max length5
Median length1
Mean length1.7129187
Min length1

Unique

Unique24 ?
Unique (%)11.5%

Sample

1st row0
2nd row0
3rd row12
4th row0
5th row<NA>

Common Values

ValueCountFrequency (%)
0 149
71.3%
<NA> 31
 
14.8%
300 3
 
1.4%
1,000 2
 
1.0%
30 1
 
0.5%
109 1
 
0.5%
633 1
 
0.5%
368 1
 
0.5%
87 1
 
0.5%
156 1
 
0.5%
Other values (18) 18
 
8.6%

Length

2024-04-06T17:27:07.679153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0 149
71.3%
na 31
 
14.8%
300 3
 
1.4%
1,000 2
 
1.0%
35 1
 
0.5%
165 1
 
0.5%
329 1
 
0.5%
500 1
 
0.5%
1,214 1
 
0.5%
339 1
 
0.5%
Other values (18) 18
 
8.6%

부서명
Categorical

Distinct23
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
산림공원과
37 
문화관광과
34 
건설교통과
25 
재무과
18 
체육맑은물사업소
15 
Other values (18)
80 

Length

Max length8
Median length5
Mean length4.6507177
Min length3

Unique

Unique3 ?
Unique (%)1.4%

Sample

1st row건설교통과
2nd row건설교통과
3rd row문화관광과
4th row민생경제과
5th row체육맑은물사업소

Common Values

ValueCountFrequency (%)
산림공원과 37
17.7%
문화관광과 34
16.3%
건설교통과 25
12.0%
재무과 18
8.6%
체육맑은물사업소 15
 
7.2%
환경위생과 11
 
5.3%
민생경제과 7
 
3.3%
장계면 7
 
3.3%
산림과 7
 
3.3%
장수읍 6
 
2.9%
Other values (13) 42
20.1%

Length

2024-04-06T17:27:07.978192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
산림공원과 37
17.7%
문화관광과 34
16.3%
건설교통과 25
12.0%
재무과 18
8.6%
체육맑은물사업소 15
 
7.2%
환경위생과 11
 
5.3%
민생경제과 7
 
3.3%
장계면 7
 
3.3%
산림과 7
 
3.3%
장수읍 6
 
2.9%
Other values (13) 42
20.1%
Distinct66
Distinct (%)31.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-06T17:27:08.491609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9808612
Min length2

Characters and Unicode

Total characters623
Distinct characters79
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)12.9%

Sample

1st row임승현
2nd row양성길
3rd row서상철
4th row심영우
5th row전준형
ValueCountFrequency (%)
장문정 22
 
10.2%
서상철 16
 
7.4%
한정문 10
 
4.6%
안현환 9
 
4.2%
이현석 7
 
3.2%
7
 
3.2%
7
 
3.2%
이경수 7
 
3.2%
정상현 7
 
3.2%
서우혁 6
 
2.8%
Other values (57) 118
54.6%
2024-04-06T17:27:09.173158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
 
7.9%
39
 
6.3%
37
 
5.9%
35
 
5.6%
30
 
4.8%
28
 
4.5%
27
 
4.3%
22
 
3.5%
19
 
3.0%
16
 
2.6%
Other values (69) 321
51.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 616
98.9%
Space Separator 7
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
8.0%
39
 
6.3%
37
 
6.0%
35
 
5.7%
30
 
4.9%
28
 
4.5%
27
 
4.4%
22
 
3.6%
19
 
3.1%
16
 
2.6%
Other values (68) 314
51.0%
Space Separator
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 616
98.9%
Common 7
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
8.0%
39
 
6.3%
37
 
6.0%
35
 
5.7%
30
 
4.9%
28
 
4.5%
27
 
4.4%
22
 
3.6%
19
 
3.1%
16
 
2.6%
Other values (68) 314
51.0%
Common
ValueCountFrequency (%)
7
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 616
98.9%
ASCII 7
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
 
8.0%
39
 
6.3%
37
 
6.0%
35
 
5.7%
30
 
4.9%
28
 
4.5%
27
 
4.4%
22
 
3.6%
19
 
3.1%
16
 
2.6%
Other values (68) 314
51.0%
ASCII
ValueCountFrequency (%)
7
100.0%
Distinct67
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-06T17:27:09.647650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters2508
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)12.9%

Sample

1st row063-350-2578
2nd row063-350-2577
3rd row063-350-2326
4th row063-350-2211
5th row063-350-2894
ValueCountFrequency (%)
063-350-2247 18
 
8.6%
063-350-2326 16
 
7.7%
063-350-2468 10
 
4.8%
063-350-2472 9
 
4.3%
063-350-2447 7
 
3.3%
063-350-2337 7
 
3.3%
063-350-2327 7
 
3.3%
063-350-1362 7
 
3.3%
063-350-1215 6
 
2.9%
063-350-2328 5
 
2.4%
Other values (57) 117
56.0%
2024-04-06T17:27:10.384475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 501
20.0%
0 427
17.0%
- 418
16.7%
6 274
10.9%
2 271
10.8%
5 268
10.7%
4 100
 
4.0%
1 93
 
3.7%
7 91
 
3.6%
8 44
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2090
83.3%
Dash Punctuation 418
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 501
24.0%
0 427
20.4%
6 274
13.1%
2 271
13.0%
5 268
12.8%
4 100
 
4.8%
1 93
 
4.4%
7 91
 
4.4%
8 44
 
2.1%
9 21
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 418
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2508
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 501
20.0%
0 427
17.0%
- 418
16.7%
6 274
10.9%
2 271
10.8%
5 268
10.7%
4 100
 
4.0%
1 93
 
3.7%
7 91
 
3.6%
8 44
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2508
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 501
20.0%
0 427
17.0%
- 418
16.7%
6 274
10.9%
2 271
10.8%
5 268
10.7%
4 100
 
4.0%
1 93
 
3.7%
7 91
 
3.6%
8 44
 
1.8%

Correlations

2024-04-06T17:27:10.644395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분발주시기연월공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)도급액관급자재대기타금차도급액(백만원)국고보조금(백만원)부서명담당자전화번호
구분1.0000.0000.0000.0000.0000.0000.0000.0000.3660.0000.000
발주시기연월0.0001.0000.4150.7070.5180.6000.4790.0000.5820.7600.877
공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)0.0000.4151.0000.4840.6950.0000.0000.0000.7320.9490.939
도급액0.0000.7070.4841.0000.9860.0000.9980.9790.0000.0000.000
관급자재대0.0000.5180.6950.9861.0000.0000.9480.9260.0000.8540.861
기타0.0000.6000.0000.0000.0001.0000.9560.0000.2470.8220.815
금차도급액(백만원)0.0000.4790.0000.9980.9480.9561.0000.9810.3640.9100.910
국고보조금(백만원)0.0000.0000.0000.9790.9260.0000.9811.0000.6210.9200.897
부서명0.3660.5820.7320.0000.0000.2470.3640.6211.0001.0001.000
담당자0.0000.7600.9490.0000.8540.8220.9100.9201.0001.0001.000
전화번호0.0000.8770.9390.0000.8610.8150.9100.8971.0001.0001.000
2024-04-06T17:27:10.959068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)구분기타발주시기연월국고보조금(백만원)부서명관급자재대
공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)1.0000.0000.0000.1890.0000.3450.254
구분0.0001.0000.0000.0000.0000.3010.000
기타0.0000.0001.0000.2870.0000.0800.000
발주시기연월0.1890.0000.2871.0000.0000.2480.180
국고보조금(백만원)0.0000.0000.0000.0001.0000.2080.452
부서명0.3450.3010.0800.2480.2081.0000.000
관급자재대0.2540.0000.0000.1800.4520.0001.000
2024-04-06T17:27:11.212331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분발주시기연월공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)관급자재대기타국고보조금(백만원)부서명
구분1.0000.0000.0000.0000.0000.0000.301
발주시기연월0.0001.0000.1890.1800.2870.0000.248
공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)0.0000.1891.0000.2540.0000.0000.345
관급자재대0.0000.1800.2541.0000.0000.4520.000
기타0.0000.2870.0000.0001.0000.0000.080
국고보조금(백만원)0.0000.0000.0000.4520.0001.0000.208
부서명0.3010.2480.3450.0000.0800.2081.000

Missing values

2024-04-06T17:26:59.612566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:26:59.956807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-06T17:27:00.233231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분사 업 명발주시기연월공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)사업비총계(백만원)도급액관급자재대기타금차도급액(백만원)국고보조금(백만원)부서명담당자전화번호
0공사장수레드푸드 융복합센터 진입도로(군도12호) 개설공사2024-01-23토목75055020005500건설교통과임승현063-350-2578
1공사장수교촌로(3-4호) 개설공사2024-01-23토목50035015003500건설교통과양성길063-350-2577
2공사의암송, 봉덕리느티나무 보호사업2024-01-23전문1818001812문화관광과서상철063-350-2326
3공사전북형 도시재생 뉴딜사업 스마트 골목길 정비사업2024-01-23토목40025015004000민생경제과심영우063-350-2211
4공사2023년 상수도시설 긴급보수공사(단가계약)2024-01-23전문(상수도)900900<NA><NA><NA><NA>체육맑은물사업소전준형063-350-2894
5공사2023년 장수군 상수도 개인급수공사(단가계약)2024-01-23전문(상수도)200200<NA><NA><NA><NA>체육맑은물사업소전준형063-350-2894
6공사계남 중방양돈단지 지장물 철거공사2024-01-23건축301301<NA><NA>301<NA>환경위생과신철주063-350-2536
7공사계남 중방양돈단지 석면철거공사2024-01-23기타8181<NA><NA>81<NA>환경위생과신철주063-350-2536
8공사대기오염측정망 신규설치사업2024-01-23기타195195<NA><NA>195<NA>환경위생과유해수063-350-2547
9공사장수군 소각시설 추가설치 사업 전기공사2024-01-23전기1,31769261510692329환경위생과이일근063-350-2930
구분사 업 명발주시기연월공종(토건_토목_건축_전문_전기_통신_소방_기타 중 택1)사업비총계(백만원)도급액관급자재대기타금차도급액(백만원)국고보조금(백만원)부서명담당자전화번호
199공사기후변화 대응 탄소 측정 장비 설치 공사2024-07-23기타1001000000산림공원과김민우063-350-2473
200공사계북면작은목욕탕 신축공사2024-07-23건축760760<NA>0<NA>0재무과장문정063-350-2247
201공사방화동가족휴가촌 기능보강사업2024-08-23건축3,8002,6002001,0002,6000산림공원과백지선063-350-2471
202공사장수복합문화시설조성2024-09-23건축4,2000004,2000문화관광과백성훈063-350-2317
203공사2023년 와룡자연휴양림 보완사업(토목-상수도)2024-09-23토목700500200000산림공원과안현환063-350-2472
204공사2023년 방화동자연휴양림 수변전실 개량공사2024-09-23전기3003000000산림공원과안현환063-350-2472
205공사천천에놀라온(산악관광센터 증축)사업 건축공사2024-12-23건축3,3002,80050003,3001,000문화관광과장문정063-350-2346
206공사천천에놀라온(산악관광센터 증축)사업 전기공사2024-12-23전기55020035005500문화관광과장문정063-350-2346
207공사천천에놀라온(산악관광센터 증축)사업 통신공사2024-12-23통신150150001500문화관광과장문정063-350-2346
208공사천천에놀라온(산악관광센터 증축)사업 소방공사2024-12-23소방707000700문화관광과장문정063-350-2346