Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells29
Missing cells (%)16.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory54.6 B

Variable types

Numeric1
Text4
Unsupported1

Dataset

Description도로점용 및 굴착허가 현황에 대한 데이터로 공사명, 공사위치, 공사기간 및 사업시행자에 대한 현황이 들어가 있는 데이터입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=335&beforeMenuCd=DOM_000000201001001000&publicdatapk=15084815

Alerts

비고 has 29 (100.0%) missing valuesMissing
연번 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 23:18:31.107111
Analysis finished2024-01-09 23:18:31.600408
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct29
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.62069
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2024-01-10T08:18:31.650858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.4
Q18
median16
Q323
95-th percentile28.6
Maximum30
Range29
Interquartile range (IQR)15

Descriptive statistics

Standard deviation8.9339393
Coefficient of variation (CV)0.57192989
Kurtosis-1.2476493
Mean15.62069
Median Absolute Deviation (MAD)8
Skewness-0.04122564
Sum453
Variance79.815271
MonotonicityStrictly increasing
2024-01-10T08:18:31.752485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%)
1 1
 
3.4%
2 1
 
3.4%
30 1
 
3.4%
29 1
 
3.4%
28 1
 
3.4%
27 1
 
3.4%
26 1
 
3.4%
25 1
 
3.4%
24 1
 
3.4%
23 1
 
3.4%
Other values (19) 19
65.5%
ValueCountFrequency (%)
1 1
3.4%
2 1
3.4%
3 1
3.4%
4 1
3.4%
5 1
3.4%
6 1
3.4%
7 1
3.4%
8 1
3.4%
9 1
3.4%
10 1
3.4%
ValueCountFrequency (%)
30 1
3.4%
29 1
3.4%
28 1
3.4%
27 1
3.4%
26 1
3.4%
25 1
3.4%
24 1
3.4%
23 1
3.4%
22 1
3.4%
21 1
3.4%
Distinct28
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-01-10T08:18:31.994437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length19.448276
Min length10

Characters and Unicode

Total characters564
Distinct characters146
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)93.1%

Sample

1st row대호지면 조금지구 밭기반정비사업
2nd row2021년 삽교천 하수관로 정비공사
3rd row우강면 송산리 지방상수도 시설공사
4th row합덕배수지 송수관로 설치공사
5th row농어촌생활용수개발(대호지, 정미) 급수관로 설치공사
ValueCountFrequency (%)
매설공사 5
 
4.5%
하수관로 3
 
2.7%
배관공사 3
 
2.7%
3
 
2.7%
지방상수도 3
 
2.7%
정비공사 3
 
2.7%
설치공사 3
 
2.7%
미래엔서해에너지 2
 
1.8%
도시가스 2
 
1.8%
지중화공사 2
 
1.8%
Other values (78) 83
74.1%
2024-01-10T08:18:32.327921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
14.9%
32
 
5.7%
29
 
5.1%
21
 
3.7%
20
 
3.5%
18
 
3.2%
16
 
2.8%
15
 
2.7%
9
 
1.6%
9
 
1.6%
Other values (136) 311
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 433
76.8%
Space Separator 84
 
14.9%
Decimal Number 20
 
3.5%
Uppercase Letter 13
 
2.3%
Other Symbol 5
 
0.9%
Close Punctuation 3
 
0.5%
Other Punctuation 3
 
0.5%
Open Punctuation 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
7.4%
29
 
6.7%
21
 
4.8%
20
 
4.6%
18
 
4.2%
16
 
3.7%
15
 
3.5%
9
 
2.1%
9
 
2.1%
7
 
1.6%
Other values (117) 257
59.4%
Uppercase Letter
ValueCountFrequency (%)
K 5
38.5%
W 3
23.1%
L 1
 
7.7%
H 1
 
7.7%
A 1
 
7.7%
S 1
 
7.7%
T 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
3 5
25.0%
0 5
25.0%
2 3
15.0%
1 3
15.0%
4 2
 
10.0%
6 1
 
5.0%
5 1
 
5.0%
Space Separator
ValueCountFrequency (%)
84
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 438
77.7%
Common 113
 
20.0%
Latin 13
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
7.3%
29
 
6.6%
21
 
4.8%
20
 
4.6%
18
 
4.1%
16
 
3.7%
15
 
3.4%
9
 
2.1%
9
 
2.1%
7
 
1.6%
Other values (118) 262
59.8%
Common
ValueCountFrequency (%)
84
74.3%
3 5
 
4.4%
0 5
 
4.4%
) 3
 
2.7%
, 3
 
2.7%
( 3
 
2.7%
2 3
 
2.7%
1 3
 
2.7%
4 2
 
1.8%
6 1
 
0.9%
Latin
ValueCountFrequency (%)
K 5
38.5%
W 3
23.1%
L 1
 
7.7%
H 1
 
7.7%
A 1
 
7.7%
S 1
 
7.7%
T 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 433
76.8%
ASCII 126
 
22.3%
None 5
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
84
66.7%
K 5
 
4.0%
3 5
 
4.0%
0 5
 
4.0%
W 3
 
2.4%
) 3
 
2.4%
, 3
 
2.4%
( 3
 
2.4%
2 3
 
2.4%
1 3
 
2.4%
Other values (8) 9
 
7.1%
Hangul
ValueCountFrequency (%)
32
 
7.4%
29
 
6.7%
21
 
4.8%
20
 
4.6%
18
 
4.2%
16
 
3.7%
15
 
3.5%
9
 
2.1%
9
 
2.1%
7
 
1.6%
Other values (117) 257
59.4%
None
ValueCountFrequency (%)
5
100.0%
Distinct28
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-01-10T08:18:32.520867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length38
Mean length27.689655
Min length16

Characters and Unicode

Total characters803
Distinct characters48
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)93.1%

Sample

1st row당진시 대호지면 일원 지방도 647호선, 도시계획도로 중로3-11호선
2nd row당진시 신평면 일원 도시계획도로 소로2-856호선
3rd row당진시 우강면 일원 지방도 622호선
4th row당진시 우강면 일원 시도 10호선
5th row당진시 정미면 일원 도시계획도로 중로 3-818호선
ValueCountFrequency (%)
당진시 29
19.1%
일원 29
19.1%
도시계획도로 19
12.5%
석문면 9
 
5.9%
지방도 6
 
3.9%
송악읍 5
 
3.3%
신평면 4
 
2.6%
당진1동 3
 
2.0%
633호선 2
 
1.3%
우강면 2
 
1.3%
Other values (42) 44
28.9%
2024-01-10T08:18:32.830772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
123
 
15.3%
52
 
6.5%
52
 
6.5%
46
 
5.7%
32
 
4.0%
32
 
4.0%
32
 
4.0%
31
 
3.9%
- 31
 
3.9%
29
 
3.6%
Other values (38) 343
42.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 507
63.1%
Decimal Number 127
 
15.8%
Space Separator 123
 
15.3%
Dash Punctuation 31
 
3.9%
Other Punctuation 15
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
10.3%
52
 
10.3%
46
 
9.1%
32
 
6.3%
32
 
6.3%
32
 
6.3%
31
 
6.1%
29
 
5.7%
29
 
5.7%
22
 
4.3%
Other values (26) 150
29.6%
Decimal Number
ValueCountFrequency (%)
3 28
22.0%
1 23
18.1%
2 22
17.3%
0 14
11.0%
5 11
 
8.7%
6 11
 
8.7%
7 6
 
4.7%
4 6
 
4.7%
8 6
 
4.7%
Space Separator
ValueCountFrequency (%)
123
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 507
63.1%
Common 296
36.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
10.3%
52
 
10.3%
46
 
9.1%
32
 
6.3%
32
 
6.3%
32
 
6.3%
31
 
6.1%
29
 
5.7%
29
 
5.7%
22
 
4.3%
Other values (26) 150
29.6%
Common
ValueCountFrequency (%)
123
41.6%
- 31
 
10.5%
3 28
 
9.5%
1 23
 
7.8%
2 22
 
7.4%
, 15
 
5.1%
0 14
 
4.7%
5 11
 
3.7%
6 11
 
3.7%
7 6
 
2.0%
Other values (2) 12
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 507
63.1%
ASCII 296
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
123
41.6%
- 31
 
10.5%
3 28
 
9.5%
1 23
 
7.8%
2 22
 
7.4%
, 15
 
5.1%
0 14
 
4.7%
5 11
 
3.7%
6 11
 
3.7%
7 6
 
2.0%
Other values (2) 12
 
4.1%
Hangul
ValueCountFrequency (%)
52
 
10.3%
52
 
10.3%
46
 
9.1%
32
 
6.3%
32
 
6.3%
32
 
6.3%
31
 
6.1%
29
 
5.7%
29
 
5.7%
22
 
4.3%
Other values (26) 150
29.6%
Distinct19
Distinct (%)65.5%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-01-10T08:18:33.005609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters435
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)48.3%

Sample

1st row2021-03~2021-07
2nd row2021-03~2021-06
3rd row2021-03~2021-06
4th row2021-03~2022-12
5th row2021-03~2021-04
ValueCountFrequency (%)
2021-03~2021-06 5
17.2%
2021-03~2022-12 3
 
10.3%
2021-03~2021-12 3
 
10.3%
2021-05~2021-06 2
 
6.9%
2021-06~2021-09 2
 
6.9%
2021-05~2021-07 1
 
3.4%
2021-03~2021-07 1
 
3.4%
2021-07~2021-08 1
 
3.4%
2021-05~2021-12 1
 
3.4%
2021-06~2021-07 1
 
3.4%
Other values (9) 9
31.0%
2024-01-10T08:18:33.262181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 134
30.8%
0 104
23.9%
1 65
14.9%
- 58
13.3%
~ 29
 
6.7%
3 14
 
3.2%
6 14
 
3.2%
5 7
 
1.6%
7 5
 
1.1%
9 2
 
0.5%
Other values (2) 3
 
0.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 348
80.0%
Dash Punctuation 58
 
13.3%
Math Symbol 29
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 134
38.5%
0 104
29.9%
1 65
18.7%
3 14
 
4.0%
6 14
 
4.0%
5 7
 
2.0%
7 5
 
1.4%
9 2
 
0.6%
4 2
 
0.6%
8 1
 
0.3%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Math Symbol
ValueCountFrequency (%)
~ 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 435
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 134
30.8%
0 104
23.9%
1 65
14.9%
- 58
13.3%
~ 29
 
6.7%
3 14
 
3.2%
6 14
 
3.2%
5 7
 
1.6%
7 5
 
1.1%
9 2
 
0.5%
Other values (2) 3
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 435
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 134
30.8%
0 104
23.9%
1 65
14.9%
- 58
13.3%
~ 29
 
6.7%
3 14
 
3.2%
6 14
 
3.2%
5 7
 
1.6%
7 5
 
1.1%
9 2
 
0.5%
Other values (2) 3
 
0.7%
Distinct17
Distinct (%)58.6%
Missing0
Missing (%)0.0%
Memory size364.0 B
2024-01-10T08:18:33.433603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length9.137931
Min length3

Characters and Unicode

Total characters265
Distinct characters75
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)44.8%

Sample

1st row당진시장(건설과장)
2nd row당진시장(수도과장)
3rd row당진시장(수도과장)
4th row당진시장(수도과장)
5th row당진시장(수도과장)
ValueCountFrequency (%)
당진시장(수도과장 9
25.7%
당진지사장 3
 
8.6%
한국전력공사 3
 
8.6%
미래엔서해에너지 2
 
5.7%
㈜석문에너지 2
 
5.7%
수청1지구 1
 
2.9%
당진시장(건설과장 1
 
2.9%
대아에너지㈜ 1
 
2.9%
한국철도시설공단 1
 
2.9%
㈜지오콘 1
 
2.9%
Other values (11) 11
31.4%
2024-01-10T08:18:33.695484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
10.2%
16
 
6.0%
16
 
6.0%
14
 
5.3%
( 12
 
4.5%
) 12
 
4.5%
11
 
4.2%
11
 
4.2%
11
 
4.2%
10
 
3.8%
Other values (65) 125
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 222
83.8%
Open Punctuation 12
 
4.5%
Close Punctuation 12
 
4.5%
Other Symbol 7
 
2.6%
Space Separator 6
 
2.3%
Decimal Number 2
 
0.8%
Lowercase Letter 2
 
0.8%
Uppercase Letter 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
12.2%
16
 
7.2%
16
 
7.2%
14
 
6.3%
11
 
5.0%
11
 
5.0%
11
 
5.0%
10
 
4.5%
10
 
4.5%
5
 
2.3%
Other values (55) 91
41.0%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
3 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
t 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
K 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 229
86.4%
Common 32
 
12.1%
Latin 4
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
11.8%
16
 
7.0%
16
 
7.0%
14
 
6.1%
11
 
4.8%
11
 
4.8%
11
 
4.8%
10
 
4.4%
10
 
4.4%
7
 
3.1%
Other values (56) 96
41.9%
Common
ValueCountFrequency (%)
( 12
37.5%
) 12
37.5%
6
18.8%
1 1
 
3.1%
3 1
 
3.1%
Latin
ValueCountFrequency (%)
k 1
25.0%
t 1
25.0%
S 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 222
83.8%
ASCII 36
 
13.6%
None 7
 
2.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
12.2%
16
 
7.2%
16
 
7.2%
14
 
6.3%
11
 
5.0%
11
 
5.0%
11
 
5.0%
10
 
4.5%
10
 
4.5%
5
 
2.3%
Other values (55) 91
41.0%
ASCII
ValueCountFrequency (%)
( 12
33.3%
) 12
33.3%
6
16.7%
1 1
 
2.8%
k 1
 
2.8%
t 1
 
2.8%
S 1
 
2.8%
K 1
 
2.8%
3 1
 
2.8%
None
ValueCountFrequency (%)
7
100.0%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing29
Missing (%)100.0%
Memory size393.0 B

Interactions

2024-01-10T08:18:31.390571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T08:18:33.774878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번공사명공사위치공사기간사업시행자
연번1.0000.9291.0000.7930.745
공사명0.9291.0000.9900.9681.000
공사위치1.0000.9901.0000.8701.000
공사기간0.7930.9680.8701.0000.809
사업시행자0.7451.0001.0000.8091.000

Missing values

2024-01-10T08:18:31.485422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T08:18:31.567551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번공사명공사위치공사기간사업시행자비고
01대호지면 조금지구 밭기반정비사업당진시 대호지면 일원 지방도 647호선, 도시계획도로 중로3-11호선2021-03~2021-07당진시장(건설과장)<NA>
122021년 삽교천 하수관로 정비공사당진시 신평면 일원 도시계획도로 소로2-856호선2021-03~2021-06당진시장(수도과장)<NA>
23우강면 송산리 지방상수도 시설공사당진시 우강면 일원 지방도 622호선2021-03~2021-06당진시장(수도과장)<NA>
34합덕배수지 송수관로 설치공사당진시 우강면 일원 시도 10호선2021-03~2022-12당진시장(수도과장)<NA>
45농어촌생활용수개발(대호지, 정미) 급수관로 설치공사당진시 정미면 일원 도시계획도로 중로 3-818호선2021-03~2021-04당진시장(수도과장)<NA>
56중흥 하수관로 정비공사당진시 송악읍 일원 도시계획도로 소로2-178, 중로2-704, 중로3-702호선2021-03~2022-12당진시장(수도과장)<NA>
67당진시농업기술센터 상수도 배관공사당진시 합덕읍 일원 도시계획도로 대로3-302호선2021-03~2021-06당진시장(농업기술센터)<NA>
78미래엔서해에너지 도시가스 배관공사당진시 당진1동 일원 도시계획도로 소로1-103, 소로2-118호선2021-03~2021-06미래엔서해에너지<NA>
89㈜석문에너지 추가사용자 중온수 및 스팀공사당진시 석문면 일원 도시계획도로 대로2-5, 대로2-6호선2021-03~2022-12㈜석문에너지<NA>
910신평면 소재지 주변 지중화공사당진시 신평면 일원 도시계획도로 중로3-504호선2021-03~2021-12한국전력공사 당진지사장<NA>
연번공사명공사위치공사기간사업시행자비고
1921교로3리 노후상수관 개량 및 확장공사당진시 석문면 일원 지방도 615호선2021-06~2021-11당진시장(수도과장)<NA>
2022미래엔서해에너지 도시가스 배관공사당진시 송악읍, 합덕읍,고대면, 면천면 일원 도시계획도로 대로3-2,대로3-6호선, 국지도70호선2021-05~2022-12미래엔서해에너지<NA>
2123고대면 성산리 LH주택용 전력 3,400KW 공사당진시 고대면 일원 도시계획도로 대로3-2호선2021-07~2021-08한국전력공사 당진지사장<NA>
2224대아에너지㈜ 우수관로 매설공사당진시 석문면 일원 도시게획도로 중로1-8호선2021-05~2021-06대아에너지㈜<NA>
2325㈜라군 관로매설공사당진시 석문면 일원 도시계획도로 대로1-2호선2021-05~2021-07㈜라군<NA>
2426국도32호선 수청1지구 진입도로 개설공사당진시 당진1동 일원 국도32호선2021-05~2022-04수청1지구 도시개발사업조합장<NA>
2527석문에너지 추가사용자 중온수 및 스팀공사당진시 석문면 일원 도시계획도로 대로2-20,2-23,1-3호선2021-07~2021-12㈜석문에너지<NA>
2628통정리 자원순환시설 우수관 매설공사당진시 석문면 일원 도시계획도로2-23호선2021-06~2021-07㈜지오콘<NA>
2729철도시설 변전소 상수관로 설치당진시 합덕읍 일원 시도7호선2021-05~2021-12한국철도시설공단<NA>
2830송산공장 우오수관로 매설공사당진시 송산면 일원 도시계획도로 중로2-21호선2021-06~2021-12㈜베바스토코리아홀딩스<NA>