Overview

Dataset statistics

Number of variables8
Number of observations83
Missing cells25
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 KiB
Average record size in memory65.6 B

Variable types

Numeric1
Text1
Categorical5
DateTime1

Dataset

Description전라남도 나주시 관내 지하매설물 공사 등 도로굴착 공사를 위한 점용허가 정보(공사위치, 도로, 허가시작일, 종료일 등) 제공
Author전라남도 나주시
URLhttps://www.data.go.kr/data/15084494/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 용도 and 1 other fieldsHigh correlation
노선명 is highly overall correlated with 용도High correlation
용도 is highly overall correlated with 연번 and 3 other fieldsHigh correlation
만료일 is highly overall correlated with 용도 and 1 other fieldsHigh correlation
신청법인명 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
노선명 is highly imbalanced (68.9%)Imbalance
만료일 is highly imbalanced (65.5%)Imbalance
연번 has 25 (30.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 10:58:02.421098
Analysis finished2023-12-12 10:58:04.207425
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct58
Distinct (%)100.0%
Missing25
Missing (%)30.1%
Infinite0
Infinite (%)0.0%
Mean33.5
Minimum1
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size879.0 B
2023-12-12T19:58:04.338905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.85
Q117.25
median32.5
Q349.75
95-th percentile63.15
Maximum66
Range65
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation19.178982
Coefficient of variation (CV)0.57250691
Kurtosis-1.1905771
Mean33.5
Median Absolute Deviation (MAD)16.5
Skewness0.037645843
Sum1943
Variance367.83333
MonotonicityStrictly increasing
2023-12-12T19:58:04.583362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
51 1
 
1.2%
36 1
 
1.2%
39 1
 
1.2%
40 1
 
1.2%
41 1
 
1.2%
42 1
 
1.2%
43 1
 
1.2%
44 1
 
1.2%
46 1
 
1.2%
47 1
 
1.2%
Other values (48) 48
57.8%
(Missing) 25
30.1%
ValueCountFrequency (%)
1 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
11 1
1.2%
ValueCountFrequency (%)
66 1
1.2%
65 1
1.2%
64 1
1.2%
63 1
1.2%
62 1
1.2%
61 1
1.2%
60 1
1.2%
58 1
1.2%
57 1
1.2%
55 1
1.2%
Distinct69
Distinct (%)83.1%
Missing0
Missing (%)0.0%
Memory size796.0 B
2023-12-12T19:58:05.037230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length19.433735
Min length16

Characters and Unicode

Total characters1613
Distinct characters66
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)69.9%

Sample

1st row전라남도 나주시 송월동 1223
2nd row전라남도 나주시 성북동 8-22 일원
3rd row전라남도 나주시 이창동 161-2일원
4th row전라남도 나주시 빛가람동 290
5th row전라남도 나주시 반남면 흥덕리 101-1
ValueCountFrequency (%)
나주시 86
22.3%
전라남도 83
21.6%
일원 37
 
9.6%
빛가람동 21
 
5.5%
송월동 20
 
5.2%
이창동 8
 
2.1%
1223 5
 
1.3%
남평읍 5
 
1.3%
성북동 4
 
1.0%
1222 3
 
0.8%
Other values (91) 113
29.4%
2023-12-12T19:58:05.658895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
303
18.8%
93
 
5.8%
89
 
5.5%
86
 
5.3%
86
 
5.3%
83
 
5.1%
83
 
5.1%
83
 
5.1%
72
 
4.5%
2 67
 
4.2%
Other values (56) 568
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 988
61.3%
Space Separator 303
 
18.8%
Decimal Number 293
 
18.2%
Dash Punctuation 29
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
93
 
9.4%
89
 
9.0%
86
 
8.7%
86
 
8.7%
83
 
8.4%
83
 
8.4%
83
 
8.4%
72
 
7.3%
40
 
4.0%
39
 
3.9%
Other values (44) 234
23.7%
Decimal Number
ValueCountFrequency (%)
2 67
22.9%
1 62
21.2%
8 27
9.2%
4 27
9.2%
7 23
 
7.8%
3 21
 
7.2%
9 19
 
6.5%
0 17
 
5.8%
6 17
 
5.8%
5 13
 
4.4%
Space Separator
ValueCountFrequency (%)
303
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 988
61.3%
Common 625
38.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
93
 
9.4%
89
 
9.0%
86
 
8.7%
86
 
8.7%
83
 
8.4%
83
 
8.4%
83
 
8.4%
72
 
7.3%
40
 
4.0%
39
 
3.9%
Other values (44) 234
23.7%
Common
ValueCountFrequency (%)
303
48.5%
2 67
 
10.7%
1 62
 
9.9%
- 29
 
4.6%
8 27
 
4.3%
4 27
 
4.3%
7 23
 
3.7%
3 21
 
3.4%
9 19
 
3.0%
0 17
 
2.7%
Other values (2) 30
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 988
61.3%
ASCII 625
38.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
303
48.5%
2 67
 
10.7%
1 62
 
9.9%
- 29
 
4.6%
8 27
 
4.3%
4 27
 
4.3%
7 23
 
3.7%
3 21
 
3.4%
9 19
 
3.0%
0 17
 
2.7%
Other values (2) 30
 
4.8%
Hangul
ValueCountFrequency (%)
93
 
9.4%
89
 
9.0%
86
 
8.7%
86
 
8.7%
83
 
8.4%
83
 
8.4%
83
 
8.4%
72
 
7.3%
40
 
4.0%
39
 
3.9%
Other values (44) 234
23.7%

노선명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct10
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
도시계획도로
71 
도시계획도로 외
 
3
지방도 818호선
 
2
지방도820
 
1
면도102
 
1
Other values (5)
 
5

Length

Max length9
Median length6
Mean length6.2289157
Min length5

Unique

Unique7 ?
Unique (%)8.4%

Sample

1st row도시계획도로
2nd row도시계획도로
3rd row도시계획도로
4th row도시계획도로
5th row지방도820

Common Values

ValueCountFrequency (%)
도시계획도로 71
85.5%
도시계획도로 외 3
 
3.6%
지방도 818호선 2
 
2.4%
지방도820 1
 
1.2%
면도102 1
 
1.2%
시도 36호선 외 1
 
1.2%
국도 23호선 1
 
1.2%
지방도 820호선 1
 
1.2%
시도 40호선 1
 
1.2%
시도 9호선 1
 
1.2%

Length

2023-12-12T19:58:05.884418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:58:06.106944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도시계획도로 74
78.7%
4
 
4.3%
지방도 3
 
3.2%
시도 3
 
3.2%
818호선 2
 
2.1%
지방도820 1
 
1.1%
면도102 1
 
1.1%
36호선 1
 
1.1%
국도 1
 
1.1%
23호선 1
 
1.1%
Other values (3) 3
 
3.2%

용도
Categorical

HIGH CORRELATION 

Distinct32
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size796.0 B
도시가스 배관 매설
23 
통신관로 매설
우수관로 매설
노후관 교체
 
3
지중화공사
 
3
Other values (27)
38 

Length

Max length15
Median length13
Mean length8
Min length4

Unique

Unique18 ?
Unique (%)21.7%

Sample

1st row우수관로 매설
2nd row통신관로 매설
3rd row통신관로 매설
4th row통신관로 매설
5th row오수관로 매설

Common Values

ValueCountFrequency (%)
도시가스 배관 매설 23
27.7%
통신관로 매설 9
 
10.8%
우수관로 매설 7
 
8.4%
노후관 교체 3
 
3.6%
지중화공사 3
 
3.6%
광통신용 관로 매설 3
 
3.6%
오우수관로 매설 3
 
3.6%
열수송관 매설 2
 
2.4%
오수관로 매설 2
 
2.4%
전력관로 매설 2
 
2.4%
Other values (22) 26
31.3%

Length

2023-12-12T19:58:06.775767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
매설 58
30.7%
도시가스 23
 
12.2%
배관 23
 
12.2%
우수관로 12
 
6.3%
연결 9
 
4.8%
통신관로 9
 
4.8%
관로 4
 
2.1%
오우수관로 4
 
2.1%
오수관로 4
 
2.1%
설치 4
 
2.1%
Other values (24) 39
20.6%
Distinct44
Distinct (%)53.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
Minimum2012-05-22 00:00:00
Maximum2121-01-18 00:00:00
2023-12-12T19:58:06.981303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:58:07.201730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)

만료일
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size796.0 B
2030-12-31
69 
2027-12-31
2029-12-31
 
2
2021-03-21
 
1
2021-12-31
 
1
Other values (3)
 
3

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique5 ?
Unique (%)6.0%

Sample

1st row2027-12-31
2nd row2030-12-31
3rd row2030-12-31
4th row2030-12-31
5th row2021-03-21

Common Values

ValueCountFrequency (%)
2030-12-31 69
83.1%
2027-12-31 7
 
8.4%
2029-12-31 2
 
2.4%
2021-03-21 1
 
1.2%
2021-12-31 1
 
1.2%
2021-05-27 1
 
1.2%
2025-12-31 1
 
1.2%
2022-05-21 1
 
1.2%

Length

2023-12-12T19:58:07.435232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:58:07.620267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2030-12-31 69
83.1%
2027-12-31 7
 
8.4%
2029-12-31 2
 
2.4%
2021-03-21 1
 
1.2%
2021-12-31 1
 
1.2%
2021-05-27 1
 
1.2%
2025-12-31 1
 
1.2%
2022-05-21 1
 
1.2%

신청법인명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)30.1%
Missing0
Missing (%)0.0%
Memory size796.0 B
㈜해양에너지
23 
<NA>
18 
나주시청
한국전력공사 나주지사
주식회사 케이티
Other values (20)
28 

Length

Max length15
Median length12
Mean length6.9277108
Min length4

Unique

Unique15 ?
Unique (%)18.1%

Sample

1st row<NA>
2nd row주식회사 케이티
3rd row주식회사 케이티
4th rowLG유플러스
5th row하나건설㈜

Common Values

ValueCountFrequency (%)
㈜해양에너지 23
27.7%
<NA> 18
21.7%
나주시청 6
 
7.2%
한국전력공사 나주지사 4
 
4.8%
주식회사 케이티 4
 
4.8%
LG유플러스 3
 
3.6%
에스케이텔레콘 주식회사 3
 
3.6%
한국전력공사광주전남본부 3
 
3.6%
한국지역난방공사 광주전남지사 2
 
2.4%
하나건설㈜ 2
 
2.4%
Other values (15) 15
18.1%

Length

2023-12-12T19:58:07.913640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
㈜해양에너지 23
21.7%
na 18
17.0%
주식회사 11
10.4%
나주시청 6
 
5.7%
유한회사 5
 
4.7%
한국전력공사 4
 
3.8%
나주지사 4
 
3.8%
케이티 4
 
3.8%
lg유플러스 3
 
2.8%
에스케이텔레콘 3
 
2.8%
Other values (20) 25
23.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size796.0 B
2021-07-15
83 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-07-15
2nd row2021-07-15
3rd row2021-07-15
4th row2021-07-15
5th row2021-07-15

Common Values

ValueCountFrequency (%)
2021-07-15 83
100.0%

Length

2023-12-12T19:58:08.167587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:58:08.347793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-07-15 83
100.0%

Interactions

2023-12-12T19:58:03.154714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:58:08.485327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번점용장소노선명용도허가일만료일신청법인명
연번1.0000.6760.0000.0000.8520.0000.108
점용장소0.6761.0000.9210.9630.9820.0000.931
노선명0.0000.9211.0000.9720.9630.7190.875
용도0.0000.9630.9721.0000.9900.9970.994
허가일0.8520.9820.9630.9901.0000.9760.985
만료일0.0000.0000.7190.9970.9761.0000.990
신청법인명0.1080.9310.8750.9940.9850.9901.000
2023-12-12T19:58:08.721942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도노선명만료일신청법인명
용도1.0000.6880.7490.773
노선명0.6881.0000.4430.488
만료일0.7490.4431.0000.719
신청법인명0.7730.4880.7191.000
2023-12-12T19:58:08.899726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번노선명용도만료일신청법인명
연번1.0000.3330.6190.2940.604
노선명0.3331.0000.6880.4430.488
용도0.6190.6881.0000.7490.773
만료일0.2940.4430.7491.0000.719
신청법인명0.6040.4880.7730.7191.000

Missing values

2023-12-12T19:58:03.914010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:58:04.119582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번점용장소노선명용도허가일만료일신청법인명데이터기준일자
01전라남도 나주시 송월동 1223도시계획도로우수관로 매설2018-07-062027-12-31<NA>2021-07-15
12_1전라남도 나주시 성북동 8-22 일원도시계획도로통신관로 매설2021-01-062030-12-31주식회사 케이티2021-07-15
22_2전라남도 나주시 이창동 161-2일원도시계획도로통신관로 매설2021-01-062030-12-31주식회사 케이티2021-07-15
33전라남도 나주시 빛가람동 290도시계획도로통신관로 매설2021-01-112030-12-31LG유플러스2021-07-15
44전라남도 나주시 반남면 흥덕리 101-1지방도820오수관로 매설2018-12-072021-03-21하나건설㈜2021-07-15
55전라남도 나주시 산포면 덕례리 410-1 일원면도102오수관로 매설2121-01-182030-12-31나주시청2021-07-15
66전라남도 나주시 진포동 42-9 일원도시계획도로하수관로 매설2021-01-182030-12-31<NA>2021-07-15
77전라남도 나주시 이창동 815도시계획도로오우수관 설치2021-01-202030-12-31<NA>2021-07-15
88전라남도 나주시 빛가람동 879도시계획도로전력관로 매설2021-01-202030-12-31한국전력공사 나주지사2021-07-15
99전라남도 나주시 빛가람동 886 일원도시계획도로통신관로 매설2021-01-222030-12-31에스케이텔레콘 주식회사2021-07-15
연번점용장소노선명용도허가일만료일신청법인명데이터기준일자
7359_1전라남도 나주시 송월동 1222 일원도시계획도로도시가스 배관 매설2021-06-112030-12-31㈜해양에너지2021-07-15
7459_2전라남도 나주시 대호동 1093 일원도시계획도로도시가스 배관 매설2021-06-112030-12-31㈜해양에너지2021-07-15
7559_3전라남도 나주시 빛가람동 523 일원도시계획도로도시가스 배관 매설2021-06-112030-12-31㈜해양에너지2021-07-15
7660전라남도 나주시 송월동 1223도시계획도로우수관로 매설2018-07-062027-12-31(유)대세, (유)알천2021-07-15
7761전라남도 나주시 송월동 1437도시계획도로우수관로 설치2020-08-062029-12-31디에스개발(유)2021-07-15
7862전라남도 나주시 빛가람동 874도시계획도로생활오수관 연결2021-06-152025-12-31한국에너지공과대학교2021-07-15
7963전라남도 나주시 송월동 437-1 일원도시계획도로송전선로 이설 지중화매설2021-06-222030-12-31<NA>2021-07-15
8064전라남도 나주시 송월동 1112도시계획도로소매점 진출입로 및 오우수관2012-05-222022-05-21㈜호남주택2021-07-15
8165전라남도 나주시 공산면 남창리 산 46-1 일원시도 9호선관로 매설2021-06-252030-12-31한국전력공사 나주지사2021-07-15
8266전라남도 나주시 빛가람동 650 일원도시계획도로우수관 연결2021-06-282030-12-31<NA>2021-07-15