Overview

Dataset statistics

Number of variables8
Number of observations26
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory70.1 B

Variable types

Numeric1
Categorical5
Text2

Dataset

Description경상북도 문경시 도로점용(굴착)공사 허가정보(2022.03.29)에 대한 데이터로 도로굴착 위치, 공사시기, 시행처 등을 제공합니다.
Author경상북도 문경시
URLhttps://www.data.go.kr/data/15099573/fileData.do

Alerts

비 고 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
허가번호 is highly overall correlated with 허가일자 and 2 other fieldsHigh correlation
허가일자 is highly overall correlated with 허가번호 and 2 other fieldsHigh correlation
시 행 처 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
공사시기 is highly overall correlated with 허가번호 and 3 other fieldsHigh correlation
연번 is highly overall correlated with 시 행 처 and 1 other fieldsHigh correlation
허가번호 is highly imbalanced (60.1%)Imbalance
허가일자 is highly imbalanced (60.1%)Imbalance
연번 has unique valuesUnique
굴착위치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:57:39.168896
Analysis finished2023-12-12 11:57:40.144838
Duration0.98 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.5
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-12T20:57:40.232265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.25
Q17.25
median13.5
Q319.75
95-th percentile24.75
Maximum26
Range25
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation7.6485293
Coefficient of variation (CV)0.56655772
Kurtosis-1.2
Mean13.5
Median Absolute Deviation (MAD)6.5
Skewness0
Sum351
Variance58.5
MonotonicityStrictly increasing
2023-12-12T20:57:40.428298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 1
 
3.8%
15 1
 
3.8%
26 1
 
3.8%
25 1
 
3.8%
24 1
 
3.8%
23 1
 
3.8%
22 1
 
3.8%
21 1
 
3.8%
20 1
 
3.8%
19 1
 
3.8%
Other values (16) 16
61.5%
ValueCountFrequency (%)
1 1
3.8%
2 1
3.8%
3 1
3.8%
4 1
3.8%
5 1
3.8%
6 1
3.8%
7 1
3.8%
8 1
3.8%
9 1
3.8%
10 1
3.8%
ValueCountFrequency (%)
26 1
3.8%
25 1
3.8%
24 1
3.8%
23 1
3.8%
22 1
3.8%
21 1
3.8%
20 1
3.8%
19 1
3.8%
18 1
3.8%
17 1
3.8%

허가번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
도로관리심의회
22 
2022 01
 
1
2022 02
 
1
2022 03
 
1
2022 04
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique4 ?
Unique (%)15.4%

Sample

1st row2022 01
2nd row2022 02
3rd row2022 03
4th row2022 04
5th row도로관리심의회

Common Values

ValueCountFrequency (%)
도로관리심의회 22
84.6%
2022 01 1
 
3.8%
2022 02 1
 
3.8%
2022 03 1
 
3.8%
2022 04 1
 
3.8%

Length

2023-12-12T20:57:40.592315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:40.741656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도로관리심의회 22
73.3%
2022 4
 
13.3%
01 1
 
3.3%
02 1
 
3.3%
03 1
 
3.3%
04 1
 
3.3%

허가일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
심의안건
22 
2022-02-17
 
1
2022-02-25
 
1
2022-03-14
 
1
2022-03-24
 
1

Length

Max length10
Median length4
Mean length4.9230769
Min length4

Unique

Unique4 ?
Unique (%)15.4%

Sample

1st row2022-02-17
2nd row2022-02-25
3rd row2022-03-14
4th row2022-03-24
5th row심의안건

Common Values

ValueCountFrequency (%)
심의안건 22
84.6%
2022-02-17 1
 
3.8%
2022-02-25 1
 
3.8%
2022-03-14 1
 
3.8%
2022-03-24 1
 
3.8%

Length

2023-12-12T20:57:40.920707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:41.075372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
심의안건 22
84.6%
2022-02-17 1
 
3.8%
2022-02-25 1
 
3.8%
2022-03-14 1
 
3.8%
2022-03-24 1
 
3.8%
Distinct25
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T20:57:41.393028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length32
Mean length26.192308
Min length8

Characters and Unicode

Total characters681
Distinct characters111
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row문경시 지방상수도 현대화사업 블록시스템 구축공사
2nd row우오수 관로매설
3rd row문경시 점촌동 4535-14번지 외 6개소 도시가스 배관 매설
4th row문경시 모전동 1308번지 외 1개소 도시가스 배관 매설
5th row문경파출소 옆 도시계획도로(소로2-19) 개설공사
ValueCountFrequency (%)
지방상수도 5
 
4.3%
배전설비 5
 
4.3%
현대화사업 5
 
4.3%
이천~문경 5
 
4.3%
철도건설(9공구 5
 
4.3%
지장 5
 
4.3%
일원 5
 
4.3%
블록시스템 5
 
4.3%
문경시 3
 
2.6%
점촌4동 3
 
2.6%
Other values (61) 71
60.7%
2023-12-12T20:57:41.901443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
13.4%
29
 
4.3%
27
 
4.0%
22
 
3.2%
) 20
 
2.9%
20
 
2.9%
( 20
 
2.9%
17
 
2.5%
14
 
2.1%
14
 
2.1%
Other values (101) 407
59.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 481
70.6%
Space Separator 91
 
13.4%
Decimal Number 50
 
7.3%
Close Punctuation 20
 
2.9%
Open Punctuation 20
 
2.9%
Dash Punctuation 7
 
1.0%
Math Symbol 5
 
0.7%
Connector Punctuation 4
 
0.6%
Other Punctuation 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
6.0%
27
 
5.6%
22
 
4.6%
20
 
4.2%
17
 
3.5%
14
 
2.9%
14
 
2.9%
13
 
2.7%
13
 
2.7%
12
 
2.5%
Other values (84) 300
62.4%
Decimal Number
ValueCountFrequency (%)
3 9
18.0%
1 9
18.0%
4 8
16.0%
2 7
14.0%
9 6
12.0%
5 3
 
6.0%
6 3
 
6.0%
8 3
 
6.0%
7 1
 
2.0%
0 1
 
2.0%
Space Separator
ValueCountFrequency (%)
91
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 481
70.6%
Common 200
29.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
6.0%
27
 
5.6%
22
 
4.6%
20
 
4.2%
17
 
3.5%
14
 
2.9%
14
 
2.9%
13
 
2.7%
13
 
2.7%
12
 
2.5%
Other values (84) 300
62.4%
Common
ValueCountFrequency (%)
91
45.5%
) 20
 
10.0%
( 20
 
10.0%
3 9
 
4.5%
1 9
 
4.5%
4 8
 
4.0%
- 7
 
3.5%
2 7
 
3.5%
9 6
 
3.0%
~ 5
 
2.5%
Other values (7) 18
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 481
70.6%
ASCII 200
29.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
91
45.5%
) 20
 
10.0%
( 20
 
10.0%
3 9
 
4.5%
1 9
 
4.5%
4 8
 
4.0%
- 7
 
3.5%
2 7
 
3.5%
9 6
 
3.0%
~ 5
 
2.5%
Other values (7) 18
 
9.0%
Hangul
ValueCountFrequency (%)
29
 
6.0%
27
 
5.6%
22
 
4.6%
20
 
4.2%
17
 
3.5%
14
 
2.9%
14
 
2.9%
13
 
2.7%
13
 
2.7%
12
 
2.5%
Other values (84) 300
62.4%

굴착위치
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-12T20:57:42.120612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16.5
Mean length13.615385
Min length10

Characters and Unicode

Total characters354
Distinct characters48
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row흥덕동 307 일원
2nd row신기동1139-1 일원
3rd row점촌동 435-14 일원
4th row모전동 1308 일원
5th row문경읍 상리 445-4 일원
ValueCountFrequency (%)
일원 26
29.2%
문경읍 6
 
6.7%
흥덕동 5
 
5.6%
점촌동 3
 
3.4%
마성면 3
 
3.4%
하리 3
 
3.4%
모전동 2
 
2.2%
신기동 2
 
2.2%
마원리 2
 
2.2%
남호리 2
 
2.2%
Other values (35) 35
39.3%
2023-12-12T20:57:42.589399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
17.8%
28
 
7.9%
26
 
7.3%
1 26
 
7.3%
- 22
 
6.2%
3 15
 
4.2%
14
 
4.0%
12
 
3.4%
2 12
 
3.4%
8 11
 
3.1%
Other values (38) 125
35.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 162
45.8%
Decimal Number 107
30.2%
Space Separator 63
 
17.8%
Dash Punctuation 22
 
6.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
17.3%
26
16.0%
14
 
8.6%
12
 
7.4%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
Other values (26) 48
29.6%
Decimal Number
ValueCountFrequency (%)
1 26
24.3%
3 15
14.0%
2 12
11.2%
8 11
10.3%
4 10
 
9.3%
5 8
 
7.5%
0 8
 
7.5%
6 7
 
6.5%
7 6
 
5.6%
9 4
 
3.7%
Space Separator
ValueCountFrequency (%)
63
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 192
54.2%
Hangul 162
45.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
17.3%
26
16.0%
14
 
8.6%
12
 
7.4%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
Other values (26) 48
29.6%
Common
ValueCountFrequency (%)
63
32.8%
1 26
13.5%
- 22
 
11.5%
3 15
 
7.8%
2 12
 
6.2%
8 11
 
5.7%
4 10
 
5.2%
5 8
 
4.2%
0 8
 
4.2%
6 7
 
3.6%
Other values (2) 10
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 192
54.2%
Hangul 162
45.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
63
32.8%
1 26
13.5%
- 22
 
11.5%
3 15
 
7.8%
2 12
 
6.2%
8 11
 
5.7%
4 10
 
5.2%
5 8
 
4.2%
0 8
 
4.2%
6 7
 
3.6%
Other values (2) 10
 
5.2%
Hangul
ValueCountFrequency (%)
28
17.3%
26
16.0%
14
 
8.6%
12
 
7.4%
7
 
4.3%
6
 
3.7%
6
 
3.7%
5
 
3.1%
5
 
3.1%
5
 
3.1%
Other values (26) 48
29.6%

공사시기
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)19.2%
Missing0
Missing (%)0.0%
Memory size340.0 B
~2022-12
18 
~2023-12
~2022-04
~2022-03
 
1
~2022-07
 
1

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique2 ?
Unique (%)7.7%

Sample

1st row~2023-12
2nd row~2022-03
3rd row~2022-04
4th row~2022-04
5th row~2022-12

Common Values

ValueCountFrequency (%)
~2022-12 18
69.2%
~2023-12 4
 
15.4%
~2022-04 2
 
7.7%
~2022-03 1
 
3.8%
~2022-07 1
 
3.8%

Length

2023-12-12T20:57:42.803713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:42.975123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-12 18
69.2%
2023-12 4
 
15.4%
2022-04 2
 
7.7%
2022-03 1
 
3.8%
2022-07 1
 
3.8%

시 행 처
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)26.9%
Missing0
Missing (%)0.0%
Memory size340.0 B
문경시장(상수도사업소장)
영남에너지서비스㈜
한국전력공사
문경시장(도시과장)
문경시장(하수도사업소장)
Other values (2)

Length

Max length13
Median length10
Mean length10
Min length6

Unique

Unique2 ?
Unique (%)7.7%

Sample

1st row문경시장(상수도사업소장)
2nd row㈜우리종합건설
3rd row영남에너지서비스㈜
4th row영남에너지서비스㈜
5th row문경시장(도시과장)

Common Values

ValueCountFrequency (%)
문경시장(상수도사업소장) 8
30.8%
영남에너지서비스㈜ 6
23.1%
한국전력공사 5
19.2%
문경시장(도시과장) 3
 
11.5%
문경시장(하수도사업소장) 2
 
7.7%
㈜우리종합건설 1
 
3.8%
SK텔레콤주식회사 1
 
3.8%

Length

2023-12-12T20:57:43.181284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:43.380382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문경시장(상수도사업소장 8
30.8%
영남에너지서비스㈜ 6
23.1%
한국전력공사 5
19.2%
문경시장(도시과장 3
 
11.5%
문경시장(하수도사업소장 2
 
7.7%
㈜우리종합건설 1
 
3.8%
sk텔레콤주식회사 1
 
3.8%

비 고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size340.0 B
허가예정
22 
<NA>

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row허가예정

Common Values

ValueCountFrequency (%)
허가예정 22
84.6%
<NA> 4
 
15.4%

Length

2023-12-12T20:57:43.556651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:57:43.657780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
허가예정 22
84.6%
na 4
 
15.4%

Interactions

2023-12-12T20:57:39.713396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:57:43.746427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가번호허가일자공 사 명굴착위치공사시기시 행 처
연번1.0000.4220.4220.9161.0000.6390.673
허가번호0.4221.0001.0001.0001.0000.9510.530
허가일자0.4221.0001.0001.0001.0000.9510.530
공 사 명0.9161.0001.0001.0001.0000.9401.000
굴착위치1.0001.0001.0001.0001.0001.0001.000
공사시기0.6390.9510.9510.9401.0001.0000.844
시 행 처0.6730.5300.5301.0001.0000.8441.000
2023-12-12T20:57:43.897633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비 고허가번호허가일자시 행 처공사시기
비 고1.0001.0001.0001.0001.000
허가번호1.0001.0001.0000.3440.685
허가일자1.0001.0001.0000.3440.685
시 행 처1.0000.3440.3441.0000.709
공사시기1.0000.6850.6850.7091.000
2023-12-12T20:57:44.020548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번허가번호허가일자공사시기시 행 처비 고
연번1.0000.0000.0000.3080.5181.000
허가번호0.0001.0001.0000.6850.3441.000
허가일자0.0001.0001.0000.6850.3441.000
공사시기0.3080.6850.6851.0000.7091.000
시 행 처0.5180.3440.3440.7091.0001.000
비 고1.0001.0001.0001.0001.0001.000

Missing values

2023-12-12T20:57:39.878640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:57:40.062949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번허가번호허가일자공 사 명굴착위치공사시기시 행 처비 고
012022 012022-02-17문경시 지방상수도 현대화사업 블록시스템 구축공사흥덕동 307 일원~2023-12문경시장(상수도사업소장)<NA>
122022 022022-02-25우오수 관로매설신기동1139-1 일원~2022-03㈜우리종합건설<NA>
232022 032022-03-14문경시 점촌동 4535-14번지 외 6개소 도시가스 배관 매설점촌동 435-14 일원~2022-04영남에너지서비스㈜<NA>
342022 042022-03-24문경시 모전동 1308번지 외 1개소 도시가스 배관 매설모전동 1308 일원~2022-04영남에너지서비스㈜<NA>
45도로관리심의회심의안건문경파출소 옆 도시계획도로(소로2-19) 개설공사문경읍 상리 445-4 일원~2022-12문경시장(도시과장)허가예정
56도로관리심의회심의안건점촌2동 영동할인마트 뒤 도시계획도로 개설공사(소로3-342,343)점촌동 138-8 일원~2022-12문경시장(도시과장)허가예정
67도로관리심의회심의안건점촌4동 점촌북초등학교 도시계획도로 개설공사(소로3-188)유곡동 542-6 일원~2022-12문경시장(도시과장)허가예정
78도로관리심의회심의안건지방상수도 현대화사업 블록시스템 구축공사(동지역)흥덕동 727-3 일원~2022-12문경시장(상수도사업소장)허가예정
89도로관리심의회심의안건지방상수도 현대화사업 블록시스템 구축공사(동지역)흥덕동 716-13 일원~2023-12문경시장(상수도사업소장)허가예정
910도로관리심의회심의안건지방상수도 현대화사업 블록시스템 구축공사(문경읍,중로2-6)문경읍 하리 84-10 일원~2023-12문경시장(상수도사업소장)허가예정
연번허가번호허가일자공 사 명굴착위치공사시기시 행 처비 고
1617도로관리심의회심의안건이천~문경 철도건설(9공구) 지장 배전설비 지중화공사(마원리)문경읍 마원리 338-1 일원~2022-12한국전력공사허가예정
1718도로관리심의회심의안건이천~문경 철도건설(9공구) 지장 배전설비 지중화공사(남호리_1)마성면 남호리 248 일원~2022-12한국전력공사허가예정
1819도로관리심의회심의안건이천~문경 철도건설(9공구) 지장 배전설비 지중화공사(남호리_2)마성면 남호리 212-10 일원~2022-12한국전력공사허가예정
1920도로관리심의회심의안건이천~문경 철도건설(9공구) 지장 배전설비 지중화공사(외어리_1)마성면 외어리 1181-14 일원~2022-12한국전력공사허가예정
2021도로관리심의회심의안건이천~문경 철도건설(9공구) 지장 배전설비 지중화공사(마원리_2)문경읍 마원리 238-8 일원~2022-12한국전력공사허가예정
2122도로관리심의회심의안건흥덕동 흥덕공원 일원 저압배관공사흥덕동 816 일원~2022-12영남에너지서비스㈜허가예정
2223도로관리심의회심의안건모전동 새동네4길 일원 배관공사모전동 119-3 일원~2022-12영남에너지서비스㈜허가예정
2324도로관리심의회심의안건점촌4동 일원 중압배관공사(3차)신기동 252-51 일원~2022-12영남에너지서비스㈜허가예정
2425도로관리심의회심의안건점촌4동 일원 저압배관공사(1차)신기동 990-4 일원~2022-12영남에너지서비스㈜허가예정
2526도로관리심의회심의안건점촌동 156-3번지 일원 문경국사 삼원화 지중화공사점촌동 156-3 일원~2022-07SK텔레콤주식회사허가예정