Overview

Dataset statistics

Number of variables10
Number of observations25
Missing cells30
Missing cells (%)12.0%
Duplicate rows8
Duplicate rows (%)32.0%
Total size in memory2.1 KiB
Average record size in memory85.3 B

Variable types

Text1
Categorical1
Unsupported8

Dataset

Description부설경년별 수도관 현황 통계자료입니다. 부설로부터 1~5년, 6~10년, 11~15년, 16~20년, 21~25년, 26~30년, 31년 이상 경과된 관로의 총 연장 길이를 나타낸 자료이며, 관용도, 관재질별 분류가 되어있습니다.
URLhttps://www.data.go.kr/data/15081149/fileData.do

Alerts

Dataset has 8 (32.0%) duplicate rowsDuplicates
경년별 수도관 has 17 (68.0%) missing valuesMissing
Unnamed: 2 has 2 (8.0%) missing valuesMissing
Unnamed: 3 has 1 (4.0%) missing valuesMissing
Unnamed: 4 has 2 (8.0%) missing valuesMissing
Unnamed: 5 has 2 (8.0%) missing valuesMissing
Unnamed: 6 has 1 (4.0%) missing valuesMissing
Unnamed: 7 has 2 (8.0%) missing valuesMissing
Unnamed: 8 has 2 (8.0%) missing valuesMissing
Unnamed: 9 has 1 (4.0%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:35:17.029499
Analysis finished2023-12-12 22:35:17.560680
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

경년별 수도관
Text

MISSING 

Distinct5
Distinct (%)62.5%
Missing17
Missing (%)68.0%
Memory size332.0 B
2023-12-13T07:35:17.660135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length4.75
Min length3

Characters and Unicode

Total characters38
Distinct characters10
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)25.0%

Sample

1st row구 분
2nd row합 계
3rd row도수관
4th row도수관
5th row배수관
ValueCountFrequency (%)
도수관 2
20.0%
배수관 2
20.0%
급수관 2
20.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
2023-12-13T07:35:17.918962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
42.1%
6
 
15.8%
6
 
15.8%
2
 
5.3%
2
 
5.3%
2
 
5.3%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22
57.9%
Space Separator 16
42.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
27.3%
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22
57.9%
Common 16
42.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
27.3%
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Common
ValueCountFrequency (%)
16
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22
57.9%
ASCII 16
42.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16
100.0%
Hangul
ValueCountFrequency (%)
6
27.3%
6
27.3%
2
 
9.1%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%

Unnamed: 1
Categorical

Distinct10
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
<NA>
소계
닥타일주철관
도복장강관
RC(철근콘크리트관)
Other values (5)
10 

Length

Max length12
Median length10
Mean length6.36
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row소계

Common Values

ValueCountFrequency (%)
<NA> 4
16.0%
소계 3
12.0%
닥타일주철관 3
12.0%
도복장강관 3
12.0%
RC(철근콘크리트관) 2
8.0%
PVC(경화염화비닐관) 2
8.0%
PE(폴리에틸렌관) 2
8.0%
내충격수도관 2
8.0%
STS(스테인레스관) 2
8.0%
기타 2
8.0%

Length

2023-12-13T07:35:18.064440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:35:18.199733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4
16.0%
소계 3
12.0%
닥타일주철관 3
12.0%
도복장강관 3
12.0%
rc(철근콘크리트관 2
8.0%
pvc(경화염화비닐관 2
8.0%
pe(폴리에틸렌관 2
8.0%
내충격수도관 2
8.0%
sts(스테인레스관 2
8.0%
기타 2
8.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.0%
Memory size332.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)4.0%
Memory size332.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.0%
Memory size332.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.0%
Memory size332.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)4.0%
Memory size332.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.0%
Memory size332.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)8.0%
Memory size332.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1
Missing (%)4.0%
Memory size332.0 B

Correlations

2023-12-13T07:35:18.293918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
경년별 수도관Unnamed: 1
경년별 수도관1.0000.000
Unnamed: 10.0001.000

Missing values

2023-12-13T07:35:17.155575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:35:17.297395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:35:17.457120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

경년별 수도관Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
0<NA><NA>NaNNaNNaNNaNNaNNaN단위 : m
1구 분<NA>부 설 경 년 별NaNNaNNaNNaNNaNNaN
2<NA><NA>NaN1∼5년6∼10년11∼15년16∼20년21~25년26~30년31년이상
3합 계<NA>3963483.91392627.72500898.38613547.69606122.07563349.06540986.53745952.46
4도수관소계23085.750001360.151511.284.820209.52
5도수관닥타일주철관1533.2500001276.514.8251.94
6<NA>도복장강관17700.50001360.15234.77016105.58
7<NA>RC(철근콘크리트관)38520000003852
8배수관소계2674925.54279910.04328230.46460676.39381160.1349076.13312459.94563412.48
9배수관닥타일주철관2006856.69239527.42277260.69376016.47269741.81209890.29171549.89462870.12
경년별 수도관Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9
15<NA>기타3104.713104.71000000
16급수관소계1265472.62112717.68172667.92152871.3223601.82212761.65228521.79162330.46
17급수관닥타일주철관221.69178.6639.0340000
18<NA>도복장강관13825.073553.426893.8593.3542.23432.912098.65210.71
19<NA>PVC(경화염화비닐관)120010.47415.18183.4127.7141.71887.344288.93114066.2
20<NA>PE(폴리에틸렌관)106999.2513.99124.761543.623416.6628730.3733027.9319641.89
21<NA>내충격수도관3500.032565.95461.32352.1875.71368.870
22<NA>STS(스테인레스관)1020748.16105443.48164844.56150750.51199525.51182675.03189097.4128411.66
23<NA>RC(철근콘크리트관)1554710800000
24<NA>기타1301300000

Duplicate rows

Most frequently occurring

경년별 수도관Unnamed: 1# duplicates
6<NA>도복장강관3
0<NA>PE(폴리에틸렌관)2
1<NA>PVC(경화염화비닐관)2
2<NA>RC(철근콘크리트관)2
3<NA>STS(스테인레스관)2
4<NA>기타2
5<NA>내충격수도관2
7<NA><NA>2