Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory810.5 KiB
Average record size in memory83.0 B

Variable types

Numeric3
Categorical6

Dataset

Description경기도 의정부시 관내의 맨홀정보를 제공하는 데이터로 관리번호, 맨홀형식, 맨홀명칭, 입력일자, 입력자, 최종수정일자, 최종수정자, 경도, 위도 등의 정보를 제공합니다.
Author경기도 의정부시
URLhttps://www.data.go.kr/data/15062026/fileData.do

Alerts

맨홀명칭 is highly overall correlated with 관리번호 and 1 other fieldsHigh correlation
맨홀형식 is highly overall correlated with 관리번호 and 1 other fieldsHigh correlation
입력일자 is highly overall correlated with 입력자 and 2 other fieldsHigh correlation
최종수정자 is highly overall correlated with 입력일자 and 2 other fieldsHigh correlation
최종수정일자 is highly overall correlated with 입력일자 and 2 other fieldsHigh correlation
입력자 is highly overall correlated with 입력일자 and 2 other fieldsHigh correlation
관리번호 is highly overall correlated with 맨홀형식 and 1 other fieldsHigh correlation
맨홀명칭 is highly imbalanced (68.5%)Imbalance
입력일자 is highly imbalanced (78.4%)Imbalance
입력자 is highly imbalanced (77.3%)Imbalance
최종수정일자 is highly imbalanced (78.4%)Imbalance
최종수정자 is highly imbalanced (76.6%)Imbalance

Reproduction

Analysis started2023-12-12 19:38:49.349694
Analysis finished2023-12-12 19:38:51.765066
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

HIGH CORRELATION 

Distinct9994
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29707633
Minimum2
Maximum2.1112901 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:38:51.845871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile1122.95
Q15278.5
median10650.5
Q315918.25
95-th percentile1.9113116 × 108
Maximum2.1112901 × 109
Range2.1112901 × 109
Interquartile range (IQR)10639.75

Descriptive statistics

Standard deviation1.3025228 × 108
Coefficient of variation (CV)4.3844718
Kurtosis191.0117
Mean29707633
Median Absolute Deviation (MAD)5332
Skewness12.380471
Sum2.9707633 × 1011
Variance1.6965657 × 1016
MonotonicityNot monotonic
2023-12-13T04:38:51.985899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
181112 2
 
< 0.1%
181116 2
 
< 0.1%
181113 2
 
< 0.1%
181109 2
 
< 0.1%
180007 2
 
< 0.1%
181105 2
 
< 0.1%
4754 1
 
< 0.1%
14963 1
 
< 0.1%
5445 1
 
< 0.1%
5594 1
 
< 0.1%
Other values (9984) 9984
99.8%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
16 1
< 0.1%
20 1
< 0.1%
ValueCountFrequency (%)
2111290055 1
< 0.1%
2111290053 1
< 0.1%
2111290045 1
< 0.1%
2111290043 1
< 0.1%
2111290042 1
< 0.1%
2111290041 1
< 0.1%
2111290040 1
< 0.1%
2111290036 1
< 0.1%
2111290035 1
< 0.1%
2111290034 1
< 0.1%

맨홀형식
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
AZB005
5150 
AZB007
1385 
AZB006
1293 
AZB015
639 
AZB004
 
387
Other values (9)
1146 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAZB004
2nd rowAZB005
3rd rowAZB006
4th rowAZB005
5th rowAZB005

Common Values

ValueCountFrequency (%)
AZB005 5150
51.5%
AZB007 1385
 
13.9%
AZB006 1293
 
12.9%
AZB015 639
 
6.4%
AZB004 387
 
3.9%
AZB999 369
 
3.7%
AZB012 293
 
2.9%
AZB002 149
 
1.5%
AZB013 124
 
1.2%
AZB010 100
 
1.0%
Other values (4) 111
 
1.1%

Length

2023-12-13T04:38:52.117255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
azb005 5150
51.5%
azb007 1385
 
13.9%
azb006 1293
 
12.9%
azb015 639
 
6.4%
azb004 387
 
3.9%
azb999 369
 
3.7%
azb012 293
 
2.9%
azb002 149
 
1.5%
azb013 124
 
1.2%
azb010 100
 
1.0%
Other values (4) 111
 
1.1%

맨홀명칭
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
공동구맨홀
8393 
통신맨홀
 
639
기타맨홀
 
369
전기맨홀
 
293
하수맨홀
 
124
Other values (4)
 
182

Length

Max length6
Median length5
Mean length4.8431
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공동구맨홀
2nd row공동구맨홀
3rd row공동구맨홀
4th row공동구맨홀
5th row공동구맨홀

Common Values

ValueCountFrequency (%)
공동구맨홀 8393
83.9%
통신맨홀 639
 
6.4%
기타맨홀 369
 
3.7%
전기맨홀 293
 
2.9%
하수맨홀 124
 
1.2%
가스맨홀 100
 
1.0%
전화맨홀 61
 
0.6%
지역난방맨홀 19
 
0.2%
상수맨홀 2
 
< 0.1%

Length

2023-12-13T04:38:52.249914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:38:52.367295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공동구맨홀 8393
83.9%
통신맨홀 639
 
6.4%
기타맨홀 369
 
3.7%
전기맨홀 293
 
2.9%
하수맨홀 124
 
1.2%
가스맨홀 100
 
1.0%
전화맨홀 61
 
0.6%
지역난방맨홀 19
 
0.2%
상수맨홀 2
 
< 0.1%

입력일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
확인불가
8868 
2019-12-16
895 
2020-11-13
 
216
2021-04-15
 
13
2021-03-04
 
4
Other values (2)
 
4

Length

Max length10
Median length4
Mean length4.6792
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row확인불가
2nd row확인불가
3rd row확인불가
4th row확인불가
5th row확인불가

Common Values

ValueCountFrequency (%)
확인불가 8868
88.7%
2019-12-16 895
 
8.9%
2020-11-13 216
 
2.2%
2021-04-15 13
 
0.1%
2021-03-04 4
 
< 0.1%
2021-01-18 2
 
< 0.1%
2020-01-13 2
 
< 0.1%

Length

2023-12-13T04:38:52.500977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:38:52.611879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
확인불가 8868
88.7%
2019-12-16 895
 
8.9%
2020-11-13 216
 
2.2%
2021-04-15 13
 
0.1%
2021-03-04 4
 
< 0.1%
2021-01-18 2
 
< 0.1%
2020-01-13 2
 
< 0.1%

입력자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
확인불가
8868 
지오스토리컨소시엄
1111 
(주)유현정보기술
 
17
디에스정보기술
 
2
제일항업(주)
 
2

Length

Max length9
Median length4
Mean length4.5652
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row확인불가
2nd row확인불가
3rd row확인불가
4th row확인불가
5th row확인불가

Common Values

ValueCountFrequency (%)
확인불가 8868
88.7%
지오스토리컨소시엄 1111
 
11.1%
(주)유현정보기술 17
 
0.2%
디에스정보기술 2
 
< 0.1%
제일항업(주) 2
 
< 0.1%

Length

2023-12-13T04:38:52.729213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:38:52.830577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
확인불가 8868
88.7%
지오스토리컨소시엄 1111
 
11.1%
주)유현정보기술 17
 
0.2%
디에스정보기술 2
 
< 0.1%
제일항업(주 2
 
< 0.1%

최종수정일자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
확인불가
8868 
2019-12-16
895 
2020-11-13
 
216
2021-04-15
 
13
2021-03-04
 
4
Other values (2)
 
4

Length

Max length10
Median length4
Mean length4.6792
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row확인불가
2nd row확인불가
3rd row확인불가
4th row확인불가
5th row확인불가

Common Values

ValueCountFrequency (%)
확인불가 8868
88.7%
2019-12-16 895
 
8.9%
2020-11-13 216
 
2.2%
2021-04-15 13
 
0.1%
2021-03-04 4
 
< 0.1%
2021-01-18 2
 
< 0.1%
2020-01-13 2
 
< 0.1%

Length

2023-12-13T04:38:52.947773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:38:53.067063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
확인불가 8868
88.7%
2019-12-16 895
 
8.9%
2020-11-13 216
 
2.2%
2021-04-15 13
 
0.1%
2021-03-04 4
 
< 0.1%
2021-01-18 2
 
< 0.1%
2020-01-13 2
 
< 0.1%

최종수정자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
확인불가
8868 
컨소시엄
895 
지오스토리컨소시엄
 
216
유현정보
 
17
디에스정보기술
 
2

Length

Max length9
Median length4
Mean length4.1086
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row확인불가
2nd row확인불가
3rd row확인불가
4th row확인불가
5th row확인불가

Common Values

ValueCountFrequency (%)
확인불가 8868
88.7%
컨소시엄 895
 
8.9%
지오스토리컨소시엄 216
 
2.2%
유현정보 17
 
0.2%
디에스정보기술 2
 
< 0.1%
제일항업 2
 
< 0.1%

Length

2023-12-13T04:38:53.186981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:38:53.308636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
확인불가 8868
88.7%
컨소시엄 895
 
8.9%
지오스토리컨소시엄 216
 
2.2%
유현정보 17
 
0.2%
디에스정보기술 2
 
< 0.1%
제일항업 2
 
< 0.1%

경도
Real number (ℝ)

Distinct9877
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.05948
Minimum127.00597
Maximum127.12002
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:38:53.450386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.00597
5-th percentile127.03163
Q1127.0425
median127.05234
Q3127.07771
95-th percentile127.10257
Maximum127.12002
Range0.1140533
Interquartile range (IQR)0.035209725

Descriptive statistics

Standard deviation0.022788578
Coefficient of variation (CV)0.00017935363
Kurtosis-0.48099484
Mean127.05948
Median Absolute Deviation (MAD)0.013539
Skewness0.67954737
Sum1270594.8
Variance0.0005193193
MonotonicityNot monotonic
2023-12-13T04:38:53.611254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0415376 4
 
< 0.1%
127.0462266 3
 
< 0.1%
127.0398652 2
 
< 0.1%
127.0462482 2
 
< 0.1%
127.0565097 2
 
< 0.1%
127.0352701 2
 
< 0.1%
127.0561813 2
 
< 0.1%
127.0570886 2
 
< 0.1%
127.046258 2
 
< 0.1%
127.0440091 2
 
< 0.1%
Other values (9867) 9977
99.8%
ValueCountFrequency (%)
127.0059703 1
< 0.1%
127.0060036 1
< 0.1%
127.0070145 1
< 0.1%
127.0089528 1
< 0.1%
127.0097411 1
< 0.1%
127.0109067 1
< 0.1%
127.0109129 1
< 0.1%
127.0111009 1
< 0.1%
127.0117759 1
< 0.1%
127.0118456 1
< 0.1%
ValueCountFrequency (%)
127.1200236 1
< 0.1%
127.1199977 1
< 0.1%
127.119975 1
< 0.1%
127.1199725 1
< 0.1%
127.119954 1
< 0.1%
127.1199534 1
< 0.1%
127.1199367 1
< 0.1%
127.1199219 1
< 0.1%
127.119373 1
< 0.1%
127.1192986 1
< 0.1%

위도
Real number (ℝ)

Distinct9947
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.741083
Minimum37.687382
Maximum37.776857
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T04:38:53.784883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.687382
5-th percentile37.716192
Q137.734015
median37.742978
Q337.749718
95-th percentile37.758553
Maximum37.776857
Range0.08947422
Interquartile range (IQR)0.015702448

Descriptive statistics

Standard deviation0.012465944
Coefficient of variation (CV)0.0003303017
Kurtosis1.331087
Mean37.741083
Median Absolute Deviation (MAD)0.007708395
Skewness-0.89911051
Sum377410.83
Variance0.00015539975
MonotonicityNot monotonic
2023-12-13T04:38:53.960321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.73444566 2
 
< 0.1%
37.75586062 2
 
< 0.1%
37.75302819 2
 
< 0.1%
37.74444956 2
 
< 0.1%
37.7533412 2
 
< 0.1%
37.75396176 2
 
< 0.1%
37.75304104 2
 
< 0.1%
37.74136836 2
 
< 0.1%
37.75851239 2
 
< 0.1%
37.73538433 2
 
< 0.1%
Other values (9937) 9980
99.8%
ValueCountFrequency (%)
37.68738249 1
< 0.1%
37.68817949 1
< 0.1%
37.68882172 1
< 0.1%
37.68910498 1
< 0.1%
37.68922069 1
< 0.1%
37.68924439 1
< 0.1%
37.68931929 1
< 0.1%
37.68932726 1
< 0.1%
37.68939934 1
< 0.1%
37.68940206 1
< 0.1%
ValueCountFrequency (%)
37.77685671 1
< 0.1%
37.77523136 1
< 0.1%
37.77270637 1
< 0.1%
37.77219735 1
< 0.1%
37.77117088 1
< 0.1%
37.77089869 1
< 0.1%
37.77087015 1
< 0.1%
37.77041809 1
< 0.1%
37.76977943 1
< 0.1%
37.76972585 1
< 0.1%

Interactions

2023-12-13T04:38:51.136497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:50.248703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:50.531273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:51.240573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:50.333709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:50.937469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:51.353105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:50.437110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:38:51.034775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:38:54.088155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호맨홀형식맨홀명칭입력일자입력자최종수정일자최종수정자경도위도
관리번호1.0000.7800.8900.0100.0210.0100.0290.2400.178
맨홀형식0.7801.0001.0000.7460.6990.7460.6720.4910.263
맨홀명칭0.8901.0001.0000.6060.6590.6060.6730.4400.234
입력일자0.0100.7460.6061.0001.0001.0001.0000.5140.349
입력자0.0210.6990.6591.0001.0001.0001.0000.6220.319
최종수정일자0.0100.7460.6061.0001.0001.0001.0000.5140.349
최종수정자0.0290.6720.6731.0001.0001.0001.0000.5370.364
경도0.2400.4910.4400.5140.6220.5140.5371.0000.589
위도0.1780.2630.2340.3490.3190.3490.3640.5891.000
2023-12-13T04:38:54.228053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
맨홀명칭맨홀형식입력일자최종수정자최종수정일자입력자
맨홀명칭1.0001.0000.3760.4110.3760.458
맨홀형식1.0001.0000.3750.4110.3750.457
입력일자0.3760.3751.0001.0001.0001.000
최종수정자0.4110.4111.0001.0001.0001.000
최종수정일자0.3760.3751.0001.0001.0001.000
입력자0.4580.4571.0001.0001.0001.000
2023-12-13T04:38:54.369132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호경도위도맨홀형식맨홀명칭입력일자입력자최종수정일자최종수정자
관리번호1.000-0.2290.3830.6210.6210.0070.0160.0070.012
경도-0.2291.000-0.0250.2230.2170.2910.3060.2910.318
위도0.383-0.0251.0000.1090.1080.1850.1380.1850.201
맨홀형식0.6210.2230.1091.0001.0000.3750.4570.3750.411
맨홀명칭0.6210.2170.1081.0001.0000.3760.4580.3760.411
입력일자0.0070.2910.1850.3750.3761.0001.0001.0001.000
입력자0.0160.3060.1380.4570.4581.0001.0001.0001.000
최종수정일자0.0070.2910.1850.3750.3761.0001.0001.0001.000
최종수정자0.0120.3180.2010.4110.4111.0001.0001.0001.000

Missing values

2023-12-13T04:38:51.507591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:38:51.703898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호맨홀형식맨홀명칭입력일자입력자최종수정일자최종수정자경도위도
102134754AZB004공동구맨홀확인불가확인불가확인불가확인불가127.05644937.730644
351916793AZB005공동구맨홀확인불가확인불가확인불가확인불가127.02343337.755919
186833851AZB006공동구맨홀확인불가확인불가확인불가확인불가127.05615837.751854
50989672AZB005공동구맨홀확인불가확인불가확인불가확인불가127.04917437.705097
625717009AZB005공동구맨홀확인불가확인불가확인불가확인불가127.03877537.758918
66586548AZB005공동구맨홀확인불가확인불가확인불가확인불가127.08491437.738029
19923202031591AZB012전기맨홀2020-11-13지오스토리컨소시엄2020-11-13지오스토리컨소시엄127.04081437.763108
245020218AZB000공동구맨홀확인불가확인불가확인불가확인불가127.03671637.73511
392191120024AZB015통신맨홀2019-12-16지오스토리컨소시엄2019-12-16컨소시엄127.10720737.710867
131446663AZB005공동구맨홀확인불가확인불가확인불가확인불가127.08124137.742618
관리번호맨홀형식맨홀명칭입력일자입력자최종수정일자최종수정자경도위도
169967391AZB005공동구맨홀확인불가확인불가확인불가확인불가127.08351137.748492
90423789AZB005공동구맨홀확인불가확인불가확인불가확인불가127.05304237.748823
1677910289AZB006공동구맨홀확인불가확인불가확인불가확인불가127.05280637.733309
1371191130958AZB015통신맨홀2019-12-16지오스토리컨소시엄2019-12-16컨소시엄127.10209637.750582
190191110054AZB999기타맨홀2019-12-16지오스토리컨소시엄2019-12-16컨소시엄127.06058937.751379
80734678AZB005공동구맨홀확인불가확인불가확인불가확인불가127.0571637.732375
169267252AZB005공동구맨홀확인불가확인불가확인불가확인불가127.07875937.744139
28469605AZB005공동구맨홀확인불가확인불가확인불가확인불가127.04822237.705581
11160592AZB005공동구맨홀확인불가확인불가확인불가확인불가127.04824137.743064
26251866AZB005공동구맨홀확인불가확인불가확인불가확인불가127.0546737.736272