Overview

Dataset statistics

Number of variables9
Number of observations204
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory15.1 KiB
Average record size in memory75.6 B

Variable types

Text2
Categorical5
Numeric2

Dataset

Description관리번호,구분코드(01:전력구,02:통신구),관리기관,관리부서,시설물명,집수정위치,자치구,X좌표,Y좌표
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21118/S/1/datasetView.do

Alerts

구분코드(01:전력구,02:통신구) has constant value ""Constant
관리기관 has constant value ""Constant
시설물명 has constant value ""Constant
Dataset has 1 (0.5%) duplicate rowsDuplicates
X좌표 is highly overall correlated with 관리부서 and 1 other fieldsHigh correlation
Y좌표 is highly overall correlated with 관리부서 and 1 other fieldsHigh correlation
관리부서 is highly overall correlated with X좌표 and 2 other fieldsHigh correlation
자치구 is highly overall correlated with X좌표 and 2 other fieldsHigh correlation

Reproduction

Analysis started2023-12-11 03:57:28.069345
Analysis finished2023-12-11 03:57:29.286889
Duration1.22 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct162
Distinct (%)79.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T12:57:29.504598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters2244
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique161 ?
Unique (%)78.9%

Sample

1st row2018_1_0212
2nd row2018_1_0214
3rd row2018_1_0215
4th row2018_1_0216
5th row2018_1_0217
ValueCountFrequency (%)
2018_2_0307 43
 
21.1%
2018_2_0264 1
 
0.5%
2018_2_0240 1
 
0.5%
2018_2_0250 1
 
0.5%
2018_2_0241 1
 
0.5%
2018_2_0242 1
 
0.5%
2018_2_0244 1
 
0.5%
2018_2_0245 1
 
0.5%
2018_2_0247 1
 
0.5%
2018_2_0248 1
 
0.5%
Other values (152) 152
74.5%
2023-12-11T12:57:29.907005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 501
22.3%
0 484
21.6%
_ 408
18.2%
1 313
13.9%
8 238
10.6%
3 92
 
4.1%
7 78
 
3.5%
6 36
 
1.6%
4 32
 
1.4%
9 32
 
1.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1836
81.8%
Connector Punctuation 408
 
18.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 501
27.3%
0 484
26.4%
1 313
17.0%
8 238
13.0%
3 92
 
5.0%
7 78
 
4.2%
6 36
 
2.0%
4 32
 
1.7%
9 32
 
1.7%
5 30
 
1.6%
Connector Punctuation
ValueCountFrequency (%)
_ 408
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2244
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 501
22.3%
0 484
21.6%
_ 408
18.2%
1 313
13.9%
8 238
10.6%
3 92
 
4.1%
7 78
 
3.5%
6 36
 
1.6%
4 32
 
1.4%
9 32
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2244
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 501
22.3%
0 484
21.6%
_ 408
18.2%
1 313
13.9%
8 238
10.6%
3 92
 
4.1%
7 78
 
3.5%
6 36
 
1.6%
4 32
 
1.4%
9 32
 
1.4%
Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2
204 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 204
100.0%

Length

2023-12-11T12:57:30.047921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:57:30.144498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 204
100.0%

관리기관
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
㈜KT
204 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row㈜KT
2nd row㈜KT
3rd row㈜KT
4th row㈜KT
5th row㈜KT

Common Values

ValueCountFrequency (%)
㈜KT 204
100.0%

Length

2023-12-11T12:57:30.266069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:57:30.382660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
㈜kt 204
100.0%

관리부서
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
통신구팀
126 
kt 신촌지점
14 
kt 광화문지사(혜화)
 
12
kt 성수지점
 
10
kt 서대문지사(가좌)
 
10
Other values (5)
32 

Length

Max length12
Median length4
Mean length5.7352941
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowkt 성수지점
2nd rowkt 성수지점
3rd rowkt 성수지점
4th rowkt 성수지점
5th rowkt 성수지점

Common Values

ValueCountFrequency (%)
통신구팀 126
61.8%
kt 신촌지점 14
 
6.9%
kt 광화문지사(혜화) 12
 
5.9%
kt 성수지점 10
 
4.9%
kt 서대문지사(가좌) 10
 
4.9%
kt 원효지점 10
 
4.9%
KT구로지사 10
 
4.9%
KT동작지사 6
 
2.9%
kt 도봉지점(방학) 4
 
2.0%
kt 서대문지사(홍제) 2
 
1.0%

Length

2023-12-11T12:57:30.499632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:57:30.666161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통신구팀 126
47.4%
kt 62
23.3%
신촌지점 14
 
5.3%
광화문지사(혜화 12
 
4.5%
성수지점 10
 
3.8%
서대문지사(가좌 10
 
3.8%
원효지점 10
 
3.8%
kt구로지사 10
 
3.8%
kt동작지사 6
 
2.3%
도봉지점(방학 4
 
1.5%

시설물명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
통신구
204 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row통신구
2nd row통신구
3rd row통신구
4th row통신구
5th row통신구

Common Values

ValueCountFrequency (%)
통신구 204
100.0%

Length

2023-12-11T12:57:30.831271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:57:30.940760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
통신구 204
100.0%
Distinct80
Distinct (%)39.2%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2023-12-11T12:57:31.175488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length27
Mean length23.754902
Min length14

Characters and Unicode

Total characters4846
Distinct characters198
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울시 성동구 아차산로 13길47 (KT성수지점앞)
2nd row서울시 성동구 아차산로 13길54(신한은행앞)
3rd row서울시 광진구 동일로 190(화양사거리)
4th row서울시 성동구 아차산로 13길47 (KT 성수지점앞)
5th row서울시 성동구 아차산로 13길47 (KT 성수지점앞)
ValueCountFrequency (%)
서울시 78
 
10.1%
집수정 36
 
4.6%
성북구 36
 
4.6%
동대문구 24
 
3.1%
종로구 21
 
2.7%
18
 
2.3%
서대문구 16
 
2.1%
마포구 14
 
1.8%
용산구 10
 
1.3%
중구 9
 
1.2%
Other values (182) 514
66.2%
2023-12-11T12:57:31.583352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
588
 
12.1%
234
 
4.8%
) 204
 
4.2%
( 204
 
4.2%
186
 
3.8%
167
 
3.4%
1 166
 
3.4%
2 128
 
2.6%
104
 
2.1%
4 103
 
2.1%
Other values (188) 2762
57.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2839
58.6%
Decimal Number 862
 
17.8%
Space Separator 588
 
12.1%
Close Punctuation 204
 
4.2%
Open Punctuation 204
 
4.2%
Dash Punctuation 78
 
1.6%
Other Punctuation 39
 
0.8%
Lowercase Letter 16
 
0.3%
Uppercase Letter 16
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
234
 
8.2%
186
 
6.6%
167
 
5.9%
104
 
3.7%
100
 
3.5%
87
 
3.1%
78
 
2.7%
77
 
2.7%
75
 
2.6%
64
 
2.3%
Other values (169) 1667
58.7%
Decimal Number
ValueCountFrequency (%)
1 166
19.3%
2 128
14.8%
4 103
11.9%
3 91
10.6%
7 68
7.9%
6 67
7.8%
5 62
 
7.2%
8 62
 
7.2%
9 59
 
6.8%
0 56
 
6.5%
Lowercase Letter
ValueCountFrequency (%)
t 8
50.0%
k 8
50.0%
Uppercase Letter
ValueCountFrequency (%)
K 8
50.0%
T 8
50.0%
Space Separator
ValueCountFrequency (%)
588
100.0%
Close Punctuation
ValueCountFrequency (%)
) 204
100.0%
Open Punctuation
ValueCountFrequency (%)
( 204
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 78
100.0%
Other Punctuation
ValueCountFrequency (%)
, 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2839
58.6%
Common 1975
40.8%
Latin 32
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
234
 
8.2%
186
 
6.6%
167
 
5.9%
104
 
3.7%
100
 
3.5%
87
 
3.1%
78
 
2.7%
77
 
2.7%
75
 
2.6%
64
 
2.3%
Other values (169) 1667
58.7%
Common
ValueCountFrequency (%)
588
29.8%
) 204
 
10.3%
( 204
 
10.3%
1 166
 
8.4%
2 128
 
6.5%
4 103
 
5.2%
3 91
 
4.6%
- 78
 
3.9%
7 68
 
3.4%
6 67
 
3.4%
Other values (5) 278
14.1%
Latin
ValueCountFrequency (%)
t 8
25.0%
k 8
25.0%
K 8
25.0%
T 8
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2839
58.6%
ASCII 2007
41.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
588
29.3%
) 204
 
10.2%
( 204
 
10.2%
1 166
 
8.3%
2 128
 
6.4%
4 103
 
5.1%
3 91
 
4.5%
- 78
 
3.9%
7 68
 
3.4%
6 67
 
3.3%
Other values (9) 310
15.4%
Hangul
ValueCountFrequency (%)
234
 
8.2%
186
 
6.6%
167
 
5.9%
104
 
3.7%
100
 
3.5%
87
 
3.1%
78
 
2.7%
77
 
2.7%
75
 
2.6%
64
 
2.3%
Other values (169) 1667
58.7%

자치구
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
성북구
39 
종로구
27 
동대문구
27 
중 구
24 
마포구
17 
Other values (11)
70 

Length

Max length4
Median length3
Mean length3.3186275
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성동구
2nd row성동구
3rd row광진구
4th row성동구
5th row성동구

Common Values

ValueCountFrequency (%)
성북구 39
19.1%
종로구 27
13.2%
동대문구 27
13.2%
중 구 24
11.8%
마포구 17
8.3%
서대문구 14
 
6.9%
용산구 13
 
6.4%
성동구 8
 
3.9%
관악구 6
 
2.9%
동작구 6
 
2.9%
Other values (6) 23
11.3%

Length

2023-12-11T12:57:31.739298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성북구 39
17.1%
종로구 27
11.8%
동대문구 27
11.8%
24
10.5%
24
10.5%
마포구 17
7.5%
서대문구 14
 
6.1%
용산구 13
 
5.7%
성동구 8
 
3.5%
관악구 6
 
2.6%
Other values (7) 29
12.7%

X좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct76
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean199487.09
Minimum189979.6
Maximum206022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-11T12:57:31.908152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum189979.6
5-th percentile192335.2
Q1196394.4
median200612
Q3202680.4
95-th percentile205464
Maximum206022
Range16042.4
Interquartile range (IQR)6286

Descriptive statistics

Standard deviation4420.0432
Coefficient of variation (CV)0.022157039
Kurtosis-1.0281931
Mean199487.09
Median Absolute Deviation (MAD)3387.2
Skewness-0.37862227
Sum40695367
Variance19536782
MonotonicityNot monotonic
2023-12-11T12:57:32.112664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
205470.4 6
 
2.9%
194195.2 6
 
2.9%
200129.2 4
 
2.0%
205190.0 3
 
1.5%
196480.0 3
 
1.5%
204030.0 3
 
1.5%
204759.2 3
 
1.5%
205464.0 3
 
1.5%
197224.8 3
 
1.5%
197292.8 3
 
1.5%
Other values (66) 167
81.9%
ValueCountFrequency (%)
189979.6 2
1.0%
190331.2 2
1.0%
191185.6 2
1.0%
191433.2 2
1.0%
192139.6 2
1.0%
192335.2 2
1.0%
192342.8 2
1.0%
192363.6 2
1.0%
192412.4 2
1.0%
193193.2 2
1.0%
ValueCountFrequency (%)
206022.0 2
 
1.0%
205551.6 2
 
1.0%
205470.4 6
2.9%
205464.0 3
1.5%
205348.8 3
1.5%
205190.0 3
1.5%
205045.2 3
1.5%
204990.8 3
1.5%
204938.8 3
1.5%
204831.6 3
1.5%

Y좌표
Real number (ℝ)

HIGH CORRELATION 

Distinct76
Distinct (%)37.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean452200.58
Minimum442229.2
Maximum463028
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2023-12-11T12:57:32.300686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum442229.2
5-th percentile445781.2
Q1450896.8
median452454.4
Q3454277.5
95-th percentile456660
Maximum463028
Range20798.8
Interquartile range (IQR)3380.7

Descriptive statistics

Standard deviation3668.3147
Coefficient of variation (CV)0.0081121406
Kurtosis1.7348369
Mean452200.58
Median Absolute Deviation (MAD)1565.6
Skewness-0.45375049
Sum92248918
Variance13456533
MonotonicityNot monotonic
2023-12-11T12:57:32.442855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
449751.6 6
 
2.9%
445781.2 6
 
2.9%
452999.2 4
 
2.0%
451523.2 3
 
1.5%
452454.4 3
 
1.5%
456104.8 3
 
1.5%
456660.0 3
 
1.5%
455882.8 3
 
1.5%
451165.6 3
 
1.5%
451356.4 3
 
1.5%
Other values (66) 167
81.9%
ValueCountFrequency (%)
442229.2 2
 
1.0%
442266.4 2
 
1.0%
442519.6 2
 
1.0%
442535.2 2
 
1.0%
442724.4 2
 
1.0%
445781.2 6
2.9%
447359.6 3
1.5%
447432.0 3
1.5%
448432.0 2
 
1.0%
448483.2 2
 
1.0%
ValueCountFrequency (%)
463028.0 2
1.0%
462838.8 2
1.0%
458283.2 3
1.5%
457076.0 3
1.5%
456660.0 3
1.5%
456104.8 3
1.5%
455952.8 3
1.5%
455910.8 3
1.5%
455882.8 3
1.5%
455880.4 3
1.5%

Interactions

2023-12-11T12:57:28.886122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:57:28.417925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:57:28.983811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:57:28.801197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:57:32.516744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리부서집수정위치자치구X좌표Y좌표
관리부서1.0001.0000.9680.9380.915
집수정위치1.0001.0001.0001.0001.000
자치구0.9681.0001.0000.9010.970
X좌표0.9381.0000.9011.0000.690
Y좌표0.9151.0000.9700.6901.000
2023-12-11T12:57:32.606727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자치구관리부서
자치구1.0000.839
관리부서0.8391.000
2023-12-11T12:57:32.696631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
X좌표Y좌표관리부서자치구
X좌표1.0000.4670.5940.647
Y좌표0.4671.0000.7310.862
관리부서0.5940.7311.0000.839
자치구0.6470.8620.8391.000

Missing values

2023-12-11T12:57:29.087591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:57:29.225633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호구분코드(01:전력구,02:통신구)관리기관관리부서시설물명집수정위치자치구X좌표Y좌표
02018_1_02122㈜KTkt 성수지점통신구서울시 성동구 아차산로 13길47 (KT성수지점앞)성동구205470.4449751.6
12018_1_02142㈜KTkt 성수지점통신구서울시 성동구 아차산로 13길54(신한은행앞)성동구205551.6449792.8
22018_1_02152㈜KTkt 성수지점통신구서울시 광진구 동일로 190(화양사거리)광진구206022.0449916.4
32018_1_02162㈜KTkt 성수지점통신구서울시 성동구 아차산로 13길47 (KT 성수지점앞)성동구205470.4449751.6
42018_1_02172㈜KTkt 성수지점통신구서울시 성동구 아차산로 13길47 (KT 성수지점앞)성동구205470.4449751.6
52018_1_02202㈜KTkt 서대문지사(가좌)통신구서울시 서대문구 응암로121 (kt가좌지사 우측편)서대문구192363.6453791.6
62018_1_02212㈜KTkt 서대문지사(가좌)통신구서울시 서대문구 응암로113 (중소기업은행앞)서대문구192335.2453693.2
72018_1_02222㈜KTkt 서대문지사(가좌)통신구서울시 서대문구 증가로30길 25 (kt가좌지사 후면)서대문구192342.8453836.0
82018_1_02232㈜KTkt 서대문지사(가좌)통신구서울시 서대문구 증가로 261(증산2교 북단 우측)은평구192139.6453812.4
92018_1_02242㈜KTkt 서대문지사(가좌)통신구서울시 은평구 증산로 213(증산빗물펌프장앞)은평구191433.2453189.2
관리번호구분코드(01:전력구,02:통신구)관리기관관리부서시설물명집수정위치자치구X좌표Y좌표
1942018_2_03072㈜KT통신구팀통신구논현로872(신사동610-2), 압구정사거리 집수정강남구202546.8447432.0
1952018_2_03072㈜KT통신구팀통신구서빙고로4-12(한강3가 49-3), 용산병원 집수정용산구196919.6447359.6
1962018_2_03072㈜KT통신구팀통신구신촌로297(북아현동126-30), 아현삼거리 집수정마포구196394.4450896.8
1972018_2_03072㈜KT통신구팀통신구세종대로83(태평로2가344-3), 시청 집수정중 구197923.2451572.4
1982018_2_03072㈜KT통신구팀통신구을지로54(을지로2가199-78), 중앙국사 분기 집수정중 구198556.4451812.8
1992018_2_03072㈜KT통신구팀통신구을지로79(을지로2가50), 을지로2가 집수정중 구198812.8451885.2
2002018_2_03072㈜KT통신구팀통신구다산로248(신당동100-1), 율원파출소 집수정중 구201447.2451723.6
2012018_2_03072㈜KT통신구팀통신구종로266(종로6가262-1), 청계6가 집수정종로구200612.0452362.4
2022018_2_03072㈜KT통신구팀통신구낙산성곽길2(창신동697-3), 동대문 집수정종로구200872.4452469.2
2032018_2_03072㈜KT통신구팀통신구장충단로247(을지로6가18-21), 동대문운동장 집수장동대문구200643.2451926.4

Duplicate rows

Most frequently occurring

관리번호구분코드(01:전력구,02:통신구)관리기관관리부서시설물명집수정위치자치구X좌표Y좌표# duplicates
02018_2_03072㈜KT통신구팀통신구장충단로247(을지로6가18-21), 동대문운동장 집수장동대문구200643.2451926.42