Overview

Dataset statistics

Number of variables15
Number of observations31
Missing cells94
Missing cells (%)20.2%
Duplicate rows1
Duplicate rows (%)3.2%
Total size in memory3.8 KiB
Average record size in memory124.3 B

Variable types

Text3
Categorical1
Unsupported11

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-13213/F/1/datasetView.do

Alerts

Dataset has 1 (3.2%) duplicate rowsDuplicates
서울교통공사 정보통신설비 현황 (2017년 11월) has 15 (48.4%) missing valuesMissing
Unnamed: 1 has 11 (35.5%) missing valuesMissing
Unnamed: 2 has 29 (93.5%) missing valuesMissing
Unnamed: 4 has 2 (6.5%) missing valuesMissing
Unnamed: 5 has 2 (6.5%) missing valuesMissing
Unnamed: 6 has 3 (9.7%) missing valuesMissing
Unnamed: 7 has 3 (9.7%) missing valuesMissing
Unnamed: 8 has 2 (6.5%) missing valuesMissing
Unnamed: 9 has 4 (12.9%) missing valuesMissing
Unnamed: 10 has 6 (19.4%) missing valuesMissing
Unnamed: 11 has 7 (22.6%) missing valuesMissing
Unnamed: 12 has 6 (19.4%) missing valuesMissing
Unnamed: 13 has 2 (6.5%) missing valuesMissing
Unnamed: 14 has 2 (6.5%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 16:44:02.940621
Analysis finished2024-04-29 16:44:04.180113
Duration1.24 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct16
Distinct (%)100.0%
Missing15
Missing (%)48.4%
Memory size380.0 B
2024-04-30T01:44:04.301127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length7
Mean length6.125
Min length3

Characters and Unicode

Total characters98
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row설 비 명
2nd row광전송설비
3rd row통합정보통신망
4th row종합화상설비
5th row교환설비
ValueCountFrequency (%)
1
 
5.6%
1
 
5.6%
행선정보안내시스템 1
 
5.6%
음성유도기 1
 
5.6%
비상통화장치 1
 
5.6%
전화기 1
 
5.6%
토크백(주장치 1
 
5.6%
방송설비(역사용 1
 
5.6%
전기시계(부모 1
 
5.6%
무정전전원설비 1
 
5.6%
Other values (8) 8
44.4%
2024-04-30T01:44:04.618047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
9.2%
8
 
8.2%
5
 
5.1%
4
 
4.1%
4
 
4.1%
) 3
 
3.1%
3
 
3.1%
3
 
3.1%
( 3
 
3.1%
3
 
3.1%
Other values (41) 53
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88
89.8%
Space Separator 4
 
4.1%
Close Punctuation 3
 
3.1%
Open Punctuation 3
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
10.2%
8
 
9.1%
5
 
5.7%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (38) 45
51.1%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88
89.8%
Common 10
 
10.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
10.2%
8
 
9.1%
5
 
5.7%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (38) 45
51.1%
Common
ValueCountFrequency (%)
4
40.0%
) 3
30.0%
( 3
30.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88
89.8%
ASCII 10
 
10.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
10.2%
8
 
9.1%
5
 
5.7%
4
 
4.5%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
3
 
3.4%
2
 
2.3%
Other values (38) 45
51.1%
ASCII
ValueCountFrequency (%)
4
40.0%
) 3
30.0%
( 3
30.0%

Unnamed: 1
Text

MISSING 

Distinct20
Distinct (%)100.0%
Missing11
Missing (%)35.5%
Memory size380.0 B
2024-04-30T01:44:04.799594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length5.2
Min length2

Characters and Unicode

Total characters104
Distinct characters74
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)100.0%

Sample

1st row전송설비(주/부)
2nd rowXMP1
3rd row백본스위치
4th row에지스위치
5th row워크그룹스위치
ValueCountFrequency (%)
전송설비(주/부 1
 
5.0%
xmp1 1
 
5.0%
광통신용 1
 
5.0%
열차무선안테나 1
 
5.0%
콜폰 1
 
5.0%
승강장 1
 
5.0%
각종 1
 
5.0%
연선전화 1
 
5.0%
ircp 1
 
5.0%
이동국 1
 
5.0%
Other values (10) 10
50.0%
2024-04-30T01:44:05.098884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 4
 
3.8%
) 4
 
3.8%
4
 
3.8%
3
 
2.9%
3
 
2.9%
3
 
2.9%
R 3
 
2.9%
P 3
 
2.9%
3
 
2.9%
2
 
1.9%
Other values (64) 72
69.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 72
69.2%
Uppercase Letter 15
 
14.4%
Lowercase Letter 7
 
6.7%
Open Punctuation 4
 
3.8%
Close Punctuation 4
 
3.8%
Decimal Number 1
 
1.0%
Other Punctuation 1
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 46
63.9%
Uppercase Letter
ValueCountFrequency (%)
R 3
20.0%
P 3
20.0%
V 2
13.3%
I 2
13.3%
C 1
 
6.7%
D 1
 
6.7%
M 1
 
6.7%
N 1
 
6.7%
X 1
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
a 2
28.6%
t 1
14.3%
w 1
14.3%
y 1
14.3%
e 1
14.3%
g 1
14.3%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 72
69.2%
Latin 22
 
21.2%
Common 10
 
9.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 46
63.9%
Latin
ValueCountFrequency (%)
R 3
13.6%
P 3
13.6%
a 2
 
9.1%
V 2
 
9.1%
I 2
 
9.1%
t 1
 
4.5%
w 1
 
4.5%
y 1
 
4.5%
C 1
 
4.5%
e 1
 
4.5%
Other values (5) 5
22.7%
Common
ValueCountFrequency (%)
( 4
40.0%
) 4
40.0%
1 1
 
10.0%
/ 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 72
69.2%
ASCII 32
30.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 4
12.5%
) 4
12.5%
R 3
 
9.4%
P 3
 
9.4%
a 2
 
6.2%
V 2
 
6.2%
I 2
 
6.2%
t 1
 
3.1%
w 1
 
3.1%
y 1
 
3.1%
Other values (9) 9
28.1%
Hangul
ValueCountFrequency (%)
4
 
5.6%
3
 
4.2%
3
 
4.2%
3
 
4.2%
3
 
4.2%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
2
 
2.8%
Other values (45) 46
63.9%

Unnamed: 2
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing29
Missing (%)93.5%
Memory size380.0 B
2024-04-30T01:44:05.243882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2.5
Mean length2.5
Min length2

Characters and Unicode

Total characters5
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row역사
2nd row전동차
ValueCountFrequency (%)
역사 1
50.0%
전동차 1
50.0%
2024-04-30T01:44:05.468144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Unnamed: 3
Categorical

Distinct6
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size380.0 B
<NA>
Km
장치

Length

Max length4
Median length1
Mean length1.8709677
Min length1

Unique

Unique1 ?
Unique (%)3.2%

Sample

1st row<NA>
2nd row단위
3rd row<NA>
4th row
5th row<NA>

Common Values

ValueCountFrequency (%)
9
29.0%
9
29.0%
<NA> 7
22.6%
Km 3
 
9.7%
장치 2
 
6.5%
단위 1
 
3.2%

Length

2024-04-30T01:44:05.589431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T01:44:05.703540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9
29.0%
9
29.0%
na 7
22.6%
km 3
 
9.7%
장치 2
 
6.5%
단위 1
 
3.2%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)9.7%
Memory size380.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)12.9%
Memory size380.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing6
Missing (%)19.4%
Memory size380.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)22.6%
Memory size380.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing6
Missing (%)19.4%
Memory size380.0 B

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)6.5%
Memory size380.0 B

Correlations

2024-04-30T01:44:05.788287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울교통공사 정보통신설비 현황 (2017년 11월)Unnamed: 1Unnamed: 2Unnamed: 3
서울교통공사 정보통신설비 현황 (2017년 11월)1.0001.000NaN1.000
Unnamed: 11.0001.000NaN1.000
Unnamed: 2NaNNaN1.000NaN
Unnamed: 31.0001.000NaN1.000

Missing values

2024-04-30T01:44:03.610062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T01:44:03.798902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T01:44:04.011519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

서울교통공사 정보통신설비 현황 (2017년 11월)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14
0<NA><NA><NA><NA>NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1설 비 명<NA><NA>단위1~4호선NaNNaNNaNNaN5~8호선NaNNaNNaNNaN
2<NA><NA><NA><NA>1호선2호선3호선4호선소계5호선6호선7호선8호선소계NaN
3광전송설비전송설비(주/부)<NA>1163383714953395318163312
4<NA>XMP1<NA><NA>6-191641NaNNaNNaNNaN041
5통합정보통신망백본스위치<NA>2104622NaNNaNNaNNaN022
6<NA>에지스위치<NA><NA>201057056251NaNNaNNaNNaN0251
7<NA>워크그룹스위치<NA><NA>5530019715670861409020211919
8종합화상설비모니터링시스템<NA>1051342612151385117157278
9<NA>카메라역사4862281163812585663135011291657450458610249
서울교통공사 정보통신설비 현황 (2017년 11월)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14
21토크백(주장치)<NA><NA>4131293817111654987
22전화기연선전화<NA>382311461045192731412701047881307
23<NA>각종<NA>68715791466119049221589991151653146279549
24비상통화장치승강장<NA>562821721406503052023021009092468
25<NA>콜폰<NA>150685317370152264842166922419623484
26음성유도기<NA><NA>2199354595712184916702108430430065190
27행선정보안내시스템<NA><NA>1050342612051385117157277
28선로설비열차무선안테나<NA>Km1077654719920011515954528727
29<NA>광통신용<NA>Km292261521325393471072721128381377
30<NA>본선용(반송)<NA>Km12159846331850406731188506

Duplicate rows

Most frequently occurring

서울교통공사 정보통신설비 현황 (2017년 11월)Unnamed: 1Unnamed: 2Unnamed: 3# duplicates
0<NA><NA><NA><NA>2