Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells28
Missing cells (%)19.4%
Duplicate rows3
Duplicate rows (%)12.5%
Total size in memory1.3 KiB
Average record size in memory53.5 B

Variable types

Text1
Unsupported5

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-12926/F/1/datasetView.do

Alerts

Dataset has 3 (12.5%) duplicate rowsDuplicates
위치별 현황 has 4 (16.7%) missing valuesMissing
Unnamed: 1 has 5 (20.8%) missing valuesMissing
Unnamed: 2 has 5 (20.8%) missing valuesMissing
Unnamed: 3 has 5 (20.8%) missing valuesMissing
Unnamed: 4 has 4 (16.7%) missing valuesMissing
Unnamed: 5 has 5 (20.8%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 16:38:24.928353
Analysis finished2024-04-29 16:38:25.291783
Duration0.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치별 현황
Text

MISSING 

Distinct17
Distinct (%)85.0%
Missing4
Missing (%)16.7%
Memory size324.0 B
2024-04-30T01:38:25.383241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.65
Min length1

Characters and Unicode

Total characters53
Distinct characters31
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)75.0%

Sample

1st row
2nd row보도
3rd row중앙
4th row녹지
5th row차도
ValueCountFrequency (%)
3
15.0%
구분 2
 
10.0%
급기 1
 
5.0%
소계 1
 
5.0%
30~120 1
 
5.0%
30미만 1
 
5.0%
높이별현황 1
 
5.0%
급배기 1
 
5.0%
배기 1
 
5.0%
용도별현황 1
 
5.0%
Other values (7) 7
35.0%
2024-04-30T01:38:25.638992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
7.5%
0 4
 
7.5%
4
 
7.5%
3
 
5.7%
2
 
3.8%
2 2
 
3.8%
1 2
 
3.8%
3 2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (21) 26
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42
79.2%
Decimal Number 10
 
18.9%
Math Symbol 1
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
9.5%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
Other values (16) 17
40.5%
Decimal Number
ValueCountFrequency (%)
0 4
40.0%
2 2
20.0%
1 2
20.0%
3 2
20.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42
79.2%
Common 11
 
20.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
9.5%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
Other values (16) 17
40.5%
Common
ValueCountFrequency (%)
0 4
36.4%
2 2
18.2%
1 2
18.2%
3 2
18.2%
~ 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42
79.2%
ASCII 11
 
20.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
9.5%
4
 
9.5%
3
 
7.1%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
2
 
4.8%
Other values (16) 17
40.5%
ASCII
ValueCountFrequency (%)
0 4
36.4%
2 2
18.2%
1 2
18.2%
3 2
18.2%
~ 1
 
9.1%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)16.7%
Memory size324.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Missing values

2024-04-30T01:38:25.009099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T01:38:25.111212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T01:38:25.217110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

위치별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
0<NA>1호선2호선3호선4호선
1997111439269178
2보도71383263207160
3중앙123011436
4녹지1101451387
5차도75110
6기타44910205
7<NA>NaNNaNNaN※ 3호선 1개소 폐쇄NaN
8<NA>NaNNaNNaNNaNNaN
9용도별현황NaNNaNNaNNaNNaN
위치별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
14급기3092211011265
15배기3752514012882
16급배기80440
17<NA>NaNNaNNaNNaNNaN
18높이별현황NaNNaNNaNNaNNaN
19구분1호선2호선3호선4호선
20997111439269178
2130미만1366266440
2230~120472752278486
23120이상3893018612152

Duplicate rows

Most frequently occurring

위치별 현황# duplicates
2<NA>4
03
1구분2