Overview

Dataset statistics

Number of variables6
Number of observations24
Missing cells30
Missing cells (%)20.8%
Duplicate rows2
Duplicate rows (%)8.3%
Total size in memory1.3 KiB
Average record size in memory53.5 B

Variable types

Text1
Unsupported5

Dataset

Description파일 다운로드
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-12926/F/1/datasetView.do

Alerts

Dataset has 2 (8.3%) duplicate rowsDuplicates
위치별 현황 has 5 (20.8%) missing valuesMissing
Unnamed: 1 has 5 (20.8%) missing valuesMissing
Unnamed: 2 has 5 (20.8%) missing valuesMissing
Unnamed: 3 has 5 (20.8%) missing valuesMissing
Unnamed: 4 has 5 (20.8%) missing valuesMissing
Unnamed: 5 has 5 (20.8%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-29 16:38:26.460166
Analysis finished2024-04-29 16:38:26.861561
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

위치별 현황
Text

MISSING 

Distinct17
Distinct (%)89.5%
Missing5
Missing (%)20.8%
Memory size324.0 B
2024-04-30T01:38:26.945326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.7368421
Min length1

Characters and Unicode

Total characters52
Distinct characters32
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)84.2%

Sample

1st row
2nd row보도
3rd row중앙
4th row녹지
5th row차도
ValueCountFrequency (%)
3
 
15.0%
보도 1
 
5.0%
30~120 1
 
5.0%
30미만 1
 
5.0%
현황 1
 
5.0%
높이별 1
 
5.0%
급배기 1
 
5.0%
배기 1
 
5.0%
급기 1
 
5.0%
소계 1
 
5.0%
Other values (8) 8
40.0%
2024-04-30T01:38:27.222744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
7.7%
4
 
7.7%
0 4
 
7.7%
3
 
5.8%
2 2
 
3.8%
1 2
 
3.8%
2
 
3.8%
3 2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (22) 25
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40
76.9%
Decimal Number 10
 
19.2%
Math Symbol 1
 
1.9%
Space Separator 1
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
10.0%
4
 
10.0%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
Other values (16) 16
40.0%
Decimal Number
ValueCountFrequency (%)
0 4
40.0%
2 2
20.0%
1 2
20.0%
3 2
20.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40
76.9%
Common 12
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
10.0%
4
 
10.0%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
Other values (16) 16
40.0%
Common
ValueCountFrequency (%)
0 4
33.3%
2 2
16.7%
1 2
16.7%
3 2
16.7%
~ 1
 
8.3%
1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 40
76.9%
ASCII 12
 
23.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
10.0%
4
 
10.0%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
Other values (16) 16
40.0%
ASCII
ValueCountFrequency (%)
0 4
33.3%
2 2
16.7%
1 2
16.7%
3 2
16.7%
~ 1
 
8.3%
1
 
8.3%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Missing values

2024-04-30T01:38:26.548873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T01:38:26.665351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-30T01:38:26.790544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

위치별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
0<NA>1호선2호선3호선4호선
1990111431270178
2보도71392254207160
3중앙126011736
4녹지99747387
5차도75110
6기타45712215
7<NA>NaNNaNNaNNaNNaN
8<NA>NaNNaNNaNNaNNaN
9용도별현황NaNNaNNaNNaNNaN
위치별 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
14급기3022210311265
15배기3692513312982
16급배기80440
17<NA>NaNNaNNaNNaNNaN
18<NA>NaNNaNNaNNaNNaN
19높이별 현황1호선2호선3호선4호선
20990111431270178
2130미만1368246440
2230~120472772248586
23120이상3822618312152

Duplicate rows

Most frequently occurring

위치별 현황# duplicates
1<NA>5
03