Overview

Dataset statistics

Number of variables13
Number of observations290
Missing cells291
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.1 KiB
Average record size in memory106.5 B

Variable types

Numeric1
Text1
Categorical1
Boolean9
Unsupported1

Dataset

Description지하철역ID,지하철역명,호선,엘리베이터유무,휘체어리프트유무,환승주차자유무,자전거보관소유무,무인민원발급기유무,환전키오스크유무,기차예매역유무,문화공간유무,만남의장소유무,유아수유방유무
Author서울교통공사
URLhttps://data.seoul.go.kr/dataList/OA-13321/S/1/datasetView.do

Alerts

지하철역ID is highly overall correlated with 호선High correlation
호선 is highly overall correlated with 지하철역IDHigh correlation
엘리베이터유무 is highly imbalanced (85.5%)Imbalance
환전키오스크유무 is highly imbalanced (56.5%)Imbalance
기차예매역유무 is highly imbalanced (80.0%)Imbalance
자전거보관소유무 has 290 (100.0%) missing valuesMissing
지하철역ID has unique valuesUnique
자전거보관소유무 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-05-11 06:11:53.278697
Analysis finished2024-05-11 06:11:54.691429
Duration1.41 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지하철역ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct290
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1726.4276
Minimum150
Maximum4138
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.7 KiB
2024-05-11T15:11:54.777212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum150
5-th percentile204.45
Q1320.25
median2533.5
Q32695.5
95-th percentile2826.55
Maximum4138
Range3988
Interquartile range (IQR)2375.25

Descriptive statistics

Standard deviation1259.7797
Coefficient of variation (CV)0.72970318
Kurtosis-1.5149557
Mean1726.4276
Median Absolute Deviation (MAD)285
Skewness-0.099047292
Sum500664
Variance1587044.9
MonotonicityStrictly increasing
2024-05-11T15:11:54.942079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150 1
 
0.3%
2712 1
 
0.3%
2631 1
 
0.3%
2630 1
 
0.3%
2629 1
 
0.3%
2628 1
 
0.3%
2627 1
 
0.3%
2626 1
 
0.3%
2625 1
 
0.3%
2624 1
 
0.3%
Other values (280) 280
96.6%
ValueCountFrequency (%)
150 1
0.3%
151 1
0.3%
152 1
0.3%
153 1
0.3%
154 1
0.3%
155 1
0.3%
156 1
0.3%
157 1
0.3%
158 1
0.3%
159 1
0.3%
ValueCountFrequency (%)
4138 1
0.3%
4137 1
0.3%
4136 1
0.3%
4135 1
0.3%
4134 1
0.3%
4133 1
0.3%
4132 1
0.3%
4131 1
0.3%
4130 1
0.3%
4129 1
0.3%
Distinct250
Distinct (%)86.2%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2024-05-11T15:11:55.345795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length2
Mean length2.9586207
Min length2

Characters and Unicode

Total characters858
Distinct characters215
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique212 ?
Unique (%)73.1%

Sample

1st row서울역
2nd row시청
3rd row종각
4th row종로3가
5th row종로5가
ValueCountFrequency (%)
종로3가 3
 
1.0%
동대문역사문화공원 3
 
1.0%
시청 2
 
0.7%
서울역 2
 
0.7%
가산디지털단지 2
 
0.7%
사당 2
 
0.7%
노원 2
 
0.7%
잠실 2
 
0.7%
약수 2
 
0.7%
천호 2
 
0.7%
Other values (240) 268
92.4%
2024-05-11T15:11:56.000684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
3.7%
28
 
3.3%
24
 
2.8%
23
 
2.7%
21
 
2.4%
16
 
1.9%
15
 
1.7%
15
 
1.7%
15
 
1.7%
14
 
1.6%
Other values (205) 655
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 850
99.1%
Decimal Number 8
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
3.8%
28
 
3.3%
24
 
2.8%
23
 
2.7%
21
 
2.5%
16
 
1.9%
15
 
1.8%
15
 
1.8%
15
 
1.8%
14
 
1.6%
Other values (202) 647
76.1%
Decimal Number
ValueCountFrequency (%)
3 5
62.5%
4 2
 
25.0%
5 1
 
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 850
99.1%
Common 8
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
3.8%
28
 
3.3%
24
 
2.8%
23
 
2.7%
21
 
2.5%
16
 
1.9%
15
 
1.8%
15
 
1.8%
15
 
1.8%
14
 
1.6%
Other values (202) 647
76.1%
Common
ValueCountFrequency (%)
3 5
62.5%
4 2
 
25.0%
5 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 850
99.1%
ASCII 8
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
3.8%
28
 
3.3%
24
 
2.8%
23
 
2.7%
21
 
2.5%
16
 
1.9%
15
 
1.8%
15
 
1.8%
15
 
1.8%
14
 
1.6%
Other values (202) 647
76.1%
ASCII
ValueCountFrequency (%)
3 5
62.5%
4 2
 
25.0%
5 1
 
12.5%

호선
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
05호선
56 
02호선
51 
07호선
42 
06호선
39 
03호선
34 
Other values (4)
68 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row01호선
2nd row01호선
3rd row01호선
4th row01호선
5th row01호선

Common Values

ValueCountFrequency (%)
05호선 56
19.3%
02호선 51
17.6%
07호선 42
14.5%
06호선 39
13.4%
03호선 34
11.7%
04호선 26
9.0%
08호선 18
 
6.2%
09호선 13
 
4.5%
01호선 11
 
3.8%

Length

2024-05-11T15:11:56.158573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-11T15:11:56.321239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
05호선 56
19.3%
02호선 51
17.6%
07호선 42
14.5%
06호선 39
13.4%
03호선 34
11.7%
04호선 26
9.0%
08호선 18
 
6.2%
09호선 13
 
4.5%
01호선 11
 
3.8%

엘리베이터유무
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
True
284 
False
 
6
ValueCountFrequency (%)
True 284
97.9%
False 6
 
2.1%
2024-05-11T15:11:56.476769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
231 
True
59 
ValueCountFrequency (%)
False 231
79.7%
True 59
 
20.3%
2024-05-11T15:11:56.573073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
254 
True
36 
ValueCountFrequency (%)
False 254
87.6%
True 36
 
12.4%
2024-05-11T15:11:56.682000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

자전거보관소유무
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing290
Missing (%)100.0%
Memory size2.7 KiB
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
211 
True
79 
ValueCountFrequency (%)
False 211
72.8%
True 79
 
27.2%
2024-05-11T15:11:56.787826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

환전키오스크유무
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
264 
True
 
26
ValueCountFrequency (%)
False 264
91.0%
True 26
 
9.0%
2024-05-11T15:11:56.876118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기차예매역유무
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
281 
True
 
9
ValueCountFrequency (%)
False 281
96.9%
True 9
 
3.1%
2024-05-11T15:11:56.984656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
False
253 
True
37 
ValueCountFrequency (%)
False 253
87.2%
True 37
 
12.8%
2024-05-11T15:11:57.150506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size422.0 B
True
183 
False
107 
ValueCountFrequency (%)
True 183
63.1%
False 107
36.9%
2024-05-11T15:11:57.245491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.7%
Missing1
Missing (%)0.3%
Memory size712.0 B
False
186 
True
103 
(Missing)
 
1
ValueCountFrequency (%)
False 186
64.1%
True 103
35.5%
(Missing) 1
 
0.3%
2024-05-11T15:11:57.347416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2024-05-11T15:11:54.272623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-11T15:11:57.434698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지하철역ID호선엘리베이터유무휘체어리프트유무환승주차자유무무인민원발급기유무환전키오스크유무기차예매역유무문화공간유무만남의장소유무유아수유방유무
지하철역ID1.0000.9000.3230.1910.0000.0630.1710.0910.0000.2560.252
호선0.9001.0000.0000.2480.0000.1160.2190.0530.0000.3060.290
엘리베이터유무0.3230.0001.0000.0000.0000.0000.0000.0000.0000.0000.000
휘체어리프트유무0.1910.2480.0001.0000.0000.0560.0000.0000.0770.0000.188
환승주차자유무0.0000.0000.0000.0001.0000.0000.0370.0000.0000.0900.000
무인민원발급기유무0.0630.1160.0000.0560.0001.0000.1130.0000.0000.1080.000
환전키오스크유무0.1710.2190.0000.0000.0370.1131.0000.3840.0830.1690.000
기차예매역유무0.0910.0530.0000.0000.0000.0000.3841.0000.0000.0000.000
문화공간유무0.0000.0000.0000.0770.0000.0000.0830.0001.0000.1460.193
만남의장소유무0.2560.3060.0000.0000.0900.1080.1690.0000.1461.0000.045
유아수유방유무0.2520.2900.0000.1880.0000.0000.0000.0000.1930.0451.000
2024-05-11T15:11:57.598700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
휘체어리프트유무엘리베이터유무만남의장소유무유아수유방유무기차예매역유무문화공간유무무인민원발급기유무환승주차자유무호선환전키오스크유무
휘체어리프트유무1.0000.0000.0000.1200.0000.0490.0360.0000.2450.000
엘리베이터유무0.0001.0000.0000.0000.0000.0000.0000.0000.0000.000
만남의장소유무0.0000.0001.0000.0280.0000.0940.0690.0570.3020.108
유아수유방유무0.1200.0000.0281.0000.0000.1230.0000.0000.2860.000
기차예매역유무0.0000.0000.0000.0001.0000.0000.0000.0000.0520.251
문화공간유무0.0490.0000.0940.1230.0001.0000.0000.0000.0000.053
무인민원발급기유무0.0360.0000.0690.0000.0000.0001.0000.0000.1140.072
환승주차자유무0.0000.0000.0570.0000.0000.0000.0001.0000.0000.023
호선0.2450.0000.3020.2860.0520.0000.1140.0001.0000.216
환전키오스크유무0.0000.0000.1080.0000.2510.0530.0720.0230.2161.000
2024-05-11T15:11:57.772725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지하철역ID호선엘리베이터유무휘체어리프트유무환승주차자유무무인민원발급기유무환전키오스크유무기차예매역유무문화공간유무만남의장소유무유아수유방유무
지하철역ID1.0000.7850.3920.2250.0000.0740.2060.1220.0000.3100.308
호선0.7851.0000.0000.2450.0000.1140.2160.0520.0000.3020.286
엘리베이터유무0.3920.0001.0000.0000.0000.0000.0000.0000.0000.0000.000
휘체어리프트유무0.2250.2450.0001.0000.0000.0360.0000.0000.0490.0000.120
환승주차자유무0.0000.0000.0000.0001.0000.0000.0230.0000.0000.0570.000
무인민원발급기유무0.0740.1140.0000.0360.0001.0000.0720.0000.0000.0690.000
환전키오스크유무0.2060.2160.0000.0000.0230.0721.0000.2510.0530.1080.000
기차예매역유무0.1220.0520.0000.0000.0000.0000.2511.0000.0000.0000.000
문화공간유무0.0000.0000.0000.0490.0000.0000.0530.0001.0000.0940.123
만남의장소유무0.3100.3020.0000.0000.0570.0690.1080.0000.0941.0000.028
유아수유방유무0.3080.2860.0000.1200.0000.0000.0000.0000.1230.0281.000

Missing values

2024-05-11T15:11:54.427349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T15:11:54.615496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지하철역ID지하철역명호선엘리베이터유무휘체어리프트유무환승주차자유무자전거보관소유무무인민원발급기유무환전키오스크유무기차예매역유무문화공간유무만남의장소유무유아수유방유무
0150서울역01호선YYN<NA>NNYNYN
1151시청01호선YNN<NA>NNNNYN
2152종각01호선YNN<NA>YNNNNN
3153종로3가01호선YNN<NA>NNNYNY
4154종로5가01호선YNN<NA>NNNNNN
5155동대문01호선YNN<NA>NNNNYY
6156신설동01호선YYN<NA>NNNNNN
7157제기동01호선YNN<NA>NNNNNN
8158청량리01호선YYN<NA>NNNNNN
9159동묘앞01호선YNN<NA>NNNNNN
지하철역ID지하철역명호선엘리베이터유무휘체어리프트유무환승주차자유무자전거보관소유무무인민원발급기유무환전키오스크유무기차예매역유무문화공간유무만남의장소유무유아수유방유무
2804129봉은사09호선YNN<NA>NNNNNY
2814130종합운동장09호선YNN<NA>NNNNNY
2824131삼전09호선YNN<NA>NNNNNY
2834132석촌고분09호선YNN<NA>NNNNNY
2844133석촌09호선YNN<NA>NNNNNY
2854134송파나루09호선YNN<NA>YNNNNY
2864135한성백제09호선YNN<NA>NNNNNY
2874136올림픽공원09호선YNN<NA>NNNNNY
2884137둔촌오륜09호선YNN<NA>NNNNNY
2894138중앙보훈병원09호선YNN<NA>NNNNNY