Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 124 |
Missing cells | 623 |
Missing cells (%) | 35.9% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 13.8 KiB |
Average record size in memory | 114.1 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 1 |
Text | 1 |
Unsupported | 11 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13294/F/1/datasetView.do |
역번호 is highly overall correlated with 호선 | High correlation |
호선 is highly overall correlated with 역번호 | High correlation |
역번호 has 5 (4.0%) missing values | Missing |
역 명 has 5 (4.0%) missing values | Missing |
턴스타일게이트 has 9 (7.3%) missing values | Missing |
Unnamed: 4 has 9 (7.3%) missing values | Missing |
Unnamed: 5 has 112 (90.3%) missing values | Missing |
Unnamed: 6 has 9 (7.3%) missing values | Missing |
Unnamed: 7 has 9 (7.3%) missing values | Missing |
슬림게이트 has 87 (70.2%) missing values | Missing |
Unnamed: 9 has 87 (70.2%) missing values | Missing |
Unnamed: 10 has 117 (94.4%) missing values | Missing |
Unnamed: 11 has 87 (70.2%) missing values | Missing |
Unnamed: 12 has 87 (70.2%) missing values | Missing |
턴스타일게이트 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
슬림게이트 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
스피드게이트 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-29 16:49:44.116742 |
---|---|
Analysis finished | 2024-04-29 16:49:45.162956 |
Duration | 1.05 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 7.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
2호선 | |
---|---|
3호선 | |
4호선 | |
1호선 | |
<NA> | 1 |
Other values (4) | 4 |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.1048387 |
Min length | 3 |
Unique
Unique | 5 ? |
---|---|
Unique (%) | 4.0% |
Sample
1st row | <NA> |
---|---|
2nd row | 1호선 |
3rd row | 1호선 |
4th row | 1호선 |
5th row | 1호선 |
Common Values
Value | Count | Frequency (%) |
2호선 | 50 | |
3호선 | 33 | |
4호선 | 26 | |
1호선 | 10 | 8.1% |
<NA> | 1 | 0.8% |
1호선 합계 | 1 | 0.8% |
2호선 합계 | 1 | 0.8% |
3호선 합계 | 1 | 0.8% |
4호선 합계 | 1 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2호선 | 51 | |
3호선 | 34 | |
4호선 | 27 | |
1호선 | 11 | 8.6% |
합계 | 4 | 3.1% |
na | 1 | 0.8% |
역번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 119 |
---|---|
Distinct (%) | 100.0% |
Missing | 5 |
Missing (%) | 4.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 290.12605 |
Minimum | 150 |
---|---|
Maximum | 434 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 155.9 |
Q1 | 220.5 |
median | 250 |
Q3 | 338.5 |
95-th percentile | 428.1 |
Maximum | 434 |
Range | 284 |
Interquartile range (IQR) | 118 |
Descriptive statistics
Standard deviation | 87.25227 |
---|---|
Coefficient of variation (CV) | 0.30073918 |
Kurtosis | -1.1541503 |
Mean | 290.12605 |
Median Absolute Deviation (MAD) | 68 |
Skewness | 0.27063256 |
Sum | 34525 |
Variance | 7612.9586 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
151 | 1 | 0.8% |
338 | 1 | 0.8% |
337 | 1 | 0.8% |
336 | 1 | 0.8% |
335 | 1 | 0.8% |
334 | 1 | 0.8% |
333 | 1 | 0.8% |
332 | 1 | 0.8% |
331 | 1 | 0.8% |
330 | 1 | 0.8% |
Other values (109) | 109 | |
(Missing) | 5 | 4.0% |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
434 | 1 | |
433 | 1 | |
432 | 1 | |
431 | 1 | |
430 | 1 | |
429 | 1 | |
428 | 1 | |
427 | 1 | |
426 | 1 | |
425 | 1 |
역 명
Text
MISSING
 
Distinct | 118 |
---|---|
Distinct (%) | 99.2% |
Missing | 5 |
Missing (%) | 4.0% |
Memory size | 1.1 KiB |
Value | Count | Frequency (%) |
수 | 3 | 2.2% |
대 | 3 | 2.2% |
동대문역사문화공원 | 2 | 1.5% |
금 | 2 | 1.5% |
원 | 2 | 1.5% |
재 | 1 | 0.7% |
교 | 1 | 0.7% |
남부터미널 | 1 | 0.7% |
양 | 1 | 0.7% |
매 | 1 | 0.7% |
Other values (119) | 119 |
Most occurring characters
Value | Count | Frequency (%) |
32 | 7.7% | |
대 | 21 | 5.0% |
신 | 14 | 3.4% |
동 | 13 | 3.1% |
구 | 13 | 3.1% |
( | 13 | 3.1% |
) | 13 | 3.1% |
문 | 9 | 2.2% |
지 | 8 | 1.9% |
입 | 7 | 1.7% |
Other values (137) | 273 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 339 | |
Space Separator | 32 | 7.7% |
Decimal Number | 19 | 4.6% |
Open Punctuation | 13 | 3.1% |
Close Punctuation | 13 | 3.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 21 | 6.2% |
신 | 14 | 4.1% |
동 | 13 | 3.8% |
구 | 13 | 3.8% |
문 | 9 | 2.7% |
지 | 8 | 2.4% |
입 | 7 | 2.1% |
가 | 7 | 2.1% |
로 | 6 | 1.8% |
청 | 6 | 1.8% |
Other values (129) | 235 |
Decimal Number
Value | Count | Frequency (%) |
3 | 6 | |
1 | 5 | |
2 | 5 | |
4 | 2 | 10.5% |
5 | 1 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
32 |
Open Punctuation
Value | Count | Frequency (%) |
( | 13 |
Close Punctuation
Value | Count | Frequency (%) |
) | 13 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 339 | |
Common | 77 | 18.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 21 | 6.2% |
신 | 14 | 4.1% |
동 | 13 | 3.8% |
구 | 13 | 3.8% |
문 | 9 | 2.7% |
지 | 8 | 2.4% |
입 | 7 | 2.1% |
가 | 7 | 2.1% |
로 | 6 | 1.8% |
청 | 6 | 1.8% |
Other values (129) | 235 |
Common
Value | Count | Frequency (%) |
32 | ||
( | 13 | |
) | 13 | |
3 | 6 | 7.8% |
1 | 5 | 6.5% |
2 | 5 | 6.5% |
4 | 2 | 2.6% |
5 | 1 | 1.3% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 339 | |
ASCII | 77 | 18.5% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
32 | ||
( | 13 | |
) | 13 | |
3 | 6 | 7.8% |
1 | 5 | 6.5% |
2 | 5 | 6.5% |
4 | 2 | 2.6% |
5 | 1 | 1.3% |
Hangul
Value | Count | Frequency (%) |
대 | 21 | 6.2% |
신 | 14 | 4.1% |
동 | 13 | 3.8% |
구 | 13 | 3.8% |
문 | 9 | 2.7% |
지 | 8 | 2.4% |
입 | 7 | 2.1% |
가 | 7 | 2.1% |
로 | 6 | 1.8% |
청 | 6 | 1.8% |
Other values (129) | 235 |
턴스타일게이트
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 7.3% |
Memory size | 1.1 KiB |
Unnamed: 4
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 7.3% |
Memory size | 1.1 KiB |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 112 |
---|---|
Missing (%) | 90.3% |
Memory size | 1.1 KiB |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 7.3% |
Memory size | 1.1 KiB |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 7.3% |
Memory size | 1.1 KiB |
슬림게이트
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 87 |
---|---|
Missing (%) | 70.2% |
Memory size | 1.1 KiB |
Unnamed: 9
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 87 |
---|---|
Missing (%) | 70.2% |
Memory size | 1.1 KiB |
Unnamed: 10
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 117 |
---|---|
Missing (%) | 94.4% |
Memory size | 1.1 KiB |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 87 |
---|---|
Missing (%) | 70.2% |
Memory size | 1.1 KiB |
Unnamed: 12
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 87 |
---|---|
Missing (%) | 70.2% |
Memory size | 1.1 KiB |
스피드게이트
Unsupported
REJECTED
  UNSUPPORTED
 
Missing | 0 |
---|---|
Missing (%) | 0.0% |
Memory size | 1.1 KiB |
호선 | 역번호 | |
---|---|---|
호선 | 1.000 | 1.000 |
역번호 | 1.000 | 1.000 |
역번호 | 호선 | |
---|---|---|
역번호 | 1.000 | 0.987 |
호선 | 0.987 | 1.000 |
호선 | 역번호 | 역 명 | 턴스타일게이트 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | 슬림게이트 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | 스피드게이트 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | 소계 | EN | EX | REV | FL | 소계 | 1형 | 2형 | 3형 | 5형 | 설치 |
1 | 1호선 | 150 | 서울역(1) | 46 | 10 | NaN | 30 | 6 | 16 | 1 | NaN | 14 | 1 | 5 |
2 | 1호선 | 151 | 시청(1) | NaN | NaN | NaN | NaN | NaN | 64 | 4 | NaN | 56 | 4 | 4 |
3 | 1호선 | 152 | 종각 | 37 | 6 | NaN | 27 | 4 | 11 | 2 | NaN | 7 | 2 | 5 |
4 | 1호선 | 153 | 종로3가(1) | 34 | 5 | NaN | 24 | 5 | NaN | NaN | NaN | NaN | NaN | 4 |
5 | 1호선 | 154 | 종로5가 | 31 | 4 | NaN | 23 | 4 | NaN | NaN | NaN | NaN | NaN | 3 |
6 | 1호선 | 155 | 동대문(1) | 32 | 5 | NaN | 22 | 5 | NaN | NaN | NaN | NaN | NaN | 5 |
7 | 1호선 | 156 | 신설동(1) | 25 | 4 | NaN | 17 | 4 | 11 | 2 | NaN | 7 | 2 | 5 |
8 | 1호선 | 157 | 제기동 | 20 | 4 | NaN | 13 | 3 | NaN | NaN | NaN | NaN | NaN | 2 |
9 | 1호선 | 158 | 청량리 | 36 | 8 | NaN | 23 | 5 | NaN | NaN | NaN | NaN | NaN | 3 |
호선 | 역번호 | 역 명 | 턴스타일게이트 | Unnamed: 4 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | 슬림게이트 | Unnamed: 9 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | 스피드게이트 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
114 | 4호선 | 426 | 서울역 | 9 | 2 | NaN | 5 | 2 | 8 | 1 | NaN | 6 | 1 | 2 |
115 | 4호선 | 427 | 숙대입구 | 28 | 4 | NaN | 20 | 4 | NaN | NaN | NaN | NaN | NaN | 4 |
116 | 4호선 | 428 | 삼각지 | 25 | 4 | NaN | 17 | 4 | NaN | NaN | NaN | NaN | NaN | 2 |
117 | 4호선 | 429 | 신용산 | 23 | 4 | NaN | 15 | 4 | NaN | NaN | NaN | NaN | NaN | 4 |
118 | 4호선 | 430 | 이 촌 | 19 | 4 | NaN | 13 | 2 | NaN | NaN | NaN | NaN | NaN | 2 |
119 | 4호선 | 431 | 동 작 | 10 | 2 | NaN | 6 | 2 | NaN | NaN | NaN | NaN | NaN | 2 |
120 | 4호선 | 432 | 총신대 | 23 | 4 | NaN | 15 | 4 | NaN | NaN | NaN | NaN | NaN | 3 |
121 | 4호선 | 433 | 사당(4) | 25 | 6 | NaN | 13 | 6 | NaN | NaN | NaN | NaN | NaN | 3 |
122 | 4호선 | 434 | 남태령 | 7 | 2 | NaN | 4 | 1 | NaN | NaN | NaN | NaN | NaN | 1 |
123 | 4호선 합계 | <NA> | <NA> | 549 | 92 | 0 | 376 | 81 | 118 | 13 | 0 | 92 | 13 | 71 |