Dataset statistics
Number of variables | 18 |
---|---|
Number of observations | 170 |
Missing cells | 1487 |
Missing cells (%) | 48.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 24.5 KiB |
Average record size in memory | 147.8 B |
Variable types
Categorical | 1 |
---|---|
Numeric | 3 |
Text | 1 |
Unsupported | 13 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-13294/F/1/datasetView.do |
외부 역번호 is highly overall correlated with 호선 | High correlation |
개집표기 is highly overall correlated with 호선 | High correlation |
EV is highly overall correlated with 호선 | High correlation |
호선 is highly overall correlated with 외부 역번호 and 2 other fields | High correlation |
외부 역번호 has 13 (7.6%) missing values | Missing |
역명 has 5 (2.9%) missing values | Missing |
개집표기 has 4 (2.4%) missing values | Missing |
플랩형 has 9 (5.3%) missing values | Missing |
Unnamed: 5 has 53 (31.2%) missing values | Missing |
Unnamed: 6 has 9 (5.3%) missing values | Missing |
Unnamed: 7 has 116 (68.2%) missing values | Missing |
Unnamed: 8 has 78 (45.9%) missing values | Missing |
장애인/비상 has 121 (71.2%) missing values | Missing |
Unnamed: 10 has 160 (94.1%) missing values | Missing |
Unnamed: 11 has 126 (74.1%) missing values | Missing |
Unnamed: 12 has 86 (50.6%) missing values | Missing |
Unnamed: 13 has 86 (50.6%) missing values | Missing |
개방형 has 163 (95.9%) missing values | Missing |
Unnamed: 15 has 163 (95.9%) missing values | Missing |
Unnamed: 16 has 163 (95.9%) missing values | Missing |
EV has 132 (77.6%) missing values | Missing |
플랩형 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
장애인/비상 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
개방형 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
개집표기 has 6 (3.5%) zeros | Zeros |
EV has 2 (1.2%) zeros | Zeros |
Reproduction
Analysis started | 2024-04-29 16:49:37.461766 |
---|---|
Analysis finished | 2024-04-29 16:49:40.188053 |
Duration | 2.73 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호선
Categorical
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 5.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.5 KiB |
5호선 | |
---|---|
7호선 | |
6호선 | |
8호선 | |
<NA> | 3 |
Other values (4) | 4 |
Length
Max length | 6 |
---|---|
Median length | 3 |
Mean length | 3.0882353 |
Min length | 3 |
Unique
Unique | 4 ? |
---|---|
Unique (%) | 2.4% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | 5호선 |
5th row | 5호선 |
Common Values
Value | Count | Frequency (%) |
5호선 | 53 | |
7호선 | 53 | |
6호선 | 39 | |
8호선 | 18 | 10.6% |
<NA> | 3 | 1.8% |
5호선 합계 | 1 | 0.6% |
6호선 합계 | 1 | 0.6% |
7호선 합계 | 1 | 0.6% |
8호선 합계 | 1 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5호선 | 54 | |
7호선 | 54 | |
6호선 | 40 | |
8호선 | 19 | 10.9% |
합계 | 4 | 2.3% |
na | 3 | 1.7% |
외부 역번호
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 157 |
---|---|
Distinct (%) | 100.0% |
Missing | 13 |
Missing (%) | 7.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2654.242 |
Minimum | 2511 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 2511 |
---|---|
5-th percentile | 2518.8 |
Q1 | 2550 |
median | 2638 |
Q3 | 2739 |
95-th percentile | 2819.2 |
Maximum | 2827 |
Range | 316 |
Interquartile range (IQR) | 189 |
Descriptive statistics
Standard deviation | 100.18415 |
---|---|
Coefficient of variation (CV) | 0.037744919 |
Kurtosis | -1.3226005 |
Mean | 2654.242 |
Median Absolute Deviation (MAD) | 95 |
Skewness | 0.11408772 |
Sum | 416716 |
Variance | 10036.864 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
2730 | 1 | 0.6% |
2723 | 1 | 0.6% |
2724 | 1 | 0.6% |
2725 | 1 | 0.6% |
2726 | 1 | 0.6% |
2727 | 1 | 0.6% |
2728 | 1 | 0.6% |
2729 | 1 | 0.6% |
2731 | 1 | 0.6% |
2721 | 1 | 0.6% |
Other values (147) | 147 | |
(Missing) | 13 | 7.6% |
Value | Count | Frequency (%) |
2511 | 1 | |
2512 | 1 | |
2513 | 1 | |
2514 | 1 | |
2515 | 1 | |
2516 | 1 | |
2517 | 1 | |
2518 | 1 | |
2519 | 1 | |
2520 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
MISSING
 
Distinct | 165 |
---|---|
Distinct (%) | 100.0% |
Missing | 5 |
Missing (%) | 2.9% |
Memory size | 1.5 KiB |
Value | Count | Frequency (%) |
여의도 | 1 | 0.6% |
청담 | 1 | 0.6% |
사가정 | 1 | 0.6% |
용마산 | 1 | 0.6% |
중곡 | 1 | 0.6% |
군자(7 | 1 | 0.6% |
어린이대공원 | 1 | 0.6% |
건대입구 | 1 | 0.6% |
뚝섬 | 1 | 0.6% |
유원지 | 1 | 0.6% |
Other values (161) | 161 |
Most occurring characters
Value | Count | Frequency (%) |
산 | 16 | 3.0% |
구 | 14 | 2.6% |
지 | 13 | 2.4% |
) | 12 | 2.3% |
( | 12 | 2.3% |
동 | 12 | 2.3% |
신 | 11 | 2.1% |
대 | 11 | 2.1% |
장 | 10 | 1.9% |
청 | 9 | 1.7% |
Other values (181) | 412 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 490 | |
Close Punctuation | 12 | 2.3% |
Open Punctuation | 12 | 2.3% |
Decimal Number | 12 | 2.3% |
Control | 6 | 1.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 16 | 3.3% |
구 | 14 | 2.9% |
지 | 13 | 2.7% |
동 | 12 | 2.4% |
신 | 11 | 2.2% |
대 | 11 | 2.2% |
장 | 10 | 2.0% |
청 | 9 | 1.8% |
화 | 9 | 1.8% |
원 | 8 | 1.6% |
Other values (172) | 377 |
Decimal Number
Value | Count | Frequency (%) |
5 | 4 | |
6 | 3 | |
7 | 2 | |
8 | 1 | 8.3% |
4 | 1 | 8.3% |
3 | 1 | 8.3% |
Close Punctuation
Value | Count | Frequency (%) |
) | 12 |
Open Punctuation
Value | Count | Frequency (%) |
( | 12 |
Control
Value | Count | Frequency (%) |
6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 490 | |
Common | 42 | 7.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 16 | 3.3% |
구 | 14 | 2.9% |
지 | 13 | 2.7% |
동 | 12 | 2.4% |
신 | 11 | 2.2% |
대 | 11 | 2.2% |
장 | 10 | 2.0% |
청 | 9 | 1.8% |
화 | 9 | 1.8% |
원 | 8 | 1.6% |
Other values (172) | 377 |
Common
Value | Count | Frequency (%) |
) | 12 | |
( | 12 | |
6 | ||
5 | 4 | 9.5% |
6 | 3 | 7.1% |
7 | 2 | 4.8% |
8 | 1 | 2.4% |
4 | 1 | 2.4% |
3 | 1 | 2.4% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 490 | |
ASCII | 42 | 7.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
산 | 16 | 3.3% |
구 | 14 | 2.9% |
지 | 13 | 2.7% |
동 | 12 | 2.4% |
신 | 11 | 2.2% |
대 | 11 | 2.2% |
장 | 10 | 2.0% |
청 | 9 | 1.8% |
화 | 9 | 1.8% |
원 | 8 | 1.6% |
Other values (172) | 377 |
ASCII
Value | Count | Frequency (%) |
) | 12 | |
( | 12 | |
6 | ||
5 | 4 | 9.5% |
6 | 3 | 7.1% |
7 | 2 | 4.8% |
8 | 1 | 2.4% |
4 | 1 | 2.4% |
3 | 1 | 2.4% |
개집표기
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 40 |
---|---|
Distinct (%) | 24.1% |
Missing | 4 |
Missing (%) | 2.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 32.60241 |
Minimum | 0 |
---|---|
Maximum | 1058 |
Zeros | 6 |
Zeros (%) | 3.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 8 |
Q1 | 11 |
median | 15.5 |
Q3 | 20 |
95-th percentile | 37.5 |
Maximum | 1058 |
Range | 1058 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 112.20279 |
---|---|
Coefficient of variation (CV) | 3.4415489 |
Kurtosis | 59.756939 |
Mean | 32.60241 |
Median Absolute Deviation (MAD) | 4.5 |
Skewness | 7.5748836 |
Sum | 5412 |
Variance | 12589.465 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
16 | 18 | 10.6% |
10 | 17 | 10.0% |
13 | 14 | 8.2% |
12 | 9 | 5.3% |
14 | 9 | 5.3% |
11 | 8 | 4.7% |
8 | 7 | 4.1% |
19 | 7 | 4.1% |
20 | 7 | 4.1% |
17 | 6 | 3.5% |
Other values (30) | 64 |
Value | Count | Frequency (%) |
0 | 6 | 3.5% |
6 | 1 | 0.6% |
7 | 1 | 0.6% |
8 | 7 | |
9 | 5 | 2.9% |
10 | 17 | |
11 | 8 | |
12 | 9 | |
13 | 14 | |
14 | 9 |
Value | Count | Frequency (%) |
1058 | 1 | |
816 | 1 | |
595 | 1 | |
237 | 1 | |
58 | 1 | |
53 | 1 | |
45 | 1 | |
39 | 1 | |
38 | 1 | |
36 | 1 |
플랩형
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 5.3% |
Memory size | 1.5 KiB |
Unnamed: 5
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 53 |
---|---|
Missing (%) | 31.2% |
Memory size | 1.5 KiB |
Unnamed: 6
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 9 |
---|---|
Missing (%) | 5.3% |
Memory size | 1.5 KiB |
Unnamed: 7
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 116 |
---|---|
Missing (%) | 68.2% |
Memory size | 1.5 KiB |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 78 |
---|---|
Missing (%) | 45.9% |
Memory size | 1.5 KiB |
장애인/비상
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 121 |
---|---|
Missing (%) | 71.2% |
Memory size | 1.5 KiB |
Unnamed: 10
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 160 |
---|---|
Missing (%) | 94.1% |
Memory size | 1.5 KiB |
Unnamed: 11
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 126 |
---|---|
Missing (%) | 74.1% |
Memory size | 1.5 KiB |
Unnamed: 12
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 86 |
---|---|
Missing (%) | 50.6% |
Memory size | 1.5 KiB |
Unnamed: 13
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 86 |
---|---|
Missing (%) | 50.6% |
Memory size | 1.5 KiB |
개방형
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 163 |
---|---|
Missing (%) | 95.9% |
Memory size | 1.5 KiB |
Unnamed: 15
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 163 |
---|---|
Missing (%) | 95.9% |
Memory size | 1.5 KiB |
Unnamed: 16
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 163 |
---|---|
Missing (%) | 95.9% |
Memory size | 1.5 KiB |
EV
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 6 |
---|---|
Distinct (%) | 15.8% |
Missing | 132 |
Missing (%) | 77.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.5263158 |
Minimum | 0 |
---|---|
Maximum | 23 |
Zeros | 2 |
Zeros (%) | 1.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0.85 |
Q1 | 1 |
median | 2 |
Q3 | 2 |
95-th percentile | 8.15 |
Maximum | 23 |
Range | 23 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 3.9437727 |
---|---|
Coefficient of variation (CV) | 1.5610767 |
Kurtosis | 20.31887 |
Mean | 2.5263158 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 4.2276488 |
Sum | 96 |
Variance | 15.553343 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 16 | 9.4% |
1 | 16 | 9.4% |
8 | 2 | 1.2% |
0 | 2 | 1.2% |
23 | 1 | 0.6% |
9 | 1 | 0.6% |
(Missing) | 132 |
Value | Count | Frequency (%) |
0 | 2 | 1.2% |
1 | 16 | |
2 | 16 | |
8 | 2 | 1.2% |
9 | 1 | 0.6% |
23 | 1 | 0.6% |
Value | Count | Frequency (%) |
23 | 1 | 0.6% |
9 | 1 | 0.6% |
8 | 2 | 1.2% |
2 | 16 | |
1 | 16 | |
0 | 2 | 1.2% |
호선 | 외부 역번호 | 개집표기 | EV | |
---|---|---|---|---|
호선 | 1.000 | 1.000 | 1.000 | 1.000 |
외부 역번호 | 1.000 | 1.000 | NaN | NaN |
개집표기 | 1.000 | NaN | 1.000 | 1.000 |
EV | 1.000 | NaN | 1.000 | 1.000 |
외부 역번호 | 개집표기 | EV | 호선 | |
---|---|---|---|---|
외부 역번호 | 1.000 | 0.083 | 0.032 | 0.990 |
개집표기 | 0.083 | 1.000 | 0.450 | 0.991 |
EV | 0.032 | 0.450 | 1.000 | 0.926 |
호선 | 0.990 | 0.991 | 0.926 | 1.000 |
호선 | 외부 역번호 | 역명 | 개집표기 | 플랩형 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | 장애인/비상 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | 개방형 | Unnamed: 15 | Unnamed: 16 | EV | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | 1형 | 2형 | 3형 | 4형 | 5형 | 1형 | 2형 | 5형 | 6형 | 7형 | 1형 | 2형 | 3형 | <NA> |
1 | <NA> | <NA> | 전자처(랩실) | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
2 | <NA> | <NA> | 도봉중정비 | <NA> | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
3 | 5호선 | <NA> | 방화기지 | <NA> | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
4 | 5호선 | 2511 | 방화 | 10 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | NaN | 1 | 1 | NaN | NaN | NaN | <NA> |
5 | 5호선 | 2512 | 개화산 | 11 | 2 | NaN | 2 | 1 | 2 | NaN | NaN | NaN | 1 | 1 | NaN | NaN | NaN | 2 |
6 | 5호선 | 2513 | 김포공항 | 25 | 5 | 3 | 7 | NaN | 4 | NaN | NaN | NaN | 2 | 2 | NaN | NaN | NaN | 2 |
7 | 5호선 | 2514 | 송정 | 11 | 2 | 2 | 4 | NaN | 1 | NaN | NaN | NaN | 1 | 1 | NaN | NaN | NaN | <NA> |
8 | 5호선 | 2515 | 마곡 | 16 | 2 | 4 | 2 | NaN | 1 | NaN | NaN | NaN | 1 | 1 | 1 | 3 | 1 | <NA> |
9 | 5호선 | 2516 | 발산 | 29 | 7 | 5 | 7 | 2 | 4 | 1 | NaN | 1 | 1 | 1 | NaN | NaN | NaN | <NA> |
호선 | 외부 역번호 | 역명 | 개집표기 | 플랩형 | Unnamed: 5 | Unnamed: 6 | Unnamed: 7 | Unnamed: 8 | 장애인/비상 | Unnamed: 10 | Unnamed: 11 | Unnamed: 12 | Unnamed: 13 | 개방형 | Unnamed: 15 | Unnamed: 16 | EV | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
160 | 8호선 | 2820 | 장지 | 19 | 4 | 6 | 5 | NaN | 2 | 1 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | <NA> |
161 | 8호선 | 2821 | 복정 | 10 | 2 | 2 | 2 | 1 | 1 | NaN | NaN | NaN | 1 | 1 | NaN | NaN | NaN | <NA> |
162 | 8호선 | 2822 | 산성 | 13 | 3 | 2 | 5 | NaN | 1 | NaN | NaN | NaN | 1 | 1 | NaN | NaN | NaN | <NA> |
163 | 8호선 | 2823 | 남한산성입구 | 13 | 3 | 3 | 4 | 1 | 2 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 0 |
164 | 8호선 | 2824 | 단대오거리 | 15 | 3 | 2 | 4 | 2 | 2 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
165 | 8호선 | 2825 | 신흥 | 6 | 1 | 1 | 3 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
166 | 8호선 | 2826 | 수진 | 9 | 2 | 2 | 4 | NaN | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
167 | 8호선 | 2827 | 모란 | 8 | 1 | 1 | 2 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | 2 |
168 | 8호선 | <NA> | 모란기지 | 0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | <NA> |
169 | 8호선 합계 | <NA> | <NA> | 237 | 46 | 44 | 71 | 13 | 26 | 4 | 0 | 4 | 10 | 10 | 0 | 0 | 0 | 9 |