Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 281 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 17.1 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
장애 has unique values | Unique |
유공자 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:09:57.165289 |
---|---|
Analysis finished | 2024-04-29 21:10:00.599275 |
Duration | 3.43 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 281 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 141 |
Minimum | 1 |
---|---|
Maximum | 281 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15 |
Q1 | 71 |
median | 141 |
Q3 | 211 |
95-th percentile | 267 |
Maximum | 281 |
Range | 280 |
Interquartile range (IQR) | 140 |
Descriptive statistics
Standard deviation | 81.261922 |
---|---|
Coefficient of variation (CV) | 0.57632569 |
Kurtosis | -1.2 |
Mean | 141 |
Median Absolute Deviation (MAD) | 70 |
Skewness | 0 |
Sum | 39621 |
Variance | 6603.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
186 | 1 | 0.4% |
192 | 1 | 0.4% |
191 | 1 | 0.4% |
190 | 1 | 0.4% |
189 | 1 | 0.4% |
188 | 1 | 0.4% |
187 | 1 | 0.4% |
185 | 1 | 0.4% |
194 | 1 | 0.4% |
Other values (271) | 271 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
281 | 1 | |
280 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
276 | 1 | |
275 | 1 | |
274 | 1 | |
273 | 1 | |
272 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.683274 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0202646 |
---|---|
Coefficient of variation (CV) | 0.4313787 |
Kurtosis | -1.1788697 |
Mean | 4.683274 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.11088965 |
Sum | 1316 |
Variance | 4.0814692 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 56 | |
7 | 51 | |
2 | 50 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.4% |
1 | 10 | 3.6% |
Value | Count | Frequency (%) |
1 | 10 | 3.6% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 51 | |
8 | 18 | 6.4% |
Value | Count | Frequency (%) |
8 | 18 | 6.4% |
7 | 51 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.6% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 281 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1652.2206 |
Minimum | 150 |
---|---|
Maximum | 2828 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205 |
Q1 | 319 |
median | 2532 |
Q3 | 2647 |
95-th percentile | 2814 |
Maximum | 2828 |
Range | 2678 |
Interquartile range (IQR) | 2328 |
Descriptive statistics
Standard deviation | 1173.3538 |
---|---|
Coefficient of variation (CV) | 0.71016775 |
Kurtosis | -1.8931459 |
Mean | 1652.2206 |
Median Absolute Deviation (MAD) | 226 |
Skewness | -0.30543982 |
Sum | 464274 |
Variance | 1376759.2 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2622 | 1 | 0.4% |
2628 | 1 | 0.4% |
2627 | 1 | 0.4% |
2626 | 1 | 0.4% |
2625 | 1 | 0.4% |
2624 | 1 | 0.4% |
2623 | 1 | 0.4% |
2621 | 1 | 0.4% |
2630 | 1 | 0.4% |
Other values (271) | 271 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2828 | 1 | |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 |
역명
Text
Distinct | 248 |
---|---|
Distinct (%) | 88.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원(ddp | 3 | 1.1% |
을지로4가 | 2 | 0.7% |
사당 | 2 | 0.7% |
서울역 | 2 | 0.7% |
대림(구로구청 | 2 | 0.7% |
불광 | 2 | 0.7% |
교대(법원.검찰청 | 2 | 0.7% |
노원 | 2 | 0.7% |
잠실(송파구청 | 2 | 0.7% |
Other values (238) | 259 |
Most occurring characters
Value | Count | Frequency (%) |
( | 64 | 5.2% |
) | 64 | 5.2% |
구 | 50 | 4.0% |
대 | 49 | 3.9% |
동 | 35 | 2.8% |
청 | 32 | 2.6% |
신 | 26 | 2.1% |
원 | 23 | 1.9% |
산 | 22 | 1.8% |
문 | 20 | 1.6% |
Other values (233) | 856 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1092 | |
Open Punctuation | 64 | 5.2% |
Close Punctuation | 64 | 5.2% |
Uppercase Letter | 9 | 0.7% |
Decimal Number | 8 | 0.6% |
Other Punctuation | 4 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 50 | 4.6% |
대 | 49 | 4.5% |
동 | 35 | 3.2% |
청 | 32 | 2.9% |
신 | 26 | 2.4% |
원 | 23 | 2.1% |
산 | 22 | 2.0% |
문 | 20 | 1.8% |
입 | 19 | 1.7% |
로 | 16 | 1.5% |
Other values (224) | 800 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 6 | |
P | 3 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 | |
· | 1 | 25.0% |
Open Punctuation
Value | Count | Frequency (%) |
( | 64 |
Close Punctuation
Value | Count | Frequency (%) |
) | 64 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1092 | |
Common | 140 | 11.3% |
Latin | 9 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.6% |
대 | 49 | 4.5% |
동 | 35 | 3.2% |
청 | 32 | 2.9% |
신 | 26 | 2.4% |
원 | 23 | 2.1% |
산 | 22 | 2.0% |
문 | 20 | 1.8% |
입 | 19 | 1.7% |
로 | 16 | 1.5% |
Other values (224) | 800 |
Common
Value | Count | Frequency (%) |
( | 64 | |
) | 64 | |
3 | 5 | 3.6% |
. | 3 | 2.1% |
4 | 2 | 1.4% |
5 | 1 | 0.7% |
· | 1 | 0.7% |
Latin
Value | Count | Frequency (%) |
D | 6 | |
P | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1092 | |
ASCII | 148 | 11.9% |
None | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 64 | |
) | 64 | |
D | 6 | 4.1% |
3 | 5 | 3.4% |
P | 3 | 2.0% |
. | 3 | 2.0% |
4 | 2 | 1.4% |
5 | 1 | 0.7% |
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.6% |
대 | 49 | 4.5% |
동 | 35 | 3.2% |
청 | 32 | 2.9% |
신 | 26 | 2.4% |
원 | 23 | 2.1% |
산 | 22 | 2.0% |
문 | 20 | 1.8% |
입 | 19 | 1.7% |
로 | 16 | 1.5% |
Other values (224) | 800 |
None
Value | Count | Frequency (%) |
· | 1 |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 281 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 607726.54 |
Minimum | 6178 |
---|---|
Maximum | 2489647 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 6178 |
---|---|
5-th percentile | 153982 |
Q1 | 322108 |
median | 519002 |
Q3 | 742675 |
95-th percentile | 1458120 |
Maximum | 2489647 |
Range | 2483469 |
Interquartile range (IQR) | 420567 |
Descriptive statistics
Standard deviation | 422099.26 |
---|---|
Coefficient of variation (CV) | 0.69455459 |
Kurtosis | 4.3043538 |
Mean | 607726.54 |
Median Absolute Deviation (MAD) | 204652 |
Skewness | 1.7965571 |
Sum | 1.7077116 × 108 |
Variance | 1.7816778 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1460872 | 1 | 0.4% |
700301 | 1 | 0.4% |
322108 | 1 | 0.4% |
605707 | 1 | 0.4% |
367409 | 1 | 0.4% |
377835 | 1 | 0.4% |
180386 | 1 | 0.4% |
318100 | 1 | 0.4% |
593026 | 1 | 0.4% |
150345 | 1 | 0.4% |
Other values (271) | 271 |
Value | Count | Frequency (%) |
6178 | 1 | |
62778 | 1 | |
68680 | 1 | |
76384 | 1 | |
80199 | 1 | |
86231 | 1 | |
90356 | 1 | |
93552 | 1 | |
107244 | 1 | |
112234 | 1 |
Value | Count | Frequency (%) |
2489647 | 1 | |
2452804 | 1 | |
2358285 | 1 | |
2303767 | 1 | |
2188055 | 1 | |
1820075 | 1 | |
1792211 | 1 | |
1586915 | 1 | |
1536611 | 1 | |
1531379 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 281 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 116968.67 |
Minimum | 1202 |
---|---|
Maximum | 406955 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1202 |
---|---|
5-th percentile | 27844 |
Q1 | 62634 |
median | 97350 |
Q3 | 153572 |
95-th percentile | 282633 |
Maximum | 406955 |
Range | 405753 |
Interquartile range (IQR) | 90938 |
Descriptive statistics
Standard deviation | 78933.472 |
---|---|
Coefficient of variation (CV) | 0.67482579 |
Kurtosis | 1.9671453 |
Mean | 116968.67 |
Median Absolute Deviation (MAD) | 38748 |
Skewness | 1.3838752 |
Sum | 32868195 |
Variance | 6.230493 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
388272 | 1 | 0.4% |
121000 | 1 | 0.4% |
58602 | 1 | 0.4% |
109169 | 1 | 0.4% |
61623 | 1 | 0.4% |
56009 | 1 | 0.4% |
27995 | 1 | 0.4% |
63926 | 1 | 0.4% |
167030 | 1 | 0.4% |
29547 | 1 | 0.4% |
Other values (271) | 271 |
Value | Count | Frequency (%) |
1202 | 1 | |
10298 | 1 | |
12587 | 1 | |
12820 | 1 | |
13377 | 1 | |
15270 | 1 | |
16658 | 1 | |
17609 | 1 | |
23290 | 1 | |
23566 | 1 |
Value | Count | Frequency (%) |
406955 | 1 | |
405814 | 1 | |
388272 | 1 | |
383345 | 1 | |
364559 | 1 | |
353724 | 1 | |
342727 | 1 | |
321227 | 1 | |
318872 | 1 | |
310723 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 281 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7504.79 |
Minimum | 76 |
---|---|
Maximum | 34588 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 76 |
---|---|
5-th percentile | 1702 |
Q1 | 3583 |
median | 6153 |
Q3 | 9509 |
95-th percentile | 19721 |
Maximum | 34588 |
Range | 34512 |
Interquartile range (IQR) | 5926 |
Descriptive statistics
Standard deviation | 5452.9029 |
---|---|
Coefficient of variation (CV) | 0.72658967 |
Kurtosis | 3.5704153 |
Mean | 7504.79 |
Median Absolute Deviation (MAD) | 2951 |
Skewness | 1.656351 |
Sum | 2108846 |
Variance | 29734150 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
29791 | 1 | 0.4% |
5760 | 1 | 0.4% |
3471 | 1 | 0.4% |
7634 | 1 | 0.4% |
6052 | 1 | 0.4% |
5659 | 1 | 0.4% |
2863 | 1 | 0.4% |
4837 | 1 | 0.4% |
6141 | 1 | 0.4% |
1722 | 1 | 0.4% |
Other values (271) | 271 |
Value | Count | Frequency (%) |
76 | 1 | |
282 | 1 | |
550 | 1 | |
929 | 1 | |
1025 | 1 | |
1189 | 1 | |
1234 | 1 | |
1291 | 1 | |
1336 | 1 | |
1377 | 1 |
Value | Count | Frequency (%) |
34588 | 1 | |
29791 | 1 | |
25689 | 1 | |
24058 | 1 | |
23967 | 1 | |
23192 | 1 | |
22791 | 1 | |
22076 | 1 | |
22069 | 1 | |
21137 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.923 | 0.931 | 0.531 | 0.422 | 0.492 |
호선 | 0.923 | 1.000 | 0.995 | 0.530 | 0.403 | 0.465 |
역번호 | 0.931 | 0.995 | 1.000 | 0.347 | 0.340 | 0.378 |
경로 | 0.531 | 0.530 | 0.347 | 1.000 | 0.918 | 0.884 |
장애 | 0.422 | 0.403 | 0.340 | 0.918 | 1.000 | 0.880 |
유공자 | 0.492 | 0.465 | 0.378 | 0.884 | 0.880 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.372 | -0.330 | -0.382 |
호선 | 0.988 | 1.000 | 0.988 | -0.343 | -0.296 | -0.354 |
역번호 | 1.000 | 0.988 | 1.000 | -0.372 | -0.330 | -0.382 |
경로 | -0.372 | -0.343 | -0.372 | 1.000 | 0.938 | 0.907 |
장애 | -0.330 | -0.296 | -0.330 | 0.938 | 1.000 | 0.898 |
유공자 | -0.382 | -0.354 | -0.382 | 0.907 | 0.898 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 1460872 | 388272 | 29791 |
1 | 2 | 1 | 151 | 시청 | 565414 | 136378 | 11388 |
2 | 3 | 1 | 152 | 종각 | 1032047 | 219725 | 16035 |
3 | 4 | 1 | 153 | 종로3가 | 2489647 | 405814 | 34588 |
4 | 5 | 1 | 154 | 종로5가 | 2303767 | 321227 | 23192 |
5 | 6 | 1 | 155 | 동대문 | 1036170 | 199931 | 9011 |
6 | 7 | 1 | 156 | 신설동 | 996356 | 180221 | 9934 |
7 | 8 | 1 | 157 | 제기동 | 2452804 | 307701 | 18213 |
8 | 9 | 1 | 158 | 청량리(서울시립대입구) | 2358285 | 364559 | 23967 |
9 | 10 | 1 | 159 | 동묘앞 | 1350866 | 251952 | 16072 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
271 | 272 | 8 | 2819 | 문정 | 454496 | 98300 | 6240 |
272 | 273 | 8 | 2820 | 장지 | 606606 | 135750 | 8783 |
273 | 274 | 8 | 2821 | 복정 | 370027 | 80973 | 5505 |
274 | 275 | 8 | 2822 | 산성 | 298976 | 49335 | 3397 |
275 | 276 | 8 | 2823 | 남한산성입구(성남법원.검찰청) | 651239 | 139509 | 6359 |
276 | 277 | 8 | 2824 | 단대오거리 | 562345 | 133288 | 4211 |
277 | 278 | 8 | 2825 | 신흥 | 306458 | 73008 | 2452 |
278 | 279 | 8 | 2826 | 수진 | 339066 | 77884 | 2871 |
279 | 280 | 8 | 2827 | 모란 | 443197 | 85920 | 4251 |
280 | 281 | 8 | 2828 | 남위례 | 6178 | 1202 | 76 |