Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 275 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.8 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
장애 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:10:10.012928 |
---|---|
Analysis finished | 2024-04-29 21:10:13.375322 |
Duration | 3.36 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 138 |
Minimum | 1 |
---|---|
Maximum | 275 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.7 |
Q1 | 69.5 |
median | 138 |
Q3 | 206.5 |
95-th percentile | 261.3 |
Maximum | 275 |
Range | 274 |
Interquartile range (IQR) | 137 |
Descriptive statistics
Standard deviation | 79.529869 |
---|---|
Coefficient of variation (CV) | 0.5763034 |
Kurtosis | -1.2 |
Mean | 138 |
Median Absolute Deviation (MAD) | 69 |
Skewness | 0 |
Sum | 37950 |
Variance | 6325 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
183 | 1 | 0.4% |
189 | 1 | 0.4% |
188 | 1 | 0.4% |
187 | 1 | 0.4% |
186 | 1 | 0.4% |
185 | 1 | 0.4% |
184 | 1 | 0.4% |
182 | 1 | 0.4% |
174 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
275 | 1 | |
274 | 1 | |
273 | 1 | |
272 | 1 | |
271 | 1 | |
270 | 1 | |
269 | 1 | |
268 | 1 | |
267 | 1 | |
266 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6654545 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0318826 |
---|---|
Coefficient of variation (CV) | 0.43551653 |
Kurtosis | -1.2116408 |
Mean | 4.6654545 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.10095916 |
Sum | 1283 |
Variance | 4.1285468 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 51 | |
7 | 51 | |
2 | 50 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 17 | 6.2% |
1 | 10 | 3.6% |
Value | Count | Frequency (%) |
1 | 10 | 3.6% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 51 | |
6 | 37 | |
7 | 51 | |
8 | 17 | 6.2% |
Value | Count | Frequency (%) |
8 | 17 | 6.2% |
7 | 51 | |
6 | 37 | |
5 | 51 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.6% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1631.3673 |
Minimum | 150 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.7 |
Q1 | 317.5 |
median | 2529 |
Q3 | 2647.5 |
95-th percentile | 2813.3 |
Maximum | 2827 |
Range | 2677 |
Interquartile range (IQR) | 2330 |
Descriptive statistics
Standard deviation | 1177.3932 |
---|---|
Coefficient of variation (CV) | 0.72172173 |
Kurtosis | -1.9158526 |
Mean | 1631.3673 |
Median Absolute Deviation (MAD) | 231 |
Skewness | -0.26764389 |
Sum | 448626 |
Variance | 1386254.8 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2624 | 1 | 0.4% |
2630 | 1 | 0.4% |
2629 | 1 | 0.4% |
2628 | 1 | 0.4% |
2627 | 1 | 0.4% |
2626 | 1 | 0.4% |
2625 | 1 | 0.4% |
2623 | 1 | 0.4% |
2614 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
Distinct | 242 |
---|---|
Distinct (%) | 88.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원 | 3 | 1.1% |
천호(풍납토성 | 2 | 0.7% |
사당 | 2 | 0.7% |
서울역 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
대림(구로구청 | 2 | 0.7% |
불광 | 2 | 0.7% |
약수 | 2 | 0.7% |
오금 | 2 | 0.7% |
Other values (232) | 253 |
Most occurring characters
Value | Count | Frequency (%) |
) | 58 | 4.9% |
( | 58 | 4.9% |
구 | 50 | 4.2% |
대 | 49 | 4.1% |
동 | 35 | 3.0% |
청 | 31 | 2.6% |
신 | 25 | 2.1% |
원 | 22 | 1.9% |
산 | 20 | 1.7% |
문 | 20 | 1.7% |
Other values (226) | 818 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1059 | |
Close Punctuation | 58 | 4.9% |
Open Punctuation | 58 | 4.9% |
Decimal Number | 8 | 0.7% |
Other Punctuation | 3 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1059 | |
Common | 127 | 10.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Common
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1059 | |
ASCII | 127 | 10.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 818523.21 |
Minimum | 71898 |
---|---|
Maximum | 3972545 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 71898 |
---|---|
5-th percentile | 179931.9 |
Q1 | 434132.5 |
median | 669873 |
Q3 | 998479 |
95-th percentile | 2069412.7 |
Maximum | 3972545 |
Range | 3900647 |
Interquartile range (IQR) | 564346.5 |
Descriptive statistics
Standard deviation | 592558.66 |
---|---|
Coefficient of variation (CV) | 0.72393629 |
Kurtosis | 5.6975056 |
Mean | 818523.21 |
Median Absolute Deviation (MAD) | 268446 |
Skewness | 1.9993698 |
Sum | 2.2509388 × 108 |
Variance | 3.5112576 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2091960 | 1 | 0.4% |
234594 | 1 | 0.4% |
203425 | 1 | 0.4% |
174796 | 1 | 0.4% |
423483 | 1 | 0.4% |
761679 | 1 | 0.4% |
509912 | 1 | 0.4% |
617413 | 1 | 0.4% |
429902 | 1 | 0.4% |
217837 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
71898 | 1 | |
84971 | 1 | |
88631 | 1 | |
91292 | 1 | |
101958 | 1 | |
107252 | 1 | |
109672 | 1 | |
137457 | 1 | |
153453 | 1 | |
157787 | 1 |
Value | Count | Frequency (%) |
3972545 | 1 | |
3566296 | 1 | |
3330151 | 1 | |
2926950 | 1 | |
2858251 | 1 | |
2433535 | 1 | |
2258915 | 1 | |
2251765 | 1 | |
2241971 | 1 | |
2157517 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 165029.78 |
Minimum | 14527 |
---|---|
Maximum | 607798 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 14527 |
---|---|
5-th percentile | 39066.3 |
Q1 | 90154.5 |
median | 138392 |
Q3 | 214224 |
95-th percentile | 397562.5 |
Maximum | 607798 |
Range | 593271 |
Interquartile range (IQR) | 124069.5 |
Descriptive statistics
Standard deviation | 111974.13 |
---|---|
Coefficient of variation (CV) | 0.67850861 |
Kurtosis | 2.3126618 |
Mean | 165029.78 |
Median Absolute Deviation (MAD) | 54356 |
Skewness | 1.4548405 |
Sum | 45383190 |
Variance | 1.2538205 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
590159 | 1 | 0.4% |
41335 | 1 | 0.4% |
40202 | 1 | 0.4% |
39591 | 1 | 0.4% |
83234 | 1 | 0.4% |
149219 | 1 | 0.4% |
89951 | 1 | 0.4% |
84036 | 1 | 0.4% |
95304 | 1 | 0.4% |
36122 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
14527 | 1 | |
15602 | 1 | |
18708 | 1 | |
18856 | 1 | |
18898 | 1 | |
20654 | 1 | |
22913 | 1 | |
25659 | 1 | |
28399 | 1 | |
32815 | 1 |
Value | Count | Frequency (%) |
607798 | 1 | |
590159 | 1 | |
570098 | 1 | |
519538 | 1 | |
518170 | 1 | |
496421 | 1 | |
493901 | 1 | |
462315 | 1 | |
451972 | 1 | |
446438 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 272 |
---|---|
Distinct (%) | 98.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12232.651 |
Minimum | 647 |
---|---|
Maximum | 66278 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 647 |
---|---|
5-th percentile | 2323.6 |
Q1 | 5729 |
median | 9668 |
Q3 | 15180.5 |
95-th percentile | 33131.8 |
Maximum | 66278 |
Range | 65631 |
Interquartile range (IQR) | 9451.5 |
Descriptive statistics
Standard deviation | 10088.507 |
---|---|
Coefficient of variation (CV) | 0.82471959 |
Kurtosis | 6.7311839 |
Mean | 12232.651 |
Median Absolute Deviation (MAD) | 4514 |
Skewness | 2.2251708 |
Sum | 3363979 |
Variance | 1.0177797 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
10534 | 2 | 0.7% |
4078 | 2 | 0.7% |
4438 | 2 | 0.7% |
7735 | 1 | 0.4% |
9855 | 1 | 0.4% |
6967 | 1 | 0.4% |
9654 | 1 | 0.4% |
8052 | 1 | 0.4% |
6105 | 1 | 0.4% |
4679 | 1 | 0.4% |
Other values (262) | 262 |
Value | Count | Frequency (%) |
647 | 1 | |
817 | 1 | |
821 | 1 | |
1343 | 1 | |
1609 | 1 | |
1694 | 1 | |
1702 | 1 | |
1724 | 1 | |
1834 | 1 | |
1958 | 1 |
Value | Count | Frequency (%) |
66278 | 1 | |
62324 | 1 | |
55719 | 1 | |
50744 | 1 | |
47798 | 1 | |
41992 | 1 | |
40322 | 1 | |
36687 | 1 | |
36083 | 1 | |
35014 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.918 | 0.921 | 0.438 | 0.494 | 0.498 |
호선 | 0.918 | 1.000 | 0.996 | 0.493 | 0.450 | 0.474 |
역번호 | 0.921 | 0.996 | 1.000 | 0.386 | 0.370 | 0.397 |
경로 | 0.438 | 0.493 | 0.386 | 1.000 | 0.925 | 0.903 |
장애 | 0.494 | 0.450 | 0.370 | 0.925 | 1.000 | 0.920 |
유공자 | 0.498 | 0.474 | 0.397 | 0.903 | 0.920 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.411 | -0.348 | -0.389 |
호선 | 0.988 | 1.000 | 0.988 | -0.385 | -0.318 | -0.361 |
역번호 | 1.000 | 0.988 | 1.000 | -0.411 | -0.348 | -0.389 |
경로 | -0.411 | -0.385 | -0.411 | 1.000 | 0.936 | 0.907 |
장애 | -0.348 | -0.318 | -0.348 | 0.936 | 1.000 | 0.907 |
유공자 | -0.389 | -0.361 | -0.389 | 0.907 | 0.907 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 2091960 | 590159 | 62324 |
1 | 2 | 1 | 151 | 시청 | 1118281 | 237822 | 25435 |
2 | 3 | 1 | 152 | 종각 | 1653008 | 357227 | 33775 |
3 | 4 | 1 | 153 | 종로3가 | 3972545 | 607798 | 66278 |
4 | 5 | 1 | 154 | 종로5가 | 2926950 | 430773 | 36687 |
5 | 6 | 1 | 155 | 동대문 | 1366758 | 280563 | 16416 |
6 | 7 | 1 | 156 | 신설동 | 1231602 | 254227 | 15959 |
7 | 8 | 1 | 157 | 제기동 | 3566296 | 446438 | 30965 |
8 | 9 | 1 | 158 | 청량리(서울시립대입구) | 3330151 | 519538 | 41992 |
9 | 10 | 1 | 159 | 동묘앞 | 1512766 | 303865 | 21589 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
265 | 266 | 8 | 2818 | 가락시장 | 618610 | 101418 | 8334 |
266 | 267 | 8 | 2819 | 문정 | 541887 | 129273 | 8813 |
267 | 268 | 8 | 2820 | 장지 | 741629 | 184335 | 13726 |
268 | 269 | 8 | 2821 | 복정 | 486905 | 105811 | 9523 |
269 | 270 | 8 | 2822 | 산성 | 334663 | 77567 | 3708 |
270 | 271 | 8 | 2823 | 남한산성입구(성남법원.검찰청) | 769622 | 172685 | 10112 |
271 | 272 | 8 | 2824 | 단대오거리 | 656667 | 173986 | 5087 |
272 | 273 | 8 | 2825 | 신흥 | 424513 | 105959 | 4416 |
273 | 274 | 8 | 2826 | 수진 | 492646 | 113934 | 3835 |
274 | 275 | 8 | 2827 | 모란 | 541359 | 112723 | 6928 |