Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 275 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.8 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
장애 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:10:22.725514 |
---|---|
Analysis finished | 2024-04-29 21:10:26.109683 |
Duration | 3.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 138 |
Minimum | 1 |
---|---|
Maximum | 275 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.7 |
Q1 | 69.5 |
median | 138 |
Q3 | 206.5 |
95-th percentile | 261.3 |
Maximum | 275 |
Range | 274 |
Interquartile range (IQR) | 137 |
Descriptive statistics
Standard deviation | 79.529869 |
---|---|
Coefficient of variation (CV) | 0.5763034 |
Kurtosis | -1.2 |
Mean | 138 |
Median Absolute Deviation (MAD) | 69 |
Skewness | 0 |
Sum | 37950 |
Variance | 6325 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
183 | 1 | 0.4% |
189 | 1 | 0.4% |
188 | 1 | 0.4% |
187 | 1 | 0.4% |
186 | 1 | 0.4% |
185 | 1 | 0.4% |
184 | 1 | 0.4% |
182 | 1 | 0.4% |
174 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
275 | 1 | |
274 | 1 | |
273 | 1 | |
272 | 1 | |
271 | 1 | |
270 | 1 | |
269 | 1 | |
268 | 1 | |
267 | 1 | |
266 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6654545 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0318826 |
---|---|
Coefficient of variation (CV) | 0.43551653 |
Kurtosis | -1.2116408 |
Mean | 4.6654545 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.10095916 |
Sum | 1283 |
Variance | 4.1285468 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 51 | |
7 | 51 | |
2 | 50 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 17 | 6.2% |
1 | 10 | 3.6% |
Value | Count | Frequency (%) |
1 | 10 | 3.6% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 51 | |
6 | 37 | |
7 | 51 | |
8 | 17 | 6.2% |
Value | Count | Frequency (%) |
8 | 17 | 6.2% |
7 | 51 | |
6 | 37 | |
5 | 51 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.6% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1631.3673 |
Minimum | 150 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.7 |
Q1 | 317.5 |
median | 2529 |
Q3 | 2647.5 |
95-th percentile | 2813.3 |
Maximum | 2827 |
Range | 2677 |
Interquartile range (IQR) | 2330 |
Descriptive statistics
Standard deviation | 1177.3932 |
---|---|
Coefficient of variation (CV) | 0.72172173 |
Kurtosis | -1.9158526 |
Mean | 1631.3673 |
Median Absolute Deviation (MAD) | 231 |
Skewness | -0.26764389 |
Sum | 448626 |
Variance | 1386254.8 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2624 | 1 | 0.4% |
2630 | 1 | 0.4% |
2629 | 1 | 0.4% |
2628 | 1 | 0.4% |
2627 | 1 | 0.4% |
2626 | 1 | 0.4% |
2625 | 1 | 0.4% |
2623 | 1 | 0.4% |
2614 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
Distinct | 242 |
---|---|
Distinct (%) | 88.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원 | 3 | 1.1% |
천호(풍납토성 | 2 | 0.7% |
사당 | 2 | 0.7% |
서울역 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
대림(구로구청 | 2 | 0.7% |
불광 | 2 | 0.7% |
약수 | 2 | 0.7% |
오금 | 2 | 0.7% |
Other values (232) | 253 |
Most occurring characters
Value | Count | Frequency (%) |
) | 58 | 4.9% |
( | 58 | 4.9% |
구 | 50 | 4.2% |
대 | 49 | 4.1% |
동 | 35 | 3.0% |
청 | 31 | 2.6% |
신 | 25 | 2.1% |
원 | 22 | 1.9% |
산 | 20 | 1.7% |
문 | 20 | 1.7% |
Other values (226) | 818 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1059 | |
Close Punctuation | 58 | 4.9% |
Open Punctuation | 58 | 4.9% |
Decimal Number | 8 | 0.7% |
Other Punctuation | 3 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1059 | |
Common | 127 | 10.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Common
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1059 | |
ASCII | 127 | 10.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 758058.96 |
Minimum | 59099 |
---|---|
Maximum | 3685909 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 59099 |
---|---|
5-th percentile | 170230.5 |
Q1 | 403043.5 |
median | 614074 |
Q3 | 903142 |
95-th percentile | 1877775.4 |
Maximum | 3685909 |
Range | 3626810 |
Interquartile range (IQR) | 500098.5 |
Descriptive statistics
Standard deviation | 561303.92 |
---|---|
Coefficient of variation (CV) | 0.74044889 |
Kurtosis | 5.9306483 |
Mean | 758058.96 |
Median Absolute Deviation (MAD) | 242886 |
Skewness | 2.052748 |
Sum | 2.0846621 × 108 |
Variance | 3.1506209 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1964968 | 1 | 0.4% |
226810 | 1 | 0.4% |
203900 | 1 | 0.4% |
204962 | 1 | 0.4% |
407342 | 1 | 0.4% |
696311 | 1 | 0.4% |
436190 | 1 | 0.4% |
597453 | 1 | 0.4% |
380586 | 1 | 0.4% |
204684 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
59099 | 1 | |
76792 | 1 | |
95126 | 1 | |
98957 | 1 | |
99160 | 1 | |
102532 | 1 | |
115179 | 1 | |
130249 | 1 | |
142450 | 1 | |
149737 | 1 |
Value | Count | Frequency (%) |
3685909 | 1 | |
3413844 | 1 | |
3322146 | 1 | |
2718723 | 1 | |
2601023 | 1 | |
2323720 | 1 | |
2253205 | 1 | |
2179579 | 1 | |
2118338 | 1 | |
2038213 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 168744.23 |
Minimum | 14972 |
---|---|
Maximum | 644969 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 14972 |
---|---|
5-th percentile | 40506.7 |
Q1 | 90059.5 |
median | 138428 |
Q3 | 211662.5 |
95-th percentile | 398415 |
Maximum | 644969 |
Range | 629997 |
Interquartile range (IQR) | 121603 |
Descriptive statistics
Standard deviation | 116848.37 |
---|---|
Coefficient of variation (CV) | 0.69245847 |
Kurtosis | 2.7726028 |
Mean | 168744.23 |
Median Absolute Deviation (MAD) | 56141 |
Skewness | 1.5469953 |
Sum | 46404664 |
Variance | 1.3653542 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
644969 | 1 | 0.4% |
48461 | 1 | 0.4% |
42244 | 1 | 0.4% |
41466 | 1 | 0.4% |
84188 | 1 | 0.4% |
152411 | 1 | 0.4% |
88985 | 1 | 0.4% |
92619 | 1 | 0.4% |
93413 | 1 | 0.4% |
39295 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
14972 | 1 | |
18275 | 1 | |
19797 | 1 | |
20430 | 1 | |
20732 | 1 | |
21206 | 1 | |
24292 | 1 | |
26310 | 1 | |
27716 | 1 | |
33170 | 1 |
Value | Count | Frequency (%) |
644969 | 1 | |
633436 | 1 | |
590176 | 1 | |
576792 | 1 | |
569897 | 1 | |
531625 | 1 | |
502462 | 1 | |
481082 | 1 | |
453907 | 1 | |
443097 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 274 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12287.451 |
Minimum | 482 |
---|---|
Maximum | 99725 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 482 |
---|---|
5-th percentile | 2495.8 |
Q1 | 5479.5 |
median | 9461 |
Q3 | 15347.5 |
95-th percentile | 31705 |
Maximum | 99725 |
Range | 99243 |
Interquartile range (IQR) | 9868 |
Descriptive statistics
Standard deviation | 11461.637 |
---|---|
Coefficient of variation (CV) | 0.93279211 |
Kurtosis | 17.773397 |
Mean | 12287.451 |
Median Absolute Deviation (MAD) | 4674 |
Skewness | 3.3801995 |
Sum | 3379049 |
Variance | 1.3136913 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7219 | 2 | 0.7% |
50650 | 1 | 0.4% |
5802 | 1 | 0.4% |
9134 | 1 | 0.4% |
4879 | 1 | 0.4% |
12198 | 1 | 0.4% |
6223 | 1 | 0.4% |
6125 | 1 | 0.4% |
3950 | 1 | 0.4% |
7938 | 1 | 0.4% |
Other values (264) | 264 |
Value | Count | Frequency (%) |
482 | 1 | |
743 | 1 | |
772 | 1 | |
1166 | 1 | |
1342 | 1 | |
1389 | 1 | |
1431 | 1 | |
1462 | 1 | |
1723 | 1 | |
1742 | 1 |
Value | Count | Frequency (%) |
99725 | 1 | |
78821 | 1 | |
65557 | 1 | |
55513 | 1 | |
50650 | 1 | |
40047 | 1 | |
38620 | 1 | |
37250 | 1 | |
35426 | 1 | |
34994 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.918 | 0.921 | 0.528 | 0.473 | 0.409 |
호선 | 0.918 | 1.000 | 0.996 | 0.520 | 0.444 | 0.583 |
역번호 | 0.921 | 0.996 | 1.000 | 0.417 | 0.376 | 0.466 |
경로 | 0.528 | 0.520 | 0.417 | 1.000 | 0.926 | 0.762 |
장애 | 0.473 | 0.444 | 0.376 | 0.926 | 1.000 | 0.790 |
유공자 | 0.409 | 0.583 | 0.466 | 0.762 | 0.790 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.430 | -0.369 | -0.389 |
호선 | 0.988 | 1.000 | 0.988 | -0.402 | -0.338 | -0.359 |
역번호 | 1.000 | 0.988 | 1.000 | -0.430 | -0.369 | -0.389 |
경로 | -0.430 | -0.402 | -0.430 | 1.000 | 0.937 | 0.907 |
장애 | -0.369 | -0.338 | -0.369 | 0.937 | 1.000 | 0.910 |
유공자 | -0.389 | -0.359 | -0.389 | 0.907 | 0.910 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 1964968 | 644969 | 50650 |
1 | 2 | 1 | 151 | 시청 | 1077904 | 248367 | 22747 |
2 | 3 | 1 | 152 | 종각 | 1488014 | 364349 | 30246 |
3 | 4 | 1 | 153 | 종로3가 | 3685909 | 633436 | 65557 |
4 | 5 | 1 | 154 | 종로5가 | 2718723 | 443097 | 35426 |
5 | 6 | 1 | 155 | 동대문 | 1309184 | 298448 | 16730 |
6 | 7 | 1 | 156 | 신설동 | 1172936 | 264811 | 16064 |
7 | 8 | 1 | 157 | 제기동 | 3322146 | 429647 | 28539 |
8 | 9 | 1 | 158 | 청량리(서울시립대입구) | 3413844 | 576792 | 40047 |
9 | 10 | 1 | 159 | 동묘앞 | 1364429 | 305029 | 22619 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
265 | 266 | 8 | 2818 | 가락시장 | 523206 | 97689 | 6451 |
266 | 267 | 8 | 2819 | 문정 | 399719 | 108811 | 5697 |
267 | 268 | 8 | 2820 | 장지 | 585989 | 174058 | 11439 |
268 | 269 | 8 | 2821 | 복정 | 417628 | 101586 | 9005 |
269 | 270 | 8 | 2822 | 산성 | 337061 | 84633 | 3882 |
270 | 271 | 8 | 2823 | 남한산성입구(성남법원.검찰청) | 704978 | 170164 | 9592 |
271 | 272 | 8 | 2824 | 단대오거리 | 615751 | 179511 | 5573 |
272 | 273 | 8 | 2825 | 신흥 | 409355 | 107219 | 4119 |
273 | 274 | 8 | 2826 | 수진 | 415415 | 107873 | 2810 |
274 | 275 | 8 | 2827 | 모란 | 505181 | 111855 | 5311 |