Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 272 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.6 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
장애 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:09:48.932490 |
---|---|
Analysis finished | 2024-04-29 21:09:53.840773 |
Duration | 4.91 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 272 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 136.5 |
Minimum | 1 |
---|---|
Maximum | 272 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.55 |
Q1 | 68.75 |
median | 136.5 |
Q3 | 204.25 |
95-th percentile | 258.45 |
Maximum | 272 |
Range | 271 |
Interquartile range (IQR) | 135.5 |
Descriptive statistics
Standard deviation | 78.663842 |
---|---|
Coefficient of variation (CV) | 0.57629188 |
Kurtosis | -1.2 |
Mean | 136.5 |
Median Absolute Deviation (MAD) | 68 |
Skewness | 0 |
Sum | 37128 |
Variance | 6188 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
181 | 1 | 0.4% |
187 | 1 | 0.4% |
186 | 1 | 0.4% |
185 | 1 | 0.4% |
184 | 1 | 0.4% |
183 | 1 | 0.4% |
182 | 1 | 0.4% |
180 | 1 | 0.4% |
138 | 1 | 0.4% |
Other values (262) | 262 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
272 | 1 | |
271 | 1 | |
270 | 1 | |
269 | 1 | |
268 | 1 | |
267 | 1 | |
266 | 1 | |
265 | 1 | |
264 | 1 | |
263 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6066176 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.008201 |
---|---|
Coefficient of variation (CV) | 0.43593828 |
Kurtosis | -1.1478026 |
Mean | 4.6066176 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.052624328 |
Sum | 1253 |
Variance | 4.0328712 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 56 | |
2 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.6% |
1 | 10 | 3.7% |
Value | Count | Frequency (%) |
1 | 10 | 3.7% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 42 | |
8 | 18 | 6.6% |
Value | Count | Frequency (%) |
8 | 18 | 6.6% |
7 | 42 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.7% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 272 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1615.6654 |
Minimum | 150 |
---|---|
Maximum | 2828 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.55 |
Q1 | 316.75 |
median | 2527.5 |
Q3 | 2640.25 |
95-th percentile | 2814.45 |
Maximum | 2828 |
Range | 2678 |
Interquartile range (IQR) | 2323.5 |
Descriptive statistics
Standard deviation | 1174.9919 |
---|---|
Coefficient of variation (CV) | 0.7272495 |
Kurtosis | -1.9259226 |
Mean | 1615.6654 |
Median Absolute Deviation (MAD) | 284 |
Skewness | -0.24809565 |
Sum | 439461 |
Variance | 1380605.9 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2617 | 1 | 0.4% |
2623 | 1 | 0.4% |
2622 | 1 | 0.4% |
2621 | 1 | 0.4% |
2620 | 1 | 0.4% |
2619 | 1 | 0.4% |
2618 | 1 | 0.4% |
2616 | 1 | 0.4% |
2529 | 1 | 0.4% |
Other values (262) | 262 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2828 | 1 | |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 |
역명
Text
Distinct | 239 |
---|---|
Distinct (%) | 87.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원(ddp | 3 | 1.1% |
동대문 | 2 | 0.7% |
잠실(송파구청 | 2 | 0.7% |
불광 | 2 | 0.7% |
고속터미널 | 2 | 0.7% |
가락시장 | 2 | 0.7% |
충정로(경기대입구 | 2 | 0.7% |
사당 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
Other values (229) | 250 |
Most occurring characters
Value | Count | Frequency (%) |
) | 64 | 5.3% |
( | 64 | 5.3% |
대 | 49 | 4.1% |
구 | 49 | 4.1% |
동 | 32 | 2.6% |
청 | 30 | 2.5% |
신 | 25 | 2.1% |
원 | 23 | 1.9% |
산 | 21 | 1.7% |
문 | 20 | 1.7% |
Other values (230) | 831 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1059 | |
Close Punctuation | 64 | 5.3% |
Open Punctuation | 64 | 5.3% |
Uppercase Letter | 9 | 0.7% |
Decimal Number | 8 | 0.7% |
Other Punctuation | 4 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 49 | 4.6% |
구 | 49 | 4.6% |
동 | 32 | 3.0% |
청 | 30 | 2.8% |
신 | 25 | 2.4% |
원 | 23 | 2.2% |
산 | 21 | 2.0% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
성 | 16 | 1.5% |
Other values (221) | 775 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 6 | |
P | 3 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 | |
· | 1 | 25.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 64 |
Open Punctuation
Value | Count | Frequency (%) |
( | 64 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1059 | |
Common | 140 | 11.6% |
Latin | 9 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 49 | 4.6% |
구 | 49 | 4.6% |
동 | 32 | 3.0% |
청 | 30 | 2.8% |
신 | 25 | 2.4% |
원 | 23 | 2.2% |
산 | 21 | 2.0% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
성 | 16 | 1.5% |
Other values (221) | 775 |
Common
Value | Count | Frequency (%) |
) | 64 | |
( | 64 | |
3 | 5 | 3.6% |
. | 3 | 2.1% |
4 | 2 | 1.4% |
· | 1 | 0.7% |
5 | 1 | 0.7% |
Latin
Value | Count | Frequency (%) |
D | 6 | |
P | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1059 | |
ASCII | 148 | 12.3% |
None | 1 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 64 | |
( | 64 | |
D | 6 | 4.1% |
3 | 5 | 3.4% |
P | 3 | 2.0% |
. | 3 | 2.0% |
4 | 2 | 1.4% |
5 | 1 | 0.7% |
Hangul
Value | Count | Frequency (%) |
대 | 49 | 4.6% |
구 | 49 | 4.6% |
동 | 32 | 3.0% |
청 | 30 | 2.8% |
신 | 25 | 2.4% |
원 | 23 | 2.2% |
산 | 21 | 2.0% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
성 | 16 | 1.5% |
Other values (221) | 775 |
None
Value | Count | Frequency (%) |
· | 1 |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 272 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 722962.08 |
Minimum | 78165 |
---|---|
Maximum | 2917042 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 78165 |
---|---|
5-th percentile | 170359.45 |
Q1 | 393744 |
median | 614096.5 |
Q3 | 887249 |
95-th percentile | 1738779.9 |
Maximum | 2917042 |
Range | 2838877 |
Interquartile range (IQR) | 493505 |
Descriptive statistics
Standard deviation | 484291.62 |
---|---|
Coefficient of variation (CV) | 0.6698714 |
Kurtosis | 3.6474604 |
Mean | 722962.08 |
Median Absolute Deviation (MAD) | 243161.5 |
Skewness | 1.6678809 |
Sum | 1.9664569 × 108 |
Variance | 2.3453837 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1738809 | 1 | 0.4% |
809758 | 1 | 0.4% |
370357 | 1 | 0.4% |
753942 | 1 | 0.4% |
685350 | 1 | 0.4% |
574072 | 1 | 0.4% |
525246 | 1 | 0.4% |
663267 | 1 | 0.4% |
475460 | 1 | 0.4% |
583812 | 1 | 0.4% |
Other values (262) | 262 |
Value | Count | Frequency (%) |
78165 | 1 | |
78620 | 1 | |
85221 | 1 | |
91964 | 1 | |
99557 | 1 | |
100763 | 1 | |
113536 | 1 | |
128031 | 1 | |
132712 | 1 | |
141188 | 1 |
Value | Count | Frequency (%) |
2917042 | 1 | |
2764406 | 1 | |
2518370 | 1 | |
2514663 | 1 | |
2452955 | 1 | |
2038522 | 1 | |
1943248 | 1 | |
1875656 | 1 | |
1874012 | 1 | |
1831750 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 272 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 124179.76 |
Minimum | 10928 |
---|---|
Maximum | 418217 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 10928 |
---|---|
5-th percentile | 30762.45 |
Q1 | 68148.5 |
median | 105213.5 |
Q3 | 162098.5 |
95-th percentile | 293163.25 |
Maximum | 418217 |
Range | 407289 |
Interquartile range (IQR) | 93950 |
Descriptive statistics
Standard deviation | 81646.755 |
---|---|
Coefficient of variation (CV) | 0.6574884 |
Kurtosis | 1.6760117 |
Mean | 124179.76 |
Median Absolute Deviation (MAD) | 40917 |
Skewness | 1.3123065 |
Sum | 33776896 |
Variance | 6.6661926 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
407971 | 1 | 0.4% |
136081 | 1 | 0.4% |
70012 | 1 | 0.4% |
123015 | 1 | 0.4% |
173537 | 1 | 0.4% |
76509 | 1 | 0.4% |
96439 | 1 | 0.4% |
121567 | 1 | 0.4% |
81207 | 1 | 0.4% |
84316 | 1 | 0.4% |
Other values (262) | 262 |
Value | Count | Frequency (%) |
10928 | 1 | |
11173 | 1 | |
12542 | 1 | |
13197 | 1 | |
15785 | 1 | |
17120 | 1 | |
17851 | 1 | |
19195 | 1 | |
24803 | 1 | |
25847 | 1 |
Value | Count | Frequency (%) |
418217 | 1 | |
407971 | 1 | |
402867 | 1 | |
393181 | 1 | |
358409 | 1 | |
357505 | 1 | |
354410 | 1 | |
347635 | 1 | |
333570 | 1 | |
324073 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 270 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8112.7463 |
Minimum | 293 |
---|---|
Maximum | 40317 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 293 |
---|---|
5-th percentile | 1678.35 |
Q1 | 3990 |
median | 6587 |
Q3 | 10055 |
95-th percentile | 20029.45 |
Maximum | 40317 |
Range | 40024 |
Interquartile range (IQR) | 6065 |
Descriptive statistics
Standard deviation | 5855.5773 |
---|---|
Coefficient of variation (CV) | 0.72177498 |
Kurtosis | 4.5937158 |
Mean | 8112.7463 |
Median Absolute Deviation (MAD) | 3119.5 |
Skewness | 1.7751886 |
Sum | 2206667 |
Variance | 34287786 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
7086 | 2 | 0.7% |
2302 | 2 | 0.7% |
6295 | 1 | 0.4% |
4344 | 1 | 0.4% |
5738 | 1 | 0.4% |
6460 | 1 | 0.4% |
5665 | 1 | 0.4% |
8096 | 1 | 0.4% |
6009 | 1 | 0.4% |
31156 | 1 | 0.4% |
Other values (260) | 260 |
Value | Count | Frequency (%) |
293 | 1 | |
546 | 1 | |
1110 | 1 | |
1190 | 1 | |
1422 | 1 | |
1448 | 1 | |
1483 | 1 | |
1484 | 1 | |
1506 | 1 | |
1527 | 1 |
Value | Count | Frequency (%) |
40317 | 1 | |
31156 | 1 | |
27822 | 1 | |
27374 | 1 | |
25307 | 1 | |
25068 | 1 | |
24996 | 1 | |
24118 | 1 | |
22195 | 1 | |
21660 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.918 | 0.916 | 0.376 | 0.423 | 0.373 |
호선 | 0.918 | 1.000 | 0.994 | 0.467 | 0.401 | 0.458 |
역번호 | 0.916 | 0.994 | 1.000 | 0.326 | 0.348 | 0.487 |
경로 | 0.376 | 0.467 | 0.326 | 1.000 | 0.792 | 0.927 |
장애 | 0.423 | 0.401 | 0.348 | 0.792 | 1.000 | 0.798 |
유공자 | 0.373 | 0.458 | 0.487 | 0.927 | 0.798 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.356 | -0.329 | -0.367 |
호선 | 0.988 | 1.000 | 0.988 | -0.335 | -0.299 | -0.349 |
역번호 | 1.000 | 0.988 | 1.000 | -0.356 | -0.329 | -0.367 |
경로 | -0.356 | -0.335 | -0.356 | 1.000 | 0.939 | 0.908 |
장애 | -0.329 | -0.299 | -0.329 | 0.939 | 1.000 | 0.895 |
유공자 | -0.367 | -0.349 | -0.367 | 0.908 | 0.895 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 1738809 | 407971 | 31156 |
1 | 2 | 1 | 151 | 시청 | 814078 | 161447 | 13011 |
2 | 3 | 1 | 152 | 종각 | 1283102 | 242896 | 18474 |
3 | 4 | 1 | 153 | 종로3가 | 2917042 | 418217 | 40317 |
4 | 5 | 1 | 154 | 종로5가 | 2514663 | 324073 | 25307 |
5 | 6 | 1 | 155 | 동대문 | 1140455 | 195115 | 9997 |
6 | 7 | 1 | 156 | 신설동 | 1127890 | 185365 | 10236 |
7 | 8 | 1 | 157 | 제기동 | 2764406 | 316144 | 20239 |
8 | 9 | 1 | 158 | 청량리(서울시립대입구) | 2518370 | 358409 | 24996 |
9 | 10 | 1 | 159 | 동묘앞 | 1453014 | 255207 | 16830 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
262 | 263 | 8 | 2819 | 문정 | 545724 | 102071 | 5529 |
263 | 264 | 8 | 2820 | 장지 | 722479 | 144652 | 7879 |
264 | 265 | 8 | 2821 | 복정 | 358605 | 74148 | 5455 |
265 | 266 | 8 | 2822 | 산성 | 302164 | 44001 | 3369 |
266 | 267 | 8 | 2823 | 남한산성입구(성남법원.검찰청) | 727019 | 143399 | 6614 |
267 | 268 | 8 | 2824 | 단대오거리 | 653658 | 142018 | 4509 |
268 | 269 | 8 | 2825 | 신흥 | 338571 | 71345 | 2470 |
269 | 270 | 8 | 2826 | 수진 | 411929 | 84605 | 3006 |
270 | 271 | 8 | 2827 | 모란 | 512846 | 91099 | 4627 |
271 | 272 | 8 | 2828 | 남위례 | 244866 | 35756 | 2815 |