Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 277 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.9 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:10:03.513896 |
---|---|
Analysis finished | 2024-04-29 21:10:06.852248 |
Duration | 3.34 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 277 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 139 |
Minimum | 1 |
---|---|
Maximum | 277 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.8 |
Q1 | 70 |
median | 139 |
Q3 | 208 |
95-th percentile | 263.2 |
Maximum | 277 |
Range | 276 |
Interquartile range (IQR) | 138 |
Descriptive statistics
Standard deviation | 80.10722 |
---|---|
Coefficient of variation (CV) | 0.57631093 |
Kurtosis | -1.2 |
Mean | 139 |
Median Absolute Deviation (MAD) | 69 |
Skewness | 0 |
Sum | 38503 |
Variance | 6417.1667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
184 | 1 | 0.4% |
190 | 1 | 0.4% |
189 | 1 | 0.4% |
188 | 1 | 0.4% |
187 | 1 | 0.4% |
186 | 1 | 0.4% |
185 | 1 | 0.4% |
183 | 1 | 0.4% |
175 | 1 | 0.4% |
Other values (267) | 267 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
277 | 1 | |
276 | 1 | |
275 | 1 | |
274 | 1 | |
273 | 1 | |
272 | 1 | |
271 | 1 | |
270 | 1 | |
269 | 1 | |
268 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.66787 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0247061 |
---|---|
Coefficient of variation (CV) | 0.43375375 |
Kurtosis | -1.1986103 |
Mean | 4.66787 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.10486387 |
Sum | 1293 |
Variance | 4.0994349 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 53 | |
7 | 51 | |
2 | 50 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 17 | 6.1% |
1 | 10 | 3.6% |
Value | Count | Frequency (%) |
1 | 10 | 3.6% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 53 | |
6 | 37 | |
7 | 51 | |
8 | 17 | 6.1% |
Value | Count | Frequency (%) |
8 | 17 | 6.1% |
7 | 51 | |
6 | 37 | |
5 | 53 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.6% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 277 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1638.0975 |
Minimum | 150 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.8 |
Q1 | 318 |
median | 2530 |
Q3 | 2647 |
95-th percentile | 2813.2 |
Maximum | 2827 |
Range | 2677 |
Interquartile range (IQR) | 2329 |
Descriptive statistics
Standard deviation | 1175.7807 |
---|---|
Coefficient of variation (CV) | 0.7177721 |
Kurtosis | -1.9085471 |
Mean | 1638.0975 |
Median Absolute Deviation (MAD) | 229 |
Skewness | -0.28044538 |
Sum | 453753 |
Variance | 1382460.2 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2623 | 1 | 0.4% |
2629 | 1 | 0.4% |
2628 | 1 | 0.4% |
2627 | 1 | 0.4% |
2626 | 1 | 0.4% |
2625 | 1 | 0.4% |
2624 | 1 | 0.4% |
2622 | 1 | 0.4% |
2613 | 1 | 0.4% |
Other values (267) | 267 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
Distinct | 244 |
---|---|
Distinct (%) | 88.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원 | 3 | 1.1% |
공덕 | 2 | 0.7% |
사당 | 2 | 0.7% |
서울역 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
대림 | 2 | 0.7% |
불광 | 2 | 0.7% |
약수 | 2 | 0.7% |
오금 | 2 | 0.7% |
Other values (234) | 255 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 32 | 3.9% |
구 | 29 | 3.6% |
동 | 25 | 3.1% |
신 | 23 | 2.8% |
산 | 19 | 2.3% |
지 | 15 | 1.8% |
청 | 15 | 1.8% |
문 | 15 | 1.8% |
원 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (200) | 613 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 806 | |
Decimal Number | 8 | 1.0% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 4.0% |
구 | 29 | 3.6% |
동 | 25 | 3.1% |
신 | 23 | 2.9% |
산 | 19 | 2.4% |
지 | 15 | 1.9% |
청 | 15 | 1.9% |
문 | 15 | 1.9% |
원 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (197) | 605 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 806 | |
Common | 8 | 1.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 4.0% |
구 | 29 | 3.6% |
동 | 25 | 3.1% |
신 | 23 | 2.9% |
산 | 19 | 2.4% |
지 | 15 | 1.9% |
청 | 15 | 1.9% |
문 | 15 | 1.9% |
원 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (197) | 605 |
Common
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 806 | |
ASCII | 8 | 1.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 32 | 4.0% |
구 | 29 | 3.6% |
동 | 25 | 3.1% |
신 | 23 | 2.9% |
산 | 19 | 2.4% |
지 | 15 | 1.9% |
청 | 15 | 1.9% |
문 | 15 | 1.9% |
원 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (197) | 605 |
ASCII
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 277 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 577810.46 |
Minimum | 53396 |
---|---|
Maximum | 2554908 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 53396 |
---|---|
5-th percentile | 133383.8 |
Q1 | 312152 |
median | 479804 |
Q3 | 712902 |
95-th percentile | 1373997.2 |
Maximum | 2554908 |
Range | 2501512 |
Interquartile range (IQR) | 400750 |
Descriptive statistics
Standard deviation | 411310.15 |
---|---|
Coefficient of variation (CV) | 0.71184268 |
Kurtosis | 5.0874206 |
Mean | 577810.46 |
Median Absolute Deviation (MAD) | 192276 |
Skewness | 1.9187913 |
Sum | 1.600535 × 108 |
Variance | 1.6917604 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1391862 | 1 | 0.4% |
288738 | 1 | 0.4% |
117741 | 1 | 0.4% |
307429 | 1 | 0.4% |
580354 | 1 | 0.4% |
347906 | 1 | 0.4% |
366647 | 1 | 0.4% |
170527 | 1 | 0.4% |
661533 | 1 | 0.4% |
324281 | 1 | 0.4% |
Other values (267) | 267 |
Value | Count | Frequency (%) |
53396 | 1 | |
68596 | 1 | |
68843 | 1 | |
73809 | 1 | |
75087 | 1 | |
79204 | 1 | |
84989 | 1 | |
98360 | 1 | |
110331 | 1 | |
116090 | 1 |
Value | Count | Frequency (%) |
2554908 | 1 | |
2419969 | 1 | |
2395667 | 1 | |
2183302 | 1 | |
2103893 | 1 | |
1758463 | 1 | |
1692153 | 1 | |
1540968 | 1 | |
1517381 | 1 | |
1472638 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 276 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 120793.84 |
Minimum | 10974 |
---|---|
Maximum | 434534 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 10974 |
---|---|
5-th percentile | 28320.2 |
Q1 | 65410 |
median | 99777 |
Q3 | 155429 |
95-th percentile | 302903.4 |
Maximum | 434534 |
Range | 423560 |
Interquartile range (IQR) | 90019 |
Descriptive statistics
Standard deviation | 82820.67 |
---|---|
Coefficient of variation (CV) | 0.68563653 |
Kurtosis | 2.1981431 |
Mean | 120793.84 |
Median Absolute Deviation (MAD) | 39317 |
Skewness | 1.4428554 |
Sum | 33459894 |
Variance | 6.8592633 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
101136 | 2 | 0.7% |
422653 | 1 | 0.4% |
68147 | 1 | 0.4% |
29508 | 1 | 0.4% |
63851 | 1 | 0.4% |
113608 | 1 | 0.4% |
63224 | 1 | 0.4% |
56161 | 1 | 0.4% |
28113 | 1 | 0.4% |
124025 | 1 | 0.4% |
Other values (266) | 266 |
Value | Count | Frequency (%) |
10974 | 1 | |
12170 | 1 | |
14720 | 1 | |
15999 | 1 | |
16589 | 1 | |
17293 | 1 | |
17448 | 1 | |
17986 | 1 | |
22601 | 1 | |
24354 | 1 |
Value | Count | Frequency (%) |
434534 | 1 | |
422653 | 1 | |
420447 | 1 | |
398776 | 1 | |
396489 | 1 | |
367591 | 1 | |
352770 | 1 | |
335899 | 1 | |
321039 | 1 | |
320507 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 275 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7858.2491 |
Minimum | 330 |
---|---|
Maximum | 37296 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 330 |
---|---|
5-th percentile | 1516.2 |
Q1 | 3687 |
median | 6263 |
Q3 | 10044 |
95-th percentile | 21334.8 |
Maximum | 37296 |
Range | 36966 |
Interquartile range (IQR) | 6357 |
Descriptive statistics
Standard deviation | 5977.6394 |
---|---|
Coefficient of variation (CV) | 0.76068337 |
Kurtosis | 3.8943776 |
Mean | 7858.2491 |
Median Absolute Deviation (MAD) | 3082 |
Skewness | 1.7614692 |
Sum | 2176735 |
Variance | 35732173 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2274 | 2 | 0.7% |
3941 | 2 | 0.7% |
29653 | 1 | 0.4% |
2575 | 1 | 0.4% |
1493 | 1 | 0.4% |
5500 | 1 | 0.4% |
3596 | 1 | 0.4% |
8890 | 1 | 0.4% |
5260 | 1 | 0.4% |
5421 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
330 | 1 | |
501 | 1 | |
918 | 1 | |
939 | 1 | |
953 | 1 | |
959 | 1 | |
1123 | 1 | |
1136 | 1 | |
1171 | 1 | |
1396 | 1 |
Value | Count | Frequency (%) |
37296 | 1 | |
32211 | 1 | |
29653 | 1 | |
27650 | 1 | |
25864 | 1 | |
25573 | 1 | |
24163 | 1 | |
23890 | 1 | |
23502 | 1 | |
23430 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.920 | 0.924 | 0.386 | 0.490 | 0.554 |
호선 | 0.920 | 1.000 | 0.995 | 0.497 | 0.466 | 0.470 |
역번호 | 0.924 | 0.995 | 1.000 | 0.359 | 0.390 | 0.411 |
경로 | 0.386 | 0.497 | 0.359 | 1.000 | 0.827 | 0.808 |
장애 | 0.490 | 0.466 | 0.390 | 0.827 | 1.000 | 0.877 |
유공자 | 0.554 | 0.470 | 0.411 | 0.808 | 0.877 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.377 | -0.332 | -0.380 |
호선 | 0.988 | 1.000 | 0.988 | -0.347 | -0.298 | -0.351 |
역번호 | 1.000 | 0.988 | 1.000 | -0.377 | -0.332 | -0.380 |
경로 | -0.377 | -0.347 | -0.377 | 1.000 | 0.941 | 0.905 |
장애 | -0.332 | -0.298 | -0.332 | 0.941 | 1.000 | 0.900 |
유공자 | -0.380 | -0.351 | -0.380 | 0.905 | 0.900 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 1391862 | 422653 | 29653 |
1 | 2 | 1 | 151 | 시청 | 592974 | 145889 | 11317 |
2 | 3 | 1 | 152 | 종각 | 1062297 | 253125 | 18286 |
3 | 4 | 1 | 153 | 종로3가 | 2554908 | 434534 | 37296 |
4 | 5 | 1 | 154 | 종로5가 | 2183302 | 318621 | 23890 |
5 | 6 | 1 | 155 | 동대문 | 994364 | 210921 | 10462 |
6 | 7 | 1 | 156 | 신설동 | 957366 | 192788 | 11000 |
7 | 8 | 1 | 157 | 제기동 | 2419969 | 321039 | 18679 |
8 | 9 | 1 | 158 | 청량리 | 2395667 | 396489 | 25864 |
9 | 10 | 1 | 159 | 동묘앞 | 1225852 | 246929 | 15832 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
267 | 268 | 8 | 2818 | 가락시장 | 513081 | 81645 | 6311 |
268 | 269 | 8 | 2819 | 문정 | 431388 | 99777 | 6904 |
269 | 270 | 8 | 2820 | 장지 | 543185 | 135437 | 9700 |
270 | 271 | 8 | 2821 | 복정 | 332606 | 75902 | 5628 |
271 | 272 | 8 | 2822 | 산성 | 285000 | 60460 | 3115 |
272 | 273 | 8 | 2823 | 남한산성입구 | 600218 | 136411 | 7592 |
273 | 274 | 8 | 2824 | 단대오거리 | 505520 | 131365 | 3339 |
274 | 275 | 8 | 2825 | 신흥 | 310474 | 77568 | 3021 |
275 | 276 | 8 | 2826 | 수진 | 335197 | 80489 | 2758 |
276 | 277 | 8 | 2827 | 모란 | 415945 | 85813 | 4157 |