Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 275 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 16.8 KiB |
Average record size in memory | 62.5 B |
Variable types
Numeric | 6 |
---|---|
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-21720/F/1/datasetView.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
경로 is highly overall correlated with 장애 and 1 other fields | High correlation |
장애 is highly overall correlated with 경로 and 1 other fields | High correlation |
유공자 is highly overall correlated with 경로 and 1 other fields | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
경로 has unique values | Unique |
장애 has unique values | Unique |
Reproduction
Analysis started | 2024-04-29 21:10:16.593120 |
---|---|
Analysis finished | 2024-04-29 21:10:19.848933 |
Duration | 3.26 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 138 |
Minimum | 1 |
---|---|
Maximum | 275 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.7 |
Q1 | 69.5 |
median | 138 |
Q3 | 206.5 |
95-th percentile | 261.3 |
Maximum | 275 |
Range | 274 |
Interquartile range (IQR) | 137 |
Descriptive statistics
Standard deviation | 79.529869 |
---|---|
Coefficient of variation (CV) | 0.5763034 |
Kurtosis | -1.2 |
Mean | 138 |
Median Absolute Deviation (MAD) | 69 |
Skewness | 0 |
Sum | 37950 |
Variance | 6325 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
183 | 1 | 0.4% |
189 | 1 | 0.4% |
188 | 1 | 0.4% |
187 | 1 | 0.4% |
186 | 1 | 0.4% |
185 | 1 | 0.4% |
184 | 1 | 0.4% |
182 | 1 | 0.4% |
174 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
275 | 1 | |
274 | 1 | |
273 | 1 | |
272 | 1 | |
271 | 1 | |
270 | 1 | |
269 | 1 | |
268 | 1 | |
267 | 1 | |
266 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6654545 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0318826 |
---|---|
Coefficient of variation (CV) | 0.43551653 |
Kurtosis | -1.2116408 |
Mean | 4.6654545 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.10095916 |
Sum | 1283 |
Variance | 4.1285468 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
5 | 51 | |
7 | 51 | |
2 | 50 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 17 | 6.2% |
1 | 10 | 3.6% |
Value | Count | Frequency (%) |
1 | 10 | 3.6% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 51 | |
6 | 37 | |
7 | 51 | |
8 | 17 | 6.2% |
Value | Count | Frequency (%) |
8 | 17 | 6.2% |
7 | 51 | |
6 | 37 | |
5 | 51 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.6% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1631.3673 |
Minimum | 150 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.7 |
Q1 | 317.5 |
median | 2529 |
Q3 | 2647.5 |
95-th percentile | 2813.3 |
Maximum | 2827 |
Range | 2677 |
Interquartile range (IQR) | 2330 |
Descriptive statistics
Standard deviation | 1177.3932 |
---|---|
Coefficient of variation (CV) | 0.72172173 |
Kurtosis | -1.9158526 |
Mean | 1631.3673 |
Median Absolute Deviation (MAD) | 231 |
Skewness | -0.26764389 |
Sum | 448626 |
Variance | 1386254.8 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2624 | 1 | 0.4% |
2630 | 1 | 0.4% |
2629 | 1 | 0.4% |
2628 | 1 | 0.4% |
2627 | 1 | 0.4% |
2626 | 1 | 0.4% |
2625 | 1 | 0.4% |
2623 | 1 | 0.4% |
2614 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
Distinct | 242 |
---|---|
Distinct (%) | 88.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.3 KiB |
Value | Count | Frequency (%) |
종로3가 | 3 | 1.1% |
동대문역사문화공원 | 3 | 1.1% |
천호(풍납토성 | 2 | 0.7% |
사당 | 2 | 0.7% |
서울역 | 2 | 0.7% |
영등포구청 | 2 | 0.7% |
대림(구로구청 | 2 | 0.7% |
불광 | 2 | 0.7% |
약수 | 2 | 0.7% |
오금 | 2 | 0.7% |
Other values (232) | 253 |
Most occurring characters
Value | Count | Frequency (%) |
) | 58 | 4.9% |
( | 58 | 4.9% |
구 | 50 | 4.2% |
대 | 49 | 4.1% |
동 | 35 | 3.0% |
청 | 31 | 2.6% |
신 | 25 | 2.1% |
원 | 22 | 1.9% |
산 | 20 | 1.7% |
문 | 20 | 1.7% |
Other values (226) | 818 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1059 | |
Close Punctuation | 58 | 4.9% |
Open Punctuation | 58 | 4.9% |
Decimal Number | 8 | 0.7% |
Other Punctuation | 3 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Decimal Number
Value | Count | Frequency (%) |
3 | 5 | |
4 | 2 | 25.0% |
5 | 1 | 12.5% |
Close Punctuation
Value | Count | Frequency (%) |
) | 58 |
Open Punctuation
Value | Count | Frequency (%) |
( | 58 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1059 | |
Common | 127 | 10.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
Common
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1059 | |
ASCII | 127 | 10.7% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 58 | |
( | 58 | |
3 | 5 | 3.9% |
. | 3 | 2.4% |
4 | 2 | 1.6% |
5 | 1 | 0.8% |
Hangul
Value | Count | Frequency (%) |
구 | 50 | 4.7% |
대 | 49 | 4.6% |
동 | 35 | 3.3% |
청 | 31 | 2.9% |
신 | 25 | 2.4% |
원 | 22 | 2.1% |
산 | 20 | 1.9% |
문 | 20 | 1.9% |
입 | 19 | 1.8% |
로 | 16 | 1.5% |
Other values (220) | 772 |
경로
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 773975.29 |
Minimum | 65798 |
---|---|
Maximum | 3755786 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 65798 |
---|---|
5-th percentile | 171192.8 |
Q1 | 410687.5 |
median | 634172 |
Q3 | 923288 |
95-th percentile | 1954591.6 |
Maximum | 3755786 |
Range | 3689988 |
Interquartile range (IQR) | 512600.5 |
Descriptive statistics
Standard deviation | 567302.74 |
---|---|
Coefficient of variation (CV) | 0.73297267 |
Kurtosis | 6.046397 |
Mean | 773975.29 |
Median Absolute Deviation (MAD) | 252127 |
Skewness | 2.0618372 |
Sum | 2.128432 × 108 |
Variance | 3.218324 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1960487 | 1 | 0.4% |
226030 | 1 | 0.4% |
199957 | 1 | 0.4% |
173408 | 1 | 0.4% |
406270 | 1 | 0.4% |
715849 | 1 | 0.4% |
470491 | 1 | 0.4% |
598495 | 1 | 0.4% |
403206 | 1 | 0.4% |
210025 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
65798 | 1 | |
80851 | 1 | |
92290 | 1 | |
101117 | 1 | |
104786 | 1 | |
106490 | 1 | |
109386 | 1 | |
139474 | 1 | |
140856 | 1 | |
156907 | 1 |
Value | Count | Frequency (%) |
3755786 | 1 | |
3532050 | 1 | |
3282323 | 1 | |
2760605 | 1 | |
2685403 | 1 | |
2289648 | 1 | |
2250342 | 1 | |
2181299 | 1 | |
2120884 | 1 | |
2072221 | 1 |
장애
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 275 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 163517.83 |
Minimum | 14106 |
---|---|
Maximum | 606567 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 14106 |
---|---|
5-th percentile | 38192.6 |
Q1 | 88937.5 |
median | 135983 |
Q3 | 206040 |
95-th percentile | 389509.2 |
Maximum | 606567 |
Range | 592461 |
Interquartile range (IQR) | 117102.5 |
Descriptive statistics
Standard deviation | 111324.95 |
---|---|
Coefficient of variation (CV) | 0.68081229 |
Kurtosis | 2.4290181 |
Mean | 163517.83 |
Median Absolute Deviation (MAD) | 55813 |
Skewness | 1.4822155 |
Sum | 44967403 |
Variance | 1.2393244 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
593788 | 1 | 0.4% |
42791 | 1 | 0.4% |
40848 | 1 | 0.4% |
39367 | 1 | 0.4% |
79630 | 1 | 0.4% |
149225 | 1 | 0.4% |
91942 | 1 | 0.4% |
85971 | 1 | 0.4% |
96253 | 1 | 0.4% |
37090 | 1 | 0.4% |
Other values (265) | 265 |
Value | Count | Frequency (%) |
14106 | 1 | |
19657 | 1 | |
20103 | 1 | |
20674 | 1 | |
20834 | 1 | |
21117 | 1 | |
23048 | 1 | |
23184 | 1 | |
25956 | 1 | |
32149 | 1 |
Value | Count | Frequency (%) |
606567 | 1 | |
593788 | 1 | |
555845 | 1 | |
540005 | 1 | |
516051 | 1 | |
495089 | 1 | |
487012 | 1 | |
473527 | 1 | |
441996 | 1 | |
439436 | 1 |
유공자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 273 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11770.062 |
Minimum | 468 |
---|---|
Maximum | 91219 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.5 KiB |
Quantile statistics
Minimum | 468 |
---|---|
5-th percentile | 2268.7 |
Q1 | 5172 |
median | 9229 |
Q3 | 14415 |
95-th percentile | 30667.1 |
Maximum | 91219 |
Range | 90751 |
Interquartile range (IQR) | 9243 |
Descriptive statistics
Standard deviation | 10835.421 |
---|---|
Coefficient of variation (CV) | 0.9205917 |
Kurtosis | 16.658963 |
Mean | 11770.062 |
Median Absolute Deviation (MAD) | 4485 |
Skewness | 3.297367 |
Sum | 3236767 |
Variance | 1.1740635 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11878 | 2 | 0.7% |
4651 | 2 | 0.7% |
47379 | 1 | 0.4% |
8192 | 1 | 0.4% |
4474 | 1 | 0.4% |
11754 | 1 | 0.4% |
5596 | 1 | 0.4% |
8234 | 1 | 0.4% |
3789 | 1 | 0.4% |
5680 | 1 | 0.4% |
Other values (263) | 263 |
Value | Count | Frequency (%) |
468 | 1 | |
572 | 1 | |
867 | 1 | |
1312 | 1 | |
1353 | 1 | |
1380 | 1 | |
1567 | 1 | |
1689 | 1 | |
1705 | 1 | |
1777 | 1 |
Value | Count | Frequency (%) |
91219 | 1 | |
75843 | 1 | |
64739 | 1 | |
53347 | 1 | |
47379 | 1 | |
37395 | 1 | |
36932 | 1 | |
33811 | 1 | |
33659 | 1 | |
33176 | 1 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.918 | 0.921 | 0.456 | 0.462 | 0.332 |
호선 | 0.918 | 1.000 | 0.996 | 0.508 | 0.439 | 0.402 |
역번호 | 0.921 | 0.996 | 1.000 | 0.386 | 0.371 | 0.304 |
경로 | 0.456 | 0.508 | 0.386 | 1.000 | 0.927 | 0.793 |
장애 | 0.462 | 0.439 | 0.371 | 0.927 | 1.000 | 0.763 |
유공자 | 0.332 | 0.402 | 0.304 | 0.793 | 0.763 | 1.000 |
연번 | 호선 | 역번호 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.411 | -0.360 | -0.377 |
호선 | 0.988 | 1.000 | 0.988 | -0.385 | -0.328 | -0.349 |
역번호 | 1.000 | 0.988 | 1.000 | -0.411 | -0.360 | -0.377 |
경로 | -0.411 | -0.385 | -0.411 | 1.000 | 0.937 | 0.904 |
장애 | -0.360 | -0.328 | -0.360 | 0.937 | 1.000 | 0.901 |
유공자 | -0.377 | -0.349 | -0.377 | 0.904 | 0.901 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역 | 1960487 | 593788 | 47379 |
1 | 2 | 1 | 151 | 시청 | 953448 | 226423 | 19220 |
2 | 3 | 1 | 152 | 종각 | 1498456 | 350490 | 27428 |
3 | 4 | 1 | 153 | 종로3가 | 3755786 | 606567 | 64739 |
4 | 5 | 1 | 154 | 종로5가 | 2760605 | 428839 | 33811 |
5 | 6 | 1 | 155 | 동대문 | 1303227 | 286572 | 15091 |
6 | 7 | 1 | 156 | 신설동 | 1182020 | 252949 | 16538 |
7 | 8 | 1 | 157 | 제기동 | 3532050 | 439436 | 29509 |
8 | 9 | 1 | 158 | 청량리(서울시립대입구) | 3282323 | 540005 | 36932 |
9 | 10 | 1 | 159 | 동묘앞 | 1402166 | 296813 | 20281 |
연번 | 호선 | 역번호 | 역명 | 경로 | 장애 | 유공자 | |
---|---|---|---|---|---|---|---|
265 | 266 | 8 | 2818 | 가락시장 | 554845 | 96312 | 6484 |
266 | 267 | 8 | 2819 | 문정 | 474996 | 117728 | 6929 |
267 | 268 | 8 | 2820 | 장지 | 666465 | 180066 | 13625 |
268 | 269 | 8 | 2821 | 복정 | 460119 | 104492 | 10126 |
269 | 270 | 8 | 2822 | 산성 | 319624 | 78603 | 3787 |
270 | 271 | 8 | 2823 | 남한산성입구(성남법원.검찰청) | 723855 | 170239 | 9155 |
271 | 272 | 8 | 2824 | 단대오거리 | 614993 | 172036 | 4541 |
272 | 273 | 8 | 2825 | 신흥 | 402273 | 99515 | 3560 |
273 | 274 | 8 | 2826 | 수진 | 445481 | 108561 | 2701 |
274 | 275 | 8 | 2827 | 모란 | 518937 | 111089 | 5243 |