Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 285 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.4 KiB |
Average record size in memory | 44.5 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Dataset
Description | 서울교통공사의 일평균 승차순위에 대한 데이터 입니다. 이 데이터는 승차순위, 호선, 역번호, 역명, 일평균승차인원 데이터를 제공합니다.해당데이터는 2023년 12월 기준으로 업데이트 되었습니다. |
---|---|
Author | 서울교통공사 |
URL | https://www.data.go.kr/data/15044252/fileData.do |
순위 is highly overall correlated with 일평균승차인원 | High correlation |
호선 is highly overall correlated with 역코드 | High correlation |
역코드 is highly overall correlated with 호선 | High correlation |
일평균승차인원 is highly overall correlated with 순위 | High correlation |
순위 has unique values | Unique |
역코드 has unique values | Unique |
역명 has unique values | Unique |
Reproduction
Analysis started | 2024-05-04 08:08:42.153237 |
---|---|
Analysis finished | 2024-05-04 08:08:48.711947 |
Duration | 6.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순위
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 143 |
Minimum | 1 |
---|---|
Maximum | 285 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.2 |
Q1 | 72 |
median | 143 |
Q3 | 214 |
95-th percentile | 270.8 |
Maximum | 285 |
Range | 284 |
Interquartile range (IQR) | 142 |
Descriptive statistics
Standard deviation | 82.416625 |
---|---|
Coefficient of variation (CV) | 0.57634003 |
Kurtosis | -1.2 |
Mean | 143 |
Median Absolute Deviation (MAD) | 71 |
Skewness | 0 |
Sum | 40755 |
Variance | 6792.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
189 | 1 | 0.4% |
195 | 1 | 0.4% |
194 | 1 | 0.4% |
193 | 1 | 0.4% |
192 | 1 | 0.4% |
191 | 1 | 0.4% |
190 | 1 | 0.4% |
188 | 1 | 0.4% |
197 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
285 | 1 | |
284 | 1 | |
283 | 1 | |
282 | 1 | |
281 | 1 | |
280 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
276 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8070175 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.165987 |
---|---|
Coefficient of variation (CV) | 0.45058855 |
Kurtosis | -0.98900904 |
Mean | 4.8070175 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.064358114 |
Sum | 1370 |
Variance | 4.6914999 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 56 | |
2 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
1 | 10 | 3.5% |
Value | Count | Frequency (%) |
1 | 10 | 3.5% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 42 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
Value | Count | Frequency (%) |
9 | 13 | 4.6% |
8 | 18 | 6.3% |
7 | 42 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.5% |
역코드
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1730.4456 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205.2 |
Q1 | 320 |
median | 2534 |
Q3 | 2712 |
95-th percentile | 2826.8 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2392 |
Descriptive statistics
Standard deviation | 1262.5495 |
---|---|
Coefficient of variation (CV) | 0.72960947 |
Kurtosis | -1.5156214 |
Mean | 1730.4456 |
Median Absolute Deviation (MAD) | 284 |
Skewness | -0.10121826 |
Sum | 493177 |
Variance | 1594031.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
216 | 1 | 0.4% |
2719 | 1 | 0.4% |
325 | 1 | 0.4% |
2712 | 1 | 0.4% |
2746 | 1 | 0.4% |
2821 | 1 | 0.4% |
4133 | 1 | 0.4% |
4127 | 1 | 0.4% |
2551 | 1 | 0.4% |
2742 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4134 | 1 | |
4133 | 1 | |
4132 | 1 | |
4131 | 1 | |
4130 | 1 | |
4129 | 1 |
역명
Text
UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
잠실(2 | 1 | 0.4% |
송파 | 1 | 0.4% |
옥수 | 1 | 0.4% |
도봉산(7 | 1 | 0.4% |
대림(7 | 1 | 0.4% |
복정(8 | 1 | 0.4% |
석촌(9 | 1 | 0.4% |
선정릉 | 1 | 0.4% |
굽은다리 | 1 | 0.4% |
뚝섬유원지 | 1 | 0.4% |
Other values (275) | 275 |
Most occurring characters
Value | Count | Frequency (%) |
( | 87 | 7.9% |
) | 87 | 7.9% |
대 | 32 | 2.9% |
구 | 28 | 2.5% |
동 | 23 | 2.1% |
신 | 22 | 2.0% |
산 | 19 | 1.7% |
5 | 18 | 1.6% |
원 | 16 | 1.4% |
2 | 16 | 1.4% |
Other values (213) | 757 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 837 | |
Decimal Number | 94 | 8.5% |
Open Punctuation | 87 | 7.9% |
Close Punctuation | 87 | 7.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
문 | 15 | 1.8% |
지 | 15 | 1.8% |
로 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
Decimal Number
Value | Count | Frequency (%) |
5 | 18 | |
2 | 16 | |
3 | 14 | |
6 | 11 | |
7 | 11 | |
4 | 9 | |
8 | 6 | 6.4% |
1 | 6 | 6.4% |
9 | 3 | 3.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 87 |
Close Punctuation
Value | Count | Frequency (%) |
) | 87 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 837 | |
Common | 268 | 24.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
문 | 15 | 1.8% |
지 | 15 | 1.8% |
로 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
Common
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
6 | 11 | 4.1% |
7 | 11 | 4.1% |
4 | 9 | 3.4% |
8 | 6 | 2.2% |
1 | 6 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 837 | |
ASCII | 268 | 24.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
6 | 11 | 4.1% |
7 | 11 | 4.1% |
4 | 9 | 3.4% |
8 | 6 | 2.2% |
1 | 6 | 2.2% |
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
문 | 15 | 1.8% |
지 | 15 | 1.8% |
로 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
일평균승차인원
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 284 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 15228.825 |
Minimum | 1050 |
---|---|
Maximum | 76010 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1050 |
---|---|
5-th percentile | 3436 |
Q1 | 7530 |
median | 12168 |
Q3 | 18569 |
95-th percentile | 37634.2 |
Maximum | 76010 |
Range | 74960 |
Interquartile range (IQR) | 11039 |
Descriptive statistics
Standard deviation | 11914.193 |
---|---|
Coefficient of variation (CV) | 0.78234488 |
Kurtosis | 6.1139591 |
Mean | 15228.825 |
Median Absolute Deviation (MAD) | 5333 |
Skewness | 2.1372001 |
Sum | 4340215 |
Variance | 1.4194799 × 108 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
9936 | 2 | 0.7% |
76010 | 1 | 0.4% |
9554 | 1 | 0.4% |
8734 | 1 | 0.4% |
8745 | 1 | 0.4% |
8779 | 1 | 0.4% |
8814 | 1 | 0.4% |
8830 | 1 | 0.4% |
8831 | 1 | 0.4% |
8958 | 1 | 0.4% |
Other values (274) | 274 |
Value | Count | Frequency (%) |
1050 | 1 | |
1213 | 1 | |
1359 | 1 | |
1594 | 1 | |
1833 | 1 | |
2084 | 1 | |
2367 | 1 | |
2406 | 1 | |
2477 | 1 | |
2647 | 1 |
Value | Count | Frequency (%) |
76010 | 1 | |
74899 | 1 | |
67806 | 1 | |
53483 | 1 | |
53411 | 1 | |
51390 | 1 | |
51361 | 1 | |
50152 | 1 | |
48559 | 1 | |
47816 | 1 |
순위 | 호선 | 역코드 | 일평균승차인원 | |
---|---|---|---|---|
순위 | 1.000 | 0.428 | 0.458 | 0.848 |
호선 | 0.428 | 1.000 | 0.941 | 0.531 |
역코드 | 0.458 | 0.941 | 1.000 | 0.378 |
일평균승차인원 | 0.848 | 0.531 | 0.378 | 1.000 |
순위 | 호선 | 역코드 | 일평균승차인원 | |
---|---|---|---|---|
순위 | 1.000 | 0.381 | 0.408 | -1.000 |
호선 | 0.381 | 1.000 | 0.989 | -0.381 |
역코드 | 0.408 | 0.989 | 1.000 | -0.408 |
일평균승차인원 | -1.000 | -0.381 | -0.408 | 1.000 |
순위 | 호선 | 역코드 | 역명 | 일평균승차인원 | |
---|---|---|---|---|---|
0 | 1 | 2 | 216 | 잠실(2) | 76010 |
1 | 2 | 2 | 222 | 강남 | 74899 |
2 | 3 | 2 | 239 | 홍대입구 | 67806 |
3 | 4 | 2 | 232 | 구로디지털단지 | 53483 |
4 | 5 | 2 | 230 | 신림 | 53411 |
5 | 6 | 1 | 150 | 서울역(1) | 51390 |
6 | 7 | 2 | 219 | 삼성 | 51361 |
7 | 8 | 2 | 220 | 선릉 | 50152 |
8 | 9 | 3 | 329 | 고속터미널(3) | 48559 |
9 | 10 | 2 | 234 | 신도림 | 47816 |
순위 | 호선 | 역코드 | 역명 | 일평균승차인원 | |
---|---|---|---|---|---|
275 | 276 | 6 | 2614 | 독바위 | 2647 |
276 | 277 | 7 | 2711 | 장암 | 2477 |
277 | 278 | 3 | 336 | 학여울 | 2406 |
278 | 279 | 2 | 250 | 용두 | 2367 |
279 | 280 | 6 | 2633 | 버티고개 | 2084 |
280 | 281 | 4 | 431 | 동작 | 1833 |
281 | 282 | 2 | 245 | 신답 | 1594 |
282 | 283 | 4 | 434 | 남태령 | 1359 |
283 | 284 | 2 | 247 | 도림천 | 1213 |
284 | 285 | 9 | 4137 | 둔촌오륜 | 1050 |