Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 285 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.4 KiB |
Average record size in memory | 44.5 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Dataset
Description | 서울교통공사의 일평균 하차순위 데이터입니다. 해당 데이터는 하차순위, 호선, 역번호, 역명, 하차인원(명/일) 데이터를 포함하고 있습니다. |
---|---|
URL | https://www.data.go.kr/data/15044248/fileData.do |
순위 is highly overall correlated with 일평균하차인원수 | High correlation |
호선 is highly overall correlated with 역번호 | High correlation |
역번호 is highly overall correlated with 호선 | High correlation |
일평균하차인원수 is highly overall correlated with 순위 | High correlation |
순위 has unique values | Unique |
역번호 has unique values | Unique |
역명 has unique values | Unique |
일평균하차인원수 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 16:19:57.615235 |
---|---|
Analysis finished | 2023-12-12 16:19:59.488000 |
Duration | 1.87 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순위
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 143 |
Minimum | 1 |
---|---|
Maximum | 285 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.2 |
Q1 | 72 |
median | 143 |
Q3 | 214 |
95-th percentile | 270.8 |
Maximum | 285 |
Range | 284 |
Interquartile range (IQR) | 142 |
Descriptive statistics
Standard deviation | 82.416625 |
---|---|
Coefficient of variation (CV) | 0.57634003 |
Kurtosis | -1.2 |
Mean | 143 |
Median Absolute Deviation (MAD) | 71 |
Skewness | 0 |
Sum | 40755 |
Variance | 6792.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
189 | 1 | 0.4% |
195 | 1 | 0.4% |
194 | 1 | 0.4% |
193 | 1 | 0.4% |
192 | 1 | 0.4% |
191 | 1 | 0.4% |
190 | 1 | 0.4% |
188 | 1 | 0.4% |
197 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
285 | 1 | |
284 | 1 | |
283 | 1 | |
282 | 1 | |
281 | 1 | |
280 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
276 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8070175 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.165987 |
---|---|
Coefficient of variation (CV) | 0.45058855 |
Kurtosis | -0.98900904 |
Mean | 4.8070175 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.064358114 |
Sum | 1370 |
Variance | 4.6914999 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 56 | |
2 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
1 | 10 | 3.5% |
Value | Count | Frequency (%) |
1 | 10 | 3.5% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 42 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
Value | Count | Frequency (%) |
9 | 13 | 4.6% |
8 | 18 | 6.3% |
7 | 42 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.5% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1730.4456 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205.2 |
Q1 | 320 |
median | 2534 |
Q3 | 2712 |
95-th percentile | 2826.8 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2392 |
Descriptive statistics
Standard deviation | 1262.5495 |
---|---|
Coefficient of variation (CV) | 0.72960947 |
Kurtosis | -1.5156214 |
Mean | 1730.4456 |
Median Absolute Deviation (MAD) | 284 |
Skewness | -0.10121826 |
Sum | 493177 |
Variance | 1594031.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
222 | 1 | 0.4% |
2742 | 1 | 0.4% |
2637 | 1 | 0.4% |
2719 | 1 | 0.4% |
340 | 1 | 0.4% |
2746 | 1 | 0.4% |
2626 | 1 | 0.4% |
2644 | 1 | 0.4% |
330 | 1 | 0.4% |
2636 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4134 | 1 | |
4133 | 1 | |
4132 | 1 | |
4131 | 1 | |
4130 | 1 | |
4129 | 1 |
역명
Text
UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
강남 | 1 | 0.4% |
가락시장(8 | 1 | 0.4% |
동묘앞(6 | 1 | 0.4% |
태릉입구(7 | 1 | 0.4% |
가락시장(3 | 1 | 0.4% |
대림(7 | 1 | 0.4% |
대흥 | 1 | 0.4% |
돌곶이 | 1 | 0.4% |
교대(3 | 1 | 0.4% |
고덕 | 1 | 0.4% |
Other values (275) | 275 |
Most occurring characters
Value | Count | Frequency (%) |
( | 87 | 7.9% |
) | 87 | 7.9% |
대 | 32 | 2.9% |
구 | 28 | 2.5% |
동 | 23 | 2.1% |
신 | 22 | 2.0% |
산 | 19 | 1.7% |
5 | 18 | 1.6% |
2 | 16 | 1.4% |
원 | 16 | 1.4% |
Other values (213) | 757 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 837 | |
Decimal Number | 94 | 8.5% |
Open Punctuation | 87 | 7.9% |
Close Punctuation | 87 | 7.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
Decimal Number
Value | Count | Frequency (%) |
5 | 18 | |
2 | 16 | |
3 | 14 | |
7 | 11 | |
6 | 11 | |
4 | 9 | |
8 | 6 | 6.4% |
1 | 6 | 6.4% |
9 | 3 | 3.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 87 |
Close Punctuation
Value | Count | Frequency (%) |
) | 87 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 837 | |
Common | 268 | 24.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
Common
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
7 | 11 | 4.1% |
6 | 11 | 4.1% |
4 | 9 | 3.4% |
8 | 6 | 2.2% |
1 | 6 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 837 | |
ASCII | 268 | 24.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
7 | 11 | 4.1% |
6 | 11 | 4.1% |
4 | 9 | 3.4% |
8 | 6 | 2.2% |
1 | 6 | 2.2% |
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
입 | 14 | 1.7% |
Other values (202) | 639 |
일평균하차인원수
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13699.674 |
Minimum | 323 |
---|---|
Maximum | 70404 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 323 |
---|---|
5-th percentile | 1704.4 |
Q1 | 6240 |
median | 10852 |
Q3 | 17952 |
95-th percentile | 36127.8 |
Maximum | 70404 |
Range | 70081 |
Interquartile range (IQR) | 11712 |
Descriptive statistics
Standard deviation | 11220.027 |
---|---|
Coefficient of variation (CV) | 0.81899961 |
Kurtosis | 5.4641887 |
Mean | 13699.674 |
Median Absolute Deviation (MAD) | 5187 |
Skewness | 2.0110836 |
Sum | 3904407 |
Variance | 1.2588901 × 108 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
70404 | 1 | 0.4% |
7967 | 1 | 0.4% |
7669 | 1 | 0.4% |
7762 | 1 | 0.4% |
7817 | 1 | 0.4% |
7858 | 1 | 0.4% |
7912 | 1 | 0.4% |
7938 | 1 | 0.4% |
7974 | 1 | 0.4% |
7632 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
323 | 1 | |
658 | 1 | |
716 | 1 | |
977 | 1 | |
1016 | 1 | |
1164 | 1 | |
1257 | 1 | |
1274 | 1 | |
1365 | 1 | |
1370 | 1 |
Value | Count | Frequency (%) |
70404 | 1 | |
67651 | 1 | |
61255 | 1 | |
52910 | 1 | |
50452 | 1 | |
46905 | 1 | |
46331 | 1 | |
44051 | 1 | |
42521 | 1 | |
42042 | 1 |
순위 | 호선 | 역번호 | 일평균하차인원수 | |
---|---|---|---|---|
순위 | 1.000 | 0.586 | 0.671 | 0.925 |
호선 | 0.586 | 1.000 | 0.941 | 0.429 |
역번호 | 0.671 | 0.941 | 1.000 | 0.477 |
일평균하차인원수 | 0.925 | 0.429 | 0.477 | 1.000 |
순위 | 호선 | 역번호 | 일평균하차인원수 | |
---|---|---|---|---|
순위 | 1.000 | 0.434 | 0.458 | -1.000 |
호선 | 0.434 | 1.000 | 0.989 | -0.434 |
역번호 | 0.458 | 0.989 | 1.000 | -0.458 |
일평균하차인원수 | -1.000 | -0.434 | -0.458 | 1.000 |
순위 | 호선 | 역번호 | 역명 | 일평균하차인원수 | |
---|---|---|---|---|---|
0 | 1 | 2 | 222 | 강남 | 70404 |
1 | 2 | 2 | 216 | 잠실(2) | 67651 |
2 | 3 | 2 | 239 | 홍대입구 | 61255 |
3 | 4 | 2 | 230 | 신림 | 52910 |
4 | 5 | 2 | 232 | 구로디지털단지 | 50452 |
5 | 6 | 2 | 221 | 역삼 | 46905 |
6 | 7 | 2 | 219 | 삼성 | 46331 |
7 | 8 | 2 | 234 | 신도림 | 44051 |
8 | 9 | 3 | 329 | 고속터미널(3) | 42521 |
9 | 10 | 2 | 228 | 서울대입구 | 42042 |
순위 | 호선 | 역번호 | 역명 | 일평균하차인원수 | |
---|---|---|---|---|---|
275 | 276 | 9 | 4134 | 송파나루 | 1370 |
276 | 277 | 9 | 4136 | 올림픽공원(9) | 1365 |
277 | 278 | 9 | 4132 | 석촌고분 | 1274 |
278 | 279 | 2 | 247 | 도림천 | 1257 |
279 | 280 | 9 | 4128 | 삼성중앙 | 1164 |
280 | 281 | 4 | 434 | 남태령 | 1016 |
281 | 282 | 7 | 2711 | 장암 | 977 |
282 | 283 | 9 | 4135 | 한성백제 | 716 |
283 | 284 | 9 | 4130 | 종합운동장(9) | 658 |
284 | 285 | 9 | 4137 | 둔촌오륜 | 323 |