Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 285 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.4 KiB |
Average record size in memory | 44.5 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Dataset
Description | 서울교통공사의 일평균 환승유입순위 데이터입니다. 해당 데이터는 순위, 해당호선, 역번호, 역명, 환승유입인원수 데이터를 포함하고 있습니다. 2022년 12월 기준 데이터입니다. |
---|---|
URL | https://www.data.go.kr/data/15044246/fileData.do |
순위 is highly overall correlated with 일평균환승유입인원수 | High correlation |
호선 is highly overall correlated with 역번호 | High correlation |
역번호 is highly overall correlated with 호선 | High correlation |
일평균환승유입인원수 is highly overall correlated with 순위 | High correlation |
순위 has unique values | Unique |
역번호 has unique values | Unique |
역명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 14:08:30.230129 |
---|---|
Analysis finished | 2023-12-12 14:08:32.366388 |
Duration | 2.14 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순위
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 143 |
Minimum | 1 |
---|---|
Maximum | 285 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.2 |
Q1 | 72 |
median | 143 |
Q3 | 214 |
95-th percentile | 270.8 |
Maximum | 285 |
Range | 284 |
Interquartile range (IQR) | 142 |
Descriptive statistics
Standard deviation | 82.416625 |
---|---|
Coefficient of variation (CV) | 0.57634003 |
Kurtosis | -1.2 |
Mean | 143 |
Median Absolute Deviation (MAD) | 71 |
Skewness | 0 |
Sum | 40755 |
Variance | 6792.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
189 | 1 | 0.4% |
195 | 1 | 0.4% |
194 | 1 | 0.4% |
193 | 1 | 0.4% |
192 | 1 | 0.4% |
191 | 1 | 0.4% |
190 | 1 | 0.4% |
188 | 1 | 0.4% |
197 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
285 | 1 | |
284 | 1 | |
283 | 1 | |
282 | 1 | |
281 | 1 | |
280 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
276 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8070175 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.165987 |
---|---|
Coefficient of variation (CV) | 0.45058855 |
Kurtosis | -0.98900904 |
Mean | 4.8070175 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.064358114 |
Sum | 1370 |
Variance | 4.6914999 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 56 | |
2 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
1 | 10 | 3.5% |
Value | Count | Frequency (%) |
1 | 10 | 3.5% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 42 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
Value | Count | Frequency (%) |
9 | 13 | 4.6% |
8 | 18 | 6.3% |
7 | 42 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.5% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1730.4456 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205.2 |
Q1 | 320 |
median | 2534 |
Q3 | 2712 |
95-th percentile | 2826.8 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2392 |
Descriptive statistics
Standard deviation | 1262.5495 |
---|---|
Coefficient of variation (CV) | 0.72960947 |
Kurtosis | -1.5156214 |
Mean | 1730.4456 |
Median Absolute Deviation (MAD) | 284 |
Skewness | -0.10121826 |
Sum | 493177 |
Variance | 1594031.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
239 | 1 | 0.4% |
2727 | 1 | 0.4% |
2551 | 1 | 0.4% |
422 | 1 | 0.4% |
4132 | 1 | 0.4% |
205 | 1 | 0.4% |
2555 | 1 | 0.4% |
2626 | 1 | 0.4% |
337 | 1 | 0.4% |
2752 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4134 | 1 | |
4133 | 1 | |
4132 | 1 | |
4131 | 1 | |
4130 | 1 | |
4129 | 1 |
역명
Text
UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
홍대입구 | 1 | 0.4% |
경찰병원 | 1 | 0.4% |
굽은다리 | 1 | 0.4% |
동대문역사문화공원(4 | 1 | 0.4% |
석촌고분 | 1 | 0.4% |
동대문역사문화공원(2 | 1 | 0.4% |
둔촌동 | 1 | 0.4% |
대흥 | 1 | 0.4% |
대청 | 1 | 0.4% |
신정 | 1 | 0.4% |
Other values (275) | 275 |
Most occurring characters
Value | Count | Frequency (%) |
( | 87 | 7.9% |
) | 87 | 7.9% |
대 | 32 | 2.9% |
구 | 28 | 2.5% |
동 | 23 | 2.1% |
신 | 22 | 2.0% |
산 | 19 | 1.7% |
5 | 18 | 1.6% |
원 | 16 | 1.4% |
2 | 16 | 1.4% |
Other values (213) | 757 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 837 | |
Decimal Number | 94 | 8.5% |
Open Punctuation | 87 | 7.9% |
Close Punctuation | 87 | 7.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
로 | 14 | 1.7% |
Other values (202) | 639 |
Decimal Number
Value | Count | Frequency (%) |
5 | 18 | |
2 | 16 | |
3 | 14 | |
6 | 11 | |
7 | 11 | |
4 | 9 | |
1 | 6 | 6.4% |
8 | 6 | 6.4% |
9 | 3 | 3.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 87 |
Close Punctuation
Value | Count | Frequency (%) |
) | 87 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 837 | |
Common | 268 | 24.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
로 | 14 | 1.7% |
Other values (202) | 639 |
Common
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
6 | 11 | 4.1% |
7 | 11 | 4.1% |
4 | 9 | 3.4% |
1 | 6 | 2.2% |
8 | 6 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 837 | |
ASCII | 268 | 24.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
6 | 11 | 4.1% |
7 | 11 | 4.1% |
4 | 9 | 3.4% |
1 | 6 | 2.2% |
8 | 6 | 2.2% |
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
청 | 14 | 1.7% |
로 | 14 | 1.7% |
Other values (202) | 639 |
일평균환승유입인원수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 283 |
---|---|
Distinct (%) | 99.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7327.8281 |
Minimum | 319 |
---|---|
Maximum | 34234 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 319 |
---|---|
5-th percentile | 1306.2 |
Q1 | 3614 |
median | 5953 |
Q3 | 8861 |
95-th percentile | 20290.8 |
Maximum | 34234 |
Range | 33915 |
Interquartile range (IQR) | 5247 |
Descriptive statistics
Standard deviation | 5689.5122 |
---|---|
Coefficient of variation (CV) | 0.77642545 |
Kurtosis | 3.3717608 |
Mean | 7327.8281 |
Median Absolute Deviation (MAD) | 2414 |
Skewness | 1.715994 |
Sum | 2088431 |
Variance | 32370549 |
Monotonicity | Decreasing |
Value | Count | Frequency (%) |
2876 | 2 | 0.7% |
9993 | 2 | 0.7% |
34234 | 1 | 0.4% |
4273 | 1 | 0.4% |
4088 | 1 | 0.4% |
4102 | 1 | 0.4% |
4213 | 1 | 0.4% |
4216 | 1 | 0.4% |
4260 | 1 | 0.4% |
4291 | 1 | 0.4% |
Other values (273) | 273 |
Value | Count | Frequency (%) |
319 | 1 | |
468 | 1 | |
608 | 1 | |
612 | 1 | |
660 | 1 | |
714 | 1 | |
730 | 1 | |
760 | 1 | |
889 | 1 | |
938 | 1 |
Value | Count | Frequency (%) |
34234 | 1 | |
28840 | 1 | |
27345 | 1 | |
25473 | 1 | |
24971 | 1 | |
24041 | 1 | |
23652 | 1 | |
23147 | 1 | |
23119 | 1 | |
22871 | 1 |
순위 | 호선 | 역번호 | 일평균환승유입인원수 | |
---|---|---|---|---|
순위 | 1.000 | 0.339 | 0.400 | 0.951 |
호선 | 0.339 | 1.000 | 0.941 | 0.393 |
역번호 | 0.400 | 0.941 | 1.000 | 0.350 |
일평균환승유입인원수 | 0.951 | 0.393 | 0.350 | 1.000 |
순위 | 호선 | 역번호 | 일평균환승유입인원수 | |
---|---|---|---|---|
순위 | 1.000 | 0.312 | 0.334 | -1.000 |
호선 | 0.312 | 1.000 | 0.989 | -0.312 |
역번호 | 0.334 | 0.989 | 1.000 | -0.333 |
일평균환승유입인원수 | -1.000 | -0.312 | -0.333 | 1.000 |
순위 | 호선 | 역번호 | 역명 | 일평균환승유입인원수 | |
---|---|---|---|---|---|
0 | 1 | 2 | 239 | 홍대입구 | 34234 |
1 | 2 | 1 | 150 | 서울역(1) | 28840 |
2 | 3 | 2 | 222 | 강남 | 27345 |
3 | 4 | 1 | 152 | 종각 | 25473 |
4 | 5 | 2 | 216 | 잠실(2) | 24971 |
5 | 6 | 2 | 232 | 구로디지털단지 | 24041 |
6 | 7 | 7 | 2748 | 가산디지털단지(7) | 23652 |
7 | 8 | 2 | 234 | 신도림 | 23147 |
8 | 9 | 2 | 221 | 역삼 | 23119 |
9 | 10 | 3 | 332 | 양재 | 22871 |
순위 | 호선 | 역번호 | 역명 | 일평균환승유입인원수 | |
---|---|---|---|---|---|
275 | 276 | 5 | 2541 | 왕십리(5) | 938 |
276 | 277 | 2 | 244 | 용답 | 889 |
277 | 278 | 5 | 2524 | 영등포구청(5) | 760 |
278 | 279 | 2 | 247 | 도림천 | 730 |
279 | 280 | 4 | 434 | 남태령 | 714 |
280 | 281 | 2 | 250 | 용두 | 660 |
281 | 282 | 8 | 2827 | 모란(8) | 612 |
282 | 283 | 9 | 4137 | 둔촌오륜 | 608 |
283 | 284 | 7 | 2711 | 장암 | 468 |
284 | 285 | 2 | 245 | 신답 | 319 |