Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 285 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 12.4 KiB |
Average record size in memory | 44.5 B |
Variable types
Numeric | 4 |
---|---|
Text | 1 |
Dataset
Description | 서울교통공사 연간수송인원 수송순위에 대한 데이터 입니다. 이 데이터는 수송순위, 역명, 연간수송인원(명) 데이터를 제공합니다. |
---|---|
Author | 서울교통공사 |
URL | https://www.data.go.kr/data/15044243/fileData.do |
순위 is highly overall correlated with 연간수송인원(명) | High correlation |
호선 is highly overall correlated with 역번호 | High correlation |
역번호 is highly overall correlated with 호선 | High correlation |
연간수송인원(명) is highly overall correlated with 순위 | High correlation |
순위 has unique values | Unique |
역번호 has unique values | Unique |
역명 has unique values | Unique |
연간수송인원(명) has unique values | Unique |
Reproduction
Analysis started | 2024-04-21 01:10:23.763644 |
---|---|
Analysis finished | 2024-04-21 01:10:26.698475 |
Duration | 2.93 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순위
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 143 |
Minimum | 1 |
---|---|
Maximum | 285 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 15.2 |
Q1 | 72 |
median | 143 |
Q3 | 214 |
95-th percentile | 270.8 |
Maximum | 285 |
Range | 284 |
Interquartile range (IQR) | 142 |
Descriptive statistics
Standard deviation | 82.416625 |
---|---|
Coefficient of variation (CV) | 0.57634003 |
Kurtosis | -1.2 |
Mean | 143 |
Median Absolute Deviation (MAD) | 71 |
Skewness | 0 |
Sum | 40755 |
Variance | 6792.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
189 | 1 | 0.4% |
195 | 1 | 0.4% |
194 | 1 | 0.4% |
193 | 1 | 0.4% |
192 | 1 | 0.4% |
191 | 1 | 0.4% |
190 | 1 | 0.4% |
188 | 1 | 0.4% |
197 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
285 | 1 | |
284 | 1 | |
283 | 1 | |
282 | 1 | |
281 | 1 | |
280 | 1 | |
279 | 1 | |
278 | 1 | |
277 | 1 | |
276 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 9 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.8070175 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 8 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.165987 |
---|---|
Coefficient of variation (CV) | 0.45058855 |
Kurtosis | -0.98900904 |
Mean | 4.8070175 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.064358114 |
Sum | 1370 |
Variance | 4.6914999 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 56 | |
2 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 33 | |
4 | 26 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
1 | 10 | 3.5% |
Value | Count | Frequency (%) |
1 | 10 | 3.5% |
2 | 50 | |
3 | 33 | |
4 | 26 | |
5 | 56 | |
6 | 37 | |
7 | 42 | |
8 | 18 | 6.3% |
9 | 13 | 4.6% |
Value | Count | Frequency (%) |
9 | 13 | 4.6% |
8 | 18 | 6.3% |
7 | 42 | |
6 | 37 | |
5 | 56 | |
4 | 26 | |
3 | 33 | |
2 | 50 | |
1 | 10 | 3.5% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1730.4456 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205.2 |
Q1 | 320 |
median | 2534 |
Q3 | 2712 |
95-th percentile | 2826.8 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2392 |
Descriptive statistics
Standard deviation | 1262.5495 |
---|---|
Coefficient of variation (CV) | 0.72960947 |
Kurtosis | -1.5156214 |
Mean | 1730.4456 |
Median Absolute Deviation (MAD) | 284 |
Skewness | -0.10121826 |
Sum | 493177 |
Variance | 1594031.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
239 | 1 | 0.4% |
2641 | 1 | 0.4% |
2626 | 1 | 0.4% |
2553 | 1 | 0.4% |
325 | 1 | 0.4% |
2752 | 1 | 0.4% |
2648 | 1 | 0.4% |
2712 | 1 | 0.4% |
2817 | 1 | 0.4% |
243 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
4138 | 1 | |
4137 | 1 | |
4136 | 1 | |
4135 | 1 | |
4134 | 1 | |
4133 | 1 | |
4132 | 1 | |
4131 | 1 | |
4130 | 1 | |
4129 | 1 |
역명
Text
UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.4 KiB |
Value | Count | Frequency (%) |
홍대입구 | 1 | 0.4% |
동묘앞(6 | 1 | 0.4% |
대흥 | 1 | 0.4% |
고덕 | 1 | 0.4% |
옥수 | 1 | 0.4% |
온수(7 | 1 | 0.4% |
봉화산 | 1 | 0.4% |
도봉산(7 | 1 | 0.4% |
송파 | 1 | 0.4% |
거여 | 1 | 0.4% |
Other values (275) | 275 |
Most occurring characters
Value | Count | Frequency (%) |
( | 87 | 7.9% |
) | 87 | 7.9% |
대 | 32 | 2.9% |
구 | 28 | 2.5% |
동 | 23 | 2.1% |
신 | 22 | 2.0% |
산 | 19 | 1.7% |
5 | 18 | 1.6% |
2 | 16 | 1.4% |
원 | 16 | 1.4% |
Other values (213) | 757 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 837 | |
Decimal Number | 94 | 8.5% |
Open Punctuation | 87 | 7.9% |
Close Punctuation | 87 | 7.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 14 | 1.7% |
청 | 14 | 1.7% |
Other values (202) | 639 |
Decimal Number
Value | Count | Frequency (%) |
5 | 18 | |
2 | 16 | |
3 | 14 | |
7 | 11 | |
6 | 11 | |
4 | 9 | |
1 | 6 | 6.4% |
8 | 6 | 6.4% |
9 | 3 | 3.2% |
Open Punctuation
Value | Count | Frequency (%) |
( | 87 |
Close Punctuation
Value | Count | Frequency (%) |
) | 87 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 837 | |
Common | 268 | 24.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 14 | 1.7% |
청 | 14 | 1.7% |
Other values (202) | 639 |
Common
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
7 | 11 | 4.1% |
6 | 11 | 4.1% |
4 | 9 | 3.4% |
1 | 6 | 2.2% |
8 | 6 | 2.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 837 | |
ASCII | 268 | 24.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 87 | |
) | 87 | |
5 | 18 | 6.7% |
2 | 16 | 6.0% |
3 | 14 | 5.2% |
7 | 11 | 4.1% |
6 | 11 | 4.1% |
4 | 9 | 3.4% |
1 | 6 | 2.2% |
8 | 6 | 2.2% |
Hangul
Value | Count | Frequency (%) |
대 | 32 | 3.8% |
구 | 28 | 3.3% |
동 | 23 | 2.7% |
신 | 22 | 2.6% |
산 | 19 | 2.3% |
원 | 16 | 1.9% |
지 | 15 | 1.8% |
문 | 15 | 1.8% |
로 | 14 | 1.7% |
청 | 14 | 1.7% |
Other values (202) | 639 |
연간수송인원(명)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 285 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 8482665.7 |
Minimum | 628118 |
---|---|
Maximum | 39441541 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.6 KiB |
Quantile statistics
Minimum | 628118 |
---|---|
5-th percentile | 1858798 |
Q1 | 4222218 |
median | 6832157 |
Q3 | 10606584 |
95-th percentile | 22611032 |
Maximum | 39441541 |
Range | 38813423 |
Interquartile range (IQR) | 6384366 |
Descriptive statistics
Standard deviation | 6531478.2 |
---|---|
Coefficient of variation (CV) | 0.76997944 |
Kurtosis | 4.7467445 |
Mean | 8482665.7 |
Median Absolute Deviation (MAD) | 2976962 |
Skewness | 1.9399669 |
Sum | 2.4175597 × 109 |
Variance | 4.2660207 × 1013 |
Monotonicity | Strictly decreasing |
Value | Count | Frequency (%) |
39441541 | 1 | 0.4% |
5064988 | 1 | 0.4% |
4974809 | 1 | 0.4% |
5010191 | 1 | 0.4% |
5032924 | 1 | 0.4% |
5042731 | 1 | 0.4% |
5062633 | 1 | 0.4% |
5064436 | 1 | 0.4% |
5179956 | 1 | 0.4% |
4872939 | 1 | 0.4% |
Other values (275) | 275 |
Value | Count | Frequency (%) |
628118 | 1 | |
702166 | 1 | |
713074 | 1 | |
781113 | 1 | |
1057939 | 1 | |
1106808 | 1 | |
1120565 | 1 | |
1244658 | 1 | |
1397469 | 1 | |
1406953 | 1 |
Value | Count | Frequency (%) |
39441541 | 1 | |
37873306 | 1 | |
37224977 | 1 | |
31086993 | 1 | |
28687464 | 1 | |
27296469 | 1 | |
27142292 | 1 | |
26726964 | 1 | |
26516158 | 1 | |
25489268 | 1 |
순위 | 호선 | 역번호 | 연간수송인원(명) | |
---|---|---|---|---|
순위 | 1.000 | 0.387 | 0.447 | 0.866 |
호선 | 0.387 | 1.000 | 0.941 | 0.544 |
역번호 | 0.447 | 0.941 | 1.000 | 0.379 |
연간수송인원(명) | 0.866 | 0.544 | 0.379 | 1.000 |
순위 | 호선 | 역번호 | 연간수송인원(명) | |
---|---|---|---|---|
순위 | 1.000 | 0.358 | 0.384 | -1.000 |
호선 | 0.358 | 1.000 | 0.989 | -0.358 |
역번호 | 0.384 | 0.989 | 1.000 | -0.384 |
연간수송인원(명) | -1.000 | -0.358 | -0.384 | 1.000 |
순위 | 호선 | 역번호 | 역명 | 연간수송인원(명) | |
---|---|---|---|---|---|
0 | 1 | 2 | 239 | 홍대입구 | 39441541 |
1 | 2 | 2 | 216 | 잠실(2) | 37873306 |
2 | 3 | 2 | 222 | 강남 | 37224977 |
3 | 4 | 1 | 150 | 서울역(1) | 31086993 |
4 | 5 | 2 | 232 | 구로디지털단지 | 28687464 |
5 | 6 | 2 | 230 | 신림 | 27296469 |
6 | 7 | 2 | 219 | 삼성 | 27142292 |
7 | 8 | 3 | 329 | 고속터미널(3) | 26726964 |
8 | 9 | 2 | 234 | 신도림 | 26516158 |
9 | 10 | 2 | 221 | 역삼 | 25489268 |
순위 | 호선 | 역번호 | 역명 | 연간수송인원(명) | |
---|---|---|---|---|---|
275 | 276 | 3 | 336 | 학여울 | 1406953 |
276 | 277 | 2 | 244 | 용답 | 1397469 |
277 | 278 | 6 | 2633 | 버티고개 | 1244658 |
278 | 279 | 2 | 250 | 용두 | 1120565 |
279 | 280 | 7 | 2711 | 장암 | 1106808 |
280 | 281 | 4 | 431 | 동작 | 1057939 |
281 | 282 | 4 | 434 | 남태령 | 781113 |
282 | 283 | 2 | 247 | 도림천 | 713074 |
283 | 284 | 2 | 245 | 신답 | 702166 |
284 | 285 | 9 | 4137 | 둔촌오륜 | 628118 |