Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 262 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 13.7 KiB |
Average record size in memory | 53.5 B |
Variable types
Numeric | 5 |
---|---|
Text | 1 |
Dataset
Description | 서울교통공사 1-8호선 역별 일평균 승하차인원 정보 입니다. 해당 데이터는 연번, 호선, 역번호, 역명, 연평균승차정보, 연평균 하차정보로 구성되어 있습니다. (2006년 이전데이터는 정보 부존재로 2006년 데이터 부터 게시합니다.) |
---|---|
Author | 서울교통공사 |
URL | https://www.data.go.kr/data/15099899/fileData.do |
연번 is highly overall correlated with 호선 and 1 other fields | High correlation |
호선 is highly overall correlated with 연번 and 1 other fields | High correlation |
역번호 is highly overall correlated with 연번 and 1 other fields | High correlation |
2007년 승차인원(일평균) is highly overall correlated with 2007년 하차인원(일평균) | High correlation |
2007년 하차인원(일평균) is highly overall correlated with 2007년 승차인원(일평균) | High correlation |
연번 has unique values | Unique |
역번호 has unique values | Unique |
역명 has unique values | Unique |
2007년 하차인원(일평균) has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 14:30:55.590445 |
---|---|
Analysis finished | 2023-12-12 14:30:58.228099 |
Duration | 2.64 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 262 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 131.5 |
Minimum | 1 |
---|---|
Maximum | 262 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 14.05 |
Q1 | 66.25 |
median | 131.5 |
Q3 | 196.75 |
95-th percentile | 248.95 |
Maximum | 262 |
Range | 261 |
Interquartile range (IQR) | 130.5 |
Descriptive statistics
Standard deviation | 75.777085 |
---|---|
Coefficient of variation (CV) | 0.5762516 |
Kurtosis | -1.2 |
Mean | 131.5 |
Median Absolute Deviation (MAD) | 65.5 |
Skewness | 0 |
Sum | 34453 |
Variance | 5742.1667 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.4% |
166 | 1 | 0.4% |
168 | 1 | 0.4% |
169 | 1 | 0.4% |
170 | 1 | 0.4% |
171 | 1 | 0.4% |
172 | 1 | 0.4% |
173 | 1 | 0.4% |
174 | 1 | 0.4% |
175 | 1 | 0.4% |
Other values (252) | 252 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
262 | 1 | |
261 | 1 | |
260 | 1 | |
259 | 1 | |
258 | 1 | |
257 | 1 | |
256 | 1 | |
255 | 1 | |
254 | 1 | |
253 | 1 |
호선
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8 |
---|---|
Distinct (%) | 3.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.6030534 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 5 |
Q3 | 6 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.0273206 |
---|---|
Coefficient of variation (CV) | 0.44042952 |
Kurtosis | -1.1839681 |
Mean | 4.6030534 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.060319695 |
Sum | 1206 |
Variance | 4.110029 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
2 | 50 | |
5 | 50 | |
7 | 42 | |
6 | 37 | |
3 | 30 | |
4 | 26 | |
8 | 17 | 6.5% |
1 | 10 | 3.8% |
Value | Count | Frequency (%) |
1 | 10 | 3.8% |
2 | 50 | |
3 | 30 | |
4 | 26 | |
5 | 50 | |
6 | 37 | |
7 | 42 | |
8 | 17 | 6.5% |
Value | Count | Frequency (%) |
8 | 17 | 6.5% |
7 | 42 | |
6 | 37 | |
5 | 50 | |
4 | 26 | |
3 | 30 | |
2 | 50 | |
1 | 10 | 3.8% |
역번호
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 262 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1604.1031 |
Minimum | 150 |
---|---|
Maximum | 2827 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.4 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 204.05 |
Q1 | 314.25 |
median | 2526.5 |
Q3 | 2641.75 |
95-th percentile | 2813.95 |
Maximum | 2827 |
Range | 2677 |
Interquartile range (IQR) | 2327.5 |
Descriptive statistics
Standard deviation | 1178.4456 |
---|---|
Coefficient of variation (CV) | 0.73464459 |
Kurtosis | -1.9369219 |
Mean | 1604.1031 |
Median Absolute Deviation (MAD) | 286 |
Skewness | -0.2270754 |
Sum | 420275 |
Variance | 1388734.1 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
150 | 1 | 0.4% |
2561 | 1 | 0.4% |
2612 | 1 | 0.4% |
2613 | 1 | 0.4% |
2614 | 1 | 0.4% |
2616 | 1 | 0.4% |
2617 | 1 | 0.4% |
2618 | 1 | 0.4% |
2619 | 1 | 0.4% |
2620 | 1 | 0.4% |
Other values (252) | 252 |
Value | Count | Frequency (%) |
150 | 1 | |
151 | 1 | |
152 | 1 | |
153 | 1 | |
154 | 1 | |
155 | 1 | |
156 | 1 | |
157 | 1 | |
158 | 1 | |
159 | 1 |
Value | Count | Frequency (%) |
2827 | 1 | |
2826 | 1 | |
2825 | 1 | |
2824 | 1 | |
2823 | 1 | |
2822 | 1 | |
2821 | 1 | |
2820 | 1 | |
2819 | 1 | |
2818 | 1 |
역명
Text
UNIQUE
 
Distinct | 262 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.2 KiB |
Value | Count | Frequency (%) |
서울역(1 | 1 | 0.4% |
개롱 | 1 | 0.4% |
마천 | 1 | 0.4% |
응암 | 1 | 0.4% |
역촌 | 1 | 0.4% |
불광(6 | 1 | 0.4% |
독바위 | 1 | 0.4% |
구산 | 1 | 0.4% |
새절 | 1 | 0.4% |
증산 | 1 | 0.4% |
Other values (252) | 252 |
Most occurring characters
Value | Count | Frequency (%) |
( | 79 | 7.9% |
) | 79 | 7.9% |
대 | 32 | 3.2% |
구 | 28 | 2.8% |
신 | 22 | 2.2% |
동 | 22 | 2.2% |
5 | 17 | 1.7% |
산 | 17 | 1.7% |
2 | 15 | 1.5% |
문 | 15 | 1.5% |
Other values (200) | 675 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 757 | |
Decimal Number | 86 | 8.6% |
Open Punctuation | 79 | 7.9% |
Close Punctuation | 79 | 7.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 32 | 4.2% |
구 | 28 | 3.7% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
산 | 17 | 2.2% |
문 | 15 | 2.0% |
지 | 15 | 2.0% |
입 | 14 | 1.8% |
로 | 14 | 1.8% |
원 | 13 | 1.7% |
Other values (190) | 565 |
Decimal Number
Value | Count | Frequency (%) |
5 | 17 | |
2 | 15 | |
3 | 12 | |
7 | 11 | |
6 | 11 | |
4 | 9 | |
1 | 6 | 7.0% |
8 | 5 | 5.8% |
Open Punctuation
Value | Count | Frequency (%) |
( | 79 |
Close Punctuation
Value | Count | Frequency (%) |
) | 79 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 757 | |
Common | 244 | 24.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 32 | 4.2% |
구 | 28 | 3.7% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
산 | 17 | 2.2% |
문 | 15 | 2.0% |
지 | 15 | 2.0% |
입 | 14 | 1.8% |
로 | 14 | 1.8% |
원 | 13 | 1.7% |
Other values (190) | 565 |
Common
Value | Count | Frequency (%) |
( | 79 | |
) | 79 | |
5 | 17 | 7.0% |
2 | 15 | 6.1% |
3 | 12 | 4.9% |
7 | 11 | 4.5% |
6 | 11 | 4.5% |
4 | 9 | 3.7% |
1 | 6 | 2.5% |
8 | 5 | 2.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 757 | |
ASCII | 244 | 24.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 79 | |
) | 79 | |
5 | 17 | 7.0% |
2 | 15 | 6.1% |
3 | 12 | 4.9% |
7 | 11 | 4.5% |
6 | 11 | 4.5% |
4 | 9 | 3.7% |
1 | 6 | 2.5% |
8 | 5 | 2.0% |
Hangul
Value | Count | Frequency (%) |
대 | 32 | 4.2% |
구 | 28 | 3.7% |
신 | 22 | 2.9% |
동 | 22 | 2.9% |
산 | 17 | 2.2% |
문 | 15 | 2.0% |
지 | 15 | 2.0% |
입 | 14 | 1.8% |
로 | 14 | 1.8% |
원 | 13 | 1.7% |
Other values (190) | 565 |
2007년 승차인원(일평균)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 260 |
---|---|
Distinct (%) | 99.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17298.454 |
Minimum | 626 |
---|---|
Maximum | 93652 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.4 KiB |
Quantile statistics
Minimum | 626 |
---|---|
5-th percentile | 3094.45 |
Q1 | 7663.25 |
median | 13032 |
Q3 | 21367.5 |
95-th percentile | 47853.5 |
Maximum | 93652 |
Range | 93026 |
Interquartile range (IQR) | 13704.25 |
Descriptive statistics
Standard deviation | 14612.251 |
---|---|
Coefficient of variation (CV) | 0.84471429 |
Kurtosis | 4.8795806 |
Mean | 17298.454 |
Median Absolute Deviation (MAD) | 6138.5 |
Skewness | 1.9864686 |
Sum | 4532195 |
Variance | 2.1351789 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
12523 | 2 | 0.8% |
9563 | 2 | 0.8% |
57360 | 1 | 0.4% |
8067 | 1 | 0.4% |
3469 | 1 | 0.4% |
3984 | 1 | 0.4% |
2143 | 1 | 0.4% |
6958 | 1 | 0.4% |
12339 | 1 | 0.4% |
9791 | 1 | 0.4% |
Other values (250) | 250 |
Value | Count | Frequency (%) |
626 | 1 | |
1144 | 1 | |
1541 | 1 | |
1607 | 1 | |
1949 | 1 | |
2032 | 1 | |
2143 | 1 | |
2148 | 1 | |
2476 | 1 | |
2772 | 1 |
Value | Count | Frequency (%) |
93652 | 1 | |
75134 | 1 | |
72765 | 1 | |
72269 | 1 | |
60186 | 1 | |
60095 | 1 | |
58992 | 1 | |
57360 | 1 | |
54292 | 1 | |
52871 | 1 |
2007년 하차인원(일평균)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 262 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17164.027 |
Minimum | 711 |
---|---|
Maximum | 100635 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 2.4 KiB |
Quantile statistics
Minimum | 711 |
---|---|
5-th percentile | 2873.4 |
Q1 | 7353.25 |
median | 12651 |
Q3 | 22213.75 |
95-th percentile | 48477.25 |
Maximum | 100635 |
Range | 99924 |
Interquartile range (IQR) | 14860.5 |
Descriptive statistics
Standard deviation | 14892.496 |
---|---|
Coefficient of variation (CV) | 0.86765749 |
Kurtosis | 5.2741764 |
Mean | 17164.027 |
Median Absolute Deviation (MAD) | 6064.5 |
Skewness | 2.0222128 |
Sum | 4496975 |
Variance | 2.2178645 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
48510 | 1 | 0.4% |
5636 | 1 | 0.4% |
3598 | 1 | 0.4% |
3607 | 1 | 0.4% |
2518 | 1 | 0.4% |
5418 | 1 | 0.4% |
11567 | 1 | 0.4% |
8673 | 1 | 0.4% |
12637 | 1 | 0.4% |
10038 | 1 | 0.4% |
Other values (252) | 252 |
Value | Count | Frequency (%) |
711 | 1 | |
766 | 1 | |
1105 | 1 | |
2054 | 1 | |
2221 | 1 | |
2339 | 1 | |
2473 | 1 | |
2477 | 1 | |
2518 | 1 | |
2667 | 1 |
Value | Count | Frequency (%) |
100635 | 1 | |
75374 | 1 | |
68923 | 1 | |
68648 | 1 | |
62012 | 1 | |
58665 | 1 | |
57692 | 1 | |
56878 | 1 | |
53808 | 1 | |
53310 | 1 |
연번 | 호선 | 역번호 | 2007년 승차인원(일평균) | 2007년 하차인원(일평균) | |
---|---|---|---|---|---|
연번 | 1.000 | 0.914 | 0.917 | 0.602 | 0.419 |
호선 | 0.914 | 1.000 | 0.996 | 0.428 | 0.390 |
역번호 | 0.917 | 0.996 | 1.000 | 0.415 | 0.384 |
2007년 승차인원(일평균) | 0.602 | 0.428 | 0.415 | 1.000 | 0.941 |
2007년 하차인원(일평균) | 0.419 | 0.390 | 0.384 | 0.941 | 1.000 |
연번 | 호선 | 역번호 | 2007년 승차인원(일평균) | 2007년 하차인원(일평균) | |
---|---|---|---|---|---|
연번 | 1.000 | 0.988 | 1.000 | -0.432 | -0.446 |
호선 | 0.988 | 1.000 | 0.988 | -0.409 | -0.424 |
역번호 | 1.000 | 0.988 | 1.000 | -0.432 | -0.446 |
2007년 승차인원(일평균) | -0.432 | -0.409 | -0.432 | 1.000 | 0.987 |
2007년 하차인원(일평균) | -0.446 | -0.424 | -0.446 | 0.987 | 1.000 |
연번 | 호선 | 역번호 | 역명 | 2007년 승차인원(일평균) | 2007년 하차인원(일평균) | |
---|---|---|---|---|---|---|
0 | 1 | 1 | 150 | 서울역(1) | 57360 | 48510 |
1 | 2 | 1 | 151 | 시청(1) | 23706 | 24594 |
2 | 3 | 1 | 152 | 종각 | 52333 | 51151 |
3 | 4 | 1 | 153 | 종로3가(1) | 43357 | 42162 |
4 | 5 | 1 | 154 | 종로5가 | 27031 | 26981 |
5 | 6 | 1 | 155 | 동대문(1) | 17827 | 20576 |
6 | 7 | 1 | 156 | 신설동(1) | 16547 | 15987 |
7 | 8 | 1 | 157 | 제기동 | 19845 | 20481 |
8 | 9 | 1 | 158 | 청량리 | 40929 | 40385 |
9 | 10 | 1 | 159 | 동묘앞(1) | 6565 | 7211 |
연번 | 호선 | 역번호 | 역명 | 2007년 승차인원(일평균) | 2007년 하차인원(일평균) | |
---|---|---|---|---|---|---|
252 | 253 | 8 | 2818 | 가락시장(8) | 11036 | 12003 |
253 | 254 | 8 | 2819 | 문정 | 6237 | 6172 |
254 | 255 | 8 | 2820 | 장지 | 3255 | 2800 |
255 | 256 | 8 | 2821 | 복정(8) | 5634 | 4166 |
256 | 257 | 8 | 2822 | 산성 | 7179 | 6964 |
257 | 258 | 8 | 2823 | 남한산성입구 | 14170 | 12738 |
258 | 259 | 8 | 2824 | 단대오거리 | 13876 | 12078 |
259 | 260 | 8 | 2825 | 신흥 | 6016 | 6369 |
260 | 261 | 8 | 2826 | 수진 | 5657 | 5139 |
261 | 262 | 8 | 2827 | 모란(8) | 4517 | 3223 |