Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-12914/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:20:45.211863 |
---|---|
Analysis finished | 2024-05-11 06:20:49.388067 |
Duration | 4.18 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사용일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 170 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190348 |
Minimum | 20190101 |
---|---|
Maximum | 20190619 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20190101 |
---|---|
5-th percentile | 20190109 |
Q1 | 20190211 |
median | 20190326 |
Q3 | 20190507 |
95-th percentile | 20190610 |
Maximum | 20190619 |
Range | 518 |
Interquartile range (IQR) | 296 |
Descriptive statistics
Standard deviation | 162.0542 |
---|---|
Coefficient of variation (CV) | 8.0263204 × 10-6 |
Kurtosis | -1.2124412 |
Mean | 20190348 |
Median Absolute Deviation (MAD) | 122 |
Skewness | 0.021460156 |
Sum | 2.0190348 × 1011 |
Variance | 26261.564 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190405 | 79 | 0.8% |
20190420 | 77 | 0.8% |
20190130 | 77 | 0.8% |
20190310 | 75 | 0.8% |
20190413 | 74 | 0.7% |
20190128 | 72 | 0.7% |
20190430 | 72 | 0.7% |
20190216 | 72 | 0.7% |
20190307 | 72 | 0.7% |
20190219 | 71 | 0.7% |
Other values (160) | 9259 |
Value | Count | Frequency (%) |
20190101 | 58 | |
20190102 | 62 | |
20190103 | 46 | |
20190104 | 59 | |
20190105 | 58 | |
20190106 | 58 | |
20190107 | 66 | |
20190108 | 54 | |
20190109 | 49 | |
20190110 | 60 |
Value | Count | Frequency (%) |
20190619 | 4 | < 0.1% |
20190618 | 54 | |
20190617 | 65 | |
20190616 | 51 | |
20190615 | 54 | |
20190614 | 58 | |
20190613 | 66 | |
20190612 | 67 | |
20190611 | 61 | |
20190610 | 51 |
노선명
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
5호선 | |
---|---|
7호선 | |
2호선 | |
경부선 | |
6호선 | |
Other values (20) |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.2735 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 4호선 |
---|---|
2nd row | 경인선 |
3rd row | 2호선 |
4th row | 8호선 |
5th row | 경부선 |
Common Values
Value | Count | Frequency (%) |
5호선 | 896 | 9.0% |
7호선 | 845 | 8.5% |
2호선 | 816 | 8.2% |
경부선 | 658 | 6.6% |
6호선 | 619 | 6.2% |
분당선 | 608 | 6.1% |
3호선 | 598 | 6.0% |
경원선 | 489 | 4.9% |
경의선 | 431 | 4.3% |
4호선 | 417 | 4.2% |
Other values (15) | 3623 |
Length
Value | Count | Frequency (%) |
5호선 | 896 | 8.7% |
7호선 | 845 | 8.2% |
2호선 | 816 | 8.0% |
경부선 | 658 | 6.4% |
6호선 | 619 | 6.0% |
분당선 | 608 | 5.9% |
3호선 | 598 | 5.8% |
경원선 | 489 | 4.8% |
경의선 | 431 | 4.2% |
1호선 | 422 | 4.1% |
Other values (15) | 3872 |
역명
Text
Distinct | 509 |
---|---|
Distinct (%) | 5.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
서울역 | 87 | 0.9% |
공덕 | 74 | 0.7% |
김포공항 | 55 | 0.5% |
디지털미디어시티 | 54 | 0.5% |
종로3가 | 53 | 0.5% |
왕십리(성동구청 | 52 | 0.5% |
신설동 | 52 | 0.5% |
홍대입구 | 52 | 0.5% |
고속터미널 | 48 | 0.5% |
오금 | 44 | 0.4% |
Other values (499) | 9429 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 1251 | 3.5% |
) | 1181 | 3.3% |
( | 1181 | 3.3% |
구 | 1177 | 3.3% |
동 | 846 | 2.4% |
신 | 748 | 2.1% |
산 | 747 | 2.1% |
청 | 706 | 2.0% |
원 | 697 | 1.9% |
정 | 563 | 1.6% |
Other values (282) | 26671 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 33110 | |
Close Punctuation | 1181 | 3.3% |
Open Punctuation | 1181 | 3.3% |
Decimal Number | 233 | 0.7% |
Other Punctuation | 63 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 1251 | 3.8% |
구 | 1177 | 3.6% |
동 | 846 | 2.6% |
신 | 748 | 2.3% |
산 | 747 | 2.3% |
청 | 706 | 2.1% |
원 | 697 | 2.1% |
정 | 563 | 1.7% |
천 | 561 | 1.7% |
서 | 528 | 1.6% |
Other values (273) | 25286 |
Decimal Number
Value | Count | Frequency (%) |
3 | 88 | |
4 | 56 | |
1 | 36 | |
5 | 20 | 8.6% |
2 | 20 | 8.6% |
9 | 13 | 5.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1181 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1181 |
Other Punctuation
Value | Count | Frequency (%) |
. | 63 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 33110 | |
Common | 2658 | 7.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 1251 | 3.8% |
구 | 1177 | 3.6% |
동 | 846 | 2.6% |
신 | 748 | 2.3% |
산 | 747 | 2.3% |
청 | 706 | 2.1% |
원 | 697 | 2.1% |
정 | 563 | 1.7% |
천 | 561 | 1.7% |
서 | 528 | 1.6% |
Other values (273) | 25286 |
Common
Value | Count | Frequency (%) |
) | 1181 | |
( | 1181 | |
3 | 88 | 3.3% |
. | 63 | 2.4% |
4 | 56 | 2.1% |
1 | 36 | 1.4% |
5 | 20 | 0.8% |
2 | 20 | 0.8% |
9 | 13 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 33110 | |
ASCII | 2658 | 7.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 1251 | 3.8% |
구 | 1177 | 3.6% |
동 | 846 | 2.6% |
신 | 748 | 2.3% |
산 | 747 | 2.3% |
청 | 706 | 2.1% |
원 | 697 | 2.1% |
정 | 563 | 1.7% |
천 | 561 | 1.7% |
서 | 528 | 1.6% |
Other values (273) | 25286 |
ASCII
Value | Count | Frequency (%) |
) | 1181 | |
( | 1181 | |
3 | 88 | 3.3% |
. | 63 | 2.4% |
4 | 56 | 2.1% |
1 | 36 | 1.4% |
5 | 20 | 0.8% |
2 | 20 | 0.8% |
9 | 13 | 0.5% |
승차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8221 |
---|---|
Distinct (%) | 82.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12497.362 |
Minimum | 1 |
---|---|
Maximum | 125284 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1079.6 |
Q1 | 4121.5 |
median | 8836.5 |
Q3 | 16307.25 |
95-th percentile | 37424.45 |
Maximum | 125284 |
Range | 125283 |
Interquartile range (IQR) | 12185.75 |
Descriptive statistics
Standard deviation | 12881.787 |
---|---|
Coefficient of variation (CV) | 1.0307605 |
Kurtosis | 10.948277 |
Mean | 12497.362 |
Median Absolute Deviation (MAD) | 5481 |
Skewness | 2.6530986 |
Sum | 1.2497362 × 108 |
Variance | 1.6594043 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 47 | 0.5% |
2 | 15 | 0.1% |
3 | 7 | 0.1% |
2178 | 6 | 0.1% |
1374 | 5 | 0.1% |
4411 | 5 | 0.1% |
4557 | 4 | < 0.1% |
8547 | 4 | < 0.1% |
2179 | 4 | < 0.1% |
4030 | 4 | < 0.1% |
Other values (8211) | 9899 |
Value | Count | Frequency (%) |
1 | 47 | |
2 | 15 | 0.1% |
3 | 7 | 0.1% |
4 | 1 | < 0.1% |
5 | 2 | < 0.1% |
6 | 1 | < 0.1% |
20 | 1 | < 0.1% |
26 | 1 | < 0.1% |
29 | 1 | < 0.1% |
32 | 1 | < 0.1% |
Value | Count | Frequency (%) |
125284 | 1 | |
124516 | 1 | |
120636 | 1 | |
119912 | 1 | |
119816 | 1 | |
118278 | 1 | |
115657 | 1 | |
113431 | 1 | |
110394 | 1 | |
108878 | 1 |
하차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8208 |
---|---|
Distinct (%) | 82.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12447.271 |
Minimum | 0 |
---|---|
Maximum | 129588 |
Zeros | 73 |
Zeros (%) | 0.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1034.95 |
Q1 | 3951.5 |
median | 8590.5 |
Q3 | 16277.25 |
95-th percentile | 38668.15 |
Maximum | 129588 |
Range | 129588 |
Interquartile range (IQR) | 12325.75 |
Descriptive statistics
Standard deviation | 13079.112 |
---|---|
Coefficient of variation (CV) | 1.0507614 |
Kurtosis | 10.613678 |
Mean | 12447.271 |
Median Absolute Deviation (MAD) | 5347 |
Skewness | 2.6205803 |
Sum | 1.2447271 × 108 |
Variance | 1.7106317 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 73 | 0.7% |
988 | 5 | 0.1% |
10757 | 5 | 0.1% |
7617 | 5 | 0.1% |
4945 | 5 | 0.1% |
2142 | 4 | < 0.1% |
12743 | 4 | < 0.1% |
3902 | 4 | < 0.1% |
4953 | 4 | < 0.1% |
4890 | 4 | < 0.1% |
Other values (8198) | 9887 |
Value | Count | Frequency (%) |
0 | 73 | |
25 | 2 | < 0.1% |
26 | 1 | < 0.1% |
29 | 1 | < 0.1% |
30 | 1 | < 0.1% |
31 | 1 | < 0.1% |
33 | 3 | < 0.1% |
34 | 4 | < 0.1% |
35 | 3 | < 0.1% |
36 | 1 | < 0.1% |
Value | Count | Frequency (%) |
129588 | 1 | |
125097 | 1 | |
124399 | 1 | |
121176 | 1 | |
120632 | 1 | |
120374 | 1 | |
118613 | 1 | |
114449 | 1 | |
111249 | 1 | |
109494 | 1 |
등록일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 170 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20190357 |
Minimum | 20190104 |
---|---|
Maximum | 20190622 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20190104 |
---|---|
5-th percentile | 20190112 |
Q1 | 20190214 |
median | 20190329 |
Q3 | 20190510 |
95-th percentile | 20190613 |
Maximum | 20190622 |
Range | 518 |
Interquartile range (IQR) | 296 |
Descriptive statistics
Standard deviation | 162.44891 |
---|---|
Coefficient of variation (CV) | 8.0458662 × 10-6 |
Kurtosis | -1.2073239 |
Mean | 20190357 |
Median Absolute Deviation (MAD) | 124 |
Skewness | -0.0002721537 |
Sum | 2.0190357 × 1011 |
Variance | 26389.648 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190408 | 79 | 0.8% |
20190423 | 77 | 0.8% |
20190202 | 77 | 0.8% |
20190313 | 75 | 0.8% |
20190416 | 74 | 0.7% |
20190131 | 72 | 0.7% |
20190503 | 72 | 0.7% |
20190219 | 72 | 0.7% |
20190310 | 72 | 0.7% |
20190222 | 71 | 0.7% |
Other values (160) | 9259 |
Value | Count | Frequency (%) |
20190104 | 58 | |
20190105 | 62 | |
20190106 | 46 | |
20190107 | 59 | |
20190108 | 58 | |
20190109 | 58 | |
20190110 | 66 | |
20190111 | 54 | |
20190112 | 49 | |
20190113 | 60 |
Value | Count | Frequency (%) |
20190622 | 4 | < 0.1% |
20190621 | 54 | |
20190620 | 65 | |
20190619 | 51 | |
20190618 | 54 | |
20190617 | 58 | |
20190616 | 66 | |
20190615 | 67 | |
20190614 | 61 | |
20190613 | 51 |
사용일자 | 노선명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.000 | 0.075 | 0.074 | 0.997 |
노선명 | 0.000 | 1.000 | 0.545 | 0.536 | 0.000 |
승차총승객수 | 0.075 | 0.545 | 1.000 | 0.986 | 0.073 |
하차총승객수 | 0.074 | 0.536 | 0.986 | 1.000 | 0.070 |
등록일자 | 0.997 | 0.000 | 0.073 | 0.070 | 1.000 |
사용일자 | 승차총승객수 | 하차총승객수 | 등록일자 | 노선명 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.061 | 0.061 | 1.000 | 0.000 |
승차총승객수 | 0.061 | 1.000 | 0.993 | 0.061 | 0.224 |
하차총승객수 | 0.061 | 0.993 | 1.000 | 0.061 | 0.219 |
등록일자 | 1.000 | 0.061 | 0.061 | 1.000 | 0.000 |
노선명 | 0.000 | 0.224 | 0.219 | 0.000 | 1.000 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
30943 | 20190222 | 4호선 | 충무로 | 36974 | 37778 | 20190225 |
13765 | 20190124 | 경인선 | 부평 | 38638 | 41983 | 20190127 |
19098 | 20190202 | 2호선 | 왕십리(성동구청) | 14282 | 11979 | 20190205 |
43673 | 20190315 | 8호선 | 잠실(송파구청) | 17595 | 20007 | 20190318 |
92421 | 20190606 | 경부선 | 독산 | 9165 | 9350 | 20190609 |
17551 | 20190130 | 5호선 | 왕십리(성동구청) | 5917 | 6788 | 20190202 |
93684 | 20190608 | 과천선 | 인덕원 | 21885 | 20937 | 20190611 |
37907 | 20190306 | 3호선 | 일원 | 12024 | 12206 | 20190309 |
83868 | 20190522 | 7호선 | 군자(능동) | 17168 | 13542 | 20190525 |
57706 | 20190408 | 수인선 | 소래포구 | 5523 | 5098 | 20190411 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
96630 | 20190613 | 분당선 | 선정릉 | 9248 | 10084 | 20190616 |
28348 | 20190217 | 9호선2~3단계 | 종합운동장 | 3040 | 2305 | 20190220 |
66907 | 20190424 | 3호선 | 매봉 | 14114 | 13470 | 20190427 |
93267 | 20190607 | 5호선 | 마천 | 5591 | 5528 | 20190610 |
8094 | 20190114 | 5호선 | 천호(풍납토성) | 20489 | 21380 | 20190117 |
36312 | 20190303 | 중앙선 | 회기 | 21344 | 22536 | 20190306 |
31062 | 20190222 | 과천선 | 인덕원 | 30956 | 30641 | 20190225 |
98760 | 20190617 | 1호선 | 신설동 | 18116 | 17639 | 20190620 |
95098 | 20190610 | 6호선 | 역촌 | 4597 | 5399 | 20190613 |
22825 | 20190208 | 분당선 | 개포동 | 3742 | 3881 | 20190211 |