Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-12914/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:21:08.379465 |
---|---|
Analysis finished | 2024-05-11 06:21:13.942007 |
Duration | 5.56 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사용일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 182 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20160365 |
Minimum | 20160101 |
---|---|
Maximum | 20160630 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20160101 |
---|---|
5-th percentile | 20160110 |
Q1 | 20160214 |
median | 20160401 |
Q3 | 20160514 |
95-th percentile | 20160621 |
Maximum | 20160630 |
Range | 529 |
Interquartile range (IQR) | 300 |
Descriptive statistics
Standard deviation | 170.12512 |
---|---|
Coefficient of variation (CV) | 8.4385934 × 10-6 |
Kurtosis | -1.248385 |
Mean | 20160365 |
Median Absolute Deviation (MAD) | 128 |
Skewness | -0.012112141 |
Sum | 2.0160365 × 1011 |
Variance | 28942.558 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20160122 | 81 | 0.8% |
20160330 | 72 | 0.7% |
20160529 | 70 | 0.7% |
20160117 | 68 | 0.7% |
20160423 | 68 | 0.7% |
20160528 | 67 | 0.7% |
20160302 | 67 | 0.7% |
20160426 | 67 | 0.7% |
20160513 | 67 | 0.7% |
20160610 | 66 | 0.7% |
Other values (172) | 9307 |
Value | Count | Frequency (%) |
20160101 | 49 | |
20160102 | 41 | |
20160103 | 61 | |
20160104 | 60 | |
20160105 | 50 | |
20160106 | 55 | |
20160107 | 64 | |
20160108 | 61 | |
20160109 | 45 | |
20160110 | 50 |
Value | Count | Frequency (%) |
20160630 | 17 | 0.2% |
20160629 | 60 | |
20160628 | 55 | |
20160627 | 57 | |
20160626 | 63 | |
20160625 | 55 | |
20160624 | 46 | |
20160623 | 56 | |
20160622 | 60 | |
20160621 | 60 |
노선명
Categorical
Distinct | 23 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
5호선 | |
---|---|
7호선 | |
2호선 | |
경부선 | |
6호선 | |
Other values (18) |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.1361 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9호선2단계 |
---|---|
2nd row | 7호선 |
3rd row | 5호선 |
4th row | 분당선 |
5th row | 분당선 |
Common Values
Value | Count | Frequency (%) |
5호선 | 967 | 9.7% |
7호선 | 933 | 9.3% |
2호선 | 907 | 9.1% |
경부선 | 701 | 7.0% |
6호선 | 659 | 6.6% |
분당선 | 609 | 6.1% |
3호선 | 586 | 5.9% |
경원선 | 512 | 5.1% |
9호선 | 458 | 4.6% |
경의선 | 456 | 4.6% |
Other values (13) | 3212 |
Length
Value | Count | Frequency (%) |
5호선 | 967 | 9.5% |
7호선 | 933 | 9.1% |
2호선 | 907 | 8.9% |
경부선 | 701 | 6.9% |
6호선 | 659 | 6.5% |
분당선 | 609 | 6.0% |
3호선 | 586 | 5.7% |
경원선 | 512 | 5.0% |
9호선 | 458 | 4.5% |
경의선 | 456 | 4.5% |
Other values (13) | 3423 |
역명
Text
Distinct | 483 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
서울역 | 88 | 0.9% |
공덕 | 69 | 0.7% |
종로3가 | 67 | 0.7% |
동대문역사문화공원 | 55 | 0.5% |
홍대입구 | 55 | 0.5% |
김포공항 | 54 | 0.5% |
디지털미디어시티 | 54 | 0.5% |
고속터미널 | 53 | 0.5% |
청구 | 51 | 0.5% |
여의도 | 46 | 0.5% |
Other values (472) | 9421 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 926 | 3.3% |
구 | 908 | 3.2% |
산 | 807 | 2.8% |
신 | 707 | 2.5% |
동 | 702 | 2.5% |
천 | 615 | 2.2% |
정 | 544 | 1.9% |
청 | 538 | 1.9% |
원 | 500 | 1.8% |
수 | 441 | 1.6% |
Other values (258) | 21715 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 27936 | |
Decimal Number | 164 | 0.6% |
Close Punctuation | 145 | 0.5% |
Open Punctuation | 145 | 0.5% |
Space Separator | 13 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 926 | 3.3% |
구 | 908 | 3.3% |
산 | 807 | 2.9% |
신 | 707 | 2.5% |
동 | 702 | 2.5% |
천 | 615 | 2.2% |
정 | 544 | 1.9% |
청 | 538 | 1.9% |
원 | 500 | 1.8% |
수 | 441 | 1.6% |
Other values (252) | 21248 |
Decimal Number
Value | Count | Frequency (%) |
3 | 110 | |
4 | 35 | 21.3% |
5 | 19 | 11.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 145 |
Open Punctuation
Value | Count | Frequency (%) |
( | 145 |
Space Separator
Value | Count | Frequency (%) |
13 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 27936 | |
Common | 467 | 1.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 926 | 3.3% |
구 | 908 | 3.3% |
산 | 807 | 2.9% |
신 | 707 | 2.5% |
동 | 702 | 2.5% |
천 | 615 | 2.2% |
정 | 544 | 1.9% |
청 | 538 | 1.9% |
원 | 500 | 1.8% |
수 | 441 | 1.6% |
Other values (252) | 21248 |
Common
Value | Count | Frequency (%) |
) | 145 | |
( | 145 | |
3 | 110 | |
4 | 35 | 7.5% |
5 | 19 | 4.1% |
13 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 27936 | |
ASCII | 467 | 1.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 926 | 3.3% |
구 | 908 | 3.3% |
산 | 807 | 2.9% |
신 | 707 | 2.5% |
동 | 702 | 2.5% |
천 | 615 | 2.2% |
정 | 544 | 1.9% |
청 | 538 | 1.9% |
원 | 500 | 1.8% |
수 | 441 | 1.6% |
Other values (252) | 21248 |
ASCII
Value | Count | Frequency (%) |
) | 145 | |
( | 145 | |
3 | 110 | |
4 | 35 | 7.5% |
5 | 19 | 4.1% |
13 | 2.8% |
승차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8335 |
---|---|
Distinct (%) | 83.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13204.762 |
Minimum | 1 |
---|---|
Maximum | 124139 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1101.95 |
Q1 | 4456.5 |
median | 9247.5 |
Q3 | 17124.25 |
95-th percentile | 38818.75 |
Maximum | 124139 |
Range | 124138 |
Interquartile range (IQR) | 12667.75 |
Descriptive statistics
Standard deviation | 13485.51 |
---|---|
Coefficient of variation (CV) | 1.0212611 |
Kurtosis | 9.1112197 |
Mean | 13204.762 |
Median Absolute Deviation (MAD) | 5740 |
Skewness | 2.5082716 |
Sum | 1.3204762 × 108 |
Variance | 1.8185897 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 20 | 0.2% |
2 | 11 | 0.1% |
2808 | 6 | 0.1% |
5601 | 5 | 0.1% |
373 | 5 | 0.1% |
2412 | 4 | < 0.1% |
5368 | 4 | < 0.1% |
4650 | 4 | < 0.1% |
5912 | 4 | < 0.1% |
7820 | 4 | < 0.1% |
Other values (8325) | 9933 |
Value | Count | Frequency (%) |
1 | 20 | |
2 | 11 | |
3 | 2 | < 0.1% |
4 | 1 | < 0.1% |
6 | 1 | < 0.1% |
15 | 1 | < 0.1% |
30 | 1 | < 0.1% |
31 | 1 | < 0.1% |
37 | 1 | < 0.1% |
39 | 1 | < 0.1% |
Value | Count | Frequency (%) |
124139 | 1 | |
122270 | 1 | |
117892 | 1 | |
115593 | 1 | |
111807 | 1 | |
109540 | 1 | |
109357 | 1 | |
109347 | 1 | |
106547 | 1 | |
106465 | 1 |
하차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8348 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13142.806 |
Minimum | 0 |
---|---|
Maximum | 124517 |
Zeros | 35 |
Zeros (%) | 0.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1064 |
Q1 | 4317.75 |
median | 9045 |
Q3 | 17123.5 |
95-th percentile | 39233.2 |
Maximum | 124517 |
Range | 124517 |
Interquartile range (IQR) | 12805.75 |
Descriptive statistics
Standard deviation | 13635.503 |
---|---|
Coefficient of variation (CV) | 1.0374879 |
Kurtosis | 8.6844222 |
Mean | 13142.806 |
Median Absolute Deviation (MAD) | 5607.5 |
Skewness | 2.4726591 |
Sum | 1.3142806 × 108 |
Variance | 1.8592693 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 35 | 0.4% |
8211 | 5 | 0.1% |
2685 | 5 | 0.1% |
4512 | 4 | < 0.1% |
958 | 4 | < 0.1% |
964 | 4 | < 0.1% |
625 | 4 | < 0.1% |
2957 | 4 | < 0.1% |
2615 | 4 | < 0.1% |
2213 | 4 | < 0.1% |
Other values (8338) | 9927 |
Value | Count | Frequency (%) |
0 | 35 | |
21 | 1 | < 0.1% |
22 | 1 | < 0.1% |
23 | 1 | < 0.1% |
27 | 1 | < 0.1% |
29 | 1 | < 0.1% |
30 | 3 | < 0.1% |
31 | 1 | < 0.1% |
32 | 2 | < 0.1% |
34 | 3 | < 0.1% |
Value | Count | Frequency (%) |
124517 | 1 | |
121767 | 1 | |
120455 | 1 | |
116654 | 1 | |
111527 | 1 | |
110346 | 1 | |
109698 | 1 | |
109468 | 1 | |
108548 | 1 | |
106917 | 1 |
등록일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 182 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20160391 |
Minimum | 20160109 |
---|---|
Maximum | 20160708 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20160109 |
---|---|
5-th percentile | 20160118 |
Q1 | 20160222 |
median | 20160409 |
Q3 | 20160522 |
95-th percentile | 20160629 |
Maximum | 20160708 |
Range | 599 |
Interquartile range (IQR) | 300 |
Descriptive statistics
Standard deviation | 174.46244 |
---|---|
Coefficient of variation (CV) | 8.6537229 × 10-6 |
Kurtosis | -1.1520483 |
Mean | 20160391 |
Median Absolute Deviation (MAD) | 121 |
Skewness | -0.010226496 |
Sum | 2.0160391 × 1011 |
Variance | 30437.143 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20160130 | 81 | 0.8% |
20160407 | 72 | 0.7% |
20160606 | 70 | 0.7% |
20160125 | 68 | 0.7% |
20160501 | 68 | 0.7% |
20160605 | 67 | 0.7% |
20160310 | 67 | 0.7% |
20160504 | 67 | 0.7% |
20160521 | 67 | 0.7% |
20160618 | 66 | 0.7% |
Other values (172) | 9307 |
Value | Count | Frequency (%) |
20160109 | 49 | |
20160110 | 41 | |
20160111 | 61 | |
20160112 | 60 | |
20160113 | 50 | |
20160114 | 55 | |
20160115 | 64 | |
20160116 | 61 | |
20160117 | 45 | |
20160118 | 50 |
Value | Count | Frequency (%) |
20160708 | 17 | 0.2% |
20160707 | 60 | |
20160706 | 55 | |
20160705 | 57 | |
20160704 | 63 | |
20160703 | 55 | |
20160702 | 46 | |
20160701 | 56 | |
20160630 | 60 | |
20160629 | 60 |
사용일자 | 노선명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.000 | 0.044 | 0.060 | 0.969 |
노선명 | 0.000 | 1.000 | 0.509 | 0.508 | 0.000 |
승차총승객수 | 0.044 | 0.509 | 1.000 | 0.987 | 0.067 |
하차총승객수 | 0.060 | 0.508 | 0.987 | 1.000 | 0.080 |
등록일자 | 0.969 | 0.000 | 0.067 | 0.080 | 1.000 |
사용일자 | 승차총승객수 | 하차총승객수 | 등록일자 | 노선명 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.058 | 0.055 | 1.000 | 0.000 |
승차총승객수 | 0.058 | 1.000 | 0.993 | 0.058 | 0.214 |
하차총승객수 | 0.055 | 0.993 | 1.000 | 0.055 | 0.213 |
등록일자 | 1.000 | 0.058 | 0.055 | 1.000 | 0.000 |
노선명 | 0.000 | 0.214 | 0.213 | 0.000 | 1.000 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
37952 | 20160310 | 9호선2단계 | 언주 | 8304 | 8416 | 20160318 |
48630 | 20160329 | 7호선 | 하계 | 24567 | 23476 | 20160406 |
8851 | 20160117 | 5호선 | 송정 | 6621 | 7105 | 20160125 |
81521 | 20160527 | 분당선 | 매교 | 3392 | 3361 | 20160604 |
20215 | 20160206 | 분당선 | 서현 | 19608 | 20121 | 20160214 |
62299 | 20160423 | 7호선 | 대림 | 10140 | 11263 | 20160501 |
96837 | 20160624 | 6호선 | 화랑대 | 14629 | 10649 | 20160702 |
40594 | 20160314 | 7호선 | 보라매 | 11860 | 11975 | 20160322 |
13471 | 20160125 | 5호선 | 우장산 | 16460 | 16355 | 20160202 |
12254 | 20160123 | 경원선 | 회룡 | 10526 | 10100 | 20160131 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
78532 | 20160522 | 경부선 | 광명 | 763 | 522 | 20160530 |
87442 | 20160607 | 3호선 | 남부터미널 | 38549 | 40031 | 20160615 |
53260 | 20160406 | 안산선 | 고잔 | 10651 | 10476 | 20160414 |
29465 | 20160223 | 경부선 | 가산디지털단지 | 19625 | 23339 | 20160302 |
66482 | 20160430 | 경부선 | 명학 | 8427 | 8201 | 20160508 |
98781 | 20160628 | 2호선 | 구의 | 28606 | 27924 | 20160706 |
33697 | 20160302 | 2호선 | 당산 | 25876 | 29196 | 20160310 |
13807 | 20160126 | 6호선 | 불광 | 5856 | 5671 | 20160203 |
95944 | 20160622 | 2호선 | 사당 | 48307 | 53247 | 20160630 |
34331 | 20160303 | 3호선 | 무악재 | 4954 | 4989 | 20160311 |