Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 585.9 KiB |
Average record size in memory | 60.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 1 |
Text | 1 |
Dataset
Description | 파일 다운로드 |
---|---|
Author | 서울특별시 |
URL | https://data.seoul.go.kr/dataList/OA-12914/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-11 06:20:56.699837 |
---|---|
Analysis finished | 2024-05-11 06:21:01.711073 |
Duration | 5.01 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
사용일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 172 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20180352 |
Minimum | 20180101 |
---|---|
Maximum | 20180621 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20180101 |
---|---|
5-th percentile | 20180109 |
Q1 | 20180213 |
median | 20180327 |
Q3 | 20180509 |
95-th percentile | 20180613 |
Maximum | 20180621 |
Range | 520 |
Interquartile range (IQR) | 296 |
Descriptive statistics
Standard deviation | 164.09481 |
---|---|
Coefficient of variation (CV) | 8.1314146 × 10-6 |
Kurtosis | -1.2163644 |
Mean | 20180352 |
Median Absolute Deviation (MAD) | 123 |
Skewness | 0.028961866 |
Sum | 2.0180352 × 1011 |
Variance | 26927.107 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20180223 | 76 | 0.8% |
20180313 | 74 | 0.7% |
20180419 | 72 | 0.7% |
20180210 | 72 | 0.7% |
20180226 | 72 | 0.7% |
20180324 | 72 | 0.7% |
20180311 | 71 | 0.7% |
20180530 | 71 | 0.7% |
20180510 | 71 | 0.7% |
20180323 | 70 | 0.7% |
Other values (162) | 9279 |
Value | Count | Frequency (%) |
20180101 | 61 | |
20180102 | 53 | |
20180103 | 45 | |
20180104 | 49 | |
20180105 | 68 | |
20180106 | 65 | |
20180107 | 52 | |
20180108 | 54 | |
20180109 | 55 | |
20180110 | 62 |
Value | Count | Frequency (%) |
20180621 | 56 | |
20180620 | 65 | |
20180619 | 60 | |
20180618 | 47 | |
20180617 | 58 | |
20180616 | 54 | |
20180615 | 62 | |
20180614 | 59 | |
20180613 | 56 | |
20180612 | 51 |
노선명
Categorical
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2호선 | |
---|---|
5호선 | |
7호선 | |
경부선 | |
6호선 | |
Other values (20) |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.1831 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 3호선 |
---|---|
2nd row | 중앙선 |
3rd row | 경부선 |
4th row | 3호선 |
5th row | 4호선 |
Common Values
Value | Count | Frequency (%) |
2호선 | 876 | 8.8% |
5호선 | 867 | 8.7% |
7호선 | 855 | 8.6% |
경부선 | 659 | 6.6% |
6호선 | 650 | 6.5% |
3호선 | 612 | 6.1% |
분당선 | 573 | 5.7% |
경원선 | 561 | 5.6% |
4호선 | 445 | 4.5% |
경의선 | 443 | 4.4% |
Other values (15) | 3459 |
Length
Value | Count | Frequency (%) |
2호선 | 876 | 8.6% |
5호선 | 867 | 8.5% |
7호선 | 855 | 8.4% |
경부선 | 659 | 6.4% |
6호선 | 650 | 6.4% |
3호선 | 612 | 6.0% |
분당선 | 573 | 5.6% |
경원선 | 561 | 5.5% |
4호선 | 445 | 4.3% |
경의선 | 443 | 4.3% |
Other values (15) | 3693 |
역명
Text
Distinct | 503 |
---|---|
Distinct (%) | 5.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
서울역 | 103 | 1.0% |
공덕 | 72 | 0.7% |
왕십리(성동구청 | 59 | 0.6% |
신설동 | 58 | 0.6% |
김포공항 | 55 | 0.5% |
디지털미디어시티 | 54 | 0.5% |
홍대입구 | 51 | 0.5% |
고속터미널 | 50 | 0.5% |
충정로(경기대입구 | 46 | 0.5% |
건대입구 | 46 | 0.5% |
Other values (493) | 9406 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 1306 | 3.6% |
구 | 1243 | 3.5% |
) | 1217 | 3.4% |
( | 1217 | 3.4% |
동 | 864 | 2.4% |
산 | 812 | 2.3% |
신 | 805 | 2.2% |
청 | 717 | 2.0% |
원 | 714 | 2.0% |
정 | 577 | 1.6% |
Other values (279) | 26431 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 33181 | |
Close Punctuation | 1217 | 3.4% |
Open Punctuation | 1217 | 3.4% |
Decimal Number | 209 | 0.6% |
Other Punctuation | 79 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 1306 | 3.9% |
구 | 1243 | 3.7% |
동 | 864 | 2.6% |
산 | 812 | 2.4% |
신 | 805 | 2.4% |
청 | 717 | 2.2% |
원 | 714 | 2.2% |
정 | 577 | 1.7% |
천 | 572 | 1.7% |
서 | 532 | 1.6% |
Other values (270) | 25039 |
Decimal Number
Value | Count | Frequency (%) |
3 | 81 | |
4 | 39 | |
1 | 38 | |
2 | 20 | 9.6% |
5 | 17 | 8.1% |
9 | 14 | 6.7% |
Close Punctuation
Value | Count | Frequency (%) |
) | 1217 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1217 |
Other Punctuation
Value | Count | Frequency (%) |
. | 79 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 33181 | |
Common | 2722 | 7.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 1306 | 3.9% |
구 | 1243 | 3.7% |
동 | 864 | 2.6% |
산 | 812 | 2.4% |
신 | 805 | 2.4% |
청 | 717 | 2.2% |
원 | 714 | 2.2% |
정 | 577 | 1.7% |
천 | 572 | 1.7% |
서 | 532 | 1.6% |
Other values (270) | 25039 |
Common
Value | Count | Frequency (%) |
) | 1217 | |
( | 1217 | |
3 | 81 | 3.0% |
. | 79 | 2.9% |
4 | 39 | 1.4% |
1 | 38 | 1.4% |
2 | 20 | 0.7% |
5 | 17 | 0.6% |
9 | 14 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 33181 | |
ASCII | 2722 | 7.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 1306 | 3.9% |
구 | 1243 | 3.7% |
동 | 864 | 2.6% |
산 | 812 | 2.4% |
신 | 805 | 2.4% |
청 | 717 | 2.2% |
원 | 714 | 2.2% |
정 | 577 | 1.7% |
천 | 572 | 1.7% |
서 | 532 | 1.6% |
Other values (270) | 25039 |
ASCII
Value | Count | Frequency (%) |
) | 1217 | |
( | 1217 | |
3 | 81 | 3.0% |
. | 79 | 2.9% |
4 | 39 | 1.4% |
1 | 38 | 1.4% |
2 | 20 | 0.7% |
5 | 17 | 0.6% |
9 | 14 | 0.5% |
승차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8240 |
---|---|
Distinct (%) | 82.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12536.301 |
Minimum | 1 |
---|---|
Maximum | 133098 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1097.7 |
Q1 | 4042.75 |
median | 8855 |
Q3 | 16404.25 |
95-th percentile | 38017.2 |
Maximum | 133098 |
Range | 133097 |
Interquartile range (IQR) | 12361.5 |
Descriptive statistics
Standard deviation | 12942.538 |
---|---|
Coefficient of variation (CV) | 1.0324049 |
Kurtosis | 9.7814071 |
Mean | 12536.301 |
Median Absolute Deviation (MAD) | 5593 |
Skewness | 2.5616248 |
Sum | 1.2536301 × 108 |
Variance | 1.675093 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 36 | 0.4% |
2 | 8 | 0.1% |
2076 | 6 | 0.1% |
2374 | 5 | 0.1% |
1218 | 5 | 0.1% |
4660 | 5 | 0.1% |
4245 | 4 | < 0.1% |
3205 | 4 | < 0.1% |
2060 | 4 | < 0.1% |
1587 | 4 | < 0.1% |
Other values (8230) | 9919 |
Value | Count | Frequency (%) |
1 | 36 | |
2 | 8 | 0.1% |
3 | 3 | < 0.1% |
4 | 2 | < 0.1% |
5 | 2 | < 0.1% |
6 | 2 | < 0.1% |
8 | 1 | < 0.1% |
19 | 1 | < 0.1% |
31 | 1 | < 0.1% |
32 | 1 | < 0.1% |
Value | Count | Frequency (%) |
133098 | 1 | |
121007 | 1 | |
118692 | 1 | |
118128 | 1 | |
116410 | 1 | |
111586 | 1 | |
108915 | 1 | |
101418 | 1 | |
101308 | 1 | |
99000 | 1 |
하차총승객수
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 8165 |
---|---|
Distinct (%) | 81.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 12466.186 |
Minimum | 0 |
---|---|
Maximum | 136675 |
Zeros | 54 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 1019.95 |
Q1 | 3889 |
median | 8598 |
Q3 | 16220.25 |
95-th percentile | 38471.6 |
Maximum | 136675 |
Range | 136675 |
Interquartile range (IQR) | 12331.25 |
Descriptive statistics
Standard deviation | 13182.22 |
---|---|
Coefficient of variation (CV) | 1.0574381 |
Kurtosis | 10.041702 |
Mean | 12466.186 |
Median Absolute Deviation (MAD) | 5456 |
Skewness | 2.5883346 |
Sum | 1.2466186 × 108 |
Variance | 1.7377093 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 54 | 0.5% |
1963 | 5 | 0.1% |
2431 | 5 | 0.1% |
1749 | 5 | 0.1% |
1823 | 5 | 0.1% |
2524 | 5 | 0.1% |
2803 | 4 | < 0.1% |
2334 | 4 | < 0.1% |
4967 | 4 | < 0.1% |
4369 | 4 | < 0.1% |
Other values (8155) | 9905 |
Value | Count | Frequency (%) |
0 | 54 | |
17 | 1 | < 0.1% |
19 | 1 | < 0.1% |
20 | 1 | < 0.1% |
21 | 1 | < 0.1% |
24 | 1 | < 0.1% |
25 | 2 | < 0.1% |
26 | 2 | < 0.1% |
27 | 1 | < 0.1% |
28 | 1 | < 0.1% |
Value | Count | Frequency (%) |
136675 | 1 | |
124113 | 1 | |
120685 | 1 | |
119053 | 1 | |
116245 | 1 | |
114779 | 1 | |
112640 | 1 | |
111583 | 1 | |
111411 | 1 | |
106722 | 1 |
등록일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 172 |
---|---|
Distinct (%) | 1.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20180361 |
Minimum | 20180104 |
---|---|
Maximum | 20180624 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 20180104 |
---|---|
5-th percentile | 20180112 |
Q1 | 20180216 |
median | 20180330 |
Q3 | 20180512 |
95-th percentile | 20180616 |
Maximum | 20180624 |
Range | 520 |
Interquartile range (IQR) | 296 |
Descriptive statistics
Standard deviation | 164.49149 |
---|---|
Coefficient of variation (CV) | 8.1510676 × 10-6 |
Kurtosis | -1.2096439 |
Mean | 20180361 |
Median Absolute Deviation (MAD) | 125 |
Skewness | 0.0016875755 |
Sum | 2.0180361 × 1011 |
Variance | 27057.45 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20180226 | 76 | 0.8% |
20180316 | 74 | 0.7% |
20180422 | 72 | 0.7% |
20180213 | 72 | 0.7% |
20180301 | 72 | 0.7% |
20180327 | 72 | 0.7% |
20180314 | 71 | 0.7% |
20180602 | 71 | 0.7% |
20180513 | 71 | 0.7% |
20180326 | 70 | 0.7% |
Other values (162) | 9279 |
Value | Count | Frequency (%) |
20180104 | 61 | |
20180105 | 53 | |
20180106 | 45 | |
20180107 | 49 | |
20180108 | 68 | |
20180109 | 65 | |
20180110 | 52 | |
20180111 | 54 | |
20180112 | 55 | |
20180113 | 62 |
Value | Count | Frequency (%) |
20180624 | 56 | |
20180623 | 65 | |
20180622 | 60 | |
20180621 | 47 | |
20180620 | 58 | |
20180619 | 54 | |
20180618 | 62 | |
20180617 | 59 | |
20180616 | 56 | |
20180615 | 51 |
사용일자 | 노선명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.000 | 0.059 | 0.048 | 0.997 |
노선명 | 0.000 | 1.000 | 0.530 | 0.519 | 0.000 |
승차총승객수 | 0.059 | 0.530 | 1.000 | 0.978 | 0.061 |
하차총승객수 | 0.048 | 0.519 | 0.978 | 1.000 | 0.020 |
등록일자 | 0.997 | 0.000 | 0.061 | 0.020 | 1.000 |
사용일자 | 승차총승객수 | 하차총승객수 | 등록일자 | 노선명 | |
---|---|---|---|---|---|
사용일자 | 1.000 | 0.034 | 0.031 | 1.000 | 0.000 |
승차총승객수 | 0.034 | 1.000 | 0.991 | 0.034 | 0.216 |
하차총승객수 | 0.031 | 0.991 | 1.000 | 0.031 | 0.210 |
등록일자 | 1.000 | 0.034 | 0.031 | 1.000 | 0.000 |
노선명 | 0.000 | 0.216 | 0.210 | 0.000 | 1.000 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
53550 | 20180403 | 3호선 | 충무로 | 4 | 0 | 20180406 |
13050 | 20180123 | 중앙선 | 회기 | 26749 | 26030 | 20180126 |
78675 | 20180516 | 경부선 | 천안 | 8720 | 7240 | 20180519 |
10518 | 20180119 | 3호선 | 독립문 | 8786 | 8803 | 20180122 |
71062 | 20180503 | 4호선 | 회현(남대문시장) | 37744 | 40079 | 20180506 |
63789 | 20180420 | 5호선 | 군자(능동) | 13208 | 15138 | 20180423 |
20392 | 20180205 | 5호선 | 개롱 | 7415 | 7609 | 20180208 |
17233 | 20180130 | 경원선 | 방학 | 10489 | 10374 | 20180202 |
1479 | 20180103 | 경의선 | 금촌 | 6892 | 6782 | 20180106 |
66492 | 20180425 | 경원선 | 방학 | 12166 | 11673 | 20180428 |
사용일자 | 노선명 | 역명 | 승차총승객수 | 하차총승객수 | 등록일자 | |
---|---|---|---|---|---|---|
94847 | 20180613 | 1호선 | 신설동 | 10457 | 9932 | 20180616 |
73262 | 20180506 | 공항철도 1호선 | 운서 | 4801 | 5185 | 20180509 |
8412 | 20180115 | 중앙선 | 오빈 | 371 | 335 | 20180118 |
98493 | 20180619 | 6호선 | 불광 | 5652 | 5707 | 20180622 |
56945 | 20180408 | 공항철도 1호선 | 검암 | 5871 | 6360 | 20180411 |
90980 | 20180606 | 안산선 | 고잔 | 6141 | 6062 | 20180609 |
18226 | 20180201 | 경강선 | 곤지암 | 1954 | 1956 | 20180204 |
68068 | 20180428 | 2호선 | 강변(동서울터미널) | 56170 | 58228 | 20180501 |
30555 | 20180222 | 분당선 | 미금 | 22982 | 24706 | 20180225 |
40842 | 20180312 | 경인선 | 오류동 | 13285 | 12430 | 20180315 |