Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 500 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 54.8 KiB |
Average record size in memory | 112.3 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 6 |
Text | 2 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 서울시(스마트카드사) |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=70 |
사용자구분(BILL_USER) is highly imbalanced (50.3%) | Imbalance |
Reproduction
Analysis started | 2024-01-14 06:50:23.350733 |
---|---|
Analysis finished | 2024-01-14 06:50:28.695972 |
Duration | 5.35 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
년(YEAR)
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2018 | |
---|---|
2019 | |
2020 | |
2017 | |
2021 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019 |
---|---|
2nd row | 2021 |
3rd row | 2020 |
4th row | 2018 |
5th row | 2019 |
Common Values
Value | Count | Frequency (%) |
2018 | 122 | |
2019 | 102 | |
2020 | 99 | |
2017 | 92 | |
2021 | 85 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2018 | 122 | |
2019 | 102 | |
2020 | 99 | |
2017 | 92 | |
2021 | 85 |
월(MONTH)
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.226 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 6 |
Q3 | 9 |
95-th percentile | 12 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.3599856 |
---|---|
Coefficient of variation (CV) | 0.53967002 |
Kurtosis | -1.1802568 |
Mean | 6.226 |
Median Absolute Deviation (MAD) | 3 |
Skewness | 0.067244765 |
Sum | 3113 |
Variance | 11.289503 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 54 | |
7 | 52 | |
9 | 48 | |
10 | 47 | |
2 | 43 | |
1 | 43 | |
5 | 42 | |
6 | 41 | |
4 | 38 | |
8 | 32 | |
Other values (2) | 60 |
Value | Count | Frequency (%) |
1 | 43 | |
2 | 43 | |
3 | 54 | |
4 | 38 | |
5 | 42 | |
6 | 41 | |
7 | 52 | |
8 | 32 | |
9 | 48 | |
10 | 47 |
Value | Count | Frequency (%) |
12 | 32 | |
11 | 28 | |
10 | 47 | |
9 | 48 | |
8 | 32 | |
7 | 52 | |
6 | 41 | |
5 | 42 | |
4 | 38 | |
3 | 54 |
일(DAY)
Real number (ℝ)
Distinct | 31 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.552 |
Minimum | 1 |
---|---|
Maximum | 31 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 9 |
median | 17 |
Q3 | 24 |
95-th percentile | 30 |
Maximum | 31 |
Range | 30 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 8.7837124 |
---|---|
Coefficient of variation (CV) | 0.53067378 |
Kurtosis | -1.1597745 |
Mean | 16.552 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -0.060814941 |
Sum | 8276 |
Variance | 77.153603 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 25 | 5.0% |
9 | 25 | 5.0% |
12 | 21 | 4.2% |
14 | 21 | 4.2% |
21 | 20 | 4.0% |
27 | 19 | 3.8% |
30 | 19 | 3.8% |
19 | 18 | 3.6% |
25 | 18 | 3.6% |
8 | 18 | 3.6% |
Other values (21) | 296 |
Value | Count | Frequency (%) |
1 | 15 | |
2 | 15 | |
3 | 9 | 1.8% |
4 | 16 | |
5 | 7 | 1.4% |
6 | 15 | |
7 | 18 | |
8 | 18 | |
9 | 25 | |
10 | 15 |
Value | Count | Frequency (%) |
31 | 15 | |
30 | 19 | |
29 | 17 | |
28 | 17 | |
27 | 19 | |
26 | 17 | |
25 | 18 | |
24 | 14 | |
23 | 11 | |
22 | 16 |
시간(HOUR)
Real number (ℝ)
Distinct | 19 |
---|---|
Distinct (%) | 3.8% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.75 |
Minimum | 5 |
---|---|
Maximum | 23 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 5 |
---|---|
5-th percentile | 6 |
Q1 | 9 |
median | 14 |
Q3 | 18 |
95-th percentile | 22 |
Maximum | 23 |
Range | 18 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 5.0197506 |
---|---|
Coefficient of variation (CV) | 0.36507277 |
Kurtosis | -1.213784 |
Mean | 13.75 |
Median Absolute Deviation (MAD) | 4 |
Skewness | -0.047035598 |
Sum | 6875 |
Variance | 25.197896 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18 | 42 | 8.4% |
8 | 41 | 8.2% |
16 | 40 | 8.0% |
17 | 35 | 7.0% |
7 | 33 | 6.6% |
9 | 32 | 6.4% |
13 | 30 | 6.0% |
15 | 27 | 5.4% |
12 | 27 | 5.4% |
19 | 25 | 5.0% |
Other values (9) | 168 |
Value | Count | Frequency (%) |
5 | 12 | 2.4% |
6 | 24 | |
7 | 33 | |
8 | 41 | |
9 | 32 | |
10 | 20 | |
11 | 18 | |
12 | 27 | |
13 | 30 | |
14 | 20 |
Value | Count | Frequency (%) |
23 | 4 | 0.8% |
22 | 22 | |
21 | 25 | |
20 | 23 | |
19 | 25 | |
18 | 42 | |
17 | 35 | |
16 | 40 | |
15 | 27 | |
14 | 20 |
분_30분단위(HALF_HOUR)
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
0 | |
---|---|
30 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.476 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 30 |
---|---|
2nd row | 30 |
3rd row | 0 |
4th row | 30 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 262 | |
30 | 238 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 262 | |
30 | 238 |
사용자구분(BILL_USER)
Categorical
IMBALANCE
 
Distinct | 6 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
일반 | |
---|---|
경로 | |
장애인 | 23 |
청소년 | 15 |
국가유공자 | 4 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.104 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 일반 |
---|---|
2nd row | 일반 |
3rd row | 일반 |
4th row | 일반 |
5th row | 경로 |
Common Values
Value | Count | Frequency (%) |
일반 | 348 | |
경로 | 108 | 21.6% |
장애인 | 23 | 4.6% |
청소년 | 15 | 3.0% |
국가유공자 | 4 | 0.8% |
어린이 | 2 | 0.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
일반 | 348 | |
경로 | 108 | 21.6% |
장애인 | 23 | 4.6% |
청소년 | 15 | 3.0% |
국가유공자 | 4 | 0.8% |
어린이 | 2 | 0.4% |
승차역ID(GETON_STATION_ID)
Real number (ℝ)
Distinct | 233 |
---|---|
Distinct (%) | 46.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1676.432 |
Minimum | 150 |
---|---|
Maximum | 4710 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 198.9 |
Q1 | 311 |
median | 2519 |
Q3 | 2715 |
95-th percentile | 4116.05 |
Maximum | 4710 |
Range | 4560 |
Interquartile range (IQR) | 2404 |
Descriptive statistics
Standard deviation | 1373.8533 |
---|---|
Coefficient of variation (CV) | 0.81951033 |
Kurtosis | -1.3160076 |
Mean | 1676.432 |
Median Absolute Deviation (MAD) | 1608 |
Skewness | 0.22451864 |
Sum | 838216 |
Variance | 1887473 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
311 | 6 | 1.2% |
221 | 5 | 1.0% |
220 | 5 | 1.0% |
219 | 5 | 1.0% |
229 | 5 | 1.0% |
201 | 5 | 1.0% |
340 | 5 | 1.0% |
212 | 5 | 1.0% |
206 | 5 | 1.0% |
2718 | 5 | 1.0% |
Other values (223) | 449 |
Value | Count | Frequency (%) |
150 | 4 | |
151 | 2 | 0.4% |
152 | 3 | |
153 | 2 | 0.4% |
154 | 2 | 0.4% |
155 | 4 | |
156 | 2 | 0.4% |
157 | 4 | |
159 | 2 | 0.4% |
201 | 5 |
Value | Count | Frequency (%) |
4710 | 2 | |
4709 | 3 | |
4708 | 1 | 0.2% |
4706 | 1 | 0.2% |
4703 | 1 | 0.2% |
4138 | 1 | 0.2% |
4136 | 2 | |
4134 | 1 | 0.2% |
4131 | 1 | 0.2% |
4129 | 2 |
승차_호선명(GETON_LINE_NM)
Categorical
Distinct | 11 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2호선 | |
---|---|
5호선 | |
7호선 | |
3호선 | |
6호선 | |
Other values (6) |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.138 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 9호선 |
---|---|
2nd row | 8호선 |
3rd row | 5호선 |
4th row | 5호선 |
5th row | 4호선 |
Common Values
Value | Count | Frequency (%) |
2호선 | 106 | |
5호선 | 81 | |
7호선 | 64 | |
3호선 | 57 | |
6호선 | 49 | |
4호선 | 48 | |
8호선 | 30 | 6.0% |
9호선 | 27 | 5.4% |
1호선 | 20 | 4.0% |
9호선2~3단계 | 11 | 2.2% |
Length
Value | Count | Frequency (%) |
2호선 | 106 | |
5호선 | 81 | |
7호선 | 64 | |
3호선 | 57 | |
6호선 | 49 | |
4호선 | 48 | |
8호선 | 30 | 6.0% |
9호선 | 27 | 5.4% |
1호선 | 20 | 4.0% |
9호선2~3단계 | 11 | 2.2% |
Distinct | 210 |
---|---|
Distinct (%) | 42.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
잠실(송파구청 | 13 | 2.6% |
종로3가 | 13 | 2.6% |
당산 | 9 | 1.8% |
고속터미널 | 8 | 1.6% |
강남 | 6 | 1.2% |
홍대입구 | 6 | 1.2% |
공덕 | 6 | 1.2% |
서울역 | 6 | 1.2% |
구로디지털단지 | 6 | 1.2% |
신림 | 6 | 1.2% |
Other values (200) | 421 |
Most occurring characters
Value | Count | Frequency (%) |
( | 120 | 5.3% |
) | 120 | 5.3% |
구 | 103 | 4.6% |
대 | 76 | 3.4% |
신 | 67 | 3.0% |
청 | 56 | 2.5% |
동 | 52 | 2.3% |
로 | 43 | 1.9% |
입 | 42 | 1.9% |
사 | 37 | 1.6% |
Other values (222) | 1539 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1968 | |
Open Punctuation | 120 | 5.3% |
Close Punctuation | 120 | 5.3% |
Decimal Number | 23 | 1.0% |
Uppercase Letter | 15 | 0.7% |
Other Punctuation | 9 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 103 | 5.2% |
대 | 76 | 3.9% |
신 | 67 | 3.4% |
청 | 56 | 2.8% |
동 | 52 | 2.6% |
로 | 43 | 2.2% |
입 | 42 | 2.1% |
사 | 37 | 1.9% |
산 | 36 | 1.8% |
성 | 34 | 1.7% |
Other values (211) | 1422 |
Decimal Number
Value | Count | Frequency (%) |
3 | 16 | |
4 | 4 | 17.4% |
5 | 1 | 4.3% |
9 | 1 | 4.3% |
1 | 1 | 4.3% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 10 | |
P | 5 |
Other Punctuation
Value | Count | Frequency (%) |
. | 8 | |
· | 1 | 11.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 120 |
Close Punctuation
Value | Count | Frequency (%) |
) | 120 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1968 | |
Common | 272 | 12.1% |
Latin | 15 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 103 | 5.2% |
대 | 76 | 3.9% |
신 | 67 | 3.4% |
청 | 56 | 2.8% |
동 | 52 | 2.6% |
로 | 43 | 2.2% |
입 | 42 | 2.1% |
사 | 37 | 1.9% |
산 | 36 | 1.8% |
성 | 34 | 1.7% |
Other values (211) | 1422 |
Common
Value | Count | Frequency (%) |
( | 120 | |
) | 120 | |
3 | 16 | 5.9% |
. | 8 | 2.9% |
4 | 4 | 1.5% |
5 | 1 | 0.4% |
9 | 1 | 0.4% |
· | 1 | 0.4% |
1 | 1 | 0.4% |
Latin
Value | Count | Frequency (%) |
D | 10 | |
P | 5 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1968 | |
ASCII | 286 | 12.7% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
( | 120 | |
) | 120 | |
3 | 16 | 5.6% |
D | 10 | 3.5% |
. | 8 | 2.8% |
P | 5 | 1.7% |
4 | 4 | 1.4% |
5 | 1 | 0.3% |
9 | 1 | 0.3% |
1 | 1 | 0.3% |
Hangul
Value | Count | Frequency (%) |
구 | 103 | 5.2% |
대 | 76 | 3.9% |
신 | 67 | 3.4% |
청 | 56 | 2.8% |
동 | 52 | 2.6% |
로 | 43 | 2.2% |
입 | 42 | 2.1% |
사 | 37 | 1.9% |
산 | 36 | 1.8% |
성 | 34 | 1.7% |
Other values (211) | 1422 |
None
Value | Count | Frequency (%) |
· | 1 |
하차역ID(GETOFF_STATION_ID)
Real number (ℝ)
Distinct | 235 |
---|---|
Distinct (%) | 47.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1699.51 |
Minimum | 150 |
---|---|
Maximum | 4709 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 202 |
Q1 | 242.75 |
median | 2533.5 |
Q3 | 2721 |
95-th percentile | 4117.05 |
Maximum | 4709 |
Range | 4559 |
Interquartile range (IQR) | 2478.25 |
Descriptive statistics
Standard deviation | 1380.0865 |
---|---|
Coefficient of variation (CV) | 0.81204967 |
Kurtosis | -1.4234782 |
Mean | 1699.51 |
Median Absolute Deviation (MAD) | 1589.5 |
Skewness | 0.16435701 |
Sum | 849755 |
Variance | 1904638.8 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
239 | 7 | 1.4% |
327 | 6 | 1.2% |
420 | 6 | 1.2% |
211 | 6 | 1.2% |
216 | 6 | 1.2% |
214 | 6 | 1.2% |
331 | 5 | 1.0% |
225 | 5 | 1.0% |
329 | 5 | 1.0% |
2816 | 5 | 1.0% |
Other values (225) | 443 |
Value | Count | Frequency (%) |
150 | 3 | |
151 | 3 | |
153 | 2 | |
154 | 1 | 0.2% |
155 | 2 | |
157 | 4 | |
158 | 3 | |
159 | 1 | 0.2% |
201 | 3 | |
202 | 4 |
Value | Count | Frequency (%) |
4709 | 3 | |
4138 | 2 | |
4136 | 3 | |
4134 | 3 | |
4133 | 1 | 0.2% |
4132 | 1 | 0.2% |
4127 | 1 | 0.2% |
4125 | 3 | |
4124 | 2 | |
4123 | 2 |
하차_호선명(GETOFF_LINE_NM)
Categorical
Distinct | 11 |
---|---|
Distinct (%) | 2.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
2호선 | |
---|---|
5호선 | |
7호선 | |
3호선 | |
4호선 | |
Other values (6) |
Length
Max length | 8 |
---|---|
Median length | 3 |
Mean length | 3.12 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 6호선 |
---|---|
2nd row | 4호선 |
3rd row | 2호선 |
4th row | 6호선 |
5th row | 2호선 |
Common Values
Value | Count | Frequency (%) |
2호선 | 123 | |
5호선 | 75 | |
7호선 | 59 | |
3호선 | 56 | |
4호선 | 48 | 9.6% |
6호선 | 42 | 8.4% |
9호선 | 33 | 6.6% |
8호선 | 26 | 5.2% |
1호선 | 20 | 4.0% |
우이신설선 | 10 | 2.0% |
Length
Value | Count | Frequency (%) |
2호선 | 123 | |
5호선 | 75 | |
7호선 | 59 | |
3호선 | 56 | |
4호선 | 48 | 9.6% |
6호선 | 42 | 8.4% |
9호선 | 33 | 6.6% |
8호선 | 26 | 5.2% |
1호선 | 20 | 4.0% |
우이신설선 | 10 | 2.0% |
Distinct | 223 |
---|---|
Distinct (%) | 44.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.0 KiB |
Value | Count | Frequency (%) |
압구정 | 8 | 1.6% |
을지로3가 | 8 | 1.6% |
건대입구 | 7 | 1.4% |
강남 | 7 | 1.4% |
을지로입구 | 6 | 1.2% |
신당 | 6 | 1.2% |
당산 | 6 | 1.2% |
동대문역사문화공원(ddp | 6 | 1.2% |
양재(서초구청 | 6 | 1.2% |
종로3가 | 6 | 1.2% |
Other values (213) | 434 |
Most occurring characters
Value | Count | Frequency (%) |
) | 104 | 4.8% |
( | 104 | 4.8% |
구 | 90 | 4.2% |
대 | 82 | 3.8% |
동 | 67 | 3.1% |
청 | 50 | 2.3% |
원 | 43 | 2.0% |
신 | 41 | 1.9% |
문 | 38 | 1.8% |
입 | 35 | 1.6% |
Other values (228) | 1512 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1910 | |
Close Punctuation | 104 | 4.8% |
Open Punctuation | 104 | 4.8% |
Decimal Number | 22 | 1.0% |
Uppercase Letter | 18 | 0.8% |
Other Punctuation | 8 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 90 | 4.7% |
대 | 82 | 4.3% |
동 | 67 | 3.5% |
청 | 50 | 2.6% |
원 | 43 | 2.3% |
신 | 41 | 2.1% |
문 | 38 | 2.0% |
입 | 35 | 1.8% |
지 | 33 | 1.7% |
산 | 32 | 1.7% |
Other values (217) | 1399 |
Decimal Number
Value | Count | Frequency (%) |
3 | 14 | |
4 | 3 | 13.6% |
5 | 3 | 13.6% |
1 | 1 | 4.5% |
9 | 1 | 4.5% |
Uppercase Letter
Value | Count | Frequency (%) |
D | 12 | |
P | 6 |
Other Punctuation
Value | Count | Frequency (%) |
. | 6 | |
· | 2 | 25.0% |
Close Punctuation
Value | Count | Frequency (%) |
) | 104 |
Open Punctuation
Value | Count | Frequency (%) |
( | 104 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1910 | |
Common | 238 | 11.0% |
Latin | 18 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 90 | 4.7% |
대 | 82 | 4.3% |
동 | 67 | 3.5% |
청 | 50 | 2.6% |
원 | 43 | 2.3% |
신 | 41 | 2.1% |
문 | 38 | 2.0% |
입 | 35 | 1.8% |
지 | 33 | 1.7% |
산 | 32 | 1.7% |
Other values (217) | 1399 |
Common
Value | Count | Frequency (%) |
) | 104 | |
( | 104 | |
3 | 14 | 5.9% |
. | 6 | 2.5% |
4 | 3 | 1.3% |
5 | 3 | 1.3% |
· | 2 | 0.8% |
1 | 1 | 0.4% |
9 | 1 | 0.4% |
Latin
Value | Count | Frequency (%) |
D | 12 | |
P | 6 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1910 | |
ASCII | 254 | 11.7% |
None | 2 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 104 | |
( | 104 | |
3 | 14 | 5.5% |
D | 12 | 4.7% |
P | 6 | 2.4% |
. | 6 | 2.4% |
4 | 3 | 1.2% |
5 | 3 | 1.2% |
1 | 1 | 0.4% |
9 | 1 | 0.4% |
Hangul
Value | Count | Frequency (%) |
구 | 90 | 4.7% |
대 | 82 | 4.3% |
동 | 67 | 3.5% |
청 | 50 | 2.6% |
원 | 43 | 2.3% |
신 | 41 | 2.1% |
문 | 38 | 2.0% |
입 | 35 | 1.8% |
지 | 33 | 1.7% |
산 | 32 | 1.7% |
Other values (217) | 1399 |
None
Value | Count | Frequency (%) |
· | 2 |
인원합계(PASSN_CNT)
Real number (ℝ)
Distinct | 22 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.644 |
Minimum | 1 |
---|---|
Maximum | 47 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 8.05 |
Maximum | 47 |
Range | 46 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 4.2162564 |
---|---|
Coefficient of variation (CV) | 1.5946507 |
Kurtosis | 40.376813 |
Mean | 2.644 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.4702155 |
Sum | 1322 |
Variance | 17.776818 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 303 | |
2 | 75 | 15.0% |
3 | 36 | 7.2% |
4 | 23 | 4.6% |
5 | 15 | 3.0% |
8 | 9 | 1.8% |
6 | 9 | 1.8% |
7 | 5 | 1.0% |
11 | 3 | 0.6% |
12 | 3 | 0.6% |
Other values (12) | 19 | 3.8% |
Value | Count | Frequency (%) |
1 | 303 | |
2 | 75 | 15.0% |
3 | 36 | 7.2% |
4 | 23 | 4.6% |
5 | 15 | 3.0% |
6 | 9 | 1.8% |
7 | 5 | 1.0% |
8 | 9 | 1.8% |
9 | 2 | 0.4% |
10 | 2 | 0.4% |
Value | Count | Frequency (%) |
47 | 1 | |
35 | 1 | |
34 | 1 | |
23 | 2 | |
20 | 2 | |
18 | 2 | |
16 | 2 | |
15 | 1 | |
14 | 2 | |
13 | 1 |
년(YEAR) | 월(MONTH) | 일(DAY) | 시간(HOUR) | 분_30분단위(HALF_HOUR) | 사용자구분(BILL_USER) | 승차역ID(GETON_STATION_ID) | 승차_호선명(GETON_LINE_NM) | 하차역ID(GETOFF_STATION_ID) | 하차_호선명(GETOFF_LINE_NM) | 인원합계(PASSN_CNT) | |
---|---|---|---|---|---|---|---|---|---|---|---|
년(YEAR) | 1.000 | 0.245 | 0.000 | 0.000 | 0.098 | 0.000 | 0.081 | 0.000 | 0.000 | 0.145 | 0.000 |
월(MONTH) | 0.245 | 1.000 | 0.000 | 0.357 | 0.000 | 0.048 | 0.000 | 0.000 | 0.065 | 0.121 | 0.000 |
일(DAY) | 0.000 | 0.000 | 1.000 | 0.049 | 0.010 | 0.061 | 0.135 | 0.132 | 0.135 | 0.000 | 0.092 |
시간(HOUR) | 0.000 | 0.357 | 0.049 | 1.000 | 0.000 | 0.000 | 0.000 | 0.113 | 0.052 | 0.000 | 0.049 |
분_30분단위(HALF_HOUR) | 0.098 | 0.000 | 0.010 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.129 | 0.074 | 0.000 |
사용자구분(BILL_USER) | 0.000 | 0.048 | 0.061 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
승차역ID(GETON_STATION_ID) | 0.081 | 0.000 | 0.135 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.263 | 0.000 | 0.080 |
승차_호선명(GETON_LINE_NM) | 0.000 | 0.000 | 0.132 | 0.113 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
하차역ID(GETOFF_STATION_ID) | 0.000 | 0.065 | 0.135 | 0.052 | 0.129 | 0.000 | 0.263 | 0.000 | 1.000 | 0.096 | 0.000 |
하차_호선명(GETOFF_LINE_NM) | 0.145 | 0.121 | 0.000 | 0.000 | 0.074 | 0.000 | 0.000 | 0.000 | 0.096 | 1.000 | 0.000 |
인원합계(PASSN_CNT) | 0.000 | 0.000 | 0.092 | 0.049 | 0.000 | 0.000 | 0.080 | 0.000 | 0.000 | 0.000 | 1.000 |
년(YEAR) | 분_30분단위(HALF_HOUR) | 승차_호선명(GETON_LINE_NM) | 사용자구분(BILL_USER) | 하차_호선명(GETOFF_LINE_NM) | |
---|---|---|---|---|---|
년(YEAR) | 1.000 | 0.119 | 0.000 | 0.000 | 0.079 |
분_30분단위(HALF_HOUR) | 0.119 | 1.000 | 0.000 | 0.000 | 0.070 |
승차_호선명(GETON_LINE_NM) | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
사용자구분(BILL_USER) | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
하차_호선명(GETOFF_LINE_NM) | 0.079 | 0.070 | 0.000 | 0.000 | 1.000 |
월(MONTH) | 일(DAY) | 시간(HOUR) | 승차역ID(GETON_STATION_ID) | 하차역ID(GETOFF_STATION_ID) | 인원합계(PASSN_CNT) | 년(YEAR) | 분_30분단위(HALF_HOUR) | 사용자구분(BILL_USER) | 승차_호선명(GETON_LINE_NM) | 하차_호선명(GETOFF_LINE_NM) | |
---|---|---|---|---|---|---|---|---|---|---|---|
월(MONTH) | 1.000 | 0.019 | 0.013 | 0.011 | 0.081 | -0.045 | 0.104 | 0.000 | 0.024 | 0.000 | 0.051 |
일(DAY) | 0.019 | 1.000 | -0.029 | 0.035 | 0.049 | 0.016 | 0.000 | 0.000 | 0.000 | 0.104 | 0.000 |
시간(HOUR) | 0.013 | -0.029 | 1.000 | -0.083 | 0.009 | -0.069 | 0.000 | 0.000 | 0.000 | 0.040 | 0.000 |
승차역ID(GETON_STATION_ID) | 0.011 | 0.035 | -0.083 | 1.000 | -0.035 | -0.027 | 0.067 | 0.000 | 0.000 | 0.000 | 0.000 |
하차역ID(GETOFF_STATION_ID) | 0.081 | 0.049 | 0.009 | -0.035 | 1.000 | 0.057 | 0.000 | 0.085 | 0.000 | 0.000 | 0.052 |
인원합계(PASSN_CNT) | -0.045 | 0.016 | -0.069 | -0.027 | 0.057 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
년(YEAR) | 0.104 | 0.000 | 0.000 | 0.067 | 0.000 | 0.000 | 1.000 | 0.119 | 0.000 | 0.000 | 0.079 |
분_30분단위(HALF_HOUR) | 0.000 | 0.000 | 0.000 | 0.000 | 0.085 | 0.000 | 0.119 | 1.000 | 0.000 | 0.000 | 0.070 |
사용자구분(BILL_USER) | 0.024 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
승차_호선명(GETON_LINE_NM) | 0.000 | 0.104 | 0.040 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 |
하차_호선명(GETOFF_LINE_NM) | 0.051 | 0.000 | 0.000 | 0.000 | 0.052 | 0.000 | 0.079 | 0.070 | 0.000 | 0.000 | 1.000 |
년(YEAR) | 월(MONTH) | 일(DAY) | 시간(HOUR) | 분_30분단위(HALF_HOUR) | 사용자구분(BILL_USER) | 승차역ID(GETON_STATION_ID) | 승차_호선명(GETON_LINE_NM) | 승차_역명(GETON_STATION_NM) | 하차역ID(GETOFF_STATION_ID) | 하차_호선명(GETOFF_LINE_NM) | 하차_역명(GETOFF_STATION_NM) | 인원합계(PASSN_CNT) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 2019 | 2 | 23 | 9 | 30 | 일반 | 233 | 9호선 | 마천 | 412 | 6호선 | 을지로입구 | 3 |
1 | 2021 | 6 | 13 | 14 | 30 | 일반 | 243 | 8호선 | 홍대입구 | 228 | 4호선 | 신당 | 2 |
2 | 2020 | 5 | 17 | 22 | 0 | 일반 | 2726 | 5호선 | 흑석(중앙대입구) | 207 | 2호선 | 상월곡(한국과학기술연구원) | 1 |
3 | 2018 | 4 | 18 | 17 | 30 | 일반 | 425 | 5호선 | 건대입구 | 2523 | 6호선 | 고려대(종암) | 2 |
4 | 2019 | 7 | 10 | 18 | 0 | 경로 | 226 | 4호선 | 경복궁(정부서울청사) | 420 | 2호선 | 청량리(서울시립대입구) | 2 |
5 | 2018 | 3 | 7 | 15 | 30 | 일반 | 2539 | 9호선 | 잠실(송파구청) | 226 | 3호선 | 종각 | 1 |
6 | 2020 | 3 | 20 | 22 | 30 | 일반 | 204 | 5호선 | 여의도 | 2550 | 2호선 | 답십리 | 1 |
7 | 2020 | 2 | 22 | 13 | 0 | 경로 | 331 | 3호선 | 홍대입구 | 2620 | 2호선 | 종로3가 | 16 |
8 | 2019 | 1 | 6 | 9 | 0 | 일반 | 2644 | 7호선 | 망원 | 4110 | 2호선 | 공덕 | 1 |
9 | 2019 | 11 | 22 | 14 | 30 | 일반 | 2628 | 7호선 | 압구정 | 2630 | 2호선 | 개롱 | 5 |
년(YEAR) | 월(MONTH) | 일(DAY) | 시간(HOUR) | 분_30분단위(HALF_HOUR) | 사용자구분(BILL_USER) | 승차역ID(GETON_STATION_ID) | 승차_호선명(GETON_LINE_NM) | 승차_역명(GETON_STATION_NM) | 하차역ID(GETOFF_STATION_ID) | 하차_호선명(GETOFF_LINE_NM) | 하차_역명(GETOFF_STATION_NM) | 인원합계(PASSN_CNT) | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
490 | 2020 | 4 | 9 | 18 | 30 | 일반 | 2736 | 8호선 | 종로3가 | 250 | 8호선 | 염창 | 1 |
491 | 2020 | 7 | 30 | 15 | 30 | 일반 | 248 | 2호선 | 학동 | 2565 | 5호선 | 홍제 | 1 |
492 | 2020 | 10 | 5 | 8 | 0 | 일반 | 2740 | 2호선 | 동대문역사문화공원(DDP) | 219 | 9호선 | 우장산 | 1 |
493 | 2018 | 6 | 2 | 21 | 30 | 일반 | 4110 | 6호선 | 천왕 | 420 | 5호선 | 상월곡(한국과학기술연구원) | 1 |
494 | 2018 | 12 | 20 | 22 | 30 | 일반 | 157 | 6호선 | 충무로 | 4117 | 2호선 | 종로5가 | 1 |
495 | 2021 | 6 | 29 | 18 | 0 | 일반 | 420 | 5호선 | 신목동 | 214 | 3호선 | 명동 | 13 |
496 | 2021 | 7 | 23 | 12 | 0 | 경로 | 2548 | 6호선 | 오금 | 2753 | 6호선 | 사당 | 2 |
497 | 2019 | 3 | 12 | 21 | 0 | 일반 | 230 | 7호선 | 올림픽공원(한국체대) | 2757 | 4호선 | 증산(명지대앞) | 3 |
498 | 2019 | 12 | 14 | 6 | 0 | 일반 | 2827 | 2호선 | 봉천 | 2627 | 6호선 | 강변(동서울터미널) | 4 |
499 | 2021 | 10 | 9 | 21 | 0 | 일반 | 4123 | 7호선 | 잠실(송파구청) | 2640 | 7호선 | 을지로3가 | 1 |