Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 654.3 KiB |
Average record size in memory | 67.0 B |
Variable types
Text | 4 |
---|---|
Numeric | 1 |
Categorical | 2 |
Dataset
Description | 외부코드,전철역코드,전철역명,종착역명,출발시간,요일,상/하행선 |
---|---|
Author | 서울교통공사 |
URL | https://data.seoul.go.kr/dataList/OA-109/S/1/datasetView.do |
Reproduction
Analysis started | 2024-05-18 07:30:43.678783 |
---|---|
Analysis finished | 2024-05-18 07:30:46.170771 |
Duration | 2.49 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
외부코드
Text
Distinct | 450 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
141 | 50 | 0.5% |
140 | 50 | 0.5% |
135 | 48 | 0.5% |
920 | 47 | 0.5% |
204 | 46 | 0.5% |
936 | 46 | 0.5% |
136 | 45 | 0.4% |
917 | 45 | 0.4% |
206 | 44 | 0.4% |
137 | 44 | 0.4% |
Other values (440) | 9535 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 5352 | |
2 | 4877 | |
3 | 4160 | |
4 | 3893 | |
5 | 3111 | |
7 | 2130 | 6.9% |
6 | 1874 | 6.1% |
9 | 1796 | 5.8% |
0 | 1717 | 5.6% |
8 | 1225 | 4.0% |
Other values (2) | 674 | 2.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 30135 | |
Uppercase Letter | 539 | 1.7% |
Dash Punctuation | 135 | 0.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 5352 | |
2 | 4877 | |
3 | 4160 | |
4 | 3893 | |
5 | 3111 | |
7 | 2130 | 7.1% |
6 | 1874 | 6.2% |
9 | 1796 | 6.0% |
0 | 1717 | 5.7% |
8 | 1225 | 4.1% |
Uppercase Letter
Value | Count | Frequency (%) |
P | 539 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 135 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 30270 | |
Latin | 539 | 1.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 5352 | |
2 | 4877 | |
3 | 4160 | |
4 | 3893 | |
5 | 3111 | |
7 | 2130 | 7.0% |
6 | 1874 | 6.2% |
9 | 1796 | 5.9% |
0 | 1717 | 5.7% |
8 | 1225 | 4.0% |
Latin
Value | Count | Frequency (%) |
P | 539 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 30809 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 5352 | |
2 | 4877 | |
3 | 4160 | |
4 | 3893 | |
5 | 3111 | |
7 | 2130 | 6.9% |
6 | 1874 | 6.1% |
9 | 1796 | 5.8% |
0 | 1717 | 5.6% |
8 | 1225 | 4.0% |
Other values (2) | 674 | 2.2% |
전철역코드
Real number (ℝ)
Distinct | 450 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1747.2437 |
Minimum | 150 |
---|---|
Maximum | 4138 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 150 |
---|---|
5-th percentile | 205 |
Q1 | 336 |
median | 1808 |
Q3 | 2644.25 |
95-th percentile | 4118 |
Maximum | 4138 |
Range | 3988 |
Interquartile range (IQR) | 2308.25 |
Descriptive statistics
Standard deviation | 1271.1999 |
---|---|
Coefficient of variation (CV) | 0.72754586 |
Kurtosis | -1.0837311 |
Mean | 1747.2437 |
Median Absolute Deviation (MAD) | 939 |
Skewness | 0.2315259 |
Sum | 17472437 |
Variance | 1615949.2 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1701 | 50 | 0.5% |
1007 | 50 | 0.5% |
1003 | 48 | 0.5% |
4120 | 47 | 0.5% |
204 | 46 | 0.5% |
4136 | 46 | 0.5% |
1004 | 45 | 0.4% |
4117 | 45 | 0.4% |
206 | 44 | 0.4% |
1005 | 44 | 0.4% |
Other values (440) | 9535 |
Value | Count | Frequency (%) |
150 | 38 | |
151 | 40 | |
152 | 34 | |
153 | 41 | |
154 | 40 | |
155 | 34 | |
156 | 23 | |
157 | 31 | |
158 | 25 | |
159 | 41 |
Value | Count | Frequency (%) |
4138 | 16 | 0.2% |
4137 | 16 | 0.2% |
4136 | 46 | |
4135 | 20 | |
4134 | 21 | |
4133 | 35 | |
4132 | 16 | 0.2% |
4131 | 8 | 0.1% |
4130 | 36 | |
4129 | 20 |
전철역명
Text
Distinct | 397 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
종로3가 | 96 | 1.0% |
노량진 | 90 | 0.9% |
신도림 | 83 | 0.8% |
동대문역사문화공원 | 81 | 0.8% |
을지로4가 | 76 | 0.8% |
충무로 | 71 | 0.7% |
고속터미널 | 71 | 0.7% |
동대문 | 70 | 0.7% |
동작 | 69 | 0.7% |
대림 | 68 | 0.7% |
Other values (387) | 9225 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 1127 | 3.9% |
구 | 814 | 2.8% |
동 | 802 | 2.8% |
신 | 794 | 2.8% |
산 | 737 | 2.6% |
로 | 533 | 1.9% |
원 | 503 | 1.8% |
지 | 465 | 1.6% |
가 | 462 | 1.6% |
정 | 457 | 1.6% |
Other values (241) | 21873 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 28298 | |
Decimal Number | 269 | 0.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 1127 | 4.0% |
구 | 814 | 2.9% |
동 | 802 | 2.8% |
신 | 794 | 2.8% |
산 | 737 | 2.6% |
로 | 533 | 1.9% |
원 | 503 | 1.8% |
지 | 465 | 1.6% |
가 | 462 | 1.6% |
정 | 457 | 1.6% |
Other values (238) | 21604 |
Decimal Number
Value | Count | Frequency (%) |
3 | 153 | |
4 | 76 | |
5 | 40 | 14.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 28298 | |
Common | 269 | 0.9% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 1127 | 4.0% |
구 | 814 | 2.9% |
동 | 802 | 2.8% |
신 | 794 | 2.8% |
산 | 737 | 2.6% |
로 | 533 | 1.9% |
원 | 503 | 1.8% |
지 | 465 | 1.6% |
가 | 462 | 1.6% |
정 | 457 | 1.6% |
Other values (238) | 21604 |
Common
Value | Count | Frequency (%) |
3 | 153 | |
4 | 76 | |
5 | 40 | 14.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 28298 | |
ASCII | 269 | 0.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 1127 | 4.0% |
구 | 814 | 2.9% |
동 | 802 | 2.8% |
신 | 794 | 2.8% |
산 | 737 | 2.6% |
로 | 533 | 1.9% |
원 | 503 | 1.8% |
지 | 465 | 1.6% |
가 | 462 | 1.6% |
정 | 457 | 1.6% |
Other values (238) | 21604 |
ASCII
Value | Count | Frequency (%) |
3 | 153 | |
4 | 76 | |
5 | 40 | 14.9% |
종착역명
Text
Distinct | 89 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
성수 | 1217 | 12.2% |
방화 | 557 | 5.6% |
인천 | 446 | 4.5% |
당고개 | 415 | 4.2% |
중앙보훈병원 | 406 | 4.1% |
오금 | 351 | 3.5% |
도봉산 | 343 | 3.4% |
오이도 | 332 | 3.3% |
개화 | 324 | 3.2% |
대화 | 307 | 3.1% |
Other values (79) | 5302 |
Most occurring characters
Value | Count | Frequency (%) |
수 | 1612 | 6.3% |
화 | 1440 | 5.6% |
성 | 1241 | 4.8% |
산 | 1111 | 4.3% |
천 | 1013 | 3.9% |
도 | 862 | 3.4% |
개 | 750 | 2.9% |
오 | 694 | 2.7% |
대 | 658 | 2.6% |
암 | 651 | 2.5% |
Other values (117) | 15620 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 25652 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
수 | 1612 | 6.3% |
화 | 1440 | 5.6% |
성 | 1241 | 4.8% |
산 | 1111 | 4.3% |
천 | 1013 | 3.9% |
도 | 862 | 3.4% |
개 | 750 | 2.9% |
오 | 694 | 2.7% |
대 | 658 | 2.6% |
암 | 651 | 2.5% |
Other values (117) | 15620 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 25652 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
수 | 1612 | 6.3% |
화 | 1440 | 5.6% |
성 | 1241 | 4.8% |
산 | 1111 | 4.3% |
천 | 1013 | 3.9% |
도 | 862 | 3.4% |
개 | 750 | 2.9% |
오 | 694 | 2.7% |
대 | 658 | 2.6% |
암 | 651 | 2.5% |
Other values (117) | 15620 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 25652 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
수 | 1612 | 6.3% |
화 | 1440 | 5.6% |
성 | 1241 | 4.8% |
산 | 1111 | 4.3% |
천 | 1013 | 3.9% |
도 | 862 | 3.4% |
개 | 750 | 2.9% |
오 | 694 | 2.7% |
대 | 658 | 2.6% |
암 | 651 | 2.5% |
Other values (117) | 15620 |
출발시간
Text
Distinct | 2195 |
---|---|
Distinct (%) | 21.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
19:09:30 | 23 | 0.2% |
19:57:30 | 23 | 0.2% |
20:56:00 | 22 | 0.2% |
19:50:00 | 22 | 0.2% |
19:11:30 | 21 | 0.2% |
19:08:30 | 21 | 0.2% |
19:11:00 | 21 | 0.2% |
19:16:00 | 21 | 0.2% |
20:25:30 | 21 | 0.2% |
20:27:30 | 21 | 0.2% |
Other values (2185) | 9784 |
Most occurring characters
Value | Count | Frequency (%) |
: | 20000 | |
0 | 17866 | |
2 | 13221 | |
1 | 7822 | 9.8% |
3 | 7480 | 9.3% |
5 | 3795 | 4.7% |
4 | 3598 | 4.5% |
9 | 3184 | 4.0% |
7 | 1079 | 1.3% |
6 | 990 | 1.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 60000 | |
Other Punctuation | 20000 | 25.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 17866 | |
2 | 13221 | |
1 | 7822 | |
3 | 7480 | |
5 | 3795 | 6.3% |
4 | 3598 | 6.0% |
9 | 3184 | 5.3% |
7 | 1079 | 1.8% |
6 | 990 | 1.7% |
8 | 965 | 1.6% |
Other Punctuation
Value | Count | Frequency (%) |
: | 20000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 80000 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
: | 20000 | |
0 | 17866 | |
2 | 13221 | |
1 | 7822 | 9.8% |
3 | 7480 | 9.3% |
5 | 3795 | 4.7% |
4 | 3598 | 4.5% |
9 | 3184 | 4.0% |
7 | 1079 | 1.3% |
6 | 990 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 80000 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
: | 20000 | |
0 | 17866 | |
2 | 13221 | |
1 | 7822 | 9.8% |
3 | 7480 | 9.3% |
5 | 3795 | 4.7% |
4 | 3598 | 4.5% |
9 | 3184 | 4.0% |
7 | 1079 | 1.3% |
6 | 990 | 1.2% |
요일
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
1 | |
---|---|
2 | |
3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 1 |
3rd row | 1 |
4th row | 2 |
5th row | 3 |
Common Values
Value | Count | Frequency (%) |
1 | 3799 | |
2 | 3148 | |
3 | 3053 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 3799 | |
2 | 3148 | |
3 | 3053 |
상/하행선
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 1 |
3rd row | 2 |
4th row | 1 |
5th row | 1 |
Common Values
Value | Count | Frequency (%) |
2 | 5151 | |
1 | 4849 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 5151 | |
1 | 4849 |
전철역코드 | 종착역명 | 요일 | 상/하행선 | |
---|---|---|---|---|
전철역코드 | 1.000 | 0.950 | 0.043 | 0.031 |
종착역명 | 0.950 | 1.000 | 0.138 | 0.966 |
요일 | 0.043 | 0.138 | 1.000 | 0.000 |
상/하행선 | 0.031 | 0.966 | 0.000 | 1.000 |
요일 | 상/하행선 | |
---|---|---|
요일 | 1.000 | 0.000 |
상/하행선 | 0.000 | 1.000 |
전철역코드 | 요일 | 상/하행선 | |
---|---|---|---|
전철역코드 | 1.000 | 0.028 | 0.034 |
요일 | 0.028 | 1.000 | 0.000 |
상/하행선 | 0.034 | 0.000 | 1.000 |
외부코드 | 전철역코드 | 전철역명 | 종착역명 | 출발시간 | 요일 | 상/하행선 | |
---|---|---|---|---|---|---|---|
84237 | 155 | 1816 | 간석 | 인천 | 19:45:30 | 2 | 2 |
22518 | 322 | 312 | 불광 | 대화 | 22:43:00 | 1 | 1 |
53254 | 221 | 221 | 역삼 | 성수 | 21:09:30 | 1 | 2 |
39027 | 211-2 | 245 | 신답 | 성수 | 21:51:00 | 2 | 1 |
23953 | 910 | 4110 | 염창 | 중앙보훈병원 | 22:38:05 | 3 | 1 |
86629 | 440 | 1455 | 인덕원 | 오이도 | 19:39:30 | 3 | 2 |
78750 | 525 | 2526 | 신길 | 마천 | 19:59:50 | 1 | 2 |
37057 | 511 | 2512 | 개화산 | 방화 | 21:57:00 | 2 | 1 |
44682 | 711 | 2713 | 수락산 | 온수 | 21:34:20 | 2 | 2 |
44952 | 910 | 4110 | 염창 | 중앙보훈병원 | 21:33:30 | 2 | 1 |
외부코드 | 전철역코드 | 전철역명 | 종착역명 | 출발시간 | 요일 | 상/하행선 | |
---|---|---|---|---|---|---|---|
36595 | 147 | 1814 | 소사 | 연천 | 21:58:30 | 3 | 1 |
26883 | 414 | 414 | 수유 | 진접 | 22:28:30 | 1 | 1 |
81996 | 633 | 2634 | 약수 | 응암 | 19:51:20 | 2 | 1 |
85388 | 543 | 2544 | 장한평 | 마천 | 19:42:30 | 1 | 2 |
68945 | 434 | 434 | 남태령 | 오이도 | 20:26:00 | 1 | 2 |
72293 | 112 | 1904 | 망월사 | 인천 | 20:17:00 | 1 | 2 |
26200 | 430 | 430 | 이촌 | 안산 | 22:31:00 | 3 | 2 |
97778 | 133 | 150 | 서울역 | 광운대 | 19:11:00 | 1 | 1 |
60832 | 141 | 1701 | 구로 | 소요산 | 20:48:30 | 3 | 1 |
87021 | 614 | 2615 | 연신내 | 봉화산 | 19:38:20 | 3 | 2 |