Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 5104 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 37 |
Duplicate rows (%) | 0.7% |
Total size in memory | 214.5 KiB |
Average record size in memory | 43.0 B |
Variable types
Numeric | 3 |
---|---|
Text | 2 |
Dataset
Description | 경기도_BMS 노선/정류소 실측 현황 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=6KBYRR0MU2RPORDO33W834395723&infSeq=1 |
Dataset has 37 (0.7%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2024-04-19 05:20:22.941682 |
---|---|
Analysis finished | 2024-04-19 05:20:24.478090 |
Duration | 1.54 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
이력ID
Real number (ℝ)
Distinct | 57 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0000019 × 109 |
Minimum | 1 × 109 |
---|---|
Maximum | 1.0000033 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 45.0 KiB |
Quantile statistics
Minimum | 1 × 109 |
---|---|
5-th percentile | 1 × 109 |
Q1 | 1.0000015 × 109 |
median | 1.000002 × 109 |
Q3 | 1.0000025 × 109 |
95-th percentile | 1.0000031 × 109 |
Maximum | 1.0000033 × 109 |
Range | 3321 |
Interquartile range (IQR) | 1063 |
Descriptive statistics
Standard deviation | 858.80181 |
---|---|
Coefficient of variation (CV) | 8.5880015 × 10-7 |
Kurtosis | 0.11530357 |
Mean | 1.0000019 × 109 |
Median Absolute Deviation (MAD) | 554 |
Skewness | -0.69418759 |
Sum | 5.1040098 × 1012 |
Variance | 737540.54 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1000000000 | 407 | 8.0% |
1000001463 | 170 | 3.3% |
1000002017 | 167 | 3.3% |
1000002941 | 156 | 3.1% |
1000002942 | 154 | 3.0% |
1000001453 | 149 | 2.9% |
1000002018 | 142 | 2.8% |
1000002931 | 135 | 2.6% |
1000001471 | 133 | 2.6% |
1000002301 | 129 | 2.5% |
Other values (47) | 3362 |
Value | Count | Frequency (%) |
1000000000 | 407 | |
1000000048 | 96 | 1.9% |
1000001063 | 13 | 0.3% |
1000001124 | 60 | 1.2% |
1000001161 | 101 | 2.0% |
1000001289 | 59 | 1.2% |
1000001336 | 97 | 1.9% |
1000001384 | 64 | 1.3% |
1000001401 | 61 | 1.2% |
1000001416 | 77 | 1.5% |
Value | Count | Frequency (%) |
1000003321 | 115 | |
1000003155 | 110 | |
1000003081 | 73 | |
1000002942 | 154 | |
1000002941 | 156 | |
1000002938 | 57 | 1.1% |
1000002931 | 135 | |
1000002930 | 81 | |
1000002900 | 126 | |
1000002854 | 122 |
노선ID
Real number (ℝ)
Distinct | 60 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.2144187 × 108 |
Minimum | 2.0000014 × 108 |
---|---|
Maximum | 2.3400034 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 45.0 KiB |
Quantile statistics
Minimum | 2.0000014 × 108 |
---|---|
5-th percentile | 2.0700003 × 108 |
Q1 | 2.1500001 × 108 |
median | 2.2200002 × 108 |
Q3 | 2.2900003 × 108 |
95-th percentile | 2.3400003 × 108 |
Maximum | 2.3400034 × 108 |
Range | 34000201 |
Interquartile range (IQR) | 14000022 |
Descriptive statistics
Standard deviation | 9366625.8 |
---|---|
Coefficient of variation (CV) | 0.042298351 |
Kurtosis | -0.90348203 |
Mean | 2.2144187 × 108 |
Median Absolute Deviation (MAD) | 7000003 |
Skewness | -0.39604431 |
Sum | 1.1302393 × 1012 |
Variance | 8.7733679 × 1013 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
234000033 | 197 | 3.9% |
234000026 | 170 | 3.3% |
217000006 | 167 | 3.3% |
207000027 | 156 | 3.1% |
207000028 | 154 | 3.0% |
234000315 | 149 | 2.9% |
222000004 | 147 | 2.9% |
217000001 | 142 | 2.8% |
207000026 | 135 | 2.6% |
222000014 | 133 | 2.6% |
Other values (50) | 3554 |
Value | Count | Frequency (%) |
200000143 | 72 | |
204000026 | 61 | 1.2% |
204000030 | 25 | 0.5% |
207000026 | 135 | |
207000027 | 156 | |
207000028 | 154 | |
207000033 | 81 | |
207000048 | 57 | 1.1% |
207000054 | 97 | |
207000055 | 101 |
Value | Count | Frequency (%) |
234000344 | 59 | 1.2% |
234000315 | 149 | |
234000033 | 197 | |
234000026 | 170 | |
234000015 | 81 | |
234000007 | 13 | 0.3% |
232000004 | 110 | |
232000001 | 84 | |
231000101 | 71 | 1.4% |
231000099 | 60 | 1.2% |
정류소ID
Real number (ℝ)
Distinct | 3413 |
---|---|
Distinct (%) | 66.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.1085642 × 108 |
Minimum | 1 × 108 |
---|---|
Maximum | 2.3800028 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 45.0 KiB |
Quantile statistics
Minimum | 1 × 108 |
---|---|
5-th percentile | 1.11 × 108 |
Q1 | 2.1000028 × 108 |
median | 2.1900041 × 108 |
Q3 | 2.2800085 × 108 |
95-th percentile | 2.3500031 × 108 |
Maximum | 2.3800028 × 108 |
Range | 1.3800027 × 108 |
Interquartile range (IQR) | 18000562 |
Descriptive statistics
Standard deviation | 32827267 |
---|---|
Coefficient of variation (CV) | 0.15568541 |
Kurtosis | 4.9298369 |
Mean | 2.1085642 × 108 |
Median Absolute Deviation (MAD) | 9000338.5 |
Skewness | -2.478621 |
Sum | 1.0762111 × 1012 |
Variance | 1.0776295 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
207000027 | 5 | 0.1% |
228001084 | 5 | 0.1% |
215000024 | 5 | 0.1% |
219000193 | 5 | 0.1% |
228001552 | 5 | 0.1% |
228001143 | 5 | 0.1% |
207000152 | 5 | 0.1% |
219000190 | 5 | 0.1% |
219000192 | 5 | 0.1% |
235000242 | 5 | 0.1% |
Other values (3403) | 5054 |
Value | Count | Frequency (%) |
100000005 | 1 | |
100000034 | 1 | |
100000088 | 2 | |
100000097 | 2 | |
100000169 | 2 | |
100000174 | 1 | |
101000001 | 2 | |
101000002 | 2 | |
101000005 | 2 | |
101000007 | 1 |
Value | Count | Frequency (%) |
238000277 | 2 | |
238000271 | 2 | |
238000270 | 2 | |
238000269 | 2 | |
238000268 | 2 | |
238000267 | 2 | |
238000266 | 2 | |
238000265 | 2 | |
238000264 | 2 | |
238000263 | 2 |
노선명
Text
Distinct | 59 |
---|---|
Distinct (%) | 1.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.0 KiB |
Length
Max length | 9 |
---|---|
Median length | 8 |
Mean length | 3.2893809 |
Min length | 1 |
Characters and Unicode
Total characters | 16789 |
---|---|
Distinct characters | 20 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 30-1(지게동) |
---|---|
2nd row | 30-1(지게동) |
3rd row | 30-1(지게동) |
4th row | 30-1(지게동) |
5th row | 30-1(지게동) |
Value | Count | Frequency (%) |
11-1 | 197 | 3.9% |
720-2 | 170 | 3.3% |
61 | 167 | 3.3% |
36 | 156 | 3.1% |
37 | 154 | 3.0% |
9001 | 149 | 2.9% |
202 | 147 | 2.9% |
1 | 142 | 2.8% |
73 | 137 | 2.7% |
133 | 135 | 2.6% |
Other values (49) | 3550 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 2886 | |
0 | 2622 | |
3 | 1935 | |
- | 1836 | |
2 | 1606 | |
7 | 1450 | |
8 | 960 | 5.7% |
5 | 847 | 5.0% |
6 | 835 | 5.0% |
9 | 545 | 3.2% |
Other values (10) | 1267 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 13724 | |
Dash Punctuation | 1836 | 10.9% |
Other Letter | 665 | 4.0% |
Close Punctuation | 282 | 1.7% |
Open Punctuation | 282 | 1.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 2886 | |
0 | 2622 | |
3 | 1935 | |
2 | 1606 | |
7 | 1450 | |
8 | 960 | 7.0% |
5 | 847 | 6.2% |
6 | 835 | 6.1% |
9 | 545 | 4.0% |
4 | 38 | 0.3% |
Other Letter
Value | Count | Frequency (%) |
게 | 101 | |
동 | 101 | |
지 | 101 | |
현 | 97 | |
대 | 97 | |
일 | 84 | |
산 | 84 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1836 |
Close Punctuation
Value | Count | Frequency (%) |
) | 282 |
Open Punctuation
Value | Count | Frequency (%) |
( | 282 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 16124 | |
Hangul | 665 | 4.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 2886 | |
0 | 2622 | |
3 | 1935 | |
- | 1836 | |
2 | 1606 | |
7 | 1450 | |
8 | 960 | 6.0% |
5 | 847 | 5.3% |
6 | 835 | 5.2% |
9 | 545 | 3.4% |
Other values (3) | 602 | 3.7% |
Hangul
Value | Count | Frequency (%) |
게 | 101 | |
동 | 101 | |
지 | 101 | |
현 | 97 | |
대 | 97 | |
일 | 84 | |
산 | 84 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 16124 | |
Hangul | 665 | 4.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 2886 | |
0 | 2622 | |
3 | 1935 | |
- | 1836 | |
2 | 1606 | |
7 | 1450 | |
8 | 960 | 6.0% |
5 | 847 | 5.3% |
6 | 835 | 5.2% |
9 | 545 | 3.4% |
Other values (3) | 602 | 3.7% |
Hangul
Value | Count | Frequency (%) |
게 | 101 | |
동 | 101 | |
지 | 101 | |
현 | 97 | |
대 | 97 | |
일 | 84 | |
산 | 84 |
정류소명
Text
Distinct | 2678 |
---|---|
Distinct (%) | 52.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 40.0 KiB |
Value | Count | Frequency (%) |
부대앞 | 20 | 0.4% |
주공5단지 | 15 | 0.3% |
녹양역 | 11 | 0.2% |
현대아파트 | 11 | 0.2% |
원각사 | 11 | 0.2% |
양주시청 | 10 | 0.2% |
일산동구청 | 10 | 0.2% |
연신내역 | 9 | 0.2% |
대화역 | 9 | 0.2% |
주택앞 | 9 | 0.2% |
Other values (2665) | 4990 |
Most occurring characters
Value | Count | Frequency (%) |
. | 945 | 3.0% |
동 | 835 | 2.7% |
아 | 762 | 2.4% |
트 | 726 | 2.3% |
리 | 720 | 2.3% |
파 | 717 | 2.3% |
앞 | 671 | 2.2% |
교 | 636 | 2.0% |
마 | 578 | 1.9% |
지 | 556 | 1.8% |
Other values (506) | 24030 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 29028 | |
Other Punctuation | 945 | 3.0% |
Decimal Number | 662 | 2.1% |
Uppercase Letter | 247 | 0.8% |
Close Punctuation | 144 | 0.5% |
Open Punctuation | 136 | 0.4% |
Dash Punctuation | 9 | < 0.1% |
Lowercase Letter | 4 | < 0.1% |
Space Separator | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 835 | 2.9% |
아 | 762 | 2.6% |
트 | 726 | 2.5% |
리 | 720 | 2.5% |
파 | 717 | 2.5% |
앞 | 671 | 2.3% |
교 | 636 | 2.2% |
마 | 578 | 2.0% |
지 | 556 | 1.9% |
원 | 551 | 1.9% |
Other values (474) | 22276 |
Uppercase Letter
Value | Count | Frequency (%) |
A | 106 | |
S | 39 | 15.8% |
G | 36 | 14.6% |
L | 17 | 6.9% |
K | 14 | 5.7% |
C | 11 | 4.5% |
B | 8 | 3.2% |
T | 6 | 2.4% |
E | 4 | 1.6% |
D | 2 | 0.8% |
Other values (3) | 4 | 1.6% |
Decimal Number
Value | Count | Frequency (%) |
2 | 167 | |
1 | 162 | |
3 | 97 | |
5 | 56 | 8.5% |
4 | 51 | 7.7% |
7 | 42 | 6.3% |
6 | 30 | 4.5% |
8 | 24 | 3.6% |
0 | 23 | 3.5% |
9 | 10 | 1.5% |
Lowercase Letter
Value | Count | Frequency (%) |
s | 1 | |
k | 1 | |
l | 1 | |
g | 1 |
Other Punctuation
Value | Count | Frequency (%) |
. | 945 |
Close Punctuation
Value | Count | Frequency (%) |
) | 144 |
Open Punctuation
Value | Count | Frequency (%) |
( | 136 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9 |
Space Separator
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 29028 | |
Common | 1897 | 6.1% |
Latin | 251 | 0.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 835 | 2.9% |
아 | 762 | 2.6% |
트 | 726 | 2.5% |
리 | 720 | 2.5% |
파 | 717 | 2.5% |
앞 | 671 | 2.3% |
교 | 636 | 2.2% |
마 | 578 | 2.0% |
지 | 556 | 1.9% |
원 | 551 | 1.9% |
Other values (474) | 22276 |
Latin
Value | Count | Frequency (%) |
A | 106 | |
S | 39 | 15.5% |
G | 36 | 14.3% |
L | 17 | 6.8% |
K | 14 | 5.6% |
C | 11 | 4.4% |
B | 8 | 3.2% |
T | 6 | 2.4% |
E | 4 | 1.6% |
D | 2 | 0.8% |
Other values (7) | 8 | 3.2% |
Common
Value | Count | Frequency (%) |
. | 945 | |
2 | 167 | 8.8% |
1 | 162 | 8.5% |
) | 144 | 7.6% |
( | 136 | 7.2% |
3 | 97 | 5.1% |
5 | 56 | 3.0% |
4 | 51 | 2.7% |
7 | 42 | 2.2% |
6 | 30 | 1.6% |
Other values (5) | 67 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 29028 | |
ASCII | 2148 | 6.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
. | 945 | |
2 | 167 | 7.8% |
1 | 162 | 7.5% |
) | 144 | 6.7% |
( | 136 | 6.3% |
A | 106 | 4.9% |
3 | 97 | 4.5% |
5 | 56 | 2.6% |
4 | 51 | 2.4% |
7 | 42 | 2.0% |
Other values (22) | 242 | 11.3% |
Hangul
Value | Count | Frequency (%) |
동 | 835 | 2.9% |
아 | 762 | 2.6% |
트 | 726 | 2.5% |
리 | 720 | 2.5% |
파 | 717 | 2.5% |
앞 | 671 | 2.3% |
교 | 636 | 2.2% |
마 | 578 | 2.0% |
지 | 556 | 1.9% |
원 | 551 | 1.9% |
Other values (474) | 22276 |
이력ID | 노선ID | 정류소ID | 노선명 | |
---|---|---|---|---|
이력ID | 1.000 | 0.875 | 0.412 | 0.999 |
노선ID | 0.875 | 1.000 | 0.549 | 1.000 |
정류소ID | 0.412 | 0.549 | 1.000 | 0.857 |
노선명 | 0.999 | 1.000 | 0.857 | 1.000 |
이력ID | 노선ID | 정류소ID | |
---|---|---|---|
이력ID | 1.000 | -0.313 | -0.091 |
노선ID | -0.313 | 1.000 | 0.145 |
정류소ID | -0.091 | 0.145 | 1.000 |
이력ID | 노선ID | 정류소ID | 노선명 | 정류소명 | |
---|---|---|---|---|---|
0 | 1000001161 | 207000055 | 235000099 | 30-1(지게동) | 주공아파트 |
1 | 1000001161 | 207000055 | 235000089 | 30-1(지게동) | 덕정고등학교 |
2 | 1000001161 | 207000055 | 235000098 | 30-1(지게동) | 조은마을(주공6단지) |
3 | 1000001161 | 207000055 | 235000097 | 30-1(지게동) | 서재말 |
4 | 1000001161 | 207000055 | 235000601 | 30-1(지게동) | 회암2동.회암편의점 |
5 | 1000001161 | 207000055 | 235000521 | 30-1(지게동) | 율정삼거리 |
6 | 1000001161 | 207000055 | 235000087 | 30-1(지게동) | 천보초등학교앞 |
7 | 1000001161 | 207000055 | 235000086 | 30-1(지게동) | 모정동 |
8 | 1000001161 | 207000055 | 235000085 | 30-1(지게동) | 귀율동 |
9 | 1000001161 | 207000055 | 235000084 | 30-1(지게동) | 기우리다리앞 |
이력ID | 노선ID | 정류소ID | 노선명 | 정류소명 | |
---|---|---|---|---|---|
5094 | 1000002941 | 207000027 | 215000049 | 36 | 지방산업단지앞 |
5095 | 1000002941 | 207000027 | 215000048 | 36 | 성보주택 |
5096 | 1000002941 | 207000027 | 215000047 | 36 | 소요산역앞 |
5097 | 1000002941 | 207000027 | 215000217 | 36 | 소요산차고지 |
5098 | 1000002942 | 207000028 | 207000061 | 37 | 다락원앞 |
5099 | 1000001495 | 234000033 | 124000021 | 11-1 | 둔촌2동사무소.보훈병원 |
5100 | 1000001495 | 234000033 | 124000023 | 11-1 | 강동성심병원.길동사거리 |
5101 | 1000001495 | 234000033 | 124000025 | 11-1 | 강동전철역.강동성심병원 |
5102 | 1000001495 | 234000033 | 124000027 | 11-1 | 천호동.현대백화점.이마트 |
5103 | 1000001495 | 234000033 | 104000080 | 11-1 | 워커힐아파트.워커힐호텔.한강웨딩홀 |
Most frequently occurring
이력ID | 노선ID | 정류소ID | 노선명 | 정류소명 | # duplicates | |
---|---|---|---|---|---|---|
0 | 1000000000 | 231000005 | 231000521 | 370 | 죽산시외버스터미널 | 2 |
1 | 1000000000 | 231000101 | 214001295 | 7-6 | 은산1리 | 2 |
2 | 1000000000 | 231000101 | 214001296 | 7-6 | 산하리 | 2 |
3 | 1000000000 | 231000101 | 231000076 | 7-6 | 양성터미널 | 2 |
4 | 1000000000 | 231000101 | 231001134 | 7-6 | 산하리(평동) | 2 |
5 | 1000000000 | 231000101 | 231001224 | 7-6 | 산하리삼거리 | 2 |
6 | 1000001124 | 229000034 | 229000849 | 700 | 트리플메디컬타운 | 2 |
7 | 1000001453 | 234000315 | 100000169 | 9001 | 조계사.인사동 | 2 |
8 | 1000001453 | 234000315 | 206000174 | 9001 | 오리초등학교 | 2 |
9 | 1000001453 | 234000315 | 206000175 | 9001 | 미금초등학교 | 2 |