Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 1652 |
Missing cells (%) | 3.3% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Numeric | 2 |
---|---|
Text | 2 |
Categorical | 1 |
Dataset
Description | 경기도_BMS 차량 정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=X6P4TP1N764LMR76PS9Z34494333&infSeq=1 |
Reproduction
Analysis started | 2023-12-10 22:02:49.368460 |
---|---|
Analysis finished | 2023-12-10 22:02:50.790694 |
Duration | 1.42 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
차량ID
Real number (ℝ)
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.2294873 × 108 |
Minimum | 2 × 108 |
---|---|
Maximum | 8 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2 × 108 |
---|---|
5-th percentile | 2.000011 × 108 |
Q1 | 2.0890054 × 108 |
median | 2.2400013 × 108 |
Q3 | 2.3400022 × 108 |
95-th percentile | 2.4901148 × 108 |
Maximum | 8 × 108 |
Range | 6 × 108 |
Interquartile range (IQR) | 25099680 |
Descriptive statistics
Standard deviation | 16107578 |
---|---|
Coefficient of variation (CV) | 0.072247904 |
Kurtosis | 163.43766 |
Mean | 2.2294873 × 108 |
Median Absolute Deviation (MAD) | 10002346 |
Skewness | 4.6660438 |
Sum | 2.2294873 × 1012 |
Variance | 2.5945407 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
213000210 | 1 | < 0.1% |
228300614 | 1 | < 0.1% |
210301382 | 1 | < 0.1% |
207000117 | 1 | < 0.1% |
220010058 | 1 | < 0.1% |
208900562 | 1 | < 0.1% |
236000067 | 1 | < 0.1% |
233000958 | 1 | < 0.1% |
204001356 | 1 | < 0.1% |
229900780 | 1 | < 0.1% |
Other values (9990) | 9990 |
Value | Count | Frequency (%) |
200000002 | 1 | |
200000004 | 1 | |
200000005 | 1 | |
200000006 | 1 | |
200000009 | 1 | |
200000010 | 1 | |
200000011 | 1 | |
200000012 | 1 | |
200000013 | 1 | |
200000017 | 1 |
Value | Count | Frequency (%) |
800000002 | 1 | |
289901825 | 1 | |
289901817 | 1 | |
289901813 | 1 | |
289901245 | 1 | |
249912246 | 1 | |
249912244 | 1 | |
249911587 | 1 | |
249911531 | 1 | |
249911523 | 1 |
차량번호
Text
Distinct | 9693 |
---|---|
Distinct (%) | 96.9% |
Missing | 1 |
Missing (%) | < 0.1% |
Memory size | 156.2 KiB |
Length
Max length | 9 |
---|---|
Median length | 9 |
Mean length | 8.9994999 |
Min length | 5 |
Characters and Unicode
Total characters | 89986 |
---|---|
Distinct characters | 36 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 9416 ? |
---|---|
Unique (%) | 94.2% |
Sample
1st row | 경기72사1198 |
---|---|
2nd row | 경기74아1353 |
3rd row | 경기72아3016 |
4th row | 경기77바5913 |
5th row | 경기78아1048 |
Value | Count | Frequency (%) |
서울71바3246 | 5 | < 0.1% |
서울71바3248 | 4 | < 0.1% |
경기78아7095 | 4 | < 0.1% |
경기70사1505 | 4 | < 0.1% |
경기74자7401 | 3 | < 0.1% |
경기76아1002 | 3 | < 0.1% |
경기78사7744 | 3 | < 0.1% |
경기78아5916 | 3 | < 0.1% |
경기72아2048 | 3 | < 0.1% |
서울71바3285 | 3 | < 0.1% |
Other values (9684) | 9965 |
Most occurring characters
Value | Count | Frequency (%) |
7 | 14910 | |
경 | 9797 | |
기 | 9797 | |
1 | 8587 | |
0 | 5836 | 6.5% |
3 | 5194 | 5.8% |
6 | 4974 | 5.5% |
바 | 4858 | 5.4% |
2 | 4691 | 5.2% |
8 | 4656 | 5.2% |
Other values (26) | 16686 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 59987 | |
Other Letter | 29997 | |
Space Separator | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
경 | 9797 | |
기 | 9797 | |
바 | 4858 | |
아 | 3499 | 11.7% |
사 | 827 | 2.8% |
자 | 812 | 2.7% |
울 | 164 | 0.5% |
서 | 163 | 0.5% |
천 | 15 | 0.1% |
인 | 15 | 0.1% |
Other values (15) | 50 | 0.2% |
Decimal Number
Value | Count | Frequency (%) |
7 | 14910 | |
1 | 8587 | |
0 | 5836 | 9.7% |
3 | 5194 | 8.7% |
6 | 4974 | 8.3% |
2 | 4691 | 7.8% |
8 | 4656 | 7.8% |
5 | 4099 | 6.8% |
4 | 3924 | 6.5% |
9 | 3116 | 5.2% |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 59989 | |
Hangul | 29997 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
경 | 9797 | |
기 | 9797 | |
바 | 4858 | |
아 | 3499 | 11.7% |
사 | 827 | 2.8% |
자 | 812 | 2.7% |
울 | 164 | 0.5% |
서 | 163 | 0.5% |
천 | 15 | 0.1% |
인 | 15 | 0.1% |
Other values (15) | 50 | 0.2% |
Common
Value | Count | Frequency (%) |
7 | 14910 | |
1 | 8587 | |
0 | 5836 | 9.7% |
3 | 5194 | 8.7% |
6 | 4974 | 8.3% |
2 | 4691 | 7.8% |
8 | 4656 | 7.8% |
5 | 4099 | 6.8% |
4 | 3924 | 6.5% |
9 | 3116 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 59989 | |
Hangul | 29997 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
7 | 14910 | |
1 | 8587 | |
0 | 5836 | 9.7% |
3 | 5194 | 8.7% |
6 | 4974 | 8.3% |
2 | 4691 | 7.8% |
8 | 4656 | 7.8% |
5 | 4099 | 6.8% |
4 | 3924 | 6.5% |
9 | 3116 | 5.2% |
Hangul
Value | Count | Frequency (%) |
경 | 9797 | |
기 | 9797 | |
바 | 4858 | |
아 | 3499 | 11.7% |
사 | 827 | 2.8% |
자 | 812 | 2.7% |
울 | 164 | 0.5% |
서 | 163 | 0.5% |
천 | 15 | 0.1% |
인 | 15 | 0.1% |
Other values (15) | 50 | 0.2% |
업체ID
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 192 |
---|---|
Distinct (%) | 1.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4113108.4 |
Minimum | 4100100 |
---|---|
Maximum | 9999999 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 4100100 |
---|---|
5-th percentile | 4100200 |
Q1 | 4101100 |
median | 4103600 |
Q3 | 4109100 |
95-th percentile | 4150300 |
Maximum | 9999999 |
Range | 5899899 |
Interquartile range (IQR) | 8000 |
Descriptive statistics
Standard deviation | 103341.43 |
---|---|
Coefficient of variation (CV) | 0.025124899 |
Kurtosis | 3158.3418 |
Mean | 4113108.4 |
Median Absolute Deviation (MAD) | 3200 |
Skewness | 55.47928 |
Sum | 4.1131084 × 1010 |
Variance | 1.0679452 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4100200 | 629 | 6.3% |
4100300 | 489 | 4.9% |
4103100 | 432 | 4.3% |
4150200 | 361 | 3.6% |
4150300 | 330 | 3.3% |
4103600 | 320 | 3.2% |
4100600 | 313 | 3.1% |
4100700 | 306 | 3.1% |
4100400 | 296 | 3.0% |
4100500 | 270 | 2.7% |
Other values (182) | 6254 |
Value | Count | Frequency (%) |
4100100 | 15 | 0.1% |
4100200 | 629 | |
4100300 | 489 | |
4100400 | 296 | |
4100500 | 270 | |
4100600 | 313 | |
4100700 | 306 | |
4100800 | 96 | 1.0% |
4100900 | 23 | 0.2% |
4101100 | 196 | 2.0% |
Value | Count | Frequency (%) |
9999999 | 3 | < 0.1% |
4159800 | 3 | < 0.1% |
4159300 | 12 | 0.1% |
4158500 | 1 | < 0.1% |
4155500 | 1 | < 0.1% |
4155200 | 53 | |
4155100 | 13 | 0.1% |
4155000 | 28 | |
4154500 | 3 | < 0.1% |
4154400 | 2 | < 0.1% |
제조사
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 25 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
현대 | |
---|---|
대우 | |
<NA> | |
에디슨모터스 | 122 |
기아 | 121 |
Other values (20) |
Length
Max length | 8 |
---|---|
Median length | 2 |
Mean length | 2.4874 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 대우 |
---|---|
2nd row | 대우 |
3rd row | 현대 |
4th row | 대우 |
5th row | 현대 |
Common Values
Value | Count | Frequency (%) |
현대 | 4953 | |
대우 | 2485 | |
<NA> | 1622 | 16.2% |
에디슨모터스 | 122 | 1.2% |
기아 | 121 | 1.2% |
이엠코리아 | 102 | 1.0% |
MAN | 101 | 1.0% |
하이거 | 94 | 0.9% |
볼보 | 72 | 0.7% |
BLK | 48 | 0.5% |
Other values (15) | 280 | 2.8% |
Length
Value | Count | Frequency (%) |
현대 | 4953 | |
대우 | 2485 | |
na | 1622 | 16.2% |
에디슨모터스 | 122 | 1.2% |
기아 | 121 | 1.2% |
이엠코리아 | 102 | 1.0% |
man | 101 | 1.0% |
하이거 | 94 | 0.9% |
볼보 | 72 | 0.7% |
blk | 48 | 0.5% |
Other values (15) | 280 | 2.8% |
차량명
Text
MISSING
 
Distinct | 99 |
---|---|
Distinct (%) | 1.2% |
Missing | 1651 |
Missing (%) | 16.5% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 20 |
Mean length | 12.108276 |
Min length | 6 |
Characters and Unicode
Total characters | 101092 |
---|---|
Distinct characters | 161 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | [대우] BS090 |
---|---|
2nd row | [대우] BS106 |
3rd row | [현대] 일렉시티 |
4th row | [대우] FX116 |
5th row | [현대] 뉴슈퍼에어로시티 |
Value | Count | Frequency (%) |
현대 | 4839 | |
대우 | 2485 | |
cng | 2480 | |
유니버스 | 1620 | 8.3% |
뉴슈퍼에어로시티 | 1164 | 6.0% |
bs106 | 740 | 3.8% |
fx116 | 669 | 3.4% |
그린시티 | 591 | 3.0% |
bs090 | 453 | 2.3% |
일렉시티 | 413 | 2.1% |
Other values (109) | 4071 |
Most occurring characters
Value | Count | Frequency (%) |
11176 | 11.1% | |
[ | 8348 | 8.3% |
] | 8348 | 8.3% |
대 | 7409 | 7.3% |
현 | 4924 | 4.9% |
티 | 3183 | 3.1% |
1 | 3176 | 3.1% |
시 | 2824 | 2.8% |
N | 2694 | 2.7% |
스 | 2678 | 2.6% |
Other values (151) | 46332 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 49545 | |
Uppercase Letter | 14543 | 14.4% |
Space Separator | 11176 | 11.1% |
Open Punctuation | 8367 | 8.3% |
Close Punctuation | 8367 | 8.3% |
Decimal Number | 7938 | 7.9% |
Lowercase Letter | 907 | 0.9% |
Dash Punctuation | 244 | 0.2% |
Other Punctuation | 5 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 7409 | |
현 | 4924 | 9.9% |
티 | 3183 | 6.4% |
시 | 2824 | 5.7% |
스 | 2678 | 5.4% |
우 | 2503 | 5.1% |
버 | 2192 | 4.4% |
유 | 1929 | 3.9% |
니 | 1929 | 3.9% |
에 | 1845 | 3.7% |
Other values (95) | 18129 |
Uppercase Letter
Value | Count | Frequency (%) |
N | 2694 | |
G | 2511 | |
C | 2502 | |
S | 1502 | |
B | 1497 | |
F | 1083 | |
X | 1076 | 7.4% |
E | 199 | 1.4% |
Y | 198 | 1.4% |
A | 190 | 1.3% |
Other values (14) | 1091 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 153 | |
o | 147 | |
e | 119 | |
s | 112 | |
n | 111 | |
t | 54 | 6.0% |
c | 51 | 5.6% |
l | 46 | 5.1% |
y | 42 | 4.6% |
w | 28 | 3.1% |
Other values (6) | 44 | 4.9% |
Decimal Number
Value | Count | Frequency (%) |
1 | 3176 | |
0 | 2297 | |
6 | 1478 | |
9 | 484 | 6.1% |
2 | 478 | 6.0% |
8 | 15 | 0.2% |
3 | 5 | 0.1% |
7 | 3 | < 0.1% |
4 | 2 | < 0.1% |
Open Punctuation
Value | Count | Frequency (%) |
[ | 8348 | |
( | 19 | 0.2% |
Close Punctuation
Value | Count | Frequency (%) |
] | 8348 | |
) | 19 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
11176 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 244 |
Other Punctuation
Value | Count | Frequency (%) |
. | 5 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 49545 | |
Common | 36097 | |
Latin | 15450 | 15.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 7409 | |
현 | 4924 | 9.9% |
티 | 3183 | 6.4% |
시 | 2824 | 5.7% |
스 | 2678 | 5.4% |
우 | 2503 | 5.1% |
버 | 2192 | 4.4% |
유 | 1929 | 3.9% |
니 | 1929 | 3.9% |
에 | 1845 | 3.7% |
Other values (95) | 18129 |
Latin
Value | Count | Frequency (%) |
N | 2694 | |
G | 2511 | |
C | 2502 | |
S | 1502 | |
B | 1497 | |
F | 1083 | |
X | 1076 | 7.0% |
E | 199 | 1.3% |
Y | 198 | 1.3% |
A | 190 | 1.2% |
Other values (30) | 1998 |
Common
Value | Count | Frequency (%) |
11176 | ||
[ | 8348 | |
] | 8348 | |
1 | 3176 | 8.8% |
0 | 2297 | 6.4% |
6 | 1478 | 4.1% |
9 | 484 | 1.3% |
2 | 478 | 1.3% |
- | 244 | 0.7% |
( | 19 | 0.1% |
Other values (6) | 49 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 51547 | |
Hangul | 49545 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
11176 | ||
[ | 8348 | |
] | 8348 | |
1 | 3176 | 6.2% |
N | 2694 | 5.2% |
G | 2511 | 4.9% |
C | 2502 | 4.9% |
0 | 2297 | 4.5% |
S | 1502 | 2.9% |
B | 1497 | 2.9% |
Other values (46) | 7496 |
Hangul
Value | Count | Frequency (%) |
대 | 7409 | |
현 | 4924 | 9.9% |
티 | 3183 | 6.4% |
시 | 2824 | 5.7% |
스 | 2678 | 5.4% |
우 | 2503 | 5.1% |
버 | 2192 | 4.4% |
유 | 1929 | 3.9% |
니 | 1929 | 3.9% |
에 | 1845 | 3.7% |
Other values (95) | 18129 |
차량ID | 업체ID | 제조사 | 차량명 | |
---|---|---|---|---|
차량ID | 1.000 | 0.000 | 0.093 | 1.000 |
업체ID | 0.000 | 1.000 | NaN | NaN |
제조사 | 0.093 | NaN | 1.000 | 1.000 |
차량명 | 1.000 | NaN | 1.000 | 1.000 |
차량ID | 업체ID | 제조사 | |
---|---|---|---|
차량ID | 1.000 | 0.194 | 0.073 |
업체ID | 0.194 | 1.000 | 1.000 |
제조사 | 0.073 | 1.000 | 1.000 |
차량ID | 차량번호 | 업체ID | 제조사 | 차량명 | |
---|---|---|---|---|---|
9651 | 213000210 | 경기72사1198 | 4104500 | 대우 | [대우] BS090 |
15131 | 222000206 | 경기74아1353 | 4107800 | 대우 | [대우] BS106 |
6038 | 214000017 | 경기72아3016 | 4102200 | 현대 | [현대] 일렉시티 |
18961 | 249010883 | 경기77바5913 | 4150300 | 대우 | [대우] FX116 |
16756 | 228000080 | 경기78아1048 | 4100600 | 현대 | [현대] 뉴슈퍼에어로시티 |
11451 | 222000010 | 경기74아3703 | 4107700 | 이엠코리아 | [이엠코리아] 에픽시티 |
16084 | 233000765 | 서울70바9240 | 4108600 | 현대 | [현대] 유니버스 |
8132 | 232000106 | 경기79바6269 | 4110100 | 현대 | [현대] 뉴슈퍼에어로시티 CNG |
9646 | 249012181 | 경기78아6120 | 4150600 | 기아 | [기아] 뉴그랜버드 |
187 | 216000371 | 경기73바1478 | 4100700 | 현대 | [현대] 그린시티 |
차량ID | 차량번호 | 업체ID | 제조사 | 차량명 | |
---|---|---|---|---|---|
19315 | 210000159 | 경기71아8160 | 4106300 | 대우 | [대우] BS090 |
2216 | 200000169 | 경기70사1262 | 4102700 | 현대 | [현대] 뉴슈퍼에어로시티 |
17494 | 234001044 | 경기77바2447 | 4100300 | 대우 | [대우] FX116 |
17220 | 214300382 | 경기72아8075 | 4128300 | 대우 | [대우] 레스타 |
13307 | 200011019 | 경기70사6605 | 4105000 | 현대 | [현대] 유니버스 CNG |
17549 | 200000954 | 경기70바5530 | 4103600 | 현대 | [현대] 에어로시티 CNG |
17491 | 234001040 | 경기77바2443 | 4100300 | 대우 | [대우] FX116 |
16500 | 249011143 | 경기77바6920 | 4150300 | 대우 | [대우] FX116 |
795 | 238000010 | 경기77사3557 | 4103500 | 현대 | [현대] 뉴카운티 |
5897 | 234001125 | 경기77바2562 | 4100300 | 대우 | [대우] BS090 |