Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 673.8 KiB |
Average record size in memory | 69.0 B |
Variable types
Numeric | 4 |
---|---|
Categorical | 2 |
Text | 1 |
Dataset
Description | 경기도_BMS GIS 경로 단위 정보 |
---|---|
Author | 경기도 |
URL | https://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=HNIP5FSE7EYEJITHJR3X33138663&infSeq=1 |
등록일자 is highly overall correlated with 사용구분 | High correlation |
사용구분 is highly overall correlated with 등록일자 | High correlation |
시외버스추가경로구분 is highly imbalanced (96.9%) | Imbalance |
Reproduction
Analysis started | 2023-12-10 21:01:00.925421 |
---|---|
Analysis finished | 2023-12-10 21:01:03.860670 |
Duration | 2.94 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
노선아이디
Real number (ℝ)
Distinct | 1851 |
---|---|
Distinct (%) | 18.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.2962998 × 108 |
Minimum | 2.0000001 × 108 |
---|---|
Maximum | 2.4120301 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.0000001 × 108 |
---|---|
5-th percentile | 2.0000028 × 108 |
Q1 | 2.2300004 × 108 |
median | 2.3400047 × 108 |
Q3 | 2.4100102 × 108 |
95-th percentile | 2.4100704 × 108 |
Maximum | 2.4120301 × 108 |
Range | 41203001 |
Interquartile range (IQR) | 18000982 |
Descriptive statistics
Standard deviation | 12197687 |
---|---|
Coefficient of variation (CV) | 0.05311888 |
Kurtosis | 0.032676724 |
Mean | 2.2962998 × 108 |
Median Absolute Deviation (MAD) | 7002126 |
Skewness | -1.0501257 |
Sum | 2.2962998 × 1012 |
Variance | 1.4878358 × 1014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
200000028 | 87 | 0.9% |
241006942 | 79 | 0.8% |
241006799 | 69 | 0.7% |
224000014 | 67 | 0.7% |
241006390 | 65 | 0.7% |
241000040 | 63 | 0.6% |
229000035 | 56 | 0.6% |
241003900 | 48 | 0.5% |
241000320 | 48 | 0.5% |
233000383 | 47 | 0.5% |
Other values (1841) | 9371 |
Value | Count | Frequency (%) |
200000009 | 3 | < 0.1% |
200000015 | 16 | 0.2% |
200000017 | 5 | 0.1% |
200000021 | 1 | < 0.1% |
200000024 | 6 | 0.1% |
200000028 | 87 | |
200000032 | 8 | 0.1% |
200000034 | 3 | < 0.1% |
200000042 | 2 | < 0.1% |
200000057 | 2 | < 0.1% |
Value | Count | Frequency (%) |
241203010 | 7 | 0.1% |
241105900 | 6 | 0.1% |
241103250 | 1 | < 0.1% |
241103010 | 5 | 0.1% |
241007245 | 16 | |
241007243 | 24 | |
241007225 | 18 | |
241007203 | 22 | |
241007199 | 39 | |
241007197 | 10 | 0.1% |
링크순서
Real number (ℝ)
Distinct | 440 |
---|---|
Distinct (%) | 4.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 80.5772 |
Minimum | 1 |
---|---|
Maximum | 584 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 24 |
median | 55 |
Q3 | 112 |
95-th percentile | 236 |
Maximum | 584 |
Range | 583 |
Interquartile range (IQR) | 88 |
Descriptive statistics
Standard deviation | 79.314539 |
---|---|
Coefficient of variation (CV) | 0.9843298 |
Kurtosis | 5.0094006 |
Mean | 80.5772 |
Median Absolute Deviation (MAD) | 37 |
Skewness | 1.938253 |
Sum | 805772 |
Variance | 6290.7961 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 128 | 1.3% |
17 | 124 | 1.2% |
11 | 123 | 1.2% |
5 | 116 | 1.2% |
10 | 115 | 1.1% |
24 | 113 | 1.1% |
1 | 113 | 1.1% |
25 | 111 | 1.1% |
6 | 111 | 1.1% |
18 | 109 | 1.1% |
Other values (430) | 8837 |
Value | Count | Frequency (%) |
1 | 113 | |
2 | 128 | |
3 | 101 | |
4 | 82 | |
5 | 116 | |
6 | 111 | |
7 | 102 | |
8 | 94 | |
9 | 96 | |
10 | 115 |
Value | Count | Frequency (%) |
584 | 1 | |
582 | 1 | |
571 | 1 | |
558 | 1 | |
544 | 1 | |
542 | 1 | |
539 | 1 | |
537 | 1 | |
527 | 1 | |
526 | 1 |
링크아이디
Real number (ℝ)
Distinct | 6226 |
---|---|
Distinct (%) | 62.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.1537979 × 109 |
Minimum | 1.0000009 × 109 |
---|---|
Maximum | 3.6900008 × 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1.0000009 × 109 |
---|---|
5-th percentile | 1.1700209 × 109 |
Q1 | 2.1200302 × 109 |
median | 2.2700615 × 109 |
Q3 | 2.3303149 × 109 |
95-th percentile | 2.4000749 × 109 |
Maximum | 3.6900008 × 109 |
Range | 2.6899999 × 109 |
Interquartile range (IQR) | 2.1028475 × 108 |
Descriptive statistics
Standard deviation | 3.6841695 × 108 |
---|---|
Coefficient of variation (CV) | 0.17105456 |
Kurtosis | 3.6584331 |
Mean | 2.1537979 × 109 |
Median Absolute Deviation (MAD) | 90063350 |
Skewness | -1.5255746 |
Sum | 2.1537979 × 1013 |
Variance | 1.3573105 × 1017 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2370156600 | 22 | 0.2% |
2300118900 | 20 | 0.2% |
2390057100 | 20 | 0.2% |
2060114700 | 18 | 0.2% |
1040022100 | 18 | 0.2% |
2390056700 | 17 | 0.2% |
2390056900 | 16 | 0.2% |
2330113200 | 15 | 0.1% |
1040021900 | 15 | 0.1% |
2370160400 | 15 | 0.1% |
Other values (6216) | 9824 |
Value | Count | Frequency (%) |
1000000900 | 1 | |
1000001000 | 2 | |
1000001700 | 1 | |
1000001800 | 1 | |
1000002400 | 1 | |
1000013800 | 1 | |
1000014200 | 2 | |
1000014300 | 1 | |
1000014500 | 1 | |
1000014600 | 1 |
Value | Count | Frequency (%) |
3690000800 | 1 | |
3690000700 | 1 | |
3650000401 | 1 | |
3630004700 | 2 | |
3630004300 | 1 | |
3630004200 | 1 | |
3630002301 | 1 | |
3630002201 | 1 | |
3630002101 | 1 | |
3630001001 | 1 |
사용구분
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Y | |
---|---|
9 | |
0 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Y |
---|---|
2nd row | Y |
3rd row | 9 |
4th row | 9 |
5th row | Y |
Common Values
Value | Count | Frequency (%) |
Y | 5153 | |
9 | 3121 | |
0 | 1726 | 17.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
y | 5153 | |
9 | 3121 | |
0 | 1726 | 17.3% |
등록아이디
Text
Distinct | 86 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
0 | 4847 | |
kdbus01 | 500 | 5.0% |
busdy000 | 276 | 2.8% |
seedsp1400 | 232 | 2.3% |
kdbus03 | 211 | 2.1% |
kwbus119 | 164 | 1.6% |
kd0112 | 143 | 1.4% |
hckim082 | 140 | 1.4% |
kkks1952 | 139 | 1.4% |
shinsung00 | 137 | 1.4% |
Other values (76) | 3211 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 10519 | |
s | 3338 | 7.8% |
k | 3084 | 7.2% |
1 | 3064 | 7.2% |
d | 2579 | 6.1% |
u | 2072 | 4.9% |
b | 1586 | 3.7% |
2 | 1402 | 3.3% |
n | 1121 | 2.6% |
8 | 1057 | 2.5% |
Other values (24) | 12720 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 21675 | |
Decimal Number | 20830 | |
Dash Punctuation | 37 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
s | 3338 | |
k | 3084 | |
d | 2579 | |
u | 2072 | |
b | 1586 | 7.3% |
n | 1121 | 5.2% |
h | 1009 | 4.7% |
i | 849 | 3.9% |
y | 842 | 3.9% |
w | 726 | 3.3% |
Other values (13) | 4469 |
Decimal Number
Value | Count | Frequency (%) |
0 | 10519 | |
1 | 3064 | 14.7% |
2 | 1402 | 6.7% |
8 | 1057 | 5.1% |
3 | 1020 | 4.9% |
4 | 1017 | 4.9% |
5 | 854 | 4.1% |
9 | 703 | 3.4% |
6 | 635 | 3.0% |
7 | 559 | 2.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 37 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 21675 | |
Common | 20867 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
s | 3338 | |
k | 3084 | |
d | 2579 | |
u | 2072 | |
b | 1586 | 7.3% |
n | 1121 | 5.2% |
h | 1009 | 4.7% |
i | 849 | 3.9% |
y | 842 | 3.9% |
w | 726 | 3.3% |
Other values (13) | 4469 |
Common
Value | Count | Frequency (%) |
0 | 10519 | |
1 | 3064 | 14.7% |
2 | 1402 | 6.7% |
8 | 1057 | 5.1% |
3 | 1020 | 4.9% |
4 | 1017 | 4.9% |
5 | 854 | 4.1% |
9 | 703 | 3.4% |
6 | 635 | 3.0% |
7 | 559 | 2.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 42542 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 10519 | |
s | 3338 | 7.8% |
k | 3084 | 7.2% |
1 | 3064 | 7.2% |
d | 2579 | 6.1% |
u | 2072 | 4.9% |
b | 1586 | 3.7% |
2 | 1402 | 3.3% |
n | 1121 | 2.6% |
8 | 1057 | 2.5% |
Other values (24) | 12720 |
등록일자
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 321 |
---|---|
Distinct (%) | 3.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.0206503 × 1013 |
Minimum | 2.019083 × 1013 |
---|---|
Maximum | 2.0230907 × 1013 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 2.019083 × 1013 |
---|---|
5-th percentile | 2.019083 × 1013 |
Q1 | 2.019083 × 1013 |
median | 2.0200207 × 1013 |
Q3 | 2.0230327 × 1013 |
95-th percentile | 2.0230719 × 1013 |
Maximum | 2.0230907 × 1013 |
Range | 4.007694 × 1010 |
Interquartile range (IQR) | 3.9496926 × 1010 |
Descriptive statistics
Standard deviation | 1.78032 × 1010 |
---|---|
Coefficient of variation (CV) | 0.00088106291 |
Kurtosis | -1.6229774 |
Mean | 2.0206503 × 1013 |
Median Absolute Deviation (MAD) | 9.376918 × 109 |
Skewness | 0.47146739 |
Sum | 2.0206503 × 1017 |
Variance | 3.1695394 × 1020 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20190830185541 | 4847 | |
20230329172451 | 87 | 0.9% |
20200211182427 | 79 | 0.8% |
20210312163443 | 69 | 0.7% |
20230327111357 | 67 | 0.7% |
20200214101250 | 65 | 0.7% |
20200219153506 | 63 | 0.6% |
20230418104933 | 56 | 0.6% |
20210527170513 | 48 | 0.5% |
20200210093348 | 48 | 0.5% |
Other values (311) | 4571 |
Value | Count | Frequency (%) |
20190830185541 | 4847 | |
20191031104625 | 7 | 0.1% |
20191031104656 | 2 | < 0.1% |
20191101132221 | 2 | < 0.1% |
20191217154625 | 5 | 0.1% |
20191218173216 | 12 | 0.1% |
20191220145012 | 2 | < 0.1% |
20191226110423 | 6 | 0.1% |
20200115164018 | 43 | 0.4% |
20200117171026 | 8 | 0.1% |
Value | Count | Frequency (%) |
20230907125904 | 7 | 0.1% |
20230831175513 | 18 | |
20230830153749 | 37 | |
20230830112556 | 13 | 0.1% |
20230829104407 | 27 | |
20230829104238 | 30 | |
20230823112003 | 6 | 0.1% |
20230821152026 | 7 | 0.1% |
20230821110819 | 6 | 0.1% |
20230821110809 | 11 | 0.1% |
시외버스추가경로구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
0 | |
---|---|
1 | 32 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 0 |
Common Values
Value | Count | Frequency (%) |
0 | 9968 | |
1 | 32 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 9968 | |
1 | 32 | 0.3% |
노선아이디 | 링크순서 | 링크아이디 | 사용구분 | 등록아이디 | 등록일자 | 시외버스추가경로구분 | |
---|---|---|---|---|---|---|---|
노선아이디 | 1.000 | 0.374 | 0.498 | 0.357 | 0.960 | 0.528 | 0.096 |
링크순서 | 0.374 | 1.000 | 0.187 | 0.187 | 0.386 | 0.186 | 0.041 |
링크아이디 | 0.498 | 0.187 | 1.000 | 0.316 | 0.753 | 0.349 | 0.129 |
사용구분 | 0.357 | 0.187 | 0.316 | 1.000 | 0.896 | NaN | 0.032 |
등록아이디 | 0.960 | 0.386 | 0.753 | 0.896 | 1.000 | 0.893 | 0.788 |
등록일자 | 0.528 | 0.186 | 0.349 | NaN | 0.893 | 1.000 | 0.204 |
시외버스추가경로구분 | 0.096 | 0.041 | 0.129 | 0.032 | 0.788 | 0.204 | 1.000 |
사용구분 | 시외버스추가경로구분 | |
---|---|---|
사용구분 | 1.000 | 0.053 |
시외버스추가경로구분 | 0.053 | 1.000 |
노선아이디 | 링크순서 | 링크아이디 | 등록일자 | 사용구분 | 시외버스추가경로구분 | |
---|---|---|---|---|---|---|
노선아이디 | 1.000 | 0.193 | 0.205 | -0.293 | 0.229 | 0.074 |
링크순서 | 0.193 | 1.000 | -0.106 | -0.019 | 0.113 | 0.032 |
링크아이디 | 0.205 | -0.106 | 1.000 | -0.080 | 0.146 | 0.129 |
등록일자 | -0.293 | -0.019 | -0.080 | 1.000 | 0.702 | 0.158 |
사용구분 | 0.229 | 0.113 | 0.146 | 0.702 | 1.000 | 0.053 |
시외버스추가경로구분 | 0.074 | 0.032 | 0.129 | 0.158 | 0.053 | 1.000 |
노선아이디 | 링크순서 | 링크아이디 | 사용구분 | 등록아이디 | 등록일자 | 시외버스추가경로구분 | |
---|---|---|---|---|---|---|---|
20072 | 241006799 | 81 | 2280182500 | Y | kdbus01 | 20210312163443 | 0 |
42380 | 200000265 | 4 | 2000020800 | Y | suwon345 | 20230328135443 | 0 |
44126 | 241006870 | 174 | 1240029100 | 9 | 0 | 20190830185541 | 0 |
13725 | 241005570 | 138 | 2050043100 | 9 | 0 | 20190830185541 | 0 |
13880 | 233000281 | 21 | 2330154100 | Y | kd0112 | 20230830153749 | 0 |
16020 | 219000004 | 108 | 1180016900 | Y | crowkyh | 20230420112158 | 0 |
16248 | 214000193 | 18 | 2140115900 | Y | pt8556 | 20230406164433 | 0 |
3656 | 231000135 | 50 | 2310010700 | Y | chh72018 | 20230907125904 | 0 |
31421 | 234001333 | 77 | 2300114100 | 9 | 0 | 20190830185541 | 0 |
27615 | 222000129 | 50 | 2210038100 | 0 | 0 | 20190830185541 | 0 |
노선아이디 | 링크순서 | 링크아이디 | 사용구분 | 등록아이디 | 등록일자 | 시외버스추가경로구분 | |
---|---|---|---|---|---|---|---|
45677 | 234001570 | 126 | 2330141400 | Y | kd0112 | 20230330100741 | 0 |
34616 | 241006837 | 500 | 2260058700 | 0 | 0 | 20190830185541 | 0 |
23184 | 241006270 | 37 | 2240235700 | 9 | 0 | 20190830185541 | 0 |
25211 | 204000139 | 95 | 1230009200 | Y | sncbbgy | 20230719141349 | 0 |
1778 | 204000158 | 14 | 1210030000 | Y | kd0004 | 20210419132415 | 0 |
4196 | 210000013 | 85 | 1150011600 | Y | soshin11 | 20230608130353 | 0 |
51737 | 229000153 | 118 | 2290083200 | 9 | 0 | 20190830185541 | 0 |
47875 | 241000040 | 152 | 2310148800 | Y | ky7266 | 20200219153506 | 0 |
36427 | 226000032 | 14 | 2260072400 | Y | hagi2205 | 20230522140908 | 0 |
30030 | 229000035 | 195 | 2290136000 | Y | 4110400 | 20230418104933 | 0 |