Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory341.8 KiB
Average record size in memory35.0 B

Variable types

Numeric3

Dataset

Description경기도_BMS 업체명 노선 인가 이력 정보
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=R7GDUI51PSS9APNULJ0O34305149&infSeq=1

Reproduction

Analysis started2023-12-10 21:42:51.707250
Analysis finished2023-12-10 21:42:52.903778
Duration1.2 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이력ID
Real number (ℝ)

Distinct9555
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3810271 × 109
Minimum1 × 109
Maximum2.0001599 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:42:52.989374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 109
5-th percentile1.0000046 × 109
Q11.0000565 × 109
median1.0003664 × 109
Q32.000056 × 109
95-th percentile2.0001458 × 109
Maximum2.0001599 × 109
Range1.0001599 × 109
Interquartile range (IQR)9.9999951 × 108

Descriptive statistics

Standard deviation4.8559789 × 108
Coefficient of variation (CV)0.35162083
Kurtosis-1.7596707
Mean1.3810271 × 109
Median Absolute Deviation (MAD)346205
Skewness0.49059243
Sum1.3810271 × 1013
Variance2.3580531 × 1017
MonotonicityNot monotonic
2023-12-11T06:42:53.146387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1000000000 406
 
4.1%
2000133454 16
 
0.2%
1000444458 6
 
0.1%
1000455055 5
 
0.1%
1000382367 4
 
< 0.1%
1000476262 3
 
< 0.1%
1000249092 3
 
< 0.1%
1000029170 3
 
< 0.1%
2000157582 2
 
< 0.1%
1000114033 2
 
< 0.1%
Other values (9545) 9550
95.5%
ValueCountFrequency (%)
1000000000 406
4.1%
1000001124 1
 
< 0.1%
1000001431 1
 
< 0.1%
1000001442 1
 
< 0.1%
1000001448 1
 
< 0.1%
1000001501 1
 
< 0.1%
1000001532 1
 
< 0.1%
1000001693 1
 
< 0.1%
1000001700 1
 
< 0.1%
1000001701 1
 
< 0.1%
ValueCountFrequency (%)
2000159865 1
< 0.1%
2000159858 1
< 0.1%
2000159857 1
< 0.1%
2000159805 1
< 0.1%
2000159801 1
< 0.1%
2000159754 1
< 0.1%
2000159752 1
< 0.1%
2000159751 1
< 0.1%
2000159726 1
< 0.1%
2000159718 1
< 0.1%

노선ID
Real number (ℝ)

Distinct3382
Distinct (%)33.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2305148 × 108
Minimum2.0000001 × 108
Maximum2.49 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:42:53.270882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0000001 × 108
5-th percentile2.0000014 × 108
Q12.1300001 × 108
median2.2800002 × 108
Q32.3400068 × 108
95-th percentile2.4000008 × 108
Maximum2.49 × 108
Range48999995
Interquartile range (IQR)21000672

Descriptive statistics

Standard deviation12534146
Coefficient of variation (CV)0.056193961
Kurtosis-1.1634992
Mean2.2305148 × 108
Median Absolute Deviation (MAD)8000187.5
Skewness-0.39579036
Sum2.2305148 × 1012
Variance1.5710482 × 1014
MonotonicityNot monotonic
2023-12-11T06:42:53.610684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
210000027 20
 
0.2%
232000067 19
 
0.2%
236000052 18
 
0.2%
208000008 18
 
0.2%
233000087 17
 
0.2%
236000048 17
 
0.2%
210000073 17
 
0.2%
212000005 17
 
0.2%
229000039 17
 
0.2%
213000016 16
 
0.2%
Other values (3372) 9824
98.2%
ValueCountFrequency (%)
200000006 6
0.1%
200000008 8
0.1%
200000009 4
 
< 0.1%
200000010 11
0.1%
200000012 3
 
< 0.1%
200000013 4
 
< 0.1%
200000014 1
 
< 0.1%
200000015 5
0.1%
200000016 4
 
< 0.1%
200000017 4
 
< 0.1%
ValueCountFrequency (%)
249000001 1
< 0.1%
241106430 1
< 0.1%
241103250 1
< 0.1%
241102280 1
< 0.1%
241101640 1
< 0.1%
241100080 2
< 0.1%
241007243 1
< 0.1%
241007242 1
< 0.1%
241007238 1
< 0.1%
241007235 1
< 0.1%

업체ID
Real number (ℝ)

Distinct178
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4105646.7
Minimum4100100
Maximum4155500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:42:53.745616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4100100
5-th percentile4100200
Q14100600
median4102600
Q34104600
95-th percentile4142300
Maximum4155500
Range55400
Interquartile range (IQR)4000

Descriptive statistics

Standard deviation11043.593
Coefficient of variation (CV)0.0026898546
Kurtosis10.88086
Mean4105646.7
Median Absolute Deviation (MAD)2000
Skewness3.4315479
Sum4.1056467 × 1010
Variance1.2196094 × 108
MonotonicityNot monotonic
2023-12-11T06:42:53.878973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4100200 994
 
9.9%
4100300 903
 
9.0%
4103100 395
 
4.0%
4100700 377
 
3.8%
4101400 347
 
3.5%
4100600 347
 
3.5%
4101800 274
 
2.7%
4103200 270
 
2.7%
4102100 266
 
2.7%
4103600 264
 
2.6%
Other values (168) 5563
55.6%
ValueCountFrequency (%)
4100100 43
 
0.4%
4100200 994
9.9%
4100300 903
9.0%
4100400 133
 
1.3%
4100500 237
 
2.4%
4100600 347
 
3.5%
4100700 377
 
3.8%
4100800 106
 
1.1%
4100900 185
 
1.8%
4101100 51
 
0.5%
ValueCountFrequency (%)
4155500 1
 
< 0.1%
4155200 20
0.2%
4155100 5
 
0.1%
4155000 3
 
< 0.1%
4154500 9
0.1%
4154400 2
 
< 0.1%
4153900 9
0.1%
4153600 10
0.1%
4151200 5
 
0.1%
4151100 5
 
0.1%

Interactions

2023-12-11T06:42:52.470249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:51.939441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.204129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.554651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.028807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.292630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.663374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.116402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:42:52.385895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:42:53.966098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이력ID노선ID업체ID
이력ID1.0000.2310.170
노선ID0.2311.0000.732
업체ID0.1700.7321.000
2023-12-11T06:42:54.035892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
이력ID노선ID업체ID
이력ID1.000-0.1210.153
노선ID-0.1211.000-0.156
업체ID0.153-0.1561.000

Missing values

2023-12-11T06:42:52.784242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:42:52.868336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

이력ID노선ID업체ID
4098110000000002040001084122100
2745810004242232411000804150100
5464220001142732000001994103600
3477010004536512410070344150200
4310320000175632100000074106300
148510002488392340003164100300
2256610000656632070000804101700
4956710003723692330001404106100
1165710000258402150001244101400
4150410001132752220001294107700
이력ID노선ID업체ID
5548220001338262000002654103100
1304120000066282330001414108600
1100710000400952340000554100200
4524910003711052140000504104400
2476610001123132280002624100300
4906610003422432340015834100300
1968210000581452340005534100200
693510000588582340003984100200
6088620000711842100000274103200
4914810003330872380000254103500