Overview

Dataset statistics

Number of variables4
Number of observations3141
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory107.5 KiB
Average record size in memory35.0 B

Variable types

Numeric3
Text1

Dataset

Description청주시 버스정보(BIS)시스템 시내버스 승강장 명칭 서비스 ID와 X,Y 좌표값등 제공(ADO(에이디오) 승강장 명칭등 총3141개소)
Author충청북도 청주시
URLhttps://www.data.go.kr/data/15041896/fileData.do

Alerts

좌표(X) has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:48:23.172599
Analysis finished2023-12-12 15:48:25.360635
Duration2.19 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

서비스ID
Real number (ℝ)

Distinct3140
Distinct (%)100.0%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2581.3847
Minimum1000
Maximum4375
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.7 KiB
2023-12-13T00:48:25.467669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1000
5-th percentile1156.95
Q11785.75
median2570.5
Q33355.25
95-th percentile4121.05
Maximum4375
Range3375
Interquartile range (IQR)1569.5

Descriptive statistics

Standard deviation925.36457
Coefficient of variation (CV)0.35847604
Kurtosis-1.1107818
Mean2581.3847
Median Absolute Deviation (MAD)785
Skewness0.066728365
Sum8105548
Variance856299.59
MonotonicityNot monotonic
2023-12-13T00:48:25.635227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3043 1
 
< 0.1%
2378 1
 
< 0.1%
2542 1
 
< 0.1%
1768 1
 
< 0.1%
1745 1
 
< 0.1%
3509 1
 
< 0.1%
3510 1
 
< 0.1%
3505 1
 
< 0.1%
3506 1
 
< 0.1%
3503 1
 
< 0.1%
Other values (3130) 3130
99.6%
ValueCountFrequency (%)
1000 1
< 0.1%
1001 1
< 0.1%
1002 1
< 0.1%
1003 1
< 0.1%
1004 1
< 0.1%
1005 1
< 0.1%
1006 1
< 0.1%
1007 1
< 0.1%
1008 1
< 0.1%
1009 1
< 0.1%
ValueCountFrequency (%)
4375 1
< 0.1%
4374 1
< 0.1%
4373 1
< 0.1%
4372 1
< 0.1%
4371 1
< 0.1%
4370 1
< 0.1%
4369 1
< 0.1%
4368 1
< 0.1%
4367 1
< 0.1%
4366 1
< 0.1%
Distinct1756
Distinct (%)55.9%
Missing0
Missing (%)0.0%
Memory size24.7 KiB
2023-12-13T00:48:25.990900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length5.7166507
Min length2

Characters and Unicode

Total characters17956
Distinct characters467
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique621 ?
Unique (%)19.8%

Sample

1st rowADO(에이디오)
2nd rowCJB컨벤션센터
3rd rowCJB컨벤션센터
4th rowGS메디칼
5th rowGS메디칼
ValueCountFrequency (%)
옥산산업단지 10
 
0.3%
송정2리 8
 
0.3%
구암 8
 
0.3%
시동리 7
 
0.2%
척산3리 7
 
0.2%
양지리 7
 
0.2%
공북2리 6
 
0.2%
죽암2리 6
 
0.2%
대련리 6
 
0.2%
서촌동 6
 
0.2%
Other values (1756) 3116
97.8%
2023-12-13T00:48:26.550896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1232
 
6.9%
390
 
2.2%
382
 
2.1%
360
 
2.0%
337
 
1.9%
333
 
1.9%
1 321
 
1.8%
320
 
1.8%
2 320
 
1.8%
319
 
1.8%
Other values (457) 13642
76.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16283
90.7%
Decimal Number 896
 
5.0%
Other Punctuation 221
 
1.2%
Close Punctuation 195
 
1.1%
Open Punctuation 195
 
1.1%
Uppercase Letter 118
 
0.7%
Space Separator 46
 
0.3%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1232
 
7.6%
390
 
2.4%
382
 
2.3%
360
 
2.2%
337
 
2.1%
333
 
2.0%
320
 
2.0%
319
 
2.0%
271
 
1.7%
249
 
1.5%
Other values (424) 12090
74.2%
Uppercase Letter
ValueCountFrequency (%)
B 17
14.4%
L 17
14.4%
S 17
14.4%
K 14
11.9%
G 11
9.3%
H 8
6.8%
I 8
6.8%
C 7
5.9%
T 4
 
3.4%
M 4
 
3.4%
Other values (5) 11
9.3%
Decimal Number
ValueCountFrequency (%)
1 321
35.8%
2 320
35.7%
3 129
14.4%
4 41
 
4.6%
5 27
 
3.0%
6 18
 
2.0%
7 11
 
1.2%
9 11
 
1.2%
0 9
 
1.0%
8 9
 
1.0%
Other Punctuation
ValueCountFrequency (%)
. 196
88.7%
, 18
 
8.1%
/ 6
 
2.7%
· 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 195
100.0%
Open Punctuation
ValueCountFrequency (%)
( 195
100.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16283
90.7%
Common 1553
 
8.6%
Latin 120
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1232
 
7.6%
390
 
2.4%
382
 
2.3%
360
 
2.2%
337
 
2.1%
333
 
2.0%
320
 
2.0%
319
 
2.0%
271
 
1.7%
249
 
1.5%
Other values (424) 12090
74.2%
Common
ValueCountFrequency (%)
1 321
20.7%
2 320
20.6%
. 196
12.6%
) 195
12.6%
( 195
12.6%
3 129
8.3%
46
 
3.0%
4 41
 
2.6%
5 27
 
1.7%
6 18
 
1.2%
Other values (7) 65
 
4.2%
Latin
ValueCountFrequency (%)
B 17
14.2%
L 17
14.2%
S 17
14.2%
K 14
11.7%
G 11
9.2%
H 8
6.7%
I 8
6.7%
C 7
5.8%
T 4
 
3.3%
M 4
 
3.3%
Other values (6) 13
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16283
90.7%
ASCII 1672
 
9.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1232
 
7.6%
390
 
2.4%
382
 
2.3%
360
 
2.2%
337
 
2.1%
333
 
2.0%
320
 
2.0%
319
 
2.0%
271
 
1.7%
249
 
1.5%
Other values (424) 12090
74.2%
ASCII
ValueCountFrequency (%)
1 321
19.2%
2 320
19.1%
. 196
11.7%
) 195
11.7%
( 195
11.7%
3 129
7.7%
46
 
2.8%
4 41
 
2.5%
5 27
 
1.6%
6 18
 
1.1%
Other values (22) 184
11.0%
None
ValueCountFrequency (%)
· 1
100.0%

좌표(X)
Real number (ℝ)

UNIQUE 

Distinct3141
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.44158
Minimum127.18135
Maximum127.70903
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.7 KiB
2023-12-13T00:48:26.814721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.18135
5-th percentile127.28712
Q1127.37058
median127.44854
Q3127.50501
95-th percentile127.59388
Maximum127.70903
Range0.5276812
Interquartile range (IQR)0.1344274

Descriptive statistics

Standard deviation0.092034183
Coefficient of variation (CV)0.0007221676
Kurtosis-0.52299062
Mean127.44158
Median Absolute Deviation (MAD)0.0631804
Skewness-0.14030288
Sum400294.02
Variance0.0084702909
MonotonicityNot monotonic
2023-12-13T00:48:27.050192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.421797 1
 
< 0.1%
127.596854 1
 
< 0.1%
127.5964679 1
 
< 0.1%
127.4885414 1
 
< 0.1%
127.4882765 1
 
< 0.1%
127.4279369 1
 
< 0.1%
127.4280616 1
 
< 0.1%
127.4286665 1
 
< 0.1%
127.4287288 1
 
< 0.1%
127.4291561 1
 
< 0.1%
Other values (3131) 3131
99.7%
ValueCountFrequency (%)
127.1813507 1
< 0.1%
127.1819939 1
< 0.1%
127.1917791 1
< 0.1%
127.1920221 1
< 0.1%
127.196523 1
< 0.1%
127.197339 1
< 0.1%
127.2035991 1
< 0.1%
127.2036302 1
< 0.1%
127.2066953 1
< 0.1%
127.2068663 1
< 0.1%
ValueCountFrequency (%)
127.7090319 1
< 0.1%
127.6634882 1
< 0.1%
127.663201 1
< 0.1%
127.6590481 1
< 0.1%
127.6589231 1
< 0.1%
127.6583206 1
< 0.1%
127.6578371 1
< 0.1%
127.6578251 1
< 0.1%
127.657717 1
< 0.1%
127.6562954 1
< 0.1%

좌표(Y)
Real number (ℝ)

Distinct3140
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.630244
Minimum36.332854
Maximum36.860917
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size27.7 KiB
2023-12-13T00:48:27.266634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.332854
5-th percentile36.495997
Q136.58643
median36.629718
Q336.671641
95-th percentile36.758231
Maximum36.860917
Range0.52806233
Interquartile range (IQR)0.08521015

Descriptive statistics

Standard deviation0.077289654
Coefficient of variation (CV)0.0021099956
Kurtosis0.67324412
Mean36.630244
Median Absolute Deviation (MAD)0.0429397
Skewness-0.19939085
Sum115055.6
Variance0.0059736907
MonotonicityNot monotonic
2023-12-13T00:48:27.448787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
36.63486487 2
 
0.1%
36.6997317 1
 
< 0.1%
36.50005041 1
 
< 0.1%
36.49560859 1
 
< 0.1%
36.59009378 1
 
< 0.1%
36.58987564 1
 
< 0.1%
36.50143717 1
 
< 0.1%
36.5012569 1
 
< 0.1%
36.51509218 1
 
< 0.1%
36.51386816 1
 
< 0.1%
Other values (3130) 3130
99.6%
ValueCountFrequency (%)
36.33285421 1
< 0.1%
36.33337437 1
< 0.1%
36.33369601 1
< 0.1%
36.33378137 1
< 0.1%
36.34067481 1
< 0.1%
36.34122138 1
< 0.1%
36.34354114 1
< 0.1%
36.34392016 1
< 0.1%
36.34610407 1
< 0.1%
36.34656406 1
< 0.1%
ValueCountFrequency (%)
36.86091654 1
< 0.1%
36.86059604 1
< 0.1%
36.85993352 1
< 0.1%
36.85986846 1
< 0.1%
36.85861038 1
< 0.1%
36.85796831 1
< 0.1%
36.85680151 1
< 0.1%
36.85562399 1
< 0.1%
36.85474984 1
< 0.1%
36.85460669 1
< 0.1%

Interactions

2023-12-13T00:48:24.819475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:23.992473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.439622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.971170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.156696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.579482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:25.090348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.334545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:48:24.691067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:48:27.580115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서비스ID좌표(X)좌표(Y)
서비스ID1.0000.7670.715
좌표(X)0.7671.0000.551
좌표(Y)0.7150.5511.000
2023-12-13T00:48:27.699131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서비스ID좌표(X)좌표(Y)
서비스ID1.000-0.126-0.128
좌표(X)-0.1261.0000.099
좌표(Y)-0.1280.0991.000

Missing values

2023-12-13T00:48:25.210902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:48:25.314729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

서비스ID정류소명좌표(X)좌표(Y)
03043ADO(에이디오)127.42179736.699732
11673CJB컨벤션센터127.46935936.619056
21670CJB컨벤션센터127.46915236.618715
33269GS메디칼127.33032136.64389
43268GS메디칼127.32945736.64396
53266JPI헬스케어127.32555936.64302
62074KBS127.45541336.619699
72073KBS127.45550736.619439
82019KBS127.45799836.619244
91943LG.SK하이닉스기숙사127.45389836.648627
서비스ID정류소명좌표(X)좌표(Y)
31312405휴암동.푸르미환경공원127.40389236.62134
31321969흥덕고등학교127.42465236.636237
31331176흥덕구청127.37454336.628406
31341175흥덕구청127.37471236.62819
31352849흥덕구청종점127.37503436.628946
31361591흥덕대교127.47962336.645577
31371590흥덕대교127.47956636.645833
31381594흥덕사지.충청매일127.47417736.643263
31391945흥덕새마을금고127.45735436.65013
31403234힐데스하임아파트127.32241436.633821