Overview

Dataset statistics

Number of variables6
Number of observations4080
Missing cells1256
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory203.3 KiB
Average record size in memory51.0 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description제주특별자치도에서 관리하는 버스정류소에 대한 데이터로 정류소아이디, 정류소명, 경도, 위도, 위치 정보를 제공합니다.
Author제주특별자치도
URLhttps://www.data.go.kr/data/15010850/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
정류소아이디 is highly overall correlated with 위도High correlation
위도 is highly overall correlated with 정류소아이디High correlation
위치정보(주변설명) has 1252 (30.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:51:21.715061
Analysis finished2023-12-12 13:51:24.054736
Duration2.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

정류소아이디
Real number (ℝ)

HIGH CORRELATION 

Distinct4079
Distinct (%)100.0%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean4.0543415 × 108
Minimum4.05 × 108
Maximum4.0600206 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
2023-12-12T22:51:24.149676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.05 × 108
5-th percentile4.0500023 × 108
Q14.0500119 × 108
median4.0500233 × 108
Q34.0600089 × 108
95-th percentile4.0600184 × 108
Maximum4.0600206 × 108
Range1002058
Interquartile range (IQR)999704

Descriptive statistics

Standard deviation495399.39
Coefficient of variation (CV)0.0012218985
Kurtosis-1.9276343
Mean4.0543415 × 108
Median Absolute Deviation (MAD)2022
Skewness0.27074501
Sum1.6537659 × 1012
Variance2.4542056 × 1011
MonotonicityNot monotonic
2023-12-12T22:51:24.321250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
406001167 1
 
< 0.1%
405001033 1
 
< 0.1%
405000596 1
 
< 0.1%
405000176 1
 
< 0.1%
405002494 1
 
< 0.1%
405002495 1
 
< 0.1%
406000195 1
 
< 0.1%
406000194 1
 
< 0.1%
405000101 1
 
< 0.1%
405000100 1
 
< 0.1%
Other values (4069) 4069
99.7%
ValueCountFrequency (%)
405000001 1
< 0.1%
405000002 1
< 0.1%
405000003 1
< 0.1%
405000004 1
< 0.1%
405000005 1
< 0.1%
405000006 1
< 0.1%
405000007 1
< 0.1%
405000009 1
< 0.1%
405000010 1
< 0.1%
405000011 1
< 0.1%
ValueCountFrequency (%)
406002059 1
< 0.1%
406002058 1
< 0.1%
406002057 1
< 0.1%
406002056 1
< 0.1%
406002055 1
< 0.1%
406002054 1
< 0.1%
406002053 1
< 0.1%
406002052 1
< 0.1%
406002051 1
< 0.1%
406002050 1
< 0.1%
Distinct2181
Distinct (%)53.5%
Missing1
Missing (%)< 0.1%
Memory size32.0 KiB
2023-12-12T22:51:24.576597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length5.7820544
Min length2

Characters and Unicode

Total characters23585
Distinct characters547
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique505 ?
Unique (%)12.4%

Sample

1st row(구)구판장
2nd row(구)구판장
3rd row(구)삼양검문소
4th row(구)삼양검문소
5th row(구)중앙파출소
ValueCountFrequency (%)
입구 102
 
2.1%
사거리 19
 
0.4%
18
 
0.4%
하천리 11
 
0.2%
귀덕3리 10
 
0.2%
수산2리 10
 
0.2%
삼거리 10
 
0.2%
하도리 10
 
0.2%
신안동 10
 
0.2%
제주 9
 
0.2%
Other values (2245) 4543
95.6%
2023-12-12T22:51:25.058523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
981
 
4.2%
977
 
4.1%
701
 
3.0%
536
 
2.3%
442
 
1.9%
400
 
1.7%
376
 
1.6%
372
 
1.6%
336
 
1.4%
335
 
1.4%
Other values (537) 18129
76.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22009
93.3%
Space Separator 701
 
3.0%
Decimal Number 449
 
1.9%
Close Punctuation 145
 
0.6%
Open Punctuation 145
 
0.6%
Other Punctuation 77
 
0.3%
Uppercase Letter 59
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
981
 
4.5%
977
 
4.4%
536
 
2.4%
442
 
2.0%
400
 
1.8%
376
 
1.7%
372
 
1.7%
336
 
1.5%
335
 
1.5%
331
 
1.5%
Other values (503) 16923
76.9%
Uppercase Letter
ValueCountFrequency (%)
S 9
15.3%
C 9
15.3%
I 6
10.2%
L 6
10.2%
H 5
8.5%
G 4
6.8%
M 4
6.8%
N 3
 
5.1%
B 2
 
3.4%
K 2
 
3.4%
Other values (7) 9
15.3%
Decimal Number
ValueCountFrequency (%)
1 164
36.5%
2 135
30.1%
3 70
15.6%
0 20
 
4.5%
6 16
 
3.6%
4 16
 
3.6%
9 14
 
3.1%
5 10
 
2.2%
8 3
 
0.7%
7 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
/ 40
51.9%
, 30
39.0%
. 5
 
6.5%
· 2
 
2.6%
Space Separator
ValueCountFrequency (%)
701
100.0%
Close Punctuation
ValueCountFrequency (%)
) 145
100.0%
Open Punctuation
ValueCountFrequency (%)
( 145
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22009
93.3%
Common 1517
 
6.4%
Latin 59
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
981
 
4.5%
977
 
4.4%
536
 
2.4%
442
 
2.0%
400
 
1.8%
376
 
1.7%
372
 
1.7%
336
 
1.5%
335
 
1.5%
331
 
1.5%
Other values (503) 16923
76.9%
Common
ValueCountFrequency (%)
701
46.2%
1 164
 
10.8%
) 145
 
9.6%
( 145
 
9.6%
2 135
 
8.9%
3 70
 
4.6%
/ 40
 
2.6%
, 30
 
2.0%
0 20
 
1.3%
6 16
 
1.1%
Other values (7) 51
 
3.4%
Latin
ValueCountFrequency (%)
S 9
15.3%
C 9
15.3%
I 6
10.2%
L 6
10.2%
H 5
8.5%
G 4
6.8%
M 4
6.8%
N 3
 
5.1%
B 2
 
3.4%
K 2
 
3.4%
Other values (7) 9
15.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22009
93.3%
ASCII 1574
 
6.7%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
981
 
4.5%
977
 
4.4%
536
 
2.4%
442
 
2.0%
400
 
1.8%
376
 
1.7%
372
 
1.7%
336
 
1.5%
335
 
1.5%
331
 
1.5%
Other values (503) 16923
76.9%
ASCII
ValueCountFrequency (%)
701
44.5%
1 164
 
10.4%
) 145
 
9.2%
( 145
 
9.2%
2 135
 
8.6%
3 70
 
4.4%
/ 40
 
2.5%
, 30
 
1.9%
0 20
 
1.3%
6 16
 
1.0%
Other values (23) 108
 
6.9%
None
ValueCountFrequency (%)
· 2
100.0%

경도
Real number (ℝ)

Distinct4056
Distinct (%)99.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean126.53774
Minimum126.16504
Maximum126.93516
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
2023-12-12T22:51:25.197983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.16504
5-th percentile126.24273
Q1126.40081
median126.53888
Q3126.67166
95-th percentile126.86161
Maximum126.93516
Range0.77012
Interquartile range (IQR)0.2708525

Descriptive statistics

Standard deviation0.18736292
Coefficient of variation (CV)0.0014806881
Kurtosis-0.77328766
Mean126.53774
Median Absolute Deviation (MAD)0.135523
Skewness0.0740184
Sum516147.44
Variance0.035104864
MonotonicityNot monotonic
2023-12-12T22:51:25.331661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
126.56115 2
 
< 0.1%
126.55038 2
 
< 0.1%
126.56835 2
 
< 0.1%
126.331967 2
 
< 0.1%
126.609549 2
 
< 0.1%
126.817072 2
 
< 0.1%
126.199101 2
 
< 0.1%
126.560435 2
 
< 0.1%
126.297133 2
 
< 0.1%
126.4268 2
 
< 0.1%
Other values (4046) 4059
99.5%
ValueCountFrequency (%)
126.165039 1
< 0.1%
126.165121 1
< 0.1%
126.16618 1
< 0.1%
126.167481 1
< 0.1%
126.167517 1
< 0.1%
126.167633 1
< 0.1%
126.167831 1
< 0.1%
126.168738 1
< 0.1%
126.168835 1
< 0.1%
126.169256 1
< 0.1%
ValueCountFrequency (%)
126.935159 1
< 0.1%
126.934898 1
< 0.1%
126.934417 1
< 0.1%
126.933767 1
< 0.1%
126.933033 1
< 0.1%
126.931317 1
< 0.1%
126.931267 1
< 0.1%
126.93008 1
< 0.1%
126.92994 1
< 0.1%
126.92955 1
< 0.1%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct4013
Distinct (%)98.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean33.391304
Minimum33.208351
Maximum33.558401
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
2023-12-12T22:51:25.486156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33.208351
5-th percentile33.2447
Q133.287712
median33.418633
Q333.485458
95-th percentile33.521263
Maximum33.558401
Range0.35005
Interquartile range (IQR)0.197745

Descriptive statistics

Standard deviation0.10175315
Coefficient of variation (CV)0.0030472947
Kurtosis-1.5188786
Mean33.391304
Median Absolute Deviation (MAD)0.087392
Skewness-0.17268149
Sum136203.13
Variance0.010353703
MonotonicityNot monotonic
2023-12-12T22:51:25.652873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33.491287 3
 
0.1%
33.4897 3
 
0.1%
33.496007 2
 
< 0.1%
33.275267 2
 
< 0.1%
33.287967 2
 
< 0.1%
33.250679 2
 
< 0.1%
33.249335 2
 
< 0.1%
33.361436 2
 
< 0.1%
33.264187 2
 
< 0.1%
33.324151 2
 
< 0.1%
Other values (4003) 4057
99.4%
ValueCountFrequency (%)
33.208351 1
< 0.1%
33.208467 1
< 0.1%
33.210974 1
< 0.1%
33.211189 1
< 0.1%
33.212011 1
< 0.1%
33.212295 1
< 0.1%
33.216521 1
< 0.1%
33.217541 1
< 0.1%
33.218455 1
< 0.1%
33.218752 1
< 0.1%
ValueCountFrequency (%)
33.558401 1
< 0.1%
33.558294 1
< 0.1%
33.556167 1
< 0.1%
33.555959 1
< 0.1%
33.555819 1
< 0.1%
33.555643 1
< 0.1%
33.555522 1
< 0.1%
33.555439 1
< 0.1%
33.555307 1
< 0.1%
33.555302 1
< 0.1%
Distinct2695
Distinct (%)95.3%
Missing1252
Missing (%)30.7%
Memory size32.0 KiB
2023-12-12T22:51:25.945225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length27
Mean length9.5452617
Min length2

Characters and Unicode

Total characters26994
Distinct characters682
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2602 ?
Unique (%)92.0%

Sample

1st row세화1리노인회관입구
2nd row세화1리노인회관입구
3rd row자연농원 우측
4th row조천읍이정표 좌측
5th row윤성현내과의원 건너편
ValueCountFrequency (%)
780
 
13.3%
방향 339
 
5.8%
맞은편 299
 
5.1%
우측 298
 
5.1%
좌측 238
 
4.1%
건너편 160
 
2.7%
입구 77
 
1.3%
단독주택 46
 
0.8%
방면 33
 
0.6%
이정표 20
 
0.3%
Other values (2866) 3568
60.9%
2023-12-12T22:51:26.353476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3142
 
11.6%
1044
 
3.9%
739
 
2.7%
641
 
2.4%
605
 
2.2%
597
 
2.2%
558
 
2.1%
443
 
1.6%
427
 
1.6%
367
 
1.4%
Other values (672) 18431
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 22247
82.4%
Space Separator 3142
 
11.6%
Decimal Number 692
 
2.6%
Uppercase Letter 252
 
0.9%
Open Punctuation 226
 
0.8%
Close Punctuation 225
 
0.8%
Other Punctuation 147
 
0.5%
Lowercase Letter 47
 
0.2%
Dash Punctuation 16
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1044
 
4.7%
739
 
3.3%
641
 
2.9%
605
 
2.7%
597
 
2.7%
558
 
2.5%
443
 
2.0%
427
 
1.9%
367
 
1.6%
346
 
1.6%
Other values (614) 16480
74.1%
Uppercase Letter
ValueCountFrequency (%)
S 36
14.3%
G 35
13.9%
C 26
10.3%
K 20
 
7.9%
L 16
 
6.3%
U 14
 
5.6%
M 13
 
5.2%
B 10
 
4.0%
R 9
 
3.6%
T 9
 
3.6%
Other values (13) 64
25.4%
Lowercase Letter
ValueCountFrequency (%)
r 6
12.8%
a 6
12.8%
e 5
10.6%
k 4
 
8.5%
u 3
 
6.4%
o 3
 
6.4%
l 3
 
6.4%
h 3
 
6.4%
t 2
 
4.3%
s 2
 
4.3%
Other values (7) 10
21.3%
Decimal Number
ValueCountFrequency (%)
1 183
26.4%
2 137
19.8%
3 73
 
10.5%
5 66
 
9.5%
0 54
 
7.8%
6 50
 
7.2%
4 40
 
5.8%
9 34
 
4.9%
7 28
 
4.0%
8 27
 
3.9%
Other Punctuation
ValueCountFrequency (%)
. 137
93.2%
/ 7
 
4.8%
: 2
 
1.4%
& 1
 
0.7%
Space Separator
ValueCountFrequency (%)
3142
100.0%
Open Punctuation
ValueCountFrequency (%)
( 226
100.0%
Close Punctuation
ValueCountFrequency (%)
) 225
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 22246
82.4%
Common 4448
 
16.5%
Latin 299
 
1.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1044
 
4.7%
739
 
3.3%
641
 
2.9%
605
 
2.7%
597
 
2.7%
558
 
2.5%
443
 
2.0%
427
 
1.9%
367
 
1.6%
346
 
1.6%
Other values (613) 16479
74.1%
Latin
ValueCountFrequency (%)
S 36
 
12.0%
G 35
 
11.7%
C 26
 
8.7%
K 20
 
6.7%
L 16
 
5.4%
U 14
 
4.7%
M 13
 
4.3%
B 10
 
3.3%
R 9
 
3.0%
T 9
 
3.0%
Other values (30) 111
37.1%
Common
ValueCountFrequency (%)
3142
70.6%
( 226
 
5.1%
) 225
 
5.1%
1 183
 
4.1%
2 137
 
3.1%
. 137
 
3.1%
3 73
 
1.6%
5 66
 
1.5%
0 54
 
1.2%
6 50
 
1.1%
Other values (8) 155
 
3.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 22246
82.4%
ASCII 4747
 
17.6%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3142
66.2%
( 226
 
4.8%
) 225
 
4.7%
1 183
 
3.9%
2 137
 
2.9%
. 137
 
2.9%
3 73
 
1.5%
5 66
 
1.4%
0 54
 
1.1%
6 50
 
1.1%
Other values (48) 454
 
9.6%
Hangul
ValueCountFrequency (%)
1044
 
4.7%
739
 
3.3%
641
 
2.9%
605
 
2.7%
597
 
2.7%
558
 
2.5%
443
 
2.0%
427
 
1.9%
367
 
1.6%
346
 
1.6%
Other values (613) 16479
74.1%
CJK
ValueCountFrequency (%)
1
100.0%

데이터 기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size32.0 KiB
2022-11-01
4080 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-11-01
2nd row2022-11-01
3rd row2022-11-01
4th row2022-11-01
5th row2022-11-01

Common Values

ValueCountFrequency (%)
2022-11-01 4080
100.0%

Length

2023-12-12T22:51:26.496503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:51:26.592868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-11-01 4080
100.0%

Interactions

2023-12-12T22:51:23.246971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:22.438617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:22.840847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:23.377683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:22.567721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:22.968423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:23.527783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:22.699443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:51:23.095109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:51:26.658361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류소아이디경도위도
정류소아이디1.0000.3940.957
경도0.3941.0000.727
위도0.9570.7271.000
2023-12-12T22:51:26.740677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류소아이디경도위도
정류소아이디1.0000.208-0.682
경도0.2081.0000.264
위도-0.6820.2641.000

Missing values

2023-12-12T22:51:23.708047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:51:23.841168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T22:51:23.976537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

정류소아이디정류소명경도위도위치정보(주변설명)데이터 기준일자
0406001167(구)구판장126.79805133.329675세화1리노인회관입구2022-11-01
1406001168(구)구판장126.79787933.329732세화1리노인회관입구2022-11-01
2405000186(구)삼양검문소126.60003433.521022자연농원 우측2022-11-01
3405000185(구)삼양검문소126.60084733.520956조천읍이정표 좌측2022-11-01
4406000324(구)중앙파출소126.56089433.247598윤성현내과의원 건너편2022-11-01
5406000467(구)중앙파출소126.56036733.247M-STAY제주호텔 앞2022-11-01
6406001176(구)중앙파출소126.5608933.247515에이스모텔(후문) 앞2022-11-01
7406001837(구)화산초등학교126.79726633.326228<NA>2022-11-01
8406000607(구)화산초등학교126.79724333.326291세화3리 방향2022-11-01
94060018941100고지휴게소126.46294233.35782<NA>2022-11-01
정류소아이디정류소명경도위도위치정보(주변설명)데이터 기준일자
4070406001667흙담솔사가126.55240233.258853<NA>2022-11-01
4071406001668흙담솔사가126.55227233.258707<NA>2022-11-01
4072406000295흙통126.58227733.26191탐라주유소 건너편2022-11-01
4073406000294흙통126.58290333.262391토평교회좌측2022-11-01
4074405001115흥국사126.3777433.452319용흥3길이정표앞2022-11-01
4075405001116흥국사126.37840533.452615납읍방향.용흥3길 입구2022-11-01
4076406001766흥덕사126.87606633.399128<NA>2022-11-01
4077406001767흥덕사126.87600533.399031<NA>2022-11-01
4078406001055희진주유소126.87364433.37661희진주유소 건너편2022-11-01
4079406001056희진주유소126.87403733.377568희진주유소 앞2022-11-01