Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory664.1 KiB
Average record size in memory68.0 B

Variable types

Numeric4
Text1
Categorical2

Dataset

Description정류장_ID,정류장_명칭,정류장_유형,정류장_번호,위도,경도,버스도착정보안내기_설치_여부
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21231/S/1/datasetView.do

Alerts

정류장_ID is highly overall correlated with 정류장_번호 and 1 other fieldsHigh correlation
정류장_번호 is highly overall correlated with 정류장_ID and 1 other fieldsHigh correlation
위도 is highly overall correlated with 정류장_ID and 1 other fieldsHigh correlation
정류장_유형 is highly overall correlated with 버스도착정보안내기_설치_여부High correlation
버스도착정보안내기_설치_여부 is highly overall correlated with 정류장_유형High correlation
정류장_ID has unique valuesUnique

Reproduction

Analysis started2024-05-03 21:14:03.195330
Analysis finished2024-05-03 21:14:13.681994
Duration10.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

정류장_ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1322338 × 108
Minimum1 × 108
Maximum1.2900024 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T21:14:13.933793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1 × 108
5-th percentile1.0100025 × 108
Q11.0790018 × 108
median1.1390008 × 108
Q31.1990003 × 108
95-th percentile1.2300061 × 108
Maximum1.2900024 × 108
Range29000239
Interquartile range (IQR)11999854

Descriptive statistics

Standard deviation7006549
Coefficient of variation (CV)0.061882527
Kurtosis-1.0572137
Mean1.1322338 × 108
Median Absolute Deviation (MAD)5999927
Skewness-0.14390899
Sum1.1322338 × 1012
Variance4.9091729 × 1013
MonotonicityNot monotonic
2024-05-03T21:14:14.389292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
121900055 1
 
< 0.1%
109000103 1
 
< 0.1%
122900061 1
 
< 0.1%
100000150 1
 
< 0.1%
123000291 1
 
< 0.1%
115000103 1
 
< 0.1%
122000325 1
 
< 0.1%
119900008 1
 
< 0.1%
115000639 1
 
< 0.1%
113000214 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
100000001 1
< 0.1%
100000003 1
< 0.1%
100000004 1
< 0.1%
100000005 1
< 0.1%
100000006 1
< 0.1%
100000007 1
< 0.1%
100000008 1
< 0.1%
100000009 1
< 0.1%
100000010 1
< 0.1%
100000011 1
< 0.1%
ValueCountFrequency (%)
129000240 1
< 0.1%
129000239 1
< 0.1%
129000227 1
< 0.1%
129000204 1
< 0.1%
129000203 1
< 0.1%
129000195 1
< 0.1%
129000194 1
< 0.1%
129000192 1
< 0.1%
129000191 1
< 0.1%
129000115 1
< 0.1%
Distinct6811
Distinct (%)68.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-03T21:14:15.075865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length19
Mean length7.7142
Min length2

Characters and Unicode

Total characters77142
Distinct characters667
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4467 ?
Unique (%)44.7%

Sample

1st row반포리체
2nd rowSK아파트.벽산아파트상가
3rd row독립문파크빌
4th row도림사거리
5th row명륜3가종점
ValueCountFrequency (%)
벽산아파트 11
 
0.1%
새마을금고 10
 
0.1%
현대아파트 10
 
0.1%
북서울꿈의숲 9
 
0.1%
우성아파트 9
 
0.1%
삼성래미안아파트 9
 
0.1%
가산디지털단지역 8
 
0.1%
광화문 8
 
0.1%
신대방역 8
 
0.1%
봉천역 8
 
0.1%
Other values (6802) 9911
99.1%
2024-05-03T21:14:16.167971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2260
 
2.9%
2106
 
2.7%
2090
 
2.7%
. 2071
 
2.7%
2022
 
2.6%
1729
 
2.2%
1572
 
2.0%
1529
 
2.0%
1279
 
1.7%
1243
 
1.6%
Other values (657) 59241
76.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 71470
92.6%
Decimal Number 2439
 
3.2%
Other Punctuation 2095
 
2.7%
Uppercase Letter 692
 
0.9%
Close Punctuation 203
 
0.3%
Open Punctuation 201
 
0.3%
Lowercase Letter 29
 
< 0.1%
Dash Punctuation 12
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2260
 
3.2%
2106
 
2.9%
2090
 
2.9%
2022
 
2.8%
1729
 
2.4%
1572
 
2.2%
1529
 
2.1%
1279
 
1.8%
1243
 
1.7%
1178
 
1.6%
Other values (611) 54462
76.2%
Uppercase Letter
ValueCountFrequency (%)
C 92
13.3%
T 84
12.1%
K 70
10.1%
S 67
9.7%
A 57
8.2%
P 46
 
6.6%
I 37
 
5.3%
M 36
 
5.2%
G 33
 
4.8%
B 32
 
4.6%
Other values (14) 138
19.9%
Decimal Number
ValueCountFrequency (%)
1 694
28.5%
2 476
19.5%
3 344
14.1%
4 213
 
8.7%
5 171
 
7.0%
0 142
 
5.8%
7 119
 
4.9%
6 115
 
4.7%
9 92
 
3.8%
8 73
 
3.0%
Other Punctuation
ValueCountFrequency (%)
. 2071
98.9%
? 13
 
0.6%
& 10
 
0.5%
1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
e 23
79.3%
k 3
 
10.3%
t 2
 
6.9%
s 1
 
3.4%
Close Punctuation
ValueCountFrequency (%)
) 203
100.0%
Open Punctuation
ValueCountFrequency (%)
( 201
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 71470
92.6%
Common 4951
 
6.4%
Latin 721
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2260
 
3.2%
2106
 
2.9%
2090
 
2.9%
2022
 
2.8%
1729
 
2.4%
1572
 
2.2%
1529
 
2.1%
1279
 
1.8%
1243
 
1.7%
1178
 
1.6%
Other values (611) 54462
76.2%
Latin
ValueCountFrequency (%)
C 92
12.8%
T 84
11.7%
K 70
9.7%
S 67
9.3%
A 57
 
7.9%
P 46
 
6.4%
I 37
 
5.1%
M 36
 
5.0%
G 33
 
4.6%
B 32
 
4.4%
Other values (18) 167
23.2%
Common
ValueCountFrequency (%)
. 2071
41.8%
1 694
 
14.0%
2 476
 
9.6%
3 344
 
6.9%
4 213
 
4.3%
) 203
 
4.1%
( 201
 
4.1%
5 171
 
3.5%
0 142
 
2.9%
7 119
 
2.4%
Other values (8) 317
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 71470
92.6%
ASCII 5671
 
7.4%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2260
 
3.2%
2106
 
2.9%
2090
 
2.9%
2022
 
2.8%
1729
 
2.4%
1572
 
2.2%
1529
 
2.1%
1279
 
1.8%
1243
 
1.7%
1178
 
1.6%
Other values (611) 54462
76.2%
ASCII
ValueCountFrequency (%)
. 2071
36.5%
1 694
 
12.2%
2 476
 
8.4%
3 344
 
6.1%
4 213
 
3.8%
) 203
 
3.6%
( 201
 
3.5%
5 171
 
3.0%
0 142
 
2.5%
7 119
 
2.1%
Other values (35) 1037
18.3%
None
ValueCountFrequency (%)
1
100.0%

정류장_유형
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반차로
5350 
마을버스
3776 
중앙차로
 
331
가로변시간
 
269
가로변전일
 
140

Length

Max length5
Median length4
Mean length4.0543
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row마을버스
2nd row일반차로
3rd row일반차로
4th row일반차로
5th row마을버스

Common Values

ValueCountFrequency (%)
일반차로 5350
53.5%
마을버스 3776
37.8%
중앙차로 331
 
3.3%
가로변시간 269
 
2.7%
가로변전일 140
 
1.4%
가상정류장 134
 
1.3%

Length

2024-05-03T21:14:16.605841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T21:14:16.924432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반차로 5350
53.5%
마을버스 3776
37.8%
중앙차로 331
 
3.3%
가로변시간 269
 
2.7%
가로변전일 140
 
1.4%
가상정류장 134
 
1.3%

정류장_번호
Real number (ℝ)

HIGH CORRELATION 

Distinct9909
Distinct (%)99.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean14457.202
Minimum0
Maximum92691
Zeros54
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T21:14:17.462727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2187.6
Q18717
median14585
Q320540.5
95-th percentile24505.1
Maximum92691
Range92691
Interquartile range (IQR)11823.5

Descriptive statistics

Standard deviation8021.7416
Coefficient of variation (CV)0.55486129
Kurtosis16.793196
Mean14457.202
Median Absolute Deviation (MAD)5952
Skewness1.8574362
Sum1.4455756 × 108
Variance64348338
MonotonicityNot monotonic
2024-05-03T21:14:17.873326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 54
 
0.5%
4526 2
 
< 0.1%
24509 2
 
< 0.1%
24306 2
 
< 0.1%
17244 2
 
< 0.1%
21199 2
 
< 0.1%
4540 2
 
< 0.1%
5256 2
 
< 0.1%
4532 2
 
< 0.1%
5298 2
 
< 0.1%
Other values (9899) 9927
99.3%
ValueCountFrequency (%)
0 54
0.5%
1001 1
 
< 0.1%
1003 1
 
< 0.1%
1004 1
 
< 0.1%
1005 1
 
< 0.1%
1007 1
 
< 0.1%
1008 1
 
< 0.1%
1009 1
 
< 0.1%
1010 1
 
< 0.1%
1012 1
 
< 0.1%
ValueCountFrequency (%)
92691 1
< 0.1%
92690 1
< 0.1%
92648 1
< 0.1%
92647 1
< 0.1%
92643 1
< 0.1%
92641 1
< 0.1%
92625 1
< 0.1%
92624 1
< 0.1%
92623 1
< 0.1%
92622 1
< 0.1%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct9740
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.550542
Minimum37.329166
Maximum37.6948
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T21:14:18.490926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.329166
5-th percentile37.471921
Q137.503141
median37.549913
Q337.589735
95-th percentile37.647764
Maximum37.6948
Range0.365634
Interquartile range (IQR)0.086594

Descriptive statistics

Standard deviation0.054848125
Coefficient of variation (CV)0.001460648
Kurtosis-0.69402942
Mean37.550542
Median Absolute Deviation (MAD)0.043815
Skewness0.24013138
Sum375505.42
Variance0.0030083168
MonotonicityNot monotonic
2024-05-03T21:14:19.123161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.570337 3
 
< 0.1%
37.553372 3
 
< 0.1%
37.477082 3
 
< 0.1%
37.634433 3
 
< 0.1%
37.476304 3
 
< 0.1%
37.486274 3
 
< 0.1%
37.560081 2
 
< 0.1%
37.480979 2
 
< 0.1%
37.493303 2
 
< 0.1%
37.480603 2
 
< 0.1%
Other values (9730) 9974
99.7%
ValueCountFrequency (%)
37.329166 1
< 0.1%
37.329418 1
< 0.1%
37.364795 1
< 0.1%
37.401377 1
< 0.1%
37.416098 1
< 0.1%
37.417036 1
< 0.1%
37.417117 1
< 0.1%
37.417571 1
< 0.1%
37.43052 1
< 0.1%
37.430947 1
< 0.1%
ValueCountFrequency (%)
37.6948 1
< 0.1%
37.692548 1
< 0.1%
37.690489 1
< 0.1%
37.690177 1
< 0.1%
37.689876 1
< 0.1%
37.689668 1
< 0.1%
37.689331 1
< 0.1%
37.689218 1
< 0.1%
37.689012 1
< 0.1%
37.687988 1
< 0.1%

경도
Real number (ℝ)

Distinct9792
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.98405
Minimum126.45131
Maximum127.18179
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-03T21:14:19.582766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.45131
5-th percentile126.8399
Q1126.91501
median126.99227
Q3127.04994
95-th percentile127.12765
Maximum127.18179
Range0.730475
Interquartile range (IQR)0.134932

Descriptive statistics

Standard deviation0.087571114
Coefficient of variation (CV)0.00068962294
Kurtosis-0.48665415
Mean126.98405
Median Absolute Deviation (MAD)0.068152
Skewness-0.099991673
Sum1269840.5
Variance0.0076687
MonotonicityNot monotonic
2024-05-03T21:14:20.088831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.070211 3
 
< 0.1%
127.01542 3
 
< 0.1%
127.044896 3
 
< 0.1%
127.034811 3
 
< 0.1%
127.088669 3
 
< 0.1%
126.884787 2
 
< 0.1%
127.009282 2
 
< 0.1%
127.034097 2
 
< 0.1%
127.024805 2
 
< 0.1%
126.90782 2
 
< 0.1%
Other values (9782) 9975
99.8%
ValueCountFrequency (%)
126.45131 1
< 0.1%
126.45723 1
< 0.1%
126.463502 1
< 0.1%
126.722015 1
< 0.1%
126.766228 1
< 0.1%
126.768582 1
< 0.1%
126.768871 1
< 0.1%
126.786594 1
< 0.1%
126.79341 1
< 0.1%
126.794236 1
< 0.1%
ValueCountFrequency (%)
127.181785 1
< 0.1%
127.18176 1
< 0.1%
127.181734 1
< 0.1%
127.181667 1
< 0.1%
127.180151 1
< 0.1%
127.180138 1
< 0.1%
127.18013 1
< 0.1%
127.1799 1
< 0.1%
127.179839 1
< 0.1%
127.179726 1
< 0.1%

버스도착정보안내기_설치_여부
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
미설치
6933 
설치
3067 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row미설치
2nd row미설치
3rd row미설치
4th row미설치
5th row미설치

Common Values

ValueCountFrequency (%)
미설치 6933
69.3%
설치 3067
30.7%

Length

2024-05-03T21:14:20.488067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-03T21:14:20.772881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미설치 6933
69.3%
설치 3067
30.7%

Interactions

2024-05-03T21:14:11.697463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:08.091834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:09.260804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:10.459442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:11.999289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:08.381956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:09.549740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:10.772425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:12.287101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:08.652179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:09.827347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:11.061980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:12.589540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:08.950596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:10.135067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-03T21:14:11.372721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-03T21:14:20.987101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류장_ID정류장_유형정류장_번호위도경도버스도착정보안내기_설치_여부
정류장_ID1.0000.2630.8210.8720.7960.190
정류장_유형0.2631.0000.3230.1150.2120.736
정류장_번호0.8210.3231.0000.6880.5730.118
위도0.8720.1150.6881.0000.3870.109
경도0.7960.2120.5730.3871.0000.139
버스도착정보안내기_설치_여부0.1900.7360.1180.1090.1391.000
2024-05-03T21:14:21.398497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
버스도착정보안내기_설치_여부정류장_유형
버스도착정보안내기_설치_여부1.0000.545
정류장_유형0.5451.000
2024-05-03T21:14:21.715976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류장_ID정류장_번호위도경도정류장_유형버스도착정보안내기_설치_여부
정류장_ID1.0000.980-0.663-0.0720.1420.146
정류장_번호0.9801.000-0.652-0.0740.1670.118
위도-0.663-0.6521.0000.2280.0610.084
경도-0.072-0.0740.2281.0000.1190.104
정류장_유형0.1420.1670.0610.1191.0000.545
버스도착정보안내기_설치_여부0.1460.1180.0840.1040.5451.000

Missing values

2024-05-03T21:14:13.061448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-03T21:14:13.513538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

정류장_ID정류장_명칭정류장_유형정류장_번호위도경도버스도착정보안내기_설치_여부
10858121900055반포리체마을버스2248037.502384127.014005미설치
3522108000137SK아파트.벽산아파트상가일반차로922537.619732127.011197미설치
5733112000428독립문파크빌일반차로1333737.574578126.951858미설치
9054118000316도림사거리일반차로1940737.507051126.901271미설치
296100900040명륜3가종점마을버스150337.590948126.992864미설치
4479110000178광운대역일반차로1127837.622578127.061673미설치
636101000250서울역일반차로268537.55816126.971336설치
5892112900148홍제2동주민센터마을버스1391337.586186126.949649미설치
11466122000343개포중학교일반차로2345737.480019127.063631미설치
1823104900001종로약국앞마을버스571637.557588127.088106미설치
정류장_ID정류장_명칭정류장_유형정류장_번호위도경도버스도착정보안내기_설치_여부
9077118000480신풍프라자일반차로1984737.501786126.901512미설치
2835107000106석관고등학교일반차로819637.609435127.064362설치
7202115000183방화2단지아파트일반차로1628037.573514126.818829미설치
8125116900054한신빌라마을버스1754537.491226126.850966미설치
240100000404종묘.세운상가일반차로191437.570374126.9935미설치
6752114000161목동대학학원일반차로1526437.524726126.872864설치
5780112900036동명여중마을버스1380437.568064126.962919미설치
286100900030서울대치과대학마을버스187837.577408126.997801미설치
11069121900267연세사랑병원마을버스2244237.476278126.986388미설치
4059109000189세그루학원일반차로1027737.655624127.027833설치