Overview

Dataset statistics

Number of variables6
Number of observations356
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.2 KiB
Average record size in memory52.4 B

Variable types

Numeric4
Text1
DateTime1

Dataset

Description경기도 동두천시에 위치한 버스정류장의 위치 정보(정류장번호, 관리번호, 정류장명, 경도, 위도)를 제공하기 위한 자됴입니다.
Author경기도 동두천시
URLhttps://www.data.go.kr/data/15125436/fileData.do

Alerts

데이터기준일 has constant value ""Constant
관리번호 is highly overall correlated with 정류장번호High correlation
정류장번호 is highly overall correlated with 관리번호High correlation
관리번호 has unique valuesUnique
정류장번호 has unique valuesUnique

Reproduction

Analysis started2023-12-16 15:42:30.752809
Analysis finished2023-12-16 15:42:41.319666
Duration10.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관리번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct356
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.150002 × 108
Minimum2.15 × 108
Maximum2.1500043 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-16T15:42:42.016017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.15 × 108
5-th percentile2.1500002 × 108
Q12.1500009 × 108
median2.1500018 × 108
Q32.1500031 × 108
95-th percentile2.1500041 × 108
Maximum2.1500043 × 108
Range431
Interquartile range (IQR)218.75

Descriptive statistics

Standard deviation127.96856
Coefficient of variation (CV)5.9520203 × 10-7
Kurtosis-1.2088385
Mean2.150002 × 108
Median Absolute Deviation (MAD)104.5
Skewness0.21740323
Sum7.6540071 × 1010
Variance16375.952
MonotonicityNot monotonic
2023-12-16T15:42:42.928641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
215000166 1
 
0.3%
215000030 1
 
0.3%
215000050 1
 
0.3%
215000049 1
 
0.3%
215000048 1
 
0.3%
215000047 1
 
0.3%
215000046 1
 
0.3%
215000045 1
 
0.3%
215000044 1
 
0.3%
215000043 1
 
0.3%
Other values (346) 346
97.2%
ValueCountFrequency (%)
215000001 1
0.3%
215000002 1
0.3%
215000003 1
0.3%
215000004 1
0.3%
215000005 1
0.3%
215000006 1
0.3%
215000007 1
0.3%
215000008 1
0.3%
215000009 1
0.3%
215000010 1
0.3%
ValueCountFrequency (%)
215000432 1
0.3%
215000431 1
0.3%
215000427 1
0.3%
215000425 1
0.3%
215000424 1
0.3%
215000422 1
0.3%
215000421 1
0.3%
215000420 1
0.3%
215000419 1
0.3%
215000418 1
0.3%

정류장번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct356
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16217.593
Minimum16001
Maximum16459
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-16T15:42:43.702297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16001
5-th percentile16018.75
Q116089.75
median16200.5
Q316341.5
95-th percentile16431
Maximum16459
Range458
Interquartile range (IQR)251.75

Descriptive statistics

Standard deviation138.92018
Coefficient of variation (CV)0.0085660174
Kurtosis-1.3869029
Mean16217.593
Median Absolute Deviation (MAD)126
Skewness0.074894199
Sum5773463
Variance19298.817
MonotonicityNot monotonic
2023-12-16T15:42:44.545706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16122 1
 
0.3%
16088 1
 
0.3%
16071 1
 
0.3%
16108 1
 
0.3%
16130 1
 
0.3%
16140 1
 
0.3%
16059 1
 
0.3%
16105 1
 
0.3%
16103 1
 
0.3%
16097 1
 
0.3%
Other values (346) 346
97.2%
ValueCountFrequency (%)
16001 1
0.3%
16002 1
0.3%
16003 1
0.3%
16004 1
0.3%
16005 1
0.3%
16006 1
0.3%
16007 1
0.3%
16008 1
0.3%
16009 1
0.3%
16010 1
0.3%
ValueCountFrequency (%)
16459 1
0.3%
16458 1
0.3%
16457 1
0.3%
16456 1
0.3%
16451 1
0.3%
16450 1
0.3%
16446 1
0.3%
16445 1
0.3%
16444 1
0.3%
16443 1
0.3%
Distinct206
Distinct (%)57.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-16T15:42:45.806851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length6.0898876
Min length2

Characters and Unicode

Total characters2168
Distinct characters233
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)18.3%

Sample

1st row자연휴양림
2nd row소요초등학교
3rd row지행역4번출구
4th row지행역1번출구
5th row송내주민센터송내주공2단지
ValueCountFrequency (%)
동두천중앙역 4
 
1.1%
동두천역 4
 
1.1%
생연주공아파트 4
 
1.1%
미2사단후문 4
 
1.1%
생골사거리.고용복지플러스센터 3
 
0.8%
구)동연파출소 2
 
0.6%
북보산동 2
 
0.6%
미2사단정문 2
 
0.6%
보산역.보산초등학교입구 2
 
0.6%
한빛누리중고등학교.은성교회 2
 
0.6%
Other values (196) 327
91.9%
2023-12-16T15:42:47.862155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
87
 
4.0%
57
 
2.6%
47
 
2.2%
. 45
 
2.1%
45
 
2.1%
42
 
1.9%
39
 
1.8%
38
 
1.8%
36
 
1.7%
36
 
1.7%
Other values (223) 1696
78.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2064
95.2%
Other Punctuation 45
 
2.1%
Decimal Number 44
 
2.0%
Close Punctuation 6
 
0.3%
Uppercase Letter 5
 
0.2%
Open Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
4.2%
57
 
2.8%
47
 
2.3%
45
 
2.2%
42
 
2.0%
39
 
1.9%
38
 
1.8%
36
 
1.7%
36
 
1.7%
36
 
1.7%
Other values (207) 1601
77.6%
Decimal Number
ValueCountFrequency (%)
2 14
31.8%
1 7
15.9%
3 7
15.9%
5 6
13.6%
4 5
 
11.4%
8 2
 
4.5%
9 2
 
4.5%
6 1
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
E 1
20.0%
T 1
20.0%
K 1
20.0%
C 1
20.0%
A 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 45
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2064
95.2%
Common 99
 
4.6%
Latin 5
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
4.2%
57
 
2.8%
47
 
2.3%
45
 
2.2%
42
 
2.0%
39
 
1.9%
38
 
1.8%
36
 
1.7%
36
 
1.7%
36
 
1.7%
Other values (207) 1601
77.6%
Common
ValueCountFrequency (%)
. 45
45.5%
2 14
 
14.1%
1 7
 
7.1%
3 7
 
7.1%
5 6
 
6.1%
) 6
 
6.1%
4 5
 
5.1%
( 4
 
4.0%
8 2
 
2.0%
9 2
 
2.0%
Latin
ValueCountFrequency (%)
E 1
20.0%
T 1
20.0%
K 1
20.0%
C 1
20.0%
A 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2064
95.2%
ASCII 104
 
4.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
87
 
4.2%
57
 
2.8%
47
 
2.3%
45
 
2.2%
42
 
2.0%
39
 
1.9%
38
 
1.8%
36
 
1.7%
36
 
1.7%
36
 
1.7%
Other values (207) 1601
77.6%
ASCII
ValueCountFrequency (%)
. 45
43.3%
2 14
 
13.5%
1 7
 
6.7%
3 7
 
6.7%
5 6
 
5.8%
) 6
 
5.8%
4 5
 
4.8%
( 4
 
3.8%
8 2
 
1.9%
9 2
 
1.9%
Other values (6) 6
 
5.8%

위도
Real number (ℝ)

Distinct330
Distinct (%)92.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.905897
Minimum37.86825
Maximum37.971633
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-16T15:42:48.813201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.86825
5-th percentile37.880179
Q137.892333
median37.901083
Q337.911375
95-th percentile37.951433
Maximum37.971633
Range0.1033833
Interquartile range (IQR)0.0190417

Descriptive statistics

Standard deviation0.021007743
Coefficient of variation (CV)0.00055420778
Kurtosis0.97532177
Mean37.905897
Median Absolute Deviation (MAD)0.00907495
Skewness1.155418
Sum13494.499
Variance0.00044132527
MonotonicityNot monotonic
2023-12-16T15:42:49.741417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.9055667 2
 
0.6%
37.89215 2
 
0.6%
37.8995 2
 
0.6%
37.89695 2
 
0.6%
37.90985 2
 
0.6%
37.92775 2
 
0.6%
37.897 2
 
0.6%
37.9111333 2
 
0.6%
37.9023667 2
 
0.6%
37.9008 2
 
0.6%
Other values (320) 336
94.4%
ValueCountFrequency (%)
37.86825 1
0.3%
37.8719167 1
0.3%
37.8736333 1
0.3%
37.87385 1
0.3%
37.87415 1
0.3%
37.8744833 1
0.3%
37.8763 1
0.3%
37.87725 1
0.3%
37.8775833 1
0.3%
37.8777667 1
0.3%
ValueCountFrequency (%)
37.9716333 1
0.3%
37.9715333 1
0.3%
37.9689333 1
0.3%
37.9688333 1
0.3%
37.96625 1
0.3%
37.9627833 1
0.3%
37.9627333 1
0.3%
37.9624333 1
0.3%
37.9621667 1
0.3%
37.96205 1
0.3%

경도
Real number (ℝ)

Distinct332
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.06287
Minimum127.01362
Maximum127.13733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-16T15:42:50.581760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.01362
5-th percentile127.02876
Q1127.0524
median127.05731
Q3127.06557
95-th percentile127.11183
Maximum127.13733
Range0.1237166
Interquartile range (IQR)0.0131708

Descriptive statistics

Standard deviation0.023468245
Coefficient of variation (CV)0.00018469789
Kurtosis1.9026649
Mean127.06287
Median Absolute Deviation (MAD)0.00619165
Skewness1.2320536
Sum45234.383
Variance0.00055075852
MonotonicityNot monotonic
2023-12-16T15:42:51.281136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0563 4
 
1.1%
127.0543833 3
 
0.8%
127.05635 3
 
0.8%
127.0539667 3
 
0.8%
127.1216 2
 
0.6%
127.0565167 2
 
0.6%
127.0572833 2
 
0.6%
127.0484167 2
 
0.6%
127.0584333 2
 
0.6%
127.05675 2
 
0.6%
Other values (322) 331
93.0%
ValueCountFrequency (%)
127.0136167 1
0.3%
127.0136333 1
0.3%
127.0173 1
0.3%
127.0176833 1
0.3%
127.0193833 1
0.3%
127.0197333 1
0.3%
127.0242167 1
0.3%
127.02425 1
0.3%
127.0245833 1
0.3%
127.0252 1
0.3%
ValueCountFrequency (%)
127.1373333 1
0.3%
127.13655 1
0.3%
127.13645 1
0.3%
127.136 1
0.3%
127.13595 1
0.3%
127.1358833 1
0.3%
127.1354 1
0.3%
127.1352333 1
0.3%
127.1325333 2
0.6%
127.1323333 1
0.3%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2023-12-07 00:00:00
Maximum2023-12-07 00:00:00
2023-12-16T15:42:51.905517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:52.238673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-16T15:42:37.475416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:31.572198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:33.489209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:35.787150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:38.022303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:31.966845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:34.059100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:36.120278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:38.707910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:32.445795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:34.854297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:36.472928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:39.081425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:32.919087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:35.284914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-16T15:42:36.855862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-16T15:42:52.508425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호정류장번호위도경도
관리번호1.0000.8780.6510.658
정류장번호0.8781.0000.6820.668
위도0.6510.6821.0000.635
경도0.6580.6680.6351.000
2023-12-16T15:42:52.826003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리번호정류장번호위도경도
관리번호1.0000.752-0.096-0.025
정류장번호0.7521.000-0.1040.135
위도-0.096-0.1041.000-0.273
경도-0.0250.135-0.2731.000

Missing values

2023-12-16T15:42:40.176154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:42:41.035100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호정류장번호정류장명위도경도데이터기준일
021500016616122자연휴양림37.889467127.136452023-12-07
121500016716210소요초등학교37.947583127.054352023-12-07
221500016816217지행역4번출구37.892383127.0561332023-12-07
321500016916281지행역1번출구37.891367127.0551172023-12-07
421500017016250송내주민센터송내주공2단지37.89005127.0547672023-12-07
521500017116251송내주공4단지37.886833127.0545172023-12-07
621500017216254꿈나무도서관37.892583127.0511672023-12-07
721500017316123자연휴양림37.889483127.136552023-12-07
821500017416124놀자숲37.886867127.1373332023-12-07
921500017516261송내주공5단지후문37.8832127.0532172023-12-07
관리번호정류장번호정류장명위도경도데이터기준일
34621500015516198아차노리입구37.877583127.0570672023-12-07
34721500015616199아차노리입구37.8763127.0571672023-12-07
34821500015716200상우아파트37.898783127.0669332023-12-07
34921500015816201만복사주유소37.8957127.0715832023-12-07
35021500015916456생연2동행정복지센터37.9011127.0503172023-12-07
35121500016116110니지모리스튜디오입구37.8811127.0948832023-12-07
35221500016216111생연주공아파트37.912183127.065552023-12-07
35321500016316116생연주공아파트37.912117127.065352023-12-07
35421500016416120중앙파출소37.913083127.0609672023-12-07
35521500016516121중앙파출소37.9132127.0608832023-12-07