Overview

Dataset statistics

Number of variables5
Number of observations2794
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory117.5 KiB
Average record size in memory43.0 B

Variable types

Text2
Numeric3

Dataset

Description전북특별자치도 전주시 버스정류장 자료로 정류장 ID, 정류장명, 정류장 번호, 위도, 경도 등을 제공합니다.제공항목 : 정류장ID, 정류장명, 정류장번호, 위도, 경도자료제공: 시민교통과
Author전북특별자치도 전주시
URLhttps://www.data.go.kr/data/15080859/fileData.do

Alerts

정류장번호 is highly overall correlated with 위도High correlation
위도 is highly overall correlated with 정류장번호High correlation
정류장ID has unique valuesUnique
정류장번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 17:08:13.437680
Analysis finished2024-03-14 17:08:16.071320
Duration2.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

정류장ID
Text

UNIQUE 

Distinct2794
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size22.0 KiB
2024-03-15T02:08:16.908036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters33528
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2794 ?
Unique (%)100.0%

Sample

1st rowJUB312100690
2nd rowJUB312100693
3rd rowJUB306101433
4th rowJUB305100683
5th rowJUB305032322
ValueCountFrequency (%)
jub312100690 1
 
< 0.1%
jub312100156 1
 
< 0.1%
jub305032387 1
 
< 0.1%
jub312101776 1
 
< 0.1%
jub311100095 1
 
< 0.1%
jub312032250 1
 
< 0.1%
jub312032730 1
 
< 0.1%
jub312032731 1
 
< 0.1%
jub312032349 1
 
< 0.1%
jub312032350 1
 
< 0.1%
Other values (2784) 2784
99.6%
2024-03-15T02:08:18.592705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5922
17.7%
1 5291
15.8%
3 4620
13.8%
2 2966
8.8%
J 2794
8.3%
U 2794
8.3%
B 2794
8.3%
5 1716
 
5.1%
6 1432
 
4.3%
4 851
 
2.5%
Other values (3) 2348
 
7.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 25146
75.0%
Uppercase Letter 8382
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5922
23.6%
1 5291
21.0%
3 4620
18.4%
2 2966
11.8%
5 1716
 
6.8%
6 1432
 
5.7%
4 851
 
3.4%
7 834
 
3.3%
8 775
 
3.1%
9 739
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
J 2794
33.3%
U 2794
33.3%
B 2794
33.3%

Most occurring scripts

ValueCountFrequency (%)
Common 25146
75.0%
Latin 8382
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5922
23.6%
1 5291
21.0%
3 4620
18.4%
2 2966
11.8%
5 1716
 
6.8%
6 1432
 
5.7%
4 851
 
3.4%
7 834
 
3.3%
8 775
 
3.1%
9 739
 
2.9%
Latin
ValueCountFrequency (%)
J 2794
33.3%
U 2794
33.3%
B 2794
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 33528
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5922
17.7%
1 5291
15.8%
3 4620
13.8%
2 2966
8.8%
J 2794
8.3%
U 2794
8.3%
B 2794
8.3%
5 1716
 
5.1%
6 1432
 
4.3%
4 851
 
2.5%
Other values (3) 2348
 
7.0%
Distinct1469
Distinct (%)52.6%
Missing0
Missing (%)0.0%
Memory size22.0 KiB
2024-03-15T02:08:19.841368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length15
Mean length5.309592
Min length2

Characters and Unicode

Total characters14835
Distinct characters431
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique273 ?
Unique (%)9.8%

Sample

1st rowBYC
2nd rowBYC
3rd rowcbs기독교방송
4th rowCTS전북방송
5th rowJTV전주방송
ValueCountFrequency (%)
용정마을 8
 
0.3%
신흥리 8
 
0.3%
신덕마을 8
 
0.3%
효자휴먼시아 8
 
0.3%
신흥마을 6
 
0.2%
새터 6
 
0.2%
만성동로 6
 
0.2%
학동마을 5
 
0.2%
월곡마을 4
 
0.1%
신정리 4
 
0.1%
Other values (1481) 2800
97.8%
2024-03-15T02:08:21.427754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
874
 
5.9%
816
 
5.5%
400
 
2.7%
305
 
2.1%
286
 
1.9%
283
 
1.9%
265
 
1.8%
258
 
1.7%
244
 
1.6%
242
 
1.6%
Other values (421) 10862
73.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14425
97.2%
Other Punctuation 139
 
0.9%
Decimal Number 122
 
0.8%
Space Separator 69
 
0.5%
Uppercase Letter 60
 
0.4%
Lowercase Letter 15
 
0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
874
 
6.1%
816
 
5.7%
400
 
2.8%
305
 
2.1%
286
 
2.0%
283
 
2.0%
265
 
1.8%
258
 
1.8%
244
 
1.7%
242
 
1.7%
Other values (389) 10452
72.5%
Uppercase Letter
ValueCountFrequency (%)
T 11
18.3%
S 9
15.0%
B 6
10.0%
L 6
10.0%
C 5
8.3%
K 5
8.3%
N 4
 
6.7%
H 4
 
6.7%
I 2
 
3.3%
V 2
 
3.3%
Other values (3) 6
10.0%
Decimal Number
ValueCountFrequency (%)
2 47
38.5%
1 30
24.6%
3 12
 
9.8%
4 11
 
9.0%
6 8
 
6.6%
5 7
 
5.7%
0 3
 
2.5%
7 3
 
2.5%
8 1
 
0.8%
Lowercase Letter
ValueCountFrequency (%)
t 6
40.0%
s 4
26.7%
e 3
20.0%
b 1
 
6.7%
c 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 139
100.0%
Space Separator
ValueCountFrequency (%)
69
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14425
97.2%
Common 335
 
2.3%
Latin 75
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
874
 
6.1%
816
 
5.7%
400
 
2.8%
305
 
2.1%
286
 
2.0%
283
 
2.0%
265
 
1.8%
258
 
1.8%
244
 
1.7%
242
 
1.7%
Other values (389) 10452
72.5%
Latin
ValueCountFrequency (%)
T 11
14.7%
S 9
12.0%
t 6
 
8.0%
B 6
 
8.0%
L 6
 
8.0%
C 5
 
6.7%
K 5
 
6.7%
N 4
 
5.3%
H 4
 
5.3%
s 4
 
5.3%
Other values (8) 15
20.0%
Common
ValueCountFrequency (%)
. 139
41.5%
69
20.6%
2 47
 
14.0%
1 30
 
9.0%
3 12
 
3.6%
4 11
 
3.3%
6 8
 
2.4%
5 7
 
2.1%
0 3
 
0.9%
7 3
 
0.9%
Other values (4) 6
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14425
97.2%
ASCII 410
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
874
 
6.1%
816
 
5.7%
400
 
2.8%
305
 
2.1%
286
 
2.0%
283
 
2.0%
265
 
1.8%
258
 
1.8%
244
 
1.7%
242
 
1.7%
Other values (389) 10452
72.5%
ASCII
ValueCountFrequency (%)
. 139
33.9%
69
16.8%
2 47
 
11.5%
1 30
 
7.3%
3 12
 
2.9%
4 11
 
2.7%
T 11
 
2.7%
S 9
 
2.2%
6 8
 
2.0%
5 7
 
1.7%
Other values (22) 67
16.3%

정류장번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2794
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32193.398
Minimum30001
Maximum36626
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size24.7 KiB
2024-03-15T02:08:21.843563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30001
5-th percentile30142.65
Q130740.25
median31494.5
Q332248.75
95-th percentile36485.35
Maximum36626
Range6625
Interquartile range (IQR)1508.5

Descriptive statistics

Standard deviation2123.2291
Coefficient of variation (CV)0.065952314
Kurtosis0.0036971207
Mean32193.398
Median Absolute Deviation (MAD)754.5
Skewness1.237989
Sum89948353
Variance4508101.7
MonotonicityNot monotonic
2024-03-15T02:08:22.126686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30690 1
 
< 0.1%
30095 1
 
< 0.1%
36064 1
 
< 0.1%
36065 1
 
< 0.1%
32349 1
 
< 0.1%
32350 1
 
< 0.1%
30156 1
 
< 0.1%
30159 1
 
< 0.1%
30078 1
 
< 0.1%
30079 1
 
< 0.1%
Other values (2784) 2784
99.6%
ValueCountFrequency (%)
30001 1
< 0.1%
30002 1
< 0.1%
30003 1
< 0.1%
30004 1
< 0.1%
30005 1
< 0.1%
30006 1
< 0.1%
30007 1
< 0.1%
30008 1
< 0.1%
30009 1
< 0.1%
30010 1
< 0.1%
ValueCountFrequency (%)
36626 1
< 0.1%
36625 1
< 0.1%
36624 1
< 0.1%
36623 1
< 0.1%
36622 1
< 0.1%
36621 1
< 0.1%
36620 1
< 0.1%
36619 1
< 0.1%
36618 1
< 0.1%
36617 1
< 0.1%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct2792
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.854915
Minimum35.58788
Maximum36.092916
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size24.7 KiB
2024-03-15T02:08:22.558163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum35.58788
5-th percentile35.728182
Q135.809788
median35.844951
Q335.892665
95-th percentile36.002535
Maximum36.092916
Range0.50503609
Interquartile range (IQR)0.08287791

Descriptive statistics

Standard deviation0.080082728
Coefficient of variation (CV)0.0022335216
Kurtosis0.80741436
Mean35.854915
Median Absolute Deviation (MAD)0.04038386
Skewness0.1665611
Sum100178.63
Variance0.0064132433
MonotonicityNot monotonic
2024-03-15T02:08:22.974618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
35.85863312 2
 
0.1%
35.91678768 2
 
0.1%
35.82043092 1
 
< 0.1%
35.71432152 1
 
< 0.1%
36.0040709 1
 
< 0.1%
36.0039898 1
 
< 0.1%
36.00414125 1
 
< 0.1%
36.00397817 1
 
< 0.1%
35.74837624 1
 
< 0.1%
35.74879912 1
 
< 0.1%
Other values (2782) 2782
99.6%
ValueCountFrequency (%)
35.58787971 1
< 0.1%
35.58802884 1
< 0.1%
35.59366113 1
< 0.1%
35.59382187 1
< 0.1%
35.59397733 1
< 0.1%
35.5976497 1
< 0.1%
35.59772402 1
< 0.1%
35.6010317 1
< 0.1%
35.6011782 1
< 0.1%
35.60420968 1
< 0.1%
ValueCountFrequency (%)
36.0929158 1
< 0.1%
36.09287326 1
< 0.1%
36.0928611 1
< 0.1%
36.09278434 1
< 0.1%
36.09179828 1
< 0.1%
36.09141748 1
< 0.1%
36.0884646 1
< 0.1%
36.0882845 1
< 0.1%
36.0835839 1
< 0.1%
36.0834754 1
< 0.1%

경도
Real number (ℝ)

Distinct2787
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.13199
Minimum126.91482
Maximum127.34194
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size24.7 KiB
2024-03-15T02:08:23.406857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.91482
5-th percentile127.01293
Q1127.09022
median127.12736
Q3127.16752
95-th percentile127.26267
Maximum127.34194
Range0.4271218
Interquartile range (IQR)0.077304

Descriptive statistics

Standard deviation0.071012167
Coefficient of variation (CV)0.00055857038
Kurtosis0.10238419
Mean127.13199
Median Absolute Deviation (MAD)0.03936035
Skewness0.22642766
Sum355206.79
Variance0.0050427278
MonotonicityNot monotonic
2024-03-15T02:08:23.843653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.279984 2
 
0.1%
126.9969262 2
 
0.1%
127.157249 2
 
0.1%
127.1660291 2
 
0.1%
127.1392574 2
 
0.1%
127.0957418 2
 
0.1%
127.1677861 2
 
0.1%
127.2222981 1
 
< 0.1%
127.2204006 1
 
< 0.1%
127.2205598 1
 
< 0.1%
Other values (2777) 2777
99.4%
ValueCountFrequency (%)
126.9148188 1
< 0.1%
126.9504653 1
< 0.1%
126.9506806 1
< 0.1%
126.9558342 1
< 0.1%
126.9559817 1
< 0.1%
126.9587505 1
< 0.1%
126.958964 1
< 0.1%
126.9589772 1
< 0.1%
126.9591361 1
< 0.1%
126.9606283 1
< 0.1%
ValueCountFrequency (%)
127.3419406 1
< 0.1%
127.341871 1
< 0.1%
127.3377429 1
< 0.1%
127.3376235 1
< 0.1%
127.3285249 1
< 0.1%
127.3276355 1
< 0.1%
127.3274573 1
< 0.1%
127.3189715 1
< 0.1%
127.3188604 1
< 0.1%
127.3173265 1
< 0.1%

Interactions

2024-03-15T02:08:15.127225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:13.909586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:14.468032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:15.340036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:14.131019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:14.641041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:15.502206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:14.299950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T02:08:14.853269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T02:08:24.112543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류장번호위도경도
정류장번호1.0000.8090.384
위도0.8091.0000.519
경도0.3840.5191.000
2024-03-15T02:08:24.355764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정류장번호위도경도
정류장번호1.0000.5710.095
위도0.5711.0000.255
경도0.0950.2551.000

Missing values

2024-03-15T02:08:15.710560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:08:15.941753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

정류장ID정류장명정류장번호위도경도
0JUB312100690BYC3069035.820431127.007781
1JUB312100693BYC3069335.820713127.00797
2JUB306101433cbs기독교방송3143235.874691127.055197
3JUB305100683CTS전북방송3068335.819997127.146407
4JUB305032322JTV전주방송3621435.845972127.082964
5JUB305032323JTV전주방송3621535.846052127.082676
6JUB306101050KT북전주지점.덕진광장3104935.842929127.123182
7JUB306101053KT북전주지점.덕진광장3105235.843102127.123084
8JUB305100912KT정보센터3091235.833751127.118557
9JUB312032801LS엠트론엔진공장3627035.96984127.118182
정류장ID정류장명정류장번호위도경도
2784JUB305100038흑석골입구3203835.804199127.149954
2785JUB312032244흑석골입구3224435.805298127.151115
2786JUB305032648흑석골종점3658235.795091127.148183
2787JUB305032649흑석골종점3658335.795163127.148183
2788JUB305100429흑석대승푸른맨션3042935.799535127.150721
2789JUB305100431흑석대승푸른맨션3043135.799576127.150543
2790JUB305100460흑석송원아파트3046035.801901127.150376
2791JUB305100463흑석송원아파트3046335.802335127.150055
2792JUB306100817흥국생명3081735.82812127.14192
2793JUB305100666흥국화재3066635.819177127.14684