Overview

Dataset statistics

Number of variables4
Number of observations847
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.3 KiB
Average record size in memory34.2 B

Variable types

Numeric2
Text2

Dataset

Description경상남도 창원시 관내 버스정보안내기의 설치현황(정류장명, 주소, ID, 모바일ID) 자료를 올려드리니, 참고하시기 바랍니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15035291

Alerts

ID is highly overall correlated with 모바일IDHigh correlation
모바일ID is highly overall correlated with IDHigh correlation
ID has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:16:58.862375
Analysis finished2023-12-10 23:16:59.715333
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct847
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1867.2774
Minimum1001
Maximum9996
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.6 KiB
2023-12-11T08:16:59.799230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1001
5-th percentile1044.3
Q11214.5
median2033
Q32248.5
95-th percentile3095.7
Maximum9996
Range8995
Interquartile range (IQR)1034

Descriptive statistics

Standard deviation745.24317
Coefficient of variation (CV)0.39910683
Kurtosis15.288632
Mean1867.2774
Median Absolute Deviation (MAD)739
Skewness1.8673527
Sum1581584
Variance555387.39
MonotonicityStrictly increasing
2023-12-11T08:16:59.980177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1001 1
 
0.1%
2167 1
 
0.1%
2169 1
 
0.1%
2170 1
 
0.1%
2171 1
 
0.1%
2172 1
 
0.1%
2173 1
 
0.1%
2174 1
 
0.1%
2175 1
 
0.1%
2176 1
 
0.1%
Other values (837) 837
98.8%
ValueCountFrequency (%)
1001 1
0.1%
1002 1
0.1%
1003 1
0.1%
1004 1
0.1%
1005 1
0.1%
1006 1
0.1%
1007 1
0.1%
1008 1
0.1%
1009 1
0.1%
1010 1
0.1%
ValueCountFrequency (%)
9996 1
0.1%
3137 1
0.1%
3136 1
0.1%
3135 1
0.1%
3134 1
0.1%
3133 1
0.1%
3132 1
0.1%
3131 1
0.1%
3130 1
0.1%
3129 1
0.1%

모바일ID
Real number (ℝ)

HIGH CORRELATION 

Distinct836
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean293392.04
Minimum102503
Maximum630346
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.6 KiB
2023-12-11T08:17:00.147300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum102503
5-th percentile104713.3
Q1119122.5
median300628
Q3408606.5
95-th percentile519819.7
Maximum630346
Range527843
Interquartile range (IQR)289484

Descriptive statistics

Standard deviation154065.09
Coefficient of variation (CV)0.5251168
Kurtosis-1.1576271
Mean293392.04
Median Absolute Deviation (MAD)119989
Skewness0.27458253
Sum2.4850306 × 108
Variance2.3736051 × 1010
MonotonicityNot monotonic
2023-12-11T08:17:00.281336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
630020 3
 
0.4%
518122 2
 
0.2%
400112 2
 
0.2%
114225 2
 
0.2%
300636 2
 
0.2%
300118 2
 
0.2%
408402 2
 
0.2%
118815 2
 
0.2%
218824 2
 
0.2%
115316 2
 
0.2%
Other values (826) 826
97.5%
ValueCountFrequency (%)
102503 1
0.1%
103103 1
0.1%
103104 1
0.1%
103105 1
0.1%
103106 1
0.1%
103107 1
0.1%
103108 1
0.1%
103109 1
0.1%
103115 1
0.1%
103118 1
0.1%
ValueCountFrequency (%)
630346 1
0.1%
630238 1
0.1%
630224 1
0.1%
630218 1
0.1%
630216 1
0.1%
630215 1
0.1%
630214 1
0.1%
630213 1
0.1%
630204 1
0.1%
630081 1
0.1%
Distinct833
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size6.7 KiB
2023-12-11T08:17:00.567884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length8.837072
Min length2

Characters and Unicode

Total characters7485
Distinct characters369
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique819 ?
Unique (%)96.7%

Sample

1st row창원역(맞은편PAT앞)
2nd row도계동.서부스포츠센터
3rd row도계동(창원씨엘여성병원)
4th row도계동만남의광장(김해행버스정류소)
5th row명서다리.명곡교회(맞은편)
ValueCountFrequency (%)
더샵센트럴파크 3
 
0.4%
lh자은프라임 2
 
0.2%
영남주차장 2
 
0.2%
마산역(동마산병원앞 2
 
0.2%
마산합포구청.의료원 2
 
0.2%
구암2동주민센터 2
 
0.2%
정우상가 2
 
0.2%
창원종합터미널 2
 
0.2%
의창구청.서부경찰서원흥사입구 2
 
0.2%
중리역(건너편 2
 
0.2%
Other values (830) 835
97.5%
2023-12-11T08:17:00.971416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 439
 
5.9%
( 439
 
5.9%
222
 
3.0%
203
 
2.7%
184
 
2.5%
183
 
2.4%
179
 
2.4%
177
 
2.4%
142
 
1.9%
128
 
1.7%
Other values (359) 5189
69.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6406
85.6%
Close Punctuation 439
 
5.9%
Open Punctuation 439
 
5.9%
Decimal Number 91
 
1.2%
Uppercase Letter 53
 
0.7%
Other Punctuation 34
 
0.5%
Lowercase Letter 14
 
0.2%
Space Separator 9
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
222
 
3.5%
203
 
3.2%
184
 
2.9%
183
 
2.9%
179
 
2.8%
177
 
2.8%
142
 
2.2%
128
 
2.0%
121
 
1.9%
118
 
1.8%
Other values (328) 4749
74.1%
Uppercase Letter
ValueCountFrequency (%)
T 10
18.9%
G 7
13.2%
L 7
13.2%
A 6
11.3%
S 5
9.4%
P 5
9.4%
H 4
 
7.5%
K 4
 
7.5%
X 2
 
3.8%
M 1
 
1.9%
Other values (2) 2
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 33
36.3%
2 27
29.7%
3 14
15.4%
9 5
 
5.5%
5 5
 
5.5%
7 4
 
4.4%
0 2
 
2.2%
4 1
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
t 5
35.7%
k 3
21.4%
e 2
 
14.3%
h 2
 
14.3%
s 1
 
7.1%
g 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
. 32
94.1%
& 2
 
5.9%
Close Punctuation
ValueCountFrequency (%)
) 439
100.0%
Open Punctuation
ValueCountFrequency (%)
( 439
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6406
85.6%
Common 1012
 
13.5%
Latin 67
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
222
 
3.5%
203
 
3.2%
184
 
2.9%
183
 
2.9%
179
 
2.8%
177
 
2.8%
142
 
2.2%
128
 
2.0%
121
 
1.9%
118
 
1.8%
Other values (328) 4749
74.1%
Latin
ValueCountFrequency (%)
T 10
14.9%
G 7
10.4%
L 7
10.4%
A 6
9.0%
S 5
7.5%
P 5
7.5%
t 5
7.5%
H 4
 
6.0%
K 4
 
6.0%
k 3
 
4.5%
Other values (8) 11
16.4%
Common
ValueCountFrequency (%)
) 439
43.4%
( 439
43.4%
1 33
 
3.3%
. 32
 
3.2%
2 27
 
2.7%
3 14
 
1.4%
9
 
0.9%
9 5
 
0.5%
5 5
 
0.5%
7 4
 
0.4%
Other values (3) 5
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6406
85.6%
ASCII 1079
 
14.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 439
40.7%
( 439
40.7%
1 33
 
3.1%
. 32
 
3.0%
2 27
 
2.5%
3 14
 
1.3%
T 10
 
0.9%
9
 
0.8%
G 7
 
0.6%
L 7
 
0.6%
Other values (21) 62
 
5.7%
Hangul
ValueCountFrequency (%)
222
 
3.5%
203
 
3.2%
184
 
2.9%
183
 
2.9%
179
 
2.8%
177
 
2.8%
142
 
2.2%
128
 
2.0%
121
 
1.9%
118
 
1.8%
Other values (328) 4749
74.1%

주소
Text

Distinct802
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size6.7 KiB
2023-12-11T08:17:01.283767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length14.167651
Min length9

Characters and Unicode

Total characters12000
Distinct characters118
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique759 ?
Unique (%)89.6%

Sample

1st row의창구 동정동 156-3
2nd row의창구 도계동 465-3
3rd row의창구 도계동 333-4
4th row의창구 도계동 305-7
5th row의창구 명서동 131-4
ValueCountFrequency (%)
의창구 238
 
9.0%
마산회원구 179
 
6.7%
성산구 157
 
5.9%
마산합포구 140
 
5.3%
진해구 133
 
5.0%
내서읍 44
 
1.7%
팔용동 34
 
1.3%
사림동 25
 
0.9%
북면 23
 
0.9%
동읍 23
 
0.9%
Other values (855) 1659
62.5%
2023-12-11T08:17:01.721527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2334
19.4%
869
 
7.2%
802
 
6.7%
1 696
 
5.8%
- 618
 
5.1%
535
 
4.5%
2 415
 
3.5%
3 343
 
2.9%
324
 
2.7%
4 310
 
2.6%
Other values (108) 4754
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5985
49.9%
Decimal Number 3063
25.5%
Space Separator 2334
 
19.4%
Dash Punctuation 618
 
5.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
869
14.5%
802
 
13.4%
535
 
8.9%
324
 
5.4%
238
 
4.0%
238
 
4.0%
224
 
3.7%
213
 
3.6%
212
 
3.5%
158
 
2.6%
Other values (96) 2172
36.3%
Decimal Number
ValueCountFrequency (%)
1 696
22.7%
2 415
13.5%
3 343
11.2%
4 310
10.1%
5 272
 
8.9%
6 229
 
7.5%
0 219
 
7.1%
7 210
 
6.9%
8 187
 
6.1%
9 182
 
5.9%
Space Separator
ValueCountFrequency (%)
2334
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 618
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6015
50.1%
Hangul 5985
49.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
869
14.5%
802
 
13.4%
535
 
8.9%
324
 
5.4%
238
 
4.0%
238
 
4.0%
224
 
3.7%
213
 
3.6%
212
 
3.5%
158
 
2.6%
Other values (96) 2172
36.3%
Common
ValueCountFrequency (%)
2334
38.8%
1 696
 
11.6%
- 618
 
10.3%
2 415
 
6.9%
3 343
 
5.7%
4 310
 
5.2%
5 272
 
4.5%
6 229
 
3.8%
0 219
 
3.6%
7 210
 
3.5%
Other values (2) 369
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6015
50.1%
Hangul 5985
49.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2334
38.8%
1 696
 
11.6%
- 618
 
10.3%
2 415
 
6.9%
3 343
 
5.7%
4 310
 
5.2%
5 272
 
4.5%
6 229
 
3.8%
0 219
 
3.6%
7 210
 
3.5%
Other values (2) 369
 
6.1%
Hangul
ValueCountFrequency (%)
869
14.5%
802
 
13.4%
535
 
8.9%
324
 
5.4%
238
 
4.0%
238
 
4.0%
224
 
3.7%
213
 
3.6%
212
 
3.5%
158
 
2.6%
Other values (96) 2172
36.3%

Interactions

2023-12-11T08:16:59.384181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:16:59.217149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:16:59.469126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:16:59.299690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:17:01.846601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ID모바일ID
ID1.0000.888
모바일ID0.8881.000
2023-12-11T08:17:01.934075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ID모바일ID
ID1.0000.810
모바일ID0.8101.000

Missing values

2023-12-11T08:16:59.590171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:16:59.674351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

ID모바일ID설치정류소명주소
01001115302창원역(맞은편PAT앞)의창구 동정동 156-3
11002114202도계동.서부스포츠센터의창구 도계동 465-3
21003114201도계동(창원씨엘여성병원)의창구 도계동 333-4
31004115316도계동만남의광장(김해행버스정류소)의창구 도계동 305-7
41005114210명서다리.명곡교회(맞은편)의창구 명서동 131-4
51006114209명서다리.명곡교회의창구 명서동 82-10
61007119303흥한웰가아파트의창구 서상동 272-10
71008119111지귀상가의창구 봉곡동 61-14
81009119112지귀상가(건너편)의창구 봉곡동 61-4
91010114227은아아파트의창구 신월동 93
ID모바일ID설치정류소명주소
8373129503208냉천중학교진해구 자은동 109
8383130503209LH자은프라임진해구 자은동 124
8393131630204덕산초등학교(앞)진해구 자은동 807-3
8403132522316아래장천(침례교회맞은편)진해구 장천동 713-3
8413133513810의곡마을진해구 두동 1232
8423134510402안청초교맞은편진해구 안골동 389
8433135513005반짓골(앞)진해구 용원동 1307
8443136630346부영13단지진해구 용원동 1362
8453137518122롯데마트(석동주민센터맞은편)진해구 석동 264
8469996300636정우상가의창구 용호동 505