Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells6
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Numeric2
Text2

Dataset

Description빅데이터시스템_창원시 버스정보 자료
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15063982

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-04-19 06:50:09.456474
Analysis finished2024-04-19 06:50:10.630067
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7599.3688
Minimum1
Maximum15193
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:50:10.694722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile764.9
Q13765.75
median7601
Q311409.5
95-th percentile14444.05
Maximum15193
Range15192
Interquartile range (IQR)7643.75

Descriptive statistics

Standard deviation4396.3437
Coefficient of variation (CV)0.57851433
Kurtosis-1.2076738
Mean7599.3688
Median Absolute Deviation (MAD)3822
Skewness0.0032053302
Sum75993688
Variance19327838
MonotonicityNot monotonic
2024-04-19T15:50:10.827942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6408 1
 
< 0.1%
1333 1
 
< 0.1%
13316 1
 
< 0.1%
5630 1
 
< 0.1%
12438 1
 
< 0.1%
7516 1
 
< 0.1%
9314 1
 
< 0.1%
5186 1
 
< 0.1%
12951 1
 
< 0.1%
14781 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
13 1
< 0.1%
ValueCountFrequency (%)
15193 1
< 0.1%
15192 1
< 0.1%
15190 1
< 0.1%
15189 1
< 0.1%
15188 1
< 0.1%
15187 1
< 0.1%
15186 1
< 0.1%
15185 1
< 0.1%
15184 1
< 0.1%
15180 1
< 0.1%

노선
Text

Distinct169
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-19T15:50:11.161469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length2.8596
Min length1

Characters and Unicode

Total characters28596
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row33
2nd row302
3rd row762
4th rowJan-73
5th row75
ValueCountFrequency (%)
250 106
 
1.1%
105 106
 
1.1%
110 105
 
1.1%
27 105
 
1.1%
41 103
 
1.0%
116 102
 
1.0%
22 100
 
1.0%
111 98
 
1.0%
17 97
 
1.0%
109 97
 
1.0%
Other values (159) 8981
89.8%
2024-04-19T15:50:11.698259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5896
20.6%
2 3972
13.9%
0 3783
13.2%
5 3422
12.0%
3 3028
10.6%
7 2222
 
7.8%
6 1985
 
6.9%
4 1445
 
5.1%
- 1141
 
4.0%
9 448
 
1.6%
Other values (4) 1254
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26636
93.1%
Dash Punctuation 1141
 
4.0%
Lowercase Letter 546
 
1.9%
Uppercase Letter 273
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5896
22.1%
2 3972
14.9%
0 3783
14.2%
5 3422
12.8%
3 3028
11.4%
7 2222
 
8.3%
6 1985
 
7.5%
4 1445
 
5.4%
9 448
 
1.7%
8 435
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
a 273
50.0%
n 273
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1141
100.0%
Uppercase Letter
ValueCountFrequency (%)
J 273
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27777
97.1%
Latin 819
 
2.9%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5896
21.2%
2 3972
14.3%
0 3783
13.6%
5 3422
12.3%
3 3028
10.9%
7 2222
 
8.0%
6 1985
 
7.1%
4 1445
 
5.2%
- 1141
 
4.1%
9 448
 
1.6%
Latin
ValueCountFrequency (%)
J 273
33.3%
a 273
33.3%
n 273
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28596
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5896
20.6%
2 3972
13.9%
0 3783
13.2%
5 3422
12.0%
3 3028
10.6%
7 2222
 
7.8%
6 1985
 
6.9%
4 1445
 
5.1%
- 1141
 
4.0%
9 448
 
1.6%
Other values (4) 1254
 
4.4%

정류장코드
Real number (ℝ)

Distinct2172
Distinct (%)21.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4843901
Minimum246
Maximum9129300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-19T15:50:11.856239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum246
5-th percentile4801070
Q14805412
median4812642
Q34818560
95-th percentile4826662.4
Maximum9129300
Range9129054
Interquartile range (IQR)13148

Descriptive statistics

Standard deviation411579.74
Coefficient of variation (CV)0.084968652
Kurtosis107.49805
Mean4843901
Median Absolute Deviation (MAD)6651
Skewness7.8610814
Sum4.843901 × 1010
Variance1.6939788 × 1011
MonotonicityNot monotonic
2024-04-19T15:50:11.986440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4808190 42
 
0.4%
4819610 42
 
0.4%
4804000 42
 
0.4%
4803990 40
 
0.4%
4813610 38
 
0.4%
4819720 38
 
0.4%
4813790 38
 
0.4%
4813800 38
 
0.4%
4804180 36
 
0.4%
4813590 36
 
0.4%
Other values (2162) 9610
96.1%
ValueCountFrequency (%)
246 1
 
< 0.1%
297 1
 
< 0.1%
574 1
 
< 0.1%
640 1
 
< 0.1%
735 1
 
< 0.1%
773 1
 
< 0.1%
966 1
 
< 0.1%
1517 1
 
< 0.1%
4800012 16
0.2%
4800022 3
 
< 0.1%
ValueCountFrequency (%)
9129300 1
< 0.1%
9129280 1
< 0.1%
9129240 1
< 0.1%
9126540 2
< 0.1%
9126530 1
< 0.1%
9126122 1
< 0.1%
9126092 1
< 0.1%
9125593 1
< 0.1%
9125213 1
< 0.1%
9124993 1
< 0.1%
Distinct1322
Distinct (%)13.2%
Missing6
Missing (%)0.1%
Memory size156.2 KiB
2024-04-19T15:50:12.179714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length5.8808285
Min length2

Characters and Unicode

Total characters58773
Distinct characters455
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)2.5%

Sample

1st row마산리
2nd row이동119안전센터
3rd row마산역앞.동마산병원
4th row오서시장
5th row금암마을
ValueCountFrequency (%)
경동메르빌아파트 82
 
0.8%
문화동 76
 
0.8%
경남데파트 75
 
0.8%
반월민원센터 74
 
0.7%
kt마산지사 71
 
0.7%
마산합포구청.의료원 70
 
0.7%
중부경찰서 69
 
0.7%
롯데백화점앞어시장 69
 
0.7%
자유무역지역정문 66
 
0.7%
경남대남부터미널종점 65
 
0.7%
Other values (1312) 9277
92.8%
2024-04-19T15:50:12.536010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2312
 
3.9%
1649
 
2.8%
1480
 
2.5%
1464
 
2.5%
1460
 
2.5%
1404
 
2.4%
. 1220
 
2.1%
1193
 
2.0%
1162
 
2.0%
1097
 
1.9%
Other values (445) 44332
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56142
95.5%
Other Punctuation 1224
 
2.1%
Decimal Number 805
 
1.4%
Uppercase Letter 326
 
0.6%
Lowercase Letter 246
 
0.4%
Close Punctuation 13
 
< 0.1%
Open Punctuation 13
 
< 0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2312
 
4.1%
1649
 
2.9%
1480
 
2.6%
1464
 
2.6%
1460
 
2.6%
1404
 
2.5%
1193
 
2.1%
1162
 
2.1%
1097
 
2.0%
862
 
1.5%
Other values (412) 42059
74.9%
Uppercase Letter
ValueCountFrequency (%)
S 78
23.9%
T 68
20.9%
X 58
17.8%
H 26
 
8.0%
L 16
 
4.9%
D 14
 
4.3%
K 13
 
4.0%
A 10
 
3.1%
P 10
 
3.1%
N 9
 
2.8%
Other values (5) 24
 
7.4%
Decimal Number
ValueCountFrequency (%)
1 309
38.4%
3 177
22.0%
2 113
 
14.0%
5 82
 
10.2%
9 47
 
5.8%
7 29
 
3.6%
6 26
 
3.2%
4 19
 
2.4%
8 3
 
0.4%
Lowercase Letter
ValueCountFrequency (%)
k 113
45.9%
t 113
45.9%
g 10
 
4.1%
s 10
 
4.1%
Other Punctuation
ValueCountFrequency (%)
. 1220
99.7%
& 4
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56146
95.5%
Common 2055
 
3.5%
Latin 572
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2312
 
4.1%
1649
 
2.9%
1480
 
2.6%
1464
 
2.6%
1460
 
2.6%
1404
 
2.5%
1193
 
2.1%
1162
 
2.1%
1097
 
2.0%
862
 
1.5%
Other values (413) 42063
74.9%
Latin
ValueCountFrequency (%)
k 113
19.8%
t 113
19.8%
S 78
13.6%
T 68
11.9%
X 58
10.1%
H 26
 
4.5%
L 16
 
2.8%
D 14
 
2.4%
K 13
 
2.3%
g 10
 
1.7%
Other values (9) 63
11.0%
Common
ValueCountFrequency (%)
. 1220
59.4%
1 309
 
15.0%
3 177
 
8.6%
2 113
 
5.5%
5 82
 
4.0%
9 47
 
2.3%
7 29
 
1.4%
6 26
 
1.3%
4 19
 
0.9%
) 13
 
0.6%
Other values (3) 20
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56142
95.5%
ASCII 2627
 
4.5%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2312
 
4.1%
1649
 
2.9%
1480
 
2.6%
1464
 
2.6%
1460
 
2.6%
1404
 
2.5%
1193
 
2.1%
1162
 
2.1%
1097
 
2.0%
862
 
1.5%
Other values (412) 42059
74.9%
ASCII
ValueCountFrequency (%)
. 1220
46.4%
1 309
 
11.8%
3 177
 
6.7%
k 113
 
4.3%
2 113
 
4.3%
t 113
 
4.3%
5 82
 
3.1%
S 78
 
3.0%
T 68
 
2.6%
X 58
 
2.2%
Other values (22) 296
 
11.3%
None
ValueCountFrequency (%)
4
100.0%

Interactions

2024-04-19T15:50:10.042803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:50:09.812484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:50:10.145376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T15:50:09.911577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T15:50:12.620201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류장코드
연번1.0000.077
정류장코드0.0771.000
2024-04-19T15:50:12.692256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류장코드
연번1.0000.101
정류장코드0.1011.000

Missing values

2024-04-19T15:50:10.255600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T15:50:10.595722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번노선정류장코드정류장명
64076408334815022마산리
627162723024811073이동119안전센터
428342847624815940마산역앞.동마산병원
28812882Jan-734823450오서시장
1459014591754823210금암마을
11277112783514814563마천공단입구
857185722504817620내서초등학교
725726714818310국제주유소
206720682144806152유목교
13118131193024804963장천동.이순신리더쉽국제센터
연번노선정류장코드정류장명
10984109852164803002창원시청
234423451704801932트리비앙아파트
6896907044813040봉암동
370337042594800480현동초등학교
65486549774823490정달마을
11169111701134808402여성회관창원관
1188011881314814332명서다리.명곡교회
1454714548544813790문화동
91189119364802322칠성아파트
37603761504815940마산역앞.동마산병원