Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells10
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory410.2 KiB
Average record size in memory42.0 B

Variable types

Numeric2
Text2

Dataset

Description빅데이터시스템_창원시 버스정보 자료
Author경상남도 창원시
URLhttps://www.data.go.kr/data/15063982/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 22:46:01.914357
Analysis finished2023-12-11 22:46:03.300424
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7614.8164
Minimum2
Maximum15194
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T07:46:03.426224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile779.9
Q13812.5
median7628
Q311386.25
95-th percentile14479.15
Maximum15194
Range15192
Interquartile range (IQR)7573.75

Descriptive statistics

Standard deviation4391.6113
Coefficient of variation (CV)0.57671926
Kurtosis-1.1966499
Mean7614.8164
Median Absolute Deviation (MAD)3789
Skewness-0.0028854286
Sum76148164
Variance19286250
MonotonicityNot monotonic
2023-12-12T07:46:03.678726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9068 1
 
< 0.1%
1307 1
 
< 0.1%
2 1
 
< 0.1%
14271 1
 
< 0.1%
4307 1
 
< 0.1%
653 1
 
< 0.1%
9668 1
 
< 0.1%
4752 1
 
< 0.1%
574 1
 
< 0.1%
7832 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
4 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
14 1
< 0.1%
ValueCountFrequency (%)
15194 1
< 0.1%
15193 1
< 0.1%
15192 1
< 0.1%
15191 1
< 0.1%
15190 1
< 0.1%
15189 1
< 0.1%
15185 1
< 0.1%
15184 1
< 0.1%
15182 1
< 0.1%
15180 1
< 0.1%

노선
Text

Distinct169
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T07:46:04.223593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length2.8693
Min length1

Characters and Unicode

Total characters28693
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row530-1
2nd rowJan-65
3rd row75
4th rowJan-65
5th row11
ValueCountFrequency (%)
110 112
 
1.1%
17 109
 
1.1%
116 103
 
1.0%
40 102
 
1.0%
41 102
 
1.0%
111 101
 
1.0%
105 100
 
1.0%
27 97
 
1.0%
752 96
 
1.0%
30 96
 
1.0%
Other values (159) 8982
89.8%
2023-12-12T07:46:04.985958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5925
20.6%
2 3960
13.8%
0 3836
13.4%
5 3430
12.0%
3 3093
10.8%
7 2175
 
7.6%
6 2003
 
7.0%
4 1447
 
5.0%
- 1177
 
4.1%
8 432
 
1.5%
Other values (4) 1215
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26706
93.1%
Dash Punctuation 1177
 
4.1%
Lowercase Letter 540
 
1.9%
Uppercase Letter 270
 
0.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5925
22.2%
2 3960
14.8%
0 3836
14.4%
5 3430
12.8%
3 3093
11.6%
7 2175
 
8.1%
6 2003
 
7.5%
4 1447
 
5.4%
8 432
 
1.6%
9 405
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
a 270
50.0%
n 270
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1177
100.0%
Uppercase Letter
ValueCountFrequency (%)
J 270
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27883
97.2%
Latin 810
 
2.8%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5925
21.2%
2 3960
14.2%
0 3836
13.8%
5 3430
12.3%
3 3093
11.1%
7 2175
 
7.8%
6 2003
 
7.2%
4 1447
 
5.2%
- 1177
 
4.2%
8 432
 
1.5%
Latin
ValueCountFrequency (%)
J 270
33.3%
a 270
33.3%
n 270
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28693
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5925
20.6%
2 3960
13.8%
0 3836
13.4%
5 3430
12.0%
3 3093
10.8%
7 2175
 
7.6%
6 2003
 
7.0%
4 1447
 
5.0%
- 1177
 
4.1%
8 432
 
1.5%
Other values (4) 1215
 
4.2%

정류장코드
Real number (ℝ)

Distinct2193
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4843809.7
Minimum246
Maximum9129320
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T07:46:05.225842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum246
5-th percentile4801070
Q14805720.5
median4812760
Q34818550
95-th percentile4826670
Maximum9129320
Range9129074
Interquartile range (IQR)12829.5

Descriptive statistics

Standard deviation426542.09
Coefficient of variation (CV)0.088059217
Kurtosis100.84194
Mean4843809.7
Median Absolute Deviation (MAD)6548
Skewness6.92803
Sum4.8438097 × 1010
Variance1.8193815 × 1011
MonotonicityNot monotonic
2023-12-12T07:46:05.864773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4814650 42
 
0.4%
4808190 40
 
0.4%
4813800 40
 
0.4%
4803990 40
 
0.4%
4813790 39
 
0.4%
4813590 39
 
0.4%
4820460 36
 
0.4%
4804180 36
 
0.4%
4804000 36
 
0.4%
4819720 36
 
0.4%
Other values (2183) 9616
96.2%
ValueCountFrequency (%)
246 1
< 0.1%
411 1
< 0.1%
528 1
< 0.1%
574 1
< 0.1%
640 1
< 0.1%
673 1
< 0.1%
735 1
< 0.1%
962 1
< 0.1%
966 1
< 0.1%
1136 1
< 0.1%
ValueCountFrequency (%)
9129320 1
< 0.1%
9129300 1
< 0.1%
9129280 1
< 0.1%
9129240 1
< 0.1%
9127520 1
< 0.1%
9126540 2
< 0.1%
9126530 1
< 0.1%
9125593 1
< 0.1%
9124993 1
< 0.1%
9124880 1
< 0.1%
Distinct1322
Distinct (%)13.2%
Missing10
Missing (%)0.1%
Memory size156.2 KiB
2023-12-12T07:46:06.185321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length5.8524525
Min length2

Characters and Unicode

Total characters58466
Distinct characters451
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)2.4%

Sample

1st row고속버스터미널
2nd row문화동
3rd row고속버스터미널
4th row합성동
5th row임마뉴엘교회
ValueCountFrequency (%)
마산합포구청.의료원 81
 
0.8%
문화동 79
 
0.8%
경동메르빌아파트 76
 
0.8%
kt마산지사 74
 
0.7%
경남데파트 73
 
0.7%
롯데백화점앞어시장 71
 
0.7%
반월민원센터 71
 
0.7%
신세계백화점 65
 
0.7%
중부경찰서 65
 
0.7%
경남대남부터미널종점 63
 
0.6%
Other values (1312) 9272
92.8%
2023-12-12T07:46:06.782127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2203
 
3.8%
1649
 
2.8%
1453
 
2.5%
1453
 
2.5%
1443
 
2.5%
1398
 
2.4%
1205
 
2.1%
. 1177
 
2.0%
1149
 
2.0%
1076
 
1.8%
Other values (441) 44260
75.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 55920
95.6%
Other Punctuation 1184
 
2.0%
Decimal Number 776
 
1.3%
Uppercase Letter 332
 
0.6%
Lowercase Letter 220
 
0.4%
Close Punctuation 17
 
< 0.1%
Open Punctuation 15
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2203
 
3.9%
1649
 
2.9%
1453
 
2.6%
1453
 
2.6%
1443
 
2.6%
1398
 
2.5%
1205
 
2.2%
1149
 
2.1%
1076
 
1.9%
888
 
1.6%
Other values (410) 42003
75.1%
Uppercase Letter
ValueCountFrequency (%)
S 83
25.0%
T 69
20.8%
X 57
17.2%
H 30
 
9.0%
D 17
 
5.1%
K 14
 
4.2%
L 13
 
3.9%
N 10
 
3.0%
B 10
 
3.0%
A 9
 
2.7%
Other values (3) 20
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 308
39.7%
3 154
19.8%
2 117
 
15.1%
5 71
 
9.1%
9 51
 
6.6%
7 25
 
3.2%
4 25
 
3.2%
6 23
 
3.0%
8 2
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
t 104
47.3%
k 104
47.3%
s 6
 
2.7%
g 6
 
2.7%
Other Punctuation
ValueCountFrequency (%)
. 1177
99.4%
& 7
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 55922
95.6%
Common 1992
 
3.4%
Latin 552
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2203
 
3.9%
1649
 
2.9%
1453
 
2.6%
1453
 
2.6%
1443
 
2.6%
1398
 
2.5%
1205
 
2.2%
1149
 
2.1%
1076
 
1.9%
888
 
1.6%
Other values (411) 42005
75.1%
Latin
ValueCountFrequency (%)
t 104
18.8%
k 104
18.8%
S 83
15.0%
T 69
12.5%
X 57
10.3%
H 30
 
5.4%
D 17
 
3.1%
K 14
 
2.5%
L 13
 
2.4%
N 10
 
1.8%
Other values (7) 51
9.2%
Common
ValueCountFrequency (%)
. 1177
59.1%
1 308
 
15.5%
3 154
 
7.7%
2 117
 
5.9%
5 71
 
3.6%
9 51
 
2.6%
7 25
 
1.3%
4 25
 
1.3%
6 23
 
1.2%
) 17
 
0.9%
Other values (3) 24
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 55920
95.6%
ASCII 2544
 
4.4%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2203
 
3.9%
1649
 
2.9%
1453
 
2.6%
1453
 
2.6%
1443
 
2.6%
1398
 
2.5%
1205
 
2.2%
1149
 
2.1%
1076
 
1.9%
888
 
1.6%
Other values (410) 42003
75.1%
ASCII
ValueCountFrequency (%)
. 1177
46.3%
1 308
 
12.1%
3 154
 
6.1%
2 117
 
4.6%
t 104
 
4.1%
k 104
 
4.1%
S 83
 
3.3%
5 71
 
2.8%
T 69
 
2.7%
X 57
 
2.2%
Other values (20) 300
 
11.8%
None
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-12T07:46:02.705143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:46:02.391365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:46:02.871727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:46:02.556168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T07:46:06.909611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류장코드
연번1.0000.079
정류장코드0.0791.000
2023-12-12T07:46:07.032848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번정류장코드
연번1.0000.094
정류장코드0.0941.000

Missing values

2023-12-12T07:46:03.081749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:46:03.230121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번노선정류장코드정류장명
90679068530-14819100고속버스터미널
93879388Jan-654813800문화동
1279512796754819100고속버스터미널
93459346Jan-654800910합성동
1139211393114805452임마뉴엘교회
966196622114813192봉곡동상가입구
89478948304814272명서초등학교
14633146345064826272동민정공
87988799224801812팔룡동행정복지센터
82648265104816552대천
연번노선정류장코드정류장명
1420614207114814422명당주유소
518151821084820412S&T중공업
350335041114804802정우상가
1140411405314818612신성
1126711268114820962북면농협.북면온
1831842134804612가음정럭키아파트
334133422664814730마산역.365병원
649965001154801802팔룡주구운동장
65786579507-14801322경남선관위
1496614967504815200두척