Overview

Dataset statistics

Number of variables4
Number of observations564
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory18.9 KiB
Average record size in memory34.2 B

Variable types

Categorical2
Text1
Numeric1

Dataset

Description이용자 유형별 버스정류소 이용 인원 현황
Author제주데이터허브
URLhttps://www.jejudatahub.net/data/view/data/743

Alerts

base_date has constant value ""Constant

Reproduction

Analysis started2024-03-13 12:03:06.117561
Analysis finished2024-03-13 12:03:06.576511
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

base_date
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
20190101
564 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20190101
2nd row20190101
3rd row20190101
4th row20190101
5th row20190101

Common Values

ValueCountFrequency (%)
20190101 564
100.0%

Length

2024-03-13T21:03:06.652961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:03:06.767154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20190101 564
100.0%
Distinct176
Distinct (%)31.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2024-03-13T21:03:07.113206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length98
Median length96
Mean length95.156028
Min length85

Characters and Unicode

Total characters53668
Distinct characters227
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)5.9%

Sample

1st row(구)동홍동주민센터
2nd row(구)동홍동주민센터
3rd row(구)동홍동주민센터
4th row(구)동홍동주민센터
5th row(구)동홍동주민센터
ValueCountFrequency (%)
관덕정 7
 
1.2%
1100고지휴게소 7
 
1.2%
고성리구성산농협 7
 
1.2%
구터미널 7
 
1.2%
고산동산 7
 
1.2%
광양사거리 7
 
1.2%
광양 7
 
1.2%
구)삼일금고 7
 
1.2%
구)중앙파출소 7
 
1.2%
김녕환승정류장(김녕초등학교 6
 
1.1%
Other values (166) 495
87.8%
2024-03-13T21:03:07.659670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50682
94.4%
165
 
0.3%
139
 
0.3%
116
 
0.2%
109
 
0.2%
79
 
0.1%
71
 
0.1%
1 69
 
0.1%
55
 
0.1%
54
 
0.1%
Other values (217) 2129
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Space Separator 50682
94.4%
Other Letter 2732
 
5.1%
Decimal Number 127
 
0.2%
Close Punctuation 42
 
0.1%
Open Punctuation 42
 
0.1%
Uppercase Letter 30
 
0.1%
Other Punctuation 7
 
< 0.1%
Dash Punctuation 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
165
 
6.0%
139
 
5.1%
116
 
4.2%
109
 
4.0%
79
 
2.9%
71
 
2.6%
55
 
2.0%
54
 
2.0%
54
 
2.0%
53
 
1.9%
Other values (196) 1837
67.2%
Decimal Number
ValueCountFrequency (%)
1 69
54.3%
0 22
 
17.3%
2 15
 
11.8%
3 9
 
7.1%
9 5
 
3.9%
6 4
 
3.1%
4 3
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
L 10
33.3%
S 7
23.3%
M 4
 
13.3%
G 4
 
13.3%
H 3
 
10.0%
C 1
 
3.3%
N 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
/ 3
42.9%
. 3
42.9%
, 1
 
14.3%
Space Separator
ValueCountFrequency (%)
50682
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 50906
94.9%
Hangul 2732
 
5.1%
Latin 30
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
165
 
6.0%
139
 
5.1%
116
 
4.2%
109
 
4.0%
79
 
2.9%
71
 
2.6%
55
 
2.0%
54
 
2.0%
54
 
2.0%
53
 
1.9%
Other values (196) 1837
67.2%
Common
ValueCountFrequency (%)
50682
99.6%
1 69
 
0.1%
) 42
 
0.1%
( 42
 
0.1%
0 22
 
< 0.1%
2 15
 
< 0.1%
3 9
 
< 0.1%
- 6
 
< 0.1%
9 5
 
< 0.1%
6 4
 
< 0.1%
Other values (4) 10
 
< 0.1%
Latin
ValueCountFrequency (%)
L 10
33.3%
S 7
23.3%
M 4
 
13.3%
G 4
 
13.3%
H 3
 
10.0%
C 1
 
3.3%
N 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50936
94.9%
Hangul 2732
 
5.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50682
99.5%
1 69
 
0.1%
) 42
 
0.1%
( 42
 
0.1%
0 22
 
< 0.1%
2 15
 
< 0.1%
L 10
 
< 0.1%
3 9
 
< 0.1%
S 7
 
< 0.1%
- 6
 
< 0.1%
Other values (11) 32
 
0.1%
Hangul
ValueCountFrequency (%)
165
 
6.0%
139
 
5.1%
116
 
4.2%
109
 
4.0%
79
 
2.9%
71
 
2.6%
55
 
2.0%
54
 
2.0%
54
 
2.0%
53
 
1.9%
Other values (196) 1837
67.2%

user_type
Categorical

Distinct7
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
일반
167 
경로
117 
청소년
107 
장애 동반
60 
장애 일반
60 
Other values (2)
53 

Length

Max length5
Median length2
Mean length3.0070922
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경로
2nd row어린이
3rd row일반
4th row장애 동반
5th row장애 일반

Common Values

ValueCountFrequency (%)
일반 167
29.6%
경로 117
20.7%
청소년 107
19.0%
장애 동반 60
 
10.6%
장애 일반 60
 
10.6%
어린이 29
 
5.1%
유공 일반 24
 
4.3%

Length

2024-03-13T21:03:07.832793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:03:07.968975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 251
35.5%
장애 120
16.9%
경로 117
16.5%
청소년 107
15.1%
동반 60
 
8.5%
어린이 29
 
4.1%
유공 24
 
3.4%

user_count
Real number (ℝ)

Distinct52
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9.3723404
Minimum1
Maximum226
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.1 KiB
2024-03-13T21:03:08.136438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q39
95-th percentile37.85
Maximum226
Range225
Interquartile range (IQR)8

Descriptive statistics

Standard deviation19.782019
Coefficient of variation (CV)2.1106808
Kurtosis55.550008
Mean9.3723404
Median Absolute Deviation (MAD)2
Skewness6.349333
Sum5286
Variance391.32826
MonotonicityNot monotonic
2024-03-13T21:03:08.400610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 175
31.0%
2 84
14.9%
3 51
 
9.0%
4 26
 
4.6%
5 25
 
4.4%
6 21
 
3.7%
8 19
 
3.4%
7 17
 
3.0%
10 15
 
2.7%
13 11
 
2.0%
Other values (42) 120
21.3%
ValueCountFrequency (%)
1 175
31.0%
2 84
14.9%
3 51
 
9.0%
4 26
 
4.6%
5 25
 
4.4%
6 21
 
3.7%
7 17
 
3.0%
8 19
 
3.4%
9 9
 
1.6%
10 15
 
2.7%
ValueCountFrequency (%)
226 1
 
0.2%
220 1
 
0.2%
154 1
 
0.2%
121 1
 
0.2%
100 1
 
0.2%
77 3
0.5%
73 2
0.4%
67 2
0.4%
65 1
 
0.2%
58 2
0.4%

Interactions

2024-03-13T21:03:06.304696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T21:03:08.549560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
user_typeuser_count
user_type1.0000.201
user_count0.2011.000
2024-03-13T21:03:08.652829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
user_countuser_type
user_count1.0000.112
user_type0.1121.000

Missing values

2024-03-13T21:03:06.422389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T21:03:06.535451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

base_datestation_nameuser_typeuser_count
020190101(구)동홍동주민센터경로3
120190101(구)동홍동주민센터어린이2
220190101(구)동홍동주민센터일반14
320190101(구)동홍동주민센터장애 동반1
420190101(구)동홍동주민센터장애 일반1
520190101(구)동홍동주민센터청소년1
620190101(구)삼일금고경로24
720190101(구)삼일금고어린이1
820190101(구)삼일금고유공 일반4
920190101(구)삼일금고일반36
base_datestation_nameuser_typeuser_count
55420190101까끄래기오름경로2
55520190101까끄래기오름일반2
55620190101꽃동산일반4
55720190101낙선동일반1
55820190101낙천리경로6
55920190101낙천리일반1
56020190101난산리경로9
56120190101난산리일반2
56220190101난산리노인복지회관경로2
56320190101난산리노인복지회관장애 동반1