Overview

Dataset statistics

Number of variables5
Number of observations39
Missing cells7
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory43.4 B

Variable types

Text3
Categorical1
DateTime1

Dataset

Description경상남도 김해시의 택시승차대 현황 자료로 택시승차대명, 도로명주소, 지번주소, 승차대유형 등의 데이터로 구성되어 있습니다.
Author경상남도 김해시
URLhttps://www.data.go.kr/data/15098640/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
도로명주소 has 7 (17.9%) missing valuesMissing
택시승차대명 has unique valuesUnique

Reproduction

Analysis started2023-12-11 22:58:43.130026
Analysis finished2023-12-11 22:58:43.484243
Duration0.35 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

택시승차대명
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-12T07:58:43.637421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length17.974359
Min length10

Characters and Unicode

Total characters701
Distinct characters146
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row지내동 동원아파트 앞 파리바게트 건너편 택시승강장
2nd row삼정동 메가마트 택시승강장
3rd row부원동 롯데마트 정문 택시승강장
4th row부원역 아이스퀘어몰 택시승강장
5th row부원 센텀그린코아 택시승강장
ValueCountFrequency (%)
택시승강장 39
25.7%
19
 
12.5%
외동 5
 
3.3%
맞은편 4
 
2.6%
사이 3
 
2.0%
1 3
 
2.0%
2 3
 
2.0%
장유 2
 
1.3%
sw타워 2
 
1.3%
선천지구 2
 
1.3%
Other values (59) 70
46.1%
2023-12-12T07:58:43.986825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
113
 
16.1%
43
 
6.1%
42
 
6.0%
39
 
5.6%
39
 
5.6%
39
 
5.6%
20
 
2.9%
17
 
2.4%
16
 
2.3%
15
 
2.1%
Other values (136) 318
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 563
80.3%
Space Separator 113
 
16.1%
Decimal Number 13
 
1.9%
Uppercase Letter 7
 
1.0%
Other Punctuation 3
 
0.4%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.6%
42
 
7.5%
39
 
6.9%
39
 
6.9%
39
 
6.9%
20
 
3.6%
17
 
3.0%
16
 
2.8%
15
 
2.7%
12
 
2.1%
Other values (123) 281
49.9%
Uppercase Letter
ValueCountFrequency (%)
W 2
28.6%
S 2
28.6%
T 1
14.3%
X 1
14.3%
K 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 6
46.2%
1 5
38.5%
6 1
 
7.7%
4 1
 
7.7%
Space Separator
ValueCountFrequency (%)
113
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 563
80.3%
Common 131
 
18.7%
Latin 7
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.6%
42
 
7.5%
39
 
6.9%
39
 
6.9%
39
 
6.9%
20
 
3.6%
17
 
3.0%
16
 
2.8%
15
 
2.7%
12
 
2.1%
Other values (123) 281
49.9%
Common
ValueCountFrequency (%)
113
86.3%
2 6
 
4.6%
1 5
 
3.8%
, 3
 
2.3%
6 1
 
0.8%
) 1
 
0.8%
4 1
 
0.8%
( 1
 
0.8%
Latin
ValueCountFrequency (%)
W 2
28.6%
S 2
28.6%
T 1
14.3%
X 1
14.3%
K 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 563
80.3%
ASCII 138
 
19.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
113
81.9%
2 6
 
4.3%
1 5
 
3.6%
, 3
 
2.2%
W 2
 
1.4%
S 2
 
1.4%
T 1
 
0.7%
X 1
 
0.7%
K 1
 
0.7%
6 1
 
0.7%
Other values (3) 3
 
2.2%
Hangul
ValueCountFrequency (%)
43
 
7.6%
42
 
7.5%
39
 
6.9%
39
 
6.9%
39
 
6.9%
20
 
3.6%
17
 
3.0%
16
 
2.8%
15
 
2.7%
12
 
2.1%
Other values (123) 281
49.9%

도로명주소
Text

MISSING 

Distinct30
Distinct (%)93.8%
Missing7
Missing (%)17.9%
Memory size444.0 B
2023-12-12T07:58:44.183824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length25.5
Mean length24.1875
Min length17

Characters and Unicode

Total characters774
Distinct characters91
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)87.5%

Sample

1st row경상남도 김해시 분성로727번길 7 (지내동)
2nd row경상남도 김해시 김해대로2492번길 20 (삼정동)
3rd row경상남도 김해시 김해대로 2330 (부원동)
4th row경상남도 김해시 김해대로 2342 (부원동)
5th row경상남도 김해시 김해대로 2349 (부원동, 부원역그린코아더센텀)
ValueCountFrequency (%)
경상남도 32
19.6%
김해시 32
19.6%
김해대로 7
 
4.3%
외동 6
 
3.7%
내동 4
 
2.5%
부원동 4
 
2.5%
7 3
 
1.8%
67 2
 
1.2%
삼정동 2
 
1.2%
내외로 2
 
1.2%
Other values (64) 69
42.3%
2023-12-12T07:58:44.491393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
131
 
16.9%
44
 
5.7%
43
 
5.6%
33
 
4.3%
33
 
4.3%
32
 
4.1%
32
 
4.1%
32
 
4.1%
32
 
4.1%
31
 
4.0%
Other values (81) 331
42.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 469
60.6%
Space Separator 131
 
16.9%
Decimal Number 112
 
14.5%
Open Punctuation 28
 
3.6%
Close Punctuation 28
 
3.6%
Other Punctuation 4
 
0.5%
Dash Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
9.4%
43
 
9.2%
33
 
7.0%
33
 
7.0%
32
 
6.8%
32
 
6.8%
32
 
6.8%
32
 
6.8%
31
 
6.6%
15
 
3.2%
Other values (66) 142
30.3%
Decimal Number
ValueCountFrequency (%)
2 27
24.1%
1 17
15.2%
3 15
13.4%
0 12
10.7%
7 12
10.7%
6 9
 
8.0%
5 7
 
6.2%
4 6
 
5.4%
9 6
 
5.4%
8 1
 
0.9%
Space Separator
ValueCountFrequency (%)
131
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 469
60.6%
Common 305
39.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
9.4%
43
 
9.2%
33
 
7.0%
33
 
7.0%
32
 
6.8%
32
 
6.8%
32
 
6.8%
32
 
6.8%
31
 
6.6%
15
 
3.2%
Other values (66) 142
30.3%
Common
ValueCountFrequency (%)
131
43.0%
( 28
 
9.2%
) 28
 
9.2%
2 27
 
8.9%
1 17
 
5.6%
3 15
 
4.9%
0 12
 
3.9%
7 12
 
3.9%
6 9
 
3.0%
5 7
 
2.3%
Other values (5) 19
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 469
60.6%
ASCII 305
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
131
43.0%
( 28
 
9.2%
) 28
 
9.2%
2 27
 
8.9%
1 17
 
5.6%
3 15
 
4.9%
0 12
 
3.9%
7 12
 
3.9%
6 9
 
3.0%
5 7
 
2.3%
Other values (5) 19
 
6.2%
Hangul
ValueCountFrequency (%)
44
 
9.4%
43
 
9.2%
33
 
7.0%
33
 
7.0%
32
 
6.8%
32
 
6.8%
32
 
6.8%
32
 
6.8%
31
 
6.6%
15
 
3.2%
Other values (66) 142
30.3%
Distinct36
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-12T07:58:44.688682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length24
Mean length18.538462
Min length16

Characters and Unicode

Total characters723
Distinct characters53
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)84.6%

Sample

1st row경상남도 김해시 지내동 313-9
2nd row경상남도 김해시 삼정동 378
3rd row경상남도 김해시 부원동 1041
4th row경상남도 김해시 부원동 1043
5th row경상남도 김해시 부원동 606-3
ValueCountFrequency (%)
경상남도 39
23.8%
김해시 39
23.8%
외동 8
 
4.9%
부원동 4
 
2.4%
내동 4
 
2.4%
삼계동 3
 
1.8%
진영읍 3
 
1.8%
128-1 2
 
1.2%
진영리 2
 
1.2%
주촌면 2
 
1.2%
Other values (52) 58
35.4%
2023-12-12T07:58:45.007623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
17.6%
1 51
 
7.1%
39
 
5.4%
39
 
5.4%
39
 
5.4%
39
 
5.4%
39
 
5.4%
39
 
5.4%
39
 
5.4%
35
 
4.8%
Other values (43) 237
32.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 394
54.5%
Decimal Number 172
23.8%
Space Separator 127
 
17.6%
Dash Punctuation 30
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
35
8.9%
8
 
2.0%
6
 
1.5%
Other values (31) 72
18.3%
Decimal Number
ValueCountFrequency (%)
1 51
29.7%
4 21
12.2%
2 19
 
11.0%
3 18
 
10.5%
6 16
 
9.3%
8 14
 
8.1%
0 13
 
7.6%
7 8
 
4.7%
5 6
 
3.5%
9 6
 
3.5%
Space Separator
ValueCountFrequency (%)
127
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 394
54.5%
Common 329
45.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
35
8.9%
8
 
2.0%
6
 
1.5%
Other values (31) 72
18.3%
Common
ValueCountFrequency (%)
127
38.6%
1 51
15.5%
- 30
 
9.1%
4 21
 
6.4%
2 19
 
5.8%
3 18
 
5.5%
6 16
 
4.9%
8 14
 
4.3%
0 13
 
4.0%
7 8
 
2.4%
Other values (2) 12
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 394
54.5%
ASCII 329
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
127
38.6%
1 51
15.5%
- 30
 
9.1%
4 21
 
6.4%
2 19
 
5.8%
3 18
 
5.5%
6 16
 
4.9%
8 14
 
4.3%
0 13
 
4.0%
7 8
 
2.4%
Other values (2) 12
 
3.6%
Hangul
ValueCountFrequency (%)
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
39
9.9%
35
8.9%
8
 
2.0%
6
 
1.5%
Other values (31) 72
18.3%

승차대유형
Categorical

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
승하차대
21 
표지판
18 

Length

Max length4
Median length4
Mean length3.5384615
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row표지판
2nd row표지판
3rd row표지판
4th row승하차대
5th row승하차대

Common Values

ValueCountFrequency (%)
승하차대 21
53.8%
표지판 18
46.2%

Length

2023-12-12T07:58:45.158426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T07:58:45.254119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
승하차대 21
53.8%
표지판 18
46.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size444.0 B
Minimum2023-09-20 00:00:00
Maximum2023-09-20 00:00:00
2023-12-12T07:58:45.325398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T07:58:45.398983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T07:58:45.459887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
택시승차대명도로명주소지번주소승차대유형
택시승차대명1.0001.0001.0001.000
도로명주소1.0001.0001.0000.710
지번주소1.0001.0001.0000.000
승차대유형1.0000.7100.0001.000

Missing values

2023-12-12T07:58:43.364480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T07:58:43.450089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

택시승차대명도로명주소지번주소승차대유형데이터기준일자
0지내동 동원아파트 앞 파리바게트 건너편 택시승강장경상남도 김해시 분성로727번길 7 (지내동)경상남도 김해시 지내동 313-9표지판2023-09-20
1삼정동 메가마트 택시승강장경상남도 김해시 김해대로2492번길 20 (삼정동)경상남도 김해시 삼정동 378표지판2023-09-20
2부원동 롯데마트 정문 택시승강장경상남도 김해시 김해대로 2330 (부원동)경상남도 김해시 부원동 1041표지판2023-09-20
3부원역 아이스퀘어몰 택시승강장경상남도 김해시 김해대로 2342 (부원동)경상남도 김해시 부원동 1043승하차대2023-09-20
4부원 센텀그린코아 택시승강장경상남도 김해시 김해대로 2349 (부원동, 부원역그린코아더센텀)경상남도 김해시 부원동 606-3승하차대2023-09-20
5김해시청역 택시승강장경상남도 김해시 김해대로 2409 (부원동)경상남도 김해시 부원동 625-1승하차대2023-09-20
6인제대역 앞 택시승강장경상남도 김해시 김해대로 2521 (삼정동)경상남도 김해시 삼정동 665-9표지판2023-09-20
7외동 한국1차아파트, 덕산아파트 사이 택시승강장 1경상남도 김해시 함박로 120 (외동)경상남도 김해시 외동 1261-9승하차대2023-09-20
8외동 한국1차아파트, 덕산아파트 사이 택시승강장 2경상남도 김해시 함박로 120 (외동)경상남도 김해시 외동 1261-9표지판2023-09-20
9외동 뜨란채 아파트 앞 택시승강장경상남도 김해시 우암로 36 (외동, 뜨란채아파트)경상남도 김해시 외동 1250-1승하차대2023-09-20
택시승차대명도로명주소지번주소승차대유형데이터기준일자
29장유 롯데마트 택시승강장경상남도 김해시 번화1로56번길 15 (대청동)경상남도 김해시 대청동 300승하차대2023-09-20
30울트라상가 앞(갑오부영 4단지) 택시승강장경상남도 김해시 계동로102번길 27 (대청동)경상남도 김해시 대청동 322-2승하차대2023-09-20
31팔판마을 관동우체국 맞은편 택시승강장경상남도 김해시 계동로 23 (관동동)경상남도 김해시 관동동 448-4승하차대2023-09-20
32율하 다이소 앞 택시승강장<NA>경상남도 김해시 율하동 1348-3승하차대2023-09-20
33율하2지구 중심상가 부근 택시승강장<NA>경상남도 김해시 장유동 824 - 1승하차대2023-09-20
34율하2지구 시티프라디움 앞 택시승강장경상남도 김해시 율하5로 11 (장유동, 율하시티프라디움)경상남도 김해시 장유동 870승하차대2023-09-20
35경남부산경마공원 앞 택시승강장경상남도 김해시 가락대로 929-1 (수가동)경상남도 김해시 수가동 1333표지판2023-09-20
36KTX진영역 앞 택시승강장<NA>경상남도 김해시 진영읍 설창리 산 15-10승하차대2023-09-20
37진영코아루아파트 맞은편 택시승강장경상남도 김해시 진영읍 김해대로361번길 16경상남도 김해시 진영읍 진영리 1612-4표지판2023-09-20
38진영신도시 우리은행 부근 택시승강장경상남도 김해시 진영읍 김해대로365번길 6-4경상남도 김해시 진영읍 진영리 1614-7표지판2023-09-20