Overview

Dataset statistics

Number of variables5
Number of observations68
Missing cells12
Missing cells (%)3.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory41.9 B

Variable types

Categorical2
Text3

Alerts

철도운영기관명 has constant value ""Constant
지번주소 has 12 (17.6%) missing valuesMissing
도로명주소 has unique valuesUnique

Reproduction

Analysis started2024-04-16 12:17:57.585478
Analysis finished2024-04-16 12:17:59.920876
Duration2.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
인천교통공사
68 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천교통공사
2nd row인천교통공사
3rd row인천교통공사
4th row인천교통공사
5th row인천교통공사

Common Values

ValueCountFrequency (%)
인천교통공사 68
100.0%

Length

2024-04-16T21:17:59.986246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T21:18:00.104813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천교통공사 68
100.0%

선명
Categorical

Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
인천1호선
30 
인천2호선
27 
7호선
11 

Length

Max length5
Median length5
Mean length4.6764706
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천1호선
2nd row인천1호선
3rd row인천1호선
4th row인천1호선
5th row인천1호선

Common Values

ValueCountFrequency (%)
인천1호선 30
44.1%
인천2호선 27
39.7%
7호선 11
 
16.2%

Length

2024-04-16T21:18:00.219563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T21:18:00.329868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천1호선 30
44.1%
인천2호선 27
39.7%
7호선 11
 
16.2%

역명
Text

Distinct66
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size676.0 B
2024-04-16T21:18:00.564006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.7352941
Min length2

Characters and Unicode

Total characters254
Distinct characters118
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)94.1%

Sample

1st row간석오거리
2nd row갈산
3rd row경인교대입구
4th row계산
5th row계양
ValueCountFrequency (%)
부평구청 2
 
2.9%
인천시청 2
 
2.9%
석남(거북시장 1
 
1.5%
간석오거리 1
 
1.5%
서부여성회관 1
 
1.5%
시민공원 1
 
1.5%
석천사거리 1
 
1.5%
석바위시장 1
 
1.5%
검암 1
 
1.5%
서구청 1
 
1.5%
Other values (56) 56
82.4%
2024-04-16T21:18:00.965792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
3.9%
10
 
3.9%
8
 
3.1%
8
 
3.1%
8
 
3.1%
7
 
2.8%
7
 
2.8%
7
 
2.8%
5
 
2.0%
5
 
2.0%
Other values (108) 179
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 252
99.2%
Close Punctuation 1
 
0.4%
Open Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
4.0%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
5
 
2.0%
5
 
2.0%
Other values (106) 177
70.2%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 252
99.2%
Common 2
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
4.0%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
5
 
2.0%
5
 
2.0%
Other values (106) 177
70.2%
Common
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 252
99.2%
ASCII 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
4.0%
10
 
4.0%
8
 
3.2%
8
 
3.2%
8
 
3.2%
7
 
2.8%
7
 
2.8%
7
 
2.8%
5
 
2.0%
5
 
2.0%
Other values (106) 177
70.2%
ASCII
ValueCountFrequency (%)
) 1
50.0%
( 1
50.0%

지번주소
Text

MISSING 

Distinct53
Distinct (%)94.6%
Missing12
Missing (%)17.6%
Memory size676.0 B
2024-04-16T21:18:01.224819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length18.053571
Min length16

Characters and Unicode

Total characters1011
Distinct characters75
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)89.3%

Sample

1st row인천광역시 남동구 간석동 239-3
2nd row인천광역시 부편구 청천동 112
3rd row인천광역시 계양구 계산동 1034
4th row인청광역시 계양구 계산동 1014
5th row인천광역시 계양구 귤현동 451-264
ValueCountFrequency (%)
인천광역시 54
24.2%
서구 17
 
7.6%
연수구 12
 
5.4%
남동구 9
 
4.0%
계양구 7
 
3.1%
송도동 6
 
2.7%
부평구 5
 
2.2%
미추홀구 4
 
1.8%
선학동 3
 
1.3%
부평동 3
 
1.3%
Other values (81) 103
46.2%
2024-04-16T21:18:01.600163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
16.5%
67
 
6.6%
59
 
5.8%
56
 
5.5%
56
 
5.5%
56
 
5.5%
56
 
5.5%
56
 
5.5%
1 41
 
4.1%
- 29
 
2.9%
Other values (65) 368
36.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 618
61.1%
Decimal Number 197
 
19.5%
Space Separator 167
 
16.5%
Dash Punctuation 29
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
67
10.8%
59
 
9.5%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
17
 
2.8%
16
 
2.6%
15
 
2.4%
Other values (53) 164
26.5%
Decimal Number
ValueCountFrequency (%)
1 41
20.8%
4 25
12.7%
2 23
11.7%
7 19
9.6%
9 17
8.6%
3 16
 
8.1%
0 15
 
7.6%
6 15
 
7.6%
8 14
 
7.1%
5 12
 
6.1%
Space Separator
ValueCountFrequency (%)
167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 618
61.1%
Common 393
38.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
67
10.8%
59
 
9.5%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
17
 
2.8%
16
 
2.6%
15
 
2.4%
Other values (53) 164
26.5%
Common
ValueCountFrequency (%)
167
42.5%
1 41
 
10.4%
- 29
 
7.4%
4 25
 
6.4%
2 23
 
5.9%
7 19
 
4.8%
9 17
 
4.3%
3 16
 
4.1%
0 15
 
3.8%
6 15
 
3.8%
Other values (2) 26
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 618
61.1%
ASCII 393
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
167
42.5%
1 41
 
10.4%
- 29
 
7.4%
4 25
 
6.4%
2 23
 
5.9%
7 19
 
4.8%
9 17
 
4.3%
3 16
 
4.1%
0 15
 
3.8%
6 15
 
3.8%
Other values (2) 26
 
6.6%
Hangul
ValueCountFrequency (%)
67
10.8%
59
 
9.5%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
56
 
9.1%
17
 
2.8%
16
 
2.6%
15
 
2.4%
Other values (53) 164
26.5%

도로명주소
Text

UNIQUE 

Distinct68
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
2024-04-16T21:18:01.893853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length18.235294
Min length15

Characters and Unicode

Total characters1240
Distinct characters72
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)100.0%

Sample

1st row인천광역시 남동구 경인로 지하 642
2nd row인천광역시 부평구 부평대로 지하 286
3rd row인천광역시 계양구 계양대로 지하 162
4th row인천광역시 계양구 경명대로 지하 1089
5th row인천광역시 계양구 다남로 24
ValueCountFrequency (%)
인천광역시 57
18.8%
지하 31
 
10.2%
서구 17
 
5.6%
연수구 13
 
4.3%
경기도 11
 
3.6%
길주로 11
 
3.6%
부천시 11
 
3.6%
남동구 10
 
3.3%
계양구 7
 
2.3%
부평구 6
 
2.0%
Other values (95) 129
42.6%
2024-04-16T21:18:02.315203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
235
19.0%
72
 
5.8%
68
 
5.5%
67
 
5.4%
67
 
5.4%
62
 
5.0%
58
 
4.7%
57
 
4.6%
32
 
2.6%
31
 
2.5%
Other values (62) 491
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 812
65.5%
Space Separator 235
 
19.0%
Decimal Number 192
 
15.5%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
8.9%
68
 
8.4%
67
 
8.3%
67
 
8.3%
62
 
7.6%
58
 
7.1%
57
 
7.0%
32
 
3.9%
31
 
3.8%
21
 
2.6%
Other values (50) 277
34.1%
Decimal Number
ValueCountFrequency (%)
2 27
14.1%
1 24
12.5%
6 21
10.9%
3 20
10.4%
8 18
9.4%
7 18
9.4%
4 17
8.9%
0 16
8.3%
9 16
8.3%
5 15
7.8%
Space Separator
ValueCountFrequency (%)
235
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 812
65.5%
Common 428
34.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
8.9%
68
 
8.4%
67
 
8.3%
67
 
8.3%
62
 
7.6%
58
 
7.1%
57
 
7.0%
32
 
3.9%
31
 
3.8%
21
 
2.6%
Other values (50) 277
34.1%
Common
ValueCountFrequency (%)
235
54.9%
2 27
 
6.3%
1 24
 
5.6%
6 21
 
4.9%
3 20
 
4.7%
8 18
 
4.2%
7 18
 
4.2%
4 17
 
4.0%
0 16
 
3.7%
9 16
 
3.7%
Other values (2) 16
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 812
65.5%
ASCII 428
34.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
235
54.9%
2 27
 
6.3%
1 24
 
5.6%
6 21
 
4.9%
3 20
 
4.7%
8 18
 
4.2%
7 18
 
4.2%
4 17
 
4.0%
0 16
 
3.7%
9 16
 
3.7%
Other values (2) 16
 
3.7%
Hangul
ValueCountFrequency (%)
72
 
8.9%
68
 
8.4%
67
 
8.3%
67
 
8.3%
62
 
7.6%
58
 
7.1%
57
 
7.0%
32
 
3.9%
31
 
3.8%
21
 
2.6%
Other values (50) 277
34.1%

Correlations

2024-04-16T21:18:02.414704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
선명역명지번주소도로명주소
선명1.0000.0001.0001.000
역명0.0001.0000.9901.000
지번주소1.0000.9901.0001.000
도로명주소1.0001.0001.0001.000

Missing values

2024-04-16T21:17:59.740189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-16T21:17:59.879081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지번주소도로명주소
0인천교통공사인천1호선간석오거리인천광역시 남동구 간석동 239-3인천광역시 남동구 경인로 지하 642
1인천교통공사인천1호선갈산인천광역시 부편구 청천동 112인천광역시 부평구 부평대로 지하 286
2인천교통공사인천1호선경인교대입구인천광역시 계양구 계산동 1034인천광역시 계양구 계양대로 지하 162
3인천교통공사인천1호선계산인청광역시 계양구 계산동 1014인천광역시 계양구 경명대로 지하 1089
4인천교통공사인천1호선계양인천광역시 계양구 귤현동 451-264인천광역시 계양구 다남로 24
5인천교통공사인천1호선국제업무지구인천광역시 연수구 송도동 78인천광역시 연수구 인천타워대로 지하 334
6인천교통공사인천1호선귤현인천광역시 계양구 귤현동 334-1인천광역시 계양구 장제로 1136
7인천교통공사인천1호선동막인천광역시 연수구 동춘동 927-1인천광역시 연수구 경원대로 80
8인천교통공사인천1호선동수인천광역시 부평구 부평동 686인천광역시 부평구 경인로 지하 877
9인천교통공사인천1호선동춘인천광역시 연수구 동춘동 926-11인천광역시 연수구 경원대로 지하 180
철도운영기관명선명역명지번주소도로명주소
58인천교통공사7호선까치울<NA>경기도 부천시 길주로 626
59인천교통공사7호선부천시청<NA>경기도 부천시 길주로 202
60인천교통공사7호선부천종합운동장<NA>경기도 부천시 길주로 502
61인천교통공사7호선부평구청<NA>경기도 부천시 길주로 189
62인천교통공사7호선산곡<NA>경기도 부천시 길주로 379
63인천교통공사7호선삼산체육관<NA>경기도 부천시 길주로 713
64인천교통공사7호선상동<NA>경기도 부천시 길주로 104
65인천교통공사7호선석남(거북시장)<NA>경기도 부천시 길주로 120
66인천교통공사7호선신중동<NA>경기도 부천시 길주로 314
67인천교통공사7호선춘의<NA>경기도 부천시 길주로 406