Overview

Dataset statistics

Number of variables6
Number of observations446
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.9 KiB
Average record size in memory50.3 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description부산교통공사 114개 역사의 엘리베이터 정보입니다. 호선, 역명, (지상과 연결되는 엘리베이터의 경우 인근)출입구번호, (엘리베이터에서 가장 가까운)상세위치, 운행구간 정보가 포함되어 있습니다.
URLhttps://www.data.go.kr/data/15119875/fileData.do

Alerts

연번 is highly overall correlated with 호선High correlation
호선 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:34:07.570960
Analysis finished2023-12-12 17:34:08.319132
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct446
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean223.5
Minimum1
Maximum446
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.0 KiB
2023-12-13T02:34:08.416873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile23.25
Q1112.25
median223.5
Q3334.75
95-th percentile423.75
Maximum446
Range445
Interquartile range (IQR)222.5

Descriptive statistics

Standard deviation128.89337
Coefficient of variation (CV)0.5767041
Kurtosis-1.2
Mean223.5
Median Absolute Deviation (MAD)111.5
Skewness0
Sum99681
Variance16613.5
MonotonicityStrictly increasing
2023-12-13T02:34:08.579612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
295 1
 
0.2%
306 1
 
0.2%
305 1
 
0.2%
304 1
 
0.2%
303 1
 
0.2%
302 1
 
0.2%
301 1
 
0.2%
300 1
 
0.2%
299 1
 
0.2%
Other values (436) 436
97.8%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
446 1
0.2%
445 1
0.2%
444 1
0.2%
443 1
0.2%
442 1
0.2%
441 1
0.2%
440 1
0.2%
439 1
0.2%
438 1
0.2%
437 1
0.2%

호선
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2
172 
1
159 
3
68 
4
47 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
2 172
38.6%
1 159
35.7%
3 68
 
15.2%
4 47
 
10.5%

Length

2023-12-13T02:34:08.720802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:34:08.846279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 172
38.6%
1 159
35.7%
3 68
 
15.2%
4 47
 
10.5%

역명
Text

Distinct110
Distinct (%)24.7%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-13T02:34:09.188453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length4.6300448
Min length2

Characters and Unicode

Total characters2065
Distinct characters172
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.7%

Sample

1st row노포(종합버스터미널)
2nd row노포(종합버스터미널)
3rd row범어사
4th row범어사
5th row범어사
ValueCountFrequency (%)
동매 10
 
2.2%
미남 7
 
1.6%
덕천(부산과기대 7
 
1.6%
만덕 7
 
1.6%
벡스코(시립미술관 7
 
1.6%
다대포해수욕장 6
 
1.3%
센텀시티(bexco·신세계 6
 
1.3%
중동 6
 
1.3%
망미(병무청 6
 
1.3%
배산 6
 
1.3%
Other values (100) 378
84.8%
2023-12-13T02:34:09.688997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 124
 
6.0%
( 124
 
6.0%
97
 
4.7%
90
 
4.4%
63
 
3.1%
58
 
2.8%
40
 
1.9%
39
 
1.9%
36
 
1.7%
35
 
1.7%
Other values (162) 1359
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1736
84.1%
Close Punctuation 124
 
6.0%
Open Punctuation 124
 
6.0%
Uppercase Letter 42
 
2.0%
Other Punctuation 31
 
1.5%
Decimal Number 8
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
97
 
5.6%
90
 
5.2%
63
 
3.6%
58
 
3.3%
40
 
2.3%
39
 
2.2%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.8%
Other values (150) 1213
69.9%
Uppercase Letter
ValueCountFrequency (%)
B 10
23.8%
O 6
14.3%
C 6
14.3%
X 6
14.3%
E 6
14.3%
S 4
 
9.5%
K 4
 
9.5%
Decimal Number
ValueCountFrequency (%)
2 5
62.5%
1 3
37.5%
Close Punctuation
ValueCountFrequency (%)
) 124
100.0%
Open Punctuation
ValueCountFrequency (%)
( 124
100.0%
Other Punctuation
ValueCountFrequency (%)
· 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1736
84.1%
Common 287
 
13.9%
Latin 42
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
97
 
5.6%
90
 
5.2%
63
 
3.6%
58
 
3.3%
40
 
2.3%
39
 
2.2%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.8%
Other values (150) 1213
69.9%
Latin
ValueCountFrequency (%)
B 10
23.8%
O 6
14.3%
C 6
14.3%
X 6
14.3%
E 6
14.3%
S 4
 
9.5%
K 4
 
9.5%
Common
ValueCountFrequency (%)
) 124
43.2%
( 124
43.2%
· 31
 
10.8%
2 5
 
1.7%
1 3
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1736
84.1%
ASCII 298
 
14.4%
None 31
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 124
41.6%
( 124
41.6%
B 10
 
3.4%
O 6
 
2.0%
C 6
 
2.0%
X 6
 
2.0%
E 6
 
2.0%
2 5
 
1.7%
S 4
 
1.3%
K 4
 
1.3%
Hangul
ValueCountFrequency (%)
97
 
5.6%
90
 
5.2%
63
 
3.6%
58
 
3.3%
40
 
2.3%
39
 
2.2%
36
 
2.1%
35
 
2.0%
33
 
1.9%
32
 
1.8%
Other values (150) 1213
69.9%
None
ValueCountFrequency (%)
· 31
100.0%

출입구번호
Categorical

Distinct17
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
<NA>
174 
1
53 
2
47 
3
45 
4
44 
Other values (12)
83 

Length

Max length4
Median length1
Mean length2.206278
Min length1

Unique

Unique3 ?
Unique (%)0.7%

Sample

1st row2
2nd row2
3rd row3
4th row4
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 174
39.0%
1 53
 
11.9%
2 47
 
10.5%
3 45
 
10.1%
4 44
 
9.9%
5 27
 
6.1%
6 24
 
5.4%
8 7
 
1.6%
7 7
 
1.6%
12 4
 
0.9%
Other values (7) 14
 
3.1%

Length

2023-12-13T02:34:09.850681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 174
39.0%
1 53
 
11.9%
2 47
 
10.5%
3 45
 
10.1%
4 44
 
9.9%
5 27
 
6.1%
6 24
 
5.4%
7 7
 
1.6%
8 7
 
1.6%
12 4
 
0.9%
Other values (7) 14
 
3.1%
Distinct432
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
2023-12-13T02:34:10.157974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length63
Mean length37.737668
Min length4

Characters and Unicode

Total characters16831
Distinct characters268
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique420 ?
Unique (%)94.2%

Sample

1st row(2F) 2번 출입구 앞(1F) 밤어사역 방향 승강장 6-1 출입문 앞
2nd row(2F) 2번 출입구 앞(1F) 노포행 승강장 6-1 출입문 앞
3rd row(1F) 3번출입구(B2) 1번/3번 출입구 방향
4th row(1F) 4번출입구(B2) 2번/4번 출입구 방향
5th row(B2) 남산역 방향 표내는 곳 내, 1번/3번 출입구 방향(B3) 남산역 방향 승강장 8-1 출입문 앞
ValueCountFrequency (%)
출입구 420
 
9.9%
방향 325
 
7.6%
1f 193
 
4.5%
승강장 191
 
4.5%
189
 
4.4%
b1 186
 
4.4%
출입문 165
 
3.9%
155
 
3.6%
92
 
2.2%
사이 77
 
1.8%
Other values (504) 2270
53.2%
2023-12-13T02:34:10.780707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3866
23.0%
1 861
 
5.1%
( 843
 
5.0%
) 843
 
5.0%
655
 
3.9%
653
 
3.9%
615
 
3.7%
B 544
 
3.2%
506
 
3.0%
406
 
2.4%
Other values (258) 7039
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7862
46.7%
Space Separator 3866
23.0%
Decimal Number 2056
 
12.2%
Uppercase Letter 847
 
5.0%
Open Punctuation 843
 
5.0%
Close Punctuation 843
 
5.0%
Other Punctuation 305
 
1.8%
Dash Punctuation 191
 
1.1%
Math Symbol 13
 
0.1%
Lowercase Letter 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
655
 
8.3%
653
 
8.3%
615
 
7.8%
506
 
6.4%
406
 
5.2%
387
 
4.9%
268
 
3.4%
250
 
3.2%
231
 
2.9%
209
 
2.7%
Other values (228) 3682
46.8%
Decimal Number
ValueCountFrequency (%)
1 861
41.9%
2 398
19.4%
3 278
 
13.5%
4 202
 
9.8%
5 100
 
4.9%
6 81
 
3.9%
0 51
 
2.5%
7 37
 
1.8%
8 35
 
1.7%
9 13
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
B 544
64.2%
F 286
33.8%
S 4
 
0.5%
E 4
 
0.5%
G 3
 
0.4%
X 3
 
0.4%
L 1
 
0.1%
M 1
 
0.1%
A 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 150
49.2%
/ 149
48.9%
. 6
 
2.0%
Math Symbol
ValueCountFrequency (%)
> 11
84.6%
~ 2
 
15.4%
Lowercase Letter
ValueCountFrequency (%)
m 4
80.0%
x 1
 
20.0%
Space Separator
ValueCountFrequency (%)
3866
100.0%
Open Punctuation
ValueCountFrequency (%)
( 843
100.0%
Close Punctuation
ValueCountFrequency (%)
) 843
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 191
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8117
48.2%
Hangul 7862
46.7%
Latin 852
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
655
 
8.3%
653
 
8.3%
615
 
7.8%
506
 
6.4%
406
 
5.2%
387
 
4.9%
268
 
3.4%
250
 
3.2%
231
 
2.9%
209
 
2.7%
Other values (228) 3682
46.8%
Common
ValueCountFrequency (%)
3866
47.6%
1 861
 
10.6%
( 843
 
10.4%
) 843
 
10.4%
2 398
 
4.9%
3 278
 
3.4%
4 202
 
2.5%
- 191
 
2.4%
, 150
 
1.8%
/ 149
 
1.8%
Other values (9) 336
 
4.1%
Latin
ValueCountFrequency (%)
B 544
63.8%
F 286
33.6%
S 4
 
0.5%
E 4
 
0.5%
m 4
 
0.5%
G 3
 
0.4%
X 3
 
0.4%
L 1
 
0.1%
M 1
 
0.1%
x 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8969
53.3%
Hangul 7862
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3866
43.1%
1 861
 
9.6%
( 843
 
9.4%
) 843
 
9.4%
B 544
 
6.1%
2 398
 
4.4%
F 286
 
3.2%
3 278
 
3.1%
4 202
 
2.3%
- 191
 
2.1%
Other values (20) 657
 
7.3%
Hangul
ValueCountFrequency (%)
655
 
8.3%
653
 
8.3%
615
 
7.8%
506
 
6.4%
406
 
5.2%
387
 
4.9%
268
 
3.4%
250
 
3.2%
231
 
2.9%
209
 
2.7%
Other values (228) 3682
46.8%

운행구간
Categorical

Distinct21
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size3.6 KiB
B1-1F
201 
B2-B1
92 
B3-B1
39 
1F-2F
34 
B2-1F
 
12
Other values (16)
68 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row1F-2F
2nd row1F-2F
3rd rowB2-1F
4th rowB2-1F
5th rowB3-B2

Common Values

ValueCountFrequency (%)
B1-1F 201
45.1%
B2-B1 92
20.6%
B3-B1 39
 
8.7%
1F-2F 34
 
7.6%
B2-1F 12
 
2.7%
2F-3F 9
 
2.0%
B4-B1 8
 
1.8%
B3-B2 7
 
1.6%
B3-1F 6
 
1.3%
B9-B1 5
 
1.1%
Other values (11) 33
 
7.4%

Length

2023-12-13T02:34:10.943876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
b1-1f 201
45.1%
b2-b1 92
20.6%
b3-b1 39
 
8.7%
1f-2f 34
 
7.6%
b2-1f 12
 
2.7%
2f-3f 9
 
2.0%
b4-b1 8
 
1.8%
b3-b2 7
 
1.6%
b3-1f 6
 
1.3%
b4-b2 5
 
1.1%
Other values (11) 33
 
7.4%

Interactions

2023-12-13T02:34:07.973712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:34:11.038759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선출입구번호운행구간
연번1.0000.9770.3000.651
호선0.9771.0000.2480.641
출입구번호0.3000.2481.0000.383
운행구간0.6510.6410.3831.000
2023-12-13T02:34:11.168444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입구번호호선운행구간
출입구번호1.0000.1150.144
호선0.1151.0000.396
운행구간0.1440.3961.000
2023-12-13T02:34:11.645261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번호선출입구번호운행구간
연번1.0000.9180.1190.301
호선0.9181.0000.1150.396
출입구번호0.1190.1151.0000.144
운행구간0.3010.3960.1441.000

Missing values

2023-12-13T02:34:08.115522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:34:08.276123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번호선역명출입구번호상세위치운행구간
011노포(종합버스터미널)2(2F) 2번 출입구 앞(1F) 밤어사역 방향 승강장 6-1 출입문 앞1F-2F
121노포(종합버스터미널)2(2F) 2번 출입구 앞(1F) 노포행 승강장 6-1 출입문 앞1F-2F
231범어사3(1F) 3번출입구(B2) 1번/3번 출입구 방향B2-1F
341범어사4(1F) 4번출입구(B2) 2번/4번 출입구 방향B2-1F
451범어사<NA>(B2) 남산역 방향 표내는 곳 내, 1번/3번 출입구 방향(B3) 남산역 방향 승강장 8-1 출입문 앞B3-B2
561범어사<NA>(B2) 노포역 방향 표내는 곳 내, 2번/4번 출입구방향(B3) 노포역 방향 승강장 8-1출입문 앞B3-B2
671남산(부산외국대학교)3(1F) 3번 출입구 앞(B1) 10번 표내는 곳 옆B1-1F
781남산(부산외국대학교)4(1F) 4번 출입구 앞(B1) 20번 표내는 곳 옆B1-1F
891남산(부산외국대학교)<NA>(B1) 15번 표내는 곳 앞(B2) 두실역 방향 승강장 5-3 출입문 앞B2-B1
9101남산(부산외국대학교)<NA>(B1) 25번 표내는 곳 앞, 2번/4번 출입구 방향(B2) 범어사역 방향 승강장 8-2 출입문 앞B2-B1
연번호선역명출입구번호상세위치운행구간
4364374영산대(아랫반송)<NA>(2F) E/S 앞2F-3F
4374384윗반송1(1F) 1번/3번 출입구 사이 엘리베이터(2F) 1번/3번 출입구 사이 엘리베이터, 대합실 옆1F-2F
4384394윗반송2(1F) 2번/4번 출입구 사이 엘리베이터(2F) 2번/4번 출입구 사이 엘리베이터, 대합실 옆1F-2F
4394404윗반송<NA>(2F) 표내는곳 옆(3F) 승강장 1-2 출입문 앞2F-3F
4404414고촌1(1F) 1번/3번 출입구 사이 엘리베이터(2F) 1번/3번 출입구 사이 엘리베이터, 대합실 옆1F-2F
4414424고촌2(1F) 2번/4번 출입구 사이 엘리베이터(2F) 2번/4번 출입구 사이 엘리베이터, 대합실 옆1F-2F
4424434고촌<NA>(2F) 표내는곳 옆(3F) 승강장 1-2 출입문 앞2F-3F
4434444안평(고촌주택단지)1(1) 1,3번 출입구 사이 (2) 1번 출입구 계단 앞1F-2F
4444454안평(고촌주택단지)2(1) 2,4번 출입구 사이 (2) 2번 출입구 계단 앞1F-2F
4454464안평(고촌주택단지)<NA>(2) 표내는 곳 안쪽 에스컬레이터 앞(3) 승강장 미남방향 1-1 옆2F-3F