Overview

Dataset statistics

Number of variables7
Number of observations68
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)2.9%
Total size in memory4.0 KiB
Average record size in memory59.9 B

Variable types

Categorical6
Text1

Dataset

Description부산교통공사에서 운영하는 부산3호선의 엘리베이터에 대한 데이터로 철도운영기관명, 선명, 역명, 출입구번호, 상세위치, 정원인원, 정원중량의데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041378/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
Dataset has 2 (2.9%) duplicate rowsDuplicates
정원_중량(kg) is highly overall correlated with 역명 and 1 other fieldsHigh correlation
정원_인원 is highly overall correlated with 역명 and 1 other fieldsHigh correlation
역명 is highly overall correlated with 정원_인원 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 04:47:33.996544
Analysis finished2023-12-12 04:47:34.476885
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
부산교통공사
68 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 68
100.0%

Length

2023-12-12T13:47:34.545206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:47:34.641474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 68
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
3호선
68 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3호선
2nd row3호선
3rd row3호선
4th row3호선
5th row3호선

Common Values

ValueCountFrequency (%)
3호선 68
100.0%

Length

2023-12-12T13:47:34.738662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:47:34.840351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3호선 68
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)23.5%
Missing0
Missing (%)0.0%
Memory size676.0 B
만덕
망미
배산
거제
종합운동장
Other values (11)
38 

Length

Max length5
Median length2
Mean length2.5588235
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row망미
2nd row망미
3rd row망미
4th row망미
5th row망미

Common Values

ValueCountFrequency (%)
만덕 7
10.3%
망미 6
 
8.8%
배산 6
 
8.8%
거제 6
 
8.8%
종합운동장 5
 
7.4%
덕천 5
 
7.4%
물만골 4
 
5.9%
사직 4
 
5.9%
남산정 4
 
5.9%
숙등 4
 
5.9%
Other values (6) 17
25.0%

Length

2023-12-12T13:47:34.955579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
만덕 7
10.3%
망미 6
 
8.8%
배산 6
 
8.8%
거제 6
 
8.8%
종합운동장 5
 
7.4%
덕천 5
 
7.4%
물만골 4
 
5.9%
사직 4
 
5.9%
남산정 4
 
5.9%
숙등 4
 
5.9%
Other values (6) 17
25.0%

출입구번호
Categorical

Distinct17
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size676.0 B
<NA>
32 
4번
10 
3번
1번
5번
 
2
Other values (12)
14 

Length

Max length5
Median length4.5
Mean length3.1764706
Min length2

Unique

Unique10 ?
Unique (%)14.7%

Sample

1st row3번
2nd row4번
3rd row3번
4th row4번
5th row3번

Common Values

ValueCountFrequency (%)
<NA> 32
47.1%
4번 10
 
14.7%
3번 6
 
8.8%
1번 4
 
5.9%
5번 2
 
2.9%
1번 3번 2
 
2.9%
2번 2
 
2.9%
7번 1
 
1.5%
6번 1
 
1.5%
2번 4번 1
 
1.5%
Other values (7) 7
 
10.3%

Length

2023-12-12T13:47:35.082563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 32
45.1%
4번 11
 
15.5%
3번 8
 
11.3%
1번 6
 
8.5%
2번 3
 
4.2%
5번 2
 
2.8%
7번 1
 
1.4%
6번 1
 
1.4%
17번 1
 
1.4%
9번 1
 
1.4%
Other values (5) 5
 
7.0%
Distinct66
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size676.0 B
2023-12-12T13:47:35.409854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length79
Median length62
Mean length34.205882
Min length11

Characters and Unicode

Total characters2326
Distinct characters139
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)94.1%

Sample

1st rowB1 3번 출입구 앞
2nd rowB1 4번 출입구 앞
3rd rowB1 10번대 표내는곳 방향 > B6 수영역 방향 승강장 4-2 출입문 앞
4th rowB1 20번대 표내는곳 방향 > B6 배산역 방향 승강장 4-2 출입문 앞
5th rowB1 10번대 표내는곳 방향 > B6 수영역 방향 승강장 4-2 출입문 앞
ValueCountFrequency (%)
출입구 54
 
9.2%
방향 48
 
8.2%
승강장 33
 
5.6%
27
 
4.6%
b1 26
 
4.4%
출입문 22
 
3.7%
21
 
3.6%
1f 15
 
2.6%
11
 
1.9%
대합실 10
 
1.7%
Other values (137) 321
54.6%
2023-12-12T13:47:35.930131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
522
22.4%
1 114
 
4.9%
) 102
 
4.4%
( 102
 
4.4%
95
 
4.1%
86
 
3.7%
B 80
 
3.4%
77
 
3.3%
76
 
3.3%
71
 
3.1%
Other values (129) 1001
43.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1134
48.8%
Space Separator 522
22.4%
Decimal Number 296
 
12.7%
Uppercase Letter 109
 
4.7%
Close Punctuation 102
 
4.4%
Open Punctuation 102
 
4.4%
Dash Punctuation 30
 
1.3%
Other Punctuation 20
 
0.9%
Math Symbol 11
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
95
 
8.4%
86
 
7.6%
77
 
6.8%
76
 
6.7%
71
 
6.3%
71
 
6.3%
47
 
4.1%
40
 
3.5%
38
 
3.4%
34
 
3.0%
Other values (111) 499
44.0%
Decimal Number
ValueCountFrequency (%)
1 114
38.5%
2 58
19.6%
4 39
 
13.2%
3 37
 
12.5%
0 12
 
4.1%
5 11
 
3.7%
6 11
 
3.7%
9 8
 
2.7%
8 6
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
B 80
73.4%
F 28
 
25.7%
M 1
 
0.9%
Space Separator
ValueCountFrequency (%)
522
100.0%
Close Punctuation
ValueCountFrequency (%)
) 102
100.0%
Open Punctuation
ValueCountFrequency (%)
( 102
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 20
100.0%
Math Symbol
ValueCountFrequency (%)
> 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1134
48.8%
Common 1083
46.6%
Latin 109
 
4.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
95
 
8.4%
86
 
7.6%
77
 
6.8%
76
 
6.7%
71
 
6.3%
71
 
6.3%
47
 
4.1%
40
 
3.5%
38
 
3.4%
34
 
3.0%
Other values (111) 499
44.0%
Common
ValueCountFrequency (%)
522
48.2%
1 114
 
10.5%
) 102
 
9.4%
( 102
 
9.4%
2 58
 
5.4%
4 39
 
3.6%
3 37
 
3.4%
- 30
 
2.8%
/ 20
 
1.8%
0 12
 
1.1%
Other values (5) 47
 
4.3%
Latin
ValueCountFrequency (%)
B 80
73.4%
F 28
 
25.7%
M 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1192
51.2%
Hangul 1134
48.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
522
43.8%
1 114
 
9.6%
) 102
 
8.6%
( 102
 
8.6%
B 80
 
6.7%
2 58
 
4.9%
4 39
 
3.3%
3 37
 
3.1%
- 30
 
2.5%
F 28
 
2.3%
Other values (8) 80
 
6.7%
Hangul
ValueCountFrequency (%)
95
 
8.4%
86
 
7.6%
77
 
6.8%
76
 
6.7%
71
 
6.3%
71
 
6.3%
47
 
4.1%
40
 
3.5%
38
 
3.4%
34
 
3.0%
Other values (111) 499
44.0%

정원_인원
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
13
50 
15
13 
10
 
5

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row13
2nd row13
3rd row15
4th row15
5th row15

Common Values

ValueCountFrequency (%)
13 50
73.5%
15 13
 
19.1%
10 5
 
7.4%

Length

2023-12-12T13:47:36.083919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:47:36.203403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
13 50
73.5%
15 13
 
19.1%
10 5
 
7.4%

정원_중량(kg)
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size676.0 B
1000
50 
1150
13 
750
 
5

Length

Max length4
Median length4
Mean length3.9264706
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1000
2nd row1000
3rd row1150
4th row1150
5th row1150

Common Values

ValueCountFrequency (%)
1000 50
73.5%
1150 13
 
19.1%
750 5
 
7.4%

Length

2023-12-12T13:47:36.304810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:47:36.407860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1000 50
73.5%
1150 13
 
19.1%
750 5
 
7.4%

Correlations

2023-12-12T13:47:36.482322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출입구번호상세위치정원_인원정원_중량(kg)
역명1.0000.7941.0000.8010.801
출입구번호0.7941.0001.0000.5150.515
상세위치1.0001.0001.0001.0001.000
정원_인원0.8010.5151.0001.0001.000
정원_중량(kg)0.8010.5151.0001.0001.000
2023-12-12T13:47:36.597498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명정원_중량(kg)출입구번호정원_인원
역명1.0000.5690.2750.569
정원_중량(kg)0.5691.0000.2331.000
출입구번호0.2750.2331.0000.233
정원_인원0.5691.0000.2331.000
2023-12-12T13:47:36.693707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출입구번호정원_인원정원_중량(kg)
역명1.0000.2750.5690.569
출입구번호0.2751.0000.2330.233
정원_인원0.5690.2331.0001.000
정원_중량(kg)0.5690.2331.0001.000

Missing values

2023-12-12T13:47:34.314447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:47:34.430676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
0부산교통공사3호선망미3번B1 3번 출입구 앞131000
1부산교통공사3호선망미4번B1 4번 출입구 앞131000
2부산교통공사3호선망미3번B1 10번대 표내는곳 방향 > B6 수영역 방향 승강장 4-2 출입문 앞151150
3부산교통공사3호선망미4번B1 20번대 표내는곳 방향 > B6 배산역 방향 승강장 4-2 출입문 앞151150
4부산교통공사3호선망미3번B1 10번대 표내는곳 방향 > B6 수영역 방향 승강장 4-2 출입문 앞151150
5부산교통공사3호선망미4번B1 20번대 표내는곳 방향 > B6 배산역 방향 승강장 4-2 출입문 앞151150
6부산교통공사3호선배산5번(1F) 5번 출입구 옆(B1) 5번 출입구 방향131000
7부산교통공사3호선배산6번(1F) 6번 출입구 옆(B1) 6번 출입구 방향131000
8부산교통공사3호선배산<NA>(B1) 10번대 개집표기 내(B8) 승강장(수영방면)151150
9부산교통공사3호선배산<NA>(B1) 20번대 개집표기 내(B8) 승강장(대저방면)151150
철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
58부산교통공사3호선구포<NA>(2F) 1번대 표내는곳 앞 (1F) 덕천역 방향 승강장 2-1 출입문 앞131000
59부산교통공사3호선구포<NA>(2F) 30번대 표내는곳 앞 (1F) 강서구청역 방향 승강장 2-1 출입문 앞131000
60부산교통공사3호선강서구청1번1층 1번 출입구와 주차장 출입구 중간에 위치131000
61부산교통공사3호선강서구청<NA>(4F) 4층 대저방향 게이트 출입구 방향131000
62부산교통공사3호선강서구청<NA>(4F) 4층 수영방향 게이트 출입구 방향131000
63부산교통공사3호선체육공원1번2번2번 출구 맞은편/1번 출구 옆131000
64부산교통공사3호선체육공원<NA>게이트 20번대 안쪽131000
65부산교통공사3호선체육공원<NA>게이트 10번대 안쪽131000
66부산교통공사3호선대저1번(1F) 2번 출입구 근처 수방함 옆(MF) 1번 출입구 근처 벤치 옆(2F) 1번 출입구 근처 전동휠체어 충전기 옆131000
67부산교통공사3호선대저<NA>(2F) 1번 출입구 근처 13번 개찰구 옆(3F) 체육공원역 방향 승강장 4-3 출입문 앞131000

Duplicate rows

Most frequently occurring

철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)# duplicates
0부산교통공사3호선망미3번B1 10번대 표내는곳 방향 > B6 수영역 방향 승강장 4-2 출입문 앞1511502
1부산교통공사3호선망미4번B1 20번대 표내는곳 방향 > B6 배산역 방향 승강장 4-2 출입문 앞1511502