Overview

Dataset statistics

Number of variables10
Number of observations39
Missing cells19
Missing cells (%)4.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory85.4 B

Variable types

Categorical7
Text3

Dataset

Description부산1호선에 포함된 도시광역철도역들의 철도운영기관명, 선명, 역명, 지상지하구분, 제세동기의 역층, 출입구번호, 상세위치, 제세동기출력에너지, 제세동기운영방식, 수량에 대한 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041480/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
수량 has constant value ""Constant
역층 is highly imbalanced (52.3%)Imbalance
제세동기운영방식 is highly imbalanced (82.8%)Imbalance
출입구번호 has 19 (48.7%) missing valuesMissing
역명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:44:22.657559
Analysis finished2023-12-12 17:44:23.357269
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size444.0 B
부산교통공사
39 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산교통공사
2nd row부산교통공사
3rd row부산교통공사
4th row부산교통공사
5th row부산교통공사

Common Values

ValueCountFrequency (%)
부산교통공사 39
100.0%

Length

2023-12-13T02:44:23.468579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:23.656800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산교통공사 39
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size444.0 B
1호선
39 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 39
100.0%

Length

2023-12-13T02:44:23.822217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:23.945327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 39
100.0%

역명
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-13T02:44:24.185451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length2
Mean length2.4358974
Min length2

Characters and Unicode

Total characters95
Distinct characters59
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row다대포해수욕장
2nd row다대포항
3rd row낫개
4th row신장림
5th row장림
ValueCountFrequency (%)
다대포해수욕장 1
 
2.6%
부산진 1
 
2.6%
범일 1
 
2.6%
범내골 1
 
2.6%
서면 1
 
2.6%
부전 1
 
2.6%
시청 1
 
2.6%
연산 1
 
2.6%
교대 1
 
2.6%
좌천 1
 
2.6%
Other values (29) 29
74.4%
2023-12-13T02:44:24.597083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
7.4%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (49) 55
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 95
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
7.4%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (49) 55
57.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 95
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
7.4%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (49) 55
57.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 95
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
7.4%
5
 
5.3%
5
 
5.3%
4
 
4.2%
4
 
4.2%
4
 
4.2%
3
 
3.2%
3
 
3.2%
3
 
3.2%
2
 
2.1%
Other values (49) 55
57.9%
Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
지하
32 
지상

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지하
2nd row지하
3rd row지하
4th row지하
5th row지하

Common Values

ValueCountFrequency (%)
지하 32
82.1%
지상 7
 
17.9%

Length

2023-12-13T02:44:24.767664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:24.905519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지하 32
82.1%
지상 7
 
17.9%

역층
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
1
35 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 35
89.7%
2 4
 
10.3%

Length

2023-12-13T02:44:25.036735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:25.171501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 35
89.7%
2 4
 
10.3%

출입구번호
Text

MISSING 

Distinct11
Distinct (%)55.0%
Missing19
Missing (%)48.7%
Memory size444.0 B
2023-12-13T02:44:25.309168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length3
Mean length4.7
Min length3

Characters and Unicode

Total characters94
Distinct characters9
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)25.0%

Sample

1st row2번 4번
2nd row1번 2번
3rd row1번
4th row3번
5th row1번
ValueCountFrequency (%)
3번 8
25.8%
4번 6
19.4%
1번 5
16.1%
2번 5
16.1%
5번 3
 
9.7%
6번 2
 
6.5%
17번 1
 
3.2%
7번 1
 
3.2%
2023-12-13T02:44:25.584298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
33.0%
31
33.0%
3 8
 
8.5%
1 6
 
6.4%
4 6
 
6.4%
2 5
 
5.3%
5 3
 
3.2%
6 2
 
2.1%
7 2
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 32
34.0%
Other Letter 31
33.0%
Space Separator 31
33.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 8
25.0%
1 6
18.8%
4 6
18.8%
2 5
15.6%
5 3
 
9.4%
6 2
 
6.2%
7 2
 
6.2%
Other Letter
ValueCountFrequency (%)
31
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 63
67.0%
Hangul 31
33.0%

Most frequent character per script

Common
ValueCountFrequency (%)
31
49.2%
3 8
 
12.7%
1 6
 
9.5%
4 6
 
9.5%
2 5
 
7.9%
5 3
 
4.8%
6 2
 
3.2%
7 2
 
3.2%
Hangul
ValueCountFrequency (%)
31
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63
67.0%
Hangul 31
33.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
100.0%
ASCII
ValueCountFrequency (%)
31
49.2%
3 8
 
12.7%
1 6
 
9.5%
4 6
 
9.5%
2 5
 
7.9%
5 3
 
4.8%
6 2
 
3.2%
7 2
 
3.2%
Distinct20
Distinct (%)51.3%
Missing0
Missing (%)0.0%
Memory size444.0 B
2023-12-13T02:44:25.767797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length10
Mean length12.25641
Min length8

Characters and Unicode

Total characters478
Distinct characters48
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)38.5%

Sample

1st row(B1) 역무안전실 옆
2nd row(B1) 역무안전실 앞
3rd row(B1) E/L 1호기 인근 표사는 곳 옆
4th row(B1) 역무안전실 근처
5th row(B1)역무안전실 근처
ValueCountFrequency (%)
역무안전실 34
31.8%
b1 29
27.1%
6
 
5.6%
1f 5
 
4.7%
4
 
3.7%
근처 4
 
3.7%
대합실 3
 
2.8%
b2 2
 
1.9%
2f 2
 
1.9%
인근 2
 
1.9%
Other values (14) 16
15.0%
2023-12-13T02:44:26.109632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
14.6%
( 40
8.4%
40
8.4%
) 40
8.4%
1 36
7.5%
36
7.5%
36
7.5%
35
7.3%
35
7.3%
B 32
6.7%
Other values (38) 78
16.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 242
50.6%
Space Separator 70
 
14.6%
Decimal Number 43
 
9.0%
Uppercase Letter 41
 
8.6%
Open Punctuation 40
 
8.4%
Close Punctuation 40
 
8.4%
Other Punctuation 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
16.5%
36
14.9%
36
14.9%
35
14.5%
35
14.5%
6
 
2.5%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
Other values (26) 37
15.3%
Decimal Number
ValueCountFrequency (%)
1 36
83.7%
2 5
 
11.6%
4 1
 
2.3%
7 1
 
2.3%
Uppercase Letter
ValueCountFrequency (%)
B 32
78.0%
F 7
 
17.1%
E 1
 
2.4%
L 1
 
2.4%
Space Separator
ValueCountFrequency (%)
70
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 242
50.6%
Common 195
40.8%
Latin 41
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
16.5%
36
14.9%
36
14.9%
35
14.5%
35
14.5%
6
 
2.5%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
Other values (26) 37
15.3%
Common
ValueCountFrequency (%)
70
35.9%
( 40
20.5%
) 40
20.5%
1 36
18.5%
2 5
 
2.6%
/ 2
 
1.0%
4 1
 
0.5%
7 1
 
0.5%
Latin
ValueCountFrequency (%)
B 32
78.0%
F 7
 
17.1%
E 1
 
2.4%
L 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 242
50.6%
ASCII 236
49.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70
29.7%
( 40
16.9%
) 40
16.9%
1 36
15.3%
B 32
13.6%
F 7
 
3.0%
2 5
 
2.1%
/ 2
 
0.8%
4 1
 
0.4%
7 1
 
0.4%
Other values (2) 2
 
0.8%
Hangul
ValueCountFrequency (%)
40
16.5%
36
14.9%
36
14.9%
35
14.5%
35
14.5%
6
 
2.5%
6
 
2.5%
4
 
1.7%
4
 
1.7%
3
 
1.2%
Other values (26) 37
15.3%
Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
성인 150 / 소아 50
33 
200

Length

Max length14
Median length14
Mean length12.307692
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성인 150 / 소아 50
2nd row성인 150 / 소아 50
3rd row성인 150 / 소아 50
4th row성인 150 / 소아 50
5th row성인 150 / 소아 50

Common Values

ValueCountFrequency (%)
성인 150 / 소아 50 33
84.6%
200 6
 
15.4%

Length

2023-12-13T02:44:26.241966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:26.343534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성인 33
19.3%
150 33
19.3%
33
19.3%
소아 33
19.3%
50 33
19.3%
200 6
 
3.5%

제세동기운영방식
Categorical

IMBALANCE 

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
자동
38 
수동
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row자동
2nd row자동
3rd row자동
4th row자동
5th row자동

Common Values

ValueCountFrequency (%)
자동 38
97.4%
수동 1
 
2.6%

Length

2023-12-13T02:44:26.446707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:26.569510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자동 38
97.4%
수동 1
 
2.6%

수량
Categorical

CONSTANT 

Distinct1
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size444.0 B
1
39 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 39
100.0%

Length

2023-12-13T02:44:26.687067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:44:26.783868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 39
100.0%

Correlations

2023-12-13T02:44:26.844438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명지상지하구분역층출입구번호상세위치제세동기출력에너지제세동기운영방식
역명1.0001.0001.0001.0001.0001.0001.000
지상지하구분1.0001.0000.1010.0001.0000.0000.000
역층1.0000.1011.0000.0001.0000.0000.000
출입구번호1.0000.0000.0001.0000.0000.0000.000
상세위치1.0001.0001.0000.0001.0000.0000.000
제세동기출력에너지1.0000.0000.0000.0000.0001.0000.000
제세동기운영방식1.0000.0000.0000.0000.0000.0001.000
2023-12-13T02:44:26.965164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제세동기운영방식제세동기출력에너지지상지하구분역층
제세동기운영방식1.0000.0000.0000.000
제세동기출력에너지0.0001.0000.0000.000
지상지하구분0.0000.0001.0000.059
역층0.0000.0000.0591.000
2023-12-13T02:44:27.064876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지상지하구분역층제세동기출력에너지제세동기운영방식
지상지하구분1.0000.0590.0000.000
역층0.0591.0000.0000.000
제세동기출력에너지0.0000.0001.0000.000
제세동기운영방식0.0000.0000.0001.000

Missing values

2023-12-13T02:44:23.099880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:44:23.279014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명지상지하구분역층출입구번호상세위치제세동기출력에너지제세동기운영방식수량
0부산교통공사1호선다대포해수욕장지하12번 4번(B1) 역무안전실 옆성인 150 / 소아 50자동1
1부산교통공사1호선다대포항지하11번 2번(B1) 역무안전실 앞성인 150 / 소아 50자동1
2부산교통공사1호선낫개지하11번(B1) E/L 1호기 인근 표사는 곳 옆성인 150 / 소아 50자동1
3부산교통공사1호선신장림지하1<NA>(B1) 역무안전실 근처성인 150 / 소아 50자동1
4부산교통공사1호선장림지하13번(B1)역무안전실 근처성인 150 / 소아 50자동1
5부산교통공사1호선동매지하11번(B1) 역무안전실 맞은편 벽면성인 150 / 소아 50자동1
6부산교통공사1호선신평지하1<NA>(B1) 역무안전실 근처성인 150 / 소아 50자동1
7부산교통공사1호선하단지하13번 4번 5번 6번(B1) 대합실 역무안전실 맞은편성인 150 / 소아 50자동1
8부산교통공사1호선당리지하1<NA>(B1) 역무안전실성인 150 / 소아 50자동1
9부산교통공사1호선사하지하1<NA>(B1) 역무안전실 옆성인 150 / 소아 50자동1
철도운영기관명선명역명지상지하구분역층출입구번호상세위치제세동기출력에너지제세동기운영방식수량
29부산교통공사1호선동래지상1<NA>(1F) 대합실 중앙 역무안전실성인 150 / 소아 50자동1
30부산교통공사1호선명륜지상11번 2번(1F) 대합실성인 150 / 소아 50자동1
31부산교통공사1호선온천장지상13번 4번(1F) 역무안전실성인 150 / 소아 50자동1
32부산교통공사1호선부산대지상1<NA>(1F) 역무안전실성인 150 / 소아 50자동1
33부산교통공사1호선장전지상13번(1F) 역무안전실성인 150 / 소아 50자동1
34부산교통공사1호선구서지상2<NA>(2F) 역무안전실성인 150 / 소아 50자동1
35부산교통공사1호선두실지하1<NA>(B1) 역무안전실성인 150 / 소아 50자동1
36부산교통공사1호선남산지하1<NA>(B1) 역무안전실 옆성인 150 / 소아 50자동1
37부산교통공사1호선범어사지하1<NA>(B1) 역무안전실성인 150 / 소아 50자동1
38부산교통공사1호선노포지상21번(2F) 화장실 앞성인 150 / 소아 50자동1