Overview

Dataset statistics

Number of variables7
Number of observations76
Missing cells41
Missing cells (%)7.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory60.7 B

Variable types

Categorical5
Numeric1
Text1

Dataset

Description대전교통공사에서 운영하는 노선의 엘리베이터에 대한 데이터로 철도운영기관명, 선명, 역명, 출입구번호, 상세위치, 정원인원, 정원중량의데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041384/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
정원_중량(kg) is highly overall correlated with 역명 and 1 other fieldsHigh correlation
정원_인원 is highly overall correlated with 역명 and 1 other fieldsHigh correlation
역명 is highly overall correlated with 정원_인원 and 1 other fieldsHigh correlation
출입구번호 has 41 (53.9%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:40:48.443638
Analysis finished2023-12-12 22:40:49.096046
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
대전교통공사
76 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대전교통공사
2nd row대전교통공사
3rd row대전교통공사
4th row대전교통공사
5th row대전교통공사

Common Values

ValueCountFrequency (%)
대전교통공사 76
100.0%

Length

2023-12-13T07:40:49.173746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:49.272863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대전교통공사 76
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
1호선
76 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1호선
2nd row1호선
3rd row1호선
4th row1호선
5th row1호선

Common Values

ValueCountFrequency (%)
1호선 76
100.0%

Length

2023-12-13T07:40:49.367168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:49.457210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1호선 76
100.0%

역명
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)28.9%
Missing0
Missing (%)0.0%
Memory size740.0 B
월드컵경기장(노은도매시장)
갈마
 
5
갑천
 
4
판암(대전대)
 
4
정부청사
 
4
Other values (17)
53 

Length

Max length14
Median length13
Mean length5.4342105
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row갈마
2nd row갈마
3rd row갈마
4th row갈마
5th row갈마

Common Values

ValueCountFrequency (%)
월드컵경기장(노은도매시장) 6
 
7.9%
갈마 5
 
6.6%
갑천 4
 
5.3%
판암(대전대) 4
 
5.3%
정부청사 4
 
5.3%
유성온천(충남대·목원대) 4
 
5.3%
월평(한국과학기술원) 4
 
5.3%
현충원(한밭대) 4
 
5.3%
노은 4
 
5.3%
대동(우송대) 3
 
3.9%
Other values (12) 34
44.7%

Length

2023-12-13T07:40:49.547243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
월드컵경기장(노은도매시장 6
 
7.9%
갈마 5
 
6.6%
갑천 4
 
5.3%
판암(대전대 4
 
5.3%
정부청사 4
 
5.3%
유성온천(충남대·목원대 4
 
5.3%
월평(한국과학기술원 4
 
5.3%
현충원(한밭대 4
 
5.3%
노은 4
 
5.3%
대전 3
 
3.9%
Other values (12) 34
44.7%

출입구번호
Real number (ℝ)

MISSING 

Distinct7
Distinct (%)20.0%
Missing41
Missing (%)53.9%
Infinite0
Infinite (%)0.0%
Mean2.6857143
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size816.0 B
2023-12-13T07:40:49.633706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q33
95-th percentile7
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.8593394
Coefficient of variation (CV)0.69230721
Kurtosis1.5801037
Mean2.6857143
Median Absolute Deviation (MAD)2
Skewness1.334117
Sum94
Variance3.4571429
MonotonicityNot monotonic
2023-12-13T07:40:49.725394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 13
 
17.1%
3 13
 
17.1%
2 3
 
3.9%
7 2
 
2.6%
5 2
 
2.6%
8 1
 
1.3%
4 1
 
1.3%
(Missing) 41
53.9%
ValueCountFrequency (%)
1 13
17.1%
2 3
 
3.9%
3 13
17.1%
4 1
 
1.3%
5 2
 
2.6%
7 2
 
2.6%
8 1
 
1.3%
ValueCountFrequency (%)
8 1
 
1.3%
7 2
 
2.6%
5 2
 
2.6%
4 1
 
1.3%
3 13
17.1%
2 3
 
3.9%
1 13
17.1%
Distinct60
Distinct (%)78.9%
Missing0
Missing (%)0.0%
Memory size740.0 B
2023-12-13T07:40:49.893665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length50
Mean length39.710526
Min length24

Characters and Unicode

Total characters3018
Distinct characters89
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)68.4%

Sample

1st row(1F) 2번 출입구 옆 (B1) 대합실 1번/2번 출입구 방향
2nd row(1F) 3번 출입구 옆 (B1) 대합실 3번/4번 출입구 방향
3rd row(B1) 대합실 (B2) 3번/4번 출입구 방향 연결통로
4th row(B1) 표내는곳 내 대합실 (B2) 정부청사역 방향 승강장 3-4 출입문 앞
5th row(B1) 표내는곳 내 대합실 (B2) 월평역 방향 승강장 4-1 출입문 앞
ValueCountFrequency (%)
대합실 74
 
9.5%
방향 72
 
9.2%
출입구 69
 
8.8%
b1 66
 
8.5%
b2 42
 
5.4%
40
 
5.1%
40
 
5.1%
출입문 40
 
5.1%
승강장 40
 
5.1%
표내는곳 38
 
4.9%
Other values (55) 259
33.2%
2023-12-13T07:40:50.184204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
705
23.4%
( 158
 
5.2%
) 158
 
5.2%
1 149
 
4.9%
B 123
 
4.1%
111
 
3.7%
109
 
3.6%
109
 
3.6%
84
 
2.8%
2 78
 
2.6%
Other values (79) 1234
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1408
46.7%
Space Separator 705
23.4%
Decimal Number 349
 
11.6%
Open Punctuation 158
 
5.2%
Close Punctuation 158
 
5.2%
Uppercase Letter 158
 
5.2%
Other Punctuation 42
 
1.4%
Dash Punctuation 40
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
111
 
7.9%
109
 
7.7%
109
 
7.7%
84
 
6.0%
78
 
5.5%
78
 
5.5%
74
 
5.3%
73
 
5.2%
73
 
5.2%
72
 
5.1%
Other values (64) 547
38.8%
Decimal Number
ValueCountFrequency (%)
1 149
42.7%
2 78
22.3%
3 51
 
14.6%
4 48
 
13.8%
5 14
 
4.0%
6 5
 
1.4%
7 2
 
0.6%
8 2
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
B 123
77.8%
F 35
 
22.2%
Space Separator
ValueCountFrequency (%)
705
100.0%
Open Punctuation
ValueCountFrequency (%)
( 158
100.0%
Close Punctuation
ValueCountFrequency (%)
) 158
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1452
48.1%
Hangul 1408
46.7%
Latin 158
 
5.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
111
 
7.9%
109
 
7.7%
109
 
7.7%
84
 
6.0%
78
 
5.5%
78
 
5.5%
74
 
5.3%
73
 
5.2%
73
 
5.2%
72
 
5.1%
Other values (64) 547
38.8%
Common
ValueCountFrequency (%)
705
48.6%
( 158
 
10.9%
) 158
 
10.9%
1 149
 
10.3%
2 78
 
5.4%
3 51
 
3.5%
4 48
 
3.3%
/ 42
 
2.9%
- 40
 
2.8%
5 14
 
1.0%
Other values (3) 9
 
0.6%
Latin
ValueCountFrequency (%)
B 123
77.8%
F 35
 
22.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1610
53.3%
Hangul 1408
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
705
43.8%
( 158
 
9.8%
) 158
 
9.8%
1 149
 
9.3%
B 123
 
7.6%
2 78
 
4.8%
3 51
 
3.2%
4 48
 
3.0%
/ 42
 
2.6%
- 40
 
2.5%
Other values (5) 58
 
3.6%
Hangul
ValueCountFrequency (%)
111
 
7.9%
109
 
7.7%
109
 
7.7%
84
 
6.0%
78
 
5.5%
78
 
5.5%
74
 
5.3%
73
 
5.2%
73
 
5.2%
72
 
5.1%
Other values (64) 547
38.8%

정원_인원
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
15
40 
11
32 
13
 
3
9
 
1

Length

Max length2
Median length2
Mean length1.9868421
Min length1

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row15
2nd row15
3rd row11
4th row11
5th row11

Common Values

ValueCountFrequency (%)
15 40
52.6%
11 32
42.1%
13 3
 
3.9%
9 1
 
1.3%

Length

2023-12-13T07:40:50.291126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:50.376221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15 40
52.6%
11 32
42.1%
13 3
 
3.9%
9 1
 
1.3%

정원_중량(kg)
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size740.0 B
1000
40 
750
32 
900
 
3
650
 
1

Length

Max length4
Median length4
Mean length3.5263158
Min length3

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row1000
2nd row1000
3rd row750
4th row750
5th row750

Common Values

ValueCountFrequency (%)
1000 40
52.6%
750 32
42.1%
900 3
 
3.9%
650 1
 
1.3%

Length

2023-12-13T07:40:50.463469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:50.541707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1000 40
52.6%
750 32
42.1%
900 3
 
3.9%
650 1
 
1.3%

Interactions

2023-12-13T07:40:48.761569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:40:50.601666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명출입구번호상세위치정원_인원정원_중량(kg)
역명1.0000.6510.8460.9280.928
출입구번호0.6511.0000.8880.6410.641
상세위치0.8460.8881.0000.3030.303
정원_인원0.9280.6410.3031.0001.000
정원_중량(kg)0.9280.6410.3031.0001.000
2023-12-13T07:40:50.677296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정원_중량(kg)정원_인원역명
정원_중량(kg)1.0001.0000.677
정원_인원1.0001.0000.677
역명0.6770.6771.000
2023-12-13T07:40:50.746648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
출입구번호역명정원_인원정원_중량(kg)
출입구번호1.0000.1670.4720.472
역명0.1671.0000.6770.677
정원_인원0.4720.6771.0001.000
정원_중량(kg)0.4720.6771.0001.000

Missing values

2023-12-13T07:40:48.888690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:40:49.036108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
0대전교통공사1호선갈마1(1F) 2번 출입구 옆 (B1) 대합실 1번/2번 출입구 방향151000
1대전교통공사1호선갈마3(1F) 3번 출입구 옆 (B1) 대합실 3번/4번 출입구 방향151000
2대전교통공사1호선갈마3(B1) 대합실 (B2) 3번/4번 출입구 방향 연결통로11750
3대전교통공사1호선갈마<NA>(B1) 표내는곳 내 대합실 (B2) 정부청사역 방향 승강장 3-4 출입문 앞11750
4대전교통공사1호선갈마<NA>(B1) 표내는곳 내 대합실 (B2) 월평역 방향 승강장 4-1 출입문 앞11750
5대전교통공사1호선갑천1(1F) 1번/2번 출입구 근처 (B1) 대합실 1번/2번 출입구 방향151000
6대전교통공사1호선갑천3(1F) 3번 출입구 근처 (B1) 대합실 3번 출입구 방향151000
7대전교통공사1호선갑천<NA>(B1) 표내는곳 내 대합실 (B2) 월평역 방향 승강장 4-1 출입문 앞151000
8대전교통공사1호선갑천<NA>(B1) 표내는곳 내 대합실 (B2) 유성온천역 방향 승강장 3-4 출입문 앞151000
9대전교통공사1호선구암1(1F) 1번/2번 출입구 근처 (B1) 대합실 1번/2번 출입구 방향151000
철도운영기관명선명역명출입구번호상세위치정원_인원정원_중량(kg)
66대전교통공사1호선탄방<NA>(B1) 표내는곳 내 대합실 (B2) 용문역 방향 승강장 2-2 출입문 앞11750
67대전교통공사1호선탄방<NA>(B1) 표내는곳 내 대합실 (B2) 시청역 방향 승강장 5-4 출입문 앞11750
68대전교통공사1호선판암(대전대)1(1F) 1번/2번 출입구 근처 (B1) 1번/2번 출입구 방향 계단 옆11750
69대전교통공사1호선판암(대전대)3(1F) 3번/4번 출입구 근처 (B1) 3번/4번 출입구 방향 계단 옆11750
70대전교통공사1호선판암(대전대)<NA>(B1) 표내는곳 내 대합실 (B2) 승강장 4-1 출입문 앞11750
71대전교통공사1호선판암(대전대)<NA>(B1) 표내는곳 내 대합실 (B2) 신흥역 방향 승강장 3-4 출입문 앞11750
72대전교통공사1호선현충원(한밭대)<NA>(1F) 1번/2번 출입구 근처 (B1) 대합실 1번/2번 출입구 방향11750
73대전교통공사1호선현충원(한밭대)3(1F) 3번/4번 출입구 옆 (B1) 대합실 3번/4번 출입구 방향151000
74대전교통공사1호선현충원(한밭대)<NA>(B1) 표내는곳 내 대합실 (B2) 현충원역 방향 승강장 4-1 출입문 앞11750
75대전교통공사1호선현충원(한밭대)<NA>(B1) 표내는곳 내 대합실 (B2) 월드컵경기장역 방향 승강장 3-4 출입문 앞11750