Overview

Dataset statistics

Number of variables10
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory86.4 B

Variable types

Categorical6
Text1
Boolean3

Dataset

Description용인경전철에서 운영하는 노선의 승강장 정보에 대한 데이터로 철도운영기관명, 선명, 역명, 승강장번호, 상하행구분, 지상구분, 역층, 승강장연결 여부, 스크린도어 유무, 안전발판 유무의 데이터가 있습니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15041182/fileData.do

Alerts

철도운영기관명 has constant value ""Constant
선명 has constant value ""Constant
지상구분 has constant value ""Constant
승강장연결 여부 has constant value ""Constant
스크린도어 유무 has constant value ""Constant
안전발판 유무 has constant value ""Constant
상하행 is highly overall correlated with 승강장번호High correlation
승강장번호 is highly overall correlated with 상하행High correlation

Reproduction

Analysis started2023-12-12 09:07:21.830961
Analysis finished2023-12-12 09:07:22.474055
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

철도운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
용인경량전철주식회사
30 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인경량전철주식회사
2nd row용인경량전철주식회사
3rd row용인경량전철주식회사
4th row용인경량전철주식회사
5th row용인경량전철주식회사

Common Values

ValueCountFrequency (%)
용인경량전철주식회사 30
100.0%

Length

2023-12-12T18:07:22.569221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:22.685204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인경량전철주식회사 30
100.0%

선명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
용인에버라인
30 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용인에버라인
2nd row용인에버라인
3rd row용인에버라인
4th row용인에버라인
5th row용인에버라인

Common Values

ValueCountFrequency (%)
용인에버라인 30
100.0%

Length

2023-12-12T18:07:22.794434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:22.907402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용인에버라인 30
100.0%

역명
Text

Distinct15
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T18:07:23.077583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length2
Mean length3.7333333
Min length2

Characters and Unicode

Total characters112
Distinct characters44
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강남대
2nd row강남대
3rd row고진
4th row고진
5th row기흥(백남준아트센터)
ValueCountFrequency (%)
강남대 2
 
6.7%
고진 2
 
6.7%
기흥(백남준아트센터 2
 
6.7%
김량장 2
 
6.7%
동백 2
 
6.7%
둔전 2
 
6.7%
명지대 2
 
6.7%
보평 2
 
6.7%
삼가 2
 
6.7%
시청·용인대 2
 
6.7%
Other values (5) 10
33.3%
2023-12-12T18:07:23.491192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
8.9%
· 6
 
5.4%
4
 
3.6%
4
 
3.6%
4
 
3.6%
4
 
3.6%
4
 
3.6%
4
 
3.6%
2
 
1.8%
2
 
1.8%
Other values (34) 68
60.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 102
91.1%
Other Punctuation 6
 
5.4%
Open Punctuation 2
 
1.8%
Close Punctuation 2
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (31) 62
60.8%
Other Punctuation
ValueCountFrequency (%)
· 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 102
91.1%
Common 10
 
8.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
9.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (31) 62
60.8%
Common
ValueCountFrequency (%)
· 6
60.0%
( 2
 
20.0%
) 2
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 102
91.1%
None 6
 
5.4%
ASCII 4
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
9.8%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
2
 
2.0%
2
 
2.0%
2
 
2.0%
Other values (31) 62
60.8%
None
ValueCountFrequency (%)
· 6
100.0%
ASCII
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%

승강장번호
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
2
15 
1
15 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row1
3rd row2
4th row1
5th row1

Common Values

ValueCountFrequency (%)
2 15
50.0%
1 15
50.0%

Length

2023-12-12T18:07:23.661387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:23.777833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 15
50.0%
1 15
50.0%

상하행
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size372.0 B
하행
15 
상행
15 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하행
2nd row상행
3rd row하행
4th row상행
5th row상행

Common Values

ValueCountFrequency (%)
하행 15
50.0%
상행 15
50.0%

Length

2023-12-12T18:07:23.917187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:24.042538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
하행 15
50.0%
상행 15
50.0%

지상구분
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
지상
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지상
2nd row지상
3rd row지상
4th row지상
5th row지상

Common Values

ValueCountFrequency (%)
지상 30
100.0%

Length

2023-12-12T18:07:24.173153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:24.301320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지상 30
100.0%

역층
Categorical

Distinct4
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
3
24 
1
 
2
4
 
2
2
 
2

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 24
80.0%
1 2
 
6.7%
4 2
 
6.7%
2 2
 
6.7%

Length

2023-12-12T18:07:24.429983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:07:24.577816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 24
80.0%
1 2
 
6.7%
4 2
 
6.7%
2 2
 
6.7%

승강장연결 여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
True
30 
ValueCountFrequency (%)
True 30
100.0%
2023-12-12T18:07:24.675693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

스크린도어 유무
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-12T18:07:24.767997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

안전발판 유무
Boolean

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size162.0 B
False
30 
ValueCountFrequency (%)
False 30
100.0%
2023-12-12T18:07:24.880957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:07:24.964039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
역명승강장번호상하행역층
역명1.0000.0000.0001.000
승강장번호0.0001.0000.9940.000
상하행0.0000.9941.0000.000
역층1.0000.0000.0001.000
2023-12-12T18:07:25.103631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상하행승강장번호역층
상하행1.0000.9310.000
승강장번호0.9311.0000.000
역층0.0000.0001.000
2023-12-12T18:07:25.232406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
승강장번호상하행역층
승강장번호1.0000.9310.000
상하행0.9311.0000.000
역층0.0000.0001.000

Missing values

2023-12-12T18:07:22.151816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:07:22.386679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

철도운영기관명선명역명승강장번호상하행지상구분역층승강장연결 여부스크린도어 유무안전발판 유무
0용인경량전철주식회사용인에버라인강남대2하행지상1YNN
1용인경량전철주식회사용인에버라인강남대1상행지상1YNN
2용인경량전철주식회사용인에버라인고진2하행지상3YNN
3용인경량전철주식회사용인에버라인고진1상행지상3YNN
4용인경량전철주식회사용인에버라인기흥(백남준아트센터)1상행지상3YNN
5용인경량전철주식회사용인에버라인기흥(백남준아트센터)2하행지상3YNN
6용인경량전철주식회사용인에버라인김량장1상행지상3YNN
7용인경량전철주식회사용인에버라인김량장2하행지상3YNN
8용인경량전철주식회사용인에버라인동백2하행지상3YNN
9용인경량전철주식회사용인에버라인동백1상행지상3YNN
철도운영기관명선명역명승강장번호상하행지상구분역층승강장연결 여부스크린도어 유무안전발판 유무
20용인경량전철주식회사용인에버라인어정2하행지상3YNN
21용인경량전철주식회사용인에버라인어정1상행지상3YNN
22용인경량전철주식회사용인에버라인운동장·송담대2하행지상3YNN
23용인경량전철주식회사용인에버라인운동장·송담대1상행지상3YNN
24용인경량전철주식회사용인에버라인전대·에버랜드2하행지상2YNN
25용인경량전철주식회사용인에버라인전대·에버랜드1상행지상2YNN
26용인경량전철주식회사용인에버라인지석2하행지상3YNN
27용인경량전철주식회사용인에버라인지석1상행지상3YNN
28용인경량전철주식회사용인에버라인초당1상행지상3YNN
29용인경량전철주식회사용인에버라인초당2하행지상3YNN