Overview

Dataset statistics

Number of variables13
Number of observations77
Missing cells543
Missing cells (%)54.2%
Duplicate rows1
Duplicate rows (%)1.3%
Total size in memory7.9 KiB
Average record size in memory105.7 B

Variable types

Text2
Unsupported11

Dataset

Description매년 한국철도공사에서 발행하는 철도통계연보에 수록된 동력차(KTX, 전기기관차 등) 차형별 제원 현황자료입니다.
URLhttps://www.data.go.kr/data/15053623/fileData.do

Alerts

Dataset has 1 (1.3%) duplicate rowsDuplicates
동 력 차 형 별 제 원 has 67 (87.0%) missing valuesMissing
Unnamed: 1 has 16 (20.8%) missing valuesMissing
Unnamed: 2 has 24 (31.2%) missing valuesMissing
Unnamed: 3 has 17 (22.1%) missing valuesMissing
Unnamed: 4 has 26 (33.8%) missing valuesMissing
Unnamed: 5 has 26 (33.8%) missing valuesMissing
Unnamed: 6 has 74 (96.1%) missing valuesMissing
Unnamed: 7 has 44 (57.1%) missing valuesMissing
Unnamed: 8 has 55 (71.4%) missing valuesMissing
Unnamed: 9 has 50 (64.9%) missing valuesMissing
Unnamed: 10 has 44 (57.1%) missing valuesMissing
Unnamed: 11 has 50 (64.9%) missing valuesMissing
Unnamed: 12 has 50 (64.9%) missing valuesMissing
Unnamed: 1 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 17:55:56.335427
Analysis finished2023-12-12 17:55:57.239662
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9
Distinct (%)90.0%
Missing67
Missing (%)87.0%
Memory size748.0 B
2023-12-13T02:55:57.481025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length9.5
Mean length8.5
Min length5

Characters and Unicode

Total characters85
Distinct characters32
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)80.0%

Sample

1st row구 분
2nd row고속차량(KTX)
3rd row고속차량(KTX-산천)
4th row고속차량(KTX-산천Ⅱ,Ⅳ)
5th rowKTX-이음
ValueCountFrequency (%)
2
12.5%
2
12.5%
고속차량(ktx 1
 
6.2%
고속차량(ktx-산천 1
 
6.2%
고속차량(ktx-산천ⅱ,ⅳ 1
 
6.2%
ktx-이음 1
 
6.2%
디젤기관차 1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (4) 4
25.0%
2023-12-13T02:55:58.080056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
20.0%
6
 
7.1%
X 5
 
5.9%
T 5
 
5.9%
K 4
 
4.7%
- 4
 
4.7%
) 3
 
3.5%
3
 
3.5%
( 3
 
3.5%
3
 
3.5%
Other values (22) 32
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 40
47.1%
Space Separator 17
20.0%
Uppercase Letter 15
 
17.6%
Dash Punctuation 4
 
4.7%
Close Punctuation 3
 
3.5%
Open Punctuation 3
 
3.5%
Letter Number 2
 
2.4%
Other Punctuation 1
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
15.0%
3
 
7.5%
3
 
7.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (11) 12
30.0%
Uppercase Letter
ValueCountFrequency (%)
X 5
33.3%
T 5
33.3%
K 4
26.7%
I 1
 
6.7%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 40
47.1%
Common 28
32.9%
Latin 17
20.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
15.0%
3
 
7.5%
3
 
7.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (11) 12
30.0%
Latin
ValueCountFrequency (%)
X 5
29.4%
T 5
29.4%
K 4
23.5%
I 1
 
5.9%
1
 
5.9%
1
 
5.9%
Common
ValueCountFrequency (%)
17
60.7%
- 4
 
14.3%
) 3
 
10.7%
( 3
 
10.7%
, 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43
50.6%
Hangul 40
47.1%
Number Forms 2
 
2.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
39.5%
X 5
 
11.6%
T 5
 
11.6%
K 4
 
9.3%
- 4
 
9.3%
) 3
 
7.0%
( 3
 
7.0%
I 1
 
2.3%
, 1
 
2.3%
Hangul
ValueCountFrequency (%)
6
15.0%
3
 
7.5%
3
 
7.5%
3
 
7.5%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
Other values (11) 12
30.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%

Unnamed: 1
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing16
Missing (%)20.8%
Memory size748.0 B

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing24
Missing (%)31.2%
Memory size748.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing17
Missing (%)22.1%
Memory size748.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing26
Missing (%)33.8%
Memory size748.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing26
Missing (%)33.8%
Memory size748.0 B

Unnamed: 6
Text

MISSING 

Distinct3
Distinct (%)100.0%
Missing74
Missing (%)96.1%
Memory size748.0 B
2023-12-13T02:55:58.305989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length7
Mean length7.3333333
Min length5

Characters and Unicode

Total characters22
Distinct characters10
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st row구 분
2nd row디 젤 동 차
3rd row전기기관차
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
전기기관차 1
14.3%
2023-12-13T02:55:58.765935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
50.0%
2
 
9.1%
2
 
9.1%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Space Separator 11
50.0%
Other Letter 11
50.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11
50.0%
Hangul 11
50.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Common
ValueCountFrequency (%)
11
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 11
50.0%
Hangul 11
50.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11
100.0%
Hangul
ValueCountFrequency (%)
2
18.2%
2
18.2%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing44
Missing (%)57.1%
Memory size748.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing55
Missing (%)71.4%
Memory size748.0 B

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing50
Missing (%)64.9%
Memory size748.0 B

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing44
Missing (%)57.1%
Memory size748.0 B

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing50
Missing (%)64.9%
Memory size748.0 B

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing50
Missing (%)64.9%
Memory size748.0 B

Correlations

2023-12-13T02:55:58.869969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동 력 차 형 별 제 원Unnamed: 6
동 력 차 형 별 제 원1.0000.000
Unnamed: 60.0001.000

Missing values

2023-12-13T02:55:56.540175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:55:56.780396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T02:55:57.030444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

동 력 차 형 별 제 원Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
0<NA>NaNNaNNaNNaNNaN<NA>NaNNaNNaNNaNNaNNaN
1<NA>NaNNaNNaNNaNNaN<NA>NaNNaNNaNNaNNaNNaN
2구 분형 별마 력보 유 대 수자 중(톤)내용연수구 분형 별NaN마 력보 유 대 수자 중(톤)내용연수
3<NA>NaNNaNNaNNaNNaN<NA>NaNNaNNaNNaNNaNNaN
4고속차량(KTX)NaN920NaNNaN디 젤 동 차NaNNaN121NaNNaN
5<NA>동력차(PC1, PC2)121189213630<NA>새마을형NaNNaN30NaNNaN
6<NA>동력객차(TR1, TR18)60599274.7130<NA>PMC130-206198006920
7<NA>객 차(TR2~TR17)-736490.3930<NA>PMC251-262198006920
8<NA>NaNNaNNaNNaNNaN<NA>PB345-444-213920
9고속차량(KTX-산천)NaN240NaNNaN<NA>PC521-557-53920
동 력 차 형 별 제 원Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
67<NA>TC-4644.125<NA>NaNNaNNaNNaNNaNNaN
68<NA>M1,000KW2344.625<NA>NaNNaNNaNNaNNaNNaN
69<NA>M'1,000KW4648.625<NA>NaNNaNNaNNaNNaNNaN
70<NA>T-2335.825<NA>NaNNaNNaNNaNNaNNaN
71<NA>NaNNaNNaNNaNNaN<NA>NaNNaNNaNNaNNaNNaN
72ITX-청춘ITX-청춘NaN64NaNNaN<NA>NaNNaNNaNNaNNaNNaN
73<NA>TC-1643.925<NA>NaNNaNNaNNaNNaNNaN
74<NA>M1,000KW1646.525<NA>NaNNaNNaNNaNNaNNaN
75<NA>M'1,000KW1648.325<NA>NaNNaNNaNNaNNaNNaN
76<NA>T-164325<NA>NaNNaNNaNNaNNaNNaN

Duplicate rows

Most frequently occurring

동 력 차 형 별 제 원Unnamed: 6# duplicates
0<NA><NA>66