Overview

Dataset statistics

Number of variables7
Number of observations277
Missing cells29
Missing cells (%)1.5%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory15.3 KiB
Average record size in memory56.5 B

Variable types

Text3
Categorical2
DateTime2

Dataset

Description철도안전법에 근거하여 철도의 안전과 호환성의 확보 등을 위하여 철도차량 및 철도용품의 표준규격을 정하고 있고,한국철도기술연구원 한국철도표준규격은 이에 대한 정보를 제공합니다.
Author한국철도기술연구원
URLhttps://www.data.go.kr/data/3043237/fileData.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates
중분야 is highly overall correlated with 대분야High correlation
대분야 is highly overall correlated with 중분야High correlation
최종개정일 has 28 (10.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 02:37:50.083268
Analysis finished2023-12-12 02:37:50.833024
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct276
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T11:37:51.384211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length16.722022
Min length14

Characters and Unicode

Total characters4632
Distinct characters29
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique275 ?
Unique (%)99.3%

Sample

1st rowKRS RN 0004-13(R)
2nd rowKRS PR 0004-21(R)
3rd rowKRS PR 0010-10(R)
4th rowKRS CP 0001-13(R)
5th rowKRS RN 0007-21(R)
ValueCountFrequency (%)
krs 277
33.3%
pw 73
 
8.8%
sg 70
 
8.4%
rn 26
 
3.1%
cm 26
 
3.1%
br 25
 
3.0%
pr 14
 
1.7%
tr 14
 
1.7%
ap 9
 
1.1%
cb 8
 
1.0%
Other values (209) 289
34.8%
2023-12-12T11:37:51.882082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 730
15.8%
R 607
13.1%
557
12.0%
2 360
7.8%
S 351
7.6%
K 277
 
6.0%
- 277
 
6.0%
( 250
 
5.4%
) 250
 
5.4%
1 220
 
4.7%
Other values (19) 753
16.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1662
35.9%
Uppercase Letter 1636
35.3%
Space Separator 557
 
12.0%
Dash Punctuation 277
 
6.0%
Open Punctuation 250
 
5.4%
Close Punctuation 250
 
5.4%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 607
37.1%
S 351
21.5%
K 277
16.9%
P 100
 
6.1%
W 73
 
4.5%
G 70
 
4.3%
C 44
 
2.7%
B 33
 
2.0%
N 26
 
1.6%
M 26
 
1.6%
Other values (5) 29
 
1.8%
Decimal Number
ValueCountFrequency (%)
0 730
43.9%
2 360
21.7%
1 220
 
13.2%
5 81
 
4.9%
4 58
 
3.5%
6 54
 
3.2%
3 54
 
3.2%
7 47
 
2.8%
8 33
 
2.0%
9 25
 
1.5%
Space Separator
ValueCountFrequency (%)
557
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 277
100.0%
Open Punctuation
ValueCountFrequency (%)
( 250
100.0%
Close Punctuation
ValueCountFrequency (%)
) 250
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2996
64.7%
Latin 1636
35.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 607
37.1%
S 351
21.5%
K 277
16.9%
P 100
 
6.1%
W 73
 
4.5%
G 70
 
4.3%
C 44
 
2.7%
B 33
 
2.0%
N 26
 
1.6%
M 26
 
1.6%
Other values (5) 29
 
1.8%
Common
ValueCountFrequency (%)
0 730
24.4%
557
18.6%
2 360
12.0%
- 277
 
9.2%
( 250
 
8.3%
) 250
 
8.3%
1 220
 
7.3%
5 81
 
2.7%
4 58
 
1.9%
6 54
 
1.8%
Other values (4) 159
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4632
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 730
15.8%
R 607
13.1%
557
12.0%
2 360
7.8%
S 351
7.6%
K 277
 
6.0%
- 277
 
6.0%
( 250
 
5.4%
) 250
 
5.4%
1 220
 
4.7%
Other values (19) 753
16.3%
Distinct276
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-12T11:37:52.196741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length25
Mean length11.743682
Min length2

Characters and Unicode

Total characters3253
Distinct characters322
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique275 ?
Unique (%)99.3%

Sample

1st row화차용 용접 대차
2nd row디젤전기기관차 실린더 헤드
3rd rowMTU 엔진용 밸브가이드
4th row자동연결기
5th row철도차량 차륜 시험방법
ValueCountFrequency (%)
전동차용 38
 
5.9%
건널목 11
 
1.7%
10
 
1.6%
시험방법 9
 
1.4%
계전기 9
 
1.4%
철도차량용 8
 
1.2%
고속 8
 
1.2%
초단파 8
 
1.2%
자기부상열차용 8
 
1.2%
철도차량 7
 
1.1%
Other values (418) 525
81.9%
2023-12-12T11:37:52.661714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
364
 
11.2%
143
 
4.4%
132
 
4.1%
130
 
4.0%
125
 
3.8%
87
 
2.7%
65
 
2.0%
61
 
1.9%
( 53
 
1.6%
53
 
1.6%
Other values (312) 2040
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2624
80.7%
Space Separator 364
 
11.2%
Uppercase Letter 114
 
3.5%
Open Punctuation 53
 
1.6%
Close Punctuation 53
 
1.6%
Decimal Number 20
 
0.6%
Dash Punctuation 14
 
0.4%
Lowercase Letter 10
 
0.3%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
5.4%
132
 
5.0%
130
 
5.0%
125
 
4.8%
87
 
3.3%
65
 
2.5%
61
 
2.3%
53
 
2.0%
53
 
2.0%
46
 
1.8%
Other values (271) 1729
65.9%
Uppercase Letter
ValueCountFrequency (%)
T 15
13.2%
S 14
12.3%
C 12
10.5%
A 12
10.5%
M 7
 
6.1%
F 7
 
6.1%
P 6
 
5.3%
L 6
 
5.3%
D 5
 
4.4%
N 5
 
4.4%
Other values (12) 25
21.9%
Decimal Number
ValueCountFrequency (%)
0 4
20.0%
3 4
20.0%
4 3
15.0%
2 3
15.0%
5 3
15.0%
8 2
10.0%
1 1
 
5.0%
Lowercase Letter
ValueCountFrequency (%)
s 2
20.0%
x 2
20.0%
m 2
20.0%
k 1
10.0%
a 1
10.0%
u 1
10.0%
z 1
10.0%
Space Separator
ValueCountFrequency (%)
364
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2624
80.7%
Common 505
 
15.5%
Latin 123
 
3.8%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
5.4%
132
 
5.0%
130
 
5.0%
125
 
4.8%
87
 
3.3%
65
 
2.5%
61
 
2.3%
53
 
2.0%
53
 
2.0%
46
 
1.8%
Other values (271) 1729
65.9%
Latin
ValueCountFrequency (%)
T 15
12.2%
S 14
11.4%
C 12
 
9.8%
A 12
 
9.8%
M 7
 
5.7%
F 7
 
5.7%
P 6
 
4.9%
L 6
 
4.9%
D 5
 
4.1%
N 5
 
4.1%
Other values (18) 34
27.6%
Common
ValueCountFrequency (%)
364
72.1%
( 53
 
10.5%
) 53
 
10.5%
- 14
 
2.8%
0 4
 
0.8%
3 4
 
0.8%
4 3
 
0.6%
2 3
 
0.6%
5 3
 
0.6%
8 2
 
0.4%
Other values (2) 2
 
0.4%
Greek
ValueCountFrequency (%)
Φ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2624
80.7%
ASCII 628
 
19.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
364
58.0%
( 53
 
8.4%
) 53
 
8.4%
T 15
 
2.4%
S 14
 
2.2%
- 14
 
2.2%
C 12
 
1.9%
A 12
 
1.9%
M 7
 
1.1%
F 7
 
1.1%
Other values (30) 77
 
12.3%
Hangul
ValueCountFrequency (%)
143
 
5.4%
132
 
5.0%
130
 
5.0%
125
 
4.8%
87
 
3.3%
65
 
2.5%
61
 
2.3%
53
 
2.0%
53
 
2.0%
46
 
1.8%
Other values (271) 1729
65.9%
None
ValueCountFrequency (%)
Φ 1
100.0%
Distinct273
Distinct (%)98.9%
Missing1
Missing (%)0.4%
Memory size2.3 KiB
2023-12-12T11:37:52.968333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length52
Mean length34.210145
Min length5

Characters and Unicode

Total characters9442
Distinct characters66
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique270 ?
Unique (%)97.8%

Sample

1st rowWelding Bogie for Freight Car
2nd rowCylinder Head for Diesel-electric Locomotives
3rd rowValve Guide for MTU Engine
4th rowAutomatic Coupler
5th rowRailway rolling stock - Test methods : Wheel
ValueCountFrequency (%)
for 118
 
9.0%
multiple 37
 
2.8%
electric 30
 
2.3%
type 27
 
2.1%
device 23
 
1.8%
units 22
 
1.7%
unit 21
 
1.6%
brake 20
 
1.5%
railroad 18
 
1.4%
electrical 16
 
1.2%
Other values (450) 976
74.6%
2023-12-12T11:37:53.419683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1036
 
11.0%
e 862
 
9.1%
r 691
 
7.3%
i 690
 
7.3%
o 605
 
6.4%
t 583
 
6.2%
n 528
 
5.6%
a 505
 
5.3%
l 493
 
5.2%
c 333
 
3.5%
Other values (56) 3116
33.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7041
74.6%
Uppercase Letter 1202
 
12.7%
Space Separator 1036
 
11.0%
Dash Punctuation 52
 
0.6%
Close Punctuation 38
 
0.4%
Open Punctuation 38
 
0.4%
Decimal Number 19
 
0.2%
Other Punctuation 16
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 862
12.2%
r 691
9.8%
i 690
9.8%
o 605
8.6%
t 583
8.3%
n 528
 
7.5%
a 505
 
7.2%
l 493
 
7.0%
c 333
 
4.7%
s 327
 
4.6%
Other values (16) 1424
20.2%
Uppercase Letter
ValueCountFrequency (%)
C 135
11.2%
S 127
10.6%
T 126
10.5%
R 101
 
8.4%
M 86
 
7.2%
D 75
 
6.2%
E 74
 
6.2%
A 71
 
5.9%
P 63
 
5.2%
B 55
 
4.6%
Other values (15) 289
24.0%
Decimal Number
ValueCountFrequency (%)
3 4
21.1%
0 3
15.8%
5 3
15.8%
2 3
15.8%
1 2
10.5%
4 2
10.5%
8 2
10.5%
Other Punctuation
ValueCountFrequency (%)
: 8
50.0%
/ 5
31.2%
. 2
 
12.5%
& 1
 
6.2%
Space Separator
ValueCountFrequency (%)
1036
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 8242
87.3%
Common 1199
 
12.7%
Greek 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 862
 
10.5%
r 691
 
8.4%
i 690
 
8.4%
o 605
 
7.3%
t 583
 
7.1%
n 528
 
6.4%
a 505
 
6.1%
l 493
 
6.0%
c 333
 
4.0%
s 327
 
4.0%
Other values (40) 2625
31.8%
Common
ValueCountFrequency (%)
1036
86.4%
- 52
 
4.3%
) 38
 
3.2%
( 38
 
3.2%
: 8
 
0.7%
/ 5
 
0.4%
3 4
 
0.3%
0 3
 
0.3%
5 3
 
0.3%
2 3
 
0.3%
Other values (5) 9
 
0.8%
Greek
ValueCountFrequency (%)
Φ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9441
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1036
 
11.0%
e 862
 
9.1%
r 691
 
7.3%
i 690
 
7.3%
o 605
 
6.4%
t 583
 
6.2%
n 528
 
5.6%
a 505
 
5.3%
l 493
 
5.2%
c 333
 
3.5%
Other values (55) 3115
33.0%
None
ValueCountFrequency (%)
Φ 1
100.0%

대분야
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
철도전력신호정보통신
169 
철도차량
93 
철도시설
 
15

Length

Max length10
Median length10
Mean length7.6606498
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row철도차량
2nd row철도차량
3rd row철도차량
4th row철도차량
5th row철도차량

Common Values

ValueCountFrequency (%)
철도전력신호정보통신 169
61.0%
철도차량 93
33.6%
철도시설 15
 
5.4%

Length

2023-12-12T11:37:53.588424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T11:37:53.692411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
철도전력신호정보통신 169
61.0%
철도차량 93
33.6%
철도시설 15
 
5.4%

중분야
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
PW(전철전력용품)
73 
SG(신호용품)
70 
RN(주행장치용품)
26 
CM(통신용품)
26 
BR(제동장치용품)
25 
Other values (11)
57 

Length

Max length27
Median length10
Mean length9.3646209
Min length8

Unique

Unique4 ?
Unique (%)1.4%

Sample

1st rowRN(주행장치용품)
2nd rowPR(추진장치용품)
3rd rowPR(추진장치용품)
4th rowCP(연결장치용품)
5th rowRN(주행장치용품)

Common Values

ValueCountFrequency (%)
PW(전철전력용품) 73
26.4%
SG(신호용품) 70
25.3%
RN(주행장치용품) 26
 
9.4%
CM(통신용품) 26
 
9.4%
BR(제동장치용품) 25
 
9.0%
PR(추진장치용품) 14
 
5.1%
TR(궤도용품) 14
 
5.1%
AP(보조전원장치용품) 8
 
2.9%
CB(차체설비용품) 8
 
2.9%
CP(연결장치용품) 4
 
1.4%
Other values (6) 9
 
3.2%

Length

2023-12-12T11:37:53.834745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
pw(전철전력용품 73
26.1%
sg(신호용품 70
25.0%
rn(주행장치용품 26
 
9.3%
cm(통신용품 26
 
9.3%
br(제동장치용품 25
 
8.9%
pr(추진장치용품 14
 
5.0%
tr(궤도용품 14
 
5.0%
ap(보조전원장치용품 8
 
2.9%
cb(차체설비용품 8
 
2.9%
cp(연결장치용품 4
 
1.4%
Other values (9) 12
 
4.3%
Distinct17
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2006-03-07 00:00:00
Maximum2022-12-23 00:00:00
2023-12-12T11:37:53.987850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:37:54.100498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=17)

최종개정일
Date

MISSING 

Distinct23
Distinct (%)9.2%
Missing28
Missing (%)10.1%
Memory size2.3 KiB
Minimum2007-10-29 00:00:00
Maximum2022-12-23 00:00:00
2023-12-12T11:37:54.230024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T11:37:54.366828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

Correlations

2023-12-12T11:37:54.457416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분야중분야제정일최종개정일
대분야1.0001.0000.8330.642
중분야1.0001.0000.6600.800
제정일0.8330.6601.0000.606
최종개정일0.6420.8000.6061.000
2023-12-12T11:37:54.555251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중분야대분야
중분야1.0000.976
대분야0.9761.000
2023-12-12T11:37:54.651281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대분야중분야
대분야1.0000.976
중분야0.9761.000

Missing values

2023-12-12T11:37:50.492764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:37:50.653130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T11:37:50.781708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

규격번호규격명영문규격명대분야중분야제정일최종개정일
0KRS RN 0004-13(R)화차용 용접 대차Welding Bogie for Freight Car철도차량RN(주행장치용품)2006-03-072013-07-17
1KRS PR 0004-21(R)디젤전기기관차 실린더 헤드Cylinder Head for Diesel-electric Locomotives철도차량PR(추진장치용품)2006-03-072021-10-22
2KRS PR 0010-10(R)MTU 엔진용 밸브가이드Valve Guide for MTU Engine철도차량PR(추진장치용품)2006-03-072010-11-22
3KRS CP 0001-13(R)자동연결기Automatic Coupler철도차량CP(연결장치용품)2006-03-072013-11-22
4KRS RN 0007-21(R)철도차량 차륜 시험방법Railway rolling stock - Test methods : Wheel철도차량RN(주행장치용품)2014-11-282021-10-22
5KRS RN 0008-14철도차량 차축 시험방법Railway rolling stock - Test methods : Axle철도차량RN(주행장치용품)2014-11-28<NA>
6KRS RN 0009-21(R)철도차량 코일스프링 시험방법Railway rolling stock - Test methods : Coil spring철도차량RN(주행장치용품)2014-11-282021-10-22
7KRS RN 0010-14철도차량 고무스프링 시험방법Railway rolling stock - Test methods : Rubber spring철도차량RN(주행장치용품)2014-11-28<NA>
8KRS RN 0011-14철도차량 공기스프링 시험방법Railway rolling stock - Test methods : Air spring철도차량RN(주행장치용품)2014-11-28<NA>
9KRS CS 0002-14(R)정보형 ATS 차상장치Informative Automatic Train Stop철도차량CS(차상신호장치용품)2006-03-072014-11-28
규격번호규격명영문규격명대분야중분야제정일최종개정일
267KRS SG 0032-20(R)절연체 이음매판Insulated Joint Plate철도전력신호정보통신SG(신호용품)2006-05-162020-12-14
268KRS SG 0034-21(R)가청주파수(AF)궤도회로제어케이블Audio Frequency Track Circuit Control Cable철도전력신호정보통신SG(신호용품)2006-05-162021-12-30
269KRS SG 0051-20(R)건널목제어유니트(삽입형식)Railroad Crossing Control Unit철도전력신호정보통신SG(신호용품)2006-05-162020-12-14
270KRS SG 0057-20(R)ATC지상장치 시험기Testing Instrument of ATC Transmitter철도전력신호정보통신SG(신호용품)2006-05-162020-12-14
271KRS SG 0059-20(R)ATS 지상장치Automatic Train Stop Wayside Transmitter철도전력신호정보통신SG(신호용품)2006-05-162020-12-14
272KRS SG 0064-20(R)신호본드Signal Bond철도전력신호정보통신SG(신호용품)2007-06-222020-12-14
273KRS SG 0063-17(R)철도신호시스템 (네트워크 정보전송방식)Network Protocol for Railroad Signal System철도전력신호정보통신SG(신호용품)2006-05-162017-10-12
274KRS SG 0068-17(R)승강장 안전문 설비Paltform Screen Door System철도전력신호정보통신SG(신호용품)2015-05-282017-12-14
275KRS PW 0016-20(R)현수애자(전철용 고분자 T-s)Suspension Insulator(Polymer Type for Electric-railway Application T-s)철도전력신호정보통신PW(전철전력용품)2006-05-162020-12-14
276KRS SG 0070-22일반 및 고속철도용 열차제어시스템(KTCS)KTCS for Conventional & High speed railway철도전력신호정보통신SG(신호용품)2022-12-23<NA>

Duplicate rows

Most frequently occurring

규격번호규격명영문규격명대분야중분야제정일최종개정일# duplicates
0KRS RN 0022-21(R)전동차용 속도 검출기Speed Sensor for Electric Multiple Units철도차량RN(주행장치용품)2015-05-282021-10-222