Overview

Dataset statistics

Number of variables4
Number of observations47
Missing cells7
Missing cells (%)3.7%
Duplicate rows1
Duplicate rows (%)2.1%
Total size in memory1.6 KiB
Average record size in memory34.8 B

Variable types

Categorical2
Text1
Unsupported1

Dataset

Description해외 철도관련 교육자료를 제공합니다.
Author국가철도공단
URLhttps://www.data.go.kr/data/15067576/fileData.do

Alerts

Dataset has 1 (2.1%) duplicate rowsDuplicates
해외철도교육자료 목록 is highly imbalanced (68.3%)Imbalance
Unnamed: 2 is highly imbalanced (64.3%)Imbalance
Unnamed: 1 has 4 (8.5%) missing valuesMissing
Unnamed: 3 has 3 (6.4%) missing valuesMissing
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 23:56:52.598707
Analysis finished2023-12-12 23:56:52.972174
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해외철도교육자료 목록
Categorical

IMBALANCE 

Distinct4
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size508.0 B
2017.12.13
42 
<NA>
 
3
등록일
 
1
Copyright ⓒ KRIC All Right Reserved.
 
1

Length

Max length36
Median length10
Mean length10.021277
Min length3

Unique

Unique2 ?
Unique (%)4.3%

Sample

1st row<NA>
2nd row등록일
3rd row2017.12.13
4th row2017.12.13
5th row2017.12.13

Common Values

ValueCountFrequency (%)
2017.12.13 42
89.4%
<NA> 3
 
6.4%
등록일 1
 
2.1%
Copyright ⓒ KRIC All Right Reserved. 1
 
2.1%

Length

2023-12-13T08:56:53.039807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:56:53.135263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017.12.13 42
80.8%
na 3
 
5.8%
등록일 1
 
1.9%
copyright 1
 
1.9%
1
 
1.9%
kric 1
 
1.9%
all 1
 
1.9%
right 1
 
1.9%
reserved 1
 
1.9%

Unnamed: 1
Text

MISSING 

Distinct43
Distinct (%)100.0%
Missing4
Missing (%)8.5%
Memory size508.0 B
2023-12-13T08:56:53.332763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length63
Median length63
Mean length61.627907
Min length4

Characters and Unicode

Total characters2650
Distinct characters71
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row첨부파일
2nd rowhttp://www.kric.go.kr/KricFileDownload.do?file=8DvU119t8wk6hEUa
3rd rowhttp://www.kric.go.kr/KricFileDownload.do?file=U83fPuS519cso0Jx
4th rowhttp://www.kric.go.kr/KricFileDownload.do?file=LAOy63d3K1KYTgFz
5th rowhttp://www.kric.go.kr/KricFileDownload.do?file=uk9e8Mt19587Of1J
ValueCountFrequency (%)
첨부파일 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=7417t14dfid3q5f0 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=15nl44w3v6gz9s94 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=3o408ogl4eynxox7 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=9g96ix86hxky910j 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=97cw3ws5o1b11d62 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=2i61bg55khd6pekq 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=yn958km1mg39i127 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=dj1wt3yb55ex8ks8 1
 
2.3%
http://www.kric.go.kr/kricfiledownload.do?file=82hgj0y5497t15ur 1
 
2.3%
Other values (33) 33
76.7%
2023-12-13T08:56:53.909542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 177
 
6.7%
w 175
 
6.6%
o 172
 
6.5%
. 168
 
6.3%
r 134
 
5.1%
l 133
 
5.0%
/ 126
 
4.8%
t 96
 
3.6%
k 96
 
3.6%
c 91
 
3.4%
Other values (61) 1282
48.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1634
61.7%
Other Punctuation 378
 
14.3%
Uppercase Letter 314
 
11.8%
Decimal Number 278
 
10.5%
Math Symbol 42
 
1.6%
Other Letter 4
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 177
10.8%
w 175
10.7%
o 172
10.5%
r 134
 
8.2%
l 133
 
8.1%
t 96
 
5.9%
k 96
 
5.9%
c 91
 
5.6%
d 91
 
5.6%
e 90
 
5.5%
Other values (16) 379
23.2%
Uppercase Letter
ValueCountFrequency (%)
D 52
16.6%
K 47
15.0%
F 47
15.0%
Q 15
 
4.8%
I 11
 
3.5%
Y 11
 
3.5%
X 11
 
3.5%
J 10
 
3.2%
O 9
 
2.9%
R 9
 
2.9%
Other values (16) 92
29.3%
Decimal Number
ValueCountFrequency (%)
9 41
14.7%
1 39
14.0%
7 35
12.6%
3 27
9.7%
8 27
9.7%
4 26
9.4%
5 24
8.6%
2 20
7.2%
6 20
7.2%
0 19
6.8%
Other Punctuation
ValueCountFrequency (%)
. 168
44.4%
/ 126
33.3%
? 42
 
11.1%
: 42
 
11.1%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Math Symbol
ValueCountFrequency (%)
= 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1948
73.5%
Common 698
 
26.3%
Hangul 4
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 177
 
9.1%
w 175
 
9.0%
o 172
 
8.8%
r 134
 
6.9%
l 133
 
6.8%
t 96
 
4.9%
k 96
 
4.9%
c 91
 
4.7%
d 91
 
4.7%
e 90
 
4.6%
Other values (42) 693
35.6%
Common
ValueCountFrequency (%)
. 168
24.1%
/ 126
18.1%
? 42
 
6.0%
: 42
 
6.0%
= 42
 
6.0%
9 41
 
5.9%
1 39
 
5.6%
7 35
 
5.0%
3 27
 
3.9%
8 27
 
3.9%
Other values (5) 109
15.6%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2646
99.8%
Hangul 4
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 177
 
6.7%
w 175
 
6.6%
o 172
 
6.5%
. 168
 
6.3%
r 134
 
5.1%
l 133
 
5.0%
/ 126
 
4.8%
t 96
 
3.6%
k 96
 
3.6%
c 91
 
3.4%
Other values (57) 1278
48.3%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 2
Categorical

IMBALANCE 

Distinct3
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size508.0 B
철도협회
42 
<NA>
 
4
출처명
 
1

Length

Max length4
Median length4
Mean length3.9787234
Min length3

Unique

Unique1 ?
Unique (%)2.1%

Sample

1st row<NA>
2nd row출처명
3rd row철도협회
4th row철도협회
5th row철도협회

Common Values

ValueCountFrequency (%)
철도협회 42
89.4%
<NA> 4
 
8.5%
출처명 1
 
2.1%

Length

2023-12-13T08:56:54.024777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:56:54.111184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
철도협회 42
89.4%
na 4
 
8.5%
출처명 1
 
2.1%

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)6.4%
Memory size508.0 B

Correlations

2023-12-13T08:56:54.174773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해외철도교육자료 목록Unnamed: 1Unnamed: 2
해외철도교육자료 목록1.0001.0000.672
Unnamed: 11.0001.0001.000
Unnamed: 20.6721.0001.000
2023-12-13T08:56:54.264032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 2해외철도교육자료 목록
Unnamed: 21.0000.469
해외철도교육자료 목록0.4691.000
2023-12-13T08:56:54.333192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해외철도교육자료 목록Unnamed: 2
해외철도교육자료 목록1.0000.469
Unnamed: 20.4691.000

Missing values

2023-12-13T08:56:52.740250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:56:52.810649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:56:52.894470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

해외철도교육자료 목록Unnamed: 1Unnamed: 2Unnamed: 3
0<NA><NA><NA>NaN
1등록일첨부파일출처명제목
22017.12.13http://www.kric.go.kr/KricFileDownload.do?file=8DvU119t8wk6hEUa철도협회Construction_System Testing and Commission_2014
32017.12.13http://www.kric.go.kr/KricFileDownload.do?file=U83fPuS519cso0Jx철도협회ERTMS_ETCS 체계 및 국내외 사례 비교_2014
42017.12.13http://www.kric.go.kr/KricFileDownload.do?file=LAOy63d3K1KYTgFz철도협회FIDIC 계약조건과 사례 분석_2014
52017.12.13http://www.kric.go.kr/KricFileDownload.do?file=uk9e8Mt19587Of1J철도협회FIDIC의 구성 및 이해_2014
62017.12.13http://www.kric.go.kr/KricFileDownload.do?file=7e1DM14u8P0c044L철도협회Presentation Skills_2014
72017.12.13http://www.kric.go.kr/KricFileDownload.do?file=9g7Fjvx000Mk3EV9철도협회RFP의 이해와 사례_2014
82017.12.13http://www.kric.go.kr/KricFileDownload.do?file=RivZd64yKNr76Nvi철도협회궤도 기술에 의한 철도산업 부가가치 확대 방안_2014
92017.12.13http://www.kric.go.kr/KricFileDownload.do?file=i7MfDRx8J9jw4b73철도협회궤도시스템 기술 전망_2014
해외철도교육자료 목록Unnamed: 1Unnamed: 2Unnamed: 3
372017.12.13http://www.kric.go.kr/KricFileDownload.do?file=2BmZwW083D79ksQb철도협회철도교통 수요 분석_2015
382017.12.13http://www.kric.go.kr/KricFileDownload.do?file=B99tMo2QW8yo78M8철도협회철도투자사업 분석과정_2015
392017.12.13http://www.kric.go.kr/KricFileDownload.do?file=dHENl21aW48Uz8sg철도협회해외 인프라 투자사업의 재무분석 및 타당성분석_2015
402017.12.13http://www.kric.go.kr/KricFileDownload.do?file=53haQkLCyy9QY6i4철도협회해외철도 투자사업의 이해_2015
412017.12.13http://www.kric.go.kr/KricFileDownload.do?file=4990m5rt3gc83d7x철도협회해외철도사업 재원조달 심층분석_2015
422017.12.13http://www.kric.go.kr/KricFileDownload.do?file=1x63QjR9c71S47T5철도협회해외철도사업의 추진 단계별 수주 전략_2015
432017.12.13http://www.kric.go.kr/KricFileDownload.do?file=7i276Xfqtavq02Rx철도협회해외철도의 현황과 철도사업 추진단계의 이해_2015
44<NA><NA><NA>NaN
45<NA><NA><NA>NaN
46Copyright ⓒ KRIC All Right Reserved.<NA><NA>2022-05-24 23:59:35.657000

Duplicate rows

Most frequently occurring

해외철도교육자료 목록Unnamed: 1Unnamed: 2# duplicates
0<NA><NA><NA>3