Overview

Dataset statistics

Number of variables6
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory51.6 B

Variable types

Text3
Categorical1
Numeric1
DateTime1

Dataset

Description한국교통안전공단 통합홈페이지시스템에서 관리하고 있는 안전지원지식 관련 정보입니다
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15066122/fileData.do

Alerts

원본파일명 has unique valuesUnique
저장파일명 has unique valuesUnique
파일크기 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:24:25.649302
Analysis finished2023-12-12 06:24:26.460841
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

원본파일명
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T15:24:26.719909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length28
Mean length22.9
Min length10

Characters and Unicode

Total characters1145
Distinct characters150
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row2016년도 경량항공정보메뉴얼.pdf
2nd row2020년_교통문화발전대회_정부포상_후보자_게시용_(2).hwp
3rd row2020년 현대차 정몽구 재단 대학생 장학금 신청안내(변경).hwp
4th row교통안전관리규정 심사지침(2016).hwp
5th row2019년 방문케어서비스 운영 성과.ppt
ValueCountFrequency (%)
gyro 9
 
7.2%
지원제도 5
 
4.0%
자이로 5
 
4.0%
2-2 3
 
2.4%
피해가족 3
 
2.4%
no 3
 
2.4%
자동차사고 3
 
2.4%
4-1 3
 
2.4%
2-1 3
 
2.4%
2016년도 2
 
1.6%
Other values (81) 86
68.8%
2023-12-12T15:24:27.253249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
6.6%
. 75
 
6.6%
p 51
 
4.5%
1 40
 
3.5%
R 36
 
3.1%
( 35
 
3.1%
d 35
 
3.1%
) 35
 
3.1%
f 33
 
2.9%
2 30
 
2.6%
Other values (140) 700
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 361
31.5%
Uppercase Letter 194
16.9%
Lowercase Letter 181
15.8%
Decimal Number 148
12.9%
Space Separator 75
 
6.6%
Other Punctuation 75
 
6.6%
Open Punctuation 36
 
3.1%
Close Punctuation 36
 
3.1%
Connector Punctuation 24
 
2.1%
Dash Punctuation 15
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%
Lowercase Letter
ValueCountFrequency (%)
p 51
28.2%
d 35
19.3%
f 33
18.2%
o 13
 
7.2%
h 12
 
6.6%
w 12
 
6.6%
g 8
 
4.4%
r 4
 
2.2%
n 4
 
2.2%
j 4
 
2.2%
Other values (3) 5
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
R 36
18.6%
O 30
15.5%
G 25
12.9%
E 24
12.4%
Y 17
8.8%
N 15
7.7%
K 11
 
5.7%
V 6
 
3.1%
A 6
 
3.1%
P 6
 
3.1%
Other values (3) 18
9.3%
Decimal Number
ValueCountFrequency (%)
1 40
27.0%
2 30
20.3%
9 24
16.2%
0 21
14.2%
4 10
 
6.8%
6 8
 
5.4%
7 5
 
3.4%
8 5
 
3.4%
5 3
 
2.0%
3 2
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 35
97.2%
[ 1
 
2.8%
Close Punctuation
ValueCountFrequency (%)
) 35
97.2%
] 1
 
2.8%
Space Separator
ValueCountFrequency (%)
75
100.0%
Other Punctuation
ValueCountFrequency (%)
. 75
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 409
35.7%
Latin 375
32.8%
Hangul 361
31.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%
Latin
ValueCountFrequency (%)
p 51
13.6%
R 36
 
9.6%
d 35
 
9.3%
f 33
 
8.8%
O 30
 
8.0%
G 25
 
6.7%
E 24
 
6.4%
Y 17
 
4.5%
N 15
 
4.0%
o 13
 
3.5%
Other values (16) 96
25.6%
Common
ValueCountFrequency (%)
75
18.3%
. 75
18.3%
1 40
9.8%
( 35
8.6%
) 35
8.6%
2 30
 
7.3%
_ 24
 
5.9%
9 24
 
5.9%
0 21
 
5.1%
- 15
 
3.7%
Other values (8) 35
8.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 784
68.5%
Hangul 361
31.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
75
 
9.6%
. 75
 
9.6%
p 51
 
6.5%
1 40
 
5.1%
R 36
 
4.6%
( 35
 
4.5%
d 35
 
4.5%
) 35
 
4.5%
f 33
 
4.2%
2 30
 
3.8%
Other values (34) 339
43.2%
Hangul
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%

저장파일명
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T15:24:27.544499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length25
Mean length25
Min length25

Characters and Unicode

Total characters1250
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st rowkcmBBS_202003161144246410
2nd rowkcmBBS_202003120242432640
3rd rowkcmBBS_202003051230298760
4th rowkcmBBS_202002131003143790
5th rowkcmBBS_202002110122416550
ValueCountFrequency (%)
kcmbbs_202003161144246410 1
 
2.0%
kcmbbs_201805080532330870 1
 
2.0%
kcmbbs_201707060544106190 1
 
2.0%
kcmbbs_201811130339594921 1
 
2.0%
kcmbbs_201811130339594910 1
 
2.0%
kcmbbs_201811130338221061 1
 
2.0%
kcmbbs_201811130338221020 1
 
2.0%
kcmbbs_201808090929331750 1
 
2.0%
kcmbbs_201805080532330944 1
 
2.0%
kcmbbs_201805080532330933 1
 
2.0%
Other values (40) 40
80.0%
2023-12-12T15:24:28.019732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 220
17.6%
1 171
13.7%
2 149
11.9%
B 100
 
8.0%
3 71
 
5.7%
9 67
 
5.4%
8 56
 
4.5%
5 54
 
4.3%
k 50
 
4.0%
c 50
 
4.0%
Other values (6) 262
21.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 900
72.0%
Uppercase Letter 150
 
12.0%
Lowercase Letter 150
 
12.0%
Connector Punctuation 50
 
4.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 220
24.4%
1 171
19.0%
2 149
16.6%
3 71
 
7.9%
9 67
 
7.4%
8 56
 
6.2%
5 54
 
6.0%
4 45
 
5.0%
7 35
 
3.9%
6 32
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
k 50
33.3%
c 50
33.3%
m 50
33.3%
Uppercase Letter
ValueCountFrequency (%)
B 100
66.7%
S 50
33.3%
Connector Punctuation
ValueCountFrequency (%)
_ 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 950
76.0%
Latin 300
 
24.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 220
23.2%
1 171
18.0%
2 149
15.7%
3 71
 
7.5%
9 67
 
7.1%
8 56
 
5.9%
5 54
 
5.7%
_ 50
 
5.3%
4 45
 
4.7%
7 35
 
3.7%
Latin
ValueCountFrequency (%)
B 100
33.3%
k 50
16.7%
c 50
16.7%
m 50
16.7%
S 50
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1250
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 220
17.6%
1 171
13.7%
2 149
11.9%
B 100
 
8.0%
3 71
 
5.7%
9 67
 
5.4%
8 56
 
4.5%
5 54
 
4.3%
k 50
 
4.0%
c 50
 
4.0%
Other values (6) 262
21.0%

파일확장자
Categorical

Distinct5
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
pdf
32 
hwp
11 
jpg
docx
 
2
ppt
 
1

Length

Max length4
Median length3
Mean length3.04
Min length3

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st rowpdf
2nd rowhwp
3rd rowhwp
4th rowhwp
5th rowppt

Common Values

ValueCountFrequency (%)
pdf 32
64.0%
hwp 11
 
22.0%
jpg 4
 
8.0%
docx 2
 
4.0%
ppt 1
 
2.0%

Length

2023-12-12T15:24:28.221431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:24:28.361849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
pdf 32
64.0%
hwp 11
 
22.0%
jpg 4
 
8.0%
docx 2
 
4.0%
ppt 1
 
2.0%

파일크기
Real number (ℝ)

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4048077.2
Minimum12800
Maximum32499768
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2023-12-12T15:24:28.522453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12800
5-th percentile20049.85
Q1100864
median1037641
Q34210947.5
95-th percentile16980226
Maximum32499768
Range32486968
Interquartile range (IQR)4110083.5

Descriptive statistics

Standard deviation7062446.8
Coefficient of variation (CV)1.7446423
Kurtosis8.2848423
Mean4048077.2
Median Absolute Deviation (MAD)1011192.5
Skewness2.8173157
Sum2.0240386 × 108
Variance4.9878155 × 1013
MonotonicityNot monotonic
2023-12-12T15:24:28.717930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11795316 1
 
2.0%
100352 1
 
2.0%
872918 1
 
2.0%
2029945 1
 
2.0%
2071797 1
 
2.0%
30703942 1
 
2.0%
653075 1
 
2.0%
705103 1
 
2.0%
3755465 1
 
2.0%
4090014 1
 
2.0%
Other values (40) 40
80.0%
ValueCountFrequency (%)
12800 1
2.0%
14295 1
2.0%
17920 1
2.0%
22653 1
2.0%
32768 1
2.0%
32802 1
2.0%
34335 1
2.0%
47616 1
2.0%
53760 1
2.0%
69120 1
2.0%
ValueCountFrequency (%)
32499768 1
2.0%
30703942 1
2.0%
17195792 1
2.0%
16716756 1
2.0%
14651904 1
2.0%
11795316 1
2.0%
8549999 1
2.0%
7807158 1
2.0%
4511157 1
2.0%
4490677 1
2.0%
Distinct46
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T15:24:29.066408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length24
Mean length18.86
Min length6

Characters and Unicode

Total characters943
Distinct characters146
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)88.0%

Sample

1st row2016년도 경량항공정보메뉴얼
2nd row2020년_교통문화발전대회_정부포상_후보자_게시용_(2)
3rd row2020년 현대차 정몽구 재단 대학생 장학금 신청안내(변경)
4th row교통안전관리규정 심사지침(2016)
5th row2019년 방문케어서비스 운영 성과
ValueCountFrequency (%)
gyro 9
 
7.2%
자이로 5
 
4.0%
지원제도 5
 
4.0%
eng)avsec_report 3
 
2.4%
2-2 3
 
2.4%
no 3
 
2.4%
2-1 3
 
2.4%
kor)avsec_report 3
 
2.4%
피해가족 3
 
2.4%
자동차사고 3
 
2.4%
Other values (77) 85
68.0%
2023-12-12T15:24:29.594501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
8.0%
1 40
 
4.2%
R 36
 
3.8%
( 35
 
3.7%
) 35
 
3.7%
2 30
 
3.2%
O 30
 
3.2%
. 25
 
2.7%
G 25
 
2.7%
_ 24
 
2.5%
Other values (136) 588
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 361
38.3%
Uppercase Letter 194
20.6%
Decimal Number 148
15.7%
Space Separator 75
 
8.0%
Open Punctuation 36
 
3.8%
Close Punctuation 36
 
3.8%
Lowercase Letter 29
 
3.1%
Other Punctuation 25
 
2.7%
Connector Punctuation 24
 
2.5%
Dash Punctuation 15
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%
Uppercase Letter
ValueCountFrequency (%)
R 36
18.6%
O 30
15.5%
G 25
12.9%
E 24
12.4%
Y 17
8.8%
N 15
7.7%
K 11
 
5.7%
V 6
 
3.1%
A 6
 
3.1%
S 6
 
3.1%
Other values (3) 18
9.3%
Decimal Number
ValueCountFrequency (%)
1 40
27.0%
2 30
20.3%
9 24
16.2%
0 21
14.2%
4 10
 
6.8%
6 8
 
5.4%
7 5
 
3.4%
8 5
 
3.4%
5 3
 
2.0%
3 2
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
o 11
37.9%
g 4
 
13.8%
r 4
 
13.8%
n 4
 
13.8%
p 2
 
6.9%
h 1
 
3.4%
d 1
 
3.4%
f 1
 
3.4%
w 1
 
3.4%
Open Punctuation
ValueCountFrequency (%)
( 35
97.2%
[ 1
 
2.8%
Close Punctuation
ValueCountFrequency (%)
) 35
97.2%
] 1
 
2.8%
Space Separator
ValueCountFrequency (%)
75
100.0%
Other Punctuation
ValueCountFrequency (%)
. 25
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 361
38.3%
Common 359
38.1%
Latin 223
23.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%
Latin
ValueCountFrequency (%)
R 36
16.1%
O 30
13.5%
G 25
11.2%
E 24
10.8%
Y 17
7.6%
N 15
 
6.7%
K 11
 
4.9%
o 11
 
4.9%
V 6
 
2.7%
A 6
 
2.7%
Other values (12) 42
18.8%
Common
ValueCountFrequency (%)
75
20.9%
1 40
11.1%
( 35
9.7%
) 35
9.7%
2 30
 
8.4%
. 25
 
7.0%
_ 24
 
6.7%
9 24
 
6.7%
0 21
 
5.8%
- 15
 
4.2%
Other values (8) 35
9.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 582
61.7%
Hangul 361
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
75
 
12.9%
1 40
 
6.9%
R 36
 
6.2%
( 35
 
6.0%
) 35
 
6.0%
2 30
 
5.2%
O 30
 
5.2%
. 25
 
4.3%
G 25
 
4.3%
_ 24
 
4.1%
Other values (30) 227
39.0%
Hangul
ValueCountFrequency (%)
15
 
4.2%
15
 
4.2%
12
 
3.3%
11
 
3.0%
11
 
3.0%
11
 
3.0%
11
 
3.0%
10
 
2.8%
9
 
2.5%
9
 
2.5%
Other values (96) 247
68.4%
Distinct23
Distinct (%)46.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
Minimum2017-06-27 00:00:00
Maximum2020-03-16 00:00:00
2023-12-12T15:24:29.762224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:24:29.904246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)

Interactions

2023-12-12T15:24:26.091542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:24:30.029842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
원본파일명1.0001.0001.0001.0001.0001.000
저장파일명1.0001.0001.0001.0001.0001.000
파일확장자1.0001.0001.0000.0000.0000.849
파일크기1.0001.0000.0001.0001.0000.924
파일설명1.0001.0000.0001.0001.0001.000
최초등록일시1.0001.0000.8490.9241.0001.000
2023-12-12T15:24:30.158598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일크기파일확장자
파일크기1.0000.000
파일확장자0.0001.000

Missing values

2023-12-12T15:24:26.259476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:24:26.407032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
02016년도 경량항공정보메뉴얼.pdfkcmBBS_202003161144246410pdf117953162016년도 경량항공정보메뉴얼2020-03-16
12020년_교통문화발전대회_정부포상_후보자_게시용_(2).hwpkcmBBS_202003120242432640hwp691202020년_교통문화발전대회_정부포상_후보자_게시용_(2)2020-03-12
22020년 현대차 정몽구 재단 대학생 장학금 신청안내(변경).hwpkcmBBS_202003051230298760hwp327682020년 현대차 정몽구 재단 대학생 장학금 신청안내(변경)2020-03-05
3교통안전관리규정 심사지침(2016).hwpkcmBBS_202002131003143790hwp129871교통안전관리규정 심사지침(2016)2020-02-13
42019년 방문케어서비스 운영 성과.pptkcmBBS_202002110122416550ppt17812482019년 방문케어서비스 운영 성과2020-02-11
52-2. GYRO No.200(ENG).pdfkcmBBS_202001090156557381pdf42253052-2. GYRO No.200(ENG)2020-01-09
62-1. 자이로 200호(KOR).pdfkcmBBS_202001090156557320pdf41168042-1. 자이로 200호(KOR)2020-01-09
7(Eng)AVSEC_REPORT.docxkcmBBS_201912130215122915docx22653(Eng)AVSEC_REPORT2019-12-13
8(Eng)AVSEC_REPORT.hwpkcmBBS_201912130215122904hwp47616(Eng)AVSEC_REPORT2019-12-13
9(Eng)AVSEC_REPORT.pdfkcmBBS_201912130215122903pdf32802(Eng)AVSEC_REPORT2019-12-13
원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
40교통안전관리규정샘플(버스).hwpkcmBBS_201803290926378730hwp102400교통안전관리규정샘플(버스)2018-03-29
41GYRO_192_ENG.pdfkcmBBS_201801080420505491pdf2151098GYRO_192_ENG2018-01-08
42GYRO_192_KOR.pdfkcmBBS_201801080420505430pdf2284846GYRO_192_KOR2018-01-08
432016년도 운수교통안전진단결과분석보고서.pdfkcmBBS_201711141009166650pdf43711792016년도 운수교통안전진단결과분석보고서2017-11-14
44GYRO_191_ENG.pdfkcmBBS_201711021007491521pdf666585GYRO_191_ENG2017-11-02
45GYRO_191_KOR.pdfkcmBBS_201711021007491490pdf690781GYRO_191_KOR2017-11-02
462017년 항공정보매뉴얼(제7호).pdfkcmBBS_201711021005293810pdf167167562017년 항공정보매뉴얼(제7호)2017-11-02
47GYRO_190_ENG.pdfkcmBBS_201707060544106271pdf2045038GYRO_190_ENG2017-07-06
48GYRO_190_KOR.pdfkcmBBS_201707060544106190pdf2599944GYRO_190_KOR2017-07-06
49[별지 제7호서식] 작업지시서.hwpkcmBBS_201706271019178680hwp17920[별지 제7호서식] 작업지시서2017-06-27