Overview

Dataset statistics

Number of variables6
Number of observations50
Missing cells64
Missing cells (%)21.3%
Duplicate rows1
Duplicate rows (%)2.0%
Total size in memory2.5 KiB
Average record size in memory51.6 B

Variable types

Text3
Categorical2
Numeric1

Dataset

Description한국교통안전공단 통합홈페이지시스템에서 관리하고 있는 법령지식 관련 자료입니다
Author한국교통안전공단
URLhttps://www.data.go.kr/data/15066119/fileData.do

Alerts

Dataset has 1 (2.0%) duplicate rowsDuplicates
파일크기 is highly overall correlated with 최초등록일시High correlation
파일확장자 is highly overall correlated with 최초등록일시High correlation
최초등록일시 is highly overall correlated with 파일크기 and 1 other fieldsHigh correlation
저장파일명 has 32 (64.0%) missing valuesMissing
파일설명 has 32 (64.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 09:26:13.953046
Analysis finished2023-12-12 09:26:14.855573
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct48
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-12T18:26:15.048436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length31
Mean length22.06
Min length7

Characters and Unicode

Total characters1103
Distinct characters141
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)94.0%

Sample

1st row자전거 이용 활성화에 관한 법률 시행규칙.pdf
2nd row자전거 이용 활성화에 관한 법률 시행령.pdf
3rd row자전거 이용 활성화에 관한 법률.pdf
4th row보행안전 및 편의증진에 관한 법률 시행규칙.pdf
5th row보행안전 및 편의증진에 관한 법률 시행령.pdf
ValueCountFrequency (%)
대중교통시책평가 10
 
6.8%
관한 9
 
6.1%
11년 7
 
4.8%
7
 
4.8%
법률 6
 
4.1%
시행규칙.pdf 5
 
3.4%
시행령.pdf 5
 
3.4%
1.평가개요_공통.pdf 3
 
2.0%
자전거 3
 
2.0%
관련법규 3
 
2.0%
Other values (63) 89
60.5%
2023-12-12T18:26:15.501843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
97
 
8.8%
. 60
 
5.4%
p 52
 
4.7%
1 31
 
2.8%
_ 28
 
2.5%
28
 
2.5%
f 26
 
2.4%
26
 
2.4%
d 26
 
2.4%
24
 
2.2%
Other values (131) 705
63.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 642
58.2%
Lowercase Letter 151
 
13.7%
Space Separator 97
 
8.8%
Decimal Number 82
 
7.4%
Other Punctuation 67
 
6.1%
Connector Punctuation 28
 
2.5%
Uppercase Letter 13
 
1.2%
Close Punctuation 12
 
1.1%
Open Punctuation 11
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
 
4.4%
26
 
4.0%
24
 
3.7%
23
 
3.6%
21
 
3.3%
21
 
3.3%
19
 
3.0%
17
 
2.6%
15
 
2.3%
15
 
2.3%
Other values (99) 433
67.4%
Lowercase Letter
ValueCountFrequency (%)
p 52
34.4%
f 26
17.2%
d 26
17.2%
w 21
13.9%
h 21
13.9%
t 2
 
1.3%
x 1
 
0.7%
z 1
 
0.7%
i 1
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 31
37.8%
0 17
20.7%
2 16
19.5%
3 9
 
11.0%
4 5
 
6.1%
5 1
 
1.2%
8 1
 
1.2%
6 1
 
1.2%
9 1
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
S 2
15.4%
B 2
15.4%
T 2
15.4%
F 2
15.4%
A 2
15.4%
E 1
7.7%
D 1
7.7%
C 1
7.7%
Other Punctuation
ValueCountFrequency (%)
. 60
89.6%
' 7
 
10.4%
Space Separator
ValueCountFrequency (%)
97
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 642
58.2%
Common 297
26.9%
Latin 164
 
14.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
 
4.4%
26
 
4.0%
24
 
3.7%
23
 
3.6%
21
 
3.3%
21
 
3.3%
19
 
3.0%
17
 
2.6%
15
 
2.3%
15
 
2.3%
Other values (99) 433
67.4%
Latin
ValueCountFrequency (%)
p 52
31.7%
f 26
15.9%
d 26
15.9%
w 21
12.8%
h 21
12.8%
t 2
 
1.2%
S 2
 
1.2%
B 2
 
1.2%
T 2
 
1.2%
F 2
 
1.2%
Other values (7) 8
 
4.9%
Common
ValueCountFrequency (%)
97
32.7%
. 60
20.2%
1 31
 
10.4%
_ 28
 
9.4%
0 17
 
5.7%
2 16
 
5.4%
) 12
 
4.0%
( 11
 
3.7%
3 9
 
3.0%
' 7
 
2.4%
Other values (5) 9
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 642
58.2%
ASCII 461
41.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
97
21.0%
. 60
13.0%
p 52
11.3%
1 31
 
6.7%
_ 28
 
6.1%
f 26
 
5.6%
d 26
 
5.6%
w 21
 
4.6%
h 21
 
4.6%
0 17
 
3.7%
Other values (22) 82
17.8%
Hangul
ValueCountFrequency (%)
28
 
4.4%
26
 
4.0%
24
 
3.7%
23
 
3.6%
21
 
3.3%
21
 
3.3%
19
 
3.0%
17
 
2.6%
15
 
2.3%
15
 
2.3%
Other values (99) 433
67.4%

저장파일명
Text

MISSING 

Distinct18
Distinct (%)100.0%
Missing32
Missing (%)64.0%
Memory size532.0 B
2023-12-12T18:26:15.777672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length25
Mean length25
Min length25

Characters and Unicode

Total characters450
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)100.0%

Sample

1st rowkcmBBS_201412080308128320
2nd rowkcmBBS_201412080307375480
3rd rowkcmBBS_201412080306461220
4th rowkcmBBS_201412080303457480
5th rowkcmBBS_201412080303004290
ValueCountFrequency (%)
kcmbbs_201412080307375480 1
 
5.6%
kcmbbs_201412080306461220 1
 
5.6%
kcmbbs_201412040539288170 1
 
5.6%
kcmbbs_201412040540442700 1
 
5.6%
kcmbbs_201412040541458160 1
 
5.6%
kcmbbs_201412080254599670 1
 
5.6%
kcmbbs_201412080256211490 1
 
5.6%
kcmbbs_201412080257436830 1
 
5.6%
kcmbbs_201412080258256260 1
 
5.6%
kcmbbs_201412080259015900 1
 
5.6%
Other values (8) 8
44.4%
2023-12-12T18:26:16.249464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 90
20.0%
2 56
12.4%
1 47
10.4%
B 36
 
8.0%
4 35
 
7.8%
8 28
 
6.2%
3 21
 
4.7%
k 18
 
4.0%
c 18
 
4.0%
m 18
 
4.0%
Other values (6) 83
18.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 324
72.0%
Uppercase Letter 54
 
12.0%
Lowercase Letter 54
 
12.0%
Connector Punctuation 18
 
4.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 90
27.8%
2 56
17.3%
1 47
14.5%
4 35
 
10.8%
8 28
 
8.6%
3 21
 
6.5%
5 17
 
5.2%
9 11
 
3.4%
6 10
 
3.1%
7 9
 
2.8%
Lowercase Letter
ValueCountFrequency (%)
k 18
33.3%
c 18
33.3%
m 18
33.3%
Uppercase Letter
ValueCountFrequency (%)
B 36
66.7%
S 18
33.3%
Connector Punctuation
ValueCountFrequency (%)
_ 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 342
76.0%
Latin 108
 
24.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 90
26.3%
2 56
16.4%
1 47
13.7%
4 35
 
10.2%
8 28
 
8.2%
3 21
 
6.1%
_ 18
 
5.3%
5 17
 
5.0%
9 11
 
3.2%
6 10
 
2.9%
Latin
ValueCountFrequency (%)
B 36
33.3%
k 18
16.7%
c 18
16.7%
m 18
16.7%
S 18
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 90
20.0%
2 56
12.4%
1 47
10.4%
B 36
 
8.0%
4 35
 
7.8%
8 28
 
6.2%
3 21
 
4.7%
k 18
 
4.0%
c 18
 
4.0%
m 18
 
4.0%
Other values (6) 83
18.4%

파일확장자
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
G
32 
pdf
15 
hwp
 
3

Length

Max length3
Median length1
Mean length1.72
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpdf
2nd rowpdf
3rd rowpdf
4th rowpdf
5th rowpdf

Common Values

ValueCountFrequency (%)
G 32
64.0%
pdf 15
30.0%
hwp 3
 
6.0%

Length

2023-12-12T18:26:16.425596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:26:16.540477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
g 32
64.0%
pdf 15
30.0%
hwp 3
 
6.0%

파일크기
Real number (ℝ)

HIGH CORRELATION 

Distinct48
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean416797.8
Minimum8124
Maximum4138874
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2023-12-12T18:26:16.664698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8124
5-th percentile13207.3
Q142568.25
median81920
Q3435596.75
95-th percentile1957927.4
Maximum4138874
Range4130750
Interquartile range (IQR)393028.5

Descriptive statistics

Standard deviation803598.15
Coefficient of variation (CV)1.9280288
Kurtosis11.528335
Mean416797.8
Median Absolute Deviation (MAD)61994.5
Skewness3.2717732
Sum20839890
Variance6.4576999 × 1011
MonotonicityNot monotonic
2023-12-12T18:26:16.858088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
746238 3
 
6.0%
8124 1
 
2.0%
1994660 1
 
2.0%
146944 1
 
2.0%
74752 1
 
2.0%
4138874 1
 
2.0%
3225600 1
 
2.0%
11264 1
 
2.0%
1913032 1
 
2.0%
26624 1
 
2.0%
Other values (38) 38
76.0%
ValueCountFrequency (%)
8124 1
2.0%
11264 1
2.0%
11500 1
2.0%
15294 1
2.0%
17748 1
2.0%
22103 1
2.0%
26112 1
2.0%
26624 1
2.0%
31575 1
2.0%
34062 1
2.0%
ValueCountFrequency (%)
4138874 1
 
2.0%
3225600 1
 
2.0%
1994660 1
 
2.0%
1913032 1
 
2.0%
1027072 1
 
2.0%
746238 3
6.0%
725504 1
 
2.0%
681542 1
 
2.0%
639284 1
 
2.0%
457245 1
 
2.0%

파일설명
Text

MISSING 

Distinct18
Distinct (%)100.0%
Missing32
Missing (%)64.0%
Memory size532.0 B
2023-12-12T18:26:17.058891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length21.5
Mean length15.833333
Min length3

Characters and Unicode

Total characters285
Distinct characters43
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)100.0%

Sample

1st row자전거 이용 활성화에 관한 법률 시행규칙
2nd row자전거 이용 활성화에 관한 법률 시행령
3rd row자전거 이용 활성화에 관한 법률
4th row보행안전 및 편의증진에 관한 법률 시행규칙
5th row보행안전 및 편의증진에 관한 법률 시행령
ValueCountFrequency (%)
관한 9
13.0%
법률 8
 
11.6%
6
 
8.7%
시행규칙 5
 
7.2%
시행령 5
 
7.2%
도로교통법 3
 
4.3%
이용촉진에 3
 
4.3%
육성 3
 
4.3%
대중교통의 3
 
4.3%
교통안전법 3
 
4.3%
Other values (9) 21
30.4%
2023-12-12T18:26:17.662353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
17.9%
18
 
6.3%
15
 
5.3%
12
 
4.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
9
 
3.2%
Other values (33) 135
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 213
74.7%
Space Separator 51
 
17.9%
Decimal Number 15
 
5.3%
Open Punctuation 3
 
1.1%
Close Punctuation 3
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
8.5%
15
 
7.0%
12
 
5.6%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
Other values (23) 105
49.3%
Decimal Number
ValueCountFrequency (%)
0 4
26.7%
1 3
20.0%
2 3
20.0%
4 2
13.3%
5 1
 
6.7%
8 1
 
6.7%
6 1
 
6.7%
Space Separator
ValueCountFrequency (%)
51
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 213
74.7%
Common 72
 
25.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
8.5%
15
 
7.0%
12
 
5.6%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
Other values (23) 105
49.3%
Common
ValueCountFrequency (%)
51
70.8%
0 4
 
5.6%
( 3
 
4.2%
1 3
 
4.2%
) 3
 
4.2%
2 3
 
4.2%
4 2
 
2.8%
5 1
 
1.4%
8 1
 
1.4%
6 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 213
74.7%
ASCII 72
 
25.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51
70.8%
0 4
 
5.6%
( 3
 
4.2%
1 3
 
4.2%
) 3
 
4.2%
2 3
 
4.2%
4 2
 
2.8%
5 1
 
1.4%
8 1
 
1.4%
6 1
 
1.4%
Hangul
ValueCountFrequency (%)
18
 
8.5%
15
 
7.0%
12
 
5.6%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
9
 
4.2%
Other values (23) 105
49.3%

최초등록일시
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)6.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
<NA>
32 
2014-12-08
15 
2014-12-04
 
3

Length

Max length10
Median length4
Mean length6.16
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2014-12-08
2nd row2014-12-08
3rd row2014-12-08
4th row2014-12-08
5th row2014-12-08

Common Values

ValueCountFrequency (%)
<NA> 32
64.0%
2014-12-08 15
30.0%
2014-12-04 3
 
6.0%

Length

2023-12-12T18:26:17.951591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:26:18.172558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 32
64.0%
2014-12-08 15
30.0%
2014-12-04 3
 
6.0%

Interactions

2023-12-12T18:26:14.322095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:26:18.321522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
원본파일명1.0001.0001.0001.0001.0001.000
저장파일명1.0001.0001.0001.0001.0001.000
파일확장자1.0001.0001.0000.3411.0000.944
파일크기1.0001.0000.3411.0001.0000.716
파일설명1.0001.0001.0001.0001.0001.000
최초등록일시1.0001.0000.9440.7161.0001.000
2023-12-12T18:26:18.965569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
최초등록일시파일확장자
최초등록일시1.0000.786
파일확장자0.7861.000
2023-12-12T18:26:19.099413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일크기파일확장자최초등록일시
파일크기1.0000.1550.513
파일확장자0.1551.0000.786
최초등록일시0.5130.7861.000

Missing values

2023-12-12T18:26:14.484001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:26:14.648107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T18:26:14.779951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
0자전거 이용 활성화에 관한 법률 시행규칙.pdfkcmBBS_201412080308128320pdf8124자전거 이용 활성화에 관한 법률 시행규칙2014-12-08
1자전거 이용 활성화에 관한 법률 시행령.pdfkcmBBS_201412080307375480pdf15294자전거 이용 활성화에 관한 법률 시행령2014-12-08
2자전거 이용 활성화에 관한 법률.pdfkcmBBS_201412080306461220pdf22103자전거 이용 활성화에 관한 법률2014-12-08
3보행안전 및 편의증진에 관한 법률 시행규칙.pdfkcmBBS_201412080303457480pdf11500보행안전 및 편의증진에 관한 법률 시행규칙2014-12-08
4보행안전 및 편의증진에 관한 법률 시행령.pdfkcmBBS_201412080303004290pdf17748보행안전 및 편의증진에 관한 법률 시행령2014-12-08
5보행안전 및 편의증진에 관한 법률.pdfkcmBBS_201412080302205730pdf31575보행안전 및 편의증진에 관한 법률2014-12-08
6도로법 시행규칙.pdfkcmBBS_201412080301382670pdf35761도로법 시행규칙2014-12-08
7도로법 시행령.pdfkcmBBS_201412080300526890pdf94863도로법 시행령2014-12-08
8도로법.pdfkcmBBS_201412080300133480pdf105779도로법2014-12-08
9도로교통법 시행규칙.pdfkcmBBS_201412080259391890pdf134563도로교통법 시행규칙2014-12-08
원본파일명저장파일명파일확장자파일크기파일설명최초등록일시
403.제출양식_여객자동차운송사업자.hwp<NA>G64000<NA><NA>
412.평가항목_여객자동차운송사업자.pdf<NA>G639284<NA><NA>
423.제출양식_도시철도및철도.hwp<NA>G60928<NA><NA>
431.평가개요_공통.pdf<NA>G746238<NA><NA>
442.평가항목_도시철도및철도사업자.pdf<NA>G410158<NA><NA>
451.평가개요_공통.pdf<NA>G746238<NA><NA>
46대중교통경영서비스평가요령(20100319).hwp<NA>G77312<NA><NA>
47환경 관련법규 온라인 정보.pdf<NA>G77330<NA><NA>
48화물 관련법규 온라인 정보.pdf<NA>G74038<NA><NA>
49항공 관련법규 온라인 정보.pdf<NA>G72078<NA><NA>

Duplicate rows

Most frequently occurring

원본파일명저장파일명파일확장자파일크기파일설명최초등록일시# duplicates
01.평가개요_공통.pdf<NA>G746238<NA><NA>3