Overview

Dataset statistics

Number of variables5
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory44.9 B

Variable types

Text2
Numeric1
Categorical1
DateTime1

Dataset

Description충청북도농업기술원에서 시행하는 연간 교육과정에 대한 첨부파일에 대한 정보로 파일명, 원본파일명, 파일크기, 등록년도, 등록일자에 대한 데이터를 제공합니다.
Author충청북도
URLhttps://www.data.go.kr/data/15038863/fileData.do

Alerts

파일크기 is highly overall correlated with 등록년도High correlation
등록년도 is highly overall correlated with 파일크기High correlation
파일명 has unique valuesUnique
등록일자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:27:15.141740
Analysis finished2023-12-12 19:27:15.789245
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

파일명
Text

UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-13T04:27:15.975656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters690
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)100.0%

Sample

1st row15665260431.hwp
2nd row15665352011.hwp
3rd row15665353911.hwp
4th row15665357861.hwp
5th row15665359241.hwp
ValueCountFrequency (%)
15665260431.hwp 1
 
2.2%
15900448931.hwp 1
 
2.2%
15924462111.hwp 1
 
2.2%
15898552041.hwp 1
 
2.2%
15898552991.hwp 1
 
2.2%
15898555401.hwp 1
 
2.2%
15898556491.hwp 1
 
2.2%
15898557741.hwp 1
 
2.2%
15898558831.hwp 1
 
2.2%
15900446141.hwp 1
 
2.2%
Other values (36) 36
78.3%
2023-12-13T04:27:16.370711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 117
17.0%
5 87
12.6%
9 50
7.2%
8 49
7.1%
. 46
 
6.7%
h 46
 
6.7%
w 46
 
6.7%
p 46
 
6.7%
7 43
 
6.2%
6 42
 
6.1%
Other values (4) 118
17.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 506
73.3%
Lowercase Letter 138
 
20.0%
Other Punctuation 46
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 117
23.1%
5 87
17.2%
9 50
9.9%
8 49
9.7%
7 43
 
8.5%
6 42
 
8.3%
4 35
 
6.9%
3 29
 
5.7%
0 28
 
5.5%
2 26
 
5.1%
Lowercase Letter
ValueCountFrequency (%)
h 46
33.3%
w 46
33.3%
p 46
33.3%
Other Punctuation
ValueCountFrequency (%)
. 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 552
80.0%
Latin 138
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 117
21.2%
5 87
15.8%
9 50
9.1%
8 49
8.9%
. 46
 
8.3%
7 43
 
7.8%
6 42
 
7.6%
4 35
 
6.3%
3 29
 
5.3%
0 28
 
5.1%
Latin
ValueCountFrequency (%)
h 46
33.3%
w 46
33.3%
p 46
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 690
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 117
17.0%
5 87
12.6%
9 50
7.2%
8 49
7.1%
. 46
 
6.7%
h 46
 
6.7%
w 46
 
6.7%
p 46
 
6.7%
7 43
 
6.2%
6 42
 
6.1%
Other values (4) 118
17.1%
Distinct41
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-13T04:27:16.657659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length28
Mean length25.391304
Min length20

Characters and Unicode

Total characters1168
Distinct characters115
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)84.8%

Sample

1st row관리기 및 예초기 사용자 과정(8.27.).hwp
2nd row스마트팜 활용 과정(9.3~9.4).hwp
3rd row농업용 굴삭기 사용자 과정(9.10).hwp
4th row아열대작물 재배 과정(9.19).hwp
5th row주말 농업기계 과정(9.21).hwp
ValueCountFrequency (%)
과정 24
 
11.9%
교육계획.hwp 22
 
10.9%
사용자 14
 
6.9%
교육계획(시군).hwp 12
 
5.9%
2020년 8
 
4.0%
트랙터 6
 
3.0%
농업용 6
 
3.0%
굴삭기 6
 
3.0%
2019년 6
 
3.0%
6월 4
 
2.0%
Other values (58) 94
46.5%
2023-12-13T04:27:17.072344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
157
 
13.4%
. 57
 
4.9%
p 46
 
3.9%
w 46
 
3.9%
h 46
 
3.9%
45
 
3.9%
43
 
3.7%
40
 
3.4%
40
 
3.4%
39
 
3.3%
Other values (105) 609
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 656
56.2%
Space Separator 157
 
13.4%
Lowercase Letter 138
 
11.8%
Decimal Number 105
 
9.0%
Other Punctuation 57
 
4.9%
Close Punctuation 26
 
2.2%
Open Punctuation 26
 
2.2%
Math Symbol 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
6.9%
43
 
6.6%
40
 
6.1%
40
 
6.1%
39
 
5.9%
37
 
5.6%
35
 
5.3%
27
 
4.1%
27
 
4.1%
26
 
4.0%
Other values (87) 297
45.3%
Decimal Number
ValueCountFrequency (%)
2 33
31.4%
0 23
21.9%
9 16
15.2%
1 12
 
11.4%
6 6
 
5.7%
3 4
 
3.8%
4 4
 
3.8%
7 3
 
2.9%
5 3
 
2.9%
8 1
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
p 46
33.3%
w 46
33.3%
h 46
33.3%
Space Separator
ValueCountFrequency (%)
157
100.0%
Other Punctuation
ValueCountFrequency (%)
. 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 656
56.2%
Common 374
32.0%
Latin 138
 
11.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
6.9%
43
 
6.6%
40
 
6.1%
40
 
6.1%
39
 
5.9%
37
 
5.6%
35
 
5.3%
27
 
4.1%
27
 
4.1%
26
 
4.0%
Other values (87) 297
45.3%
Common
ValueCountFrequency (%)
157
42.0%
. 57
 
15.2%
2 33
 
8.8%
) 26
 
7.0%
( 26
 
7.0%
0 23
 
6.1%
9 16
 
4.3%
1 12
 
3.2%
6 6
 
1.6%
3 4
 
1.1%
Other values (5) 14
 
3.7%
Latin
ValueCountFrequency (%)
p 46
33.3%
w 46
33.3%
h 46
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 656
56.2%
ASCII 512
43.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
157
30.7%
. 57
 
11.1%
p 46
 
9.0%
w 46
 
9.0%
h 46
 
9.0%
2 33
 
6.4%
) 26
 
5.1%
( 26
 
5.1%
0 23
 
4.5%
9 16
 
3.1%
Other values (8) 36
 
7.0%
Hangul
ValueCountFrequency (%)
45
 
6.9%
43
 
6.6%
40
 
6.1%
40
 
6.1%
39
 
5.9%
37
 
5.6%
35
 
5.3%
27
 
4.1%
27
 
4.1%
26
 
4.0%
Other values (87) 297
45.3%

파일크기
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)45.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean187826.09
Minimum13824
Maximum467968
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-13T04:27:17.197706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum13824
5-th percentile13952
Q118688
median107008
Q3433152
95-th percentile457728
Maximum467968
Range454144
Interquartile range (IQR)414464

Descriptive statistics

Standard deviation183247.24
Coefficient of variation (CV)0.97562188
Kurtosis-1.4733022
Mean187826.09
Median Absolute Deviation (MAD)90112
Skewness0.63764981
Sum8640000
Variance3.3579551 × 1010
MonotonicityNot monotonic
2023-12-13T04:27:17.344792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
107008 7
15.2%
433152 5
10.9%
103936 4
 
8.7%
457728 4
 
8.7%
16896 4
 
8.7%
13824 3
 
6.5%
432640 2
 
4.3%
58880 2
 
4.3%
14336 2
 
4.3%
18432 2
 
4.3%
Other values (11) 11
23.9%
ValueCountFrequency (%)
13824 3
6.5%
14336 2
4.3%
15872 1
 
2.2%
16896 4
8.7%
18432 2
4.3%
19456 1
 
2.2%
58368 1
 
2.2%
58880 2
4.3%
59392 1
 
2.2%
103936 4
8.7%
ValueCountFrequency (%)
467968 1
 
2.2%
457728 4
8.7%
438272 1
 
2.2%
437248 1
 
2.2%
434688 1
 
2.2%
433152 5
10.9%
432640 2
 
4.3%
147456 1
 
2.2%
135680 1
 
2.2%
107008 7
15.2%

등록년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
2020
29 
2019
17 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2020 29
63.0%
2019 17
37.0%

Length

2023-12-13T04:27:17.481149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:27:17.582165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 29
63.0%
2019 17
37.0%

등록일자
Date

UNIQUE 

Distinct46
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
Minimum2019-08-23 11:07:00
Maximum2020-06-23 09:56:00
2023-12-13T04:27:17.701284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:27:17.870222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)

Interactions

2023-12-13T04:27:15.425690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:27:17.961586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일명원본파일명파일크기등록년도등록일자
파일명1.0001.0001.0001.0001.000
원본파일명1.0001.0001.0001.0001.000
파일크기1.0001.0001.0000.8391.000
등록년도1.0001.0000.8391.0001.000
등록일자1.0001.0001.0001.0001.000
2023-12-13T04:27:18.064891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
파일크기등록년도
파일크기1.0000.665
등록년도0.6651.000

Missing values

2023-12-13T04:27:15.590583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:27:15.725024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

파일명원본파일명파일크기등록년도등록일자
015665260431.hwp관리기 및 예초기 사용자 과정(8.27.).hwp1433620192019-08-23 11:07
115665352011.hwp스마트팜 활용 과정(9.3~9.4).hwp1433620192019-08-23 13:40
215665353911.hwp농업용 굴삭기 사용자 과정(9.10).hwp1689620192019-08-23 13:43
315665357861.hwp아열대작물 재배 과정(9.19).hwp1382420192019-08-23 13:49
415665359241.hwp주말 농업기계 과정(9.21).hwp1382420192019-08-23 13:52
515665360601.hwp공무원 농업기계 과정(9.25~9.27).hwp1587220192019-08-23 13:54
615665362091.hwp양봉 사육 과정(9.26~9.27).hwp1382420192019-08-23 13:56
715718136331.hwp2019년 버섯재배 과정 교육계획.hwp1843220192019-10-23 15:53
815719679421.hwp제3기 공무원농업기계 과정 교육계획.hwp1843220192019-10-25 10:45
915727567001.hwp데이터 기반 경영성과 분석 농업회계 교육(배포용).hwp1689620192019-11-03 13:51
파일명원본파일명파일크기등록년도등록일자
3615900454381.hwp2020년 친환경농업 과정 교육계획.hwp43264020202020-05-21 16:17
3715917787101.hwp소형농업기계 정비과정 교육계획(시군).hwp10700820202020-06-10 17:45
3815917787981.hwp제5기 농업용 굴삭기 사용자 과정 교육계획(시군).hwp10700820202020-06-10 17:46
3915917788701.hwp제6기 트랙터 사용자 과정 교육계획(시군).hwp10700820202020-06-10 17:47
4015917789311.hwp중대형 농업기계 과정 교육계획(시군).hwp10649620202020-06-10 17:48
4115918526441.hwp2020년 포도재배 과정 교육계획.hwp43315220202020-06-11 14:17
4215918531651.hwp2020년 토종벌사육 과정 교육계획.hwp43264020202020-06-11 14:26
4315918533171.hwp2020년 아열대작물 재배 과정 교육계획.hwp43315220202020-06-11 14:28
4415924462111.hwp2020년 청년농업인 역량강화 교육 계획(발송용).hwp13568020202020-06-18 11:10
4515928737791.hwp2020년 농촌융복합산업 과정 교육계획.hwp43724820202020-06-23 09:56