Overview

Dataset statistics

Number of variables4
Number of observations743
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.1 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description환경관련 온라인 환경실무교육 콘텐츠 및 분야별 교육과정 목록 (항목: 사이버환경실무교육 운영 과목명, 차시, 날짜 등) 제공
URLhttps://www.data.go.kr/data/15052614/fileData.do

Alerts

차시명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:11:07.312112
Analysis finished2023-12-12 20:11:07.930477
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

Distinct651
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean339.74428
Minimum1
Maximum651
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.7 KiB
2023-12-13T05:11:08.004019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile38.1
Q1186.5
median372
Q3465.5
95-th percentile613.9
Maximum651
Range650
Interquartile range (IQR)279

Descriptive statistics

Standard deviation179.80363
Coefficient of variation (CV)0.52923224
Kurtosis-1.0673868
Mean339.74428
Median Absolute Deviation (MAD)140
Skewness-0.20082541
Sum252430
Variance32329.344
MonotonicityNot monotonic
2023-12-13T05:11:08.175415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
429 5
 
0.7%
439 5
 
0.7%
432 5
 
0.7%
433 5
 
0.7%
434 5
 
0.7%
435 5
 
0.7%
436 5
 
0.7%
437 5
 
0.7%
438 5
 
0.7%
440 5
 
0.7%
Other values (641) 693
93.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
651 1
0.1%
650 1
0.1%
649 1
0.1%
648 1
0.1%
647 1
0.1%
646 1
0.1%
645 1
0.1%
644 1
0.1%
643 1
0.1%
642 1
0.1%
Distinct54
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2023-12-13T05:11:08.395217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length21
Mean length10.039031
Min length4

Characters and Unicode

Total characters7459
Distinct characters150
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row新대기오염처리기술
2nd row新대기오염처리기술
3rd row新대기오염처리기술
4th row新대기오염처리기술
5th row新대기오염처리기술
ValueCountFrequency (%)
128
 
9.3%
온실가스실무자 40
 
2.9%
실무 32
 
2.3%
중소환경기업 25
 
1.8%
환경측정 23
 
1.7%
소각처리시설운영실무 22
 
1.6%
매립시설운영실무 21
 
1.5%
환경공간정보이론 21
 
1.5%
하수처리기술 21
 
1.5%
폐수처리기술 20
 
1.5%
Other values (79) 1024
74.4%
2023-12-13T05:11:08.776251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
634
 
8.5%
319
 
4.3%
304
 
4.1%
291
 
3.9%
286
 
3.8%
255
 
3.4%
242
 
3.2%
198
 
2.7%
167
 
2.2%
156
 
2.1%
Other values (140) 4607
61.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6611
88.6%
Space Separator 634
 
8.5%
Letter Number 83
 
1.1%
Uppercase Letter 80
 
1.1%
Other Punctuation 27
 
0.4%
Lowercase Letter 24
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
319
 
4.8%
304
 
4.6%
291
 
4.4%
286
 
4.3%
255
 
3.9%
242
 
3.7%
198
 
3.0%
167
 
2.5%
156
 
2.4%
154
 
2.3%
Other values (126) 4239
64.1%
Uppercase Letter
ValueCountFrequency (%)
E 16
20.0%
S 16
20.0%
G 16
20.0%
X 16
20.0%
D 8
10.0%
T 8
10.0%
Lowercase Letter
ValueCountFrequency (%)
h 8
33.3%
c 8
33.3%
e 8
33.3%
Letter Number
ValueCountFrequency (%)
44
53.0%
39
47.0%
Other Punctuation
ValueCountFrequency (%)
· 19
70.4%
: 8
29.6%
Space Separator
ValueCountFrequency (%)
634
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6571
88.1%
Common 661
 
8.9%
Latin 187
 
2.5%
Han 40
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
319
 
4.9%
304
 
4.6%
291
 
4.4%
286
 
4.4%
255
 
3.9%
242
 
3.7%
198
 
3.0%
167
 
2.5%
156
 
2.4%
154
 
2.3%
Other values (125) 4199
63.9%
Latin
ValueCountFrequency (%)
44
23.5%
39
20.9%
E 16
 
8.6%
S 16
 
8.6%
G 16
 
8.6%
X 16
 
8.6%
D 8
 
4.3%
h 8
 
4.3%
c 8
 
4.3%
e 8
 
4.3%
Common
ValueCountFrequency (%)
634
95.9%
· 19
 
2.9%
: 8
 
1.2%
Han
ValueCountFrequency (%)
40
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6571
88.1%
ASCII 746
 
10.0%
Number Forms 83
 
1.1%
CJK 40
 
0.5%
None 19
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
634
85.0%
E 16
 
2.1%
S 16
 
2.1%
G 16
 
2.1%
X 16
 
2.1%
D 8
 
1.1%
: 8
 
1.1%
h 8
 
1.1%
c 8
 
1.1%
e 8
 
1.1%
Hangul
ValueCountFrequency (%)
319
 
4.9%
304
 
4.6%
291
 
4.4%
286
 
4.4%
255
 
3.9%
242
 
3.7%
198
 
3.0%
167
 
2.5%
156
 
2.4%
154
 
2.3%
Other values (125) 4199
63.9%
Number Forms
ValueCountFrequency (%)
44
53.0%
39
47.0%
CJK
ValueCountFrequency (%)
40
100.0%
None
ValueCountFrequency (%)
· 19
100.0%

차시명
Text

UNIQUE 

Distinct743
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2023-12-13T05:11:09.153012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length34
Mean length17.979812
Min length6

Characters and Unicode

Total characters13359
Distinct characters428
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique743 ?
Unique (%)100.0%

Sample

1st row대기오염물질 종류 및 특성
2nd row미세먼지(PM) 제어기술
3rd row황산화물(SOx) 배출원 및 제어기술
4th row질소산화물(NOx) 배출원 및 제어기술
5th row휘발성유기화합물질(VOCs) 제어기술
ValueCountFrequency (%)
170
 
5.3%
59
 
1.9%
2 45
 
1.4%
1 45
 
1.4%
5 38
 
1.2%
3 38
 
1.2%
4 38
 
1.2%
6 37
 
1.2%
7 36
 
1.1%
8 36
 
1.1%
Other values (1329) 2641
83.0%
2023-12-13T05:11:09.672429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2467
 
18.5%
. 635
 
4.8%
1 367
 
2.7%
255
 
1.9%
237
 
1.8%
198
 
1.5%
180
 
1.3%
171
 
1.3%
) 165
 
1.2%
( 165
 
1.2%
Other values (418) 8519
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8311
62.2%
Space Separator 2467
 
18.5%
Decimal Number 1074
 
8.0%
Other Punctuation 749
 
5.6%
Uppercase Letter 204
 
1.5%
Close Punctuation 165
 
1.2%
Open Punctuation 165
 
1.2%
Lowercase Letter 116
 
0.9%
Dash Punctuation 58
 
0.4%
Letter Number 32
 
0.2%
Other values (2) 18
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
255
 
3.1%
237
 
2.9%
198
 
2.4%
180
 
2.2%
171
 
2.1%
154
 
1.9%
141
 
1.7%
141
 
1.7%
141
 
1.7%
135
 
1.6%
Other values (349) 6558
78.9%
Uppercase Letter
ValueCountFrequency (%)
S 39
19.1%
I 23
11.3%
C 17
8.3%
M 16
7.8%
G 15
 
7.4%
R 13
 
6.4%
O 12
 
5.9%
T 10
 
4.9%
A 10
 
4.9%
E 10
 
4.9%
Other values (12) 39
19.1%
Lowercase Letter
ValueCountFrequency (%)
e 19
16.4%
i 12
10.3%
n 12
10.3%
s 10
 
8.6%
t 8
 
6.9%
l 7
 
6.0%
o 7
 
6.0%
r 6
 
5.2%
a 5
 
4.3%
m 5
 
4.3%
Other values (10) 25
21.6%
Decimal Number
ValueCountFrequency (%)
1 367
34.2%
2 149
13.9%
0 94
 
8.8%
3 82
 
7.6%
4 72
 
6.7%
5 68
 
6.3%
6 65
 
6.1%
7 62
 
5.8%
8 60
 
5.6%
9 55
 
5.1%
Other Punctuation
ValueCountFrequency (%)
. 635
84.8%
, 42
 
5.6%
· 41
 
5.5%
/ 14
 
1.9%
? 13
 
1.7%
& 2
 
0.3%
: 2
 
0.3%
Letter Number
ValueCountFrequency (%)
15
46.9%
14
43.8%
2
 
6.2%
1
 
3.1%
Space Separator
ValueCountFrequency (%)
2467
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%
Open Punctuation
ValueCountFrequency (%)
( 165
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Control
ValueCountFrequency (%)
17
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8309
62.2%
Common 4696
35.2%
Latin 352
 
2.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
255
 
3.1%
237
 
2.9%
198
 
2.4%
180
 
2.2%
171
 
2.1%
154
 
1.9%
141
 
1.7%
141
 
1.7%
141
 
1.7%
135
 
1.6%
Other values (347) 6556
78.9%
Latin
ValueCountFrequency (%)
S 39
 
11.1%
I 23
 
6.5%
e 19
 
5.4%
C 17
 
4.8%
M 16
 
4.5%
G 15
 
4.3%
15
 
4.3%
14
 
4.0%
R 13
 
3.7%
i 12
 
3.4%
Other values (36) 169
48.0%
Common
ValueCountFrequency (%)
2467
52.5%
. 635
 
13.5%
1 367
 
7.8%
) 165
 
3.5%
( 165
 
3.5%
2 149
 
3.2%
0 94
 
2.0%
3 82
 
1.7%
4 72
 
1.5%
5 68
 
1.4%
Other values (13) 432
 
9.2%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8295
62.1%
ASCII 4974
37.2%
None 41
 
0.3%
Number Forms 32
 
0.2%
Compat Jamo 14
 
0.1%
CJK 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2467
49.6%
. 635
 
12.8%
1 367
 
7.4%
) 165
 
3.3%
( 165
 
3.3%
2 149
 
3.0%
0 94
 
1.9%
3 82
 
1.6%
4 72
 
1.4%
5 68
 
1.4%
Other values (53) 710
 
14.3%
Hangul
ValueCountFrequency (%)
255
 
3.1%
237
 
2.9%
198
 
2.4%
180
 
2.2%
171
 
2.1%
154
 
1.9%
141
 
1.7%
141
 
1.7%
141
 
1.7%
135
 
1.6%
Other values (346) 6542
78.9%
None
ValueCountFrequency (%)
· 41
100.0%
Number Forms
ValueCountFrequency (%)
15
46.9%
14
43.8%
2
 
6.2%
1
 
3.1%
Compat Jamo
ValueCountFrequency (%)
14
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

날짜
Date

Distinct14
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Minimum2019-05-31 00:00:00
Maximum2023-03-13 00:00:00
2023-12-13T05:11:09.804463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:11:09.922528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)

Interactions

2023-12-13T05:11:07.667277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:11:10.020995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번과목명날짜
순번1.0000.9960.668
과목명0.9961.0001.000
날짜0.6681.0001.000

Missing values

2023-12-13T05:11:07.811917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:11:07.900811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번과목명차시명날짜
01新대기오염처리기술대기오염물질 종류 및 특성2023-01-10
12新대기오염처리기술미세먼지(PM) 제어기술2023-01-10
23新대기오염처리기술황산화물(SOx) 배출원 및 제어기술2023-01-10
34新대기오염처리기술질소산화물(NOx) 배출원 및 제어기술2023-01-10
45新대기오염처리기술휘발성유기화합물질(VOCs) 제어기술2023-01-10
56新대기오염처리기술악취 제어기술2023-01-10
67新대기오염처리기술실내 공기질 관리기술2023-01-10
78新대기오염처리기술온실가스 배출원 및 처리기술2023-01-10
89新대기오염처리기술흡수/흡착기술2023-01-10
910新대기오염처리기술촉매산화기술2023-01-10
순번과목명차시명날짜
733642대기환경관리계획 및 방지시설관리Ⅱ1. 대기오염 확산2019-05-31
734643대기환경관리계획 및 방지시설관리Ⅱ2. 대기오염 확산에 따른 민원 대응방안2019-05-31
735644대기환경관리계획 및 방지시설관리Ⅱ3. 대기확산 모델링(1)2019-05-31
736645대기환경관리계획 및 방지시설관리Ⅱ4. 대기확산 모델링(2)2019-05-31
737646대기환경관리계획 및 방지시설관리Ⅱ5. 대기오염물질 측정분석(1)2019-05-31
738647대기환경관리계획 및 방지시설관리Ⅱ6. 대기오염물질 측정분석(2)2019-05-31
739648대기환경관리계획 및 방지시설관리Ⅱ7. 굴뚝자동측정시스템(TMS, CleanSYS)(1)2019-05-31
740649대기환경관리계획 및 방지시설관리Ⅱ8. 굴뚝자동측정시스템(TMS, CleanSYS)(2)2019-05-31
741650대기환경관리계획 및 방지시설관리Ⅱ9. 설계경제성 검토(VE)2019-05-31
742651대기환경관리계획 및 방지시설관리Ⅱ10. 입찰안내서(ITB) 작성2019-05-31