Overview

Dataset statistics

Number of variables12
Number of observations476
Missing cells500
Missing cells (%)8.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory45.7 KiB
Average record size in memory98.3 B

Variable types

Numeric2
Text4
Categorical4
DateTime2

Dataset

Description한국환경산업기술원 환경정보공개시스템 외부위원 교육과정 이수 데이터(교육과정, 검증위원 등) 2023년도 정보 입니다.
URLhttps://www.data.go.kr/data/15120684/fileData.do

Alerts

교육과정 기간 종료 is highly overall correlated with 수료증계정 비식별화 and 3 other fieldsHigh correlation
교육과정 기간 시작 is highly overall correlated with 수료증계정 비식별화 and 3 other fieldsHigh correlation
종류구분 is highly overall correlated with 수료증계정 비식별화 and 3 other fieldsHigh correlation
번호 is highly overall correlated with 수료증계정 비식별화High correlation
수료증계정 비식별화 is highly overall correlated with 번호 and 3 other fieldsHigh correlation
교육기관이름 is highly overall correlated with 종류구분 and 2 other fieldsHigh correlation
교육기관이름 is highly imbalanced (72.8%)Imbalance
교육과정 수정일 has 246 (51.7%) missing valuesMissing
교육과정 수정자 비식별화 has 246 (51.7%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:07:58.183389
Analysis finished2023-12-12 12:08:00.266235
Duration2.08 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct476
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean239.5
Minimum2
Maximum477
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T21:08:00.719295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile25.75
Q1120.75
median239.5
Q3358.25
95-th percentile453.25
Maximum477
Range475
Interquartile range (IQR)237.5

Descriptive statistics

Standard deviation137.55363
Coefficient of variation (CV)0.57433664
Kurtosis-1.2
Mean239.5
Median Absolute Deviation (MAD)119
Skewness0
Sum114002
Variance18921
MonotonicityNot monotonic
2023-12-12T21:08:00.894481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
477 1
 
0.2%
163 1
 
0.2%
151 1
 
0.2%
152 1
 
0.2%
153 1
 
0.2%
155 1
 
0.2%
154 1
 
0.2%
156 1
 
0.2%
157 1
 
0.2%
158 1
 
0.2%
Other values (466) 466
97.9%
ValueCountFrequency (%)
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
11 1
0.2%
ValueCountFrequency (%)
477 1
0.2%
476 1
0.2%
475 1
0.2%
474 1
0.2%
473 1
0.2%
472 1
0.2%
471 1
0.2%
470 1
0.2%
469 1
0.2%
468 1
0.2%
Distinct116
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-12T21:08:01.120270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters7140
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)2.1%

Sample

1st rowV00000000000098
2nd rowV00000000000085
3rd rowV00000000000088
4th rowV00000000000008
5th rowV00000000000001
ValueCountFrequency (%)
v00000000000009 6
 
1.3%
v00000000000011 6
 
1.3%
v00000000000100 6
 
1.3%
v00000000000032 6
 
1.3%
v00000000000027 6
 
1.3%
v00000000000002 5
 
1.1%
v00000000000031 5
 
1.1%
v00000000000096 5
 
1.1%
v00000000000066 5
 
1.1%
v00000000000062 5
 
1.1%
Other values (106) 421
88.4%
2023-12-12T21:08:01.476552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5778
80.9%
V 476
 
6.7%
1 149
 
2.1%
6 100
 
1.4%
2 97
 
1.4%
7 92
 
1.3%
3 92
 
1.3%
8 92
 
1.3%
9 91
 
1.3%
4 88
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6664
93.3%
Uppercase Letter 476
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5778
86.7%
1 149
 
2.2%
6 100
 
1.5%
2 97
 
1.5%
7 92
 
1.4%
3 92
 
1.4%
8 92
 
1.4%
9 91
 
1.4%
4 88
 
1.3%
5 85
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
V 476
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6664
93.3%
Latin 476
 
6.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5778
86.7%
1 149
 
2.2%
6 100
 
1.5%
2 97
 
1.5%
7 92
 
1.4%
3 92
 
1.4%
8 92
 
1.4%
9 91
 
1.4%
4 88
 
1.3%
5 85
 
1.3%
Latin
ValueCountFrequency (%)
V 476
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7140
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5778
80.9%
V 476
 
6.7%
1 149
 
2.1%
6 100
 
1.4%
2 97
 
1.4%
7 92
 
1.3%
3 92
 
1.3%
8 92
 
1.3%
9 91
 
1.3%
4 88
 
1.2%

종류구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
MAIN
289 
OPTION
184 
SUB
 
3

Length

Max length6
Median length4
Mean length4.7668067
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMAIN
2nd rowMAIN
3rd rowMAIN
4th rowMAIN
5th rowMAIN

Common Values

ValueCountFrequency (%)
MAIN 289
60.7%
OPTION 184
38.7%
SUB 3
 
0.6%

Length

2023-12-12T21:08:01.643463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:08:01.742564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
main 289
60.7%
option 184
38.7%
sub 3
 
0.6%

교육기관이름
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct31
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
한국환경산업기술원
398 
한국표준협회
 
10
국립환경인재개발원
 
9
한국환경산업기술원장
 
8
사이버환경실무교육
 
7
Other values (26)
44 

Length

Max length31
Median length9
Mean length9.0294118
Min length1

Unique

Unique19 ?
Unique (%)4.0%

Sample

1st row한국환경산업기술원
2nd row<NA>
3rd row한국산업환경기술원
4th row한국환경산업기술원
5th row한국환경산업기술원

Common Values

ValueCountFrequency (%)
한국환경산업기술원 398
83.6%
한국표준협회 10
 
2.1%
국립환경인재개발원 9
 
1.9%
한국환경산업기술원장 8
 
1.7%
사이버환경실무교육 7
 
1.5%
환경산업기술원 7
 
1.5%
한국환경산업기술 6
 
1.3%
한국환경산업기술원(국가환경정보센터) 3
 
0.6%
사이버환경실무 교육(한국환경산업기술원) 3
 
0.6%
국가과학기술인력개발원 2
 
0.4%
Other values (21) 23
 
4.8%

Length

2023-12-12T21:08:01.874853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국환경산업기술원 398
82.2%
한국표준협회 10
 
2.1%
국립환경인재개발원 9
 
1.9%
한국환경산업기술원장 8
 
1.7%
사이버환경실무교육 7
 
1.4%
환경산업기술원 7
 
1.4%
한국환경산업기술 6
 
1.2%
교육(한국환경산업기술원 3
 
0.6%
기술원 3
 
0.6%
사이버환경실무 3
 
0.6%
Other values (25) 30
 
6.2%
Distinct141
Distinct (%)29.7%
Missing1
Missing (%)0.2%
Memory size3.8 KiB
2023-12-12T21:08:02.141027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length25
Mean length15.96
Min length1

Characters and Unicode

Total characters7581
Distinct characters162
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)20.4%

Sample

1st row환경정보 검증위원 온라인 교육
2nd row검증위원기본교육
3rd row환경정보 검증위원 온라인교육
4th row환경정보검증위원온라인교육
5th row환경정보 검증위원 온라인 교육
ValueCountFrequency (%)
검증위원 249
15.7%
환경정보 242
15.2%
교육 132
 
8.3%
온라인 128
 
8.1%
제1기 108
 
6.8%
기초교육 84
 
5.3%
2022년 71
 
4.5%
제4기 56
 
3.5%
제3기 45
 
2.8%
온라인교육 41
 
2.6%
Other values (131) 433
27.2%
2023-12-12T21:08:02.653494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1119
 
14.8%
418
 
5.5%
360
 
4.7%
341
 
4.5%
2 312
 
4.1%
311
 
4.1%
309
 
4.1%
308
 
4.1%
306
 
4.0%
296
 
3.9%
Other values (152) 3501
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5699
75.2%
Space Separator 1119
 
14.8%
Decimal Number 681
 
9.0%
Lowercase Letter 30
 
0.4%
Uppercase Letter 18
 
0.2%
Letter Number 12
 
0.2%
Close Punctuation 7
 
0.1%
Open Punctuation 6
 
0.1%
Dash Punctuation 5
 
0.1%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
418
 
7.3%
360
 
6.3%
341
 
6.0%
311
 
5.5%
309
 
5.4%
308
 
5.4%
306
 
5.4%
296
 
5.2%
291
 
5.1%
288
 
5.1%
Other values (116) 2471
43.4%
Lowercase Letter
ValueCountFrequency (%)
i 5
16.7%
n 4
13.3%
r 4
13.3%
a 3
10.0%
o 3
10.0%
s 2
 
6.7%
u 2
 
6.7%
t 2
 
6.7%
e 2
 
6.7%
d 2
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
S 5
27.8%
I 2
 
11.1%
K 2
 
11.1%
O 2
 
11.1%
T 2
 
11.1%
L 1
 
5.6%
A 1
 
5.6%
C 1
 
5.6%
E 1
 
5.6%
G 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
2 312
45.8%
1 148
21.7%
0 102
 
15.0%
4 58
 
8.5%
3 53
 
7.8%
5 8
 
1.2%
Letter Number
ValueCountFrequency (%)
11
91.7%
1
 
8.3%
Close Punctuation
ValueCountFrequency (%)
) 6
85.7%
] 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 5
83.3%
[ 1
 
16.7%
Space Separator
ValueCountFrequency (%)
1119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5676
74.9%
Common 1822
 
24.0%
Latin 60
 
0.8%
Han 23
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
418
 
7.4%
360
 
6.3%
341
 
6.0%
311
 
5.5%
309
 
5.4%
308
 
5.4%
306
 
5.4%
296
 
5.2%
291
 
5.1%
288
 
5.1%
Other values (115) 2448
43.1%
Latin
ValueCountFrequency (%)
11
18.3%
i 5
 
8.3%
S 5
 
8.3%
n 4
 
6.7%
r 4
 
6.7%
a 3
 
5.0%
o 3
 
5.0%
I 2
 
3.3%
K 2
 
3.3%
s 2
 
3.3%
Other values (13) 19
31.7%
Common
ValueCountFrequency (%)
1119
61.4%
2 312
 
17.1%
1 148
 
8.1%
0 102
 
5.6%
4 58
 
3.2%
3 53
 
2.9%
5 8
 
0.4%
) 6
 
0.3%
( 5
 
0.3%
- 5
 
0.3%
Other values (3) 6
 
0.3%
Han
ValueCountFrequency (%)
23
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5676
74.9%
ASCII 1870
 
24.7%
CJK 23
 
0.3%
Number Forms 12
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1119
59.8%
2 312
 
16.7%
1 148
 
7.9%
0 102
 
5.5%
4 58
 
3.1%
3 53
 
2.8%
5 8
 
0.4%
) 6
 
0.3%
i 5
 
0.3%
S 5
 
0.3%
Other values (24) 54
 
2.9%
Hangul
ValueCountFrequency (%)
418
 
7.4%
360
 
6.3%
341
 
6.0%
311
 
5.5%
309
 
5.4%
308
 
5.4%
306
 
5.4%
296
 
5.2%
291
 
5.1%
288
 
5.1%
Other values (115) 2448
43.1%
CJK
ValueCountFrequency (%)
23
100.0%
Number Forms
ValueCountFrequency (%)
11
91.7%
1
 
8.3%

교육과정 기간 시작
Categorical

HIGH CORRELATION 

Distinct39
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2021-04-19
93 
2022-04-08
87 
0004-50-05
85 
2021-06-01
75 
2021-05-03
47 
Other values (34)
89 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique23 ?
Unique (%)4.8%

Sample

1st row0004-50-05
2nd row0004-50-26
3rd row0004-50-05
4th row0004-50-05
5th row0004-50-05

Common Values

ValueCountFrequency (%)
2021-04-19 93
19.5%
2022-04-08 87
18.3%
0004-50-05 85
17.9%
2021-06-01 75
15.8%
2021-05-03 47
9.9%
2021-04-05 24
 
5.0%
2021-07-05 12
 
2.5%
2021-05-24 8
 
1.7%
2021-06-10 6
 
1.3%
2021-05-17 4
 
0.8%
Other values (29) 35
 
7.4%

Length

2023-12-12T21:08:02.853563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-04-19 93
19.5%
2022-04-08 87
18.3%
0004-50-05 85
17.9%
2021-06-01 75
15.8%
2021-05-03 47
9.9%
2021-04-05 24
 
5.0%
2021-07-05 12
 
2.5%
2021-05-24 8
 
1.7%
2021-06-10 6
 
1.3%
2021-05-17 4
 
0.8%
Other values (29) 35
 
7.4%

교육과정 기간 종료
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)8.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2021-06-30
96 
2022-04-29
87 
0004-50-21
81 
2021-06-25
73 
2021-05-27
47 
Other values (35)
92 

Length

Max length10
Median length10
Mean length9.9621849
Min length4

Unique

Unique23 ?
Unique (%)4.8%

Sample

1st row0004-50-21
2nd row0004-50-26
3rd row0004-50-21
4th row0004-50-21
5th row0004-50-21

Common Values

ValueCountFrequency (%)
2021-06-30 96
20.2%
2022-04-29 87
18.3%
0004-50-21 81
17.0%
2021-06-25 73
15.3%
2021-05-27 47
9.9%
2021-04-29 24
 
5.0%
2021-07-29 12
 
2.5%
2021-05-28 8
 
1.7%
2021-06-11 6
 
1.3%
2021-05-18 4
 
0.8%
Other values (30) 38
 
8.0%

Length

2023-12-12T21:08:03.004971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2021-06-30 96
20.2%
2022-04-29 87
18.3%
0004-50-21 81
17.0%
2021-06-25 73
15.3%
2021-05-27 47
9.9%
2021-04-29 24
 
5.0%
2021-07-29 12
 
2.5%
2021-05-28 8
 
1.7%
2021-06-11 6
 
1.3%
2021-05-18 4
 
0.8%
Other values (30) 38
 
8.0%
Distinct63
Distinct (%)13.3%
Missing4
Missing (%)0.8%
Memory size3.8 KiB
2023-12-12T21:08:03.230896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters4720
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)6.6%

Sample

1st row0004-50-22
2nd row0004-50-26
3rd row0004-50-22
4th row0004-50-21
5th row0004-50-21
ValueCountFrequency (%)
2021-07-01 95
20.1%
2021-06-28 71
15.0%
2022-05-02 65
13.8%
0004-50-22 48
10.2%
2021-05-28 46
9.7%
2021-04-30 24
 
5.1%
2021-07-30 11
 
2.3%
2022-04-29 8
 
1.7%
0004-50-19 7
 
1.5%
0004-50-20 6
 
1.3%
Other values (53) 91
19.3%
2023-12-12T21:08:03.658638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1348
28.6%
2 1185
25.1%
- 944
20.0%
1 440
 
9.3%
5 220
 
4.7%
4 154
 
3.3%
8 138
 
2.9%
7 118
 
2.5%
6 104
 
2.2%
3 48
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3776
80.0%
Dash Punctuation 944
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1348
35.7%
2 1185
31.4%
1 440
 
11.7%
5 220
 
5.8%
4 154
 
4.1%
8 138
 
3.7%
7 118
 
3.1%
6 104
 
2.8%
3 48
 
1.3%
9 21
 
0.6%
Dash Punctuation
ValueCountFrequency (%)
- 944
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4720
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1348
28.6%
2 1185
25.1%
- 944
20.0%
1 440
 
9.3%
5 220
 
4.7%
4 154
 
3.3%
8 138
 
2.9%
7 118
 
2.5%
6 104
 
2.2%
3 48
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4720
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1348
28.6%
2 1185
25.1%
- 944
20.0%
1 440
 
9.3%
5 220
 
4.7%
4 154
 
3.3%
8 138
 
2.9%
7 118
 
2.5%
6 104
 
2.2%
3 48
 
1.0%

수료증계정 비식별화
Real number (ℝ)

HIGH CORRELATION 

Distinct473
Distinct (%)100.0%
Missing3
Missing (%)0.6%
Infinite0
Infinite (%)0.0%
Mean2337118.1
Minimum2296348
Maximum2452614
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T21:08:03.856540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2296348
5-th percentile2296792.6
Q12297022
median2297611
Q32341506
95-th percentile2452507.4
Maximum2452614
Range156266
Interquartile range (IQR)44484

Descriptive statistics

Standard deviation59691.918
Coefficient of variation (CV)0.025540822
Kurtosis-0.12079452
Mean2337118.1
Median Absolute Deviation (MAD)694
Skewness1.2489294
Sum1.1054569 × 109
Variance3.563125 × 109
MonotonicityNot monotonic
2023-12-12T21:08:04.050813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2297068 1
 
0.2%
2297053 1
 
0.2%
2297054 1
 
0.2%
2297058 1
 
0.2%
2297057 1
 
0.2%
2297059 1
 
0.2%
2297060 1
 
0.2%
2297061 1
 
0.2%
2297062 1
 
0.2%
2297064 1
 
0.2%
Other values (463) 463
97.3%
(Missing) 3
 
0.6%
ValueCountFrequency (%)
2296348 1
0.2%
2296352 1
0.2%
2296353 1
0.2%
2296364 1
0.2%
2296365 1
0.2%
2296366 1
0.2%
2296369 1
0.2%
2296370 1
0.2%
2296371 1
0.2%
2296377 1
0.2%
ValueCountFrequency (%)
2452614 1
0.2%
2452609 1
0.2%
2452597 1
0.2%
2452571 1
0.2%
2452570 1
0.2%
2452566 1
0.2%
2452550 1
0.2%
2452549 1
0.2%
2452544 1
0.2%
2452543 1
0.2%
Distinct39
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
Minimum2021-07-21 00:00:00
Maximum2023-04-11 00:00:00
2023-12-12T21:08:04.238951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:08:04.392658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
Distinct31
Distinct (%)13.5%
Missing246
Missing (%)51.7%
Memory size3.8 KiB
Minimum2021-07-22 00:00:00
Maximum2023-04-06 00:00:00
2023-12-12T21:08:04.524359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:08:04.682566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
Distinct95
Distinct (%)41.3%
Missing246
Missing (%)51.7%
Memory size3.8 KiB
2023-12-12T21:08:04.911655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters3450
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)10.0%

Sample

1st rowV00000000000249
2nd rowV00000000000250
3rd rowV00000000000251
4th rowV00000000000252
5th rowV00000000000253
ValueCountFrequency (%)
v00000000000301 5
 
2.2%
v00000000000282 5
 
2.2%
v00000000000261 5
 
2.2%
v00000000000271 5
 
2.2%
v00000000000310 5
 
2.2%
v00000000000290 5
 
2.2%
v00000000000273 4
 
1.7%
v00000000000264 4
 
1.7%
v00000000000324 4
 
1.7%
v00000000000308 4
 
1.7%
Other values (85) 184
80.0%
2023-12-12T21:08:05.294208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2585
74.9%
V 230
 
6.7%
2 171
 
5.0%
3 141
 
4.1%
1 59
 
1.7%
6 52
 
1.5%
9 49
 
1.4%
8 49
 
1.4%
7 46
 
1.3%
5 37
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3220
93.3%
Uppercase Letter 230
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2585
80.3%
2 171
 
5.3%
3 141
 
4.4%
1 59
 
1.8%
6 52
 
1.6%
9 49
 
1.5%
8 49
 
1.5%
7 46
 
1.4%
5 37
 
1.1%
4 31
 
1.0%
Uppercase Letter
ValueCountFrequency (%)
V 230
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3220
93.3%
Latin 230
 
6.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2585
80.3%
2 171
 
5.3%
3 141
 
4.4%
1 59
 
1.8%
6 52
 
1.6%
9 49
 
1.5%
8 49
 
1.5%
7 46
 
1.4%
5 37
 
1.1%
4 31
 
1.0%
Latin
ValueCountFrequency (%)
V 230
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2585
74.9%
V 230
 
6.7%
2 171
 
5.0%
3 141
 
4.1%
1 59
 
1.7%
6 52
 
1.5%
9 49
 
1.4%
8 49
 
1.4%
7 46
 
1.3%
5 37
 
1.1%

Interactions

2023-12-12T21:07:59.423200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:07:59.197489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:07:59.559079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:07:59.301467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:08:05.448445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호종류구분교육기관이름교육과정 기간 시작교육과정 기간 종료교육과정 수료일수료증계정 비식별화교육과정 등록일교육과정 수정일교육과정 수정자 비식별화
번호1.0000.6040.4000.8490.8500.8740.9070.9410.9350.865
종류구분0.6041.0000.9291.0000.9920.9960.8450.9320.9250.824
교육기관이름0.4000.9291.0000.9710.9640.9660.4310.8360.7070.911
교육과정 기간 시작0.8491.0000.9711.0000.9980.9950.9640.9590.8490.000
교육과정 기간 종료0.8500.9920.9640.9981.0000.9970.9420.8920.8070.000
교육과정 수료일0.8740.9960.9660.9950.9971.0000.9450.9560.9070.000
수료증계정 비식별화0.9070.8450.4310.9640.9420.9451.0000.9981.0000.000
교육과정 등록일0.9410.9320.8360.9590.8920.9560.9981.0000.9920.892
교육과정 수정일0.9350.9250.7070.8490.8070.9071.0000.9921.0000.949
교육과정 수정자 비식별화0.8650.8240.9110.0000.0000.0000.0000.8920.9491.000
2023-12-12T21:08:05.606349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육과정 기간 종료교육과정 기간 시작교육기관이름종류구분
교육과정 기간 종료1.0000.9320.6000.943
교육과정 기간 시작0.9321.0000.6250.955
교육기관이름0.6000.6251.0000.714
종류구분0.9430.9550.7141.000
2023-12-12T21:08:05.754077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호수료증계정 비식별화종류구분교육기관이름교육과정 기간 시작교육과정 기간 종료
번호1.0000.9880.4450.1350.4790.479
수료증계정 비식별화0.9881.0000.6380.2110.7650.760
종류구분0.4450.6381.0000.7140.9550.943
교육기관이름0.1350.2110.7141.0000.6250.600
교육과정 기간 시작0.4790.7650.9550.6251.0000.932
교육과정 기간 종료0.4790.7600.9430.6000.9321.000

Missing values

2023-12-12T21:07:59.779089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:07:59.977014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:08:00.161224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호회원아이디 비식별화종류구분교육기관이름교육과정명교육과정 기간 시작교육과정 기간 종료교육과정 수료일수료증계정 비식별화교육과정 등록일교육과정 수정일교육과정 수정자 비식별화
0477V00000000000098MAIN한국환경산업기술원환경정보 검증위원 온라인 교육0004-50-050004-50-210004-50-2224526142023-04-11<NA><NA>
1476V00000000000085MAIN<NA><NA>0004-50-260004-50-260004-50-2624526092023-04-10<NA><NA>
2475V00000000000088MAIN한국산업환경기술원검증위원기본교육0004-50-050004-50-210004-50-2224525972023-04-062023-04-06V00000000000249
3474V00000000000008MAIN한국환경산업기술원환경정보 검증위원 온라인교육0004-50-050004-50-21<NA>24525712023-04-06<NA><NA>
4473V00000000000001MAIN한국환경산업기술원환경정보검증위원온라인교육0004-50-050004-50-210004-50-2124525702023-04-062023-04-06V00000000000250
5472V00000000000075MAIN한국환경산업기술원환경정보 검증위원 온라인 교육0004-50-050004-50-210004-50-2124525662023-04-05<NA><NA>
6471V00000000000013MAIN사이버환경실무교육환경정보 검증위원 온라인 교육0004-50-050004-50-210004-50-2024525492023-04-042023-04-04V00000000000251
7470V00000000000058MAINKEITI환경정보검증위원 온라인교육0004-50-050004-50-210004-50-2024525442023-04-04<NA><NA>
8469V00000000000023MAIN한국환경산업기술원환경정보 검증위원 온라인 교육0004-50-050004-50-210004-50-2024525432023-04-04<NA><NA>
9468V00000000000039MAIN환경산업기술원환경정보 검증위원 온라인 교육0004-50-050004-50-210004-50-1924525402023-04-042023-04-04V00000000000252
번호회원아이디 비식별화종류구분교육기관이름교육과정명교육과정 기간 시작교육과정 기간 종료교육과정 수료일수료증계정 비식별화교육과정 등록일교육과정 수정일교육과정 수정자 비식별화
46611V00000000000059OPTION한국환경산업기술원제4기 기후변화대응2021-06-012021-06-252021-06-2822963712021-07-22<NA><NA>
46710V00000000000059OPTION한국환경산업기술원제1기 환경분쟁조정2021-06-012021-06-252021-06-2822963702021-07-22<NA><NA>
4689V00000000000059MAIN한국환경산업기술원제1기 환경정보 검증위원 온라인 교육2021-04-192021-06-302021-07-0122963692021-07-22<NA><NA>
4698V00000000000064OPTION한국환경산업기술원제3기 환경안전업무실무2021-05-032021-05-272021-05-2822963662021-07-22<NA><NA>
4707V00000000000064OPTION한국환경산업기술원제3기 대기환경관리계획 및 방지시설관리12021-05-032021-05-272021-05-2822963652021-07-22<NA><NA>
4716V00000000000064MAIN한국환경산업기술원제1기 환경정보 검증위원 온라인 교육2021-04-192021-06-302021-07-0122963642021-07-22<NA><NA>
4724V00000000000026OPTION한국환경산업기술원수처리기술Ⅰ2021-06-012021-06-252021-06-2822963522021-07-22<NA><NA>
4735V00000000000026OPTION한국환경산업기술원환경분쟁조정2021-06-012021-06-252021-06-2822963532021-07-22<NA><NA>
4743V00000000000026MAIN한국환경산업기술원환경정보 검증위원 온라인 교육2021-04-192021-06-302021-07-0122963482021-07-222021-07-22V00000000000267
4752V00000000000195SUB한국품질재단(비대면 시행)환경정보공개시스템 검증위원 보수교육2021-07-27<NA><NA><NA>2021-07-212021-07-27V00000000000343