Overview

Dataset statistics

Number of variables7
Number of observations1024
Missing cells0
Missing cells (%)0.0%
Duplicate rows5
Duplicate rows (%)0.5%
Total size in memory59.1 KiB
Average record size in memory59.1 B

Variable types

Categorical4
Text1
Numeric2

Dataset

Description한국가스안전공사 실시하는 교육과정에 따른 세부정보(교육인원, 교육장소, 분류기준)를 공개하여 일반국민분들에게 편의를 제공하기 위한 데이티업니다.
Author한국가스안전공사
URLhttps://www.data.go.kr/data/15067785/fileData.do

Alerts

교육년도 has constant value ""Constant
Dataset has 5 (0.5%) duplicate rowsDuplicates
교육장소 is highly overall correlated with 차수 and 2 other fieldsHigh correlation
부서 is highly overall correlated with 교육장소High correlation
차수 is highly overall correlated with 교육장소High correlation
교육종류 is highly overall correlated with 교육장소High correlation
대상인원 has 19 (1.9%) zerosZeros

Reproduction

Analysis started2023-12-12 22:40:39.498247
Analysis finished2023-12-12 22:40:40.339415
Duration0.84 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023
1024 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023
2nd row2023
3rd row2023
4th row2023
5th row2023

Common Values

ValueCountFrequency (%)
2023 1024
100.0%

Length

2023-12-13T07:40:40.397132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:40.469516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023 1024
100.0%

부서
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
교육연수실
147 
서울광역본부
 
63
충남본부
 
62
충북본부
 
49
경남본부
 
49
Other values (25)
654 

Length

Max length6
Median length6
Mean length5.2568359
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교육연수실
2nd row교육연수실
3rd row교육연수실
4th row교육연수실
5th row교육연수실

Common Values

ValueCountFrequency (%)
교육연수실 147
 
14.4%
서울광역본부 63
 
6.2%
충남본부 62
 
6.1%
충북본부 49
 
4.8%
경남본부 49
 
4.8%
대구광역본부 48
 
4.7%
대전광역본부 48
 
4.7%
전북본부 47
 
4.6%
경기광역본부 47
 
4.6%
인천본부 44
 
4.3%
Other values (20) 420
41.0%

Length

2023-12-13T07:40:40.575406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교육연수실 147
 
14.4%
서울광역본부 63
 
6.2%
충남본부 62
 
6.1%
충북본부 49
 
4.8%
경남본부 49
 
4.8%
대구광역본부 48
 
4.7%
대전광역본부 48
 
4.7%
전북본부 47
 
4.6%
경기광역본부 47
 
4.6%
인천본부 44
 
4.3%
Other values (20) 420
41.0%

교육종류
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
전문교육 보수과정
626 
특별교육
171 
전문교육 신규과정
162 
양성교육
 
52
위탁교육
 
13

Length

Max length9
Median length9
Mean length7.8476562
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특별교육
2nd row특별교육
3rd row양성교육
4th row양성교육
5th row양성교육

Common Values

ValueCountFrequency (%)
전문교육 보수과정 626
61.1%
특별교육 171
 
16.7%
전문교육 신규과정 162
 
15.8%
양성교육 52
 
5.1%
위탁교육 13
 
1.3%

Length

2023-12-13T07:40:40.699432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:40:40.791880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전문교육 788
43.5%
보수과정 626
34.5%
특별교육 171
 
9.4%
신규과정 162
 
8.9%
양성교육 52
 
2.9%
위탁교육 13
 
0.7%
Distinct80
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-13T07:40:40.985104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length11.969727
Min length3

Characters and Unicode

Total characters12257
Distinct characters111
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)2.9%

Sample

1st row공기충전시설 안전관리책임자(온라인)
2nd row보수·유지관리원
3rd row일반시설안전관리자
4th row일반시설안전관리자
5th row일반시설안전관리자
ValueCountFrequency (%)
운반책임자 50
 
4.2%
가스시설시공업제1종 45
 
3.7%
가스시설시공업제2종 44
 
3.7%
냉동·냉동기제조시설 42
 
3.5%
특고사용시설(온라인교육 41
 
3.4%
사용시설(lp가스)(온라인교육 41
 
3.4%
사용시설(도시가스)(온라인교육 41
 
3.4%
lpg충전·저장 38
 
3.2%
집단공급시설 38
 
3.2%
가스판매시설(액 37
 
3.1%
Other values (85) 784
65.3%
2023-12-13T07:40:41.287858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1015
 
8.3%
785
 
6.4%
( 631
 
5.1%
) 631
 
5.1%
487
 
4.0%
486
 
4.0%
427
 
3.5%
421
 
3.4%
342
 
2.8%
297
 
2.4%
Other values (101) 6735
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9977
81.4%
Open Punctuation 631
 
5.1%
Close Punctuation 631
 
5.1%
Uppercase Letter 466
 
3.8%
Other Punctuation 243
 
2.0%
Space Separator 177
 
1.4%
Decimal Number 131
 
1.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1015
 
10.2%
785
 
7.9%
487
 
4.9%
486
 
4.9%
427
 
4.3%
421
 
4.2%
342
 
3.4%
297
 
3.0%
297
 
3.0%
296
 
3.0%
Other values (89) 5124
51.4%
Uppercase Letter
ValueCountFrequency (%)
P 181
38.8%
L 179
38.4%
G 104
22.3%
E 2
 
0.4%
Decimal Number
ValueCountFrequency (%)
1 49
37.4%
2 47
35.9%
3 35
26.7%
Open Punctuation
ValueCountFrequency (%)
( 631
100.0%
Close Punctuation
ValueCountFrequency (%)
) 631
100.0%
Other Punctuation
ValueCountFrequency (%)
· 243
100.0%
Space Separator
ValueCountFrequency (%)
177
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9977
81.4%
Common 1813
 
14.8%
Latin 467
 
3.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1015
 
10.2%
785
 
7.9%
487
 
4.9%
486
 
4.9%
427
 
4.3%
421
 
4.2%
342
 
3.4%
297
 
3.0%
297
 
3.0%
296
 
3.0%
Other values (89) 5124
51.4%
Common
ValueCountFrequency (%)
( 631
34.8%
) 631
34.8%
· 243
 
13.4%
177
 
9.8%
1 49
 
2.7%
2 47
 
2.6%
3 35
 
1.9%
Latin
ValueCountFrequency (%)
P 181
38.8%
L 179
38.3%
G 104
22.3%
E 2
 
0.4%
1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9977
81.4%
ASCII 2036
 
16.6%
None 243
 
2.0%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1015
 
10.2%
785
 
7.9%
487
 
4.9%
486
 
4.9%
427
 
4.3%
421
 
4.2%
342
 
3.4%
297
 
3.0%
297
 
3.0%
296
 
3.0%
Other values (89) 5124
51.4%
ASCII
ValueCountFrequency (%)
( 631
31.0%
) 631
31.0%
P 181
 
8.9%
L 179
 
8.8%
177
 
8.7%
G 104
 
5.1%
1 49
 
2.4%
2 47
 
2.3%
3 35
 
1.7%
E 2
 
0.1%
None
ValueCountFrequency (%)
· 243
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

차수
Real number (ℝ)

HIGH CORRELATION 

Distinct64
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.691406
Minimum0
Maximum999
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2023-12-13T07:40:41.408490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32.25
95-th percentile26.85
Maximum999
Range999
Interquartile range (IQR)1.25

Descriptive statistics

Standard deviation78.450615
Coefficient of variation (CV)6.1813966
Kurtosis100.30016
Mean12.691406
Median Absolute Deviation (MAD)0
Skewness9.7133187
Sum12996
Variance6154.499
MonotonicityNot monotonic
2023-12-13T07:40:41.515406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 559
54.6%
2 208
 
20.3%
3 71
 
6.9%
4 26
 
2.5%
5 14
 
1.4%
6 11
 
1.1%
7 10
 
1.0%
9 10
 
1.0%
8 10
 
1.0%
10 8
 
0.8%
Other values (54) 97
 
9.5%
ValueCountFrequency (%)
0 1
 
0.1%
1 559
54.6%
2 208
 
20.3%
3 71
 
6.9%
4 26
 
2.5%
5 14
 
1.4%
6 11
 
1.1%
7 10
 
1.0%
8 10
 
1.0%
9 10
 
1.0%
ValueCountFrequency (%)
999 3
 
0.3%
666 1
 
0.1%
601 8
0.8%
71 1
 
0.1%
70 1
 
0.1%
69 1
 
0.1%
68 1
 
0.1%
67 4
0.4%
66 2
 
0.2%
65 1
 
0.1%

대상인원
Real number (ℝ)

ZEROS 

Distinct78
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean590.87598
Minimum0
Maximum9999
Zeros19
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size9.1 KiB
2023-12-13T07:40:41.620085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q118
median35
Q3170
95-th percentile5000
Maximum9999
Range9999
Interquartile range (IQR)152

Descriptive statistics

Standard deviation1530.4402
Coefficient of variation (CV)2.5901209
Kurtosis14.062178
Mean590.87598
Median Absolute Deviation (MAD)25
Skewness3.5699698
Sum605057
Variance2342247.3
MonotonicityNot monotonic
2023-12-13T07:40:41.758204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
500 80
 
7.8%
30 77
 
7.5%
20 71
 
6.9%
5000 64
 
6.2%
10 55
 
5.4%
50 53
 
5.2%
5 48
 
4.7%
40 48
 
4.7%
1000 48
 
4.7%
15 39
 
3.8%
Other values (68) 441
43.1%
ValueCountFrequency (%)
0 19
 
1.9%
1 3
 
0.3%
2 4
 
0.4%
3 14
 
1.4%
4 8
 
0.8%
5 48
4.7%
6 1
 
0.1%
7 4
 
0.4%
8 10
 
1.0%
9 2
 
0.2%
ValueCountFrequency (%)
9999 9
 
0.9%
5000 64
6.2%
3000 12
 
1.2%
2000 17
 
1.7%
1500 1
 
0.1%
1350 1
 
0.1%
1300 1
 
0.1%
1000 48
4.7%
810 1
 
0.1%
550 1
 
0.1%

교육장소
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
<NA>
407 
서울광역본부 교육장
51 
충남본부 교육장
50 
대전광역본부 3층 교육장
 
37
경남본부 교육장
 
37
Other values (21)
442 

Length

Max length24
Median length22
Mean length8.6796875
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 407
39.7%
서울광역본부 교육장 51
 
5.0%
충남본부 교육장 50
 
4.9%
대전광역본부 3층 교육장 37
 
3.6%
경남본부 교육장 37
 
3.6%
충북본부(청주시 흥덕구 송정동 140-54) 36
 
3.5%
대구광역본부 3층 교육장 36
 
3.5%
인천본부 교육장 32
 
3.1%
전북본부 교육장 31
 
3.0%
제주본부 29
 
2.8%
Other values (16) 278
27.1%

Length

2023-12-13T07:40:41.871377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 407
23.2%
교육장 394
22.5%
3층 73
 
4.2%
서울광역본부 51
 
2.9%
충남본부 50
 
2.8%
대전광역본부 37
 
2.1%
경남본부 37
 
2.1%
한국가스안전공사 36
 
2.1%
대구광역본부 36
 
2.1%
140-54 36
 
2.1%
Other values (29) 598
34.1%

Interactions

2023-12-13T07:40:39.993132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:40:39.826936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:40:40.071114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T07:40:39.914324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:40:41.932312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
부서교육종류교육과정차수대상인원교육장소
부서1.0000.7560.0000.5660.8091.000
교육종류0.7561.0000.9930.1090.3880.728
교육과정0.0000.9931.0000.0000.4230.348
차수0.5660.1090.0001.0000.7231.000
대상인원0.8090.3880.4230.7231.0000.715
교육장소1.0000.7280.3481.0000.7151.000
2023-12-13T07:40:42.010113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육종류교육장소부서
교육종류1.0000.5060.432
교육장소0.5061.0000.997
부서0.4320.9971.000
2023-12-13T07:40:42.323402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
차수대상인원부서교육종류교육장소
차수1.000-0.2350.3150.0820.981
대상인원-0.2351.0000.4700.2750.491
부서0.3150.4701.0000.4320.997
교육종류0.0820.2750.4321.0000.506
교육장소0.9810.4910.9970.5061.000

Missing values

2023-12-13T07:40:40.195043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:40:40.297051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교육년도부서교육종류교육과정차수대상인원교육장소
02023교육연수실특별교육공기충전시설 안전관리책임자(온라인)11500<NA>
12023교육연수실특별교육보수·유지관리원525<NA>
22023교육연수실양성교육일반시설안전관리자885<NA>
32023교육연수실양성교육일반시설안전관리자990<NA>
42023교육연수실양성교육일반시설안전관리자1090<NA>
52023교육연수실양성교육냉동시설안전관리자100<NA>
62023교육연수실양성교육양성교육745<NA>
72023교육연수실양성교육LPG충전시설안전관리자840<NA>
82023교육연수실양성교육LPG충전시설안전관리자940<NA>
92023교육연수실양성교육LPG충전시설안전관리자1040<NA>
교육년도부서교육종류교육과정차수대상인원교육장소
10142023충북북부지사전문교육 보수과정판매시설110충북북부지사(충주시 탄금대로117 2층)
10152023충북북부지사전문교육 보수과정특고사용시설15충북북부지사(충주시 탄금대로117 2층)
10162023충북북부지사전문교육 보수과정특고사용시설(온라인교육)1500온라인
10172023충북북부지사전문교육 보수과정가스판매시설(액)120충북북부지사(충주시 탄금대로117 2층)
10182023충북북부지사전문교육 보수과정사용시설(LP가스)110충북북부지사(충주시 탄금대로117 2층)
10192023충북북부지사전문교육 보수과정사용시설(LP가스)(온라인교육)1500온라인
10202023충북북부지사전문교육 보수과정사용시설(도시가스)115충북북부지사(충주시 탄금대로117 2층)
10212023충북북부지사전문교육 보수과정사용시설(도시가스)(온라인교육)1500온라인
10222023충북북부지사전문교육 보수과정가스시설시공업제1종115충북북부지사(충주시 탄금대로117 2층)
10232023충북북부지사전문교육 보수과정가스시설시공업제2종115충북북부지사(충주시 탄금대로117 2층)

Duplicate rows

Most frequently occurring

교육년도부서교육종류교육과정차수대상인원교육장소# duplicates
02023강원영동지사전문교육 보수과정운반책임자115강원영동지사 교육장2
12023경북북부지사전문교육 보수과정운반책임자110경북북부지사2
22023대구광역본부전문교육 보수과정운반책임자240대구광역본부 3층 교육장2
32023대전광역본부전문교육 보수과정운반책임자120대전광역본부 3층 교육장2
42023서울광역본부전문교육 보수과정운반책임자2100서울광역본부 교육장2