Overview

Dataset statistics

Number of variables11
Number of observations277
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.5 KiB
Average record size in memory90.5 B

Variable types

Numeric2
DateTime3
Categorical5
Text1

Dataset

Description대한적십자사 재난교육(재해구호, 심리적 응급처치)에 관한 내용입니다. 강습번호, 강습연도, 강습월, 기관명, 사업구분, 과정명, 강습제목, 강습구분, 강습일정, 강습장소(명), 강습비 컬럼으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/3068146/fileData.do

Alerts

사업구분 has constant value ""Constant
강습구분 is highly overall correlated with 강습비 and 2 other fieldsHigh correlation
과정명 is highly overall correlated with 강습제목 and 1 other fieldsHigh correlation
강습제목 is highly overall correlated with 과정명 and 1 other fieldsHigh correlation
강습비 is highly overall correlated with 강습구분High correlation
강습구분 is highly imbalanced (68.2%)Imbalance
강습번호 has unique valuesUnique
강습비 has 212 (76.5%) zerosZeros

Reproduction

Analysis started2023-12-12 17:57:17.771855
Analysis finished2023-12-12 17:57:18.922494
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강습번호
Real number (ℝ)

UNIQUE 

Distinct277
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68747.238
Minimum63264
Maximum72408
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-13T02:57:19.039887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum63264
5-th percentile64961.2
Q166826
median69297
Q370806
95-th percentile71497.2
Maximum72408
Range9144
Interquartile range (IQR)3980

Descriptive statistics

Standard deviation2233.2784
Coefficient of variation (CV)0.032485355
Kurtosis-0.91996714
Mean68747.238
Median Absolute Deviation (MAD)1760
Skewness-0.47688589
Sum19042985
Variance4987532.6
MonotonicityNot monotonic
2023-12-13T02:57:19.256689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
63284 1
 
0.4%
70070 1
 
0.4%
71452 1
 
0.4%
70328 1
 
0.4%
71113 1
 
0.4%
70129 1
 
0.4%
70215 1
 
0.4%
68608 1
 
0.4%
69979 1
 
0.4%
70850 1
 
0.4%
Other values (267) 267
96.4%
ValueCountFrequency (%)
63264 1
0.4%
63284 1
0.4%
63744 1
0.4%
63930 1
0.4%
64205 1
0.4%
64246 1
0.4%
64379 1
0.4%
64573 1
0.4%
64574 1
0.4%
64575 1
0.4%
ValueCountFrequency (%)
72408 1
0.4%
72027 1
0.4%
71825 1
0.4%
71824 1
0.4%
71807 1
0.4%
71621 1
0.4%
71571 1
0.4%
71569 1
0.4%
71567 1
0.4%
71557 1
0.4%
Distinct11
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2022-02-01 00:00:00
Maximum2022-12-01 00:00:00
2023-12-13T02:57:19.408232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:19.583717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)

기관명
Categorical

Distinct16
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
경기지사
30 
부산지사
26 
경북지사
24 
서울지사
21 
대구지사
19 
Other values (11)
157 

Length

Max length7
Median length4
Mean length4.2599278
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제주지사
2nd row경북지사
3rd row경북지사
4th row부산지사
5th row경북지사

Common Values

ValueCountFrequency (%)
경기지사 30
10.8%
부산지사 26
 
9.4%
경북지사 24
 
8.7%
서울지사 21
 
7.6%
대구지사 19
 
6.9%
경남지사 19
 
6.9%
충북지사 18
 
6.5%
충남지사 16
 
5.8%
울산지사 16
 
5.8%
강원지사 16
 
5.8%
Other values (6) 72
26.0%

Length

2023-12-13T02:57:19.751276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기지사 30
10.8%
부산지사 26
 
9.4%
경북지사 24
 
8.7%
서울지사 21
 
7.6%
대구지사 19
 
6.9%
경남지사 19
 
6.9%
충북지사 18
 
6.5%
충남지사 16
 
5.8%
울산지사 16
 
5.8%
강원지사 16
 
5.8%
Other values (6) 72
26.0%

사업구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
재난
277 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재난
2nd row재난
3rd row재난
4th row재난
5th row재난

Common Values

ValueCountFrequency (%)
재난 277
100.0%

Length

2023-12-13T02:57:19.919533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:20.036664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재난 277
100.0%

과정명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
재해구호 전문인력 양성교육 기본과정
113 
심리적 응급처치
86 
심리사회적지지교육 일반과정(재해구호전문인력 양성교육 특화과정)
39 
재해구호 전문인력 양성교육 전문과정
19 
심리사회적지지 강사
15 
Other values (2)
 
5

Length

Max length34
Median length19
Mean length17.093863
Min length8

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row재해구호 전문인력 양성교육 기본과정
2nd row재해구호 전문인력 양성교육 기본과정
3rd row심리적 응급처치
4th row재해구호 전문인력 양성교육 기본과정
5th row재해구호 전문인력 양성교육 전문과정

Common Values

ValueCountFrequency (%)
재해구호 전문인력 양성교육 기본과정 113
40.8%
심리적 응급처치 86
31.0%
심리사회적지지교육 일반과정(재해구호전문인력 양성교육 특화과정) 39
 
14.1%
재해구호 전문인력 양성교육 전문과정 19
 
6.9%
심리사회적지지 강사 15
 
5.4%
영유아재난안전지도사 과정 4
 
1.4%
재난구호교육 강사과정 1
 
0.4%

Length

2023-12-13T02:57:20.150509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:20.296695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양성교육 171
19.1%
재해구호 132
14.7%
전문인력 132
14.7%
기본과정 113
12.6%
심리적 86
9.6%
응급처치 86
9.6%
심리사회적지지교육 39
 
4.4%
일반과정(재해구호전문인력 39
 
4.4%
특화과정 39
 
4.4%
전문과정 19
 
2.1%
Other values (6) 40
 
4.5%

강습제목
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
재해구호 전문인력 양성교육 기본
113 
심리적 응급처치 과정
86 
심리사회적지지 일반과정
39 
재해구호 전문인력 양성교육 전문
18 
심리사회적지지 강사 보수과정
15 
Other values (3)
 
6

Length

Max length17
Median length15
Mean length14.212996
Min length11

Unique

Unique2 ?
Unique (%)0.7%

Sample

1st row재해구호 전문인력 양성교육 기본
2nd row재해구호 전문인력 양성교육 기본
3rd row심리적 응급처치 과정
4th row재해구호 전문인력 양성교육 기본
5th row재해구호 전문인력 양성교육 전문

Common Values

ValueCountFrequency (%)
재해구호 전문인력 양성교육 기본 113
40.8%
심리적 응급처치 과정 86
31.0%
심리사회적지지 일반과정 39
 
14.1%
재해구호 전문인력 양성교육 전문 18
 
6.5%
심리사회적지지 강사 보수과정 15
 
5.4%
영유아재난안전지도사과정 4
 
1.4%
재난구호교육 강사과정 1
 
0.4%
재해구호양성교육전문보수 1
 
0.4%

Length

2023-12-13T02:57:20.515178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:20.689759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재해구호 131
14.4%
전문인력 131
14.4%
양성교육 131
14.4%
기본 113
12.4%
심리적 86
9.4%
응급처치 86
9.4%
과정 86
9.4%
심리사회적지지 54
5.9%
일반과정 39
 
4.3%
전문 18
 
2.0%
Other values (6) 37
 
4.1%

강습구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
신규
261 
재강습
 
16

Length

Max length3
Median length2
Mean length2.0577617
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신규
2nd row신규
3rd row신규
4th row신규
5th row신규

Common Values

ValueCountFrequency (%)
신규 261
94.2%
재강습 16
 
5.8%

Length

2023-12-13T02:57:20.896707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:57:21.094591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신규 261
94.2%
재강습 16
 
5.8%
Distinct150
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2021-02-24 00:00:00
Maximum2021-12-30 00:00:00
2023-12-13T02:57:21.282007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:21.545357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct146
Distinct (%)52.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2021-02-24 00:00:00
Maximum2021-12-30 00:00:00
2023-12-13T02:57:21.810479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:22.291598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct133
Distinct (%)48.0%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2023-12-13T02:57:22.746989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length39
Mean length15.126354
Min length4

Characters and Unicode

Total characters4190
Distinct characters163
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)34.7%

Sample

1st row제주지사 2층 강당
2nd row경산지구사무실
3rd row대한적십자사 경북지사
4th row대한적십자사 부산지사
5th row봉사회안동지구협의회 강의장 (20200901~20200902)
ValueCountFrequency (%)
대한적십자사 73
 
12.8%
강의장 70
 
12.3%
강당 25
 
4.4%
부산지사 25
 
4.4%
대구적십자사 18
 
3.2%
울산지사 15
 
2.6%
경남지사 13
 
2.3%
적십자 12
 
2.1%
제주지사 10
 
1.8%
2층 10
 
1.8%
Other values (170) 298
52.4%
2023-12-13T02:57:23.407606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 346
 
8.3%
306
 
7.3%
1 299
 
7.1%
0 299
 
7.1%
293
 
7.0%
136
 
3.2%
135
 
3.2%
125
 
3.0%
125
 
3.0%
110
 
2.6%
Other values (153) 2016
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2515
60.0%
Decimal Number 1146
27.4%
Space Separator 293
 
7.0%
Close Punctuation 82
 
2.0%
Open Punctuation 82
 
2.0%
Math Symbol 70
 
1.7%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
 
12.2%
136
 
5.4%
135
 
5.4%
125
 
5.0%
125
 
5.0%
110
 
4.4%
105
 
4.2%
92
 
3.7%
92
 
3.7%
82
 
3.3%
Other values (138) 1207
48.0%
Decimal Number
ValueCountFrequency (%)
2 346
30.2%
1 299
26.1%
0 299
26.1%
3 57
 
5.0%
9 39
 
3.4%
6 31
 
2.7%
7 24
 
2.1%
5 21
 
1.8%
4 17
 
1.5%
8 13
 
1.1%
Space Separator
ValueCountFrequency (%)
293
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Math Symbol
ValueCountFrequency (%)
~ 70
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2515
60.0%
Common 1675
40.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
 
12.2%
136
 
5.4%
135
 
5.4%
125
 
5.0%
125
 
5.0%
110
 
4.4%
105
 
4.2%
92
 
3.7%
92
 
3.7%
82
 
3.3%
Other values (138) 1207
48.0%
Common
ValueCountFrequency (%)
2 346
20.7%
1 299
17.9%
0 299
17.9%
293
17.5%
) 82
 
4.9%
( 82
 
4.9%
~ 70
 
4.2%
3 57
 
3.4%
9 39
 
2.3%
6 31
 
1.9%
Other values (5) 77
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2515
60.0%
ASCII 1675
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 346
20.7%
1 299
17.9%
0 299
17.9%
293
17.5%
) 82
 
4.9%
( 82
 
4.9%
~ 70
 
4.2%
3 57
 
3.4%
9 39
 
2.3%
6 31
 
1.9%
Other values (5) 77
 
4.6%
Hangul
ValueCountFrequency (%)
306
 
12.2%
136
 
5.4%
135
 
5.4%
125
 
5.0%
125
 
5.0%
110
 
4.4%
105
 
4.2%
92
 
3.7%
92
 
3.7%
82
 
3.3%
Other values (138) 1207
48.0%

강습비
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct7
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9265.343
Minimum0
Maximum60000
Zeros212
Zeros (%)76.5%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2023-12-13T02:57:24.061376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile50000
Maximum60000
Range60000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation17728.605
Coefficient of variation (CV)1.9134321
Kurtosis0.97306719
Mean9265.343
Median Absolute Deviation (MAD)0
Skewness1.5933917
Sum2566500
Variance3.1430343 × 108
MonotonicityNot monotonic
2023-12-13T02:57:24.251816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 212
76.5%
30000 29
 
10.5%
50000 22
 
7.9%
40000 7
 
2.5%
60000 5
 
1.8%
15000 1
 
0.4%
1500 1
 
0.4%
ValueCountFrequency (%)
0 212
76.5%
1500 1
 
0.4%
15000 1
 
0.4%
30000 29
 
10.5%
40000 7
 
2.5%
50000 22
 
7.9%
60000 5
 
1.8%
ValueCountFrequency (%)
60000 5
 
1.8%
50000 22
 
7.9%
40000 7
 
2.5%
30000 29
 
10.5%
15000 1
 
0.4%
1500 1
 
0.4%
0 212
76.5%

Interactions

2023-12-13T02:57:18.425312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:18.211121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:18.516043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T02:57:18.318571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:57:24.407757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강습번호강습연월기관명과정명강습제목강습구분강습비
강습번호1.0000.8420.4230.1480.1270.1730.276
강습연월0.8421.0000.3630.1640.1230.0870.219
기관명0.4230.3631.0000.6250.6960.0000.271
과정명0.1480.1640.6251.0001.0000.9010.573
강습제목0.1270.1230.6961.0001.0001.0000.609
강습구분0.1730.0870.0000.9011.0001.0000.811
강습비0.2760.2190.2710.5730.6090.8111.000
2023-12-13T02:57:24.574665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기관명강습구분과정명강습제목
기관명1.0000.0000.3410.314
강습구분0.0001.0000.9590.989
과정명0.3410.9591.0000.998
강습제목0.3140.9890.9981.000
2023-12-13T02:57:24.754456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강습번호강습비기관명과정명강습제목강습구분
강습번호1.000-0.0970.1890.0660.0520.108
강습비-0.0971.0000.1300.3860.3940.611
기관명0.1890.1301.0000.3410.3140.000
과정명0.0660.3860.3411.0000.9980.959
강습제목0.0520.3940.3140.9981.0000.989
강습구분0.1080.6110.0000.9590.9891.000

Missing values

2023-12-13T02:57:18.661696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:57:18.838687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강습번호강습연월기관명사업구분과정명강습제목강습구분강습시작일강습종료일강습장소(명)강습비
0632842022-02제주지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-02-242021-02-24제주지사 2층 강당50000
1639302022-03경북지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-03-062021-03-06경산지구사무실50000
2642462022-03경북지사재난심리적 응급처치심리적 응급처치 과정신규2021-03-232021-03-23대한적십자사 경북지사0
3643792022-03부산지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-03-232021-03-23대한적십자사 부산지사0
4642052022-03경북지사재난재해구호 전문인력 양성교육 전문과정재해구호 전문인력 양성교육 전문신규2021-03-202021-03-27봉사회안동지구협의회 강의장 (20200901~20200902)0
5645732022-03대구지사재난심리적 응급처치심리적 응급처치 과정신규2021-03-292021-03-29대구적십자사0
6645742022-03대구지사재난심리적 응급처치심리적 응급처치 과정신규2021-03-302021-03-30대구적십자사0
7645752022-03대구지사재난심리적 응급처치심리적 응급처치 과정신규2021-03-312021-03-31대구적십자사0
8632642022-03제주지사재난심리사회적지지교육 일반과정(재해구호전문인력 양성교육 특화과정)심리사회적지지 일반과정신규2021-03-302021-03-31제주지사 2층 강당0
9645762022-04대구지사재난심리적 응급처치심리적 응급처치 과정신규2021-04-012021-04-01대구적십자사0
강습번호강습연월기관명사업구분과정명강습제목강습구분강습시작일강습종료일강습장소(명)강습비
267714532022-12충북지사재난심리사회적지지교육 일반과정(재해구호전문인력 양성교육 특화과정)심리사회적지지 일반과정신규2021-12-162021-12-16보은봉사관0
268715692022-12대전·세종지사재난재해구호 전문인력 양성교육 전문과정재해구호 전문인력 양성교육 전문신규2021-12-142021-12-16대한적십자사 대전세종지사0
269715022022-12경기지사재난재해구호 전문인력 양성교육 전문과정재해구호 전문인력 양성교육 전문신규2021-12-152021-12-17적십자 중부봉사관 강의장 (20211215~20211217)0
270713192022-12제주지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-12-172021-12-17제주지사 2층 강당0
271714792022-12울산지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-12-172021-12-17대한적십자사 울산지사0
272709432022-12대구지사재난심리사회적지지교육 일반과정(재해구호전문인력 양성교육 특화과정)심리사회적지지 일반과정신규2021-12-182021-12-18대구적십자사60000
273715052022-12부산지사재난재해구호 전문인력 양성교육 기본과정재해구호 전문인력 양성교육 기본신규2021-12-182021-12-18대한적십자사 부산지사50000
274714882022-12경기지사재난심리적 응급처치심리적 응급처치 과정신규2021-12-232021-12-23적십자 서부봉사관 강의장 (20211223~20211223)0
275716212022-12전북지사재난재해구호 전문인력 양성교육 전문과정재해구호 전문인력 양성교육 전문신규2021-12-272021-12-29대한적십자사 전북지사0
276714972022-12경기지사재난심리적 응급처치심리적 응급처치 과정신규2021-12-302021-12-30적십자 서부봉사관 강의장 (20211230~20211230)0