Overview

Dataset statistics

Number of variables5
Number of observations53
Missing cells63
Missing cells (%)23.8%
Duplicate rows1
Duplicate rows (%)1.9%
Total size in memory2.3 KiB
Average record size in memory44.5 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 남동구 육아종합지원센터 보육교직원 교육일정에 대한 데이터로 연번, 교육구분, 교육일시, 교육명, 정원 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://www.data.go.kr/data/15090758/fileData.do

Alerts

Dataset has 1 (1.9%) duplicate rowsDuplicates
교육구분 is highly overall correlated with 정원(명)High correlation
정원(명) is highly overall correlated with 교육구분High correlation
연번 has 21 (39.6%) missing valuesMissing
교육일시 has 21 (39.6%) missing valuesMissing
교육명 has 21 (39.6%) missing valuesMissing

Reproduction

Analysis started2023-12-11 23:06:40.713251
Analysis finished2023-12-11 23:06:41.443922
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

MISSING 

Distinct32
Distinct (%)100.0%
Missing21
Missing (%)39.6%
Infinite0
Infinite (%)0.0%
Mean16.5
Minimum1
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size609.0 B
2023-12-12T08:06:41.522082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.55
Q18.75
median16.5
Q324.25
95-th percentile30.45
Maximum32
Range31
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation9.3808315
Coefficient of variation (CV)0.56853524
Kurtosis-1.2
Mean16.5
Median Absolute Deviation (MAD)8
Skewness0
Sum528
Variance88
MonotonicityStrictly increasing
2023-12-12T08:06:41.657488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
18 1
 
1.9%
32 1
 
1.9%
31 1
 
1.9%
30 1
 
1.9%
29 1
 
1.9%
28 1
 
1.9%
27 1
 
1.9%
26 1
 
1.9%
25 1
 
1.9%
24 1
 
1.9%
Other values (22) 22
41.5%
(Missing) 21
39.6%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
32 1
1.9%
31 1
1.9%
30 1
1.9%
29 1
1.9%
28 1
1.9%
27 1
1.9%
26 1
1.9%
25 1
1.9%
24 1
1.9%
23 1
1.9%

교육구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size556.0 B
<NA>
21 
안전교육
12 
역량강화교육
보육컨설팅
표준보육과정
Other values (2)

Length

Max length6
Median length4
Mean length4.5849057
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row역량강화교육
2nd row안전교육
3rd row역량강화교육
4th row안전교육
5th row역량강화교육

Common Values

ValueCountFrequency (%)
<NA> 21
39.6%
안전교육 12
22.6%
역량강화교육 5
 
9.4%
보육컨설팅 5
 
9.4%
표준보육과정 4
 
7.5%
힐링프로그램 4
 
7.5%
취약보육 2
 
3.8%

Length

2023-12-12T08:06:41.821704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:06:41.999422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 21
39.6%
안전교육 12
22.6%
역량강화교육 5
 
9.4%
보육컨설팅 5
 
9.4%
표준보육과정 4
 
7.5%
힐링프로그램 4
 
7.5%
취약보육 2
 
3.8%

교육일시
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing21
Missing (%)39.6%
Memory size556.0 B
2023-12-12T08:06:42.205280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length25.46875
Min length24

Characters and Unicode

Total characters815
Distinct characters24
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row1월12일(목) 16:00~18:00(2시간)
2nd row1월26일(목) 16:00~18:00(2시간)
3rd row2월2일(목) 16:00~18:00(2시간)
4th row2월7일(화) 16:00~18:00(2시간)
5th row3월22일(수) 16:00~18:00(2시간)
ValueCountFrequency (%)
16:00~18:00(2시간 28
35.9%
7월 8
 
10.3%
6월 6
 
7.7%
16:00~17:30(1시간30분 2
 
2.6%
9일(금 1
 
1.3%
15일(목 1
 
1.3%
16일(금 1
 
1.3%
21일(수 1
 
1.3%
28일(수 1
 
1.3%
30일(금 1
 
1.3%
Other values (28) 28
35.9%
2023-12-12T08:06:42.547114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 129
15.8%
1 88
10.8%
( 64
 
7.9%
) 64
 
7.9%
: 64
 
7.9%
46
 
5.6%
2 44
 
5.4%
6 39
 
4.8%
34
 
4.2%
32
 
3.9%
Other values (14) 211
25.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 383
47.0%
Other Letter 162
19.9%
Open Punctuation 64
 
7.9%
Close Punctuation 64
 
7.9%
Other Punctuation 64
 
7.9%
Space Separator 46
 
5.6%
Math Symbol 32
 
3.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 129
33.7%
1 88
23.0%
2 44
 
11.5%
6 39
 
10.2%
8 32
 
8.4%
3 15
 
3.9%
7 14
 
3.7%
5 12
 
3.1%
4 6
 
1.6%
9 4
 
1.0%
Other Letter
ValueCountFrequency (%)
34
21.0%
32
19.8%
32
19.8%
32
19.8%
10
 
6.2%
9
 
5.6%
8
 
4.9%
3
 
1.9%
2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Other Punctuation
ValueCountFrequency (%)
: 64
100.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Math Symbol
ValueCountFrequency (%)
~ 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 653
80.1%
Hangul 162
 
19.9%

Most frequent character per script

Common
ValueCountFrequency (%)
0 129
19.8%
1 88
13.5%
( 64
9.8%
) 64
9.8%
: 64
9.8%
46
 
7.0%
2 44
 
6.7%
6 39
 
6.0%
8 32
 
4.9%
~ 32
 
4.9%
Other values (5) 51
 
7.8%
Hangul
ValueCountFrequency (%)
34
21.0%
32
19.8%
32
19.8%
32
19.8%
10
 
6.2%
9
 
5.6%
8
 
4.9%
3
 
1.9%
2
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 653
80.1%
Hangul 162
 
19.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 129
19.8%
1 88
13.5%
( 64
9.8%
) 64
9.8%
: 64
9.8%
46
 
7.0%
2 44
 
6.7%
6 39
 
6.0%
8 32
 
4.9%
~ 32
 
4.9%
Other values (5) 51
 
7.8%
Hangul
ValueCountFrequency (%)
34
21.0%
32
19.8%
32
19.8%
32
19.8%
10
 
6.2%
9
 
5.6%
8
 
4.9%
3
 
1.9%
2
 
1.2%

교육명
Text

MISSING 

Distinct28
Distinct (%)87.5%
Missing21
Missing (%)39.6%
Memory size556.0 B
2023-12-12T08:06:42.779855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length22
Mean length18.84375
Min length12

Characters and Unicode

Total characters603
Distinct characters151
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)78.1%

Sample

1st row2023년 어린이집 홍보영상 제작에서 편집까지! 교육
2nd row심폐소생술 및 응급처치 교육
3rd row어린이집 원장 노무교육
4th row2023년 어린이집 개인정보 보호 및 CCTV 운영ㆍ관리 교육
5th row신입교사를 위한 보육일지 작성법 교육
ValueCountFrequency (%)
교육 21
 
16.0%
어린이집 5
 
3.8%
열린어린이집 4
 
3.1%
아동권리존중 3
 
2.3%
컨설팅 3
 
2.3%
교육(아동학대신고의무자교육 3
 
2.3%
1차 3
 
2.3%
보육교직원 3
 
2.3%
2차 3
 
2.3%
만들기 3
 
2.3%
Other values (62) 80
61.1%
2023-12-12T08:06:43.131229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
16.6%
40
 
6.6%
37
 
6.1%
13
 
2.2%
13
 
2.2%
12
 
2.0%
11
 
1.8%
10
 
1.7%
) 9
 
1.5%
2 9
 
1.5%
Other values (141) 349
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 452
75.0%
Space Separator 100
 
16.6%
Decimal Number 22
 
3.6%
Close Punctuation 10
 
1.7%
Open Punctuation 10
 
1.7%
Uppercase Letter 4
 
0.7%
Dash Punctuation 3
 
0.5%
Other Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
8.8%
37
 
8.2%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
Other values (124) 290
64.2%
Decimal Number
ValueCountFrequency (%)
2 9
40.9%
3 4
18.2%
0 4
18.2%
1 3
 
13.6%
4 1
 
4.5%
5 1
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%
Close Punctuation
ValueCountFrequency (%)
) 9
90.0%
1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 9
90.0%
1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
! 1
50.0%
Space Separator
ValueCountFrequency (%)
100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 452
75.0%
Common 147
 
24.4%
Latin 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
8.8%
37
 
8.2%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
Other values (124) 290
64.2%
Common
ValueCountFrequency (%)
100
68.0%
) 9
 
6.1%
2 9
 
6.1%
( 9
 
6.1%
3 4
 
2.7%
0 4
 
2.7%
1 3
 
2.0%
- 3
 
2.0%
1
 
0.7%
& 1
 
0.7%
Other values (4) 4
 
2.7%
Latin
ValueCountFrequency (%)
C 2
50.0%
V 1
25.0%
T 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 451
74.8%
ASCII 149
 
24.7%
None 2
 
0.3%
Compat Jamo 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
100
67.1%
) 9
 
6.0%
2 9
 
6.0%
( 9
 
6.0%
3 4
 
2.7%
0 4
 
2.7%
1 3
 
2.0%
- 3
 
2.0%
C 2
 
1.3%
& 1
 
0.7%
Other values (5) 5
 
3.4%
Hangul
ValueCountFrequency (%)
40
 
8.9%
37
 
8.2%
13
 
2.9%
13
 
2.9%
12
 
2.7%
11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
8
 
1.8%
Other values (123) 289
64.1%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

정원(명)
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size556.0 B
30
23 
<NA>
21 
50
25
300
 
1

Length

Max length4
Median length2
Mean length2.8113208
Min length2

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row30
2nd row30
3rd row30
4th row30
5th row50

Common Values

ValueCountFrequency (%)
30 23
43.4%
<NA> 21
39.6%
50 4
 
7.5%
25 4
 
7.5%
300 1
 
1.9%

Length

2023-12-12T08:06:43.271400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:06:43.376047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30 23
43.4%
na 21
39.6%
50 4
 
7.5%
25 4
 
7.5%
300 1
 
1.9%

Interactions

2023-12-12T08:06:41.021858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:06:43.449090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번교육구분교육일시교육명정원(명)
연번1.0000.2851.0000.6970.660
교육구분0.2851.0001.0001.0000.696
교육일시1.0001.0001.0001.0001.000
교육명0.6971.0001.0001.0000.808
정원(명)0.6600.6961.0000.8081.000
2023-12-12T08:06:43.536430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육구분정원(명)
교육구분1.0000.503
정원(명)0.5031.000
2023-12-12T08:06:43.609358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번교육구분정원(명)
연번1.0000.0980.399
교육구분0.0981.0000.503
정원(명)0.3990.5031.000

Missing values

2023-12-12T08:06:41.137016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:06:41.241606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T08:06:41.357632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번교육구분교육일시교육명정원(명)
01역량강화교육1월12일(목) 16:00~18:00(2시간)2023년 어린이집 홍보영상 제작에서 편집까지! 교육30
12안전교육1월26일(목) 16:00~18:00(2시간)심폐소생술 및 응급처치 교육30
23역량강화교육2월2일(목) 16:00~18:00(2시간)어린이집 원장 노무교육30
34안전교육2월7일(화) 16:00~18:00(2시간)2023년 어린이집 개인정보 보호 및 CCTV 운영ㆍ관리 교육30
45역량강화교육3월22일(수) 16:00~18:00(2시간)신입교사를 위한 보육일지 작성법 교육50
56안전교육3월24일(금) 16:00~18:00(2시간)아동권리존중 교육(아동학대신고의무자교육)50
67안전교육3월29일(수) 16:00~18:00(2시간)심폐소생술 및 응급처치 교육50
78표준보육과정3월31일(금) 16:00~18:00(2시간)놀면서 자란다(3-5세 보육과정) 교육30
89표준보육과정4월14일(금) 16:00~18:00(2시간)놀면서 자란다(0-2세 보육과정) 교육30
910안전교육4월19일(수) 16:00~18:00(2시간)아동권리존중 교육(아동학대신고의무자교육)50
연번교육구분교육일시교육명정원(명)
43<NA><NA><NA><NA><NA>
44<NA><NA><NA><NA><NA>
45<NA><NA><NA><NA><NA>
46<NA><NA><NA><NA><NA>
47<NA><NA><NA><NA><NA>
48<NA><NA><NA><NA><NA>
49<NA><NA><NA><NA><NA>
50<NA><NA><NA><NA><NA>
51<NA><NA><NA><NA><NA>
52<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번교육구분교육일시교육명정원(명)# duplicates
0<NA><NA><NA><NA><NA>21