Overview

Dataset statistics

Number of variables7
Number of observations148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.7 KiB
Average record size in memory59.9 B

Variable types

Numeric3
Text2
Categorical2

Dataset

Description제주특별자치도교육청에서 발간한 간행물 정보로 연번,자료명, 기관구분, 발간예정일, 면수,부수,본청등록번호를 제공합니다.
Author제주특별자치도교육청
URLhttps://www.data.go.kr/data/3073079/fileData.do

Alerts

연번 is highly overall correlated with 부수High correlation
면수 is highly overall correlated with 발간예정일High correlation
부수 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
기관구분 is highly overall correlated with 부수 and 1 other fieldsHigh correlation
발간예정일 is highly overall correlated with 면수 and 2 other fieldsHigh correlation
기관구분 is highly imbalanced (54.2%)Imbalance
연번 has unique valuesUnique
자료명 has unique valuesUnique
본청등록번호 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:40:38.354594
Analysis finished2024-04-06 08:40:42.580976
Duration4.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.5
Minimum1
Maximum148
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-06T17:40:42.790423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.35
Q137.75
median74.5
Q3111.25
95-th percentile140.65
Maximum148
Range147
Interquartile range (IQR)73.5

Descriptive statistics

Standard deviation42.868014
Coefficient of variation (CV)0.57540959
Kurtosis-1.2
Mean74.5
Median Absolute Deviation (MAD)37
Skewness0
Sum11026
Variance1837.6667
MonotonicityStrictly increasing
2024-04-06T17:40:43.198268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
Other values (138) 138
93.2%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%

자료명
Text

UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T17:40:43.918214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length46
Mean length18.97973
Min length11

Characters and Unicode

Total characters2809
Distinct characters251
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)100.0%

Sample

1st row중등 평가 역량 강화 직무연수
2nd row2024 한국어교육 강사 역량강화 연수
3rd row2024 기초학력 보장의 이해 직무연수(초등)
4th row회복적 생활교육(RD1) 직무연수(1기)
5th row2024학년도 학교교육계획서
ValueCountFrequency (%)
보고서 91
 
16.8%
위험성평가 90
 
16.6%
2024 15
 
2.8%
직무연수 10
 
1.8%
2024학년도 10
 
1.8%
교육과정 8
 
1.5%
요청형 6
 
1.1%
현장 6
 
1.1%
운영 4
 
0.7%
2023 4
 
0.7%
Other values (260) 299
55.1%
2024-04-06T17:40:45.058845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
396
 
14.1%
157
 
5.6%
140
 
5.0%
109
 
3.9%
102
 
3.6%
99
 
3.5%
96
 
3.4%
96
 
3.4%
95
 
3.4%
93
 
3.3%
Other values (241) 1426
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2191
78.0%
Space Separator 396
 
14.1%
Decimal Number 161
 
5.7%
Close Punctuation 18
 
0.6%
Open Punctuation 18
 
0.6%
Uppercase Letter 12
 
0.4%
Other Punctuation 5
 
0.2%
Math Symbol 4
 
0.1%
Lowercase Letter 3
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
157
 
7.2%
140
 
6.4%
109
 
5.0%
102
 
4.7%
99
 
4.5%
96
 
4.4%
96
 
4.4%
95
 
4.3%
93
 
4.2%
91
 
4.2%
Other values (214) 1113
50.8%
Decimal Number
ValueCountFrequency (%)
2 79
49.1%
0 37
23.0%
4 29
 
18.0%
3 8
 
5.0%
1 6
 
3.7%
7 1
 
0.6%
6 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
R 3
25.0%
D 3
25.0%
A 2
16.7%
P 1
 
8.3%
I 1
 
8.3%
M 1
 
8.3%
E 1
 
8.3%
Math Symbol
ValueCountFrequency (%)
+ 1
25.0%
~ 1
25.0%
> 1
25.0%
< 1
25.0%
Lowercase Letter
ValueCountFrequency (%)
s 1
33.3%
u 1
33.3%
l 1
33.3%
Other Punctuation
ValueCountFrequency (%)
· 4
80.0%
! 1
 
20.0%
Space Separator
ValueCountFrequency (%)
396
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2191
78.0%
Common 603
 
21.5%
Latin 15
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
157
 
7.2%
140
 
6.4%
109
 
5.0%
102
 
4.7%
99
 
4.5%
96
 
4.4%
96
 
4.4%
95
 
4.3%
93
 
4.2%
91
 
4.2%
Other values (214) 1113
50.8%
Common
ValueCountFrequency (%)
396
65.7%
2 79
 
13.1%
0 37
 
6.1%
4 29
 
4.8%
) 18
 
3.0%
( 18
 
3.0%
3 8
 
1.3%
1 6
 
1.0%
· 4
 
0.7%
7 1
 
0.2%
Other values (7) 7
 
1.2%
Latin
ValueCountFrequency (%)
R 3
20.0%
D 3
20.0%
A 2
13.3%
s 1
 
6.7%
u 1
 
6.7%
l 1
 
6.7%
P 1
 
6.7%
I 1
 
6.7%
M 1
 
6.7%
E 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2191
78.0%
ASCII 614
 
21.9%
None 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
396
64.5%
2 79
 
12.9%
0 37
 
6.0%
4 29
 
4.7%
) 18
 
2.9%
( 18
 
2.9%
3 8
 
1.3%
1 6
 
1.0%
R 3
 
0.5%
D 3
 
0.5%
Other values (16) 17
 
2.8%
Hangul
ValueCountFrequency (%)
157
 
7.2%
140
 
6.4%
109
 
5.0%
102
 
4.7%
99
 
4.5%
96
 
4.4%
96
 
4.4%
95
 
4.3%
93
 
4.2%
91
 
4.2%
Other values (214) 1113
50.8%
None
ValueCountFrequency (%)
· 4
100.0%

기관구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)11.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
본청
101 
직속기관
16 
탐라교육원
11 
중등교육과
 
5
안전관리과
 
2
Other values (12)
13 

Length

Max length9
Median length2
Mean length2.9121622
Min length2

Unique

Unique11 ?
Unique (%)7.4%

Sample

1st row탐라교육원
2nd row제주국제교육원
3rd row탐라교육원
4th row탐라교육원
5th row제주중학교

Common Values

ValueCountFrequency (%)
본청 101
68.2%
직속기관 16
 
10.8%
탐라교육원 11
 
7.4%
중등교육과 5
 
3.4%
안전관리과 2
 
1.4%
초등교육과 2
 
1.4%
제주국제교육원 1
 
0.7%
제주중학교 1
 
0.7%
서귀포시교육지원청 1
 
0.7%
강희주 1
 
0.7%
Other values (7) 7
 
4.7%

Length

2024-04-06T17:40:45.508935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본청 101
68.2%
직속기관 16
 
10.8%
탐라교육원 11
 
7.4%
중등교육과 5
 
3.4%
안전관리과 2
 
1.4%
초등교육과 2
 
1.4%
정책기획과 1
 
0.7%
중학교 1
 
0.7%
제주시교육지원청 1
 
0.7%
체육건강과 1
 
0.7%
Other values (7) 7
 
4.7%

발간예정일
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2022-12-01
90 
2024-03-22
 
7
2024-02-19
 
7
2024-02-23
 
4
2024-03-15
 
3
Other values (24)
37 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique14 ?
Unique (%)9.5%

Sample

1st row2024-03-21
2nd row2024-03-22
3rd row2024-03-22
4th row2023-03-21
5th row2024-03-22

Common Values

ValueCountFrequency (%)
2022-12-01 90
60.8%
2024-03-22 7
 
4.7%
2024-02-19 7
 
4.7%
2024-02-23 4
 
2.7%
2024-03-15 3
 
2.0%
2024-03-05 3
 
2.0%
2024-02-08 3
 
2.0%
2024-02-27 3
 
2.0%
2024-01-30 2
 
1.4%
2024-03-18 2
 
1.4%
Other values (19) 24
 
16.2%

Length

2024-04-06T17:40:45.883431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2022-12-01 90
60.8%
2024-02-19 7
 
4.7%
2024-03-22 7
 
4.7%
2024-02-23 4
 
2.7%
2024-03-15 3
 
2.0%
2024-03-05 3
 
2.0%
2024-02-08 3
 
2.0%
2024-02-27 3
 
2.0%
2024-02-14 2
 
1.4%
2024-02-22 2
 
1.4%
Other values (19) 24
 
16.2%

면수
Real number (ℝ)

HIGH CORRELATION 

Distinct57
Distinct (%)38.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean156.35811
Minimum50
Maximum650
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-06T17:40:46.853562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum50
5-th percentile94.8
Q1143.5
median156.5
Q3164
95-th percentile235.2
Maximum650
Range600
Interquartile range (IQR)20.5

Descriptive statistics

Standard deviation64.221768
Coefficient of variation (CV)0.41073513
Kurtosis28.799814
Mean156.35811
Median Absolute Deviation (MAD)8.5
Skewness4.3628735
Sum23141
Variance4124.4355
MonotonicityNot monotonic
2024-04-06T17:40:47.319643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100 18
 
12.2%
162 10
 
6.8%
159 8
 
5.4%
164 7
 
4.7%
150 7
 
4.7%
155 5
 
3.4%
156 5
 
3.4%
146 5
 
3.4%
161 4
 
2.7%
160 4
 
2.7%
Other values (47) 75
50.7%
ValueCountFrequency (%)
50 1
 
0.7%
69 1
 
0.7%
70 3
 
2.0%
72 1
 
0.7%
85 1
 
0.7%
92 1
 
0.7%
100 18
12.2%
104 1
 
0.7%
112 1
 
0.7%
114 1
 
0.7%
ValueCountFrequency (%)
650 1
 
0.7%
500 1
 
0.7%
345 1
 
0.7%
334 1
 
0.7%
240 3
2.0%
238 1
 
0.7%
230 1
 
0.7%
214 1
 
0.7%
209 1
 
0.7%
206 1
 
0.7%

부수
Real number (ℝ)

HIGH CORRELATION 

Distinct43
Distinct (%)29.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean334.13514
Minimum2
Maximum16000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-04-06T17:40:47.767895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q12
median2
Q392.5
95-th percentile1590
Maximum16000
Range15998
Interquartile range (IQR)90.5

Descriptive statistics

Standard deviation1574.6037
Coefficient of variation (CV)4.7124756
Kurtosis73.106307
Mean334.13514
Median Absolute Deviation (MAD)0
Skewness8.0704846
Sum49452
Variance2479376.7
MonotonicityNot monotonic
2024-04-06T17:40:48.241134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
2 90
60.8%
200 3
 
2.0%
300 3
 
2.0%
100 3
 
2.0%
535 2
 
1.4%
50 2
 
1.4%
110 2
 
1.4%
120 2
 
1.4%
400 2
 
1.4%
250 2
 
1.4%
Other values (33) 37
25.0%
ValueCountFrequency (%)
2 90
60.8%
12 1
 
0.7%
15 1
 
0.7%
17 1
 
0.7%
20 1
 
0.7%
21 1
 
0.7%
23 1
 
0.7%
25 1
 
0.7%
30 1
 
0.7%
33 1
 
0.7%
ValueCountFrequency (%)
16000 1
0.7%
9000 1
0.7%
3500 1
0.7%
3000 1
0.7%
2800 1
0.7%
2000 1
0.7%
1800 2
1.4%
1200 1
0.7%
900 1
0.7%
680 1
0.7%

본청등록번호
Text

UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-06T17:40:48.827024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length13
Min length13

Characters and Unicode

Total characters1924
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)100.0%

Sample

1st row제주교육-2024-173
2nd row제주교육-2024-172
3rd row제주교육-2024-171
4th row제주교육-2024-170
5th row제주교육-2024-169
ValueCountFrequency (%)
제주교육-2024-173 1
 
0.7%
제주교육-2024-081 1
 
0.7%
제주교육-2024-072 1
 
0.7%
제주교육-2024-078 1
 
0.7%
제주교육-2024-077 1
 
0.7%
제주교육-2024-076 1
 
0.7%
제주교육-2024-075 1
 
0.7%
제주교육-2024-074 1
 
0.7%
제주교육-2024-073 1
 
0.7%
제주교육-2024-071 1
 
0.7%
Other values (138) 138
93.2%
2024-04-06T17:40:49.716438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 325
16.9%
- 296
15.4%
0 247
12.8%
4 182
9.5%
148
7.7%
148
7.7%
148
7.7%
148
7.7%
1 99
 
5.1%
3 35
 
1.8%
Other values (5) 148
7.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1036
53.8%
Other Letter 592
30.8%
Dash Punctuation 296
 
15.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 325
31.4%
0 247
23.8%
4 182
17.6%
1 99
 
9.6%
3 35
 
3.4%
6 35
 
3.4%
5 34
 
3.3%
7 29
 
2.8%
9 25
 
2.4%
8 25
 
2.4%
Other Letter
ValueCountFrequency (%)
148
25.0%
148
25.0%
148
25.0%
148
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 296
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1332
69.2%
Hangul 592
30.8%

Most frequent character per script

Common
ValueCountFrequency (%)
2 325
24.4%
- 296
22.2%
0 247
18.5%
4 182
13.7%
1 99
 
7.4%
3 35
 
2.6%
6 35
 
2.6%
5 34
 
2.6%
7 29
 
2.2%
9 25
 
1.9%
Hangul
ValueCountFrequency (%)
148
25.0%
148
25.0%
148
25.0%
148
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1332
69.2%
Hangul 592
30.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 325
24.4%
- 296
22.2%
0 247
18.5%
4 182
13.7%
1 99
 
7.4%
3 35
 
2.6%
6 35
 
2.6%
5 34
 
2.6%
7 29
 
2.2%
9 25
 
1.9%
Hangul
ValueCountFrequency (%)
148
25.0%
148
25.0%
148
25.0%
148
25.0%

Interactions

2024-04-06T17:40:41.151392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:39.458092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:40.282144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:41.472943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:39.704872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:40.579928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:41.781637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:39.973600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:40:40.838239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:40:50.018740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번기관구분발간예정일면수부수
연번1.0000.6900.8030.3940.432
기관구분0.6901.0000.9520.5100.930
발간예정일0.8030.9521.0000.9050.860
면수0.3940.5100.9051.0000.028
부수0.4320.9300.8600.0281.000
2024-04-06T17:40:50.339406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발간예정일기관구분
발간예정일1.0000.634
기관구분0.6341.000
2024-04-06T17:40:50.628979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번면수부수기관구분발간예정일
연번1.0000.223-0.7470.3430.405
면수0.2231.000-0.2060.2480.610
부수-0.747-0.2061.0000.7720.566
기관구분0.3430.2480.7721.0000.634
발간예정일0.4050.6100.5660.6341.000

Missing values

2024-04-06T17:40:42.134252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:40:42.451471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번자료명기관구분발간예정일면수부수본청등록번호
01중등 평가 역량 강화 직무연수탐라교육원2024-03-2124090제주교육-2024-173
122024 한국어교육 강사 역량강화 연수제주국제교육원2024-03-2210050제주교육-2024-172
232024 기초학력 보장의 이해 직무연수(초등)탐라교육원2024-03-2210080제주교육-2024-171
34회복적 생활교육(RD1) 직무연수(1기)탐라교육원2023-03-2110025제주교육-2024-170
452024학년도 학교교육계획서제주중학교2024-03-2217260제주교육-2024-169
562024학년도 학교생활기록부 기재요령(초등학교)초등교육과2024-03-221723000제주교육-2024-168
67교육공무직원 인사·노무 실무과정탐라교육원2024-03-15152127제주교육-2024-167
78지방공무원 현장요청형 과정(조사 업무 처리 및 처분요구서 작성 방법)탐라교육원2024-04-1411223제주교육-2024-166
892024 마음 따뜻한 공책 봄길서귀포시교육지원청2024-03-151223500제주교육-2024-165
910회복적 생활교육(RD1) 직무연수탐라교육원2024-03-2210035제주교육-2024-164
연번자료명기관구분발간예정일면수부수본청등록번호
138139현장 요청형 검사 도구를 활용한 학습 상담과 부모 상담 지구무연수직속기관2024-02-0510021제주교육-2024-035
139140덕수초등학교 위험성평가 보고서본청2022-12-011502제주교육-2024-034
140141구좌중앙초등학교 위험성평가 보고서본청2022-12-011612제주교육-2024-033
141142곽금초등학교 위험성평가 보고서본청2022-12-011652제주교육-2024-032
142143대기고등학교 위험성평가 보고서본청2022-12-011592제주교육-2024-031
143144북촌초등학교 위험성평가 보고서본청2022-12-011582제주교육-2024-030
144145귀덕초등학교 위험성평가 보고서본청2022-12-011652제주교육-2024-029
145146도순초등학교 위험성평가 보고서본청2022-12-011562제주교육-2024-028
146147하귀초등학교 위험성평가 보고서본청2022-12-011612제주교육-2024-027
1471482024 교육공무직원 기본역량 연수(1기)직속기관2024-01-307033제주교육-2024-026