Overview

Dataset statistics

Number of variables6
Number of observations228
Missing cells17
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.0 KiB
Average record size in memory49.6 B

Variable types

Numeric1
Text2
Categorical3

Dataset

Description충청남도 공주시의 평생학습 강사에 대한 정보입니다. 강사명, 분야, 대표강좌 등에 대한 내용을 포함하고 있습니다.
Author충청남도 공주시
URLhttps://www.data.go.kr/data/15093495/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
순번 is highly overall correlated with 강사구분High correlation
강사구분 is highly overall correlated with 순번High correlation
대표강좌명 has 17 (7.5%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-17 11:19:03.564371
Analysis finished2024-04-17 11:19:04.042028
Duration0.48 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct228
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114.5
Minimum1
Maximum228
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2024-04-17T20:19:04.101795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.35
Q157.75
median114.5
Q3171.25
95-th percentile216.65
Maximum228
Range227
Interquartile range (IQR)113.5

Descriptive statistics

Standard deviation65.96211
Coefficient of variation (CV)0.5760883
Kurtosis-1.2
Mean114.5
Median Absolute Deviation (MAD)57
Skewness0
Sum26106
Variance4351
MonotonicityStrictly increasing
2024-04-17T20:19:04.208651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
145 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
Other values (218) 218
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
Distinct186
Distinct (%)81.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-17T20:19:04.537125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters684
Distinct characters104
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique158 ?
Unique (%)69.3%

Sample

1st row왕*연
2nd row김*정
3rd row편*의
4th row김*연
5th row송*희
ValueCountFrequency (%)
이*희 5
 
2.2%
김*희 4
 
1.8%
김*숙 4
 
1.8%
김*진 3
 
1.3%
신*희 3
 
1.3%
박*희 3
 
1.3%
이*숙 3
 
1.3%
김*정 3
 
1.3%
김*태 3
 
1.3%
박*영 3
 
1.3%
Other values (176) 194
85.1%
2024-04-17T20:19:04.964884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 228
33.3%
40
 
5.8%
39
 
5.7%
23
 
3.4%
17
 
2.5%
16
 
2.3%
15
 
2.2%
13
 
1.9%
12
 
1.8%
11
 
1.6%
Other values (94) 270
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 456
66.7%
Other Punctuation 228
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
8.8%
39
 
8.6%
23
 
5.0%
17
 
3.7%
16
 
3.5%
15
 
3.3%
13
 
2.9%
12
 
2.6%
11
 
2.4%
10
 
2.2%
Other values (93) 260
57.0%
Other Punctuation
ValueCountFrequency (%)
* 228
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 456
66.7%
Common 228
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
8.8%
39
 
8.6%
23
 
5.0%
17
 
3.7%
16
 
3.5%
15
 
3.3%
13
 
2.9%
12
 
2.6%
11
 
2.4%
10
 
2.2%
Other values (93) 260
57.0%
Common
ValueCountFrequency (%)
* 228
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 456
66.7%
ASCII 228
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 228
100.0%
Hangul
ValueCountFrequency (%)
40
 
8.8%
39
 
8.6%
23
 
5.0%
17
 
3.7%
16
 
3.5%
15
 
3.3%
13
 
2.9%
12
 
2.6%
11
 
2.4%
10
 
2.2%
Other values (93) 260
57.0%

분야
Categorical

Distinct10
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
문화+예술
102 
인문+교양
26 
건강+생활체육
25 
교육
17 
IT+정보화
14 
Other values (5)
44 

Length

Max length7
Median length5
Mean length4.9605263
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시민참여
2nd row시민참여
3rd row문화+예술
4th row인문+교양
5th row건강+생활체육

Common Values

ValueCountFrequency (%)
문화+예술 102
44.7%
인문+교양 26
 
11.4%
건강+생활체육 25
 
11.0%
교육 17
 
7.5%
IT+정보화 14
 
6.1%
문해교육 13
 
5.7%
취업+자격증 12
 
5.3%
기타 8
 
3.5%
언어+외국어 7
 
3.1%
시민참여 4
 
1.8%

Length

2024-04-17T20:19:05.345159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T20:19:05.444691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문화+예술 102
44.7%
인문+교양 26
 
11.4%
건강+생활체육 25
 
11.0%
교육 17
 
7.5%
it+정보화 14
 
6.1%
문해교육 13
 
5.7%
취업+자격증 12
 
5.3%
기타 8
 
3.5%
언어+외국어 7
 
3.1%
시민참여 4
 
1.8%

대표강좌명
Text

MISSING 

Distinct199
Distinct (%)94.3%
Missing17
Missing (%)7.5%
Memory size1.9 KiB
2024-04-17T20:19:05.627224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length36
Mean length10.587678
Min length2

Characters and Unicode

Total characters2234
Distinct characters396
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)91.0%

Sample

1st row환경보드게임
2nd row하브루타환경독서
3rd row그대봄이오면'신중년삶을낭독하다
4th row그림책함께읽기
5th row기체조
ValueCountFrequency (%)
실버건강지도 6
 
2.8%
천아트 3
 
1.4%
오카리나+팬플룻 2
 
0.9%
전통매듭 2
 
0.9%
전통매듭공예 2
 
0.9%
캘리그라피 2
 
0.9%
야생화자수 2
 
0.9%
프랑스자수+퀼트소품 1
 
0.5%
미술실기 1
 
0.5%
소잉바느질+프랑스자수+미싱으로소품만들기 1
 
0.5%
Other values (189) 189
89.6%
2024-04-17T20:19:05.956351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
+ 128
 
5.7%
48
 
2.1%
42
 
1.9%
37
 
1.7%
36
 
1.6%
32
 
1.4%
31
 
1.4%
30
 
1.3%
29
 
1.3%
27
 
1.2%
Other values (386) 1794
80.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1996
89.3%
Math Symbol 128
 
5.7%
Lowercase Letter 29
 
1.3%
Uppercase Letter 25
 
1.1%
Open Punctuation 21
 
0.9%
Close Punctuation 21
 
0.9%
Decimal Number 7
 
0.3%
Other Punctuation 5
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
2.4%
42
 
2.1%
37
 
1.9%
36
 
1.8%
32
 
1.6%
31
 
1.6%
30
 
1.5%
29
 
1.5%
27
 
1.4%
25
 
1.3%
Other values (351) 1659
83.1%
Uppercase Letter
ValueCountFrequency (%)
P 6
24.0%
C 3
12.0%
O 2
 
8.0%
F 2
 
8.0%
A 2
 
8.0%
T 2
 
8.0%
D 2
 
8.0%
W 1
 
4.0%
S 1
 
4.0%
V 1
 
4.0%
Other values (3) 3
12.0%
Lowercase Letter
ValueCountFrequency (%)
y 4
13.8%
u 4
13.8%
n 4
13.8%
o 3
10.3%
s 2
6.9%
p 2
6.9%
r 2
6.9%
l 2
6.9%
t 2
6.9%
i 2
6.9%
Other values (2) 2
6.9%
Decimal Number
ValueCountFrequency (%)
2 5
71.4%
3 1
 
14.3%
1 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
& 3
60.0%
' 1
 
20.0%
! 1
 
20.0%
Math Symbol
ValueCountFrequency (%)
+ 128
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1996
89.3%
Common 184
 
8.2%
Latin 54
 
2.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
2.4%
42
 
2.1%
37
 
1.9%
36
 
1.8%
32
 
1.6%
31
 
1.6%
30
 
1.5%
29
 
1.5%
27
 
1.4%
25
 
1.3%
Other values (351) 1659
83.1%
Latin
ValueCountFrequency (%)
P 6
 
11.1%
y 4
 
7.4%
u 4
 
7.4%
n 4
 
7.4%
o 3
 
5.6%
C 3
 
5.6%
s 2
 
3.7%
p 2
 
3.7%
r 2
 
3.7%
O 2
 
3.7%
Other values (15) 22
40.7%
Common
ValueCountFrequency (%)
+ 128
69.6%
( 21
 
11.4%
) 21
 
11.4%
2 5
 
2.7%
& 3
 
1.6%
- 2
 
1.1%
3 1
 
0.5%
1 1
 
0.5%
' 1
 
0.5%
! 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1996
89.3%
ASCII 238
 
10.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
+ 128
53.8%
( 21
 
8.8%
) 21
 
8.8%
P 6
 
2.5%
2 5
 
2.1%
y 4
 
1.7%
u 4
 
1.7%
n 4
 
1.7%
o 3
 
1.3%
& 3
 
1.3%
Other values (25) 39
 
16.4%
Hangul
ValueCountFrequency (%)
48
 
2.4%
42
 
2.1%
37
 
1.9%
36
 
1.8%
32
 
1.6%
31
 
1.6%
30
 
1.5%
29
 
1.5%
27
 
1.4%
25
 
1.3%
Other values (351) 1659
83.1%

강사구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
일반강사
157 
일반+행복학습나눔강사
58 
행복학습나눔강사
 
13

Length

Max length11
Median length4
Mean length6.0087719
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반강사
2nd row행복학습나눔강사
3rd row일반강사
4th row일반+행복학습나눔강사
5th row일반강사

Common Values

ValueCountFrequency (%)
일반강사 157
68.9%
일반+행복학습나눔강사 58
 
25.4%
행복학습나눔강사 13
 
5.7%

Length

2024-04-17T20:19:06.079782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T20:19:06.188215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반강사 157
68.9%
일반+행복학습나눔강사 58
 
25.4%
행복학습나눔강사 13
 
5.7%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2022-11-16
228 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-11-16
2nd row2022-11-16
3rd row2022-11-16
4th row2022-11-16
5th row2022-11-16

Common Values

ValueCountFrequency (%)
2022-11-16 228
100.0%

Length

2024-04-17T20:19:06.286812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T20:19:06.359161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-11-16 228
100.0%

Interactions

2024-04-17T20:19:03.845336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T20:19:06.407154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야강사구분
순번1.0000.3060.676
분야0.3061.0000.345
강사구분0.6760.3451.000
2024-04-17T20:19:06.481094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강사구분분야
강사구분1.0000.217
분야0.2171.000
2024-04-17T20:19:06.549790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야강사구분
순번1.0000.0970.520
분야0.0971.0000.217
강사구분0.5200.2171.000

Missing values

2024-04-17T20:19:03.926040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T20:19:04.007525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번강사명분야대표강좌명강사구분데이터기준일자
01왕*연시민참여환경보드게임일반강사2022-11-16
12김*정시민참여하브루타환경독서행복학습나눔강사2022-11-16
23편*의문화+예술그대봄이오면'신중년삶을낭독하다일반강사2022-11-16
34김*연인문+교양그림책함께읽기일반+행복학습나눔강사2022-11-16
45송*희건강+생활체육기체조일반강사2022-11-16
56김*경인문+교양그림책소통하브루타일반+행복학습나눔강사2022-11-16
67김*진IT+정보화로봇+코딩+게임메이킹+VR+유튜브+크리에이터+영상편집+사진편집+PPT+문서작성+구글웤스+모바일어플리케이션등디지털및SW분야+보드게임+드론+강사양성일반+행복학습나눔강사2022-11-16
78서*영취업+자격증수납전문가2급과정일반+행복학습나눔강사2022-11-16
89문*기문화+예술스토리창작과작문능력향상일반강사2022-11-16
910서*례기타자연요리(사찰요리)행복학습나눔강사2022-11-16
순번강사명분야대표강좌명강사구분데이터기준일자
218219안*희취업+자격증내인생이빛나는정리수납일반강사2022-11-16
219220강*종인문+교양풍수지리학일반강사2022-11-16
220221변*란문해교육중등문해(국어)일반강사2022-11-16
221222김*태취업+자격증스피치교육지도사2급과정일반강사2022-11-16
222223조*현문화+예술램플로우+스트링아트일반강사2022-11-16
223224윤*현문화+예술생활리본&선물포장일반강사2022-11-16
224225오*택교육요리교실일반강사2022-11-16
225226이*호문화+예술문인화일반강사2022-11-16
226227박*리건강+생활체육실버체조와웃음지도일반강사2022-11-16
227228조*경문화+예술아동요리+미술+공예일반강사2022-11-16