Overview

Dataset statistics

Number of variables6
Number of observations115
Missing cells6
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory50.1 B

Variable types

Numeric1
Text2
Categorical3

Dataset

Description충청남도 공주시의 평생학습 강사에 대한 정보입니다. 강사명, 분야, 대표강좌 등에 대한 내용을 포함하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=311&beforeMenuCd=DOM_000000201001001000&publicdatapk=15093495

Alerts

데이터기준일자 has constant value ""Constant
순번 is highly overall correlated with 강사구분High correlation
강사구분 is highly overall correlated with 순번High correlation
대표강좌명 has 6 (5.2%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:04:14.771056
Analysis finished2024-01-09 20:04:15.454085
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct115
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.73913
Minimum1
Maximum116
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2024-01-10T05:04:15.561889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.7
Q129.5
median59
Q387.5
95-th percentile110.3
Maximum116
Range115
Interquartile range (IQR)58

Descriptive statistics

Standard deviation33.678325
Coefficient of variation (CV)0.57335417
Kurtosis-1.1980803
Mean58.73913
Median Absolute Deviation (MAD)29
Skewness-0.016818379
Sum6755
Variance1134.2296
MonotonicityStrictly increasing
2024-01-10T05:04:15.732950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
75 1
 
0.9%
87 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
84 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
Other values (105) 105
91.3%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
116 1
0.9%
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
Distinct105
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-01-10T05:04:16.094779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.1565217
Min length3

Characters and Unicode

Total characters363
Distinct characters83
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)85.2%

Sample

1st row신*희
2nd row이*희
3rd row김*숙
4th row박*라
5th row조*미
ValueCountFrequency (%)
이*희 4
 
3.5%
이*숙 3
 
2.6%
안*신 2
 
1.7%
백*숙 2
 
1.7%
박*영 2
 
1.7%
이*민 2
 
1.7%
김*숙 2
 
1.7%
박*희 2
 
1.7%
조*경 2
 
1.7%
고*아 1
 
0.9%
Other values (93) 93
80.9%
2024-01-10T05:04:16.627832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 115
31.7%
23
 
6.3%
18
 
5.0%
16
 
4.4%
10
 
2.8%
10
 
2.8%
10
 
2.8%
8
 
2.2%
7
 
1.9%
7
 
1.9%
Other values (73) 139
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 230
63.4%
Other Punctuation 115
31.7%
Space Separator 18
 
5.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
10.0%
16
 
7.0%
10
 
4.3%
10
 
4.3%
10
 
4.3%
8
 
3.5%
7
 
3.0%
7
 
3.0%
6
 
2.6%
5
 
2.2%
Other values (71) 128
55.7%
Other Punctuation
ValueCountFrequency (%)
* 115
100.0%
Space Separator
ValueCountFrequency (%)
18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 230
63.4%
Common 133
36.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
10.0%
16
 
7.0%
10
 
4.3%
10
 
4.3%
10
 
4.3%
8
 
3.5%
7
 
3.0%
7
 
3.0%
6
 
2.6%
5
 
2.2%
Other values (71) 128
55.7%
Common
ValueCountFrequency (%)
* 115
86.5%
18
 
13.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 230
63.4%
ASCII 133
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 115
86.5%
18
 
13.5%
Hangul
ValueCountFrequency (%)
23
 
10.0%
16
 
7.0%
10
 
4.3%
10
 
4.3%
10
 
4.3%
8
 
3.5%
7
 
3.0%
7
 
3.0%
6
 
2.6%
5
 
2.2%
Other values (71) 128
55.7%

분야
Categorical

Distinct19
Distinct (%)16.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
문화/예술
52 
교육
11 
인문/교양
11 
건강/생활체육
10 
문해교육
Other values (14)
23 

Length

Max length27
Median length5
Mean length5.426087
Min length2

Unique

Unique7 ?
Unique (%)6.1%

Sample

1st row문화/예술
2nd row문해교육
3rd row문해교육
4th row문화/예술
5th row교육

Common Values

ValueCountFrequency (%)
문화/예술 52
45.2%
교육 11
 
9.6%
인문/교양 11
 
9.6%
건강/생활체육 10
 
8.7%
문해교육 8
 
7.0%
취업/자격증 3
 
2.6%
언어/외국어 3
 
2.6%
분야 IT/정보화 2
 
1.7%
분야 취업/자격증 2
 
1.7%
분야 문화/예술 2
 
1.7%
Other values (9) 11
 
9.6%

Length

2024-01-10T05:04:16.813608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문화/예술 54
40.9%
인문/교양 12
 
9.1%
교육 11
 
8.3%
건강/생활체육 11
 
8.3%
분야 11
 
8.3%
문해교육 9
 
6.8%
취업/자격증 5
 
3.8%
언어/외국어 4
 
3.0%
it/정보화 4
 
3.0%
기타 3
 
2.3%
Other values (8) 8
 
6.1%

대표강좌명
Text

MISSING 

Distinct108
Distinct (%)99.1%
Missing6
Missing (%)5.2%
Memory size1.0 KiB
2024-01-10T05:04:17.063416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length9.5321101
Min length2

Characters and Unicode

Total characters1039
Distinct characters253
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)98.2%

Sample

1st rowPOP 손글씨
2nd row찾아가는 성인문해
3rd row배움의 나무
4th row칼림바 ,오카리나, 우쿨렐레, 피아노등
5th row천연비누만들기
ValueCountFrequency (%)
레크레이션 3
 
1.7%
노인 3
 
1.7%
천아트 2
 
1.1%
오카리나 2
 
1.1%
부모교육 2
 
1.1%
라인댄스 2
 
1.1%
교실 2
 
1.1%
통기타 2
 
1.1%
토탈공예 2
 
1.1%
중국어 2
 
1.1%
Other values (150) 153
87.4%
2024-01-10T05:04:17.536197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
76
 
7.3%
, 39
 
3.8%
24
 
2.3%
22
 
2.1%
20
 
1.9%
18
 
1.7%
17
 
1.6%
17
 
1.6%
16
 
1.5%
13
 
1.3%
Other values (243) 777
74.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 853
82.1%
Space Separator 76
 
7.3%
Other Punctuation 60
 
5.8%
Lowercase Letter 16
 
1.5%
Uppercase Letter 14
 
1.3%
Open Punctuation 8
 
0.8%
Close Punctuation 8
 
0.8%
Decimal Number 3
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
2.8%
22
 
2.6%
20
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (216) 681
79.8%
Uppercase Letter
ValueCountFrequency (%)
P 4
28.6%
O 2
14.3%
F 2
14.3%
C 2
14.3%
T 1
 
7.1%
I 1
 
7.1%
U 1
 
7.1%
D 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
n 4
25.0%
y 3
18.8%
p 2
12.5%
o 2
12.5%
u 2
12.5%
r 1
 
6.2%
t 1
 
6.2%
s 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 39
65.0%
/ 9
 
15.0%
. 9
 
15.0%
& 3
 
5.0%
Decimal Number
ValueCountFrequency (%)
1 1
33.3%
3 1
33.3%
2 1
33.3%
Space Separator
ValueCountFrequency (%)
76
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 853
82.1%
Common 156
 
15.0%
Latin 30
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
2.8%
22
 
2.6%
20
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (216) 681
79.8%
Latin
ValueCountFrequency (%)
P 4
13.3%
n 4
13.3%
y 3
10.0%
O 2
 
6.7%
p 2
 
6.7%
o 2
 
6.7%
F 2
 
6.7%
C 2
 
6.7%
u 2
 
6.7%
T 1
 
3.3%
Other values (6) 6
20.0%
Common
ValueCountFrequency (%)
76
48.7%
, 39
25.0%
/ 9
 
5.8%
. 9
 
5.8%
( 8
 
5.1%
) 8
 
5.1%
& 3
 
1.9%
1 1
 
0.6%
- 1
 
0.6%
3 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 853
82.1%
ASCII 186
 
17.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
76
40.9%
, 39
21.0%
/ 9
 
4.8%
. 9
 
4.8%
( 8
 
4.3%
) 8
 
4.3%
P 4
 
2.2%
n 4
 
2.2%
& 3
 
1.6%
y 3
 
1.6%
Other values (17) 23
 
12.4%
Hangul
ValueCountFrequency (%)
24
 
2.8%
22
 
2.6%
20
 
2.3%
18
 
2.1%
17
 
2.0%
17
 
2.0%
16
 
1.9%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (216) 681
79.8%

강사구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
일반 강사
91 
일반 + 행복학습나눔 강사
24 

Length

Max length14
Median length5
Mean length6.8782609
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반 + 행복학습나눔 강사
2nd row일반 + 행복학습나눔 강사
3rd row일반 강사
4th row일반 강사
5th row일반 + 행복학습나눔 강사

Common Values

ValueCountFrequency (%)
일반 강사 91
79.1%
일반 + 행복학습나눔 강사 24
 
20.9%

Length

2024-01-10T05:04:17.689441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:04:17.806098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 115
41.4%
강사 115
41.4%
24
 
8.6%
행복학습나눔 24
 
8.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2021-10-25
115 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-10-25
2nd row2021-10-25
3rd row2021-10-25
4th row2021-10-25
5th row2021-10-25

Common Values

ValueCountFrequency (%)
2021-10-25 115
100.0%

Length

2024-01-10T05:04:17.957121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:04:18.080535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-10-25 115
100.0%

Interactions

2024-01-10T05:04:15.123019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:04:18.158530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야강사구분
순번1.0000.4020.953
분야0.4021.0000.000
강사구분0.9530.0001.000
2024-01-10T05:04:18.274127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야강사구분
분야1.0000.000
강사구분0.0001.000
2024-01-10T05:04:18.406103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분야강사구분
순번1.0000.1510.796
분야0.1511.0000.000
강사구분0.7960.0001.000

Missing values

2024-01-10T05:04:15.266509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:04:15.391423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번강사명분야대표강좌명강사구분데이터기준일자
01신*희문화/예술POP 손글씨일반 + 행복학습나눔 강사2021-10-25
12이*희문해교육찾아가는 성인문해일반 + 행복학습나눔 강사2021-10-25
23김*숙문해교육배움의 나무일반 강사2021-10-25
34박*라문화/예술칼림바 ,오카리나, 우쿨렐레, 피아노등일반 강사2021-10-25
45조*미교육천연비누만들기일반 + 행복학습나눔 강사2021-10-25
56나*애인문/교양손뜨개, 코딩일반 + 행복학습나눔 강사2021-10-25
67박*영문화/예술마크라메일반 + 행복학습나눔 강사2021-10-25
78김*선문화/예술즐거운 통기타 교실일반 + 행복학습나눔 강사2021-10-25
89이*민인문/교양사주명리학일반 + 행복학습나눔 강사2021-10-25
910이*희교육목공체험일반 + 행복학습나눔 강사2021-10-25
순번강사명분야대표강좌명강사구분데이터기준일자
105107엄*용분야 취업/자격증놀이심리상담사1급자격증과정일반 강사2021-10-25
106108허*자언어/외국어FunnyFunny story일반 강사2021-10-25
107109안*희취업/자격증내인생이빛나는정리수납일반 강사2021-10-25
108110강*종인문/교양풍수지리학일반 강사2021-10-25
109111윤*현문화/예술생활리본&선물포장일반 강사2021-10-25
110112조*현문화/예술램플로우,스트링아트일반 강사2021-10-25
111113오*택교육요리교실일반 강사2021-10-25
112114이*호문화/예술문인화일반 강사2021-10-25
113115조*경문화/예술아동요리,미술,공예일반 강사2021-10-25
114116박*리건강/생활체육실버체조와웃음지도일반 강사2021-10-25