Overview

Dataset statistics

Number of variables5
Number of observations203
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory8.3 KiB
Average record size in memory41.6 B

Variable types

Text2
Categorical1
Numeric1
DateTime1

Dataset

Description울산광역시 북구 평생교육센터 강사관리 정보입니다. 강사의 강좌명 및 소속분야, 그리고 소속분야를 정의한 값의 정보를 포함합니다.
Author울산광역시 북구
URLhttps://www.data.go.kr/data/15090075/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 1 (0.5%) duplicate rowsDuplicates
분야별값 is highly overall correlated with 소속분야High correlation
소속분야 is highly overall correlated with 분야별값High correlation

Reproduction

Analysis started2024-03-15 01:49:59.885153
Analysis finished2024-03-15 01:50:01.302517
Duration1.42 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct191
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-15T10:50:02.565988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.0837438
Min length2

Characters and Unicode

Total characters626
Distinct characters135
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)88.7%

Sample

1st row이정희
2nd row한경미
3rd row김영선
4th row최현호
5th row강웅
ValueCountFrequency (%)
노상훈 3
 
1.5%
김보경 2
 
1.0%
김혜정 2
 
1.0%
박영희 2
 
1.0%
정휘빈 2
 
1.0%
조현정 2
 
1.0%
한아름 2
 
1.0%
김은진 2
 
1.0%
서효숙 2
 
1.0%
이윤정 2
 
1.0%
Other values (184) 185
89.8%
2024-03-15T10:50:04.185895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
7.5%
43
 
6.9%
33
 
5.3%
25
 
4.0%
18
 
2.9%
18
 
2.9%
14
 
2.2%
13
 
2.1%
13
 
2.1%
12
 
1.9%
Other values (125) 390
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 600
95.8%
Uppercase Letter 23
 
3.7%
Space Separator 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
7.8%
43
 
7.2%
33
 
5.5%
25
 
4.2%
18
 
3.0%
18
 
3.0%
14
 
2.3%
13
 
2.2%
13
 
2.2%
12
 
2.0%
Other values (112) 364
60.7%
Uppercase Letter
ValueCountFrequency (%)
N 5
21.7%
O 3
13.0%
F 2
 
8.7%
A 2
 
8.7%
S 2
 
8.7%
H 2
 
8.7%
J 2
 
8.7%
I 1
 
4.3%
E 1
 
4.3%
Y 1
 
4.3%
Other values (2) 2
 
8.7%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 600
95.8%
Latin 23
 
3.7%
Common 3
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
7.8%
43
 
7.2%
33
 
5.5%
25
 
4.2%
18
 
3.0%
18
 
3.0%
14
 
2.3%
13
 
2.2%
13
 
2.2%
12
 
2.0%
Other values (112) 364
60.7%
Latin
ValueCountFrequency (%)
N 5
21.7%
O 3
13.0%
F 2
 
8.7%
A 2
 
8.7%
S 2
 
8.7%
H 2
 
8.7%
J 2
 
8.7%
I 1
 
4.3%
E 1
 
4.3%
Y 1
 
4.3%
Other values (2) 2
 
8.7%
Common
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 600
95.8%
ASCII 26
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
7.8%
43
 
7.2%
33
 
5.5%
25
 
4.2%
18
 
3.0%
18
 
3.0%
14
 
2.3%
13
 
2.2%
13
 
2.2%
12
 
2.0%
Other values (112) 364
60.7%
ASCII
ValueCountFrequency (%)
N 5
19.2%
3
11.5%
O 3
11.5%
F 2
 
7.7%
A 2
 
7.7%
S 2
 
7.7%
H 2
 
7.7%
J 2
 
7.7%
I 1
 
3.8%
E 1
 
3.8%
Other values (3) 3
11.5%
Distinct172
Distinct (%)84.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-15T10:50:05.222615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length99
Median length33
Mean length14.236453
Min length2

Characters and Unicode

Total characters2890
Distinct characters395
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique157 ?
Unique (%)77.3%

Sample

1st row라탄공예
2nd row스마트폰으로 이모티콘 크리에이터되기
3rd row바리스타 자격증, 에스프레소 센서리 & 라떼아트 테크닉
4th rowautocad기초
5th row누구나 할 수 있는 상권분석
ValueCountFrequency (%)
평생학습대학 31
 
5.5%
조경가드닝학과 9
 
1.6%
만들기 9
 
1.6%
바리스타 8
 
1.4%
역사인문학과 7
 
1.2%
7
 
1.2%
5
 
0.9%
과정 5
 
0.9%
교육 4
 
0.7%
시네마영어 4
 
0.7%
Other values (389) 471
84.1%
2024-03-15T10:50:06.533492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
357
 
12.4%
90
 
3.1%
, 70
 
2.4%
60
 
2.1%
59
 
2.0%
50
 
1.7%
43
 
1.5%
40
 
1.4%
37
 
1.3%
36
 
1.2%
Other values (385) 2048
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2321
80.3%
Space Separator 357
 
12.4%
Other Punctuation 112
 
3.9%
Uppercase Letter 37
 
1.3%
Lowercase Letter 21
 
0.7%
Decimal Number 20
 
0.7%
Open Punctuation 9
 
0.3%
Close Punctuation 9
 
0.3%
Dash Punctuation 2
 
0.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
90
 
3.9%
60
 
2.6%
59
 
2.5%
50
 
2.2%
43
 
1.9%
40
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
33
 
1.4%
Other values (341) 1837
79.1%
Uppercase Letter
ValueCountFrequency (%)
S 10
27.0%
N 5
13.5%
I 5
13.5%
T 4
 
10.8%
D 2
 
5.4%
M 2
 
5.4%
Q 2
 
5.4%
O 2
 
5.4%
B 1
 
2.7%
Y 1
 
2.7%
Other values (3) 3
 
8.1%
Lowercase Letter
ValueCountFrequency (%)
s 3
14.3%
t 3
14.3%
a 3
14.3%
d 2
9.5%
n 2
9.5%
i 1
 
4.8%
y 1
 
4.8%
f 1
 
4.8%
e 1
 
4.8%
r 1
 
4.8%
Other values (3) 3
14.3%
Other Punctuation
ValueCountFrequency (%)
, 70
62.5%
/ 22
 
19.6%
. 7
 
6.2%
& 5
 
4.5%
! 4
 
3.6%
' 2
 
1.8%
" 2
 
1.8%
Decimal Number
ValueCountFrequency (%)
2 9
45.0%
1 5
25.0%
4 3
 
15.0%
0 2
 
10.0%
3 1
 
5.0%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
357
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2321
80.3%
Common 511
 
17.7%
Latin 58
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
90
 
3.9%
60
 
2.6%
59
 
2.5%
50
 
2.2%
43
 
1.9%
40
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
33
 
1.4%
Other values (341) 1837
79.1%
Latin
ValueCountFrequency (%)
S 10
17.2%
N 5
 
8.6%
I 5
 
8.6%
T 4
 
6.9%
s 3
 
5.2%
t 3
 
5.2%
a 3
 
5.2%
d 2
 
3.4%
D 2
 
3.4%
M 2
 
3.4%
Other values (16) 19
32.8%
Common
ValueCountFrequency (%)
357
69.9%
, 70
 
13.7%
/ 22
 
4.3%
( 9
 
1.8%
2 9
 
1.8%
) 9
 
1.8%
. 7
 
1.4%
& 5
 
1.0%
1 5
 
1.0%
! 4
 
0.8%
Other values (8) 14
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2321
80.3%
ASCII 569
 
19.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
357
62.7%
, 70
 
12.3%
/ 22
 
3.9%
S 10
 
1.8%
( 9
 
1.6%
2 9
 
1.6%
) 9
 
1.6%
. 7
 
1.2%
N 5
 
0.9%
I 5
 
0.9%
Other values (34) 66
 
11.6%
Hangul
ValueCountFrequency (%)
90
 
3.9%
60
 
2.6%
59
 
2.5%
50
 
2.2%
43
 
1.9%
40
 
1.7%
37
 
1.6%
36
 
1.6%
36
 
1.6%
33
 
1.4%
Other values (341) 1837
79.1%

소속분야
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
인문교양
96 
문화예술
56 
직업능력
37 
학력보완
11 
기초문해
 
2

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row문화예술
2nd row문화예술
3rd row직업능력
4th row직업능력
5th row인문교양

Common Values

ValueCountFrequency (%)
인문교양 96
47.3%
문화예술 56
27.6%
직업능력 37
 
18.2%
학력보완 11
 
5.4%
기초문해 2
 
1.0%
시민참여 1
 
0.5%

Length

2024-03-15T10:50:06.812139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T10:50:07.159408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인문교양 96
47.3%
문화예술 56
27.6%
직업능력 37
 
18.2%
학력보완 11
 
5.4%
기초문해 2
 
1.0%
시민참여 1
 
0.5%

분야별값
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1625616
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-15T10:50:07.354744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q14
median4
Q35
95-th percentile5
Maximum6
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.97907682
Coefficient of variation (CV)0.23521017
Kurtosis0.10111724
Mean4.1625616
Median Absolute Deviation (MAD)1
Skewness-0.90724801
Sum845
Variance0.95859143
MonotonicityNot monotonic
2024-03-15T10:50:07.551474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
5 96
47.3%
4 56
27.6%
3 37
 
18.2%
2 11
 
5.4%
1 2
 
1.0%
6 1
 
0.5%
ValueCountFrequency (%)
1 2
 
1.0%
2 11
 
5.4%
3 37
 
18.2%
4 56
27.6%
5 96
47.3%
6 1
 
0.5%
ValueCountFrequency (%)
6 1
 
0.5%
5 96
47.3%
4 56
27.6%
3 37
 
18.2%
2 11
 
5.4%
1 2
 
1.0%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
Minimum2024-02-19 00:00:00
Maximum2024-02-19 00:00:00
2024-03-15T10:50:07.837979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T10:50:08.068710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T10:50:00.369579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T10:50:08.274208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소속분야분야별값
소속분야1.0001.000
분야별값1.0001.000
2024-03-15T10:50:08.508478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야별값소속분야
분야별값1.0001.000
소속분야1.0001.000

Missing values

2024-03-15T10:50:00.780979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T10:50:01.173260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강사명강좌명소속분야분야별값데이터기준일
0이정희라탄공예문화예술42024-02-19
1한경미스마트폰으로 이모티콘 크리에이터되기문화예술42024-02-19
2김영선바리스타 자격증, 에스프레소 센서리 & 라떼아트 테크닉직업능력32024-02-19
3최현호autocad기초직업능력32024-02-19
4강웅누구나 할 수 있는 상권분석인문교양52024-02-19
5김규석건강증진 자연정혈프로그램(모든 병 집에서 쉽게 고치기)인문교양52024-02-19
6이순휘나를 지키는 뇌건강인문교양52024-02-19
7김유나영어 동화 읽기인문교양52024-02-19
8정이온반려인과 함께 사용하는 케어제품 만들기문화예술42024-02-19
9한아름평생학습대학 조경가드닝학과인문교양52024-02-19
강사명강좌명소속분야분야별값데이터기준일
193이재은목공 교육문화예술42024-02-19
194김보경목공 교육문화예술42024-02-19
195김지선목공 교육문화예술42024-02-19
196정연아정보/미디어/컴퓨터관련자격증직업능력32024-02-19
197류점순컴퓨터 활용 능력 2급직업능력32024-02-19
198박명란조리자격반,중년남성요리,여성요리교실등직업능력32024-02-19
199이한결마크라메/니트레터링/위빙/인테리어소품문화예술42024-02-19
200김명아가죽공예문화예술42024-02-19
201전시환기타교실문화예술42024-02-19
202김동해"간강백세"산야초활용법인문교양52024-02-19

Duplicate rows

Most frequently occurring

강사명강좌명소속분야분야별값데이터기준일# duplicates
0한아름평생학습대학 조경가드닝학과인문교양52024-02-192