Overview

Dataset statistics

Number of variables6
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.6 KiB
Average record size in memory50.2 B

Variable types

Text1
Categorical5

Dataset

Description강좌명,구분,장소,교육기간,교육시간,수강료
Author강북구
URLhttps://data.seoul.go.kr/dataList/OA-11577/S/1/datasetView.do

Alerts

수강료 is highly overall correlated with 장소 and 2 other fieldsHigh correlation
장소 is highly overall correlated with 수강료High correlation
교육기간 is highly overall correlated with 교육시간 and 1 other fieldsHigh correlation
교육시간 is highly overall correlated with 교육기간 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-04-06 09:48:34.663114
Analysis finished2024-04-06 09:48:38.248339
Duration3.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct71
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-04-06T18:48:38.498233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length21
Mean length10.885965
Min length3

Characters and Unicode

Total characters1241
Distinct characters142
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)43.0%

Sample

1st row?(온라인) 나도 스마트폰 정보검색 왕
2nd row(온라인) 디지털 시대 코딩과 친해지기
3rd row(온라인) 편리한 스마트폰 앱 활용
4th row(온라인) 초보자도 쉽게 배우는 한글 2010
5th row(온라인) 픽슬러로 멋지게 사진 편집하기
ValueCountFrequency (%)
온라인 35
 
13.9%
스마트폰 14
 
5.6%
활용 13
 
5.2%
기초 9
 
3.6%
컴퓨터 7
 
2.8%
만들기 7
 
2.8%
컴퓨터기초 6
 
2.4%
문서작성 6
 
2.4%
인터넷활용 5
 
2.0%
엑셀기초 5
 
2.0%
Other values (86) 144
57.4%
2024-04-06T18:48:39.127187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
11.0%
72
 
5.8%
62
 
5.0%
) 57
 
4.6%
( 57
 
4.6%
43
 
3.5%
35
 
2.8%
35
 
2.8%
31
 
2.5%
31
 
2.5%
Other values (132) 681
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 896
72.2%
Space Separator 137
 
11.0%
Close Punctuation 57
 
4.6%
Open Punctuation 57
 
4.6%
Uppercase Letter 44
 
3.5%
Decimal Number 41
 
3.3%
Other Punctuation 9
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
8.0%
62
 
6.9%
43
 
4.8%
35
 
3.9%
35
 
3.9%
31
 
3.5%
31
 
3.5%
31
 
3.5%
29
 
3.2%
24
 
2.7%
Other values (115) 503
56.1%
Uppercase Letter
ValueCountFrequency (%)
C 10
22.7%
T 9
20.5%
I 9
20.5%
Q 7
15.9%
U 5
11.4%
A 2
 
4.5%
D 2
 
4.5%
Decimal Number
ValueCountFrequency (%)
0 20
48.8%
1 9
22.0%
2 9
22.0%
7 3
 
7.3%
Other Punctuation
ValueCountFrequency (%)
& 6
66.7%
! 2
 
22.2%
? 1
 
11.1%
Space Separator
ValueCountFrequency (%)
137
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 896
72.2%
Common 301
 
24.3%
Latin 44
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
8.0%
62
 
6.9%
43
 
4.8%
35
 
3.9%
35
 
3.9%
31
 
3.5%
31
 
3.5%
31
 
3.5%
29
 
3.2%
24
 
2.7%
Other values (115) 503
56.1%
Common
ValueCountFrequency (%)
137
45.5%
) 57
18.9%
( 57
18.9%
0 20
 
6.6%
1 9
 
3.0%
2 9
 
3.0%
& 6
 
2.0%
7 3
 
1.0%
! 2
 
0.7%
? 1
 
0.3%
Latin
ValueCountFrequency (%)
C 10
22.7%
T 9
20.5%
I 9
20.5%
Q 7
15.9%
U 5
11.4%
A 2
 
4.5%
D 2
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 896
72.2%
ASCII 345
 
27.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
137
39.7%
) 57
16.5%
( 57
16.5%
0 20
 
5.8%
C 10
 
2.9%
T 9
 
2.6%
I 9
 
2.6%
1 9
 
2.6%
2 9
 
2.6%
Q 7
 
2.0%
Other values (7) 21
 
6.1%
Hangul
ValueCountFrequency (%)
72
 
8.0%
62
 
6.9%
43
 
4.8%
35
 
3.9%
35
 
3.9%
31
 
3.5%
31
 
3.5%
31
 
3.5%
29
 
3.2%
24
 
2.7%
Other values (115) 503
56.1%

구분
Categorical

Distinct6
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
중급
46 
중급
26 
초급
21 
초급
11 
고급

Length

Max length3
Median length2
Mean length2.3596491
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중급
2nd row중급
3rd row중급
4th row초급
5th row중급

Common Values

ValueCountFrequency (%)
중급 46
40.4%
중급 26
22.8%
초급 21
18.4%
초급 11
 
9.6%
고급 6
 
5.3%
고급 4
 
3.5%

Length

2024-04-06T18:48:39.356710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:48:39.567528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중급 72
63.2%
초급 32
28.1%
고급 10
 
8.8%

장소
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
솔샘문화정보도서관
40 
강북문화예술회관
39 
강북구 제2교육장
18 
강북구 제1교육장
17 

Length

Max length9
Median length9
Mean length8.6578947
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강북구 제1교육장
2nd row강북구 제2교육장
3rd row강북구 제2교육장
4th row강북구 제2교육장
5th row강북구 제1교육장

Common Values

ValueCountFrequency (%)
솔샘문화정보도서관 40
35.1%
강북문화예술회관 39
34.2%
강북구 제2교육장 18
15.8%
강북구 제1교육장 17
14.9%

Length

2024-04-06T18:48:39.784457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:48:39.965822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
솔샘문화정보도서관 40
26.8%
강북문화예술회관 39
26.2%
강북구 35
23.5%
제2교육장 18
12.1%
제1교육장 17
11.4%

교육기간
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)21.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2020.07.07~2020.07.30
12 
2020.08.04.~2020.8.27
11 
2020.09.02~2020.09.29
11 
2014.10.02~10.30
 
6
2016.07.05~07.28
 
6
Other values (20)
68 

Length

Max length22
Median length21
Mean length17.824561
Min length16

Unique

Unique7 ?
Unique (%)6.1%

Sample

1st row2020.09.02~2020.09.29
2nd row2020.09.02~2020.09.29
3rd row2020.09.02~2020.09.29
4th row2020.09.02~2020.09.29
5th row2020.09.02~2020.09.29

Common Values

ValueCountFrequency (%)
2020.07.07~2020.07.30 12
 
10.5%
2020.08.04.~2020.8.27 11
 
9.6%
2020.09.02~2020.09.29 11
 
9.6%
2014.10.02~10.30 6
 
5.3%
2016.07.05~07.28 6
 
5.3%
2016.07.04~07.27 6
 
5.3%
2014.08-04~08-27 6
 
5.3%
2014.09-01~09-29 6
 
5.3%
2014.09-02~09-30 6
 
5.3%
2014.10.01~10.27 6
 
5.3%
Other values (15) 38
33.3%

Length

2024-04-06T18:48:40.169585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020.07.07~2020.07.30 12
 
10.5%
2020.08.04.~2020.8.27 12
 
10.5%
2020.09.02~2020.09.29 11
 
9.6%
2014.10.02~10.30 6
 
5.3%
2016.07.05~07.28 6
 
5.3%
2016.07.04~07.27 6
 
5.3%
2014.08-04~08-27 6
 
5.3%
2014.09-01~09-29 6
 
5.3%
2014.09-02~09-30 6
 
5.3%
2014.10.01~10.27 6
 
5.3%
Other values (13) 37
32.5%

교육시간
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)41.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
MON,WED 13시~15시30분
 
6
TUE,THU 13시~15시30분
 
6
TUE,THU 9시30분~12시
 
6
MON,WED 16시~18시30분
 
6
TUE,THU 16시~18시30분
 
6
Other values (42)
84 

Length

Max length24
Median length17
Mean length15.517544
Min length10

Unique

Unique19 ?
Unique (%)16.7%

Sample

1st rowMON,WED 13시~15시30분
2nd rowTUE,THU 16시~18시30분
3rd rowTUE,THU 13시~15시30분
4th rowTUE,THU 9시30분~12시
5th rowTUE,THU 16시~18시30분

Common Values

ValueCountFrequency (%)
MON,WED 13시~15시30분 6
 
5.3%
TUE,THU 13시~15시30분 6
 
5.3%
TUE,THU 9시30분~12시 6
 
5.3%
MON,WED 16시~18시30분 6
 
5.3%
TUE,THU 16시~18시30분 6
 
5.3%
MON,WED 9시30분~12시 5
 
4.4%
금 14시~18시 4
 
3.5%
월,수 13시~15시 4
 
3.5%
금 09시~13시 4
 
3.5%
화,목 13시~15시 4
 
3.5%
Other values (37) 63
55.3%

Length

2024-04-06T18:48:40.391282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
tue,thu 30
 
11.6%
mon,wed 29
 
11.2%
월,수 19
 
7.4%
화,목 18
 
7.0%
15시 16
 
6.2%
13시 13
 
5.0%
16시~18시30분 12
 
4.7%
12
 
4.7%
13시~15시30분 12
 
4.7%
9시30분~12시 11
 
4.3%
Other values (21) 86
33.3%

수강료
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
15000
79 
0
35 

Length

Max length5
Median length5
Mean length3.7719298
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
15000 79
69.3%
0 35
30.7%

Length

2024-04-06T18:48:40.608603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:48:40.750763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15000 79
69.3%
0 35
30.7%

Correlations

2024-04-06T18:48:40.851871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강좌명구분장소교육기간교육시간수강료
강좌명1.0000.9230.9030.8710.0001.000
구분0.9231.0000.3970.7290.8400.645
장소0.9030.3971.0000.7010.2711.000
교육기간0.8710.7290.7011.0000.9751.000
교육시간0.0000.8400.2710.9751.0001.000
수강료1.0000.6451.0001.0001.0001.000
2024-04-06T18:48:41.020507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수강료교육기간교육시간구분장소
수강료1.0000.8910.7730.4620.991
교육기간0.8911.0000.5820.3910.405
교육시간0.7730.5821.0000.4290.091
구분0.4620.3910.4291.0000.263
장소0.9910.4050.0910.2631.000
2024-04-06T18:48:41.192471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분장소교육기간교육시간수강료
구분1.0000.2630.3910.4290.462
장소0.2631.0000.4050.0910.991
교육기간0.3910.4051.0000.5820.891
교육시간0.4290.0910.5821.0000.773
수강료0.4620.9910.8910.7731.000

Missing values

2024-04-06T18:48:38.006598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:48:38.174703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강좌명구분장소교육기간교육시간수강료
0?(온라인) 나도 스마트폰 정보검색 왕중급강북구 제1교육장2020.09.02~2020.09.29MON,WED 13시~15시30분0
1(온라인) 디지털 시대 코딩과 친해지기중급강북구 제2교육장2020.09.02~2020.09.29TUE,THU 16시~18시30분0
2(온라인) 편리한 스마트폰 앱 활용중급강북구 제2교육장2020.09.02~2020.09.29TUE,THU 13시~15시30분0
3(온라인) 초보자도 쉽게 배우는 한글 2010초급강북구 제2교육장2020.09.02~2020.09.29TUE,THU 9시30분~12시0
4(온라인) 픽슬러로 멋지게 사진 편집하기중급강북구 제1교육장2020.09.02~2020.09.29TUE,THU 16시~18시30분0
5(온라인) 나도 유튜버! 동영상 제작하기중급강북구 제1교육장2020.09.02~2020.09.29MON,WED 16시~18시30분0
6(온라인) 쉬운 계산을 위한 엑셀2010중급강북구 제1교육장2020.09.02~2020.09.29TUE,THU 13시~15시30분0
7(온라인) 도전! 스마트폰과 친해지기초급강북구 제1교육장2020.09.02~2020.09.29TUE,THU 9시30분~12시0
8(온라인) 기초가 튼튼한 파워포인트2010초급강북구 제2교육장2020.09.02~2020.09.29MON,WED 16시~18시30분0
9(온라인) 포토스케이프로 멋지게 사진 꾸미기중급강북구 제2교육장2020.09.02~2020.09.29MON,WED 13시~15시30분0
강좌명구분장소교육기간교육시간수강료
104스위시 활용중급강북문화예술회관2014.08-04~08-27월,수 10시~12시15000
105인터넷기초초급솔샘문화정보도서관2014.08-04~08-27월,수 10시~12시15000
106문서작성활용중급솔샘문화정보도서관2014.08-04~08-27월,수 13시~15시15000
107컴퓨터 왕기초(누구나)초급솔샘문화정보도서관2014.08-04~08-27월,수 15시30분~17시30분15000
108인터넷 활용중급강북문화예술회관2014.08-04~08-27월,수 13시~15시15000
109포토샵중급강북문화예술회관2014.08-04~08-27월,수 15시30분~17시30분15000
110사진편집&UCC중급강북문화예술회관2014.08-01~08-29금 14시~18시15000
111ITQ 파워포인트고급솔샘문화정보도서관2014.08-01~08-29금 14시~18시15000
112카페&블로그 만들기중급솔샘문화정보도서관2014.08-01~08-29금 09시~13시15000
113엑셀활용중급강북문화예술회관2014.08-01~08-29금 09시~13시15000