Overview

Dataset statistics

Number of variables8
Number of observations158
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.0 KiB
Average record size in memory64.8 B

Variable types

Categorical6
Text2

Dataset

Description대구광역시 동구_주민자치센터 강좌_20230810
Author대구광역시 동구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15102916&dataSetDetailId=151029161d5b799c53988&provdMethod=FILE

Alerts

동명 is highly overall correlated with 접수기간 and 1 other fieldsHigh correlation
교육장소 is highly overall correlated with 동명 and 1 other fieldsHigh correlation
접수기간 is highly overall correlated with 동명 and 1 other fieldsHigh correlation
접수기간 is highly imbalanced (52.6%)Imbalance

Reproduction

Analysis started2023-08-19 13:08:11.239820
Analysis finished2023-08-19 13:08:16.995382
Duration5.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

동명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
안심1동
15 
동촌동
11 
신암5동
11 
신암4동
 
10
불로봉무동
 
10
Other values (15)
101 

Length

Max length6
Median length4
Mean length3.9113924
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신암1동
2nd row신암1동
3rd row신암1동
4th row신암1동
5th row신암1동

Common Values

ValueCountFrequency (%)
안심1동 15
 
9.5%
동촌동 11
 
7.0%
신암5동 11
 
7.0%
신암4동 10
 
6.3%
불로봉무동 10
 
6.3%
신천1,2동 9
 
5.7%
효목1동 9
 
5.7%
신암3동 8
 
5.1%
신천4동 8
 
5.1%
지저동 8
 
5.1%
Other values (10) 59
37.3%

Length

2023-08-19T22:08:17.341914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안심1동 15
 
9.5%
신암5동 11
 
7.0%
동촌동 11
 
7.0%
신암4동 10
 
6.3%
불로봉무동 10
 
6.3%
신천1,2동 9
 
5.7%
효목1동 9
 
5.7%
신암3동 8
 
5.1%
신천4동 8
 
5.1%
지저동 8
 
5.1%
Other values (10) 59
37.3%
Distinct96
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-08-19T22:08:18.108234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length5.0126582
Min length2

Characters and Unicode

Total characters792
Distinct characters146
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)46.2%

Sample

1st row국악교실
2nd row가요교실
3rd row요가교실
4th row라인.스포츠댄스
5th row서예교실
ValueCountFrequency (%)
서예교실 12
 
7.1%
가요교실 11
 
6.5%
요가교실 6
 
3.5%
탁구교실 6
 
3.5%
기체조 5
 
2.9%
라인댄스 4
 
2.4%
에어로빅 4
 
2.4%
한국무용 3
 
1.8%
풍물교실 3
 
1.8%
오카리나 3
 
1.8%
Other values (95) 113
66.5%
2023-08-19T22:08:19.793935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
72
 
9.1%
71
 
9.0%
38
 
4.8%
35
 
4.4%
32
 
4.0%
22
 
2.8%
( 21
 
2.7%
) 21
 
2.7%
17
 
2.1%
16
 
2.0%
Other values (136) 447
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 714
90.2%
Open Punctuation 21
 
2.7%
Close Punctuation 21
 
2.7%
Space Separator 16
 
2.0%
Uppercase Letter 12
 
1.5%
Other Punctuation 6
 
0.8%
Decimal Number 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
 
10.1%
71
 
9.9%
38
 
5.3%
35
 
4.9%
32
 
4.5%
22
 
3.1%
17
 
2.4%
15
 
2.1%
14
 
2.0%
11
 
1.5%
Other values (125) 387
54.2%
Uppercase Letter
ValueCountFrequency (%)
P 4
33.3%
B 3
25.0%
A 3
25.0%
O 2
16.7%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
, 2
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 714
90.2%
Common 66
 
8.3%
Latin 12
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
 
10.1%
71
 
9.9%
38
 
5.3%
35
 
4.9%
32
 
4.5%
22
 
3.1%
17
 
2.4%
15
 
2.1%
14
 
2.0%
11
 
1.5%
Other values (125) 387
54.2%
Common
ValueCountFrequency (%)
( 21
31.8%
) 21
31.8%
16
24.2%
. 4
 
6.1%
, 2
 
3.0%
2 1
 
1.5%
1 1
 
1.5%
Latin
ValueCountFrequency (%)
P 4
33.3%
B 3
25.0%
A 3
25.0%
O 2
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 714
90.2%
ASCII 78
 
9.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
72
 
10.1%
71
 
9.9%
38
 
5.3%
35
 
4.9%
32
 
4.5%
22
 
3.1%
17
 
2.4%
15
 
2.1%
14
 
2.0%
11
 
1.5%
Other values (125) 387
54.2%
ASCII
ValueCountFrequency (%)
( 21
26.9%
) 21
26.9%
16
20.5%
P 4
 
5.1%
. 4
 
5.1%
B 3
 
3.8%
A 3
 
3.8%
O 2
 
2.6%
, 2
 
2.6%
2 1
 
1.3%

요일
Categorical

Distinct19
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
화+목
23 
월+수+금
18 
월+수
14 
14 
14 
Other values (14)
75 

Length

Max length13
Median length9
Mean length3.0632911
Min length1

Unique

Unique5 ?
Unique (%)3.2%

Sample

1st row월+목
2nd row월+목
3rd row화+금
4th row화+금
5th row화+목

Common Values

ValueCountFrequency (%)
화+목 23
14.6%
월+수+금 18
11.4%
월+수 14
8.9%
14
8.9%
14
8.9%
월+화+수+목+금 11
7.0%
11
7.0%
10
6.3%
월+목 10
6.3%
화+금 9
 
5.7%
Other values (9) 24
15.2%

Length

2023-08-19T22:08:20.171461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화+목 23
14.0%
월+수+금 18
11.0%
15
9.1%
월+수 14
8.5%
14
8.5%
12
7.3%
월+화+수+목+금 11
6.7%
11
6.7%
월+목 10
 
6.1%
화+금 9
 
5.5%
Other values (11) 27
16.5%
Distinct70
Distinct (%)44.3%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
2023-08-19T22:08:20.738088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters1738
Distinct characters13
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)31.6%

Sample

1st row10:00~12:00
2nd row14:00~16:00
3rd row10:00~12:00
4th row14:00~16:00
5th row10:00~16:00
ValueCountFrequency (%)
10:00~12:00 18
 
11.4%
10:00~11:00 12
 
7.6%
09:00~18:00 10
 
6.3%
14:00~16:00 8
 
5.1%
14:00~15:00 8
 
5.1%
11:00~12:00 7
 
4.4%
10:30~12:00 6
 
3.8%
10:30~11:30 5
 
3.2%
09:30~10:30 4
 
2.5%
15:30~18:00 4
 
2.5%
Other values (60) 76
48.1%
2023-08-19T22:08:21.543884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 626
36.0%
1 330
19.0%
: 316
18.2%
~ 156
 
9.0%
3 94
 
5.4%
2 62
 
3.6%
4 39
 
2.2%
5 35
 
2.0%
9 27
 
1.6%
6 21
 
1.2%
Other values (3) 32
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1264
72.7%
Other Punctuation 316
 
18.2%
Math Symbol 158
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 626
49.5%
1 330
26.1%
3 94
 
7.4%
2 62
 
4.9%
4 39
 
3.1%
5 35
 
2.8%
9 27
 
2.1%
6 21
 
1.7%
8 19
 
1.5%
7 11
 
0.9%
Math Symbol
ValueCountFrequency (%)
~ 156
98.7%
2
 
1.3%
Other Punctuation
ValueCountFrequency (%)
: 316
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1738
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 626
36.0%
1 330
19.0%
: 316
18.2%
~ 156
 
9.0%
3 94
 
5.4%
2 62
 
3.6%
4 39
 
2.2%
5 35
 
2.0%
9 27
 
1.6%
6 21
 
1.2%
Other values (3) 32
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1736
99.9%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 626
36.1%
1 330
19.0%
: 316
18.2%
~ 156
 
9.0%
3 94
 
5.4%
2 62
 
3.6%
4 39
 
2.2%
5 35
 
2.0%
9 27
 
1.6%
6 21
 
1.2%
Other values (2) 30
 
1.7%
None
ValueCountFrequency (%)
2
100.0%

수강료
Categorical

Distinct22
Distinct (%)13.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
1개월 10,000원
55 
무료
23 
3개월 30,000원
23 
3개월 45,000원
16 
1개월 20,000원
Other values (17)
32 

Length

Max length54
Median length11
Mean length9.8987342
Min length2

Unique

Unique13 ?
Unique (%)8.2%

Sample

1st row1개월 10,000원
2nd row1개월 10,000원
3rd row1개월 10,000원
4th row1개월 14,000원
5th row1개월 10,000원

Common Values

ValueCountFrequency (%)
1개월 10,000원 55
34.8%
무료 23
14.6%
3개월 30,000원 23
14.6%
3개월 45,000원 16
 
10.1%
1개월 20,000원 9
 
5.7%
3개월 50,000원 9
 
5.7%
1개월 15,000원 5
 
3.2%
3개월 40,000원 3
 
1.9%
1개월 30,000원 2
 
1.3%
1개월 40,000원 1
 
0.6%
Other values (12) 12
 
7.6%

Length

2023-08-19T22:08:22.004476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1개월 77
25.5%
10,000원 56
18.5%
3개월 56
18.5%
30,000원 25
 
8.3%
무료 24
 
7.9%
45,000원 16
 
5.3%
20,000원 11
 
3.6%
50,000원 9
 
3.0%
15,000원 5
 
1.7%
40,000원 4
 
1.3%
Other values (16) 19
 
6.3%

접수기간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
수시
124 
상시
27 
연중
 
5
분기
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수시
2nd row수시
3rd row수시
4th row수시
5th row수시

Common Values

ValueCountFrequency (%)
수시 124
78.5%
상시 27
 
17.1%
연중 5
 
3.2%
분기 2
 
1.3%

Length

2023-08-19T22:08:22.423726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-08-19T22:08:22.832379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수시 124
78.5%
상시 27
 
17.1%
연중 5
 
3.2%
분기 2
 
1.3%

교육장소
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)20.9%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
안심1동 행정복지센터
14 
신암5동 행정복지센터
11 
신암4동 행정복지센터
 
10
효목1동 행정복지센터
 
9
동촌동 행정복지센터
 
9
Other values (28)
105 

Length

Max length13
Median length11
Mean length10.702532
Min length3

Unique

Unique10 ?
Unique (%)6.3%

Sample

1st row신암1동 행정복지센터
2nd row신암1동 행정복지센터
3rd row신암1동 행정복지센터
4th row신암1동 행정복지센터
5th row새터경로당 2층

Common Values

ValueCountFrequency (%)
안심1동 행정복지센터 14
 
8.9%
신암5동 행정복지센터 11
 
7.0%
신암4동 행정복지센터 10
 
6.3%
효목1동 행정복지센터 9
 
5.7%
동촌동 행정복지센터 9
 
5.7%
신천4동 행정복지센터 8
 
5.1%
안심2동 행정복지센터 7
 
4.4%
안심4동 행정복지센터 7
 
4.4%
지저동 행정복지센터 7
 
4.4%
신천1,2동 행정복지센터 6
 
3.8%
Other values (23) 70
44.3%

Length

2023-08-19T22:08:23.145966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
행정복지센터 134
42.3%
안심1동 14
 
4.4%
신암5동 11
 
3.5%
신암4동 10
 
3.2%
효목1동 9
 
2.8%
동촌동 9
 
2.8%
신천4동 8
 
2.5%
안심4동 7
 
2.2%
지저동 7
 
2.2%
안심2동 7
 
2.2%
Other values (31) 101
31.9%

모집정원
Categorical

Distinct25
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Memory size1.4 KiB
20명
52 
30명
20 
25명
13 
15명
12 
40명
Other values (20)
52 

Length

Max length4
Median length3
Mean length2.9936709
Min length2

Unique

Unique8 ?
Unique (%)5.1%

Sample

1st row20명
2nd row40명
3rd row30명
4th row30명
5th row20명

Common Values

ValueCountFrequency (%)
20명 52
32.9%
30명 20
 
12.7%
25명 13
 
8.2%
15명 12
 
7.6%
40명 9
 
5.7%
10명 7
 
4.4%
35명 7
 
4.4%
60명 5
 
3.2%
50명 3
 
1.9%
32명 3
 
1.9%
Other values (15) 27
17.1%

Length

2023-08-19T22:08:23.504522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20명 52
32.9%
30명 20
 
12.7%
25명 13
 
8.2%
15명 12
 
7.6%
40명 9
 
5.7%
10명 7
 
4.4%
35명 7
 
4.4%
60명 5
 
3.2%
12명 3
 
1.9%
18명 3
 
1.9%
Other values (15) 27
17.1%

Correlations

2023-08-19T22:08:23.736514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명강좌명요일강의시간수강료접수기간교육장소모집정원
동명1.0000.0000.4210.1480.7100.9881.0000.655
강좌명0.0001.0000.0000.9150.9670.6700.0000.860
요일0.4210.0001.0000.0000.8070.0000.6950.608
강의시간0.1480.9150.0001.0000.0000.7550.0000.890
수강료0.7100.9670.8070.0001.0000.7290.8810.611
접수기간0.9880.6700.0000.7550.7291.0000.9610.000
교육장소1.0000.0000.6950.0000.8810.9611.0000.601
모집정원0.6550.8600.6080.8900.6110.0000.6011.000
2023-08-19T22:08:24.031423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
접수기간요일모집정원수강료동명교육장소
접수기간1.0000.0000.0000.4560.8160.763
요일0.0001.0000.2050.3680.1310.237
모집정원0.0000.2051.0000.1980.2250.169
수강료0.4560.3680.1981.0000.2750.392
동명0.8160.1310.2250.2751.0000.952
교육장소0.7630.2370.1690.3920.9521.000
2023-08-19T22:08:24.303810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
동명요일수강료접수기간교육장소모집정원
동명1.0000.1310.2750.8160.9520.225
요일0.1311.0000.3680.0000.2370.205
수강료0.2750.3681.0000.4560.3920.198
접수기간0.8160.0000.4561.0000.7630.000
교육장소0.9520.2370.3920.7631.0000.169
모집정원0.2250.2050.1980.0000.1691.000

Missing values

2023-08-19T22:08:16.379895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-08-19T22:08:16.841877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

동명강좌명요일강의시간수강료접수기간교육장소모집정원
0신암1동국악교실월+목10:00~12:001개월 10,000원수시신암1동 행정복지센터20명
1신암1동가요교실월+목14:00~16:001개월 10,000원수시신암1동 행정복지센터40명
2신암1동요가교실화+금10:00~12:001개월 10,000원수시신암1동 행정복지센터30명
3신암1동라인.스포츠댄스화+금14:00~16:001개월 14,000원수시신암1동 행정복지센터30명
4신암1동서예교실화+목10:00~16:001개월 10,000원수시새터경로당 2층20명
5신암1동문화탐방넷째주 화09:00~18:001개월 30,000원수시신암1동 행정복지센터40명
6신암2동가요교실10:30~12:003개월 30,000원수시신암2동 행정복지센터48명
7신암2동서예교실화+금13:00~15:003개월 30,000원수시신암2동 행정복지센터10명
8신암2동풍물교실.힐링장구수+금10:00~12:003개월 30,000원수시신암2동 행정복지센터16명
9신암2동동화구연.스피치15:40~17:303개월 30,000원수시신암2동 행정복지센터18명
동명강좌명요일강의시간수강료접수기간교육장소모집정원
148안심4동줌바댄스월+목14:00~15:001개월 10,000원수시안심4동 행정복지센터20명
149안심4동가요교실12:00~13:301개월 10,000원수시안심4동 행정복지센터60명
150안심4동가요교실12:30~14:001개월 10,000원수시안심4동 행정복지센터60명
151안심4동음악난타수+금15:40~16:401개월 10,000원수시안심4동 행정복지센터20명
152안심4동경기민요화+수14:00~15:001개월 10,000원수시안심4동 행정복지센터20명
153안심4동서예교실수+금14:00~17:001개월 10,000원수시안심4동 행정복지센터20명
154공산동다이어트댄스월+수+금09:00~10:001개월 20,000원수시공산동 행정복지센터20명
155공산동라인댄스화+목10:30~11:303개월 50,000원수시공산동 행정복지센터20명
156공산동한국무용월+금10:30~11:303개월 50,000원수시공산동 행정복지센터20명
157공산동몸살림운동10:10~12:003개월 50,000원수시공산동 행정복지센터20명