Overview

Dataset statistics

Number of variables7
Number of observations85
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory58.6 B

Variable types

Categorical3
Text1
DateTime2
Numeric1

Dataset

Description대구공공시설관리공단(구.대구시설공단)이 운영하는 서재문화체육센터의 (강좌명, 시작 및 종료시간, 수강료 등)데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15120522/fileData.do

Alerts

수강료 is highly overall correlated with 분류 and 1 other fieldsHigh correlation
분류 is highly overall correlated with 수강료 High correlation
강습요일 is highly overall correlated with 수강료 High correlation
강좌명_반명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:18:05.010289
Analysis finished2023-12-12 12:18:06.269282
Duration1.26 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)8.2%
Missing0
Missing (%)0.0%
Memory size812.0 B
수영
53 
탁구
11 
배드민턴
댄스
 
5
아쿠아로빅
 
4
Other values (2)

Length

Max length5
Median length2
Mean length2.3294118
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영
2nd row수영
3rd row수영
4th row수영
5th row수영

Common Values

ValueCountFrequency (%)
수영 53
62.4%
탁구 11
 
12.9%
배드민턴 6
 
7.1%
댄스 5
 
5.9%
아쿠아로빅 4
 
4.7%
요가 4
 
4.7%
힐링몸짱 2
 
2.4%

Length

2023-12-12T21:18:06.378831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:18:06.561755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수영 53
62.4%
탁구 11
 
12.9%
배드민턴 6
 
7.1%
댄스 5
 
5.9%
아쿠아로빅 4
 
4.7%
요가 4
 
4.7%
힐링몸짱 2
 
2.4%

강좌명_반명
Text

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-12T21:18:06.856162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length12.882353
Min length9

Characters and Unicode

Total characters1095
Distinct characters59
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)100.0%

Sample

1st row주3회 초급반(07시)
2nd row주2회 초급반(20시)
3rd row주3회 중급반(06시)
4th row주3회 중급반(11시)
5th row주3회 중급반(20시)
ValueCountFrequency (%)
주3회 48
24.5%
주2회 37
18.9%
10시 6
 
3.1%
고급탁구 6
 
3.1%
배드민턴 6
 
3.1%
초급탁구 5
 
2.6%
19시 5
 
2.6%
11시 4
 
2.0%
15시 4
 
2.0%
연수반(17시 2
 
1.0%
Other values (55) 73
37.2%
2023-12-12T21:18:07.334731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111
 
10.1%
85
 
7.8%
) 85
 
7.8%
85
 
7.8%
( 85
 
7.8%
85
 
7.8%
1 63
 
5.8%
53
 
4.8%
3 49
 
4.5%
2 49
 
4.5%
Other values (49) 345
31.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 533
48.7%
Decimal Number 255
23.3%
Space Separator 111
 
10.1%
Close Punctuation 85
 
7.8%
Open Punctuation 85
 
7.8%
Uppercase Letter 25
 
2.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
15.9%
85
15.9%
85
15.9%
53
9.9%
29
 
5.4%
27
 
5.1%
22
 
4.1%
15
 
2.8%
15
 
2.8%
11
 
2.1%
Other values (29) 106
19.9%
Decimal Number
ValueCountFrequency (%)
1 63
24.7%
3 49
19.2%
2 49
19.2%
0 40
15.7%
9 18
 
7.1%
6 14
 
5.5%
7 14
 
5.5%
5 4
 
1.6%
8 2
 
0.8%
4 2
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
A 10
40.0%
B 10
40.0%
P 2
 
8.0%
O 1
 
4.0%
C 1
 
4.0%
K 1
 
4.0%
Space Separator
ValueCountFrequency (%)
111
100.0%
Close Punctuation
ValueCountFrequency (%)
) 85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 537
49.0%
Hangul 533
48.7%
Latin 25
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
15.9%
85
15.9%
85
15.9%
53
9.9%
29
 
5.4%
27
 
5.1%
22
 
4.1%
15
 
2.8%
15
 
2.8%
11
 
2.1%
Other values (29) 106
19.9%
Common
ValueCountFrequency (%)
111
20.7%
) 85
15.8%
( 85
15.8%
1 63
11.7%
3 49
9.1%
2 49
9.1%
0 40
 
7.4%
9 18
 
3.4%
6 14
 
2.6%
7 14
 
2.6%
Other values (4) 9
 
1.7%
Latin
ValueCountFrequency (%)
A 10
40.0%
B 10
40.0%
P 2
 
8.0%
O 1
 
4.0%
C 1
 
4.0%
K 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 562
51.3%
Hangul 533
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111
19.8%
) 85
15.1%
( 85
15.1%
1 63
11.2%
3 49
8.7%
2 49
8.7%
0 40
 
7.1%
9 18
 
3.2%
6 14
 
2.5%
7 14
 
2.5%
Other values (10) 34
 
6.0%
Hangul
ValueCountFrequency (%)
85
15.9%
85
15.9%
85
15.9%
53
9.9%
29
 
5.4%
27
 
5.1%
22
 
4.1%
15
 
2.8%
15
 
2.8%
11
 
2.1%
Other values (29) 106
19.9%

강습요일
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size812.0 B
월,수,금
48 
화,목
37 

Length

Max length5
Median length5
Mean length4.1294118
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row월,수,금
2nd row화,목
3rd row월,수,금
4th row월,수,금
5th row월,수,금

Common Values

ValueCountFrequency (%)
월,수,금 48
56.5%
화,목 37
43.5%

Length

2023-12-12T21:18:07.542894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:18:07.723058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월,수,금 48
56.5%
화,목 37
43.5%
Distinct15
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Memory size812.0 B
Minimum2023-12-12 06:00:00
Maximum2023-12-12 21:00:00
2023-12-12T21:18:07.866004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:18:08.027328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
Distinct19
Distinct (%)22.4%
Missing0
Missing (%)0.0%
Memory size812.0 B
Minimum2023-12-12 06:50:00
Maximum2023-12-12 22:00:00
2023-12-12T21:18:08.166931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:18:08.318057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)

대상
Categorical

Distinct2
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Memory size812.0 B
성인
73 
어린이
12 

Length

Max length3
Median length2
Mean length2.1411765
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 73
85.9%
어린이 12
 
14.1%

Length

2023-12-12T21:18:08.451978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:18:08.597204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성인 73
85.9%
어린이 12
 
14.1%

수강료
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)14.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49117.647
Minimum25000
Maximum104000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size897.0 B
2023-12-12T21:18:08.713360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum25000
5-th percentile28400
Q132000
median50000
Q350000
95-th percentile90000
Maximum104000
Range79000
Interquartile range (IQR)18000

Descriptive statistics

Standard deviation19775.774
Coefficient of variation (CV)0.40262055
Kurtosis0.93879382
Mean49117.647
Median Absolute Deviation (MAD)18000
Skewness1.2546531
Sum4175000
Variance3.9108123 × 108
MonotonicityNot monotonic
2023-12-12T21:18:08.871841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
50000 27
31.8%
32000 19
22.4%
45000 6
 
7.1%
42000 6
 
7.1%
84000 6
 
7.1%
30000 5
 
5.9%
28000 4
 
4.7%
90000 3
 
3.5%
60000 3
 
3.5%
104000 3
 
3.5%
Other values (2) 3
 
3.5%
ValueCountFrequency (%)
25000 1
 
1.2%
28000 4
 
4.7%
30000 5
 
5.9%
32000 19
22.4%
42000 6
 
7.1%
45000 6
 
7.1%
50000 27
31.8%
60000 3
 
3.5%
71000 2
 
2.4%
84000 6
 
7.1%
ValueCountFrequency (%)
104000 3
 
3.5%
90000 3
 
3.5%
84000 6
 
7.1%
71000 2
 
2.4%
60000 3
 
3.5%
50000 27
31.8%
45000 6
 
7.1%
42000 6
 
7.1%
32000 19
22.4%
30000 5
 
5.9%

Interactions

2023-12-12T21:18:05.429937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:18:08.992975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류강좌명_반명강습요일시작시간종료시간대상수강료
분류1.0001.0000.0000.8040.8610.0750.784
강좌명_반명1.0001.0001.0001.0001.0001.0001.000
강습요일0.0001.0001.0000.0000.0000.0000.998
시작시간0.8041.0000.0001.0000.9941.0000.569
종료시간0.8611.0000.0000.9941.0001.0000.735
대상0.0751.0000.0001.0001.0001.0000.575
수강료0.7841.0000.9980.5690.7350.5751.000
2023-12-12T21:18:09.132101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강습요일분류대상
강습요일1.0000.0000.000
분류0.0001.0000.072
대상0.0000.0721.000
2023-12-12T21:18:09.255011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수강료분류강습요일대상
수강료1.0000.5670.9250.407
분류0.5671.0000.0000.072
강습요일0.9250.0001.0000.000
대상0.4070.0720.0001.000

Missing values

2023-12-12T21:18:05.602471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:18:06.211831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류강좌명_반명강습요일시작시간종료시간대상수강료
0수영주3회 초급반(07시)월,수,금07:0007:50성인50000
1수영주2회 초급반(20시)화,목20:0020:50성인32000
2수영주3회 중급반(06시)월,수,금06:0006:50성인50000
3수영주3회 중급반(11시)월,수,금11:0011:50성인50000
4수영주3회 중급반(20시)월,수,금20:0020:50성인50000
5수영주2회 중급반(07시)화,목07:0007:50성인32000
6수영주2회 상급반(06시)화,목06:0006:50성인32000
7수영주2회 상급반(07시)화,목07:0007:50성인32000
8수영주2회 고급반(19시)화,목19:0019:50성인32000
9수영주3회 교정A반(06시)월,수,금06:0006:50성인50000
분류강좌명_반명강습요일시작시간종료시간대상수강료
75탁구주3회 고급탁구 (11시)월,수,금11:0012:00성인104000
76탁구주3회 초급탁구 (15시)월,수,금15:0017:00성인84000
77탁구주3회 고급탁구 (15시)월,수,금15:0016:00성인104000
78탁구주3회 초급탁구 (19시)월,수,금19:0020:00성인84000
79탁구주3회 고급탁구 (19시)월,수,금19:0021:00성인104000
80탁구주2회 초급탁구 (10시)화,목10:0012:00성인71000
81탁구주2회 고급탁구 (11시)화,목11:0012:00성인84000
82탁구주2회 초급탁구 (15시)화,목15:0017:00성인71000
83탁구주2회 고급탁구 (15시)화,목15:0016:00성인84000
84탁구주2회 고급탁구 (19시)화,목19:0021:00성인84000