Overview

Dataset statistics

Number of variables7
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory59.0 B

Variable types

Categorical5
Text1
Numeric1

Dataset

Description대구시설공단 올림픽기념국민생활관의 강습별 이용시간, 금액에 관한 정보
Author대구시설공단
URLhttps://www.data.go.kr/data/15017228/fileData.do

Alerts

시작시간 is highly overall correlated with 분류 and 2 other fieldsHigh correlation
종료시간 is highly overall correlated with 분류 and 2 other fieldsHigh correlation
수강료 is highly overall correlated with 강습요일High correlation
분류 is highly overall correlated with 시작시간 and 2 other fieldsHigh correlation
강습요일 is highly overall correlated with 수강료High correlation
대상 is highly overall correlated with 분류 and 2 other fieldsHigh correlation
대상 is highly imbalanced (72.3%)Imbalance

Reproduction

Analysis started2023-12-12 06:16:30.328603
Analysis finished2023-12-12 06:16:30.951065
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류
Categorical

HIGH CORRELATION 

Distinct23
Distinct (%)34.8%
Missing0
Missing (%)0.0%
Memory size660.0 B
수영_연수
14 
수영_교정
수영_교정1
수영_교정2
수영_배영
 
3
Other values (18)
31 

Length

Max length11
Median length10
Mean length6.2121212
Min length2

Unique

Unique7 ?
Unique (%)10.6%

Sample

1st row수영_배영
2nd row수영_배영
3rd row수영_배영
4th row수영_평영
5th row수영_평영

Common Values

ValueCountFrequency (%)
수영_연수 14
21.2%
수영_교정 8
12.1%
수영_교정1 5
 
7.6%
수영_교정2 5
 
7.6%
수영_배영 3
 
4.5%
에어로빅(방송댄스) 3
 
4.5%
아쿠아로빅_화목토 3
 
4.5%
요가 2
 
3.0%
댄스스포츠_중급,고급 2
 
3.0%
수영_연수1 2
 
3.0%
Other values (13) 19
28.8%

Length

2023-12-12T15:16:31.035590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영_연수 14
21.2%
수영_교정 8
12.1%
수영_교정1 5
 
7.6%
수영_교정2 5
 
7.6%
수영_배영 3
 
4.5%
에어로빅(방송댄스 3
 
4.5%
아쿠아로빅_화목토 3
 
4.5%
수영_연수2 2
 
3.0%
수영_자유형 2
 
3.0%
어린이수영_평영 2
 
3.0%
Other values (13) 19
28.8%
Distinct64
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T15:16:31.252117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length15.393939
Min length12

Characters and Unicode

Total characters1016
Distinct characters61
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)93.9%

Sample

1st row수영_화목토_배영반(09시)
2nd row수영_월수금_배영반(11시)
3rd row수영_화목토_배영반(20시)
4th row수영_화목토_평영반(07시)
5th row수영_월수금_평영반(20시)
ValueCountFrequency (%)
아쿠아로빅_월수금(13시 2
 
3.0%
수영_월수금_교정반(20시 2
 
3.0%
아쿠아로빅_화목토(12시 1
 
1.5%
아쿠아로빅_화목토(13시 1
 
1.5%
수영_화목_연수반(20시 1
 
1.5%
수영_월수금_연수1반(10시 1
 
1.5%
수영_월수금_연수1반(11시 1
 
1.5%
수영_월수금_연수2반(10시 1
 
1.5%
수영_월수금_연수2반(11시 1
 
1.5%
수영_화목토_자유형(11시 1
 
1.5%
Other values (54) 54
81.8%
2023-12-12T15:16:31.594045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 113
 
11.1%
101
 
9.9%
( 66
 
6.5%
66
 
6.5%
) 66
 
6.5%
1 55
 
5.4%
54
 
5.3%
0 48
 
4.7%
45
 
4.4%
34
 
3.3%
Other values (51) 368
36.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 597
58.8%
Decimal Number 172
 
16.9%
Connector Punctuation 113
 
11.1%
Open Punctuation 66
 
6.5%
Close Punctuation 66
 
6.5%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
16.9%
66
11.1%
54
 
9.0%
45
 
7.5%
34
 
5.7%
34
 
5.7%
31
 
5.2%
31
 
5.2%
22
 
3.7%
22
 
3.7%
Other values (37) 157
26.3%
Decimal Number
ValueCountFrequency (%)
1 55
32.0%
0 48
27.9%
2 25
14.5%
9 12
 
7.0%
6 10
 
5.8%
7 7
 
4.1%
3 7
 
4.1%
5 3
 
1.7%
8 3
 
1.7%
4 2
 
1.2%
Connector Punctuation
ValueCountFrequency (%)
_ 113
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 597
58.8%
Common 419
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
16.9%
66
11.1%
54
 
9.0%
45
 
7.5%
34
 
5.7%
34
 
5.7%
31
 
5.2%
31
 
5.2%
22
 
3.7%
22
 
3.7%
Other values (37) 157
26.3%
Common
ValueCountFrequency (%)
_ 113
27.0%
( 66
15.8%
) 66
15.8%
1 55
13.1%
0 48
11.5%
2 25
 
6.0%
9 12
 
2.9%
6 10
 
2.4%
7 7
 
1.7%
3 7
 
1.7%
Other values (4) 10
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 597
58.8%
ASCII 419
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 113
27.0%
( 66
15.8%
) 66
15.8%
1 55
13.1%
0 48
11.5%
2 25
 
6.0%
9 12
 
2.9%
6 10
 
2.4%
7 7
 
1.7%
3 7
 
1.7%
Other values (4) 10
 
2.4%
Hangul
ValueCountFrequency (%)
101
16.9%
66
11.1%
54
 
9.0%
45
 
7.5%
34
 
5.7%
34
 
5.7%
31
 
5.2%
31
 
5.2%
22
 
3.7%
22
 
3.7%
Other values (37) 157
26.3%

강습요일
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size660.0 B
월,수,금
31 
화,목
12 
화,목,토
11 
화,목,토(자유)
 
3

Length

Max length9
Median length5
Mean length4.8181818
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화,목,토(자유)
2nd row월,수,금
3rd row화,목
4th row화,목,토
5th row월,수,금

Common Values

ValueCountFrequency (%)
월,수,금 31
47.0%
화,목 12
 
18.2%
화,목,토 11
 
16.7%
화,목,토(자유) 7
 
10.6%
3
 
4.5%
월~금 2
 
3.0%

Length

2023-12-12T15:16:31.740552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:16:31.889208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월,수,금 31
47.0%
화,목 12
 
18.2%
화,목,토 11
 
16.7%
화,목,토(자유 7
 
10.6%
3
 
4.5%
월~금 2
 
3.0%

시작시간
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size660.0 B
20:00
6:00
19:00
11:00
7:00
Other values (16)
36 

Length

Max length5
Median length5
Mean length4.6969697
Min length4

Unique

Unique8 ?
Unique (%)12.1%

Sample

1st row9:00
2nd row11:00
3rd row20:00
4th row7:00
5th row20:00

Common Values

ValueCountFrequency (%)
20:00 6
9.1%
6:00 6
9.1%
19:00 6
9.1%
11:00 6
9.1%
7:00 6
9.1%
9:00 5
 
7.6%
12:00 5
 
7.6%
10:00 4
 
6.1%
10:20 4
 
6.1%
16:00 4
 
6.1%
Other values (11) 14
21.2%

Length

2023-12-12T15:16:32.033976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20:00 6
9.1%
19:00 6
9.1%
11:00 6
9.1%
7:00 6
9.1%
6:00 6
9.1%
9:00 5
 
7.6%
12:00 5
 
7.6%
10:00 4
 
6.1%
10:20 4
 
6.1%
16:00 4
 
6.1%
Other values (11) 14
21.2%

종료시간
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)31.8%
Missing0
Missing (%)0.0%
Memory size660.0 B
20:50
6:50
19:50
11:50
7:50
Other values (16)
36 

Length

Max length5
Median length5
Mean length4.6969697
Min length4

Unique

Unique8 ?
Unique (%)12.1%

Sample

1st row9:50
2nd row11:50
3rd row20:50
4th row7:50
5th row20:50

Common Values

ValueCountFrequency (%)
20:50 6
9.1%
6:50 6
9.1%
19:50 6
9.1%
11:50 6
9.1%
7:50 6
9.1%
9:50 5
 
7.6%
12:50 5
 
7.6%
11:10 4
 
6.1%
10:50 4
 
6.1%
16:50 4
 
6.1%
Other values (11) 14
21.2%

Length

2023-12-12T15:16:32.152333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20:50 6
9.1%
19:50 6
9.1%
11:50 6
9.1%
7:50 6
9.1%
6:50 6
9.1%
9:50 5
 
7.6%
12:50 5
 
7.6%
11:10 4
 
6.1%
10:50 4
 
6.1%
16:50 4
 
6.1%
Other values (11) 14
21.2%

대상
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size660.0 B
성인
60 
어린이
 
4
초등중등(2개월)
 
1
초등생(2개월)
 
1

Length

Max length9
Median length2
Mean length2.2575758
Min length2

Unique

Unique2 ?
Unique (%)3.0%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 60
90.9%
어린이 4
 
6.1%
초등중등(2개월) 1
 
1.5%
초등생(2개월) 1
 
1.5%

Length

2023-12-12T15:16:32.291271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:16:32.405234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성인 60
90.9%
어린이 4
 
6.1%
초등중등(2개월 1
 
1.5%
초등생(2개월 1
 
1.5%

수강료
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)21.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47354.545
Minimum10000
Maximum77000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T15:16:32.515626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10000
5-th percentile31000
Q139375
median52800
Q352800
95-th percentile57450
Maximum77000
Range67000
Interquartile range (IQR)13425

Descriptive statistics

Standard deviation9878.3085
Coefficient of variation (CV)0.20860317
Kurtosis2.9835939
Mean47354.545
Median Absolute Deviation (MAD)1650
Skewness-0.89614987
Sum3125400
Variance97580979
MonotonicityNot monotonic
2023-12-12T15:16:32.649666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
52800 33
50.0%
46000 7
 
10.6%
38500 6
 
9.1%
36000 4
 
6.1%
31000 3
 
4.5%
49500 2
 
3.0%
36300 2
 
3.0%
42000 2
 
3.0%
59000 2
 
3.0%
48400 1
 
1.5%
Other values (4) 4
 
6.1%
ValueCountFrequency (%)
10000 1
 
1.5%
24000 1
 
1.5%
31000 3
4.5%
36000 4
6.1%
36300 2
 
3.0%
38500 6
9.1%
42000 2
 
3.0%
46000 7
10.6%
48400 1
 
1.5%
49500 2
 
3.0%
ValueCountFrequency (%)
77000 1
 
1.5%
60000 1
 
1.5%
59000 2
 
3.0%
52800 33
50.0%
49500 2
 
3.0%
48400 1
 
1.5%
46000 7
 
10.6%
42000 2
 
3.0%
38500 6
 
9.1%
36300 2
 
3.0%

Interactions

2023-12-12T15:16:30.657218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:16:32.736985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류강좌명 / 반명강습요일시작시간종료시간대상수강료
분류1.0001.0000.6640.9230.9161.0000.682
강좌명 / 반명1.0001.0000.9790.9710.8891.0000.985
강습요일0.6640.9791.0000.7340.7830.5860.836
시작시간0.9230.9710.7341.0001.0000.9590.741
종료시간0.9160.8890.7831.0001.0000.9590.820
대상1.0001.0000.5860.9590.9591.0000.584
수강료0.6820.9850.8360.7410.8200.5841.000
2023-12-12T15:16:32.855770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상강습요일분류시작시간종료시간
대상1.0000.4100.8330.7400.740
강습요일0.4101.0000.3090.3750.424
분류0.8330.3091.0000.5360.519
시작시간0.7400.3750.5361.0000.973
종료시간0.7400.4240.5190.9731.000
2023-12-12T15:16:32.944700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수강료분류강습요일시작시간종료시간대상
수강료1.0000.4260.6880.4740.3840.414
분류0.4261.0000.3090.5360.5190.833
강습요일0.6880.3091.0000.3750.4240.410
시작시간0.4740.5360.3751.0000.9730.740
종료시간0.3840.5190.4240.9731.0000.740
대상0.4140.8330.4100.7400.7401.000

Missing values

2023-12-12T15:16:30.767032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:16:30.876118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류강좌명 / 반명강습요일시작시간종료시간대상수강료
0수영_배영수영_화목토_배영반(09시)화,목,토(자유)9:009:50성인46000
1수영_배영수영_월수금_배영반(11시)월,수,금11:0011:50성인52800
2수영_배영수영_화목토_배영반(20시)화,목20:0020:50성인38500
3수영_평영수영_화목토_평영반(07시)화,목,토7:007:50성인52800
4수영_평영수영_월수금_평영반(20시)월,수,금20:0020:50성인52800
5수영_교정수영_화목토_교정반(07시)화,목,토7:007:50성인52800
6수영_교정수영_월수금_교정반(09시)월,수,금9:009:50성인52800
7수영_교정수영_화목토_교정반(09시)화,목,토(자유)9:009:50성인46000
8수영_교정수영_화목토_교정반(10시)화,목,토(자유)10:0010:50성인46000
9수영_교정수영_화목토_교정반(11시)화,목,토(자유)11:0011:50성인46000
분류강좌명 / 반명강습요일시작시간종료시간대상수강료
56댄스스포츠_중급,고급댄스스포츠_화목토(11시20분)화,목,토11:2012:10성인36000
57댄스스포츠_초급댄스스포츠_화목토(10시20분)화,목,토10:2011:10성인36000
58실버댄스실버댄스_토(14시30분)14:3015:30성인10000
59힐링몸짱교실힐링몸짱교실_월수금(07시50분)월,수,금7:508:40성인42000
60바디웨이트바디웨이트_월수금(12시)월,수,금12:0012:50성인42000
61줌바댄스줌바댄스_화목(19시30분)화,목19:3020:20성인31000
62요가요가_화목(12시20분)화,목12:2013:10성인31000
63요가요가_화목(18시30분)화,목18:3019:20성인31000
64농구농구_토(15시30분)15:3017:00초등중등(2개월)59000
65체능교실(축구+농구)체능교실_토(14시00분)14:0015:30초등생(2개월)59000