Overview

Dataset statistics

Number of variables7
Number of observations133
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory7.5 KiB
Average record size in memory58.0 B

Variable types

Categorical6
Text1

Dataset

Description두류수영장의 강습 및 이용에 관한 정보
Author대구시설공단
URLhttps://www.data.go.kr/data/3055153/fileData.do

Alerts

Dataset has 1 (0.8%) duplicate rowsDuplicates
대상 is highly overall correlated with 시작시간 and 1 other fieldsHigh correlation
시작시간 is highly overall correlated with 종료시간 and 1 other fieldsHigh correlation
수강료 is highly overall correlated with 강습요일High correlation
강습요일 is highly overall correlated with 분류 and 1 other fieldsHigh correlation
종료시간 is highly overall correlated with 시작시간 and 1 other fieldsHigh correlation
분류 is highly overall correlated with 강습요일High correlation
대상 is highly imbalanced (88.7%)Imbalance

Reproduction

Analysis started2023-12-12 05:14:22.918746
Analysis finished2023-12-12 05:14:23.956728
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분류
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
수영_교정반
23 
수영_선수반
17 
수영_연수반
16 
수영_마스터반
14 
수영_안전반
13 
Other values (9)
50 

Length

Max length9
Median length6
Mean length6.1578947
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row수영_초급반
2nd row수영_초급반
3rd row수영_초급반
4th row수영_중급반
5th row수영_중급반

Common Values

ValueCountFrequency (%)
수영_교정반 23
17.3%
수영_선수반 17
12.8%
수영_연수반 16
12.0%
수영_마스터반 14
10.5%
수영_안전반 13
9.8%
요가 12
9.0%
아쿠아로빅_월수금 9
 
6.8%
아쿠아로빅_화목토 6
 
4.5%
아쿠아로빅_화목 6
 
4.5%
수영_상급반 5
 
3.8%
Other values (4) 12
9.0%

Length

2023-12-12T14:14:24.074303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영_교정반 23
17.3%
수영_선수반 17
12.8%
수영_연수반 16
12.0%
수영_마스터반 14
10.5%
수영_안전반 13
9.8%
요가 12
9.0%
아쿠아로빅_월수금 9
 
6.8%
아쿠아로빅_화목토 6
 
4.5%
아쿠아로빅_화목 6
 
4.5%
수영_상급반 5
 
3.8%
Other values (4) 12
9.0%
Distinct132
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T14:14:24.432207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length16.120301
Min length4

Characters and Unicode

Total characters2144
Distinct characters47
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)98.5%

Sample

1st row수영_화목토_초급반(06시30분)
2nd row수영_화목토_초급반(10시)
3rd row수영_화목_초급반(18시30분)
4th row수영_월수금_중급반(07시30분)
5th row수영_월수금_중급반(15시)
ValueCountFrequency (%)
수영_월수금_안전a반(19시30분 2
 
1.5%
수영_월수금_선수반(07시30분 1
 
0.8%
수영_화목토_선수반(07시30분 1
 
0.8%
수영_화목_선수반(19시30분 1
 
0.8%
수영_월수금_선수반(19시30분 1
 
0.8%
수영_화목_선수반(18시30분 1
 
0.8%
수영_월수금_선수반(18시30분 1
 
0.8%
수영_월수금_선수반(15시 1
 
0.8%
수영_월수금_선수반(14시 1
 
0.8%
수영_화목토_선수반(11시 1
 
0.8%
Other values (122) 122
91.7%
2023-12-12T14:14:24.998062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 231
 
10.8%
208
 
9.7%
( 132
 
6.2%
) 132
 
6.2%
132
 
6.2%
0 126
 
5.9%
1 100
 
4.7%
99
 
4.6%
99
 
4.6%
76
 
3.5%
Other values (37) 809
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1210
56.4%
Decimal Number 398
 
18.6%
Connector Punctuation 231
 
10.8%
Open Punctuation 132
 
6.2%
Close Punctuation 132
 
6.2%
Uppercase Letter 41
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
208
17.2%
132
10.9%
99
 
8.2%
99
 
8.2%
76
 
6.3%
76
 
6.3%
67
 
5.5%
56
 
4.6%
56
 
4.6%
42
 
3.5%
Other values (22) 299
24.7%
Decimal Number
ValueCountFrequency (%)
0 126
31.7%
1 100
25.1%
3 66
16.6%
9 31
 
7.8%
6 18
 
4.5%
7 18
 
4.5%
8 16
 
4.0%
4 12
 
3.0%
5 8
 
2.0%
2 3
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
A 23
56.1%
B 18
43.9%
Connector Punctuation
ValueCountFrequency (%)
_ 231
100.0%
Open Punctuation
ValueCountFrequency (%)
( 132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 132
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1210
56.4%
Common 893
41.7%
Latin 41
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
208
17.2%
132
10.9%
99
 
8.2%
99
 
8.2%
76
 
6.3%
76
 
6.3%
67
 
5.5%
56
 
4.6%
56
 
4.6%
42
 
3.5%
Other values (22) 299
24.7%
Common
ValueCountFrequency (%)
_ 231
25.9%
( 132
14.8%
) 132
14.8%
0 126
14.1%
1 100
11.2%
3 66
 
7.4%
9 31
 
3.5%
6 18
 
2.0%
7 18
 
2.0%
8 16
 
1.8%
Other values (3) 23
 
2.6%
Latin
ValueCountFrequency (%)
A 23
56.1%
B 18
43.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1210
56.4%
ASCII 934
43.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 231
24.7%
( 132
14.1%
) 132
14.1%
0 126
13.5%
1 100
10.7%
3 66
 
7.1%
9 31
 
3.3%
A 23
 
2.5%
6 18
 
1.9%
7 18
 
1.9%
Other values (5) 57
 
6.1%
Hangul
ValueCountFrequency (%)
208
17.2%
132
10.9%
99
 
8.2%
99
 
8.2%
76
 
6.3%
76
 
6.3%
67
 
5.5%
56
 
4.6%
56
 
4.6%
42
 
3.5%
Other values (22) 299
24.7%

강습요일
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
월,수,금
76 
화,목,토
29 
화,목
27 
월,수,금,토
 
1

Length

Max length7
Median length5
Mean length4.6090226
Min length3

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row화,목,토
2nd row화,목,토
3rd row화,목
4th row월,수,금
5th row월,수,금

Common Values

ValueCountFrequency (%)
월,수,금 76
57.1%
화,목,토 29
 
21.8%
화,목 27
 
20.3%
월,수,금,토 1
 
0.8%

Length

2023-12-12T14:14:25.176358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:14:25.328014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월,수,금 76
57.1%
화,목,토 29
 
21.8%
화,목 27
 
20.3%
월,수,금,토 1
 
0.8%

시작시간
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
19::
19 
06::
16 
18::
16 
07::
16 
10::
15 
Other values (9)
51 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row06::
2nd row10::
3rd row18::
4th row07::
5th row15::

Common Values

ValueCountFrequency (%)
19:: 19
14.3%
06:: 16
12.0%
18:: 16
12.0%
07:: 16
12.0%
10:: 15
11.3%
09:: 12
9.0%
11:: 12
9.0%
14:: 10
7.5%
15:: 8
6.0%
17:: 2
 
1.5%
Other values (4) 7
 
5.3%

Length

2023-12-12T14:14:25.474422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
19 19
14.3%
06 16
12.0%
18 16
12.0%
07 16
12.0%
10 15
11.3%
09 12
9.0%
11 12
9.0%
14 10
7.5%
15 8
6.0%
17 2
 
1.5%
Other values (4) 7
 
5.3%

종료시간
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
20::
19 
07::
18 
19::
17 
10::
14 
08::
14 
Other values (8)
51 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row07::
2nd row10::
3rd row19::
4th row08::
5th row15::

Common Values

ValueCountFrequency (%)
20:: 19
14.3%
07:: 18
13.5%
19:: 17
12.8%
10:: 14
10.5%
08:: 14
10.5%
11:: 12
9.0%
09:: 12
9.0%
14:: 10
7.5%
15:: 8
6.0%
12:: 3
 
2.3%
Other values (3) 6
 
4.5%

Length

2023-12-12T14:14:25.626750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
20 19
14.3%
07 18
13.5%
19 17
12.8%
10 14
10.5%
08 14
10.5%
11 12
9.0%
09 12
9.0%
14 10
7.5%
15 8
6.0%
12 3
 
2.3%
Other values (3) 6
 
4.5%

대상
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
성인
131 
어린이
 
2

Length

Max length3
Median length2
Mean length2.0150376
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 131
98.5%
어린이 2
 
1.5%

Length

2023-12-12T14:14:25.801354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:14:25.942902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성인 131
98.5%
어린이 2
 
1.5%

수강료
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
48000
106 
36000
27 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48000
2nd row48000
3rd row36000
4th row48000
5th row48000

Common Values

ValueCountFrequency (%)
48000 106
79.7%
36000 27
 
20.3%

Length

2023-12-12T14:14:26.077428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:14:26.216010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48000 106
79.7%
36000 27
 
20.3%

Correlations

2023-12-12T14:14:26.316117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류강습요일시작시간종료시간대상수강료
분류1.0000.8490.0000.0000.4560.539
강습요일0.8491.0000.4860.5090.0001.000
시작시간0.0000.4861.0000.9941.0000.613
종료시간0.0000.5090.9941.0001.0000.525
대상0.4560.0001.0001.0001.0000.000
수강료0.5391.0000.6130.5250.0001.000
2023-12-12T14:14:26.494998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상시작시간수강료분류강습요일종료시간
대상1.0000.9530.0000.3400.0000.957
시작시간0.9531.0000.4610.0000.2830.956
수강료0.0000.4611.0000.4030.9920.470
분류0.3400.0000.4031.0000.6430.000
강습요일0.0000.2830.9920.6431.0000.309
종료시간0.9570.9560.4700.0000.3091.000
2023-12-12T14:14:26.652017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분류강습요일시작시간종료시간대상수강료
분류1.0000.6430.0000.0000.3400.403
강습요일0.6431.0000.2830.3090.0000.992
시작시간0.0000.2831.0000.9560.9530.461
종료시간0.0000.3090.9561.0000.9570.470
대상0.3400.0000.9530.9571.0000.000
수강료0.4030.9920.4610.4700.0001.000

Missing values

2023-12-12T14:14:23.417609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:14:23.900101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분류강좌명 / 반명강습요일시작시간종료시간대상수강료
0수영_초급반수영_화목토_초급반(06시30분)화,목,토06::07::성인48000
1수영_초급반수영_화목토_초급반(10시)화,목,토10::10::성인48000
2수영_초급반수영_화목_초급반(18시30분)화,목18::19::성인36000
3수영_중급반수영_월수금_중급반(07시30분)월,수,금07::08::성인48000
4수영_중급반수영_월수금_중급반(15시)월,수,금15::15::성인48000
5수영_중급반수영_월수금_중급반(17시)월,수,금17::17::어린이48000
6수영_중급반수영_화목_중급반(19시30분)화,목19::20::성인36000
7수영_고급반수영_화목토_고급반(06시30분)화,목,토06::07::성인48000
8수영_고급반수영_월수금_고급반(10시)월,수,금10::11::성인48000
9수영_고급반수영_화목_고급반(14시)화,목14::14::성인36000
분류강좌명 / 반명강습요일시작시간종료시간대상수강료
123요가요가_월수금(09시)월,수,금09::09::성인48000
124요가요가_화목토(09시)화,목,토09::09::성인48000
125요가요가_화목(10시)화,목10::10::성인36000
126요가요가_화목(11시10분)화,목11::12::성인36000
127요가요가_월수금(11시)월,수,금11::11::성인48000
128요가요가_화목(18시40분)화,목18::19::성인36000
129요가요가_월수금(18시30분)월,수,금18::19::성인48000
130요가요가_화목(19시40분)화,목19::20::성인36000
131요가요가_월수금(19시30분)월,수,금19::20::성인48000
132에어로빅에어로빅월,수,금,토10::10::성인48000

Duplicate rows

Most frequently occurring

분류강좌명 / 반명강습요일시작시간종료시간대상수강료# duplicates
0수영_안전반수영_월수금_안전A반(19시30분)월,수,금19::20::성인480002