Overview

Dataset statistics

Number of variables10
Number of observations330
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.2 KiB
Average record size in memory81.4 B

Variable types

Categorical8
Text1
Numeric1

Dataset

Description의왕도시공사가 보유한 체육시설 강좌정보에 대한 데이터로서 운영시간, 이용인원, 수강대상, 전화번호 등의 정보를 제공합니다.
Author의왕도시공사
URLhttps://www.data.go.kr/data/15096659/fileData.do

Alerts

운영기관명 has constant value ""Constant
시설전화번호 is highly overall correlated with 시설명High correlation
시설명 is highly overall correlated with 시설전화번호High correlation
정원 is highly overall correlated with 프로그램 구분 and 1 other fieldsHigh correlation
프로그램 구분 is highly overall correlated with 정원 and 2 other fieldsHigh correlation
강좌시작시간 is highly overall correlated with 프로그램 구분 and 1 other fieldsHigh correlation
강좌종료시간 is highly overall correlated with 정원 and 1 other fieldsHigh correlation
강좌대상 is highly overall correlated with 프로그램 구분High correlation
프로그램 구분 is highly imbalanced (55.9%)Imbalance
운영요일 is highly imbalanced (52.8%)Imbalance

Reproduction

Analysis started2024-04-06 08:12:20.534988
Analysis finished2024-04-06 08:12:22.833285
Duration2.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
국민체육센터
106 
부곡스포츠센터
65 
포일스포츠센터
65 
백운커뮤니티센터
54 
평생학습관
38 

Length

Max length8
Median length7
Mean length6.6060606
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평생학습관
2nd row평생학습관
3rd row평생학습관
4th row평생학습관
5th row평생학습관

Common Values

ValueCountFrequency (%)
국민체육센터 106
32.1%
부곡스포츠센터 65
19.7%
포일스포츠센터 65
19.7%
백운커뮤니티센터 54
16.4%
평생학습관 38
 
11.5%
고천체육공원 2
 
0.6%

Length

2024-04-06T17:12:22.970701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:23.189301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국민체육센터 106
32.1%
부곡스포츠센터 65
19.7%
포일스포츠센터 65
19.7%
백운커뮤니티센터 54
16.4%
평생학습관 38
 
11.5%
고천체육공원 2
 
0.6%

프로그램 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct17
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
수영
230 
탁구
31 
배드민턴
29 
볼링
 
6
웰빙댄스
 
4
Other values (12)
30 

Length

Max length7
Median length2
Mean length2.330303
Min length2

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row수영
2nd row수영
3rd row수영
4th row수영
5th row수영

Common Values

ValueCountFrequency (%)
수영 230
69.7%
탁구 31
 
9.4%
배드민턴 29
 
8.8%
볼링 6
 
1.8%
웰빙댄스 4
 
1.2%
줌바댄스 4
 
1.2%
요가 4
 
1.2%
당구 4
 
1.2%
발레 3
 
0.9%
에어로빅 3
 
0.9%
Other values (7) 12
 
3.6%

Length

2024-04-06T17:12:23.458873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영 230
69.7%
탁구 31
 
9.4%
배드민턴 29
 
8.8%
볼링 6
 
1.8%
웰빙댄스 4
 
1.2%
줌바댄스 4
 
1.2%
요가 4
 
1.2%
당구 4
 
1.2%
에어로빅 3
 
0.9%
발레 3
 
0.9%
Other values (7) 12
 
3.6%
Distinct287
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2024-04-06T17:12:23.860827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length13.733333
Min length6

Characters and Unicode

Total characters4532
Distinct characters109
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique249 ?
Unique (%)75.5%

Sample

1st row수상인명구조반
2nd row06시 월수금 중급1
3rd row07시 월수금 중급1
4th row06시 화목 중급2
5th row07시 화목 중급2
ValueCountFrequency (%)
월수금 160
 
15.1%
화목 145
 
13.7%
연수 51
 
4.8%
10시 44
 
4.1%
성인,청소년 42
 
4.0%
16시 37
 
3.5%
17시 35
 
3.3%
07시 34
 
3.2%
교정 31
 
2.9%
09시 31
 
2.9%
Other values (83) 451
42.5%
2024-04-06T17:12:24.565531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
731
 
16.1%
323
 
7.1%
1 274
 
6.0%
231
 
5.1%
172
 
3.8%
165
 
3.6%
0 162
 
3.6%
149
 
3.3%
149
 
3.3%
) 131
 
2.9%
Other values (99) 2045
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2691
59.4%
Space Separator 731
 
16.1%
Decimal Number 725
 
16.0%
Close Punctuation 145
 
3.2%
Open Punctuation 145
 
3.2%
Other Punctuation 83
 
1.8%
Uppercase Letter 7
 
0.2%
Math Symbol 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
323
 
12.0%
231
 
8.6%
172
 
6.4%
165
 
6.1%
149
 
5.5%
149
 
5.5%
108
 
4.0%
94
 
3.5%
93
 
3.5%
92
 
3.4%
Other values (78) 1115
41.4%
Decimal Number
ValueCountFrequency (%)
1 274
37.8%
0 162
22.3%
7 69
 
9.5%
6 69
 
9.5%
9 62
 
8.6%
2 61
 
8.4%
8 14
 
1.9%
5 6
 
0.8%
4 4
 
0.6%
3 4
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 50
60.2%
/ 32
38.6%
1
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 131
90.3%
] 14
 
9.7%
Open Punctuation
ValueCountFrequency (%)
( 131
90.3%
[ 14
 
9.7%
Uppercase Letter
ValueCountFrequency (%)
A 4
57.1%
B 3
42.9%
Space Separator
ValueCountFrequency (%)
731
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2691
59.4%
Common 1834
40.5%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
323
 
12.0%
231
 
8.6%
172
 
6.4%
165
 
6.1%
149
 
5.5%
149
 
5.5%
108
 
4.0%
94
 
3.5%
93
 
3.5%
92
 
3.4%
Other values (78) 1115
41.4%
Common
ValueCountFrequency (%)
731
39.9%
1 274
 
14.9%
0 162
 
8.8%
) 131
 
7.1%
( 131
 
7.1%
7 69
 
3.8%
6 69
 
3.8%
9 62
 
3.4%
2 61
 
3.3%
, 50
 
2.7%
Other values (9) 94
 
5.1%
Latin
ValueCountFrequency (%)
A 4
57.1%
B 3
42.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2691
59.4%
ASCII 1840
40.6%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
731
39.7%
1 274
 
14.9%
0 162
 
8.8%
) 131
 
7.1%
( 131
 
7.1%
7 69
 
3.8%
6 69
 
3.8%
9 62
 
3.4%
2 61
 
3.3%
, 50
 
2.7%
Other values (10) 100
 
5.4%
Hangul
ValueCountFrequency (%)
323
 
12.0%
231
 
8.6%
172
 
6.4%
165
 
6.1%
149
 
5.5%
149
 
5.5%
108
 
4.0%
94
 
3.5%
93
 
3.5%
92
 
3.4%
Other values (78) 1115
41.4%
Punctuation
ValueCountFrequency (%)
1
100.0%

운영요일
Categorical

IMBALANCE 

Distinct6
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
월,수,금
168 
화,목
152 
월,화,수,목,금
 
4
월,수
 
3
일,토
 
2

Length

Max length9
Median length5
Mean length4.0939394
Min length3

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row일,토
2nd row월,수,금
3rd row월,수,금
4th row화,목
5th row화,목

Common Values

ValueCountFrequency (%)
월,수,금 168
50.9%
화,목 152
46.1%
월,화,수,목,금 4
 
1.2%
월,수 3
 
0.9%
일,토 2
 
0.6%
제한없음 1
 
0.3%

Length

2024-04-06T17:12:24.962152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:25.199761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월,수,금 168
50.9%
화,목 152
46.1%
월,화,수,목,금 4
 
1.2%
월,수 3
 
0.9%
일,토 2
 
0.6%
제한없음 1
 
0.3%

강좌시작시간
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
10:00
41 
16:00
37 
07:00
34 
17:00
33 
06:00
30 
Other values (24)
155 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique6 ?
Unique (%)1.8%

Sample

1st row09:00
2nd row06:00
3rd row07:00
4th row06:00
5th row07:00

Common Values

ValueCountFrequency (%)
10:00 41
12.4%
16:00 37
11.2%
07:00 34
10.3%
17:00 33
10.0%
06:00 30
9.1%
09:00 29
8.8%
19:00 29
8.8%
11:00 29
8.8%
20:00 17
 
5.2%
18:00 7
 
2.1%
Other values (19) 44
13.3%

Length

2024-04-06T17:12:25.420657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10:00 41
12.4%
16:00 37
11.2%
07:00 34
10.3%
17:00 33
10.0%
06:00 30
9.1%
09:00 29
8.8%
19:00 29
8.8%
11:00 29
8.8%
20:00 17
 
5.2%
18:00 7
 
2.1%
Other values (19) 44
13.3%

강좌종료시간
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
10:50
41 
17:50
38 
16:50
36 
07:50
34 
09:50
33 
Other values (14)
148 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row17:50
2nd row06:50
3rd row07:50
4th row06:50
5th row07:50

Common Values

ValueCountFrequency (%)
10:50 41
12.4%
17:50 38
11.5%
16:50 36
10.9%
07:50 34
10.3%
09:50 33
10.0%
11:50 32
9.7%
06:50 30
9.1%
20:50 23
7.0%
19:50 22
6.7%
18:50 9
 
2.7%
Other values (9) 32
9.7%

Length

2024-04-06T17:12:25.634189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
10:50 41
12.4%
17:50 38
11.5%
16:50 36
10.9%
07:50 34
10.3%
09:50 33
10.0%
11:50 32
9.7%
06:50 30
9.1%
20:50 23
7.0%
19:50 22
6.7%
18:50 9
 
2.7%
Other values (9) 32
9.7%

정원
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.954545
Minimum7
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2024-04-06T17:12:25.816218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile7
Q120
median20
Q325
95-th percentile35
Maximum100
Range93
Interquartile range (IQR)5

Descriptive statistics

Standard deviation9.5022503
Coefficient of variation (CV)0.43281471
Kurtosis12.951152
Mean21.954545
Median Absolute Deviation (MAD)0
Skewness1.9197754
Sum7245
Variance90.29276
MonotonicityNot monotonic
2024-04-06T17:12:26.040261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
20 185
56.1%
35 66
 
20.0%
9 24
 
7.3%
7 19
 
5.8%
25 11
 
3.3%
15 10
 
3.0%
30 7
 
2.1%
8 4
 
1.2%
50 2
 
0.6%
19 1
 
0.3%
ValueCountFrequency (%)
7 19
 
5.8%
8 4
 
1.2%
9 24
 
7.3%
15 10
 
3.0%
19 1
 
0.3%
20 185
56.1%
25 11
 
3.3%
30 7
 
2.1%
35 66
 
20.0%
50 2
 
0.6%
ValueCountFrequency (%)
100 1
 
0.3%
50 2
 
0.6%
35 66
 
20.0%
30 7
 
2.1%
25 11
 
3.3%
20 185
56.1%
19 1
 
0.3%
15 10
 
3.0%
9 24
 
7.3%
8 4
 
1.2%

강좌대상
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
성인/청소년
157 
어린이
71 
성인
66 
제한없음
33 
어린이/청소년
 
2

Length

Max length7
Median length6
Mean length4.3545455
Min length2

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row제한없음
2nd row성인/청소년
3rd row성인/청소년
4th row성인/청소년
5th row성인/청소년

Common Values

ValueCountFrequency (%)
성인/청소년 157
47.6%
어린이 71
21.5%
성인 66
20.0%
제한없음 33
 
10.0%
어린이/청소년 2
 
0.6%
성인남녀 1
 
0.3%

Length

2024-04-06T17:12:26.379486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:26.666358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성인/청소년 157
47.6%
어린이 71
21.5%
성인 66
20.0%
제한없음 33
 
10.0%
어린이/청소년 2
 
0.6%
성인남녀 1
 
0.3%

운영기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
의왕도시공사
330 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의왕도시공사
2nd row의왕도시공사
3rd row의왕도시공사
4th row의왕도시공사
5th row의왕도시공사

Common Values

ValueCountFrequency (%)
의왕도시공사 330
100.0%

Length

2024-04-06T17:12:26.946084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:27.724924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
의왕도시공사 330
100.0%

시설전화번호
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
031-8086-7430
106 
031-8086-7390~1
65 
031-8045-8060
59 
031-8045-8000~1
54 
031-8086-7412
38 
Other values (2)
 
8

Length

Max length15
Median length13
Mean length13.715152
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row031-8086-7412
2nd row031-8086-7412
3rd row031-8086-7412
4th row031-8086-7412
5th row031-8086-7412

Common Values

ValueCountFrequency (%)
031-8086-7430 106
32.1%
031-8086-7390~1 65
19.7%
031-8045-8060 59
17.9%
031-8045-8000~1 54
16.4%
031-8086-7412 38
 
11.5%
031-8045-8070 6
 
1.8%
031-477-4636 2
 
0.6%

Length

2024-04-06T17:12:27.889128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:12:28.120070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
031-8086-7430 106
32.1%
031-8086-7390~1 65
19.7%
031-8045-8060 59
17.9%
031-8045-8000~1 54
16.4%
031-8086-7412 38
 
11.5%
031-8045-8070 6
 
1.8%
031-477-4636 2
 
0.6%

Interactions

2024-04-06T17:12:22.121725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:12:28.310731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명프로그램 구분운영요일강좌시작시간강좌종료시간정원강좌대상시설전화번호
시설명1.0000.5970.4390.3320.4500.6540.6001.000
프로그램 구분0.5971.0000.7230.9170.7320.8090.7800.778
운영요일0.4390.7231.0000.0000.7230.8370.5430.269
강좌시작시간0.3320.9170.0001.0000.9900.4270.7750.292
강좌종료시간0.4500.7320.7230.9901.0000.7980.7440.431
정원0.6540.8090.8370.4270.7981.0000.7320.487
강좌대상0.6000.7800.5430.7750.7440.7321.0000.488
시설전화번호1.0000.7780.2690.2920.4310.4870.4881.000
2024-04-06T17:12:28.548374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
프로그램 구분강좌시작시간강좌대상운영요일시설전화번호시설명강좌종료시간
프로그램 구분1.0000.5420.5030.4380.4880.3260.320
강좌시작시간0.5421.0000.4600.0000.1200.1470.861
강좌대상0.5030.4601.0000.2220.3160.2530.453
운영요일0.4380.0000.2221.0000.1630.1710.431
시설전화번호0.4880.1200.3160.1631.0000.9980.203
시설명0.3260.1470.2530.1710.9981.0000.222
강좌종료시간0.3200.8610.4530.4310.2030.2221.000
2024-04-06T17:12:28.779157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정원시설명프로그램 구분운영요일강좌시작시간강좌종료시간강좌대상시설전화번호
정원1.0000.2860.5420.4540.1960.5180.3440.315
시설명0.2861.0000.3260.1710.1470.2220.2530.998
프로그램 구분0.5420.3261.0000.4380.5420.3200.5030.488
운영요일0.4540.1710.4381.0000.0000.4310.2220.163
강좌시작시간0.1960.1470.5420.0001.0000.8610.4600.120
강좌종료시간0.5180.2220.3200.4310.8611.0000.4530.203
강좌대상0.3440.2530.5030.2220.4600.4531.0000.316
시설전화번호0.3150.9980.4880.1630.1200.2030.3161.000

Missing values

2024-04-06T17:12:22.399894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:12:22.719785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명프로그램 구분강좌명운영요일강좌시작시간강좌종료시간정원강좌대상운영기관명시설전화번호
0평생학습관수영수상인명구조반일,토09:0017:5019제한없음의왕도시공사031-8086-7412
1평생학습관수영06시 월수금 중급1월,수,금06:0006:5020성인/청소년의왕도시공사031-8086-7412
2평생학습관수영07시 월수금 중급1월,수,금07:0007:5020성인/청소년의왕도시공사031-8086-7412
3평생학습관수영06시 화목 중급2화,목06:0006:5020성인/청소년의왕도시공사031-8086-7412
4평생학습관수영07시 화목 중급2화,목07:0007:5020성인/청소년의왕도시공사031-8086-7412
5평생학습관수영06시 월수금 연수월,수,금06:0006:5035성인/청소년의왕도시공사031-8086-7412
6평생학습관수영07시 월수금 연수월,수,금07:0007:5035성인/청소년의왕도시공사031-8086-7412
7평생학습관수영06시 화목 연수화,목06:0006:5035성인/청소년의왕도시공사031-8086-7412
8평생학습관수영07시 화목 교정화,목07:0007:5020성인/청소년의왕도시공사031-8086-7412
9평생학습관수영09시 화목 초급화,목09:0009:5020성인/청소년의왕도시공사031-8086-7412
시설명프로그램 구분강좌명운영요일강좌시작시간강좌종료시간정원강좌대상운영기관명시설전화번호
320국민체육센터배드민턴18시 화목 소그룹 배드민턴화,목18:0018:507성인의왕도시공사031-8086-7430
321국민체육센터배드민턴19시 화목 배드민턴화,목19:0020:5025성인의왕도시공사031-8086-7430
322국민체육센터배드민턴06시 화목 소그룹 배드민턴화,목06:0006:507성인의왕도시공사031-8086-7430
323국민체육센터배드민턴07시 월수금 소그룹 배드민턴월,수,금07:0007:507성인의왕도시공사031-8086-7430
324국민체육센터배드민턴07시 화목 소그룹 배드민턴화,목07:0007:507성인의왕도시공사031-8086-7430
325국민체육센터배드민턴08시 월수금 소그룹 배드민턴월,수,금08:0008:507성인의왕도시공사031-8086-7430
326국민체육센터배드민턴08시 화목 소그룹 배드민턴화,목08:0008:507성인의왕도시공사031-8086-7430
327국민체육센터배드민턴09시 월수금 소그룹 배드민턴월,수,금09:0009:507성인의왕도시공사031-8086-7430
328국민체육센터배드민턴09시 화목 소그룹 배드민턴화,목09:0009:507성인의왕도시공사031-8086-7430
329국민체육센터배드민턴18시 월수금 소그룹 배드민턴월,수,금18:0018:507성인의왕도시공사031-8086-7430