Overview

Dataset statistics

Number of variables8
Number of observations413
Missing cells242
Missing cells (%)7.3%
Duplicate rows2
Duplicate rows (%)0.5%
Total size in memory26.3 KiB
Average record size in memory65.3 B

Variable types

Categorical7
Numeric1

Dataset

Description성남종합운동장의 채육프로그램정보(장소,강좌명,수강시간,강습일,대상,정원,이용금액 등)입니다. ※ 코로나19로 인하여 현재 체육프로그램 강좌는 일일발권(현장) 또는 온라인 강의로 대체되고 있습니다. 자세한 내용은 홈페이지(https://spo.isdc.co.kr/)를 참고하세요)
URLhttps://www.data.go.kr/data/15031495/fileData.do

Alerts

Dataset has 2 (0.5%) duplicate rowsDuplicates
수강시간 is highly overall correlated with 시설명 and 1 other fieldsHigh correlation
시설명 is highly overall correlated with 장소 and 4 other fieldsHigh correlation
강좌명 is highly overall correlated with 정원 and 5 other fieldsHigh correlation
장소 is highly overall correlated with 시설명 and 3 other fieldsHigh correlation
정원 is highly overall correlated with 강좌명 and 1 other fieldsHigh correlation
강습일 is highly overall correlated with 시설명 and 3 other fieldsHigh correlation
대상 is highly overall correlated with 시설명 and 3 other fieldsHigh correlation
이용금액 is highly overall correlated with 정원 and 4 other fieldsHigh correlation
정원 has 242 (58.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 08:06:16.521906
Analysis finished2023-12-12 08:06:17.594410
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
성남종합운동장
335 
성남시 평생학습관 스포츠센터
78 

Length

Max length15
Median length7
Mean length8.5108959
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성남종합운동장
2nd row성남종합운동장
3rd row성남종합운동장
4th row성남종합운동장
5th row성남종합운동장

Common Values

ValueCountFrequency (%)
성남종합운동장 335
81.1%
성남시 평생학습관 스포츠센터 78
 
18.9%

Length

2023-12-12T17:06:17.714383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:06:17.834243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
성남종합운동장 335
58.9%
성남시 78
 
13.7%
평생학습관 78
 
13.7%
스포츠센터 78
 
13.7%

장소
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
수영장
213 
골프장
72 
다목적실
36 
기구필라테스실
32 
실내체육관
28 
Other values (3)
32 

Length

Max length7
Median length3
Mean length3.590799
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수영장
2nd row수영장
3rd row수영장
4th row수영장
5th row수영장

Common Values

ValueCountFrequency (%)
수영장 213
51.6%
골프장 72
 
17.4%
다목적실 36
 
8.7%
기구필라테스실 32
 
7.7%
실내체육관 28
 
6.8%
라켓볼장 18
 
4.4%
헬스장 12
 
2.9%
다목적체육관 2
 
0.5%

Length

2023-12-12T17:06:17.981017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:06:18.171247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수영장 213
51.6%
골프장 72
 
17.4%
다목적실 36
 
8.7%
기구필라테스실 32
 
7.7%
실내체육관 28
 
6.8%
라켓볼장 18
 
4.4%
헬스장 12
 
2.9%
다목적체육관 2
 
0.5%

강좌명
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
골프
44 
수영(연수)
40 
수영(고급)
36 
기구필라테스
32 
소그룹 레슨
28 
Other values (36)
233 

Length

Max length16
Median length15
Mean length6.7021792
Min length2

Unique

Unique3 ?
Unique (%)0.7%

Sample

1st row수영(고급)
2nd row수영(고급)
3rd row수영(고급)
4th row수영(고급)
5th row수영(고급)

Common Values

ValueCountFrequency (%)
골프 44
 
10.7%
수영(연수) 40
 
9.7%
수영(고급) 36
 
8.7%
기구필라테스 32
 
7.7%
소그룹 레슨 28
 
6.8%
월자유 24
 
5.8%
일일자유이용(수영) 21
 
5.1%
라켓볼 18
 
4.4%
필라테스 18
 
4.4%
수영(중급) 18
 
4.4%
Other values (31) 134
32.4%

Length

2023-12-12T17:06:18.401664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
골프 44
 
8.6%
수영(연수 40
 
7.8%
수영(고급 36
 
7.0%
기구필라테스 32
 
6.2%
주5회 30
 
5.8%
소그룹 28
 
5.5%
레슨 28
 
5.5%
월자유 24
 
4.7%
일일자유이용(수영 21
 
4.1%
라켓볼 18
 
3.5%
Other values (32) 212
41.3%

수강시간
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
09:00
53 
19:00
44 
06:00
42 
20:00
41 
10:00
39 
Other values (28)
194 

Length

Max length11
Median length5
Mean length6.1331719
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row06:00
2nd row06:00
3rd row06:00
4th row06:00
5th row06:00

Common Values

ValueCountFrequency (%)
09:00 53
12.8%
19:00 44
10.7%
06:00 42
10.2%
20:00 41
9.9%
10:00 39
 
9.4%
07:00 37
 
9.0%
16:00 21
 
5.1%
17:00 17
 
4.1%
12:00 10
 
2.4%
13:00 10
 
2.4%
Other values (23) 99
24.0%

Length

2023-12-12T17:06:18.586234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
09:00 53
12.8%
19:00 44
10.7%
06:00 42
10.2%
20:00 41
9.9%
10:00 39
 
9.4%
07:00 37
 
9.0%
16:00 21
 
5.1%
17:00 17
 
4.1%
12:00 10
 
2.4%
13:00 10
 
2.4%
Other values (23) 99
24.0%

강습일
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
월+화+수+목+금+토
84 
월+화+수+목+금
70 
월+수+금
67 
화+목
58 
월+화+목+금
36 
Other values (11)
98 

Length

Max length11
Median length9
Mean length7.3050847
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row월+화+목+금
2nd row월+화+수+목+금
3rd row월+화+수+목+금+토
4th row월+화+목+금
5th row월+화+수+목+금+토

Common Values

ValueCountFrequency (%)
월+화+수+목+금+토 84
20.3%
월+화+수+목+금 70
16.9%
월+수+금 67
16.2%
화+목 58
14.0%
월+화+목+금 36
8.7%
월+화+수+목+금 34
8.2%
화+목+토 14
 
3.4%
월+수+금 14
 
3.4%
월+화+목+금 12
 
2.9%
토+일+공휴일 9
 
2.2%
Other values (6) 15
 
3.6%

Length

2023-12-12T17:06:18.790838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
월+화+수+목+금 104
25.2%
월+화+수+목+금+토 84
20.3%
월+수+금 81
19.6%
화+목 62
15.0%
월+화+목+금 48
11.6%
화+목+토 14
 
3.4%
토+일+공휴일 9
 
2.2%
화+목+금 4
 
1.0%
공휴일 2
 
0.5%
2
 
0.5%
Other values (2) 3
 
0.7%

대상
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
청소년
176 
일반
157 
성인
26 
어린이
25 
청소년
 
9
Other values (6)
20 

Length

Max length11
Median length8
Mean length2.8280872
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row청소년
5th row청소년

Common Values

ValueCountFrequency (%)
청소년 176
42.6%
일반 157
38.0%
성인 26
 
6.3%
어린이 25
 
6.1%
청소년 9
 
2.2%
성인 8
 
1.9%
초등생3학년이상 4
 
1.0%
성인 2
 
0.5%
성인여성(60세미만) 2
 
0.5%
청소년여성 2
 
0.5%

Length

2023-12-12T17:06:18.966671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
청소년 185
44.8%
일반 157
38.0%
성인 36
 
8.7%
어린이 25
 
6.1%
초등생3학년이상 4
 
1.0%
성인여성(60세미만 2
 
0.5%
청소년여성 2
 
0.5%
어린이(초등생 2
 
0.5%

정원
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct13
Distinct (%)7.6%
Missing242
Missing (%)58.6%
Infinite0
Infinite (%)0.0%
Mean44.783626
Minimum3
Maximum400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.8 KiB
2023-12-12T17:06:19.130864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile3
Q110
median20
Q340
95-th percentile200
Maximum400
Range397
Interquartile range (IQR)30

Descriptive statistics

Standard deviation62.421197
Coefficient of variation (CV)1.3938397
Kurtosis7.3678398
Mean44.783626
Median Absolute Deviation (MAD)10
Skewness2.5817132
Sum7658
Variance3896.4058
MonotonicityNot monotonic
2023-12-12T17:06:19.339455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
30 30
 
7.3%
20 26
 
6.3%
10 22
 
5.3%
200 17
 
4.1%
6 16
 
3.9%
40 16
 
3.9%
3 14
 
3.4%
12 10
 
2.4%
60 8
 
1.9%
100 4
 
1.0%
Other values (3) 8
 
1.9%
(Missing) 242
58.6%
ValueCountFrequency (%)
3 14
3.4%
6 16
3.9%
10 22
5.3%
12 10
 
2.4%
20 26
6.3%
30 30
7.3%
40 16
3.9%
50 4
 
1.0%
60 8
 
1.9%
80 3
 
0.7%
ValueCountFrequency (%)
400 1
 
0.2%
200 17
4.1%
100 4
 
1.0%
80 3
 
0.7%
60 8
 
1.9%
50 4
 
1.0%
40 16
3.9%
30 30
7.3%
20 26
6.3%
12 10
 
2.4%

이용금액
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)6.3%
Missing0
Missing (%)0.0%
Memory size3.4 KiB
50000
52 
40000
46 
48000
45 
60000
43 
54000
24 
Other values (21)
203 

Length

Max length6
Median length5
Mean length5.0072639
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48000
2nd row54000
3rd row60000
4th row40000
5th row50000

Common Values

ValueCountFrequency (%)
50000 52
12.6%
40000 46
11.1%
48000 45
10.9%
60000 43
10.4%
54000 24
 
5.8%
45000 24
 
5.8%
100000 23
 
5.6%
31000 20
 
4.8%
39000 18
 
4.4%
150000 16
 
3.9%
Other values (16) 102
24.7%

Length

2023-12-12T17:06:19.589298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
50000 52
12.6%
40000 46
11.1%
48000 45
10.9%
60000 43
10.4%
54000 24
 
5.8%
45000 24
 
5.8%
100000 23
 
5.6%
31000 20
 
4.8%
39000 18
 
4.4%
150000 16
 
3.9%
Other values (16) 102
24.7%

Interactions

2023-12-12T17:06:17.239797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:06:19.740331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설명장소강좌명수강시간강습일대상정원이용금액
시설명1.0000.7921.0001.0000.9860.8250.0000.393
장소0.7921.0001.0000.7520.8710.5650.4890.900
강좌명1.0001.0001.0000.9270.9480.9770.9790.959
수강시간1.0000.7520.9271.0000.7520.8990.6320.611
강습일0.9860.8710.9480.7521.0000.6750.6080.902
대상0.8250.5650.9770.8990.6751.0000.5560.901
정원0.0000.4890.9790.6320.6080.5561.0000.849
이용금액0.3930.9000.9590.6110.9020.9010.8491.000
2023-12-12T17:06:19.921069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강습일대상이용금액수강시간시설명강좌명장소
강습일1.0000.3290.5290.3070.8860.6200.504
대상0.3291.0000.5880.5100.8130.7970.308
이용금액0.5290.5881.0000.1790.3030.5790.631
수강시간0.3070.5100.1791.0000.9620.4440.395
시설명0.8860.8130.3030.9621.0000.9510.610
강좌명0.6200.7970.5790.4440.9511.0000.958
장소0.5040.3080.6310.3950.6100.9581.000
2023-12-12T17:06:20.084336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정원시설명장소강좌명수강시간강습일대상이용금액
정원1.0000.0000.3240.8270.3270.2950.3580.518
시설명0.0001.0000.6100.9510.9620.8860.8130.303
장소0.3240.6101.0000.9580.3950.5040.3080.631
강좌명0.8270.9510.9581.0000.4440.6200.7970.579
수강시간0.3270.9620.3950.4441.0000.3070.5100.179
강습일0.2950.8860.5040.6200.3071.0000.3290.529
대상0.3580.8130.3080.7970.5100.3291.0000.588
이용금액0.5180.3030.6310.5790.1790.5290.5881.000

Missing values

2023-12-12T17:06:17.371037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:06:17.526374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명장소강좌명수강시간강습일대상정원이용금액
0성남종합운동장수영장수영(고급)06:00월+화+목+금일반3048000
1성남종합운동장수영장수영(고급)06:00월+화+수+목+금일반<NA>54000
2성남종합운동장수영장수영(고급)06:00월+화+수+목+금+토일반<NA>60000
3성남종합운동장수영장수영(고급)06:00월+화+목+금청소년<NA>40000
4성남종합운동장수영장수영(고급)06:00월+화+수+목+금+토청소년<NA>50000
5성남종합운동장수영장수영(고급)06:00월+화+수+목+금청소년<NA>45000
6성남종합운동장수영장수영(고급)07:00월+화+목+금일반3048000
7성남종합운동장수영장수영(고급)07:00월+화+수+목+금일반<NA>54000
8성남종합운동장수영장수영(고급)07:00월+화+수+목+금+토일반<NA>60000
9성남종합운동장수영장수영(고급)07:00월+화+수+목+금청소년<NA>45000
시설명장소강좌명수강시간강습일대상정원이용금액
403성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)13:00~13:50월+수+금성인3039000
404성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)13:00~13:50월+수+금청소년<NA>31000
405성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)14:00~14:50월+수+금성인3039000
406성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)15:00~15:50월+수+금청소년<NA>31000
407성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)14:00~14:50월+수+금성인3039000
408성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)15:00~15:50월+수+금청소년<NA>31000
409성남시 평생학습관 스포츠센터실내체육관주2회 성인벨리댄스(1개월)11:00~11:50화+목성인2026000
410성남시 평생학습관 스포츠센터실내체육관주2회 청소년벨리댄스(1개월)11:00~11:50화+목청소년<NA>20000
411성남시 평생학습관 스포츠센터실내체육관주2회 줄넘기(1개월)16:00~16:50화+목어린이(초등생)3016000
412성남시 평생학습관 스포츠센터실내체육관주2회 줄넘기(1개월)17:00~17:50화+목어린이(초등생)3016000

Duplicate rows

Most frequently occurring

시설명장소강좌명수강시간강습일대상정원이용금액# duplicates
0성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)14:00~14:50월+수+금성인30390002
1성남시 평생학습관 스포츠센터실내체육관주3회 라인댄스(1개월)15:00~15:50월+수+금청소년<NA>310002