Overview

Dataset statistics

Number of variables7
Number of observations119
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory57.1 B

Variable types

Categorical7

Dataset

Description남동구도시관리공단에서 운영하는 남동국민체육센터 및 남동수영장의 체육 강좌 정보 제공
Author인천광역시남동구도시관리공단
URLhttps://www.data.go.kr/data/15001930/fileData.do

Alerts

시설구분 is highly overall correlated with 회원구분 and 1 other fieldsHigh correlation
종목구분 is highly overall correlated with 회원구분 and 4 other fieldsHigh correlation
회원구분 is highly overall correlated with 시설구분 and 4 other fieldsHigh correlation
대상(등급) is highly overall correlated with 종목구분 and 2 other fieldsHigh correlation
요일 is highly overall correlated with 시설구분 and 4 other fieldsHigh correlation
시간 is highly overall correlated with 종목구분 and 1 other fieldsHigh correlation
회비 is highly overall correlated with 종목구분 and 3 other fieldsHigh correlation
종목구분 is highly imbalanced (58.0%)Imbalance

Reproduction

Analysis started2023-12-12 00:51:16.301179
Analysis finished2023-12-12 00:51:17.378507
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
남동국민체육센터
64 
남동수영장
55 

Length

Max length8
Median length8
Mean length6.6134454
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남동국민체육센터
2nd row남동국민체육센터
3rd row남동국민체육센터
4th row남동국민체육센터
5th row남동국민체육센터

Common Values

ValueCountFrequency (%)
남동국민체육센터 64
53.8%
남동수영장 55
46.2%

Length

2023-12-12T09:51:17.475314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:17.621825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남동국민체육센터 64
53.8%
남동수영장 55
46.2%

종목구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct14
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
수영
91 
아쿠아로빅
 
4
에어로빅
 
4
요가
 
4
탁구
 
4
Other values (9)
12 

Length

Max length7
Median length2
Mean length2.4537815
Min length2

Unique

Unique6 ?
Unique (%)5.0%

Sample

1st row수영
2nd row수영
3rd row수영
4th row수영
5th row수영

Common Values

ValueCountFrequency (%)
수영 91
76.5%
아쿠아로빅 4
 
3.4%
에어로빅 4
 
3.4%
요가 4
 
3.4%
탁구 4
 
3.4%
필라테스 2
 
1.7%
저녁요가 2
 
1.7%
아쿠아피트니스 2
 
1.7%
헬스 1
 
0.8%
프리댄스 1
 
0.8%
Other values (4) 4
 
3.4%

Length

2023-12-12T09:51:17.822831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영 91
75.8%
아쿠아로빅 4
 
3.3%
에어로빅 4
 
3.3%
요가 4
 
3.3%
탁구 4
 
3.3%
필라테스 2
 
1.7%
저녁요가 2
 
1.7%
아쿠아피트니스 2
 
1.7%
헬스 1
 
0.8%
프리댄스 1
 
0.8%
Other values (5) 5
 
4.2%

회원구분
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
어린이반
16 
<NA>
12 
20:00-20:50
 
5
어린이2반
 
5
직장1반
 
5
Other values (21)
76 

Length

Max length11
Median length8
Mean length6.2436975
Min length2

Unique

Unique5 ?
Unique (%)4.2%

Sample

1st row새벽1반
2nd row새벽1반
3rd row새벽1반
4th row새벽1반
5th row새벽2반

Common Values

ValueCountFrequency (%)
어린이반 16
 
13.4%
<NA> 12
 
10.1%
20:00-20:50 5
 
4.2%
어린이2반 5
 
4.2%
직장1반 5
 
4.2%
직장2반 5
 
4.2%
10:00-10:50 5
 
4.2%
어린이1반 5
 
4.2%
19:00-19:50 5
 
4.2%
직장인반 5
 
4.2%
Other values (16) 51
42.9%

Length

2023-12-12T09:51:18.025705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이반 16
 
13.4%
na 12
 
10.1%
19:00-19:50 5
 
4.2%
09:00-09:50 5
 
4.2%
07:00-07:50 5
 
4.2%
06:00-06:50 5
 
4.2%
직장인반 5
 
4.2%
11:00-11:50 5
 
4.2%
어린이1반 5
 
4.2%
10:00-10:50 5
 
4.2%
Other values (16) 51
42.9%

대상(등급)
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
23 
마스터즈
18 
연수
16 
초급
15 
중급
10 
Other values (7)
37 

Length

Max length13
Median length4
Mean length2.9663866
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row초중급
2nd row상고급
3rd row연수
4th row마스터즈
5th row초중급

Common Values

ValueCountFrequency (%)
<NA> 23
19.3%
마스터즈 18
15.1%
연수 16
13.4%
초급 15
12.6%
중급 10
8.4%
상급 9
 
7.6%
초중급 7
 
5.9%
상고급 7
 
5.9%
고급 7
 
5.9%
중상급 4
 
3.4%
Other values (2) 3
 
2.5%

Length

2023-12-12T09:51:18.184562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 23
19.0%
마스터즈 18
14.9%
연수 16
13.2%
초급 15
12.4%
중급 10
8.3%
상급 9
 
7.4%
초중급 7
 
5.8%
상고급 7
 
5.8%
고급 7
 
5.8%
중상급 4
 
3.3%
Other values (4) 5
 
4.1%

요일
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
화,목
37 
월,수,금
30 
화,목(월,수,금)
18 
월,수,금(화,목)
16 
월-금
15 
Other values (3)
 
3

Length

Max length18
Median length10
Mean length5.6470588
Min length3

Unique

Unique3 ?
Unique (%)2.5%

Sample

1st row화,목(월,수,금)
2nd row화,목(월,수,금)
3rd row월,수,금(화,목)
4th row월,수,금(화,목)
5th row화,목(월,수,금)

Common Values

ValueCountFrequency (%)
화,목 37
31.1%
월,수,금 30
25.2%
화,목(월,수,금) 18
15.1%
월,수,금(화,목) 16
13.4%
월-금 15
12.6%
월-금(토 09:00~16:00) 1
 
0.8%
월,수 1
 
0.8%
화,목,토 1
 
0.8%

Length

2023-12-12T09:51:18.329228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:18.454067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화,목 37
30.8%
월,수,금 30
25.0%
화,목(월,수,금 18
15.0%
월,수,금(화,목 16
13.3%
월-금 15
12.5%
월-금(토 1
 
0.8%
09:00~16:00 1
 
0.8%
월,수 1
 
0.8%
화,목,토 1
 
0.8%

시간
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)15.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
16:00-16:50
13 
17:00-17:50
13 
19:00-19:50
13 
20:00-20:50
13 
10:00-10:50
12 
Other values (13)
55 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique5 ?
Unique (%)4.2%

Sample

1st row06:00-06:50
2nd row06:00-06:50
3rd row06:00-06:50
4th row06:00-06:50
5th row07:00-07:50

Common Values

ValueCountFrequency (%)
16:00-16:50 13
10.9%
17:00-17:50 13
10.9%
19:00-19:50 13
10.9%
20:00-20:50 13
10.9%
10:00-10:50 12
10.1%
11:00-11:50 12
10.1%
09:00-09:50 10
8.4%
06:00-06:50 9
7.6%
07:00-07:50 9
7.6%
15:00-15:50 4
 
3.4%
Other values (8) 11
9.2%

Length

2023-12-12T09:51:18.624561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
16:00-16:50 13
10.9%
17:00-17:50 13
10.9%
19:00-19:50 13
10.9%
20:00-20:50 13
10.9%
10:00-10:50 12
10.1%
11:00-11:50 12
10.1%
09:00-09:50 10
8.4%
06:00-06:50 9
7.6%
07:00-07:50 9
7.6%
15:00-15:50 4
 
3.4%
Other values (8) 11
9.2%

회비
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
56,000
40 
60,000
32 
44,000
17 
50,000
10 
48,000
10 
Other values (2)
10 

Length

Max length6
Median length6
Mean length5.9915966
Min length5

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row56,000
2nd row56,000
3rd row60,000
4th row60,000
5th row56,000

Common Values

ValueCountFrequency (%)
56,000 40
33.6%
60,000 32
26.9%
44,000 17
14.3%
50,000 10
 
8.4%
48,000 10
 
8.4%
40,000 9
 
7.6%
3,000 1
 
0.8%

Length

2023-12-12T09:51:18.761198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:51:18.888961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
56,000 40
33.6%
60,000 32
26.9%
44,000 17
14.3%
50,000 10
 
8.4%
48,000 10
 
8.4%
40,000 9
 
7.6%
3,000 1
 
0.8%

Correlations

2023-12-12T09:51:18.985686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분종목구분회원구분대상(등급)요일시간회비
시설구분1.0000.4921.0000.4950.9500.0000.396
종목구분0.4921.0000.9530.8170.8340.9310.879
회원구분1.0000.9531.0000.5020.8890.9790.927
대상(등급)0.4950.8170.5021.0000.7960.5160.817
요일0.9500.8340.8890.7961.0000.8010.773
시간0.0000.9310.9790.5160.8011.0000.718
회비0.3960.8790.9270.8170.7730.7181.000
2023-12-12T09:51:19.108807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종목구분시설구분회비요일회원구분시간대상(등급)
종목구분1.0000.3650.5170.5510.7120.6510.670
시설구분0.3651.0000.4150.7830.8840.0000.451
회비0.5170.4151.0000.5540.6620.4000.574
요일0.5510.7830.5541.0000.5630.4760.577
회원구분0.7120.8840.6620.5631.0000.7780.196
시간0.6510.0000.4000.4760.7781.0000.245
대상(등급)0.6700.4510.5740.5770.1960.2451.000
2023-12-12T09:51:19.240107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설구분종목구분회원구분대상(등급)요일시간회비
시설구분1.0000.3650.8840.4510.7830.0000.415
종목구분0.3651.0000.7120.6700.5510.6510.517
회원구분0.8840.7121.0000.1960.5630.7780.662
대상(등급)0.4510.6700.1961.0000.5770.2450.574
요일0.7830.5510.5630.5771.0000.4760.554
시간0.0000.6510.7780.2450.4761.0000.400
회비0.4150.5170.6620.5740.5540.4001.000

Missing values

2023-12-12T09:51:17.179221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:51:17.322569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설구분종목구분회원구분대상(등급)요일시간회비
0남동국민체육센터수영새벽1반초중급화,목(월,수,금)06:00-06:5056,000
1남동국민체육센터수영새벽1반상고급화,목(월,수,금)06:00-06:5056,000
2남동국민체육센터수영새벽1반연수월,수,금(화,목)06:00-06:5060,000
3남동국민체육센터수영새벽1반마스터즈월,수,금(화,목)06:00-06:5060,000
4남동국민체육센터수영새벽2반초중급화,목(월,수,금)07:00-07:5056,000
5남동국민체육센터수영새벽2반상고급화,목(월,수,금)07:00-07:5056,000
6남동국민체육센터수영새벽2반연수월,수,금(화,목)07:00-07:5060,000
7남동국민체육센터수영새벽2반마스터즈월,수,금(화,목)07:00-07:5060,000
8남동국민체육센터수영오전1반초중급화,목(월,수,금)09:00-09:5056,000
9남동국민체육센터수영오전1반상고급화,목(월,수,금)09:00-09:5056,000
시설구분종목구분회원구분대상(등급)요일시간회비
109남동수영장수영어린이반고급화,목17:00-17:5040,000
110남동수영장수영어린이반마스터즈화,목17:00-17:5040,000
111남동수영장수영어린이반초급월,수,금17:00-17:5044,000
112남동수영장수영어린이반중상급월,수,금17:00-17:5044,000
113남동수영장수영어린이반고급월,수,금17:00-17:5044,000
114남동수영장수영어린이반마스터즈월,수,금17:00-17:5044,000
115남동수영장아쿠아워킹조깅<NA><NA>월,수,금14:00-14:5060,000
116남동수영장아쿠아피트니스<NA><NA>월,수,금15:00-15:5060,000
117남동수영장아쿠아피트니스<NA><NA>화,목15:00-15:5056,000
118남동수영장생존수영단체(초중고교)1인당 1회 3,000원화,목14:00-14:503,000