Overview

Dataset statistics

Number of variables8
Number of observations32
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory69.1 B

Variable types

Numeric1
Categorical5
Text2

Dataset

Description동대문구체육관에서 운영하는 체육, 문화 프로그램별 수강시기, 수강료, 대상 등 정보 제공
Author동대문구시설관리공단
URLhttps://www.data.go.kr/data/15044057/fileData.do

Alerts

연번 is highly overall correlated with 분야 and 1 other fieldsHigh correlation
분야 is highly overall correlated with 연번 and 4 other fieldsHigh correlation
요일 is highly overall correlated with 분야High correlation
대상 is highly overall correlated with 분야 and 1 other fieldsHigh correlation
수강료 is highly overall correlated with 분야 and 1 other fieldsHigh correlation
정원 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
프로그램명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:15:25.537379
Analysis finished2023-12-12 06:15:26.217277
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.5
Minimum1
Maximum32
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size420.0 B
2023-12-12T15:15:26.291184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.55
Q18.75
median16.5
Q324.25
95-th percentile30.45
Maximum32
Range31
Interquartile range (IQR)15.5

Descriptive statistics

Standard deviation9.3808315
Coefficient of variation (CV)0.56853524
Kurtosis-1.2
Mean16.5
Median Absolute Deviation (MAD)8
Skewness0
Sum528
Variance88
MonotonicityStrictly increasing
2023-12-12T15:15:26.420085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
1 1
 
3.1%
18 1
 
3.1%
32 1
 
3.1%
31 1
 
3.1%
30 1
 
3.1%
29 1
 
3.1%
28 1
 
3.1%
27 1
 
3.1%
26 1
 
3.1%
25 1
 
3.1%
Other values (22) 22
68.8%
ValueCountFrequency (%)
1 1
3.1%
2 1
3.1%
3 1
3.1%
4 1
3.1%
5 1
3.1%
6 1
3.1%
7 1
3.1%
8 1
3.1%
9 1
3.1%
10 1
3.1%
ValueCountFrequency (%)
32 1
3.1%
31 1
3.1%
30 1
3.1%
29 1
3.1%
28 1
3.1%
27 1
3.1%
26 1
3.1%
25 1
3.1%
24 1
3.1%
23 1
3.1%

분야
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size388.0 B
문화
20 
체육
12 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문화
2nd row문화
3rd row문화
4th row문화
5th row문화

Common Values

ValueCountFrequency (%)
문화 20
62.5%
체육 12
37.5%

Length

2023-12-12T15:15:26.538077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:15:26.641536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문화 20
62.5%
체육 12
37.5%

프로그램명
Text

UNIQUE 

Distinct32
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-12T15:15:26.842335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length9.625
Min length2

Characters and Unicode

Total characters308
Distinct characters79
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row웰빙 요가 A
2nd row웰빙 요가 B
3rd row웰빙 요가 C
4th row재즈 댄스
5th row실버라인댄스 (중급)
ValueCountFrequency (%)
초급 5
 
7.1%
배드민턴 4
 
5.7%
인라인 4
 
5.7%
4
 
5.7%
웰빙 3
 
4.3%
요가 3
 
4.3%
실버라인댄스 3
 
4.3%
중급 3
 
4.3%
난타 2
 
2.9%
탁구 2
 
2.9%
Other values (32) 37
52.9%
2023-12-12T15:15:27.196447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
12.3%
) 19
 
6.2%
( 19
 
6.2%
16
 
5.2%
16
 
5.2%
11
 
3.6%
10
 
3.2%
9
 
2.9%
8
 
2.6%
6
 
1.9%
Other values (69) 156
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 219
71.1%
Space Separator 38
 
12.3%
Close Punctuation 19
 
6.2%
Open Punctuation 19
 
6.2%
Uppercase Letter 9
 
2.9%
Dash Punctuation 4
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
7.3%
16
 
7.3%
11
 
5.0%
10
 
4.6%
9
 
4.1%
8
 
3.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (61) 125
57.1%
Uppercase Letter
ValueCountFrequency (%)
B 4
44.4%
A 3
33.3%
C 1
 
11.1%
T 1
 
11.1%
Space Separator
ValueCountFrequency (%)
38
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 219
71.1%
Common 80
 
26.0%
Latin 9
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
7.3%
16
 
7.3%
11
 
5.0%
10
 
4.6%
9
 
4.1%
8
 
3.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (61) 125
57.1%
Common
ValueCountFrequency (%)
38
47.5%
) 19
23.8%
( 19
23.8%
- 4
 
5.0%
Latin
ValueCountFrequency (%)
B 4
44.4%
A 3
33.3%
C 1
 
11.1%
T 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 219
71.1%
ASCII 89
28.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
42.7%
) 19
21.3%
( 19
21.3%
- 4
 
4.5%
B 4
 
4.5%
A 3
 
3.4%
C 1
 
1.1%
T 1
 
1.1%
Hangul
ValueCountFrequency (%)
16
 
7.3%
16
 
7.3%
11
 
5.0%
10
 
4.6%
9
 
4.1%
8
 
3.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (61) 125
57.1%

시간
Text

Distinct22
Distinct (%)68.8%
Missing0
Missing (%)0.0%
Memory size388.0 B
2023-12-12T15:15:27.367854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.9375
Min length11

Characters and Unicode

Total characters414
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)46.9%

Sample

1st row06:00 ~ 06:50
2nd row11:00 ~ 11:50
3rd row20:30 ~ 21:40
4th row09:00 ~ 10:20
5th row14:00 ~ 14:50
ValueCountFrequency (%)
31
33.0%
15:50 5
 
5.3%
21:50 5
 
5.3%
15:00 4
 
4.3%
06:00 4
 
4.3%
14:00 4
 
4.3%
17:00 3
 
3.2%
19:50 2
 
2.1%
13:20 2
 
2.1%
10:20 2
 
2.1%
Other values (25) 32
34.0%
2023-12-12T15:15:27.731028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 108
26.1%
: 64
15.5%
62
15.0%
1 55
13.3%
~ 32
 
7.7%
5 29
 
7.0%
2 24
 
5.8%
6 9
 
2.2%
4 8
 
1.9%
3 8
 
1.9%
Other values (3) 15
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 256
61.8%
Other Punctuation 64
 
15.5%
Space Separator 62
 
15.0%
Math Symbol 32
 
7.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 108
42.2%
1 55
21.5%
5 29
 
11.3%
2 24
 
9.4%
6 9
 
3.5%
4 8
 
3.1%
3 8
 
3.1%
9 7
 
2.7%
7 4
 
1.6%
8 4
 
1.6%
Other Punctuation
ValueCountFrequency (%)
: 64
100.0%
Space Separator
ValueCountFrequency (%)
62
100.0%
Math Symbol
ValueCountFrequency (%)
~ 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 414
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 108
26.1%
: 64
15.5%
62
15.0%
1 55
13.3%
~ 32
 
7.7%
5 29
 
7.0%
2 24
 
5.8%
6 9
 
2.2%
4 8
 
1.9%
3 8
 
1.9%
Other values (3) 15
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 414
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 108
26.1%
: 64
15.5%
62
15.0%
1 55
13.3%
~ 32
 
7.7%
5 29
 
7.0%
2 24
 
5.8%
6 9
 
2.2%
4 8
 
1.9%
3 8
 
1.9%
Other values (3) 15
 
3.6%

요일
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)28.1%
Missing0
Missing (%)0.0%
Memory size388.0 B
화·목
월~금
월·수·금
월·수
Other values (4)

Length

Max length5
Median length3
Mean length3
Min length1

Unique

Unique4 ?
Unique (%)12.5%

Sample

1st row월·수·금
2nd row월·수·금
3rd row월·수·금
4th row월·수·금
5th row월·화

Common Values

ValueCountFrequency (%)
화·목 9
28.1%
월~금 7
21.9%
월·수·금 5
15.6%
4
12.5%
월·수 3
 
9.4%
월·화 1
 
3.1%
월·금 1
 
3.1%
1
 
3.1%
화,목 1
 
3.1%

Length

2023-12-12T15:15:27.882971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:15:28.068524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화·목 9
28.1%
월~금 7
21.9%
월·수·금 5
15.6%
4
12.5%
월·수 3
 
9.4%
월·화 1
 
3.1%
월·금 1
 
3.1%
1
 
3.1%
화,목 1
 
3.1%

대상
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
성인
11 
누구나
55세이상
성인·청소년
초등
Other values (7)

Length

Max length6
Median length5
Mean length3.40625
Min length2

Unique

Unique5 ?
Unique (%)15.6%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인여성
5th row55세이상

Common Values

ValueCountFrequency (%)
성인 11
34.4%
누구나 4
 
12.5%
55세이상 3
 
9.4%
성인·청소년 3
 
9.4%
초등 2
 
6.2%
청소년 2
 
6.2%
6세~초등 2
 
6.2%
성인여성 1
 
3.1%
7~13세 1
 
3.1%
어린이 1
 
3.1%
Other values (2) 2
 
6.2%

Length

2023-12-12T15:15:28.201989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성인 11
34.4%
누구나 4
 
12.5%
55세이상 3
 
9.4%
성인·청소년 3
 
9.4%
초등 2
 
6.2%
청소년 2
 
6.2%
6세~초등 2
 
6.2%
성인여성 1
 
3.1%
7~13세 1
 
3.1%
어린이 1
 
3.1%
Other values (2) 2
 
6.2%

수강료
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)43.8%
Missing0
Missing (%)0.0%
Memory size388.0 B
40000
10 
10000
20000
35000
32000
Other values (9)
11 

Length

Max length5
Median length5
Mean length4.90625
Min length2

Unique

Unique7 ?
Unique (%)21.9%

Sample

1st row40000
2nd row40000
3rd row40000
4th row50000
5th row10000

Common Values

ValueCountFrequency (%)
40000 10
31.2%
10000 3
 
9.4%
20000 3
 
9.4%
35000 3
 
9.4%
32000 2
 
6.2%
88000 2
 
6.2%
70000 2
 
6.2%
50000 1
 
3.1%
30000 1
 
3.1%
45000 1
 
3.1%
Other values (4) 4
 
12.5%

Length

2023-12-12T15:15:28.330920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
40000 10
31.2%
10000 3
 
9.4%
20000 3
 
9.4%
35000 3
 
9.4%
32000 2
 
6.2%
88000 2
 
6.2%
70000 2
 
6.2%
50000 1
 
3.1%
30000 1
 
3.1%
45000 1
 
3.1%
Other values (4) 4
 
12.5%

정원
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)37.5%
Missing0
Missing (%)0.0%
Memory size388.0 B
30
20
25
60
10
Other values (7)
10 

Length

Max length4
Median length2
Mean length2
Min length1

Unique

Unique4 ?
Unique (%)12.5%

Sample

1st row30
2nd row30
3rd row30
4th row20
5th row60

Common Values

ValueCountFrequency (%)
30 7
21.9%
20 6
18.8%
25 4
12.5%
60 3
9.4%
10 2
 
6.2%
16 2
 
6.2%
35 2
 
6.2%
6 2
 
6.2%
40 1
 
3.1%
4 1
 
3.1%
Other values (2) 2
 
6.2%

Length

2023-12-12T15:15:28.454619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
30 7
21.2%
20 6
18.2%
25 4
12.1%
60 3
9.1%
6 3
9.1%
10 2
 
6.1%
16 2
 
6.1%
35 2
 
6.1%
40 1
 
3.0%
4 1
 
3.0%
Other values (2) 2
 
6.1%

Interactions

2023-12-12T15:15:25.879635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:15:28.547887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야프로그램명시간요일대상수강료정원
연번1.0000.9951.0000.9370.7090.7580.8350.860
분야0.9951.0001.0000.9550.6290.9290.9400.832
프로그램명1.0001.0001.0001.0001.0001.0001.0001.000
시간0.9370.9551.0001.0000.0000.7770.8000.822
요일0.7090.6291.0000.0001.0000.4040.2440.659
대상0.7580.9291.0000.7770.4041.0000.8930.824
수강료0.8350.9401.0000.8000.2440.8931.0000.731
정원0.8600.8321.0000.8220.6590.8240.7311.000
2023-12-12T15:15:28.650517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상분야정원요일수강료
대상1.0000.6350.4890.1220.572
분야0.6351.0000.6860.5520.620
정원0.4890.6861.0000.3370.358
요일0.1220.5520.3371.0000.000
수강료0.5720.6200.3580.0001.000
2023-12-12T15:15:28.738107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야요일대상수강료정원
연번1.0000.8010.4000.4110.4700.574
분야0.8011.0000.5520.6350.6200.686
요일0.4000.5521.0000.1220.0000.337
대상0.4110.6350.1221.0000.5720.489
수강료0.4700.6200.0000.5721.0000.358
정원0.5740.6860.3370.4890.3581.000

Missing values

2023-12-12T15:15:26.019735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:15:26.161092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분야프로그램명시간요일대상수강료정원
01문화웰빙 요가 A06:00 ~ 06:50월·수·금성인4000030
12문화웰빙 요가 B11:00 ~ 11:50월·수·금성인4000030
23문화웰빙 요가 C20:30 ~ 21:40월·수·금성인4000030
34문화재즈 댄스09:00 ~ 10:20월·수·금성인여성5000020
45문화실버라인댄스 (중급)14:00 ~ 14:50월·화55세이상1000060
56문화실버라인댄스 (초급)15:00 ~ 15:50월·금55세이상1000060
67문화실버라인댄스 (왕초보)15:00 ~ 15:50화·목55세이상1000060
78문화라인댄스(상급반)17:00 ~ 18:20월·수성인2000030
89문화라인댄스(소수정예반)17:00 ~ 18:20성인2000010
910문화체형교정 필라테스09:30 ~ 10:20화,목성인3500030
연번분야프로그램명시간요일대상수강료정원
2223체육배드민턴 - 어린이(자유)06:00 ~ 21:50월~금어린이2772025
2324체육배드민턴 - 성인(자유)06:00 ~ 21:50월~금성인3850025
2425체육배드민턴 - 청소년(자유)06:00 ~ 21:50월~금청소년3113025
2526체육배드민턴 - 개인레슨09:00 ~ 22:00월~금누구나88000각 6
2627체육어린이배드민턴교실17:00 ~ 17:50월·수·금4~6학년4000020
2728체육인라인 스케이트(A)15:00 ~ 15:50화·목5세~9세3500020
2829체육인라인 스케이트(B)15:50 ~ 16:40화·목초등3500020
2930체육인라인 스케이트 소그룹반(A)11:30 ~ 12:206세~초등700006
3031체육인라인 스케이트 소그룹반(B)12:30 ~ 13:206세~초등700006
3132체육조깅06:00~21:30월~금누구나무료<NA>