Overview

Dataset statistics

Number of variables8
Number of observations363
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.9 KiB
Average record size in memory67.4 B

Variable types

Numeric3
Categorical3
Text2

Dataset

Description구민체육센터에서 운영하는 체육, 문화프로그램별 수강시기, 수강료, 대상 등 정보 제공
Author동대문구시설관리공단
URLhttps://www.data.go.kr/data/15044059/fileData.do

Alerts

연번 is highly overall correlated with 분야High correlation
분야 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
대상 is highly overall correlated with 분야High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:12:41.945899
Analysis finished2023-12-12 14:12:43.731735
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct363
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean182
Minimum1
Maximum363
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T23:12:43.818320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.1
Q191.5
median182
Q3272.5
95-th percentile344.9
Maximum363
Range362
Interquartile range (IQR)181

Descriptive statistics

Standard deviation104.93331
Coefficient of variation (CV)0.57655666
Kurtosis-1.2
Mean182
Median Absolute Deviation (MAD)91
Skewness0
Sum66066
Variance11011
MonotonicityStrictly increasing
2023-12-12T23:12:43.980448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
251 1
 
0.3%
249 1
 
0.3%
248 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
244 1
 
0.3%
243 1
 
0.3%
242 1
 
0.3%
Other values (353) 353
97.2%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
363 1
0.3%
362 1
0.3%
361 1
0.3%
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%
356 1
0.3%
355 1
0.3%
354 1
0.3%

분야
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
수영
184 
문화
138 
공동사업
19 
유아체능단
 
18
체육
 
4

Length

Max length5
Median length2
Mean length2.2534435
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공동사업
2nd row공동사업
3rd row공동사업
4th row공동사업
5th row공동사업

Common Values

ValueCountFrequency (%)
수영 184
50.7%
문화 138
38.0%
공동사업 19
 
5.2%
유아체능단 18
 
5.0%
체육 4
 
1.1%

Length

2023-12-12T23:12:44.177070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:12:44.330590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수영 184
50.7%
문화 138
38.0%
공동사업 19
 
5.2%
유아체능단 18
 
5.0%
체육 4
 
1.1%
Distinct362
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T23:12:44.684173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length12.752066
Min length3

Characters and Unicode

Total characters4629
Distinct characters274
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique361 ?
Unique (%)99.4%

Sample

1st row강덕 쿵후 태극권
2nd row스피닝A(화목, 11시)
3rd row스피닝B(화목,20시)
4th row스피닝D(월수금,21시)
5th row스피드인라인(단체-토)14시
ValueCountFrequency (%)
10시 22
 
3.1%
11시 20
 
2.8%
소수정예 20
 
2.8%
09시 18
 
2.5%
20시 15
 
2.1%
15시 15
 
2.1%
16시 14
 
2.0%
17시 13
 
1.8%
체능]18년 13
 
1.8%
유소년 13
 
1.8%
Other values (315) 553
77.2%
2023-12-12T23:12:45.193947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
353
 
7.6%
190
 
4.1%
1 187
 
4.0%
) 168
 
3.6%
( 168
 
3.6%
0 124
 
2.7%
118
 
2.5%
- 101
 
2.2%
98
 
2.1%
82
 
1.8%
Other values (264) 3040
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2913
62.9%
Decimal Number 625
 
13.5%
Space Separator 353
 
7.6%
Close Punctuation 191
 
4.1%
Open Punctuation 173
 
3.7%
Uppercase Letter 156
 
3.4%
Dash Punctuation 101
 
2.2%
Math Symbol 80
 
1.7%
Other Punctuation 37
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
190
 
6.5%
118
 
4.1%
98
 
3.4%
82
 
2.8%
78
 
2.7%
75
 
2.6%
74
 
2.5%
73
 
2.5%
66
 
2.3%
61
 
2.1%
Other values (229) 1998
68.6%
Uppercase Letter
ValueCountFrequency (%)
B 47
30.1%
A 46
29.5%
D 18
 
11.5%
C 12
 
7.7%
P 11
 
7.1%
E 8
 
5.1%
O 4
 
2.6%
N 3
 
1.9%
S 3
 
1.9%
K 2
 
1.3%
Other values (2) 2
 
1.3%
Decimal Number
ValueCountFrequency (%)
1 187
29.9%
0 124
19.8%
2 66
 
10.6%
7 43
 
6.9%
5 42
 
6.7%
3 40
 
6.4%
8 38
 
6.1%
9 33
 
5.3%
6 33
 
5.3%
4 19
 
3.0%
Math Symbol
ValueCountFrequency (%)
~ 30
37.5%
> 18
22.5%
< 18
22.5%
+ 14
17.5%
Other Punctuation
ValueCountFrequency (%)
, 20
54.1%
& 13
35.1%
: 4
 
10.8%
Close Punctuation
ValueCountFrequency (%)
) 168
88.0%
] 23
 
12.0%
Open Punctuation
ValueCountFrequency (%)
( 168
97.1%
[ 5
 
2.9%
Space Separator
ValueCountFrequency (%)
353
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 101
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2913
62.9%
Common 1560
33.7%
Latin 156
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
190
 
6.5%
118
 
4.1%
98
 
3.4%
82
 
2.8%
78
 
2.7%
75
 
2.6%
74
 
2.5%
73
 
2.5%
66
 
2.3%
61
 
2.1%
Other values (229) 1998
68.6%
Common
ValueCountFrequency (%)
353
22.6%
1 187
12.0%
) 168
10.8%
( 168
10.8%
0 124
 
7.9%
- 101
 
6.5%
2 66
 
4.2%
7 43
 
2.8%
5 42
 
2.7%
3 40
 
2.6%
Other values (13) 268
17.2%
Latin
ValueCountFrequency (%)
B 47
30.1%
A 46
29.5%
D 18
 
11.5%
C 12
 
7.7%
P 11
 
7.1%
E 8
 
5.1%
O 4
 
2.6%
N 3
 
1.9%
S 3
 
1.9%
K 2
 
1.3%
Other values (2) 2
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2913
62.9%
ASCII 1716
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
353
20.6%
1 187
10.9%
) 168
9.8%
( 168
9.8%
0 124
 
7.2%
- 101
 
5.9%
2 66
 
3.8%
B 47
 
2.7%
A 46
 
2.7%
7 43
 
2.5%
Other values (25) 413
24.1%
Hangul
ValueCountFrequency (%)
190
 
6.5%
118
 
4.1%
98
 
3.4%
82
 
2.8%
78
 
2.7%
75
 
2.6%
74
 
2.5%
73
 
2.5%
66
 
2.3%
61
 
2.1%
Other values (229) 1998
68.6%

시간
Text

Distinct55
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
2023-12-12T23:12:45.433075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters3993
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)7.2%

Sample

1st row11:00~11:50
2nd row11:00~11:50
3rd row20:00~20:50
4th row21:00~21:50
5th row14:00~14:50
ValueCountFrequency (%)
15:00~15:50 35
 
9.6%
16:00~16:50 31
 
8.5%
10:00~10:50 28
 
7.7%
11:00~11:50 26
 
7.2%
17:00~17:50 23
 
6.3%
09:00~09:50 22
 
6.1%
14:00~14:50 21
 
5.8%
20:00~20:50 19
 
5.2%
19:00~19:50 14
 
3.9%
09:00~13:50 12
 
3.3%
Other values (45) 132
36.4%
2023-12-12T23:12:45.847179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1318
33.0%
: 726
18.2%
1 618
15.5%
5 404
 
10.1%
~ 363
 
9.1%
2 120
 
3.0%
6 99
 
2.5%
9 95
 
2.4%
3 74
 
1.9%
7 70
 
1.8%
Other values (2) 106
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2904
72.7%
Other Punctuation 726
 
18.2%
Math Symbol 363
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1318
45.4%
1 618
21.3%
5 404
 
13.9%
2 120
 
4.1%
6 99
 
3.4%
9 95
 
3.3%
3 74
 
2.5%
7 70
 
2.4%
4 56
 
1.9%
8 50
 
1.7%
Other Punctuation
ValueCountFrequency (%)
: 726
100.0%
Math Symbol
ValueCountFrequency (%)
~ 363
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3993
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1318
33.0%
: 726
18.2%
1 618
15.5%
5 404
 
10.1%
~ 363
 
9.1%
2 120
 
3.0%
6 99
 
2.5%
9 95
 
2.4%
3 74
 
1.9%
7 70
 
1.8%
Other values (2) 106
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3993
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1318
33.0%
: 726
18.2%
1 618
15.5%
5 404
 
10.1%
~ 363
 
9.1%
2 120
 
3.0%
6 99
 
2.5%
9 95
 
2.4%
3 74
 
1.9%
7 70
 
1.8%
Other values (2) 106
 
2.7%

요일
Categorical

Distinct16
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
월수금
72 
화목
58 
월화수목금
54 
35 
토일
30 
Other values (11)
114 

Length

Max length6
Median length5
Mean length2.4407713
Min length1

Unique

Unique2 ?
Unique (%)0.6%

Sample

1st row화목
2nd row화목
3rd row화목
4th row월수금
5th row

Common Values

ValueCountFrequency (%)
월수금 72
19.8%
화목 58
16.0%
월화수목금 54
14.9%
35
9.6%
토일 30
8.3%
화목토 26
 
7.2%
20
 
5.5%
16
 
4.4%
15
 
4.1%
월수 11
 
3.0%
Other values (6) 26
 
7.2%

Length

2023-12-12T23:12:45.999707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
월수금 72
19.8%
화목 58
16.0%
월화수목금 54
14.9%
35
9.6%
토일 30
8.3%
화목토 26
 
7.2%
20
 
5.5%
16
 
4.4%
15
 
4.1%
월수 11
 
3.0%
Other values (6) 26
 
7.2%

대상
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size3.0 KiB
성인
97 
어린이
44 
성인여자
41 
초등생
40 
청소년
36 
Other values (29)
105 

Length

Max length6
Median length5
Mean length3.1101928
Min length2

Unique

Unique13 ?
Unique (%)3.6%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row초등생

Common Values

ValueCountFrequency (%)
성인 97
26.7%
어린이 44
12.1%
성인여자 41
11.3%
초등생 40
11.0%
청소년 36
 
9.9%
6~7세 14
 
3.9%
5~7세 11
 
3.0%
7세 10
 
2.8%
6세~초등 9
 
2.5%
7세~초등 8
 
2.2%
Other values (24) 53
14.6%

Length

2023-12-12T23:12:46.131919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성인 97
26.7%
어린이 44
12.1%
성인여자 41
11.3%
초등생 40
11.0%
청소년 36
 
9.9%
6~7세 14
 
3.9%
5~7세 11
 
3.0%
7세 10
 
2.8%
6세~초등 9
 
2.5%
7세~초등 8
 
2.2%
Other values (24) 53
14.6%

수강요금
Real number (ℝ)

Distinct48
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75037.273
Minimum20000
Maximum990000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T23:12:46.270555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20000
5-th percentile23100
Q130000
median45000
Q364500
95-th percentile249000
Maximum990000
Range970000
Interquartile range (IQR)34500

Descriptive statistics

Standard deviation131024.47
Coefficient of variation (CV)1.7461252
Kurtosis28.874983
Mean75037.273
Median Absolute Deviation (MAD)15000
Skewness5.1511444
Sum27238530
Variance1.7167412 × 1010
MonotonicityNot monotonic
2023-12-12T23:12:46.765623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=48)
ValueCountFrequency (%)
30000 39
 
10.7%
46000 37
 
10.2%
80000 30
 
8.3%
50000 28
 
7.7%
69000 23
 
6.3%
40000 18
 
5.0%
35000 17
 
4.7%
26000 15
 
4.1%
24000 14
 
3.9%
25000 12
 
3.3%
Other values (38) 130
35.8%
ValueCountFrequency (%)
20000 10
 
2.8%
22000 8
 
2.2%
23000 1
 
0.3%
24000 14
 
3.9%
25000 12
 
3.3%
26000 15
 
4.1%
27720 1
 
0.3%
28000 9
 
2.5%
29000 3
 
0.8%
30000 39
10.7%
ValueCountFrequency (%)
990000 2
 
0.6%
900000 2
 
0.6%
870000 2
 
0.6%
660000 1
 
0.3%
420000 6
1.7%
342000 1
 
0.3%
320000 1
 
0.3%
300000 1
 
0.3%
291000 1
 
0.3%
270000 1
 
0.3%

정원
Real number (ℝ)

Distinct26
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.101928
Minimum1
Maximum120
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2023-12-12T23:12:46.918907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5
Q17.5
median15
Q320
95-th percentile35
Maximum120
Range119
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation11.937594
Coefficient of variation (CV)0.74137668
Kurtosis21.458152
Mean16.101928
Median Absolute Deviation (MAD)7
Skewness3.211926
Sum5845
Variance142.50616
MonotonicityNot monotonic
2023-12-12T23:12:47.049347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
20 71
19.6%
10 49
13.5%
6 37
10.2%
5 33
9.1%
30 21
 
5.8%
8 20
 
5.5%
15 18
 
5.0%
16 16
 
4.4%
25 16
 
4.4%
35 15
 
4.1%
Other values (16) 67
18.5%
ValueCountFrequency (%)
1 3
 
0.8%
4 11
 
3.0%
5 33
9.1%
6 37
10.2%
7 7
 
1.9%
8 20
5.5%
10 49
13.5%
12 11
 
3.0%
13 2
 
0.6%
14 1
 
0.3%
ValueCountFrequency (%)
120 1
 
0.3%
100 1
 
0.3%
50 3
 
0.8%
45 2
 
0.6%
40 3
 
0.8%
39 1
 
0.3%
35 15
4.1%
30 21
5.8%
28 2
 
0.6%
25 16
4.4%

Interactions

2023-12-12T23:12:43.152352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:42.499165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:42.817144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:43.261836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:42.596301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:42.932015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:43.365326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:42.704881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:12:43.049699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:12:47.159791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분야시간요일대상수강요금정원
연번1.0000.9430.9020.6770.8040.5460.142
분야0.9431.0000.9290.7500.8540.5940.218
시간0.9020.9291.0000.8620.9080.9190.315
요일0.6770.7500.8621.0000.7960.4350.372
대상0.8040.8540.9080.7961.0000.7330.343
수강요금0.5460.5940.9190.4350.7331.0000.091
정원0.1420.2180.3150.3720.3430.0911.000
2023-12-12T23:12:47.294237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야대상요일
분야1.0000.5780.496
대상0.5781.0000.348
요일0.4960.3481.000
2023-12-12T23:12:47.389879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번수강요금정원분야요일대상
연번1.0000.1560.0190.6730.3400.422
수강요금0.1561.000-0.0930.4170.1680.374
정원0.019-0.0931.0000.1400.1770.141
분야0.6730.4170.1401.0000.4960.578
요일0.3400.1680.1770.4961.0000.348
대상0.4220.3740.1410.5780.3481.000

Missing values

2023-12-12T23:12:43.524410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:12:43.672135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분야프로그램시간요일대상수강요금정원
01공동사업강덕 쿵후 태극권11:00~11:50화목성인4000020
12공동사업스피닝A(화목, 11시)11:00~11:50화목성인4800025
23공동사업스피닝B(화목,20시)20:00~20:50화목성인4800025
34공동사업스피닝D(월수금,21시)21:00~21:50월수금성인6400025
45공동사업스피드인라인(단체-토)14시14:00~14:50초등생3000028
56공동사업어린이배드민턴A-초등생16:00~16:50월수금초등생3000030
67공동사업어린이배드민턴B-초등생17:00~17:50월수금초등생3000030
78공동사업어린이배드민턴C-초등생12:00~12:50초등생3000020
89공동사업어린이배드민턴D-초,중등11:00~11:50초,중등생3000020
910공동사업인라인 소그룹(평일)15시15:00~15:506세~초등500008
연번분야프로그램시간요일대상수강요금정원
353354유아체능단체능]18년 해마반(5세)09:00~13:50월화수목금5세99000016
354355유아체능단체능]방과후-FC동대문슛돌이 (7세)14:00~14:50화목7세5000018
355356유아체능단체능]방과후-리틀와이 코딩(7세)14:00~14:507세3500020
356357유아체능단체능]방과후-생각대통령(7세)14:00~14:507세3000020
357358유아체능단체능]방과후-영어특별반A(7세)15:00~15:50월수7세5000012
358359유아체능단체능]방과후-영어특별반B(7세)15:00~15:50화목7세5000012
359360체육유도(성인)19:30~20:30월화수목금성인4268015
360361체육유도(청소년)19:30~20:30월화수목금청소년3113010
361362체육유도(초등)19:30~20:30월화수목금초2이상2772010
362363체육조기배드민턴06:00~07:20월화수목금토성인4200030