Overview

Dataset statistics

Number of variables6
Number of observations256
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.4 KiB
Average record size in memory49.5 B

Variable types

Numeric1
Categorical3
Text1
Unsupported1

Dataset

Description성북구 소재 문화 체육 스포츠 센터 강좌 및 수강신청 정보
Author성북구도시관리공단
URLhttps://www.data.go.kr/data/3077169/fileData.do

Alerts

번호 is highly overall correlated with 프로그램그룹 and 1 other fieldsHigh correlation
프로그램그룹 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
대상 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
번호 has unique valuesUnique
수강료(원) is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 21:52:30.226477
Analysis finished2023-12-12 21:52:30.757616
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct256
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.5
Minimum1
Maximum256
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-13T06:52:30.831892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.75
Q164.75
median128.5
Q3192.25
95-th percentile243.25
Maximum256
Range255
Interquartile range (IQR)127.5

Descriptive statistics

Standard deviation74.045031
Coefficient of variation (CV)0.57622592
Kurtosis-1.2
Mean128.5
Median Absolute Deviation (MAD)64
Skewness0
Sum32896
Variance5482.6667
MonotonicityStrictly increasing
2023-12-13T06:52:30.979900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
130 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
170 1
 
0.4%
171 1
 
0.4%
Other values (246) 246
96.1%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
256 1
0.4%
255 1
0.4%
254 1
0.4%
253 1
0.4%
252 1
0.4%
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%

프로그램그룹
Categorical

HIGH CORRELATION 

Distinct22
Distinct (%)8.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
성북레포츠타운>스포츠프로그램>수영
67 
성북레포츠타운>실용음악교실>피아노
39 
성북레포츠타운>문화프로그램>문화취미
37 
성북레포츠타운>스포츠프로그램>스쿼시
26 
성북레포츠타운>스포츠프로그램>체육관
17 
Other values (17)
70 

Length

Max length22
Median length21
Mean length18.5
Min length17

Unique

Unique3 ?
Unique (%)1.2%

Sample

1st row성북레포츠타운>스포츠프로그램>수영
2nd row성북레포츠타운>스포츠프로그램>수영
3rd row성북레포츠타운>스포츠프로그램>수영
4th row성북레포츠타운>스포츠프로그램>수영
5th row성북레포츠타운>스포츠프로그램>수영

Common Values

ValueCountFrequency (%)
성북레포츠타운>스포츠프로그램>수영 67
26.2%
성북레포츠타운>실용음악교실>피아노 39
15.2%
성북레포츠타운>문화프로그램>문화취미 37
14.5%
성북레포츠타운>스포츠프로그램>스쿼시 26
 
10.2%
성북레포츠타운>스포츠프로그램>체육관 17
 
6.6%
성북레포츠타운>실용음악교실>드럼 15
 
5.9%
성북레포츠타운>스포츠프로그램>헬스 9
 
3.5%
성북레포츠타운>스포츠프로그램>스피닝 7
 
2.7%
성북레포츠타운>스포츠문화프로그램>댄스 6
 
2.3%
성북레포츠타운>스포츠문화프로그램>발레 5
 
2.0%
Other values (12) 28
10.9%

Length

2023-12-13T06:52:31.121414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성북레포츠타운>스포츠프로그램>수영 67
26.2%
성북레포츠타운>실용음악교실>피아노 39
15.2%
성북레포츠타운>문화프로그램>문화취미 37
14.5%
성북레포츠타운>스포츠프로그램>스쿼시 26
 
10.2%
성북레포츠타운>스포츠프로그램>체육관 17
 
6.6%
성북레포츠타운>실용음악교실>드럼 15
 
5.9%
성북레포츠타운>스포츠프로그램>헬스 9
 
3.5%
성북레포츠타운>스포츠프로그램>스피닝 7
 
2.7%
성북레포츠타운>스포츠문화프로그램>댄스 6
 
2.3%
성북레포츠타운>스포츠문화프로그램>발레 5
 
2.0%
Other values (12) 28
10.9%
Distinct206
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-13T06:52:31.286933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length14.792969
Min length6

Characters and Unicode

Total characters3787
Distinct characters217
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique190 ?
Unique (%)74.2%

Sample

1st row직장인A06시
2nd row직장인B06시
3rd row직장인A07시
4th row직장인B07시
5th row직장인A08시
ValueCountFrequency (%)
피아노교실(화목 13
 
4.7%
피아노교실(월수금 13
 
4.7%
피아노교실(월~금 13
 
4.7%
초등수영 4
 
1.4%
월자유/토,일(2개월)1~3부 3
 
1.1%
스쿼시]b18시/토(자유 2
 
0.7%
체형관리반 2
 
0.7%
통기타]금(초등3학년~성인)18시30분 2
 
0.7%
스쿼시]a16시 2
 
0.7%
스쿼시]b16시/토(자유 2
 
0.7%
Other values (214) 223
79.9%
2023-12-13T06:52:31.609071image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 235
 
6.2%
) 235
 
6.2%
175
 
4.6%
1 166
 
4.4%
[ 151
 
4.0%
] 150
 
4.0%
96
 
2.5%
92
 
2.4%
, 79
 
2.1%
74
 
2.0%
Other values (207) 2334
61.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2208
58.3%
Decimal Number 471
 
12.4%
Open Punctuation 386
 
10.2%
Close Punctuation 385
 
10.2%
Uppercase Letter 126
 
3.3%
Other Punctuation 117
 
3.1%
Math Symbol 56
 
1.5%
Space Separator 24
 
0.6%
Dash Punctuation 14
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
7.9%
96
 
4.3%
92
 
4.2%
74
 
3.4%
64
 
2.9%
64
 
2.9%
56
 
2.5%
54
 
2.4%
52
 
2.4%
50
 
2.3%
Other values (170) 1431
64.8%
Uppercase Letter
ValueCountFrequency (%)
A 48
38.1%
B 47
37.3%
P 6
 
4.8%
S 5
 
4.0%
N 4
 
3.2%
E 3
 
2.4%
T 2
 
1.6%
I 2
 
1.6%
X 2
 
1.6%
G 2
 
1.6%
Other values (5) 5
 
4.0%
Decimal Number
ValueCountFrequency (%)
1 166
35.2%
6 70
14.9%
0 47
 
10.0%
7 45
 
9.6%
2 38
 
8.1%
5 34
 
7.2%
3 23
 
4.9%
9 23
 
4.9%
8 15
 
3.2%
4 10
 
2.1%
Other Punctuation
ValueCountFrequency (%)
, 79
67.5%
/ 26
 
22.2%
. 9
 
7.7%
& 3
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 235
60.9%
[ 151
39.1%
Close Punctuation
ValueCountFrequency (%)
) 235
61.0%
] 150
39.0%
Math Symbol
ValueCountFrequency (%)
~ 55
98.2%
+ 1
 
1.8%
Space Separator
ValueCountFrequency (%)
24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2208
58.3%
Common 1453
38.4%
Latin 126
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
7.9%
96
 
4.3%
92
 
4.2%
74
 
3.4%
64
 
2.9%
64
 
2.9%
56
 
2.5%
54
 
2.4%
52
 
2.4%
50
 
2.3%
Other values (170) 1431
64.8%
Common
ValueCountFrequency (%)
( 235
16.2%
) 235
16.2%
1 166
11.4%
[ 151
10.4%
] 150
10.3%
, 79
 
5.4%
6 70
 
4.8%
~ 55
 
3.8%
0 47
 
3.2%
7 45
 
3.1%
Other values (12) 220
15.1%
Latin
ValueCountFrequency (%)
A 48
38.1%
B 47
37.3%
P 6
 
4.8%
S 5
 
4.0%
N 4
 
3.2%
E 3
 
2.4%
T 2
 
1.6%
I 2
 
1.6%
X 2
 
1.6%
G 2
 
1.6%
Other values (5) 5
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2208
58.3%
ASCII 1579
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 235
14.9%
) 235
14.9%
1 166
10.5%
[ 151
9.6%
] 150
9.5%
, 79
 
5.0%
6 70
 
4.4%
~ 55
 
3.5%
A 48
 
3.0%
0 47
 
3.0%
Other values (27) 343
21.7%
Hangul
ValueCountFrequency (%)
175
 
7.9%
96
 
4.3%
92
 
4.2%
74
 
3.4%
64
 
2.9%
64
 
2.9%
56
 
2.5%
54
 
2.4%
52
 
2.4%
50
 
2.3%
Other values (170) 1431
64.8%

시간
Categorical

Distinct31
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
14:00~14:50
34 
15:00~15:50
33 
16:00~16:50
20 
17:00~17:50
19 
10:00~10:50
16 
Other values (26)
134 

Length

Max length11
Median length11
Mean length10.9375
Min length4

Unique

Unique8 ?
Unique (%)3.1%

Sample

1st row06:00~06:50
2nd row06:00~06:50
3rd row07:00~07:50
4th row07:00~07:50
5th row08:00~08:50

Common Values

ValueCountFrequency (%)
14:00~14:50 34
13.3%
15:00~15:50 33
12.9%
16:00~16:50 20
 
7.8%
17:00~17:50 19
 
7.4%
10:00~10:50 16
 
6.2%
11:00~11:50 15
 
5.9%
19:00~19:50 15
 
5.9%
18:00~18:50 14
 
5.5%
20:00~20:50 14
 
5.5%
21:00~21:50 13
 
5.1%
Other values (21) 63
24.6%

Length

2023-12-13T06:52:31.745075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
14:00~14:50 34
13.3%
15:00~15:50 33
12.9%
16:00~16:50 20
 
7.8%
17:00~17:50 19
 
7.4%
10:00~10:50 16
 
6.2%
11:00~11:50 15
 
5.9%
19:00~19:50 15
 
5.9%
18:00~18:50 14
 
5.5%
20:00~20:50 14
 
5.5%
21:00~21:50 13
 
5.1%
Other values (21) 63
24.6%

대상
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
성인
96 
유아
45 
초등
41 
유아~성인
39 
누구나
 
9
Other values (6)
26 

Length

Max length7
Median length2
Mean length2.7617188
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 96
37.5%
유아 45
17.6%
초등 41
16.0%
유아~성인 39
15.2%
누구나 9
 
3.5%
청소년 8
 
3.1%
청소년~성인 6
 
2.3%
유아,어린이 5
 
2.0%
어린이 4
 
1.6%
어린이~성인 2
 
0.8%

Length

2023-12-13T06:52:31.858277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성인 96
37.5%
유아 45
17.6%
초등 41
16.0%
유아~성인 39
15.2%
누구나 9
 
3.5%
청소년 8
 
3.1%
청소년~성인 6
 
2.3%
유아,어린이 5
 
2.0%
어린이 4
 
1.6%
어린이~성인 2
 
0.8%

수강료(원)
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size2.1 KiB

Interactions

2023-12-13T06:52:30.494673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:52:31.936405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호프로그램그룹시간대상
번호1.0000.9370.6340.801
프로그램그룹0.9371.0000.8630.922
시간0.6340.8631.0000.823
대상0.8010.9220.8231.000
2023-12-13T06:52:32.016022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상프로그램그룹시간
대상1.0000.5530.432
프로그램그룹0.5531.0000.396
시간0.4320.3961.000
2023-12-13T06:52:32.093771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호프로그램그룹시간대상
번호1.0000.7050.2660.506
프로그램그룹0.7051.0000.3960.553
시간0.2660.3961.0000.432
대상0.5060.5530.4321.000

Missing values

2023-12-13T06:52:30.614048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:52:30.721728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호프로그램그룹프로그램명시간대상수강료(원)
01성북레포츠타운>스포츠프로그램>수영직장인A06시06:00~06:50성인49000
12성북레포츠타운>스포츠프로그램>수영직장인B06시06:00~06:50성인49000
23성북레포츠타운>스포츠프로그램>수영직장인A07시07:00~07:50성인49000
34성북레포츠타운>스포츠프로그램>수영직장인B07시07:00~07:50성인49000
45성북레포츠타운>스포츠프로그램>수영직장인A08시08:00~08:50성인49000
56성북레포츠타운>스포츠프로그램>수영여성A09시09:00~09:50성인49000
67성북레포츠타운>스포츠프로그램>수영여성B09시09:00~09:50성인49000
78성북레포츠타운>스포츠프로그램>수영여성A10시10:00~10:50성인49000
89성북레포츠타운>스포츠프로그램>수영여성B10시10:00~10:50성인49000
910성북레포츠타운>스포츠프로그램>수영여성A11시11:00~11:50성인49000
번호프로그램그룹프로그램명시간대상수강료(원)
246247성북레포츠타운>실용음악교실>피아노피아노교실(화목)12:00~12:50유아~성인40,000~45,000
247248성북레포츠타운>실용음악교실>피아노피아노교실(화목)13:00~13:50유아~성인40,000~45,000
248249성북레포츠타운>실용음악교실>피아노피아노교실(화목)14:00~14:50유아~성인40,000~45,000
249250성북레포츠타운>실용음악교실>피아노피아노교실(화목)15:00~15:50유아~성인40,000~45,000
250251성북레포츠타운>실용음악교실>피아노피아노교실(화목)16:00~16:50유아~성인40,000~45,000
251252성북레포츠타운>실용음악교실>피아노피아노교실(화목)17:00~17:50유아~성인40,000~45,000
252253성북레포츠타운>실용음악교실>피아노피아노교실(화목)18:00~18:50유아~성인40,000~45,000
253254성북레포츠타운>실용음악교실>피아노피아노교실(화목)19:00~19:50유아~성인40,000~45,000
254255성북레포츠타운>실용음악교실>피아노피아노교실(화목)20:00~20:50유아~성인40,000~45,000
255256성북레포츠타운>실용음악교실>피아노피아노교실(화목)21:00~21:50유아~성인40,000~45,000