Overview

Dataset statistics

Number of variables8
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory70.4 B

Variable types

Numeric1
Categorical5
Text2

Dataset

Description정보화도서관 교육 및 문화 프로그램 종류, 수강시기, 대상 등 수강정보 제공
Author동대문구시설관리공단
URLhttps://www.data.go.kr/data/15044065/fileData.do

Alerts

분야 has constant value ""Constant
연번 is highly overall correlated with 수강료(1개월)High correlation
대상 is highly overall correlated with 수강료(1개월)High correlation
수강료(1개월) is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
프로그램명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:12:09.644507
Analysis finished2023-12-12 01:12:10.368793
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.5
Minimum1
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2023-12-12T10:12:10.471142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.45
Q18.25
median15.5
Q322.75
95-th percentile28.55
Maximum30
Range29
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation8.8034084
Coefficient of variation (CV)0.56796183
Kurtosis-1.2
Mean15.5
Median Absolute Deviation (MAD)7.5
Skewness0
Sum465
Variance77.5
MonotonicityStrictly increasing
2023-12-12T10:12:10.631579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1 1
 
3.3%
17 1
 
3.3%
30 1
 
3.3%
29 1
 
3.3%
28 1
 
3.3%
27 1
 
3.3%
26 1
 
3.3%
25 1
 
3.3%
24 1
 
3.3%
23 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
1 1
3.3%
2 1
3.3%
3 1
3.3%
4 1
3.3%
5 1
3.3%
6 1
3.3%
7 1
3.3%
8 1
3.3%
9 1
3.3%
10 1
3.3%
ValueCountFrequency (%)
30 1
3.3%
29 1
3.3%
28 1
3.3%
27 1
3.3%
26 1
3.3%
25 1
3.3%
24 1
3.3%
23 1
3.3%
22 1
3.3%
21 1
3.3%

분야
Categorical

CONSTANT 

Distinct1
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
문화
30 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문화
2nd row문화
3rd row문화
4th row문화
5th row문화

Common Values

ValueCountFrequency (%)
문화 30
100.0%

Length

2023-12-12T10:12:10.769093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:12:10.876930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문화 30
100.0%

프로그램명
Text

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T10:12:11.082618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length8.6666667
Min length5

Characters and Unicode

Total characters260
Distinct characters102
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)100.0%

Sample

1st row나도과학자
2nd row사서와 함께하는 책놀이
3rd row이수연 작가의 즐거운 캐리커쳐
4th row부모교육A
5th row보타니컬 꽃그림그리기
ValueCountFrequency (%)
nie 3
 
6.4%
나오미 2
 
4.3%
전통악기 2
 
4.3%
즐거운 2
 
4.3%
시사중등논술 1
 
2.1%
노부영스토리텔링b 1
 
2.1%
노부영스토리텔링c 1
 
2.1%
미술a 1
 
2.1%
미술b 1
 
2.1%
데생수채화 1
 
2.1%
Other values (32) 32
68.1%
2023-12-12T10:12:11.457152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
6.5%
B 8
 
3.1%
( 7
 
2.7%
A 7
 
2.7%
7
 
2.7%
) 7
 
2.7%
7
 
2.7%
6
 
2.3%
6
 
2.3%
5
 
1.9%
Other values (92) 183
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 196
75.4%
Uppercase Letter 28
 
10.8%
Space Separator 17
 
6.5%
Open Punctuation 7
 
2.7%
Close Punctuation 7
 
2.7%
Other Punctuation 3
 
1.2%
Math Symbol 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
Other values (79) 142
72.4%
Uppercase Letter
ValueCountFrequency (%)
B 8
28.6%
A 7
25.0%
C 3
 
10.7%
N 3
 
10.7%
I 3
 
10.7%
E 3
 
10.7%
D 1
 
3.6%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 196
75.4%
Common 36
 
13.8%
Latin 28
 
10.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
Other values (79) 142
72.4%
Latin
ValueCountFrequency (%)
B 8
28.6%
A 7
25.0%
C 3
 
10.7%
N 3
 
10.7%
I 3
 
10.7%
E 3
 
10.7%
D 1
 
3.6%
Common
ValueCountFrequency (%)
17
47.2%
( 7
19.4%
) 7
19.4%
+ 2
 
5.6%
& 2
 
5.6%
, 1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 196
75.4%
ASCII 64
 
24.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
26.6%
B 8
12.5%
( 7
10.9%
A 7
10.9%
) 7
10.9%
C 3
 
4.7%
N 3
 
4.7%
I 3
 
4.7%
E 3
 
4.7%
+ 2
 
3.1%
Other values (3) 4
 
6.2%
Hangul
ValueCountFrequency (%)
7
 
3.6%
7
 
3.6%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.0%
4
 
2.0%
Other values (79) 142
72.4%

시간
Text

Distinct18
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
2023-12-12T10:12:11.922397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters330
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)30.0%

Sample

1st row10:00~12:00
2nd row14:00~15:00
3rd row19:00~20:20
4th row10:00~12:00
5th row10:30~11:50
ValueCountFrequency (%)
10:00~12:00 4
13.3%
16:00~16:50 3
 
10.0%
09:30~11:30 2
 
6.7%
11:00~11:50 2
 
6.7%
10:00~10:50 2
 
6.7%
17:00~17:50 2
 
6.7%
15:00~15:50 2
 
6.7%
10:30~11:50 2
 
6.7%
12:00~14:00 2
 
6.7%
18:30~19:20 1
 
3.3%
Other values (8) 8
26.7%
2023-12-12T10:12:12.254397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 110
33.3%
1 61
18.5%
: 60
18.2%
~ 30
 
9.1%
5 22
 
6.7%
2 15
 
4.5%
9 9
 
2.7%
3 9
 
2.7%
6 6
 
1.8%
7 4
 
1.2%
Other values (2) 4
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 240
72.7%
Other Punctuation 60
 
18.2%
Math Symbol 30
 
9.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 110
45.8%
1 61
25.4%
5 22
 
9.2%
2 15
 
6.2%
9 9
 
3.8%
3 9
 
3.8%
6 6
 
2.5%
7 4
 
1.7%
4 3
 
1.2%
8 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
: 60
100.0%
Math Symbol
ValueCountFrequency (%)
~ 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 330
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 110
33.3%
1 61
18.5%
: 60
18.2%
~ 30
 
9.1%
5 22
 
6.7%
2 15
 
4.5%
9 9
 
2.7%
3 9
 
2.7%
6 6
 
1.8%
7 4
 
1.2%
Other values (2) 4
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 330
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 110
33.3%
1 61
18.5%
: 60
18.2%
~ 30
 
9.1%
5 22
 
6.7%
2 15
 
4.5%
9 9
 
2.7%
3 9
 
2.7%
6 6
 
1.8%
7 4
 
1.2%
Other values (2) 4
 
1.2%

요일
Categorical

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Other values (4)

Length

Max length8
Median length1
Mean length1.7333333
Min length1

Unique

Unique2 ?
Unique (%)6.7%

Sample

1st row매월 둘째주 토
2nd row매월 넷째주 토
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
7
23.3%
6
20.0%
4
13.3%
4
13.3%
3
10.0%
화,목 2
 
6.7%
수,금 2
 
6.7%
매월 둘째주 토 1
 
3.3%
매월 넷째주 토 1
 
3.3%

Length

2023-12-12T10:12:12.416917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:12:12.547244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
8
23.5%
7
20.6%
4
11.8%
4
11.8%
3
 
8.8%
화,목 2
 
5.9%
수,금 2
 
5.9%
매월 2
 
5.9%
둘째주 1
 
2.9%
넷째주 1
 
2.9%

대상
Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
성인
15 
초1~2
초4~6
초등
초1~3
 
1
Other values (7)

Length

Max length5
Median length2
Mean length2.9
Min length2

Unique

Unique8 ?
Unique (%)26.7%

Sample

1st row초4~6
2nd row초1~3
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 15
50.0%
초1~2 3
 
10.0%
초4~6 2
 
6.7%
초등 2
 
6.7%
초1~3 1
 
3.3%
초2~3 1
 
3.3%
6-7세 1
 
3.3%
6~7세 1
 
3.3%
초3~중등 1
 
3.3%
중1이상 1
 
3.3%
Other values (2) 2
 
6.7%

Length

2023-12-12T10:12:12.697318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성인 15
50.0%
초1~2 3
 
10.0%
초4~6 2
 
6.7%
초등 2
 
6.7%
초1~3 1
 
3.3%
초2~3 1
 
3.3%
6-7세 1
 
3.3%
6~7세 1
 
3.3%
초3~중등 1
 
3.3%
중1이상 1
 
3.3%
Other values (2) 2
 
6.7%

수강료(1개월)
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)23.3%
Missing0
Missing (%)0.0%
Memory size372.0 B
25000
12 
23000
30000
10000
무료
Other values (2)

Length

Max length5
Median length5
Mean length4.8
Min length2

Unique

Unique1 ?
Unique (%)3.3%

Sample

1st row무료
2nd row무료
3rd row30000
4th row30000
5th row25000

Common Values

ValueCountFrequency (%)
25000 12
40.0%
23000 5
16.7%
30000 4
 
13.3%
10000 4
 
13.3%
무료 2
 
6.7%
22000 2
 
6.7%
40000 1
 
3.3%

Length

2023-12-12T10:12:12.817119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:12:12.925833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
25000 12
40.0%
23000 5
16.7%
30000 4
 
13.3%
10000 4
 
13.3%
무료 2
 
6.7%
22000 2
 
6.7%
40000 1
 
3.3%

정원
Categorical

Distinct3
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
20
12 
15
10 
12

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15
2nd row20
3rd row15
4th row20
5th row15

Common Values

ValueCountFrequency (%)
20 12
40.0%
15 10
33.3%
12 8
26.7%

Length

2023-12-12T10:12:13.039523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:12:13.141206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 12
40.0%
15 10
33.3%
12 8
26.7%

Interactions

2023-12-12T10:12:10.049073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:12:13.217544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번프로그램명시간요일대상수강료(1개월)정원
연번1.0001.0000.6570.7510.6850.8410.684
프로그램명1.0001.0001.0001.0001.0001.0001.000
시간0.6571.0001.0000.4940.5010.8650.921
요일0.7511.0000.4941.0000.5640.7530.490
대상0.6851.0000.5010.5641.0000.8500.000
수강료(1개월)0.8411.0000.8650.7530.8501.0000.624
정원0.6841.0000.9210.4900.0000.6241.000
2023-12-12T10:12:13.335580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수강료(1개월)요일정원대상
수강료(1개월)1.0000.4980.4750.555
요일0.4981.0000.1960.226
정원0.4750.1961.0000.000
대상0.5550.2260.0001.000
2023-12-12T10:12:13.428632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번요일대상수강료(1개월)정원
연번1.0000.4430.3300.5890.455
요일0.4431.0000.2260.4980.196
대상0.3300.2261.0000.5550.000
수강료(1개월)0.5890.4980.5551.0000.475
정원0.4550.1960.0000.4751.000

Missing values

2023-12-12T10:12:10.178175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:12:10.310215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분야프로그램명시간요일대상수강료(1개월)정원
01문화나도과학자10:00~12:00매월 둘째주 토초4~6무료15
12문화사서와 함께하는 책놀이14:00~15:00매월 넷째주 토초1~3무료20
23문화이수연 작가의 즐거운 캐리커쳐19:00~20:20성인3000015
34문화부모교육A10:00~12:00성인3000020
45문화보타니컬 꽃그림그리기10:30~11:50성인2500015
56문화부모교육B10:00~12:00성인3000020
67문화인간관계와자기이해10:00~12:00성인3000020
78문화우리집공간정리 노하우10:00~11:30성인2500015
89문화즐거운 중국어 입문반10:30~11:50성인2500015
910문화전통악기 해금A(초급)19:00~19:50성인2500012
연번분야프로그램명시간요일대상수강료(1개월)정원
2021문화통합교과 시사중등논술09:00~09:50중1이상4000020
2122문화NIE A(글쓰기)10:00~10:50초1~22300020
2223문화NIE B(토의)11:00~11:50초3~42300020
2324문화NIE C(논술)12:00~12:50초5~62300020
2425문화주산+암산A10:00~10:50초등2200015
2526문화주산+암산B11:00~11:50초등2200015
2627문화정보화교육A09:30~11:30화,목성인1000020
2728문화정보화교육B12:00~14:00화,목성인1000020
2829문화정보화교육C09:30~11:30수,금성인1000020
2930문화정보화교육D12:00~14:00수,금성인1000020