Overview

Dataset statistics

Number of variables10
Number of observations379
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.2 KiB
Average record size in memory84.3 B

Variable types

Numeric2
Categorical7
Text1

Dataset

Description인천광역시 연수구 주민자치센터 프로그램의 데이터에서 강좌명, 수강료 등의 목록- 강좌명, 수강료, 교육기관, 신청기간, 교육기간, 상태로 구분
Author인천광역시 연수구
URLhttps://www.data.go.kr/data/15087577/fileData.do

Alerts

기수 has constant value ""Constant
연번 is highly overall correlated with 교육기관 and 1 other fieldsHigh correlation
교육기관 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
수강기간 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
대상 is highly imbalanced (72.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 09:05:48.848469
Analysis finished2024-04-06 09:05:51.112269
Duration2.26 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct379
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean190
Minimum1
Maximum379
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-04-06T18:05:51.270069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile19.9
Q195.5
median190
Q3284.5
95-th percentile360.1
Maximum379
Range378
Interquartile range (IQR)189

Descriptive statistics

Standard deviation109.55212
Coefficient of variation (CV)0.5765901
Kurtosis-1.2
Mean190
Median Absolute Deviation (MAD)95
Skewness0
Sum72010
Variance12001.667
MonotonicityStrictly increasing
2024-04-06T18:05:51.539558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
251 1
 
0.3%
260 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
256 1
 
0.3%
255 1
 
0.3%
254 1
 
0.3%
253 1
 
0.3%
Other values (369) 369
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
379 1
0.3%
378 1
0.3%
377 1
0.3%
376 1
0.3%
375 1
0.3%
374 1
0.3%
373 1
0.3%
372 1
0.3%
371 1
0.3%
370 1
0.3%

기수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024년 1분기
379 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024년 1분기
2nd row2024년 1분기
3rd row2024년 1분기
4th row2024년 1분기
5th row2024년 1분기

Common Values

ValueCountFrequency (%)
2024년 1분기 379
100.0%

Length

2024-04-06T18:05:51.806739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:05:51.991196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024년 379
50.0%
1분기 379
50.0%

교육기관
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
송도2동 행정복지센터
55 
송도4동 행정복지센터
44 
송도3동 행정복지센터
42 
송도1동 행정복지센터
35 
동춘3동 행정복지센터
26 
Other values (10)
177 

Length

Max length11
Median length11
Mean length10.897098
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row옥련1동 행정복지센터
2nd row옥련1동 행정복지센터
3rd row옥련1동 행정복지센터
4th row옥련1동 행정복지센터
5th row옥련1동 행정복지센터

Common Values

ValueCountFrequency (%)
송도2동 행정복지센터 55
14.5%
송도4동 행정복지센터 44
11.6%
송도3동 행정복지센터 42
11.1%
송도1동 행정복지센터 35
9.2%
동춘3동 행정복지센터 26
 
6.9%
옥련2동 행정복지센터 25
 
6.6%
연수1동 행정복지센터 22
 
5.8%
연수3동 행정복지센터 21
 
5.5%
선학동 행정복지센터 20
 
5.3%
연수2동 행정복지센터 20
 
5.3%
Other values (5) 69
18.2%

Length

2024-04-06T18:05:52.178686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
행정복지센터 379
50.0%
송도2동 55
 
7.3%
송도4동 44
 
5.8%
송도3동 42
 
5.5%
송도1동 35
 
4.6%
동춘3동 26
 
3.4%
옥련2동 25
 
3.3%
연수1동 22
 
2.9%
연수3동 21
 
2.8%
선학동 20
 
2.6%
Other values (6) 89
 
11.7%

분류
Categorical

Distinct3
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
문화여가
207 
생활체육
137 
시민교육
35 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row문화여가
2nd row문화여가
3rd row문화여가
4th row문화여가
5th row문화여가

Common Values

ValueCountFrequency (%)
문화여가 207
54.6%
생활체육 137
36.1%
시민교육 35
 
9.2%

Length

2024-04-06T18:05:52.386177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:05:52.580221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
문화여가 207
54.6%
생활체육 137
36.1%
시민교육 35
 
9.2%
Distinct328
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2024-04-06T18:05:53.020402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length22
Mean length7.6754617
Min length2

Characters and Unicode

Total characters2909
Distinct characters322
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)80.5%

Sample

1st row요가
2nd row줌바&근력스트레칭
3rd row라인댄스
4th row실버태권도
5th row우쿨렐레
ValueCountFrequency (%)
노래교실 14
 
2.8%
라인댄스 9
 
1.8%
사교댄스 8
 
1.6%
다이어트댄스 6
 
1.2%
캘리그라피 5
 
1.0%
줌바댄스 5
 
1.0%
어린이 5
 
1.0%
english 4
 
0.8%
신나는 4
 
0.8%
a 4
 
0.8%
Other values (375) 437
87.2%
2024-04-06T18:05:53.756878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
127
 
4.4%
126
 
4.3%
) 112
 
3.9%
( 111
 
3.8%
73
 
2.5%
65
 
2.2%
55
 
1.9%
53
 
1.8%
51
 
1.8%
50
 
1.7%
Other values (312) 2086
71.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2265
77.9%
Space Separator 126
 
4.3%
Close Punctuation 112
 
3.9%
Open Punctuation 111
 
3.8%
Uppercase Letter 110
 
3.8%
Lowercase Letter 89
 
3.1%
Decimal Number 59
 
2.0%
Dash Punctuation 21
 
0.7%
Other Punctuation 16
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
127
 
5.6%
73
 
3.2%
65
 
2.9%
55
 
2.4%
53
 
2.3%
51
 
2.3%
50
 
2.2%
48
 
2.1%
46
 
2.0%
42
 
1.9%
Other values (266) 1655
73.1%
Lowercase Letter
ValueCountFrequency (%)
s 12
13.5%
i 11
12.4%
n 10
11.2%
l 9
10.1%
a 8
9.0%
o 5
 
5.6%
g 5
 
5.6%
h 5
 
5.6%
e 4
 
4.5%
t 4
 
4.5%
Other values (8) 16
18.0%
Uppercase Letter
ValueCountFrequency (%)
B 25
22.7%
A 24
21.8%
P 16
14.5%
E 11
10.0%
N 8
 
7.3%
C 6
 
5.5%
S 6
 
5.5%
K 5
 
4.5%
O 5
 
4.5%
M 3
 
2.7%
Decimal Number
ValueCountFrequency (%)
2 18
30.5%
1 14
23.7%
7 9
15.3%
6 5
 
8.5%
0 4
 
6.8%
5 3
 
5.1%
3 3
 
5.1%
8 2
 
3.4%
4 1
 
1.7%
Other Punctuation
ValueCountFrequency (%)
& 11
68.8%
, 3
 
18.8%
. 1
 
6.2%
/ 1
 
6.2%
Space Separator
ValueCountFrequency (%)
126
100.0%
Close Punctuation
ValueCountFrequency (%)
) 112
100.0%
Open Punctuation
ValueCountFrequency (%)
( 111
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2265
77.9%
Common 445
 
15.3%
Latin 199
 
6.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
127
 
5.6%
73
 
3.2%
65
 
2.9%
55
 
2.4%
53
 
2.3%
51
 
2.3%
50
 
2.2%
48
 
2.1%
46
 
2.0%
42
 
1.9%
Other values (266) 1655
73.1%
Latin
ValueCountFrequency (%)
B 25
 
12.6%
A 24
 
12.1%
P 16
 
8.0%
s 12
 
6.0%
i 11
 
5.5%
E 11
 
5.5%
n 10
 
5.0%
l 9
 
4.5%
a 8
 
4.0%
N 8
 
4.0%
Other values (19) 65
32.7%
Common
ValueCountFrequency (%)
126
28.3%
) 112
25.2%
( 111
24.9%
- 21
 
4.7%
2 18
 
4.0%
1 14
 
3.1%
& 11
 
2.5%
7 9
 
2.0%
6 5
 
1.1%
0 4
 
0.9%
Other values (7) 14
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2265
77.9%
ASCII 644
 
22.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
127
 
5.6%
73
 
3.2%
65
 
2.9%
55
 
2.4%
53
 
2.3%
51
 
2.3%
50
 
2.2%
48
 
2.1%
46
 
2.0%
42
 
1.9%
Other values (266) 1655
73.1%
ASCII
ValueCountFrequency (%)
126
19.6%
) 112
17.4%
( 111
17.2%
B 25
 
3.9%
A 24
 
3.7%
- 21
 
3.3%
2 18
 
2.8%
P 16
 
2.5%
1 14
 
2.2%
s 12
 
1.9%
Other values (36) 165
25.6%

대상
Categorical

IMBALANCE 

Distinct20
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
성인
301 
아동
52 
성인여성
 
5
초1~초3
 
2
63세이하(61년생까지)
 
2
Other values (15)
 
17

Length

Max length13
Median length2
Mean length2.2664908
Min length2

Unique

Unique13 ?
Unique (%)3.4%

Sample

1st row성인
2nd row성인
3rd row성인
4th row성인
5th row성인

Common Values

ValueCountFrequency (%)
성인 301
79.4%
아동 52
 
13.7%
성인여성 5
 
1.3%
초1~초3 2
 
0.5%
63세이하(61년생까지) 2
 
0.5%
초4~성인 2
 
0.5%
초등 2
 
0.5%
중등~성인 1
 
0.3%
시니어 1
 
0.3%
성인(70세미만) 1
 
0.3%
Other values (10) 10
 
2.6%

Length

2024-04-06T18:05:53.983142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성인 301
79.4%
아동 52
 
13.7%
성인여성 5
 
1.3%
초1~초3 2
 
0.5%
63세이하(61년생까지 2
 
0.5%
초4~성인 2
 
0.5%
초등 2
 
0.5%
초4~초6 1
 
0.3%
64세이상(60년생부터 1
 
0.3%
성인(만65세이상 1
 
0.3%
Other values (10) 10
 
2.6%

수강인원(명)
Real number (ℝ)

Distinct34
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.350923
Minimum4
Maximum60
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2024-04-06T18:05:54.202221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile10
Q115
median20
Q325
95-th percentile35
Maximum60
Range56
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.4219621
Coefficient of variation (CV)0.36469903
Kurtosis1.9965906
Mean20.350923
Median Absolute Deviation (MAD)5
Skewness1.0295361
Sum7713
Variance55.085522
MonotonicityNot monotonic
2024-04-06T18:05:54.418289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
15 107
28.2%
20 88
23.2%
25 42
 
11.1%
30 32
 
8.4%
12 12
 
3.2%
10 9
 
2.4%
40 9
 
2.4%
16 9
 
2.4%
28 6
 
1.6%
8 5
 
1.3%
Other values (24) 60
15.8%
ValueCountFrequency (%)
4 2
 
0.5%
7 2
 
0.5%
8 5
 
1.3%
9 3
 
0.8%
10 9
 
2.4%
11 3
 
0.8%
12 12
 
3.2%
13 3
 
0.8%
14 1
 
0.3%
15 107
28.2%
ValueCountFrequency (%)
60 1
 
0.3%
40 9
 
2.4%
38 3
 
0.8%
37 1
 
0.3%
36 3
 
0.8%
35 4
 
1.1%
34 2
 
0.5%
33 2
 
0.5%
31 1
 
0.3%
30 32
8.4%

횟수(1주)
Categorical

Distinct4
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
1
214 
2
119 
3
43 
5
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row2
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 214
56.5%
2 119
31.4%
3 43
 
11.3%
5 3
 
0.8%

Length

2024-04-06T18:05:55.036222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:05:55.244091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 214
56.5%
2 119
31.4%
3 43
 
11.3%
5 3
 
0.8%
Distinct5
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
1.0
193 
2.0
155 
1.5
28 
2.5
 
2
1.4
 
1

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)0.3%

Sample

1st row1.0
2nd row1.0
3rd row1.0
4th row1.0
5th row2.0

Common Values

ValueCountFrequency (%)
1.0 193
50.9%
2.0 155
40.9%
1.5 28
 
7.4%
2.5 2
 
0.5%
1.4 1
 
0.3%

Length

2024-04-06T18:05:55.449847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T18:05:55.648346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1.0 193
50.9%
2.0 155
40.9%
1.5 28
 
7.4%
2.5 2
 
0.5%
1.4 1
 
0.3%

수강기간
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
01-02~03-25
135 
01-02~03-31
56 
01-02 ~03-25
45 
01-01~03-22
22 
01-02~03-29
19 
Other values (20)
102 

Length

Max length12
Median length11
Mean length11.113456
Min length9

Unique

Unique5 ?
Unique (%)1.3%

Sample

1st row01-03~03-25
2nd row01-03~03-25
3rd row01-02~03-21
4th row01-02~03-21
5th row01-05~03-22

Common Values

ValueCountFrequency (%)
01-02~03-25 135
35.6%
01-02~03-31 56
14.8%
01-02 ~03-25 45
 
11.9%
01-01~03-22 22
 
5.8%
01-02~03-29 19
 
5.0%
01-02~03-21 17
 
4.5%
01-01~03-31 16
 
4.2%
01-04~03-31 10
 
2.6%
01-08~03-25 10
 
2.6%
01-04~03-21 8
 
2.1%
Other values (15) 41
 
10.8%

Length

2024-04-06T18:05:55.856830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
01-02~03-25 135
31.8%
01-02~03-31 56
13.2%
01-02 45
 
10.6%
03-25 45
 
10.6%
01-01~03-22 22
 
5.2%
01-02~03-29 19
 
4.5%
01-02~03-21 17
 
4.0%
01-01~03-31 16
 
3.8%
01-04~03-31 10
 
2.4%
01-08~03-25 10
 
2.4%
Other values (16) 49
 
11.6%

Interactions

2024-04-06T18:05:50.316548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:05:49.977222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:05:50.514342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T18:05:50.150169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T18:05:56.017229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번교육기관분류대상수강인원(명)횟수(1주)수강시간(1회)수강기간
연번1.0000.9810.3920.5500.4620.2750.4560.931
교육기관0.9811.0000.6090.3520.5940.2290.3780.961
분류0.3920.6091.0000.2530.4620.4670.4380.497
대상0.5500.3520.2531.0000.7620.2320.4860.798
수강인원(명)0.4620.5940.4620.7621.0000.6270.2500.696
횟수(1주)0.2750.2290.4670.2320.6271.0000.4990.462
수강시간(1회)0.4560.3780.4380.4860.2500.4991.0000.596
수강기간0.9310.9610.4970.7980.6960.4620.5961.000
2024-04-06T18:05:56.240694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
횟수(1주)수강시간(1회)분류대상수강기간교육기관
횟수(1주)1.0000.4270.4630.1080.2540.130
수강시간(1회)0.4271.0000.3680.2250.2930.168
분류0.4630.3681.0000.1330.2880.341
대상0.1080.2250.1331.0000.3410.115
수강기간0.2540.2930.2880.3411.0000.711
교육기관0.1300.1680.3410.1150.7111.000
2024-04-06T18:05:56.425243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번수강인원(명)교육기관분류대상횟수(1주)수강시간(1회)수강기간
연번1.000-0.2300.8510.2540.2010.1660.2040.657
수강인원(명)-0.2301.0000.3000.3290.4220.3190.1550.354
교육기관0.8510.3001.0000.3410.1150.1300.1680.711
분류0.2540.3290.3411.0000.1330.4630.3680.288
대상0.2010.4220.1150.1331.0000.1080.2250.341
횟수(1주)0.1660.3190.1300.4630.1081.0000.4270.254
수강시간(1회)0.2040.1550.1680.3680.2250.4271.0000.293
수강기간0.6570.3540.7110.2880.3410.2540.2931.000

Missing values

2024-04-06T18:05:50.726082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:05:51.009693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기수교육기관분류강좌명대상수강인원(명)횟수(1주)수강시간(1회)수강기간
012024년 1분기옥련1동 행정복지센터문화여가요가성인2531.001-03~03-25
122024년 1분기옥련1동 행정복지센터문화여가줌바&근력스트레칭성인2531.001-03~03-25
232024년 1분기옥련1동 행정복지센터문화여가라인댄스성인2521.001-02~03-21
342024년 1분기옥련1동 행정복지센터문화여가실버태권도성인2021.001-02~03-21
452024년 1분기옥련1동 행정복지센터문화여가우쿨렐레성인2012.001-05~03-22
562024년 1분기옥련1동 행정복지센터문화여가포크기타성인2012.001-05~03-22
672024년 1분기옥련1동 행정복지센터문화여가색연필로 그리는 보태니컬아트성인2012.001-02~03-19
782024년 1분기옥련1동 행정복지센터문화여가신바람 노래교실성인2212.001-04~03-21
892024년 1분기옥련2동 행정복지센터문화여가서예교실성인1512.001-02 ~03-25
9102024년 1분기옥련2동 행정복지센터문화여가컴퓨터 A반성인1212.001-02 ~03-25
연번기수교육기관분류강좌명대상수강인원(명)횟수(1주)수강시간(1회)수강기간
3693702024년 1분기송도5동 행정복지센터생활체육유아체육A(목요반)아동1511.001-04~03-31
3703712024년 1분기송도5동 행정복지센터생활체육유아체육A(금요반)아동1511.001-04~03-31
3713722024년 1분기송도5동 행정복지센터생활체육유아체육B아동1511.001-04~03-31
3723732024년 1분기송도5동 행정복지센터생활체육유아체육C아동1511.001-04~03-31
3733742024년 1분기송도5동 행정복지센터문화여가저학년미술교실초등생1512.001-04~03-31
3743752024년 1분기송도5동 행정복지센터문화여가조물조물클레이아트아동1512.001-04~03-31
3753762024년 1분기송도5동 행정복지센터문화여가서양화미술성인1512.001-04~03-31
3763772024년 1분기송도5동 행정복지센터문화여가노래교실성인3012.001-04~03-31
3773782024년 1분기송도5동 행정복지센터문화여가통키타(초급)성인1512.001-04~03-31
3783792024년 1분기송도5동 행정복지센터문화여가통키타(중급)성인1512.001-04~03-31