Overview

Dataset statistics

Number of variables10
Number of observations92
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.7 KiB
Average record size in memory85.4 B

Variable types

Categorical6
Text1
Numeric3

Dataset

Description부산광역시여성회관운영프로그램현황_20240101
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3077143

Alerts

데이터기준일자 has constant value ""Constant
교육인원_1_2기(정원) is highly overall correlated with 교육인원_3_4기(정원) and 3 other fieldsHigh correlation
교육인원_3_4기(정원) is highly overall correlated with 교육인원_1_2기(정원) and 3 other fieldsHigh correlation
교육인원_연간(인원) is highly overall correlated with 교육인원_1_2기(정원) and 3 other fieldsHigh correlation
교육횟수  is highly overall correlated with 교육인원_1_2기(정원) and 4 other fieldsHigh correlation
교육시기  is highly overall correlated with 교육인원_1_2기(정원) and 3 other fieldsHigh correlation
분기별 교육횟수 is highly overall correlated with 교육횟수 High correlation
교육횟수  is highly imbalanced (81.0%)Imbalance
교육시기  is highly imbalanced (85.1%)Imbalance
교육인원_1_2기(정원) has 1 (1.1%) zerosZeros
교육인원_3_4기(정원) has 2 (2.2%) zerosZeros

Reproduction

Analysis started2024-03-13 13:19:41.533793
Analysis finished2024-03-13 13:19:43.315826
Duration1.78 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct5
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size868.0 B
기능생활교양강좌
36 
자격증강좌
19 
심화강좌
14 
야간강좌
13 
주말강좌
10 

Length

Max length8
Median length5
Mean length5.7717391
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row자격증강좌
2nd row자격증강좌
3rd row자격증강좌
4th row자격증강좌
5th row자격증강좌

Common Values

ValueCountFrequency (%)
기능생활교양강좌 36
39.1%
자격증강좌 19
20.7%
심화강좌 14
 
15.2%
야간강좌 13
 
14.1%
주말강좌 10
 
10.9%

Length

2024-03-13T22:19:43.405324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:43.537914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기능생활교양강좌 36
39.1%
자격증강좌 19
20.7%
심화강좌 14
 
15.2%
야간강좌 13
 
14.1%
주말강좌 10
 
10.9%

분야
Categorical

Distinct9
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size868.0 B
봉제
19 
미용
12 
제과제빵
12 
공예
11 
요리
11 
Other values (4)
27 

Length

Max length4
Median length2
Mean length2.5108696
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공예
2nd row미용
3rd row미용
4th row미용
5th row미용

Common Values

ValueCountFrequency (%)
봉제 19
20.7%
미용 12
13.0%
제과제빵 12
13.0%
공예 11
12.0%
요리 11
12.0%
외국어 11
12.0%
컴퓨터 8
8.7%
커피 6
 
6.5%
노인교구 2
 
2.2%

Length

2024-03-13T22:19:43.722220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:43.868655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
봉제 19
20.7%
미용 12
13.0%
제과제빵 12
13.0%
공예 11
12.0%
요리 11
12.0%
외국어 11
12.0%
컴퓨터 8
8.7%
커피 6
 
6.5%
노인교구 2
 
2.2%
Distinct85
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size868.0 B
2024-03-13T22:19:44.215832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length6.4673913
Min length2

Characters and Unicode

Total characters595
Distinct characters191
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique79 ?
Unique (%)85.9%

Sample

1st row화훼장식기능사
2nd row미용사(일반)
3rd row미용사(네일)
4th row미용사(피부)
5th row미용사(메이크업)
ValueCountFrequency (%)
홈패션 3
 
2.3%
커피바리스타 3
 
2.3%
중급 3
 
2.3%
카페디저트 3
 
2.3%
남성커트 3
 
2.3%
프랑스자수 2
 
1.5%
창업 2
 
1.5%
그래픽 2
 
1.5%
한지공예 2
 
1.5%
동영상 2
 
1.5%
Other values (96) 105
80.8%
2024-03-13T22:19:44.766841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
6.4%
17
 
2.9%
15
 
2.5%
14
 
2.4%
( 11
 
1.8%
11
 
1.8%
11
 
1.8%
) 11
 
1.8%
11
 
1.8%
10
 
1.7%
Other values (181) 446
75.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 499
83.9%
Space Separator 38
 
6.4%
Uppercase Letter 25
 
4.2%
Open Punctuation 11
 
1.8%
Close Punctuation 11
 
1.8%
Other Punctuation 7
 
1.2%
Decimal Number 2
 
0.3%
Lowercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
3.4%
15
 
3.0%
14
 
2.8%
11
 
2.2%
11
 
2.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
9
 
1.8%
Other values (158) 383
76.8%
Uppercase Letter
ValueCountFrequency (%)
T 4
16.0%
P 3
12.0%
O 2
8.0%
A 2
8.0%
B 2
8.0%
C 2
8.0%
Q 2
8.0%
I 2
8.0%
J 1
 
4.0%
K 1
 
4.0%
Other values (4) 4
16.0%
Other Punctuation
ValueCountFrequency (%)
/ 3
42.9%
& 2
28.6%
, 2
28.6%
Lowercase Letter
ValueCountFrequency (%)
b 1
50.0%
a 1
50.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Decimal Number
ValueCountFrequency (%)
2 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 499
83.9%
Common 69
 
11.6%
Latin 27
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
3.4%
15
 
3.0%
14
 
2.8%
11
 
2.2%
11
 
2.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
9
 
1.8%
Other values (158) 383
76.8%
Latin
ValueCountFrequency (%)
T 4
14.8%
P 3
11.1%
O 2
 
7.4%
A 2
 
7.4%
B 2
 
7.4%
C 2
 
7.4%
Q 2
 
7.4%
I 2
 
7.4%
b 1
 
3.7%
J 1
 
3.7%
Other values (6) 6
22.2%
Common
ValueCountFrequency (%)
38
55.1%
( 11
 
15.9%
) 11
 
15.9%
/ 3
 
4.3%
2 2
 
2.9%
& 2
 
2.9%
, 2
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 499
83.9%
ASCII 96
 
16.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
39.6%
( 11
 
11.5%
) 11
 
11.5%
T 4
 
4.2%
P 3
 
3.1%
/ 3
 
3.1%
2 2
 
2.1%
O 2
 
2.1%
& 2
 
2.1%
A 2
 
2.1%
Other values (13) 18
18.8%
Hangul
ValueCountFrequency (%)
17
 
3.4%
15
 
3.0%
14
 
2.8%
11
 
2.2%
11
 
2.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
9
 
1.8%
9
 
1.8%
Other values (158) 383
76.8%

교육횟수 
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
4
87 
1
 
3
6
 
1
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)2.2%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 87
94.6%
1 3
 
3.3%
6 1
 
1.1%
3 1
 
1.1%

Length

2024-03-13T22:19:44.917489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:45.019834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 87
94.6%
1 3
 
3.3%
6 1
 
1.1%
3 1
 
1.1%

교육시기 
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Memory size868.0 B
1-4기
88 
1-3기
 
1
1기
 
1
2기
 
1
3기
 
1

Length

Max length4
Median length4
Mean length3.9347826
Min length2

Unique

Unique4 ?
Unique (%)4.3%

Sample

1st row1-4기
2nd row1-4기
3rd row1-4기
4th row1-4기
5th row1-4기

Common Values

ValueCountFrequency (%)
1-4기 88
95.7%
1-3기 1
 
1.1%
1기 1
 
1.1%
2기 1
 
1.1%
3기 1
 
1.1%

Length

2024-03-13T22:19:45.177057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:45.319288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1-4기 88
95.7%
1-3기 1
 
1.1%
1기 1
 
1.1%
2기 1
 
1.1%
3기 1
 
1.1%

교육인원_1_2기(정원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct10
Distinct (%)10.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.847826
Minimum0
Maximum50
Zeros1
Zeros (%)1.1%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-03-13T22:19:45.460435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile30
Q140
median40
Q340
95-th percentile50
Maximum50
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation7.5198416
Coefficient of variation (CV)0.18871397
Kurtosis9.7380537
Mean39.847826
Median Absolute Deviation (MAD)0
Skewness-2.2674599
Sum3666
Variance56.548017
MonotonicityNot monotonic
2024-03-13T22:19:45.588650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
40 61
66.3%
50 10
 
10.9%
48 8
 
8.7%
30 5
 
5.4%
32 3
 
3.3%
28 1
 
1.1%
36 1
 
1.1%
20 1
 
1.1%
12 1
 
1.1%
0 1
 
1.1%
ValueCountFrequency (%)
0 1
 
1.1%
12 1
 
1.1%
20 1
 
1.1%
28 1
 
1.1%
30 5
 
5.4%
32 3
 
3.3%
36 1
 
1.1%
40 61
66.3%
48 8
 
8.7%
50 10
 
10.9%
ValueCountFrequency (%)
50 10
 
10.9%
48 8
 
8.7%
40 61
66.3%
36 1
 
1.1%
32 3
 
3.3%
30 5
 
5.4%
28 1
 
1.1%
20 1
 
1.1%
12 1
 
1.1%
0 1
 
1.1%

교육인원_3_4기(정원)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct9
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.456522
Minimum0
Maximum50
Zeros2
Zeros (%)2.2%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-03-13T22:19:45.719094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile29.1
Q140
median40
Q340
95-th percentile50
Maximum50
Range50
Interquartile range (IQR)0

Descriptive statistics

Standard deviation8.583671
Coefficient of variation (CV)0.21754758
Kurtosis9.421479
Mean39.456522
Median Absolute Deviation (MAD)0
Skewness-2.5074817
Sum3630
Variance73.679408
MonotonicityNot monotonic
2024-03-13T22:19:45.851527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
40 61
66.3%
50 10
 
10.9%
48 8
 
8.7%
30 5
 
5.4%
32 3
 
3.3%
0 2
 
2.2%
28 1
 
1.1%
12 1
 
1.1%
20 1
 
1.1%
ValueCountFrequency (%)
0 2
 
2.2%
12 1
 
1.1%
20 1
 
1.1%
28 1
 
1.1%
30 5
 
5.4%
32 3
 
3.3%
40 61
66.3%
48 8
 
8.7%
50 10
 
10.9%
ValueCountFrequency (%)
50 10
 
10.9%
48 8
 
8.7%
40 61
66.3%
32 3
 
3.3%
30 5
 
5.4%
28 1
 
1.1%
20 1
 
1.1%
12 1
 
1.1%
0 2
 
2.2%

교육인원_연간(인원)
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.304348
Minimum12
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size960.0 B
2024-03-13T22:19:45.980268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile58.2
Q180
median80
Q380
95-th percentile100
Maximum100
Range88
Interquartile range (IQR)0

Descriptive statistics

Standard deviation15.617502
Coefficient of variation (CV)0.19693121
Kurtosis6.7302478
Mean79.304348
Median Absolute Deviation (MAD)0
Skewness-2.0235546
Sum7296
Variance243.90635
MonotonicityNot monotonic
2024-03-13T22:19:46.120778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
80 61
66.3%
100 10
 
10.9%
96 8
 
8.7%
60 5
 
5.4%
64 3
 
3.3%
20 2
 
2.2%
56 1
 
1.1%
48 1
 
1.1%
12 1
 
1.1%
ValueCountFrequency (%)
12 1
 
1.1%
20 2
 
2.2%
48 1
 
1.1%
56 1
 
1.1%
60 5
 
5.4%
64 3
 
3.3%
80 61
66.3%
96 8
 
8.7%
100 10
 
10.9%
ValueCountFrequency (%)
100 10
 
10.9%
96 8
 
8.7%
80 61
66.3%
64 3
 
3.3%
60 5
 
5.4%
56 1
 
1.1%
48 1
 
1.1%
20 2
 
2.2%
12 1
 
1.1%

분기별 교육횟수
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size868.0 B
10-12회
66 
20-24회
20 
1-6회
 
6

Length

Max length6
Median length6
Mean length5.8695652
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20-24회
2nd row20-24회
3rd row20-24회
4th row20-24회
5th row20-24회

Common Values

ValueCountFrequency (%)
10-12회 66
71.7%
20-24회 20
 
21.7%
1-6회 6
 
6.5%

Length

2024-03-13T22:19:46.270881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:46.402285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10-12회 66
71.7%
20-24회 20
 
21.7%
1-6회 6
 
6.5%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size868.0 B
2024-01-01
92 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-01
2nd row2024-01-01
3rd row2024-01-01
4th row2024-01-01
5th row2024-01-01

Common Values

ValueCountFrequency (%)
2024-01-01 92
100.0%

Length

2024-03-13T22:19:46.504271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T22:19:46.634343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-01 92
100.0%

Interactions

2024-03-13T22:19:42.648865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.040704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.391701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.744453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.172863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.475358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.839919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.299651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T22:19:42.568615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T22:19:46.715757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분분야강좌명교육횟수교육시기교육인원_1_2기(정원)교육인원_3_4기(정원)교육인원_연간(인원)분기별 교육횟수
구분1.0000.1760.0000.3380.5080.3420.3900.6380.550
분야0.1761.0000.9960.4040.1660.5620.6100.6900.641
강좌명0.0000.9961.0001.0001.0000.0000.0000.9980.982
교육횟수0.3380.4041.0001.0000.8350.6750.8680.8360.547
교육시기0.5080.1661.0000.8351.0000.9060.9050.9520.520
교육인원_1_2기(정원)0.3420.5620.0000.6750.9061.0000.9970.9050.571
교육인원_3_4기(정원)0.3900.6100.0000.8680.9050.9971.0001.0000.574
교육인원_연간(인원)0.6380.6900.9980.8360.9520.9051.0001.0000.541
분기별 교육횟수0.5500.6410.9820.5470.5200.5710.5740.5411.000
2024-03-13T22:19:47.196681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육횟수분야교육시기분기별 교육횟수구분
교육횟수1.0000.2600.8020.5490.280
분야0.2601.0000.0890.3440.094
교육시기0.8020.0891.0000.4520.209
분기별 교육횟수0.5490.3440.4521.0000.487
구분0.2800.0940.2090.4871.000
2024-03-13T22:19:47.313271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교육인원_1_2기(정원)교육인원_3_4기(정원)교육인원_연간(인원)구분분야교육횟수교육시기분기별 교육횟수
교육인원_1_2기(정원)1.0000.9990.9990.2080.3180.5290.8470.455
교육인원_3_4기(정원)0.9991.0001.0000.2450.3600.7890.8460.457
교육인원_연간(인원)0.9991.0001.0000.2640.4670.6870.5870.477
구분0.2080.2450.2641.0000.0940.2800.2090.487
분야0.3180.3600.4670.0941.0000.2600.0890.344
교육횟수0.5290.7890.6870.2800.2601.0000.8020.549
교육시기0.8470.8460.5870.2090.0890.8021.0000.452
분기별 교육횟수0.4550.4570.4770.4870.3440.5490.4521.000

Missing values

2024-03-13T22:19:42.957851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T22:19:43.178729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분분야강좌명교육횟수교육시기교육인원_1_2기(정원)교육인원_3_4기(정원)교육인원_연간(인원)분기별 교육횟수데이터기준일자
0자격증강좌공예화훼장식기능사41-4기40408020-24회2024-01-01
1자격증강좌미용미용사(일반)41-4기40408020-24회2024-01-01
2자격증강좌미용미용사(네일)41-4기40408020-24회2024-01-01
3자격증강좌미용미용사(피부)41-4기40408020-24회2024-01-01
4자격증강좌미용미용사(메이크업)41-4기40408020-24회2024-01-01
5자격증강좌요리한식조리기능사41-4기40408020-24회2024-01-01
6자격증강좌요리양식/중식/일식조리기능사41-4기505010020-24회2024-01-01
7자격증강좌제과제빵제빵기능사41-4기40408020-24회2024-01-01
8자격증강좌제과제빵제과기능사41-4기40408020-24회2024-01-01
9자격증강좌커피바리스타A41-4기40408010-12회2024-01-01
구분분야강좌명교육횟수교육시기교육인원_1_2기(정원)교육인원_3_4기(정원)교육인원_연간(인원)분기별 교육횟수데이터기준일자
82주말강좌공예손맛나는 수채캘리그라피41-4기40408010-12회2024-01-01
83주말강좌미용남성커트 초급41-4기505010010-12회2024-01-01
84주말강좌미용네일아트41-4기40408010-12회2024-01-01
85주말강좌제과제빵반려동물 간식11기200201-6회2024-01-01
86주말강좌제과제빵가족 홈 베이커리12기120121-6회2024-01-01
87주말강좌제과제빵쿠키와 스콘13기020201-6회2024-01-01
88주말강좌봉제누비제품 제작41-4기40408010-12회2024-01-01
89주말강좌봉제옷 만들기41-4기40408010-12회2024-01-01
90주말강좌외국어재밌는 영어회화41-4기505010010-12회2024-01-01
91주말강좌컴퓨터그래픽, 동영상 등41-4기40408010-12회2024-01-01