Overview

Dataset statistics

Number of variables7
Number of observations653
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory36.5 KiB
Average record size in memory57.2 B

Variable types

Categorical2
Text2
DateTime2
Numeric1

Dataset

Description여성가족부 센터별 모집공고에 대한 정보(모집구분, 모집상태, 모집공고명,모집시작일, 모집종료일, 모집정원, 지역)를 제공
Author여성가족부
URLhttps://www.data.go.kr/data/15063152/fileData.do

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
모집상태 is highly imbalanced (69.2%)Imbalance
모집정원 has 501 (76.7%) zerosZeros

Reproduction

Analysis started2023-12-12 11:33:48.867647
Analysis finished2023-12-12 11:33:50.060083
Duration1.19 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

모집구분
Categorical

Distinct3
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
정기모집
450 
수시모집
169 
특별모집
 
34

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수시모집
2nd row수시모집
3rd row정기모집
4th row정기모집
5th row정기모집

Common Values

ValueCountFrequency (%)
정기모집 450
68.9%
수시모집 169
 
25.9%
특별모집 34
 
5.2%

Length

2023-12-12T20:33:50.266611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:33:50.544215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정기모집 450
68.9%
수시모집 169
 
25.9%
특별모집 34
 
5.2%

모집상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
마감
617 
모집중
 
36

Length

Max length3
Median length2
Mean length2.0551302
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row마감
2nd row마감
3rd row마감
4th row마감
5th row마감

Common Values

ValueCountFrequency (%)
마감 617
94.5%
모집중 36
 
5.5%

Length

2023-12-12T20:33:50.838673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:33:51.083697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
마감 617
94.5%
모집중 36
 
5.5%
Distinct566
Distinct (%)86.7%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2023-12-12T20:33:51.592805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length47
Mean length22.064319
Min length16

Characters and Unicode

Total characters14408
Distinct characters211
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique503 ?
Unique (%)77.0%

Sample

1st row2021 옥천군 아이돌보미 수시모집
2nd row2021 옥천군 아이돌보미 수시모집
3rd row2021 옥천군 아이돌보미 정기 모집
4th row2021 옥천군 아이돌보미 정기모집
5th row2021 태백시 아이돌보미 2차
ValueCountFrequency (%)
아이돌보미 615
20.0%
2021 577
18.8%
1차 288
 
9.4%
2차 134
 
4.4%
정기모집 118
 
3.8%
모집 81
 
2.6%
2021년 68
 
2.2%
3차 53
 
1.7%
수시모집 53
 
1.7%
중구 25
 
0.8%
Other values (386) 1059
34.5%
2023-12-12T20:33:52.496942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2812
19.5%
2 1461
 
10.1%
1 967
 
6.7%
667
 
4.6%
663
 
4.6%
659
 
4.6%
657
 
4.6%
656
 
4.6%
0 648
 
4.5%
569
 
3.9%
Other values (201) 4649
32.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8118
56.3%
Decimal Number 3199
 
22.2%
Space Separator 2812
 
19.5%
Open Punctuation 120
 
0.8%
Close Punctuation 120
 
0.8%
Dash Punctuation 18
 
0.1%
Other Punctuation 13
 
0.1%
Connector Punctuation 4
 
< 0.1%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
667
 
8.2%
663
 
8.2%
659
 
8.1%
657
 
8.1%
656
 
8.1%
569
 
7.0%
300
 
3.7%
300
 
3.7%
287
 
3.5%
252
 
3.1%
Other values (179) 3108
38.3%
Decimal Number
ValueCountFrequency (%)
2 1461
45.7%
1 967
30.2%
0 648
20.3%
3 61
 
1.9%
4 30
 
0.9%
5 14
 
0.4%
6 7
 
0.2%
7 6
 
0.2%
9 3
 
0.1%
8 2
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
C 1
25.0%
W 1
25.0%
Y 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 8
61.5%
/ 4
30.8%
. 1
 
7.7%
Space Separator
ValueCountFrequency (%)
2812
100.0%
Open Punctuation
ValueCountFrequency (%)
( 120
100.0%
Close Punctuation
ValueCountFrequency (%)
) 120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8118
56.3%
Common 6286
43.6%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
667
 
8.2%
663
 
8.2%
659
 
8.1%
657
 
8.1%
656
 
8.1%
569
 
7.0%
300
 
3.7%
300
 
3.7%
287
 
3.5%
252
 
3.1%
Other values (179) 3108
38.3%
Common
ValueCountFrequency (%)
2812
44.7%
2 1461
23.2%
1 967
 
15.4%
0 648
 
10.3%
( 120
 
1.9%
) 120
 
1.9%
3 61
 
1.0%
4 30
 
0.5%
- 18
 
0.3%
5 14
 
0.2%
Other values (8) 35
 
0.6%
Latin
ValueCountFrequency (%)
A 1
25.0%
C 1
25.0%
W 1
25.0%
Y 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8118
56.3%
ASCII 6290
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2812
44.7%
2 1461
23.2%
1 967
 
15.4%
0 648
 
10.3%
( 120
 
1.9%
) 120
 
1.9%
3 61
 
1.0%
4 30
 
0.5%
- 18
 
0.3%
5 14
 
0.2%
Other values (12) 39
 
0.6%
Hangul
ValueCountFrequency (%)
667
 
8.2%
663
 
8.2%
659
 
8.1%
657
 
8.1%
656
 
8.1%
569
 
7.0%
300
 
3.7%
300
 
3.7%
287
 
3.5%
252
 
3.1%
Other values (179) 3108
38.3%
Distinct164
Distinct (%)25.1%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2020-11-18 00:00:00
Maximum2021-08-17 00:00:00
2023-12-12T20:33:52.859201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:33:53.229581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct185
Distinct (%)28.3%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
Minimum2020-12-23 00:00:00
Maximum2022-01-31 00:00:00
2023-12-12T20:33:53.566882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:33:53.913238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

모집정원
Real number (ℝ)

ZEROS 

Distinct25
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.4900459
Minimum0
Maximum88
Zeros501
Zeros (%)76.7%
Negative0
Negative (%)0.0%
Memory size5.9 KiB
2023-12-12T20:33:54.226450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile15
Maximum88
Range88
Interquartile range (IQR)0

Descriptive statistics

Standard deviation8.0313024
Coefficient of variation (CV)3.2253631
Kurtosis47.392121
Mean2.4900459
Median Absolute Deviation (MAD)0
Skewness6.090459
Sum1626
Variance64.501818
MonotonicityNot monotonic
2023-12-12T20:33:54.510157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
0 501
76.7%
10 23
 
3.5%
3 20
 
3.1%
2 19
 
2.9%
5 19
 
2.9%
15 18
 
2.8%
1 16
 
2.5%
20 8
 
1.2%
13 4
 
0.6%
8 3
 
0.5%
Other values (15) 22
 
3.4%
ValueCountFrequency (%)
0 501
76.7%
1 16
 
2.5%
2 19
 
2.9%
3 20
 
3.1%
4 2
 
0.3%
5 19
 
2.9%
6 2
 
0.3%
7 1
 
0.2%
8 3
 
0.5%
9 1
 
0.2%
ValueCountFrequency (%)
88 1
 
0.2%
73 2
 
0.3%
60 2
 
0.3%
47 1
 
0.2%
40 2
 
0.3%
30 1
 
0.2%
25 2
 
0.3%
24 1
 
0.2%
22 1
 
0.2%
20 8
1.2%

지역
Text

Distinct218
Distinct (%)33.4%
Missing0
Missing (%)0.0%
Memory size5.2 KiB
2023-12-12T20:33:55.123112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length5.9111792
Min length5

Characters and Unicode

Total characters3860
Distinct characters130
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)7.7%

Sample

1st row충북 옥천군
2nd row충북 옥천군
3rd row충북 옥천군
4th row충북 옥천군
5th row강원 태백시
ValueCountFrequency (%)
서울 120
 
9.2%
경기 94
 
7.2%
강원 56
 
4.3%
전남 54
 
4.1%
경북 52
 
4.0%
경남 43
 
3.3%
전북 41
 
3.1%
부산 31
 
2.4%
대구 30
 
2.3%
인천 30
 
2.3%
Other values (205) 755
57.8%
2023-12-12T20:33:55.848747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
653
16.9%
279
 
7.2%
237
 
6.1%
196
 
5.1%
184
 
4.8%
171
 
4.4%
154
 
4.0%
134
 
3.5%
127
 
3.3%
119
 
3.1%
Other values (120) 1606
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3207
83.1%
Space Separator 653
 
16.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
279
 
8.7%
237
 
7.4%
196
 
6.1%
184
 
5.7%
171
 
5.3%
154
 
4.8%
134
 
4.2%
127
 
4.0%
119
 
3.7%
94
 
2.9%
Other values (119) 1512
47.1%
Space Separator
ValueCountFrequency (%)
653
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3207
83.1%
Common 653
 
16.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
279
 
8.7%
237
 
7.4%
196
 
6.1%
184
 
5.7%
171
 
5.3%
154
 
4.8%
134
 
4.2%
127
 
4.0%
119
 
3.7%
94
 
2.9%
Other values (119) 1512
47.1%
Common
ValueCountFrequency (%)
653
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3207
83.1%
ASCII 653
 
16.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
653
100.0%
Hangul
ValueCountFrequency (%)
279
 
8.7%
237
 
7.4%
196
 
6.1%
184
 
5.7%
171
 
5.3%
154
 
4.8%
134
 
4.2%
127
 
4.0%
119
 
3.7%
94
 
2.9%
Other values (119) 1512
47.1%

Interactions

2023-12-12T20:33:49.430600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:33:56.012563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모집구분모집상태모집정원
모집구분1.0000.1030.500
모집상태0.1031.0000.000
모집정원0.5000.0001.000
2023-12-12T20:33:56.157213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모집구분모집상태
모집구분1.0000.170
모집상태0.1701.000
2023-12-12T20:33:56.303709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
모집정원모집구분모집상태
모집정원1.0000.2530.000
모집구분0.2531.0000.170
모집상태0.0000.1701.000

Missing values

2023-12-12T20:33:49.664324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:33:49.948537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

모집구분모집상태모집공고명모집시작일모집종료일모집정원지역
0수시모집마감2021 옥천군 아이돌보미 수시모집2021-07-302021-08-130충북 옥천군
1수시모집마감2021 옥천군 아이돌보미 수시모집2021-07-092021-07-230충북 옥천군
2정기모집마감2021 옥천군 아이돌보미 정기 모집2021-05-262021-06-090충북 옥천군
3정기모집마감2021 옥천군 아이돌보미 정기모집2021-05-062021-05-210충북 옥천군
4정기모집마감2021 태백시 아이돌보미 2차2021-03-162021-03-260강원 태백시
5정기모집마감2021 태백시 아이돌보미 1차2021-02-182021-03-030강원 태백시
6정기모집마감2021 평창군 아이돌보미 정기모집 3차2021-07-262021-08-120강원 평창군
7수시모집모집중2021 평창군 아이돌보미 수시모집 2차 (자격증 소지자 또는 양성교육 수료자)2021-06-162021-12-310강원 평창군
8정기모집마감2021 평창군 아이돌보미 정기모집 1차2021-03-262021-03-300강원 평창군
9정기모집마감2021 평창군 아이돌보미 정기모집 1차2021-03-152021-03-250강원 평창군
모집구분모집상태모집공고명모집시작일모집종료일모집정원지역
643정기모집마감2021 중구 아이돌보미 정기모집 2차2021-07-012021-07-230대전 중구
644수시모집마감2021 중구 아이돌보미 수시모집 1차2021-06-012021-10-290대전 중구
645정기모집마감2021 중구 아이돌보미 1차2021-03-302021-03-310대전 중구
646정기모집마감2021 중구 아이돌보미 1차2021-03-122021-03-260대전 중구
647정기모집마감2021 중구 아이돌보미 1차2021-01-262021-01-312대전 중구
648특별모집마감2021 중구 아이돌보미 1차2021-01-252021-01-2688대전 중구
649정기모집모집중2021 서구 아이돌보미 정기모집 2차2021-08-132021-09-025부산 서구
650수시모집마감2021 서구 아이돌보미 1차2021-03-192021-03-201부산 서구
651정기모집마감2021 부산서구 아이돌보미 1차 정기모집2021-03-052021-03-2420부산 서구
652특별모집마감2021 서구 아이돌보미 특별모집 1차2021-01-122021-01-140부산 서구

Duplicate rows

Most frequently occurring

모집구분모집상태모집공고명모집시작일모집종료일모집정원지역# duplicates
0수시모집마감2021 영주시 아이돌보미 1차2021-03-292021-04-050경북 영주시2