Overview

Dataset statistics

Number of variables5
Number of observations53
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory43.5 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description한국건강가정진흥원 아이돌봄지원사업 돌보미 양성교육 기관현황입니다.파일데이터 제공항목은 연번, 시도명, 교육기관명, 주소, 운영시간입니다.
Author한국건강가정진흥원
URLhttps://www.data.go.kr/data/3081657/fileData.do

Alerts

연번 is highly overall correlated with 시도명High correlation
시도명 is highly overall correlated with 연번High correlation
운영시간 is highly imbalanced (59.5%)Imbalance
연번 has unique valuesUnique
교육기관명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:09:20.436264
Analysis finished2023-12-12 22:09:20.987533
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27
Minimum1
Maximum53
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size609.0 B
2023-12-13T07:09:21.075539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.6
Q114
median27
Q340
95-th percentile50.4
Maximum53
Range52
Interquartile range (IQR)26

Descriptive statistics

Standard deviation15.443445
Coefficient of variation (CV)0.57197945
Kurtosis-1.2
Mean27
Median Absolute Deviation (MAD)13
Skewness0
Sum1431
Variance238.5
MonotonicityStrictly increasing
2023-12-13T07:09:21.554022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
41 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (43) 43
81.1%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
53 1
1.9%
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%

시도명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size556.0 B
서울특별시
12 
경상북도
경기도
강원도
충청남도
Other values (12)
17 

Length

Max length7
Median length5
Mean length4.3584906
Min length3

Unique

Unique7 ?
Unique (%)13.2%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 12
22.6%
경상북도 9
17.0%
경기도 6
11.3%
강원도 5
9.4%
충청남도 4
 
7.5%
부산광역시 2
 
3.8%
제주특별자치도 2
 
3.8%
충청북도 2
 
3.8%
광주광역시 2
 
3.8%
인천광역시 2
 
3.8%
Other values (7) 7
13.2%

Length

2023-12-13T07:09:21.731882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 12
22.6%
경상북도 9
17.0%
경기도 6
11.3%
강원도 5
9.4%
충청남도 4
 
7.5%
충청북도 2
 
3.8%
인천광역시 2
 
3.8%
광주광역시 2
 
3.8%
제주특별자치도 2
 
3.8%
부산광역시 2
 
3.8%
Other values (7) 7
13.2%

교육기관명
Text

UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-13T07:09:22.011373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length10.698113
Min length6

Characters and Unicode

Total characters567
Distinct characters110
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)100.0%

Sample

1st row강서여성인력개발센터
2nd row북부여성발전센터
3rd row서대문여성인력개발센터
4th row서초여성인력개발센터
5th row종로여성인력개발센터
ValueCountFrequency (%)
강서여성인력개발센터 1
 
1.7%
전라남도 1
 
1.7%
정선여성새로일하기센터 1
 
1.7%
상지대학교부설평생교육원 1
 
1.7%
진천군가족센터 1
 
1.7%
충청북도아이돌봄광역지원센터 1
 
1.7%
논산여성인력개발센터 1
 
1.7%
한서대학교 1
 
1.7%
부설 1
 
1.7%
평생교육원 1
 
1.7%
Other values (49) 49
83.1%
2023-12-13T07:09:22.435600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
6.0%
34
 
6.0%
27
 
4.8%
25
 
4.4%
24
 
4.2%
22
 
3.9%
21
 
3.7%
21
 
3.7%
20
 
3.5%
17
 
3.0%
Other values (100) 322
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 541
95.4%
Uppercase Letter 20
 
3.5%
Space Separator 6
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
6.3%
34
 
6.3%
27
 
5.0%
25
 
4.6%
24
 
4.4%
22
 
4.1%
21
 
3.9%
21
 
3.9%
20
 
3.7%
17
 
3.1%
Other values (92) 296
54.7%
Uppercase Letter
ValueCountFrequency (%)
C 5
25.0%
Y 4
20.0%
A 4
20.0%
W 3
15.0%
M 2
 
10.0%
K 1
 
5.0%
E 1
 
5.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 541
95.4%
Latin 20
 
3.5%
Common 6
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
6.3%
34
 
6.3%
27
 
5.0%
25
 
4.6%
24
 
4.4%
22
 
4.1%
21
 
3.9%
21
 
3.9%
20
 
3.7%
17
 
3.1%
Other values (92) 296
54.7%
Latin
ValueCountFrequency (%)
C 5
25.0%
Y 4
20.0%
A 4
20.0%
W 3
15.0%
M 2
 
10.0%
K 1
 
5.0%
E 1
 
5.0%
Common
ValueCountFrequency (%)
6
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 541
95.4%
ASCII 26
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
34
 
6.3%
34
 
6.3%
27
 
5.0%
25
 
4.6%
24
 
4.4%
22
 
4.1%
21
 
3.9%
21
 
3.9%
20
 
3.7%
17
 
3.1%
Other values (92) 296
54.7%
ASCII
ValueCountFrequency (%)
6
23.1%
C 5
19.2%
Y 4
15.4%
A 4
15.4%
W 3
11.5%
M 2
 
7.7%
K 1
 
3.8%
E 1
 
3.8%

주소
Text

UNIQUE 

Distinct53
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-13T07:09:22.749089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length32
Mean length24.716981
Min length13

Characters and Unicode

Total characters1310
Distinct characters200
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)100.0%

Sample

1st row서울시 강서구 까치산로 134 화곡빌딩 5층
2nd row서울 노원구 동일로 207길 50
3rd row서울 서대문구 신촌역로 10 (대현동, 혜우빌딩4층)
4th row서울 서초구 강남대로 216 양재프라자 3층
5th row서울 종로구 대학로 11길 23 대학로스타시티빌딩 2~4층
ValueCountFrequency (%)
서울 9
 
2.9%
3층 8
 
2.6%
경북 8
 
2.6%
2층 7
 
2.3%
경기 6
 
2.0%
강원 4
 
1.3%
충남 4
 
1.3%
5층 4
 
1.3%
중앙로 3
 
1.0%
4층 3
 
1.0%
Other values (237) 250
81.7%
2023-12-13T07:09:23.192381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
253
 
19.3%
47
 
3.6%
1 41
 
3.1%
36
 
2.7%
3 35
 
2.7%
34
 
2.6%
2 32
 
2.4%
29
 
2.2%
25
 
1.9%
5 24
 
1.8%
Other values (190) 754
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 812
62.0%
Space Separator 253
 
19.3%
Decimal Number 211
 
16.1%
Close Punctuation 8
 
0.6%
Open Punctuation 8
 
0.6%
Dash Punctuation 7
 
0.5%
Uppercase Letter 5
 
0.4%
Other Punctuation 3
 
0.2%
Math Symbol 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
5.8%
36
 
4.4%
34
 
4.2%
29
 
3.6%
25
 
3.1%
21
 
2.6%
21
 
2.6%
19
 
2.3%
19
 
2.3%
18
 
2.2%
Other values (170) 543
66.9%
Decimal Number
ValueCountFrequency (%)
1 41
19.4%
3 35
16.6%
2 32
15.2%
5 24
11.4%
0 18
8.5%
4 17
8.1%
7 16
 
7.6%
6 11
 
5.2%
9 9
 
4.3%
8 8
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
40.0%
W 1
20.0%
Y 1
20.0%
C 1
20.0%
Space Separator
ValueCountFrequency (%)
253
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 812
62.0%
Common 493
37.6%
Latin 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
5.8%
36
 
4.4%
34
 
4.2%
29
 
3.6%
25
 
3.1%
21
 
2.6%
21
 
2.6%
19
 
2.3%
19
 
2.3%
18
 
2.2%
Other values (170) 543
66.9%
Common
ValueCountFrequency (%)
253
51.3%
1 41
 
8.3%
3 35
 
7.1%
2 32
 
6.5%
5 24
 
4.9%
0 18
 
3.7%
4 17
 
3.4%
7 16
 
3.2%
6 11
 
2.2%
9 9
 
1.8%
Other values (6) 37
 
7.5%
Latin
ValueCountFrequency (%)
A 2
40.0%
W 1
20.0%
Y 1
20.0%
C 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 812
62.0%
ASCII 498
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
253
50.8%
1 41
 
8.2%
3 35
 
7.0%
2 32
 
6.4%
5 24
 
4.8%
0 18
 
3.6%
4 17
 
3.4%
7 16
 
3.2%
6 11
 
2.2%
9 9
 
1.8%
Other values (10) 42
 
8.4%
Hangul
ValueCountFrequency (%)
47
 
5.8%
36
 
4.4%
34
 
4.2%
29
 
3.6%
25
 
3.1%
21
 
2.6%
21
 
2.6%
19
 
2.3%
19
 
2.3%
18
 
2.2%
Other values (170) 543
66.9%

운영시간
Categorical

IMBALANCE 

Distinct3
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Memory size556.0 B
평일 09:00~18:00
46 
평일 09:00~22:00
평일 09:00~22:30
 
1

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique1 ?
Unique (%)1.9%

Sample

1st row평일 09:00~18:00
2nd row평일 09:00~18:00
3rd row평일 09:00~22:00
4th row평일 09:00~22:00
5th row평일 09:00~22:00

Common Values

ValueCountFrequency (%)
평일 09:00~18:00 46
86.8%
평일 09:00~22:00 6
 
11.3%
평일 09:00~22:30 1
 
1.9%

Length

2023-12-13T07:09:23.342887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:09:23.422199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
평일 53
50.0%
09:00~18:00 46
43.4%
09:00~22:00 6
 
5.7%
09:00~22:30 1
 
0.9%

Interactions

2023-12-13T07:09:20.736094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:09:23.482479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시도명교육기관명주소운영시간
연번1.0000.9041.0001.0000.624
시도명0.9041.0001.0001.0000.676
교육기관명1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
운영시간0.6240.6761.0001.0001.000
2023-12-13T07:09:23.568625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명운영시간
시도명1.0000.401
운영시간0.4011.000
2023-12-13T07:09:23.638208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시도명운영시간
연번1.0000.6120.432
시도명0.6121.0000.401
운영시간0.4320.4011.000

Missing values

2023-12-13T07:09:20.850676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:09:20.953393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도명교육기관명주소운영시간
01서울특별시강서여성인력개발센터서울시 강서구 까치산로 134 화곡빌딩 5층평일 09:00~18:00
12서울특별시북부여성발전센터서울 노원구 동일로 207길 50평일 09:00~18:00
23서울특별시서대문여성인력개발센터서울 서대문구 신촌역로 10 (대현동, 혜우빌딩4층)평일 09:00~22:00
34서울특별시서초여성인력개발센터서울 서초구 강남대로 216 양재프라자 3층평일 09:00~22:00
45서울특별시종로여성인력개발센터서울 종로구 대학로 11길 23 대학로스타시티빌딩 2~4층평일 09:00~22:00
56서울특별시은평여성인력개발센터서울 은평구 녹번로 76평일 09:00~22:00
67서울특별시강북여성인력개발센터서울 강북구 덕릉로 108(미아동) 현웅빌딩 3층평일 09:00~22:00
78서울특별시송파여성인력개발센터서울 송파구 중대로9길 34(가락동) 대호빌딩 2층평일 09:00~22:00
89서울특별시구로여성인력개발센터서울 구로구 공원로 63 희훈타워빌 2층평일 09:00~18:00
910서울특별시성동여성인력개발센터서울 성동구 성수이로22길 37 성수아크벨리 5층평일 09:00~18:00
연번시도명교육기관명주소운영시간
4344경상북도경주가경사회서비스지원센터경주시 동문로 50, 3층평일 09:00~18:00
4445경상북도안동YMCA경북 안동시 영가로 19 삼보빌딩 3층평일 09:00~18:00
4546경상북도구미시가족센터경북 구미시 산책길 73 가족행복플라자평일 09:00~18:00
4647경상북도동양대학교산학협력단경북 영주시 풍기읍 동양대로 145평일 09:00~18:00
4748경상북도대경대학교평생교육원경북 경산시 자인면 단북1길 65평일 09:00~18:00
4849경상북도경북도립대학교평생교육원경북 예천군 예천읍 도립대학길 114평일 09:00~18:00
4950경상북도칠곡여성인력개발센터경북 칠곡군 왜관읍 평장2길 20평일 09:00~18:00
5051경상남도경상남도건강가정지원센터경남 창원시 마산회원구 봉암북7길 21 경남테크노파크 정보산업진흥본부 4동 301호평일 09:00~18:00
5152제주특별자치도제주여성인력개발센터제주 제주시 중앙로 165 제주고용복지플러스센터 4층평일 09:00~18:00
5253제주특별자치도서귀포여성새로일하기센터제주 서귀포시 부두로 3 서귀포YWCA평일 09:00~18:00