Overview

Dataset statistics

Number of variables5
Number of observations98
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description한국건강가정진흥원에서 제공하는 가족서비스지원 가족역량강화 지원사업 기관 현황입니다.파일데이터 항목 구성은 연번, 지역, 시설명, 전화번호, 사업대상년도입니다.
Author한국건강가정진흥원
URLhttps://www.data.go.kr/data/3081671/fileData.do

Alerts

사업대상년도 has constant value ""Constant
연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:01:48.899087
Analysis finished2023-12-12 12:01:49.708297
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.5
Minimum1
Maximum98
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1014.0 B
2023-12-12T21:01:49.784478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.85
Q125.25
median49.5
Q373.75
95-th percentile93.15
Maximum98
Range97
Interquartile range (IQR)48.5

Descriptive statistics

Standard deviation28.434134
Coefficient of variation (CV)0.57442696
Kurtosis-1.2
Mean49.5
Median Absolute Deviation (MAD)24.5
Skewness0
Sum4851
Variance808.5
MonotonicityStrictly increasing
2023-12-12T21:01:49.929464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
75 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
67 1
 
1.0%
66 1
 
1.0%
Other values (88) 88
89.8%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
90 1
1.0%
89 1
1.0%

지역
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Memory size916.0 B
서울특별시
11 
경상남도
경상북도
경기도
전라남도
Other values (12)
55 

Length

Max length5
Median length4
Mean length4.244898
Min length3

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 11
11.2%
경상남도 9
9.2%
경상북도 8
 
8.2%
경기도 8
 
8.2%
전라남도 7
 
7.1%
인천광역시 7
 
7.1%
대구광역시 7
 
7.1%
충청남도 7
 
7.1%
부산광역시 7
 
7.1%
강원도 6
 
6.1%
Other values (7) 21
21.4%

Length

2023-12-12T21:01:50.415900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 11
11.2%
경상남도 9
9.2%
경상북도 8
 
8.2%
경기도 8
 
8.2%
대구광역시 7
 
7.1%
충청남도 7
 
7.1%
부산광역시 7
 
7.1%
인천광역시 7
 
7.1%
전라남도 7
 
7.1%
강원도 6
 
6.1%
Other values (7) 21
21.4%

센터
Text

Distinct90
Distinct (%)91.8%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-12T21:01:50.718150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.3163265
Min length7

Characters and Unicode

Total characters815
Distinct characters96
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)87.8%

Sample

1st row관악구 가족센터
2nd row구로구 가족센터
3rd row금천구 가족센터
4th row도봉구 가족센터
5th row동대문구 가족센터
ValueCountFrequency (%)
가족센터 87
44.8%
건강가정지원센터 9
 
4.6%
서구 4
 
2.1%
남구 3
 
1.5%
북구 3
 
1.5%
동구 2
 
1.0%
청주시 1
 
0.5%
관악구 1
 
0.5%
남원시 1
 
0.5%
목포시 1
 
0.5%
Other values (82) 82
42.3%
2023-12-12T21:01:51.244990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
12.0%
98
12.0%
98
12.0%
96
11.8%
89
10.9%
50
 
6.1%
40
 
4.9%
12
 
1.5%
12
 
1.5%
11
 
1.3%
Other values (86) 211
25.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 719
88.2%
Space Separator 96
 
11.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
13.6%
98
13.6%
98
13.6%
89
12.4%
50
 
7.0%
40
 
5.6%
12
 
1.7%
12
 
1.7%
11
 
1.5%
11
 
1.5%
Other values (85) 200
27.8%
Space Separator
ValueCountFrequency (%)
96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 719
88.2%
Common 96
 
11.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
98
13.6%
98
13.6%
98
13.6%
89
12.4%
50
 
7.0%
40
 
5.6%
12
 
1.7%
12
 
1.7%
11
 
1.5%
11
 
1.5%
Other values (85) 200
27.8%
Common
ValueCountFrequency (%)
96
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 719
88.2%
ASCII 96
 
11.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
98
13.6%
98
13.6%
98
13.6%
89
12.4%
50
 
7.0%
40
 
5.6%
12
 
1.7%
12
 
1.7%
11
 
1.5%
11
 
1.5%
Other values (85) 200
27.8%
ASCII
ValueCountFrequency (%)
96
100.0%
Distinct97
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-12T21:01:51.527213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.918367
Min length11

Characters and Unicode

Total characters1168
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique96 ?
Unique (%)98.0%

Sample

1st row02-883-9390
2nd row02-830-0450
3rd row02-803-7747
4th row02-995-6800
5th row02-957-0760
ValueCountFrequency (%)
061-659-4167 2
 
2.0%
02-883-9390 1
 
1.0%
061-797-6800 1
 
1.0%
063-838-6046 1
 
1.0%
063-261-1033 1
 
1.0%
063-631-6700 1
 
1.0%
063-545-8506 1
 
1.0%
063-443-5300 1
 
1.0%
041-670-2396 1
 
1.0%
070-7733-8300 1
 
1.0%
Other values (87) 87
88.8%
2023-12-12T21:01:51.986599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 196
16.8%
0 183
15.7%
3 137
11.7%
5 116
9.9%
2 102
8.7%
6 92
7.9%
1 80
6.8%
4 77
 
6.6%
7 70
 
6.0%
8 59
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 972
83.2%
Dash Punctuation 196
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 183
18.8%
3 137
14.1%
5 116
11.9%
2 102
10.5%
6 92
9.5%
1 80
8.2%
4 77
7.9%
7 70
 
7.2%
8 59
 
6.1%
9 56
 
5.8%
Dash Punctuation
ValueCountFrequency (%)
- 196
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1168
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 196
16.8%
0 183
15.7%
3 137
11.7%
5 116
9.9%
2 102
8.7%
6 92
7.9%
1 80
6.8%
4 77
 
6.6%
7 70
 
6.0%
8 59
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1168
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 196
16.8%
0 183
15.7%
3 137
11.7%
5 116
9.9%
2 102
8.7%
6 92
7.9%
1 80
6.8%
4 77
 
6.6%
7 70
 
6.0%
8 59
 
5.1%

사업대상년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023년도
98 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023년도
2nd row2023년도
3rd row2023년도
4th row2023년도
5th row2023년도

Common Values

ValueCountFrequency (%)
2023년도 98
100.0%

Length

2023-12-12T21:01:52.136451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:01:52.226631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023년도 98
100.0%

Interactions

2023-12-12T21:01:49.476740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:01:52.290796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역센터전화번호
연번1.0000.9740.7780.942
지역0.9741.0000.0001.000
센터0.7780.0001.0000.996
전화번호0.9421.0000.9961.000
2023-12-12T21:01:52.404644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역
연번1.0000.843
지역0.8431.000

Missing values

2023-12-12T21:01:49.576249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:01:49.672902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지역센터전화번호사업대상년도
01서울특별시관악구 가족센터02-883-93902023년도
12서울특별시구로구 가족센터02-830-04502023년도
23서울특별시금천구 가족센터02-803-77472023년도
34서울특별시도봉구 가족센터02-995-68002023년도
45서울특별시동대문구 가족센터02-957-07602023년도
56서울특별시서대문구 가족센터02-322-75952023년도
67서울특별시서초구 가족센터02-576-28522023년도
78서울특별시영등포구 가족센터02-2678-21932023년도
89서울특별시은평구 가족센터02-376-37592023년도
910서울특별시종로구 가족센터02-764-35242023년도
연번지역센터전화번호사업대상년도
8889경상남도김해시 가족센터055-329-63552023년도
8990경상남도밀양시 가족센터055-351-44042023년도
9091경상남도사천시 가족센터055-832-03452023년도
9192경상남도양산시 가족센터055-382-09882023년도
9293경상남도통영시 가족센터055-640-77412023년도
9394경상남도하동군 가족센터055-880-65202023년도
9495경상남도함안군 가족센터055-582-57902023년도
9596경상남도거제시 가족센터055-682-49582023년도
9697제주시서귀포시 가족센터064-732-64822023년도
9798제주시제주시 가족센터064-725-80052023년도