Overview

Dataset statistics

Number of variables5
Number of observations232
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.2 KiB
Average record size in memory40.6 B

Variable types

Categorical2
Text3

Dataset

Description전국 다문화가족지원센터 시설현황 정보입니다. 소재 지역, 시군구명, 센터유형, 주소(우편번호), 연락처가 기재되어 있습니다.
Author여성가족부
URLhttps://www.data.go.kr/data/3077033/fileData.do

Alerts

센터유형 is highly imbalanced (70.8%)Imbalance
주소 has unique valuesUnique
전화 has unique valuesUnique

Reproduction

Analysis started2024-04-19 05:41:07.679716
Analysis finished2024-04-19 05:41:08.105022
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct17
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
경기
31 
서울
26 
경북
24 
전남
22 
경남
20 
Other values (12)
109 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
경기 31
13.4%
서울 26
11.2%
경북 24
10.3%
전남 22
9.5%
경남 20
8.6%
강원 18
7.8%
충남 15
6.5%
전북 14
 
6.0%
부산 14
 
6.0%
충북 12
 
5.2%
Other values (7) 36
15.5%

Length

2024-04-19T14:41:08.179363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 31
13.4%
서울 26
11.2%
경북 24
10.3%
전남 22
9.5%
경남 20
8.6%
강원 18
7.8%
충남 15
6.5%
부산 14
 
6.0%
전북 14
 
6.0%
충북 12
 
5.2%
Other values (7) 36
15.5%
Distinct212
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-19T14:41:08.468661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3
Min length2

Characters and Unicode

Total characters696
Distinct characters137
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique205 ?
Unique (%)88.4%

Sample

1st row서울특별시
2nd row강남구
3rd row강동구
4th row강북구
5th row강서구
ValueCountFrequency (%)
동구 6
 
2.6%
서구 5
 
2.2%
남구 4
 
1.7%
북구 4
 
1.7%
중구 4
 
1.7%
고성군 2
 
0.9%
청주시 2
 
0.9%
무안군 1
 
0.4%
여수시 1
 
0.4%
신안군 1
 
0.4%
Other values (202) 202
87.1%
2024-04-19T14:41:08.895855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
12.2%
82
 
11.8%
72
 
10.3%
22
 
3.2%
21
 
3.0%
18
 
2.6%
17
 
2.4%
17
 
2.4%
17
 
2.4%
14
 
2.0%
Other values (127) 331
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 696
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
85
 
12.2%
82
 
11.8%
72
 
10.3%
22
 
3.2%
21
 
3.0%
18
 
2.6%
17
 
2.4%
17
 
2.4%
17
 
2.4%
14
 
2.0%
Other values (127) 331
47.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 696
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
85
 
12.2%
82
 
11.8%
72
 
10.3%
22
 
3.2%
21
 
3.0%
18
 
2.6%
17
 
2.4%
17
 
2.4%
17
 
2.4%
14
 
2.0%
Other values (127) 331
47.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 696
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
85
 
12.2%
82
 
11.8%
72
 
10.3%
22
 
3.2%
21
 
3.0%
18
 
2.6%
17
 
2.4%
17
 
2.4%
17
 
2.4%
14
 
2.0%
Other values (127) 331
47.6%

센터유형
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
가족센터
211 
다가센터
 
20
다문화거점센터
 
1

Length

Max length7
Median length4
Mean length4.012931
Min length4

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row가족센터
2nd row가족센터
3rd row가족센터
4th row가족센터
5th row가족센터

Common Values

ValueCountFrequency (%)
가족센터 211
90.9%
다가센터 20
 
8.6%
다문화거점센터 1
 
0.4%

Length

2024-04-19T14:41:09.024344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-19T14:41:09.151892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가족센터 211
90.9%
다가센터 20
 
8.6%
다문화거점센터 1
 
0.4%

주소
Text

UNIQUE 

Distinct232
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-19T14:41:09.445376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length44
Mean length35.49569
Min length23

Characters and Unicode

Total characters8235
Distinct characters321
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)100.0%

Sample

1st row(04628) 서울특별시 중구 소파로4길 6
2nd row(06336) 서울특별시 강남구 개포로 617-8
3rd row(05266) 서울특별시 강동구 양재대로 1634, 3층
4th row(01064) 서울특별시 강북구 한천로 129길 6
5th row(07781) 서울특별시 강서구 강서로 5길 50, 곰달래 문화복지센터 4층
ValueCountFrequency (%)
2층 40
 
2.5%
3층 33
 
2.1%
경기도 31
 
1.9%
서울특별시 26
 
1.6%
경상북도 23
 
1.4%
전라남도 22
 
1.4%
경상남도 20
 
1.3%
4층 19
 
1.2%
강원도 18
 
1.1%
충청남도 15
 
0.9%
Other values (1165) 1346
84.5%
2024-04-19T14:41:09.918379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1372
 
16.7%
2 334
 
4.1%
1 331
 
4.0%
( 298
 
3.6%
) 298
 
3.6%
3 281
 
3.4%
4 236
 
2.9%
5 229
 
2.8%
183
 
2.2%
0 183
 
2.2%
Other values (311) 4490
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3994
48.5%
Decimal Number 2127
25.8%
Space Separator 1372
 
16.7%
Open Punctuation 298
 
3.6%
Close Punctuation 298
 
3.6%
Other Punctuation 82
 
1.0%
Dash Punctuation 47
 
0.6%
Uppercase Letter 13
 
0.2%
Math Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
183
 
4.6%
179
 
4.5%
171
 
4.3%
153
 
3.8%
116
 
2.9%
104
 
2.6%
96
 
2.4%
85
 
2.1%
83
 
2.1%
79
 
2.0%
Other values (285) 2745
68.7%
Decimal Number
ValueCountFrequency (%)
2 334
15.7%
1 331
15.6%
3 281
13.2%
4 236
11.1%
5 229
10.8%
0 183
8.6%
7 158
7.4%
6 143
6.7%
9 121
 
5.7%
8 111
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 3
23.1%
A 2
15.4%
C 2
15.4%
Y 2
15.4%
W 2
15.4%
H 1
 
7.7%
L 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 78
95.1%
. 2
 
2.4%
/ 1
 
1.2%
· 1
 
1.2%
Space Separator
ValueCountFrequency (%)
1372
100.0%
Open Punctuation
ValueCountFrequency (%)
( 298
100.0%
Close Punctuation
ValueCountFrequency (%)
) 298
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4228
51.3%
Hangul 3994
48.5%
Latin 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
183
 
4.6%
179
 
4.5%
171
 
4.3%
153
 
3.8%
116
 
2.9%
104
 
2.6%
96
 
2.4%
85
 
2.1%
83
 
2.1%
79
 
2.0%
Other values (285) 2745
68.7%
Common
ValueCountFrequency (%)
1372
32.5%
2 334
 
7.9%
1 331
 
7.8%
( 298
 
7.0%
) 298
 
7.0%
3 281
 
6.6%
4 236
 
5.6%
5 229
 
5.4%
0 183
 
4.3%
7 158
 
3.7%
Other values (9) 508
 
12.0%
Latin
ValueCountFrequency (%)
B 3
23.1%
A 2
15.4%
C 2
15.4%
Y 2
15.4%
W 2
15.4%
H 1
 
7.7%
L 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4240
51.5%
Hangul 3994
48.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1372
32.4%
2 334
 
7.9%
1 331
 
7.8%
( 298
 
7.0%
) 298
 
7.0%
3 281
 
6.6%
4 236
 
5.6%
5 229
 
5.4%
0 183
 
4.3%
7 158
 
3.7%
Other values (15) 520
 
12.3%
Hangul
ValueCountFrequency (%)
183
 
4.6%
179
 
4.5%
171
 
4.3%
153
 
3.8%
116
 
2.9%
104
 
2.6%
96
 
2.4%
85
 
2.1%
83
 
2.1%
79
 
2.0%
Other values (285) 2745
68.7%
None
ValueCountFrequency (%)
· 1
100.0%

전화
Text

UNIQUE 

Distinct232
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2024-04-19T14:41:10.149213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.900862
Min length6

Characters and Unicode

Total characters2761
Distinct characters17
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)100.0%

Sample

1st row02-318-0227
2nd row02-3412-2222
3rd row02-471-0812
4th row02-987-2567
5th row02-2606-2017
ValueCountFrequency (%)
02-318-0227 1
 
0.4%
061-692-4172 1
 
0.4%
054-443-0541 1
 
0.4%
061-433-9004 1
 
0.4%
061-832-5399 1
 
0.4%
061-362-5411 1
 
0.4%
061-797-6800 1
 
0.4%
061-781-8003 1
 
0.4%
061-339-9800 1
 
0.4%
061-383-3655 1
 
0.4%
Other values (223) 223
95.7%
2024-04-19T14:41:10.501436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 462
16.7%
0 397
14.4%
3 356
12.9%
5 275
10.0%
1 225
8.1%
2 223
8.1%
4 218
7.9%
6 172
 
6.2%
8 148
 
5.4%
7 142
 
5.1%
Other values (7) 143
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2292
83.0%
Dash Punctuation 462
 
16.7%
Other Letter 3
 
0.1%
Space Separator 2
 
0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 397
17.3%
3 356
15.5%
5 275
12.0%
1 225
9.8%
2 223
9.7%
4 218
9.5%
6 172
7.5%
8 148
 
6.5%
7 142
 
6.2%
9 136
 
5.9%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 462
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2758
99.9%
Hangul 3
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 462
16.8%
0 397
14.4%
3 356
12.9%
5 275
10.0%
1 225
8.2%
2 223
8.1%
4 218
7.9%
6 172
 
6.2%
8 148
 
5.4%
7 142
 
5.1%
Other values (4) 140
 
5.1%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2758
99.9%
Hangul 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 462
16.8%
0 397
14.4%
3 356
12.9%
5 275
10.0%
1 225
8.2%
2 223
8.1%
4 218
7.9%
6 172
 
6.2%
8 148
 
5.4%
7 142
 
5.1%
Other values (4) 140
 
5.1%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Correlations

2024-04-19T14:41:10.590982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역센터유형
지역1.0000.332
센터유형0.3321.000
2024-04-19T14:41:10.664677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
센터유형지역
센터유형1.0000.183
지역0.1831.000
2024-04-19T14:41:10.740047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역센터유형
지역1.0000.183
센터유형0.1831.000

Missing values

2024-04-19T14:41:07.968070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:41:08.064205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역시군구명센터유형주소전화
0서울서울특별시가족센터(04628) 서울특별시 중구 소파로4길 602-318-0227
1서울강남구가족센터(06336) 서울특별시 강남구 개포로 617-802-3412-2222
2서울강동구가족센터(05266) 서울특별시 강동구 양재대로 1634, 3층02-471-0812
3서울강북구가족센터(01064) 서울특별시 강북구 한천로 129길 602-987-2567
4서울강서구가족센터(07781) 서울특별시 강서구 강서로 5길 50, 곰달래 문화복지센터 4층02-2606-2017
5서울관악구가족센터(08825) 서울특별시 관악구 신림로 3길 35 김삼준문화복지기념관 3층 사무실02-883-9383
6서울광진구가족센터(05072) 서울특별시 광진구 아차산로 24길 17, 자양공공힐링센터 5층02-458-0666
7서울구로구가족센터(08383) 서울특별시 구로구 우마2길 35, 구로구가족통합지원센터 2,3층02-869-0317
8서울금천구가족센터(08627) 서울특별시 금천구 금하로 11길 4002-803-7747
9서울노원구가족센터(01857) 서울특별시 노원구 동일로173가길 94 가온빌딩 3층02-979-3501
지역시군구명센터유형주소전화
222경기부천시다가센터(14646) 경기도 부천시 조종로 68번가길 4(원미동)032-327-1370
223경기안산시다가센터(15385) 경기도 안산시 단원구 화정로 26, 안산글로벌다문화센터 2층(초지동)031-599-1700
224충북청주시다가센터(28806) 충청북도 청주시 상당구 남일면 단재로 480 상당보건소 내 2층043-293-8887
225충남천안시다가센터(31129) 충청남도 천안시 동남구 은행길 15, 3층 천안시다문화가족지원센터041-558-8653
226충남당진시다가센터(31772) 충청남도 당진시 시청 1로 38 수청동(1005, 당진시종합복지타운 4층)041-360-3160
227전북전주시다가센터(54935) 전라북도 전주시 덕진구 팔달로 336, 5~7층063-243-0333
228경북청송군다가센터(37433) 경상북도 청송군 청송읍 복지타운길 77054-870-6790
229경북영양군다가센터(36540) 경상북도 영양군 영양읍 군민회관길 36054-683-5432
230경북고령군다가센터(40138) 경상북도 고령군 대가야읍 왕릉로 30 문화누리 3층054-956-6336
231경남경상남도다문화거점센터(51140) 경상남도 창원시 의창구 창원대학로 20 (사림동) 21호관 404호055-274-8338