Overview

Dataset statistics

Number of variables6
Number of observations231
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory49.6 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description한국건강가정진흥원에서 제공하는 한국어교육 운영기관 현황정보입니다.파일데이터 제공항목은 NO, 지역명, 시군구, 센터명, 주소, 연락처입니다.
Author한국건강가정진흥원
URLhttps://www.data.go.kr/data/15025593/fileData.do

Alerts

순번(NO) is highly overall correlated with 지역명High correlation
지역명 is highly overall correlated with 순번(NO)High correlation
순번(NO) has unique valuesUnique
센터명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:38:00.805407
Analysis finished2023-12-12 04:38:01.671334
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번(NO)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct231
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean116
Minimum1
Maximum231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-12T13:38:01.772739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.5
Q158.5
median116
Q3173.5
95-th percentile219.5
Maximum231
Range230
Interquartile range (IQR)115

Descriptive statistics

Standard deviation66.828138
Coefficient of variation (CV)0.57610464
Kurtosis-1.2
Mean116
Median Absolute Deviation (MAD)58
Skewness0
Sum26796
Variance4466
MonotonicityStrictly increasing
2023-12-12T13:38:02.011356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
160 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
Other values (221) 221
95.7%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%
227 1
0.4%
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%

지역명
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)7.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
경기도
31 
서울특별시
25 
경상북도
23 
전라남도
22 
경상남도
19 
Other values (12)
111 

Length

Max length7
Median length5
Mean length4.1471861
Min length3

Unique

Unique1 ?
Unique (%)0.4%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
경기도 31
13.4%
서울특별시 25
10.8%
경상북도 23
10.0%
전라남도 22
9.5%
경상남도 19
8.2%
강원도 18
7.8%
부산광역시 16
6.9%
충청남도 15
6.5%
전라북도 14
 
6.1%
충청북도 12
 
5.2%
Other values (7) 36
15.6%

Length

2023-12-12T13:38:02.253440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 31
13.4%
서울특별시 25
10.8%
경상북도 23
10.0%
전라남도 22
9.5%
경상남도 19
8.2%
강원도 18
7.8%
부산광역시 16
6.9%
충청남도 15
6.5%
전라북도 14
 
6.1%
충청북도 12
 
5.2%
Other values (7) 36
15.6%
Distinct206
Distinct (%)89.2%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:38:02.664441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9307359
Min length2

Characters and Unicode

Total characters677
Distinct characters132
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique196 ?
Unique (%)84.8%

Sample

1st row강남구
2nd row강동구
3rd row강북구
4th row강서구
5th row관악구
ValueCountFrequency (%)
동구 6
 
2.6%
중구 5
 
2.2%
남구 5
 
2.2%
서구 5
 
2.2%
북구 4
 
1.7%
고성군 2
 
0.9%
양양군 2
 
0.9%
창원시 2
 
0.9%
청주시 2
 
0.9%
강서구 2
 
0.9%
Other values (196) 196
84.8%
2023-12-12T13:38:03.124211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
84
 
12.4%
81
 
12.0%
75
 
11.1%
22
 
3.2%
21
 
3.1%
19
 
2.8%
18
 
2.7%
17
 
2.5%
15
 
2.2%
14
 
2.1%
Other values (122) 311
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 677
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
12.4%
81
 
12.0%
75
 
11.1%
22
 
3.2%
21
 
3.1%
19
 
2.8%
18
 
2.7%
17
 
2.5%
15
 
2.2%
14
 
2.1%
Other values (122) 311
45.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 677
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
12.4%
81
 
12.0%
75
 
11.1%
22
 
3.2%
21
 
3.1%
19
 
2.8%
18
 
2.7%
17
 
2.5%
15
 
2.2%
14
 
2.1%
Other values (122) 311
45.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 677
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
84
 
12.4%
81
 
12.0%
75
 
11.1%
22
 
3.2%
21
 
3.1%
19
 
2.8%
18
 
2.7%
17
 
2.5%
15
 
2.2%
14
 
2.1%
Other values (122) 311
45.9%

센터명
Text

UNIQUE 

Distinct231
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:38:03.436538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length8
Mean length8.6277056
Min length7

Characters and Unicode

Total characters1993
Distinct characters147
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)100.0%

Sample

1st row강남구 가족센터
2nd row강동구 가족센터
3rd row강북구 가족센터
4th row강서구 가족센터
5th row관악구 가족센터
ValueCountFrequency (%)
가족센터 208
47.3%
순창군 1
 
0.2%
보성군 1
 
0.2%
예산군 1
 
0.2%
천안시다문화가족지원센터 1
 
0.2%
청양군 1
 
0.2%
태안군 1
 
0.2%
홍성군 1
 
0.2%
고창군 1
 
0.2%
군산시 1
 
0.2%
Other values (223) 223
50.7%
2023-12-12T13:38:03.915076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
229
11.5%
228
11.4%
228
11.4%
228
11.4%
209
 
10.5%
85
 
4.3%
81
 
4.1%
78
 
3.9%
30
 
1.5%
29
 
1.5%
Other values (137) 568
28.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1784
89.5%
Space Separator 209
 
10.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
229
12.8%
228
12.8%
228
12.8%
228
12.8%
85
 
4.8%
81
 
4.5%
78
 
4.4%
30
 
1.7%
29
 
1.6%
29
 
1.6%
Other values (136) 539
30.2%
Space Separator
ValueCountFrequency (%)
209
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1784
89.5%
Common 209
 
10.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
229
12.8%
228
12.8%
228
12.8%
228
12.8%
85
 
4.8%
81
 
4.5%
78
 
4.4%
30
 
1.7%
29
 
1.6%
29
 
1.6%
Other values (136) 539
30.2%
Common
ValueCountFrequency (%)
209
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1784
89.5%
ASCII 209
 
10.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
229
12.8%
228
12.8%
228
12.8%
228
12.8%
85
 
4.8%
81
 
4.5%
78
 
4.4%
30
 
1.7%
29
 
1.6%
29
 
1.6%
Other values (136) 539
30.2%
ASCII
ValueCountFrequency (%)
209
100.0%

주소
Text

Distinct230
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:38:04.261589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length34
Mean length25.350649
Min length12

Characters and Unicode

Total characters5856
Distinct characters313
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)99.1%

Sample

1st row서울시 강남구 개포로 617-8
2nd row서울시 강동구 양재대로 138길 41 2층
3rd row서울시 강북구 한천로 129길 6
4th row서울시 강서구 강서로 5길 50 곰달래 문화복지센터 4층
5th row서울시 관악구 신림로 3길 35 김삼준문화복지기념관 3층 사무실
ValueCountFrequency (%)
2층 45
 
3.3%
3층 34
 
2.5%
경기도 29
 
2.1%
서울시 25
 
1.8%
경북 23
 
1.7%
전남 22
 
1.6%
4층 20
 
1.5%
강원도 18
 
1.3%
경남 18
 
1.3%
1층 17
 
1.2%
Other values (924) 1116
81.6%
2023-12-12T13:38:04.780062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1146
 
19.6%
1 211
 
3.6%
186
 
3.2%
164
 
2.8%
2 160
 
2.7%
152
 
2.6%
3 133
 
2.3%
111
 
1.9%
100
 
1.7%
4 97
 
1.7%
Other values (303) 3396
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3568
60.9%
Space Separator 1146
 
19.6%
Decimal Number 962
 
16.4%
Close Punctuation 57
 
1.0%
Open Punctuation 56
 
1.0%
Dash Punctuation 48
 
0.8%
Uppercase Letter 9
 
0.2%
Other Punctuation 7
 
0.1%
Math Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
186
 
5.2%
164
 
4.6%
152
 
4.3%
111
 
3.1%
100
 
2.8%
94
 
2.6%
84
 
2.4%
83
 
2.3%
82
 
2.3%
79
 
2.2%
Other values (279) 2433
68.2%
Decimal Number
ValueCountFrequency (%)
1 211
21.9%
2 160
16.6%
3 133
13.8%
4 97
10.1%
5 93
9.7%
0 69
 
7.2%
6 56
 
5.8%
7 56
 
5.8%
9 50
 
5.2%
8 37
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
22.2%
B 2
22.2%
Y 1
11.1%
W 1
11.1%
L 1
11.1%
H 1
11.1%
C 1
11.1%
Other Punctuation
ValueCountFrequency (%)
: 4
57.1%
. 3
42.9%
Space Separator
ValueCountFrequency (%)
1146
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3568
60.9%
Common 2279
38.9%
Latin 9
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
186
 
5.2%
164
 
4.6%
152
 
4.3%
111
 
3.1%
100
 
2.8%
94
 
2.6%
84
 
2.4%
83
 
2.3%
82
 
2.3%
79
 
2.2%
Other values (279) 2433
68.2%
Common
ValueCountFrequency (%)
1146
50.3%
1 211
 
9.3%
2 160
 
7.0%
3 133
 
5.8%
4 97
 
4.3%
5 93
 
4.1%
0 69
 
3.0%
) 57
 
2.5%
6 56
 
2.5%
( 56
 
2.5%
Other values (7) 201
 
8.8%
Latin
ValueCountFrequency (%)
A 2
22.2%
B 2
22.2%
Y 1
11.1%
W 1
11.1%
L 1
11.1%
H 1
11.1%
C 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3568
60.9%
ASCII 2288
39.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1146
50.1%
1 211
 
9.2%
2 160
 
7.0%
3 133
 
5.8%
4 97
 
4.2%
5 93
 
4.1%
0 69
 
3.0%
) 57
 
2.5%
6 56
 
2.4%
( 56
 
2.4%
Other values (14) 210
 
9.2%
Hangul
ValueCountFrequency (%)
186
 
5.2%
164
 
4.6%
152
 
4.3%
111
 
3.1%
100
 
2.8%
94
 
2.6%
84
 
2.4%
83
 
2.3%
82
 
2.3%
79
 
2.2%
Other values (279) 2433
68.2%
Distinct230
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:38:05.067677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.935065
Min length11

Characters and Unicode

Total characters2757
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)99.1%

Sample

1st row02-3412-2222
2nd row02-471-0812
3rd row02-987-2567
4th row02-2606-2017
5th row02-883-9383
ValueCountFrequency (%)
051-702-8002 2
 
0.9%
041-558-8653 1
 
0.4%
061-278-4222 1
 
0.4%
063-841-6040 1
 
0.4%
041-944-2333 1
 
0.4%
041-670-2396 1
 
0.4%
041-634-7432 1
 
0.4%
063-561-1366 1
 
0.4%
063-443-5300 1
 
0.4%
063-545-8506 1
 
0.4%
Other values (220) 220
95.2%
2023-12-12T13:38:05.534257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 462
16.8%
0 402
14.6%
3 362
13.1%
5 265
9.6%
2 227
8.2%
1 221
8.0%
4 212
7.7%
6 168
 
6.1%
8 149
 
5.4%
7 148
 
5.4%
Other values (2) 141
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2294
83.2%
Dash Punctuation 462
 
16.8%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 402
17.5%
3 362
15.8%
5 265
11.6%
2 227
9.9%
1 221
9.6%
4 212
9.2%
6 168
7.3%
8 149
 
6.5%
7 148
 
6.5%
9 140
 
6.1%
Dash Punctuation
ValueCountFrequency (%)
- 462
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2757
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 462
16.8%
0 402
14.6%
3 362
13.1%
5 265
9.6%
2 227
8.2%
1 221
8.0%
4 212
7.7%
6 168
 
6.1%
8 149
 
5.4%
7 148
 
5.4%
Other values (2) 141
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2757
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 462
16.8%
0 402
14.6%
3 362
13.1%
5 265
9.6%
2 227
8.2%
1 221
8.0%
4 212
7.7%
6 168
 
6.1%
8 149
 
5.4%
7 148
 
5.4%
Other values (2) 141
 
5.1%

Interactions

2023-12-12T13:38:01.315181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:38:05.652266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번(NO)지역명
순번(NO)1.0000.971
지역명0.9711.000
2023-12-12T13:38:05.762608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번(NO)지역명
순번(NO)1.0000.855
지역명0.8551.000

Missing values

2023-12-12T13:38:01.457047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:38:01.602646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번(NO)지역명시군구센터명주소연락처
01서울특별시강남구강남구 가족센터서울시 강남구 개포로 617-802-3412-2222
12서울특별시강동구강동구 가족센터서울시 강동구 양재대로 138길 41 2층02-471-0812
23서울특별시강북구강북구 가족센터서울시 강북구 한천로 129길 602-987-2567
34서울특별시강서구강서구 가족센터서울시 강서구 강서로 5길 50 곰달래 문화복지센터 4층02-2606-2017
45서울특별시관악구관악구 가족센터서울시 관악구 신림로 3길 35 김삼준문화복지기념관 3층 사무실02-883-9383
56서울특별시광진구광진구 가족센터1센터 : 서울시 광진구 능동로 30길 23 새마을회관 2층02-458-0622
67서울특별시구로구구로구 가족센터서울시 구로구 우마2길 35 구로구가족종합지원센터 2 3층02-869-0317
78서울특별시금천구금천구 가족센터서울시 금천구 금하로 11길 40 1층02-803-7747
89서울특별시노원구노원구 가족센터서울시 노원구 동일로173가길 94 가온빌딩 3층02-979-3501
910서울특별시도봉구도봉구 가족센터서울시 도봉구 도봉로 552 도봉구민회관 2층02-995-6800
순번(NO)지역명시군구센터명주소연락처
221222경상남도창녕군창녕군 가족센터경남 창녕군 창녕읍 창녕대로 172 사회복지타운 1층055-533-1305
222223경상남도창원시창원시 가족센터경남 창원시 성산구 대정로20번길 11 1층(가음동 여성회관 창원관)055-225-3951
223224경상남도창원시창원시마산 가족센터경남 창원시 마산합포구 중앙북5길 20. 마산YWCA 지하1층055-244-8745
224225경상남도통영시통영시 가족센터경남 통영시 안개 4길 94 (통영시종합사회복지관 내 3층)055-640-7741
225226경상남도하동군하동군 가족센터경남 하동군 섬진강대로 2214 종합사회복지관4층055-880-6522
226227경상남도함안군함안군 가족센터경남 함안군 산인면 가야로 217 산인면종합복지관 1층055-583-5430
227228경상남도함양군함양군 가족센터경남 함양군 함양읍 함양여중길 10055-963-2057
228229경상남도합천군합천군 가족센터경남 합천군 합천읍 옥산로 96-7 2층055-930-4732
229230제주특별자치도서귀포시서귀포시 가족센터제주 서귀포시 서호남로 19-42064-732-6482
230231제주특별자치도제주시제주시 가족센터제주 제주시 중앙로 14길 15(삼도이동) 2 3 4층064-725-8005