Overview

Dataset statistics

Number of variables4
Number of observations2620
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory82.0 KiB
Average record size in memory32.1 B

Variable types

Text4

Dataset

Description보건소, 진료소, 복지관 등 장애인 건강을 위한 검진 기관에 관한 정보로 제공하는 항목은 "지역, 보건기관명, 주소, 전화번호"입니다.
Author보건복지부 국립재활원
URLhttps://www.data.go.kr/data/15052276/fileData.do

Reproduction

Analysis started2023-12-12 12:20:55.785865
Analysis finished2023-12-12 12:20:56.869496
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Text

Distinct229
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-12T21:20:57.167219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length8
Mean length7.2961832
Min length2

Characters and Unicode

Total characters19116
Distinct characters129
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)1.8%

Sample

1st row서울 금천구
2nd row서울 금천구
3rd row서울 종로구
4th row서울 종로구
5th row서울 종로구
ValueCountFrequency (%)
경상북도 552
 
10.7%
경상남도 408
 
7.9%
충청남도 380
 
7.4%
경기도 328
 
6.4%
충청북도 265
 
5.2%
강원도 246
 
4.8%
서울 89
 
1.7%
인천 70
 
1.4%
상주시 44
 
0.9%
부산 44
 
0.9%
Other values (212) 2713
52.8%
2023-12-12T21:20:57.701997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2519
 
13.2%
2216
 
11.6%
1405
 
7.3%
1300
 
6.8%
1011
 
5.3%
1006
 
5.3%
896
 
4.7%
883
 
4.6%
772
 
4.0%
690
 
3.6%
Other values (119) 6418
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16597
86.8%
Space Separator 2519
 
13.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2216
 
13.4%
1405
 
8.5%
1300
 
7.8%
1011
 
6.1%
1006
 
6.1%
896
 
5.4%
883
 
5.3%
772
 
4.7%
690
 
4.2%
436
 
2.6%
Other values (118) 5982
36.0%
Space Separator
ValueCountFrequency (%)
2519
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16597
86.8%
Common 2519
 
13.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2216
 
13.4%
1405
 
8.5%
1300
 
7.8%
1011
 
6.1%
1006
 
6.1%
896
 
5.4%
883
 
5.3%
772
 
4.7%
690
 
4.2%
436
 
2.6%
Other values (118) 5982
36.0%
Common
ValueCountFrequency (%)
2519
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16597
86.8%
ASCII 2519
 
13.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2519
100.0%
Hangul
ValueCountFrequency (%)
2216
 
13.4%
1405
 
8.5%
1300
 
7.8%
1011
 
6.1%
1006
 
6.1%
896
 
5.4%
883
 
5.3%
772
 
4.7%
690
 
4.2%
436
 
2.6%
Other values (118) 5982
36.0%
Distinct2419
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-12T21:20:57.983473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length7
Mean length7.3671756
Min length3

Characters and Unicode

Total characters19302
Distinct characters338
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2266 ?
Unique (%)86.5%

Sample

1st row금천구보건소
2nd row독산보건분소
3rd row종로구보건소
4th row명륜보건분소
5th row종로구보건분소
ValueCountFrequency (%)
서면보건지소 8
 
0.3%
남면보건지소 6
 
0.2%
북면보건지소 6
 
0.2%
대구광역시 5
 
0.2%
동면보건지소 5
 
0.2%
봉산보건진료소 5
 
0.2%
창원시 4
 
0.2%
용산보건진료소 4
 
0.2%
양지보건진료소 4
 
0.2%
소곡보건진료소 3
 
0.1%
Other values (2414) 2586
98.1%
2023-12-12T21:20:58.442529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2456
 
12.7%
2447
 
12.7%
2432
 
12.6%
1321
 
6.8%
1264
 
6.5%
1170
 
6.1%
419
 
2.2%
269
 
1.4%
256
 
1.3%
231
 
1.2%
Other values (328) 7037
36.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19246
99.7%
Close Punctuation 20
 
0.1%
Open Punctuation 20
 
0.1%
Space Separator 16
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2456
 
12.8%
2447
 
12.7%
2432
 
12.6%
1321
 
6.9%
1264
 
6.6%
1170
 
6.1%
419
 
2.2%
269
 
1.4%
256
 
1.3%
231
 
1.2%
Other values (325) 6981
36.3%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19246
99.7%
Common 56
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2456
 
12.8%
2447
 
12.7%
2432
 
12.6%
1321
 
6.9%
1264
 
6.6%
1170
 
6.1%
419
 
2.2%
269
 
1.4%
256
 
1.3%
231
 
1.2%
Other values (325) 6981
36.3%
Common
ValueCountFrequency (%)
) 20
35.7%
( 20
35.7%
16
28.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19246
99.7%
ASCII 56
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2456
 
12.8%
2447
 
12.7%
2432
 
12.6%
1321
 
6.9%
1264
 
6.6%
1170
 
6.1%
419
 
2.2%
269
 
1.4%
256
 
1.3%
231
 
1.2%
Other values (325) 6981
36.3%
ASCII
ValueCountFrequency (%)
) 20
35.7%
( 20
35.7%
16
28.6%

주소
Text

Distinct2605
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-12T21:20:58.846601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length39
Mean length19.996183
Min length7

Characters and Unicode

Total characters52390
Distinct characters389
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2590 ?
Unique (%)98.9%

Sample

1st row서울 금천구 시흥동 시흥대로 73길 70
2nd row서울 금천구 독산본동 독산로87길 27
3rd row서울 종로구 옥인동 45-30(자수궁길)
4th row서울 종로구 옥인동 45-30(자수궁길)
5th row서울 종로구 창신1동 222-8
ValueCountFrequency (%)
경북 563
 
4.4%
경남 419
 
3.3%
충남 391
 
3.0%
경기 343
 
2.7%
충북 274
 
2.1%
강원 250
 
1.9%
서울 90
 
0.7%
인천 68
 
0.5%
남구 47
 
0.4%
상주시 45
 
0.3%
Other values (5657) 10389
80.7%
2023-12-12T21:20:59.478034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10266
 
19.6%
2169
 
4.1%
1 2127
 
4.1%
- 1966
 
3.8%
1917
 
3.7%
1429
 
2.7%
2 1383
 
2.6%
1363
 
2.6%
3 1214
 
2.3%
1150
 
2.2%
Other values (379) 27406
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29416
56.1%
Decimal Number 10483
 
20.0%
Space Separator 10266
 
19.6%
Dash Punctuation 1966
 
3.8%
Open Punctuation 122
 
0.2%
Close Punctuation 122
 
0.2%
Other Punctuation 12
 
< 0.1%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2169
 
7.4%
1917
 
6.5%
1429
 
4.9%
1363
 
4.6%
1150
 
3.9%
1131
 
3.8%
1034
 
3.5%
853
 
2.9%
814
 
2.8%
726
 
2.5%
Other values (361) 16830
57.2%
Decimal Number
ValueCountFrequency (%)
1 2127
20.3%
2 1383
13.2%
3 1214
11.6%
4 1094
10.4%
5 951
9.1%
6 862
8.2%
7 828
 
7.9%
9 700
 
6.7%
8 692
 
6.6%
0 632
 
6.0%
Other Punctuation
ValueCountFrequency (%)
, 7
58.3%
. 5
41.7%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
10266
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1966
100.0%
Open Punctuation
ValueCountFrequency (%)
( 122
100.0%
Close Punctuation
ValueCountFrequency (%)
) 122
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29416
56.1%
Common 22971
43.8%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2169
 
7.4%
1917
 
6.5%
1429
 
4.9%
1363
 
4.6%
1150
 
3.9%
1131
 
3.8%
1034
 
3.5%
853
 
2.9%
814
 
2.8%
726
 
2.5%
Other values (361) 16830
57.2%
Common
ValueCountFrequency (%)
10266
44.7%
1 2127
 
9.3%
- 1966
 
8.6%
2 1383
 
6.0%
3 1214
 
5.3%
4 1094
 
4.8%
5 951
 
4.1%
6 862
 
3.8%
7 828
 
3.6%
9 700
 
3.0%
Other values (6) 1580
 
6.9%
Latin
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29416
56.1%
ASCII 22974
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10266
44.7%
1 2127
 
9.3%
- 1966
 
8.6%
2 1383
 
6.0%
3 1214
 
5.3%
4 1094
 
4.8%
5 951
 
4.1%
6 862
 
3.8%
7 828
 
3.6%
9 700
 
3.0%
Other values (8) 1583
 
6.9%
Hangul
ValueCountFrequency (%)
2169
 
7.4%
1917
 
6.5%
1429
 
4.9%
1363
 
4.6%
1150
 
3.9%
1131
 
3.8%
1034
 
3.5%
853
 
2.9%
814
 
2.8%
726
 
2.5%
Other values (361) 16830
57.2%
Distinct2599
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size20.6 KiB
2023-12-12T21:20:59.809462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.994656
Min length11

Characters and Unicode

Total characters31426
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2579 ?
Unique (%)98.4%

Sample

1st row02-2627-2703
2nd row02-867-4631
3rd row02-2148-3522
4th row02-2148-3673
5th row02-2148-3563
ValueCountFrequency (%)
02-2147-3450 3
 
0.1%
053-852-9939 2
 
0.1%
02-920-1971 2
 
0.1%
043-838-8018 2
 
0.1%
031-538-4344 2
 
0.1%
041-852-4214 2
 
0.1%
032-560-5013 2
 
0.1%
054-571-3284 2
 
0.1%
053-667-5616 2
 
0.1%
055-639-6200 2
 
0.1%
Other values (2589) 2599
99.2%
2023-12-12T21:21:00.351054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 5240
16.7%
0 4501
14.3%
3 3675
11.7%
5 3542
11.3%
4 3030
9.6%
2 2300
7.3%
1 2198
7.0%
6 1885
 
6.0%
8 1860
 
5.9%
7 1838
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26186
83.3%
Dash Punctuation 5240
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4501
17.2%
3 3675
14.0%
5 3542
13.5%
4 3030
11.6%
2 2300
8.8%
1 2198
8.4%
6 1885
7.2%
8 1860
7.1%
7 1838
7.0%
9 1357
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 5240
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 31426
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 5240
16.7%
0 4501
14.3%
3 3675
11.7%
5 3542
11.3%
4 3030
9.6%
2 2300
7.3%
1 2198
7.0%
6 1885
 
6.0%
8 1860
 
5.9%
7 1838
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31426
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 5240
16.7%
0 4501
14.3%
3 3675
11.7%
5 3542
11.3%
4 3030
9.6%
2 2300
7.3%
1 2198
7.0%
6 1885
 
6.0%
8 1860
 
5.9%
7 1838
 
5.8%

Missing values

2023-12-12T21:20:56.697176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:20:56.815638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역보건기관명주소전화번호
0서울 금천구금천구보건소서울 금천구 시흥동 시흥대로 73길 7002-2627-2703
1서울 금천구독산보건분소서울 금천구 독산본동 독산로87길 2702-867-4631
2서울 종로구종로구보건소서울 종로구 옥인동 45-30(자수궁길)02-2148-3522
3서울 종로구명륜보건분소서울 종로구 옥인동 45-30(자수궁길)02-2148-3673
4서울 종로구종로구보건분소서울 종로구 창신1동 222-802-2148-3563
5서울 서초구서초구보건소서울 서초구 서초동 1376-302-2155-8000
6서울 서초구서초방배보건분소서울 서초구 방배동 936-1102-2155-8160
7서울 성동구성동구보건소서울 성동구 홍익동 16-102-2286-7063
8서울 성동구금호동보건분소서울 성동구 금호동 1가 580번지02-2286-7103
9서울 성동구성수동보건분소서울 성동구 성수1가1동 685-54번지02-2286-7172
지역보건기관명주소전화번호
2610충남부여군장애인종합복지관충남 부여군 규암면 내리246-3041-836-2157
2611충남서천군장애인종합복지관충남 서천군 종천면 종천리37-6번지041-950-1253
2612충남충청남도남부장애인종합복지관충남 공주시 계룡면 기산리627-1 (영규대사로 747)041-856-7071
2613충북보은군노인장애인복지관충북 보은군 보은읍 이평리뱃들4길 11-10043-544-5446
2614충북제천장애인종합복지관충북 제천시 청전동110번지043-652-0900
2615충북혜원장애인종합복지관충북 청주시 흥덕구 미평동271-7번지043-295-2505
2616충북충청북도장애인종합복지관충북 충주시 호암동751-7번지043-856-1100
2617충북옥천노인장애인복지관충북 옥천군 옥천읍 삼양리8길 9043-733-2500
2618충북영동군장애인복지관충북 영동군 영동읍 매천리460043-743-1500
2619충북증평군장애인복지관충북 증평군 증평읍 내성리45번지043-835-4288