Overview

Dataset statistics

Number of variables4
Number of observations509
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory16.0 KiB
Average record size in memory32.3 B

Variable types

Categorical1
Text2
DateTime1

Dataset

Description병무청지정병원
Author병무청
URLhttps://www.data.go.kr/data/15064429/fileData.do

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 01:58:57.349363
Analysis finished2023-12-12 01:58:57.770937
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관할청
Categorical

Distinct14
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
서울
89 
부산
57 
경남
52 
광주.전남
50 
대구.경북
43 
Other values (9)
218 

Length

Max length5
Median length2
Mean length2.870334
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울
2nd row서울
3rd row서울
4th row서울
5th row서울

Common Values

ValueCountFrequency (%)
서울 89
17.5%
부산 57
11.2%
경남 52
10.2%
광주.전남 50
9.8%
대구.경북 43
8.4%
경인 35
 
6.9%
인천 35
 
6.9%
대전.충남 34
 
6.7%
충북 28
 
5.5%
강원 20
 
3.9%
Other values (4) 66
13.0%

Length

2023-12-12T10:58:57.851589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울 89
17.5%
부산 57
11.2%
경남 52
10.2%
광주.전남 50
9.8%
대구.경북 43
8.4%
경인 35
 
6.9%
인천 35
 
6.9%
대전.충남 34
 
6.7%
충북 28
 
5.5%
강원 20
 
3.9%
Other values (4) 66
13.0%
Distinct275
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-12T10:58:58.041169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length9.9764244
Min length3

Characters and Unicode

Total characters5078
Distinct characters218
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)9.4%

Sample

1st row가톨릭대학교서울성모병원
2nd row가톨릭대학교서울성모병원
3rd row가톨릭대학교성바오로병원
4th row가톨릭대학교성바오로병원
5th row가톨릭대학교여의도성모병원
ValueCountFrequency (%)
의료법인 8
 
1.4%
근로복지공단 8
 
1.4%
광혜의료재단광혜병원 3
 
0.5%
강원대학교병원 3
 
0.5%
김원묵기념봉생병원 3
 
0.5%
국립춘천병원 3
 
0.5%
고신대학교복음병원 3
 
0.5%
대동병원 3
 
0.5%
강동병원 3
 
0.5%
의료법인철원길병원 2
 
0.4%
Other values (284) 514
92.9%
2023-12-12T10:58:58.488839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
581
 
11.4%
459
 
9.0%
307
 
6.0%
269
 
5.3%
212
 
4.2%
194
 
3.8%
162
 
3.2%
150
 
3.0%
130
 
2.6%
126
 
2.5%
Other values (208) 2488
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5010
98.7%
Space Separator 44
 
0.9%
Uppercase Letter 16
 
0.3%
Dash Punctuation 4
 
0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
581
 
11.6%
459
 
9.2%
307
 
6.1%
269
 
5.4%
212
 
4.2%
194
 
3.9%
162
 
3.2%
150
 
3.0%
130
 
2.6%
126
 
2.5%
Other values (198) 2420
48.3%
Uppercase Letter
ValueCountFrequency (%)
S 4
25.0%
H 4
25.0%
E 2
12.5%
B 2
12.5%
M 2
12.5%
K 2
12.5%
Space Separator
ValueCountFrequency (%)
44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5010
98.7%
Common 52
 
1.0%
Latin 16
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
581
 
11.6%
459
 
9.2%
307
 
6.1%
269
 
5.4%
212
 
4.2%
194
 
3.9%
162
 
3.2%
150
 
3.0%
130
 
2.6%
126
 
2.5%
Other values (198) 2420
48.3%
Latin
ValueCountFrequency (%)
S 4
25.0%
H 4
25.0%
E 2
12.5%
B 2
12.5%
M 2
12.5%
K 2
12.5%
Common
ValueCountFrequency (%)
44
84.6%
- 4
 
7.7%
( 2
 
3.8%
) 2
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5010
98.7%
ASCII 68
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
581
 
11.6%
459
 
9.2%
307
 
6.1%
269
 
5.4%
212
 
4.2%
194
 
3.9%
162
 
3.2%
150
 
3.0%
130
 
2.6%
126
 
2.5%
Other values (198) 2420
48.3%
ASCII
ValueCountFrequency (%)
44
64.7%
S 4
 
5.9%
H 4
 
5.9%
- 4
 
5.9%
E 2
 
2.9%
( 2
 
2.9%
) 2
 
2.9%
B 2
 
2.9%
M 2
 
2.9%
K 2
 
2.9%
Distinct138
Distinct (%)27.1%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
Minimum1977-05-01 00:00:00
Maximum2016-07-19 00:00:00
2023-12-12T10:58:58.636964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:58:58.778096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

주소
Text

Distinct503
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size4.1 KiB
2023-12-12T10:58:59.106761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length37
Mean length20.980354
Min length6

Characters and Unicode

Total characters10679
Distinct characters301
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique499 ?
Unique (%)98.0%

Sample

1st row서울특별시 서초구 반포대로 222
2nd row서울 서초구 반포동 505번지
3rd row서울 동대문구 전농1동 성바오로병원 620-56
4th row서울 동대문구 왕산로 180
5th row서울 영등포구 여의도동 여의도성모병원 62
ValueCountFrequency (%)
서울 50
 
2.1%
경기도 49
 
2.1%
서울특별시 40
 
1.7%
경남 37
 
1.6%
35
 
1.5%
0 35
 
1.5%
강원 25
 
1.1%
중구 24
 
1.0%
충북 24
 
1.0%
부산광역시 23
 
1.0%
Other values (1137) 2020
85.5%
2023-12-12T10:58:59.690108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2135
 
20.0%
406
 
3.8%
372
 
3.5%
347
 
3.2%
1 312
 
2.9%
299
 
2.8%
244
 
2.3%
2 228
 
2.1%
3 184
 
1.7%
173
 
1.6%
Other values (291) 5979
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6443
60.3%
Space Separator 2135
 
20.0%
Decimal Number 1613
 
15.1%
Dash Punctuation 143
 
1.3%
Close Punctuation 142
 
1.3%
Open Punctuation 142
 
1.3%
Other Punctuation 61
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
406
 
6.3%
372
 
5.8%
347
 
5.4%
299
 
4.6%
244
 
3.8%
173
 
2.7%
173
 
2.7%
171
 
2.7%
165
 
2.6%
160
 
2.5%
Other values (274) 3933
61.0%
Decimal Number
ValueCountFrequency (%)
1 312
19.3%
2 228
14.1%
3 184
11.4%
5 168
10.4%
0 156
9.7%
4 127
7.9%
7 122
 
7.6%
6 118
 
7.3%
9 103
 
6.4%
8 95
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 56
91.8%
. 3
 
4.9%
· 2
 
3.3%
Space Separator
ValueCountFrequency (%)
2135
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%
Close Punctuation
ValueCountFrequency (%)
) 142
100.0%
Open Punctuation
ValueCountFrequency (%)
( 142
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6443
60.3%
Common 4236
39.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
406
 
6.3%
372
 
5.8%
347
 
5.4%
299
 
4.6%
244
 
3.8%
173
 
2.7%
173
 
2.7%
171
 
2.7%
165
 
2.6%
160
 
2.5%
Other values (274) 3933
61.0%
Common
ValueCountFrequency (%)
2135
50.4%
1 312
 
7.4%
2 228
 
5.4%
3 184
 
4.3%
5 168
 
4.0%
0 156
 
3.7%
- 143
 
3.4%
) 142
 
3.4%
( 142
 
3.4%
4 127
 
3.0%
Other values (7) 499
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6443
60.3%
ASCII 4234
39.6%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2135
50.4%
1 312
 
7.4%
2 228
 
5.4%
3 184
 
4.3%
5 168
 
4.0%
0 156
 
3.7%
- 143
 
3.4%
) 142
 
3.4%
( 142
 
3.4%
4 127
 
3.0%
Other values (6) 497
 
11.7%
Hangul
ValueCountFrequency (%)
406
 
6.3%
372
 
5.8%
347
 
5.4%
299
 
4.6%
244
 
3.8%
173
 
2.7%
173
 
2.7%
171
 
2.7%
165
 
2.6%
160
 
2.5%
Other values (274) 3933
61.0%
None
ValueCountFrequency (%)
· 2
100.0%

Missing values

2023-12-12T10:58:57.650987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:58:57.735883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관할청병원명지정일자주소
0서울가톨릭대학교서울성모병원1992-03-03서울특별시 서초구 반포대로 222
1서울가톨릭대학교서울성모병원1992-03-03서울 서초구 반포동 505번지
2서울가톨릭대학교성바오로병원1997-09-29서울 동대문구 전농1동 성바오로병원 620-56
3서울가톨릭대학교성바오로병원1997-09-29서울 동대문구 왕산로 180
4서울가톨릭대학교여의도성모병원1992-03-03서울 영등포구 여의도동 여의도성모병원 62
5서울가톨릭대학교여의도성모병원1992-03-03서울특별시 영등포구 63로 10
6서울강동경희대학교의대병원2006-10-26서울 강동구 상일동 (구.경희대학교동서신의학병원)
7서울강동경희대학교의대병원2006-10-26서울특별시 강동구 동남로 892
8서울강동경희대학교치과병원2007-04-23서울 강동구 상일동 149
9서울강동경희대학교치과병원2007-04-23서울특별시 강동구 동남로 892
관할청병원명지정일자주소
499강원영동강원도삼척의료원1992-03-06강원 삼척시 남양동 55-9
500강원영동강원도삼척의료원1992-03-06강원 삼척시 오십천로 418(남양동)
501강원영동강원도속초의료원1992-03-06강원 속초시 영랑동 591-10
502강원영동강원도속초의료원1992-03-06강원 속초시 영랑호반길 3(영랑동)
503강원영동강릉아산병원1998-02-13강원 강릉시 사천면 강릉아산병원
504강원영동강릉아산병원1998-02-13강원도 강릉시 사천면 방동길 38 (방동리)
505강원영동강릉원주대학교치과병원2006-06-01강원 강릉시 지변동 강릉대학교
506강원영동강릉원주대학교치과병원2006-06-01강원 강릉시 죽헌길 7(지변동)
507강원영동강릉율곡병원2009-03-16강원 강릉시 난곡동 401-1번지
508강원영동강릉율곡병원2009-03-16강원 강릉시 동해대로3304번길 11(난곡동)

Duplicate rows

Most frequently occurring

관할청병원명지정일자주소# duplicates
0부산부민병원2016-07-19(비어있음)2