Overview

Dataset statistics

Number of variables4
Number of observations83
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory34.6 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description건강관리의 기회가 부족한 건설근로자를 대상으로 초음파, 내시경, CT촬영 등의 종합건강검진을 제공해 드리는 복지서비스입니다. 대상에 해당되는 건설근로자 본인이 건설근로자공제회로 검진을 신청하시면 첨부된 검진기관을 통해 무료로 검진을 받을 수 있습니다.
URLhttps://www.data.go.kr/data/15032101/fileData.do

Alerts

연번 has unique valuesUnique
검진기관명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:17:05.261082
Analysis finished2023-12-12 13:17:05.835911
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42
Minimum1
Maximum83
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size879.0 B
2023-12-12T22:17:05.945450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.1
Q121.5
median42
Q362.5
95-th percentile78.9
Maximum83
Range82
Interquartile range (IQR)41

Descriptive statistics

Standard deviation24.103942
Coefficient of variation (CV)0.57390337
Kurtosis-1.2
Mean42
Median Absolute Deviation (MAD)21
Skewness0
Sum3486
Variance581
MonotonicityStrictly increasing
2023-12-12T22:17:06.106675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
54 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
55 1
 
1.2%
Other values (73) 73
88.0%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
74 1
1.2%

지역
Categorical

Distinct17
Distinct (%)20.5%
Missing0
Missing (%)0.0%
Memory size796.0 B
서울특별시
17 
경기도
12 
인천광역시
부산광역시
충청남도
Other values (12)
37 

Length

Max length7
Median length5
Mean length4.5662651
Min length3

Unique

Unique2 ?
Unique (%)2.4%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row경기도
5th row인천광역시

Common Values

ValueCountFrequency (%)
서울특별시 17
20.5%
경기도 12
14.5%
인천광역시 6
 
7.2%
부산광역시 6
 
7.2%
충청남도 5
 
6.0%
강원특별자치도 4
 
4.8%
경상남도 4
 
4.8%
대구광역시 4
 
4.8%
충청북도 4
 
4.8%
전라북도 4
 
4.8%
Other values (7) 17
20.5%

Length

2023-12-12T22:17:06.282395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 17
20.5%
경기도 12
14.5%
인천광역시 6
 
7.2%
부산광역시 6
 
7.2%
충청남도 5
 
6.0%
전라남도 4
 
4.8%
전라북도 4
 
4.8%
충청북도 4
 
4.8%
대구광역시 4
 
4.8%
경상남도 4
 
4.8%
Other values (7) 17
20.5%

검진기관명
Text

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2023-12-12T22:17:06.585447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length15
Mean length8.4698795
Min length4

Characters and Unicode

Total characters703
Distinct characters148
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row건강관리협회_서울강남지부
2nd row건강관리협회_서울동부지부
3rd row건강관리협회_서울서부지부
4th row건강관리협회_경기지부
5th row건강관리협회_인천지부
ValueCountFrequency (%)
건강관리협회_서울강남지부 1
 
1.2%
천안우리병원 1
 
1.2%
송도지안병원 1
 
1.2%
강릉동인병원 1
 
1.2%
안동성소 1
 
1.2%
전주우석대 1
 
1.2%
강릉아나병원 1
 
1.2%
부천우리병원 1
 
1.2%
대경영상의학과의원 1
 
1.2%
목포한국병원 1
 
1.2%
Other values (73) 73
88.0%
2023-12-12T22:17:07.091518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45
 
6.4%
35
 
5.0%
_ 30
 
4.3%
30
 
4.3%
28
 
4.0%
22
 
3.1%
20
 
2.8%
18
 
2.6%
17
 
2.4%
17
 
2.4%
Other values (138) 441
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 654
93.0%
Connector Punctuation 30
 
4.3%
Uppercase Letter 9
 
1.3%
Close Punctuation 3
 
0.4%
Open Punctuation 3
 
0.4%
Decimal Number 3
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
6.9%
35
 
5.4%
30
 
4.6%
28
 
4.3%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
17
 
2.6%
17
 
2.6%
Other values (125) 405
61.9%
Uppercase Letter
ValueCountFrequency (%)
S 3
33.3%
G 2
22.2%
K 1
 
11.1%
C 1
 
11.1%
F 1
 
11.1%
I 1
 
11.1%
Decimal Number
ValueCountFrequency (%)
3 1
33.3%
6 1
33.3%
5 1
33.3%
Connector Punctuation
ValueCountFrequency (%)
_ 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 654
93.0%
Common 40
 
5.7%
Latin 9
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
6.9%
35
 
5.4%
30
 
4.6%
28
 
4.3%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
17
 
2.6%
17
 
2.6%
Other values (125) 405
61.9%
Common
ValueCountFrequency (%)
_ 30
75.0%
) 3
 
7.5%
( 3
 
7.5%
, 1
 
2.5%
3 1
 
2.5%
6 1
 
2.5%
5 1
 
2.5%
Latin
ValueCountFrequency (%)
S 3
33.3%
G 2
22.2%
K 1
 
11.1%
C 1
 
11.1%
F 1
 
11.1%
I 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 654
93.0%
ASCII 49
 
7.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
45
 
6.9%
35
 
5.4%
30
 
4.6%
28
 
4.3%
22
 
3.4%
20
 
3.1%
18
 
2.8%
17
 
2.6%
17
 
2.6%
17
 
2.6%
Other values (125) 405
61.9%
ASCII
ValueCountFrequency (%)
_ 30
61.2%
) 3
 
6.1%
S 3
 
6.1%
( 3
 
6.1%
G 2
 
4.1%
, 1
 
2.0%
3 1
 
2.0%
6 1
 
2.0%
5 1
 
2.0%
K 1
 
2.0%
Other values (3) 3
 
6.1%

주소
Text

UNIQUE 

Distinct83
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size796.0 B
2023-12-12T22:17:07.434666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length35
Mean length23.626506
Min length15

Characters and Unicode

Total characters1961
Distinct characters224
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)100.0%

Sample

1st row서울 송파구 올림픽로 269 (신천동, 롯데캐슬골드 5층)
2nd row서울 동대문구 왕산로 78 (용두동)
3rd row서울 강서구 화곡로 335 (화곡동)
4th row경기도 수원시 장안구 경수대로 857(조원동 779)
5th row인천 남구 독배로 500 (숭의동)
ValueCountFrequency (%)
서울 10
 
2.2%
경기도 8
 
1.8%
부산 6
 
1.3%
서구 6
 
1.3%
서울시 6
 
1.3%
인천 6
 
1.3%
강남구 5
 
1.1%
충남 5
 
1.1%
전북 4
 
0.9%
중구 4
 
0.9%
Other values (323) 390
86.7%
2023-12-12T22:17:07.897578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
368
 
18.8%
81
 
4.1%
78
 
4.0%
66
 
3.4%
( 64
 
3.3%
) 64
 
3.3%
1 56
 
2.9%
52
 
2.7%
2 44
 
2.2%
3 39
 
2.0%
Other values (214) 1049
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1132
57.7%
Space Separator 368
 
18.8%
Decimal Number 305
 
15.6%
Open Punctuation 64
 
3.3%
Close Punctuation 64
 
3.3%
Other Punctuation 17
 
0.9%
Dash Punctuation 6
 
0.3%
Math Symbol 3
 
0.2%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
7.2%
78
 
6.9%
66
 
5.8%
52
 
4.6%
37
 
3.3%
34
 
3.0%
33
 
2.9%
31
 
2.7%
22
 
1.9%
20
 
1.8%
Other values (196) 678
59.9%
Decimal Number
ValueCountFrequency (%)
1 56
18.4%
2 44
14.4%
3 39
12.8%
5 31
10.2%
4 28
9.2%
6 23
7.5%
7 22
 
7.2%
0 21
 
6.9%
9 21
 
6.9%
8 20
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
368
100.0%
Open Punctuation
ValueCountFrequency (%)
( 64
100.0%
Close Punctuation
ValueCountFrequency (%)
) 64
100.0%
Other Punctuation
ValueCountFrequency (%)
, 17
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1132
57.7%
Common 827
42.2%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
7.2%
78
 
6.9%
66
 
5.8%
52
 
4.6%
37
 
3.3%
34
 
3.0%
33
 
2.9%
31
 
2.7%
22
 
1.9%
20
 
1.8%
Other values (196) 678
59.9%
Common
ValueCountFrequency (%)
368
44.5%
( 64
 
7.7%
) 64
 
7.7%
1 56
 
6.8%
2 44
 
5.3%
3 39
 
4.7%
5 31
 
3.7%
4 28
 
3.4%
6 23
 
2.8%
7 22
 
2.7%
Other values (6) 88
 
10.6%
Latin
ValueCountFrequency (%)
A 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1132
57.7%
ASCII 829
42.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
368
44.4%
( 64
 
7.7%
) 64
 
7.7%
1 56
 
6.8%
2 44
 
5.3%
3 39
 
4.7%
5 31
 
3.7%
4 28
 
3.4%
6 23
 
2.8%
7 22
 
2.7%
Other values (8) 90
 
10.9%
Hangul
ValueCountFrequency (%)
81
 
7.2%
78
 
6.9%
66
 
5.8%
52
 
4.6%
37
 
3.3%
34
 
3.0%
33
 
2.9%
31
 
2.7%
22
 
1.9%
20
 
1.8%
Other values (196) 678
59.9%

Interactions

2023-12-12T22:17:05.545944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:17:08.004951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역검진기관명주소
연번1.0000.5791.0001.000
지역0.5791.0001.0001.000
검진기관명1.0001.0001.0001.000
주소1.0001.0001.0001.000
2023-12-12T22:17:08.084875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역
연번1.0000.250
지역0.2501.000

Missing values

2023-12-12T22:17:05.694947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:17:05.801295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지역검진기관명주소
01서울특별시건강관리협회_서울강남지부서울 송파구 올림픽로 269 (신천동, 롯데캐슬골드 5층)
12서울특별시건강관리협회_서울동부지부서울 동대문구 왕산로 78 (용두동)
23서울특별시건강관리협회_서울서부지부서울 강서구 화곡로 335 (화곡동)
34경기도건강관리협회_경기지부경기도 수원시 장안구 경수대로 857(조원동 779)
45인천광역시건강관리협회_인천지부인천 남구 독배로 500 (숭의동)
56강원특별자치도건강관리협회_강원지부강원 춘천시 남춘로 50 (효자동)
67경상남도건강관리협회_경남지부경남 창원시 마산회원구 삼호로 107 (양덕동)
78대구광역시건강관리협회_경북,대구북부지부대구 북구 팔달로 23 (노원동3가)
89광주광역시건강관리협회_광주전남지부광주 서구 대남대로 432 (농성동)
910대구광역시건강관리협회_대구지부대구 동구 장동로16(신천3동 128-1)
연번지역검진기관명주소
7374부산광역시부산온종합병원부산 부산진구 가야대로 721 (당감동)
7475충청남도예산명지병원충남 예산군 예산읍 신례원로 26
7576울산광역시울산세민병원울산 중구 학성로 184 (학성동)
7677경기도용인명주병원경기도 용인시 처인구 금령로39번길8-6
7778인천광역시검단탑병원인천 서구 청마로19번길 5 (당하동)
7879충청남도천안하나메디컬충남 천안시 서북구 검은들3길 46 (불당동)
7980전라북도남원의료원전북 남원시 충정로 365 남원의료원
8081경상남도박해동내과경남 거제시 서문로3길 26 5층, 6층
8182세종특별자치시세종365의원세종 한누리대로 1958 법조타운 A동 4층
8283경상북도포항성모병원경북 포항시 남구 대잠동길 17