Overview

Dataset statistics

Number of variables11
Number of observations74
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory90.8 B

Variable types

Numeric1
Text3
Boolean6
DateTime1

Dataset

Description남동구 국가암검진 의료기관 현황(연번, 행정동, 검진기관명, 전화번호, 소재지주소, 위암검진여부, 간암검진여부, 대장암검진여부, 유방암검진여부, 자둥경부암검진여부, 폐암검진여부, 기준일자) 개방
URLhttps://www.data.go.kr/data/15067995/fileData.do

Alerts

기준일자 has constant value ""Constant
위암검진여부 is highly overall correlated with 간암검진여부 and 1 other fieldsHigh correlation
간암검진여부 is highly overall correlated with 위암검진여부 and 1 other fieldsHigh correlation
대장암검진여부 is highly overall correlated with 위암검진여부 and 2 other fieldsHigh correlation
유방암검진여부 is highly overall correlated with 대장암검진여부High correlation
폐암검진여부 is highly imbalanced (82.1%)Imbalance
전화번호 has 1 (1.4%) missing valuesMissing
연번 has unique valuesUnique
검진기관명 has unique valuesUnique
소재지주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:14:42.013787
Analysis finished2023-12-12 15:14:43.148010
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct74
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.5
Minimum1
Maximum74
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size798.0 B
2023-12-13T00:14:43.239386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.65
Q119.25
median37.5
Q355.75
95-th percentile70.35
Maximum74
Range73
Interquartile range (IQR)36.5

Descriptive statistics

Standard deviation21.505813
Coefficient of variation (CV)0.57348835
Kurtosis-1.2
Mean37.5
Median Absolute Deviation (MAD)18.5
Skewness0
Sum2775
Variance462.5
MonotonicityStrictly increasing
2023-12-13T00:14:43.415654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
57 1
 
1.4%
55 1
 
1.4%
54 1
 
1.4%
53 1
 
1.4%
52 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
Other values (64) 64
86.5%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
74 1
1.4%
73 1
1.4%
72 1
1.4%
71 1
1.4%
70 1
1.4%
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%

검진기관명
Text

UNIQUE 

Distinct74
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size724.0 B
2023-12-13T00:14:43.726460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length12
Mean length7.6756757
Min length3

Characters and Unicode

Total characters568
Distinct characters138
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)100.0%

Sample

1st row21세기미소내과의원
2nd row국제바로병원
3rd row나은요양병원
4th row논현사랑내과의원
5th row논현새로운내과의원
ValueCountFrequency (%)
21세기미소내과의원 1
 
1.3%
길의료재단 1
 
1.3%
인하내과의원 1
 
1.3%
인천힘찬종합병원 1
 
1.3%
인천아시아드병원 1
 
1.3%
인천속내과의원 1
 
1.3%
인구보건복지협회인천지회가족보건의원 1
 
1.3%
이광래내과의원 1
 
1.3%
길병원 1
 
1.3%
의료법인 1
 
1.3%
Other values (66) 66
86.8%
2023-12-13T00:14:44.206182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
13.6%
68
 
12.0%
55
 
9.7%
42
 
7.4%
18
 
3.2%
10
 
1.8%
10
 
1.8%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (128) 260
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 564
99.3%
Space Separator 2
 
0.4%
Decimal Number 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
13.7%
68
 
12.1%
55
 
9.8%
42
 
7.4%
18
 
3.2%
10
 
1.8%
10
 
1.8%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (125) 256
45.4%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 564
99.3%
Common 4
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
13.7%
68
 
12.1%
55
 
9.8%
42
 
7.4%
18
 
3.2%
10
 
1.8%
10
 
1.8%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (125) 256
45.4%
Common
ValueCountFrequency (%)
2
50.0%
2 1
25.0%
1 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 564
99.3%
ASCII 4
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
77
 
13.7%
68
 
12.1%
55
 
9.8%
42
 
7.4%
18
 
3.2%
10
 
1.8%
10
 
1.8%
10
 
1.8%
9
 
1.6%
9
 
1.6%
Other values (125) 256
45.4%
ASCII
ValueCountFrequency (%)
2
50.0%
2 1
25.0%
1 1
25.0%

전화번호
Text

MISSING 

Distinct73
Distinct (%)100.0%
Missing1
Missing (%)1.4%
Memory size724.0 B
2023-12-13T00:14:44.515594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.890411
Min length9

Characters and Unicode

Total characters868
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)100.0%

Sample

1st row032-431-7715
2nd row032-722-8585
3rd row032-710-6001
4th row032-433-9900
5th row032-424-3334
ValueCountFrequency (%)
032-431-7715 1
 
1.4%
032-422-7580 1
 
1.4%
032-861-3335 1
 
1.4%
032-466-0101 1
 
1.4%
032-1899-2220 1
 
1.4%
032-222-7575 1
 
1.4%
032-431-0119 1
 
1.4%
032-451-4000 1
 
1.4%
032-891-3456 1
 
1.4%
032-426-9275 1
 
1.4%
Other values (63) 63
86.3%
2023-12-13T00:14:44.995304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 143
16.5%
0 126
14.5%
3 121
13.9%
2 117
13.5%
4 81
9.3%
7 66
7.6%
5 57
 
6.6%
1 53
 
6.1%
6 48
 
5.5%
8 31
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 725
83.5%
Dash Punctuation 143
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 126
17.4%
3 121
16.7%
2 117
16.1%
4 81
11.2%
7 66
9.1%
5 57
7.9%
1 53
7.3%
6 48
 
6.6%
8 31
 
4.3%
9 25
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 143
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 868
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 143
16.5%
0 126
14.5%
3 121
13.9%
2 117
13.5%
4 81
9.3%
7 66
7.6%
5 57
 
6.6%
1 53
 
6.1%
6 48
 
5.5%
8 31
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 868
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 143
16.5%
0 126
14.5%
3 121
13.9%
2 117
13.5%
4 81
9.3%
7 66
7.6%
5 57
 
6.6%
1 53
 
6.1%
6 48
 
5.5%
8 31
 
3.6%

소재지주소
Text

UNIQUE 

Distinct74
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size724.0 B
2023-12-13T00:14:45.301581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length44
Mean length36.189189
Min length23

Characters and Unicode

Total characters2678
Distinct characters158
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)100.0%

Sample

1st row인천광역시 남동구 백범로 322 2층 (간석동, 진메디칼센터)
2nd row인천광역시 남동구 석정로 518 2층일부~9층 (간석동)
3rd row인천광역시 남동구 소래역남로16번길 20 둘리프라자 (논현동)
4th row인천광역시 남동구 논현역로 8 4층 일부호 (논현동, 아이엠프라자)
5th row인천광역시 남동구 논고개로 87 논현메디스타워 401,402,405,406.407호 (논현동)
ValueCountFrequency (%)
인천광역시 74
 
14.4%
남동구 74
 
14.4%
구월동 23
 
4.5%
2층 16
 
3.1%
간석동 16
 
3.1%
논현동 13
 
2.5%
만수동 12
 
2.3%
3층 7
 
1.4%
구월로 7
 
1.4%
호구포로 7
 
1.4%
Other values (197) 264
51.5%
2023-12-13T00:14:45.715024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
439
 
16.4%
158
 
5.9%
112
 
4.2%
92
 
3.4%
90
 
3.4%
2 80
 
3.0%
78
 
2.9%
77
 
2.9%
76
 
2.8%
76
 
2.8%
Other values (148) 1400
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1524
56.9%
Decimal Number 471
 
17.6%
Space Separator 439
 
16.4%
Close Punctuation 74
 
2.8%
Open Punctuation 74
 
2.8%
Other Punctuation 69
 
2.6%
Math Symbol 13
 
0.5%
Dash Punctuation 10
 
0.4%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
158
 
10.4%
112
 
7.3%
92
 
6.0%
90
 
5.9%
78
 
5.1%
77
 
5.1%
76
 
5.0%
76
 
5.0%
74
 
4.9%
56
 
3.7%
Other values (129) 635
41.7%
Decimal Number
ValueCountFrequency (%)
2 80
17.0%
1 68
14.4%
0 63
13.4%
3 62
13.2%
4 48
10.2%
7 38
8.1%
5 36
7.6%
8 36
7.6%
6 23
 
4.9%
9 17
 
3.6%
Other Punctuation
ValueCountFrequency (%)
, 68
98.6%
. 1
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
A 3
75.0%
C 1
 
25.0%
Space Separator
ValueCountFrequency (%)
439
100.0%
Close Punctuation
ValueCountFrequency (%)
) 74
100.0%
Open Punctuation
ValueCountFrequency (%)
( 74
100.0%
Math Symbol
ValueCountFrequency (%)
~ 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1524
56.9%
Common 1150
42.9%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
158
 
10.4%
112
 
7.3%
92
 
6.0%
90
 
5.9%
78
 
5.1%
77
 
5.1%
76
 
5.0%
76
 
5.0%
74
 
4.9%
56
 
3.7%
Other values (129) 635
41.7%
Common
ValueCountFrequency (%)
439
38.2%
2 80
 
7.0%
) 74
 
6.4%
( 74
 
6.4%
1 68
 
5.9%
, 68
 
5.9%
0 63
 
5.5%
3 62
 
5.4%
4 48
 
4.2%
7 38
 
3.3%
Other values (7) 136
 
11.8%
Latin
ValueCountFrequency (%)
A 3
75.0%
C 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1524
56.9%
ASCII 1154
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
439
38.0%
2 80
 
6.9%
) 74
 
6.4%
( 74
 
6.4%
1 68
 
5.9%
, 68
 
5.9%
0 63
 
5.5%
3 62
 
5.4%
4 48
 
4.2%
7 38
 
3.3%
Other values (9) 140
 
12.1%
Hangul
ValueCountFrequency (%)
158
 
10.4%
112
 
7.3%
92
 
6.0%
90
 
5.9%
78
 
5.1%
77
 
5.1%
76
 
5.0%
76
 
5.0%
74
 
4.9%
56
 
3.7%
Other values (129) 635
41.7%

위암검진여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
57 
False
17 
ValueCountFrequency (%)
True 57
77.0%
False 17
 
23.0%
2023-12-13T00:14:45.827146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

간암검진여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
61 
False
13 
ValueCountFrequency (%)
True 61
82.4%
False 13
 
17.6%
2023-12-13T00:14:45.911742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

대장암검진여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
54 
False
20 
ValueCountFrequency (%)
True 54
73.0%
False 20
 
27.0%
2023-12-13T00:14:45.990273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

유방암검진여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
44 
False
30 
ValueCountFrequency (%)
True 44
59.5%
False 30
40.5%
2023-12-13T00:14:46.079174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
True
52 
False
22 
ValueCountFrequency (%)
True 52
70.3%
False 22
29.7%
2023-12-13T00:14:46.184003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

폐암검진여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size206.0 B
False
72 
True
 
2
ValueCountFrequency (%)
False 72
97.3%
True 2
 
2.7%
2023-12-13T00:14:46.270844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size724.0 B
Minimum2023-05-12 00:00:00
Maximum2023-05-12 00:00:00
2023-12-13T00:14:46.350232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T00:14:46.429752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T00:14:42.701326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:14:46.493844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번검진기관명전화번호소재지주소위암검진여부간암검진여부대장암검진여부유방암검진여부자궁경부암검진여부폐암검진여부
연번1.0001.0001.0001.0000.5170.4210.4580.2470.0000.455
검진기관명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
소재지주소1.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
위암검진여부0.5171.0001.0001.0001.0000.9510.9760.6930.0000.000
간암검진여부0.4211.0001.0001.0000.9511.0000.9010.5390.3620.000
대장암검진여부0.4581.0001.0001.0000.9760.9011.0000.7850.0000.000
유방암검진여부0.2471.0001.0001.0000.6930.5390.7851.0000.6430.000
자궁경부암검진여부0.0001.0001.0001.0000.0000.3620.0000.6431.0000.000
폐암검진여부0.4551.0001.0001.0000.0000.0000.0000.0000.0001.000
2023-12-13T00:14:46.615387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
간암검진여부대장암검진여부폐암검진여부유방암검진여부자궁경부암검진여부위암검진여부
간암검진여부1.0000.7140.0000.3620.2350.800
대장암검진여부0.7141.0000.0000.5740.0000.859
폐암검진여부0.0000.0001.0000.0000.0000.000
유방암검진여부0.3620.5740.0001.0000.4440.487
자궁경부암검진여부0.2350.0000.0000.4441.0000.000
위암검진여부0.8000.8590.0000.4870.0001.000
2023-12-13T00:14:46.723913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위암검진여부간암검진여부대장암검진여부유방암검진여부자궁경부암검진여부폐암검진여부
연번1.0000.3730.3020.3300.1740.0000.328
위암검진여부0.3731.0000.8000.8590.4870.0000.000
간암검진여부0.3020.8001.0000.7140.3620.2350.000
대장암검진여부0.3300.8590.7141.0000.5740.0000.000
유방암검진여부0.1740.4870.3620.5741.0000.4440.000
자궁경부암검진여부0.0000.0000.2350.0000.4441.0000.000
폐암검진여부0.3280.0000.0000.0000.0000.0001.000

Missing values

2023-12-13T00:14:42.862477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:14:43.071658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번검진기관명전화번호소재지주소위암검진여부간암검진여부대장암검진여부유방암검진여부자궁경부암검진여부폐암검진여부기준일자
0121세기미소내과의원032-431-7715인천광역시 남동구 백범로 322 2층 (간석동, 진메디칼센터)NYNNNN2023-05-12
12국제바로병원032-722-8585인천광역시 남동구 석정로 518 2층일부~9층 (간석동)YYYYYN2023-05-12
23나은요양병원032-710-6001인천광역시 남동구 소래역남로16번길 20 둘리프라자 (논현동)YYYNNN2023-05-12
34논현사랑내과의원032-433-9900인천광역시 남동구 논현역로 8 4층 일부호 (논현동, 아이엠프라자)YYYYYN2023-05-12
45논현새로운내과의원032-424-3334인천광역시 남동구 논고개로 87 논현메디스타워 401,402,405,406.407호 (논현동)YYYYYN2023-05-12
56논현이이주내과의원032-432-7114인천광역시 남동구 논고개로123번길 17 407~411호 (논현동, 아이플렉스)YYYYNN2023-05-12
67논현탑내과의원032-433-1175인천광역시 남동구 논고개로 61 4층 403호 (논현동, 라피에스타)YYYYYN2023-05-12
78늘푸른내과의원032-441-0096인천광역시 남동구 논고개로 325 3층 (도림동, 명례빌딩)YYYNYN2023-05-12
89다정산부인과의원032-463-5500인천광역시 남동구 호구포로 826 (구월동)NNNYYN2023-05-12
910대찬병원1522-3266인천광역시 남동구 인주대로 590 1층일부, 2~8층, 9층일부 (구월동)NYNNNN2023-05-12
연번검진기관명전화번호소재지주소위암검진여부간암검진여부대장암검진여부유방암검진여부자궁경부암검진여부폐암검진여부기준일자
6465참조은내과의원032-446-1273인천광역시 남동구 에코중앙로156번길 5-19 2층 (논현동)YYNNNN2023-05-12
6566최앤박내과외과의원032-441-7175인천광역시 남동구 논고개로 114 에이스타워 401,402,403,502호 (논현동)YYYYYN2023-05-12
6667추원석내과의원032-422-3332인천광역시 남동구 백범로 374 (간석동)YYYNNN2023-05-12
6768파티마의원032-471-9944인천광역시 남동구 만수로 7 2,3층 (만수동)YYNNNN2023-05-12
6869한마음산부인과의원032-467-3687인천광역시 남동구 인주대로 865 2층 (만수동, 승창빌딩)NNNNYN2023-05-12
6970한빛산부인과의원032-466-3575인천광역시 남동구 서창남로 77 301,302호 (서창동)NNNNYN2023-05-12
7071한사랑의원032-466-8275인천광역시 남동구 만수로 107 2층 (만수동, 은성프라자)NYNNYN2023-05-12
7172해밀병원032-427-1175인천광역시 남동구 백범로 403 A동 지1층~지상 5층 (간석동)YYYYYN2023-05-12
7273향촌사랑내과의원032-462-3377인천광역시 남동구 만수서로 56 303호 (만수동, 향촌메디칼)YYYNNN2023-05-12
7374휴내과의원032-437-5111인천광역시 남동구 인하로 497-5 8층 801호 (구월동, 푸른세상안과빌딩)YYYYYN2023-05-12