Overview

Dataset statistics

Number of variables4
Number of observations65
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory35.0 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description인천광역시 외국인환자 유치사업자 현황에 대한 데이터로 인천시 외국인환자 유치사업자의 상호 및 주소에 대한 정보를 제공합니다
Author인천광역시
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15048402&srcSe=7661IVAWM27C61E190

Alerts

연번 has unique valuesUnique
기업명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2024-03-13 05:39:31.052379
Analysis finished2024-03-13 05:39:31.592871
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33
Minimum1
Maximum65
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size717.0 B
2024-03-13T14:39:31.664382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.2
Q117
median33
Q349
95-th percentile61.8
Maximum65
Range64
Interquartile range (IQR)32

Descriptive statistics

Standard deviation18.90767
Coefficient of variation (CV)0.57295971
Kurtosis-1.2
Mean33
Median Absolute Deviation (MAD)16
Skewness0
Sum2145
Variance357.5
MonotonicityStrictly increasing
2024-03-13T14:39:31.797758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
50 1
 
1.5%
36 1
 
1.5%
37 1
 
1.5%
38 1
 
1.5%
39 1
 
1.5%
40 1
 
1.5%
41 1
 
1.5%
42 1
 
1.5%
43 1
 
1.5%
Other values (55) 55
84.6%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%
57 1
1.5%
56 1
1.5%

군구명
Categorical

Distinct8
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Memory size652.0 B
연수구
19 
중구
12 
서구
부평구
미추홀구
Other values (3)
10 

Length

Max length4
Median length3
Mean length2.7692308
Min length2

Unique

Unique1 ?
Unique (%)1.5%

Sample

1st row서구
2nd row부평구
3rd row미추홀구
4th row서구
5th row연수구

Common Values

ValueCountFrequency (%)
연수구 19
29.2%
중구 12
18.5%
서구 9
13.8%
부평구 9
13.8%
미추홀구 6
 
9.2%
남동구 5
 
7.7%
계양구 4
 
6.2%
강화군 1
 
1.5%

Length

2024-03-13T14:39:31.945434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T14:39:32.059094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연수구 19
29.2%
중구 12
18.5%
서구 9
13.8%
부평구 9
13.8%
미추홀구 6
 
9.2%
남동구 5
 
7.7%
계양구 4
 
6.2%
강화군 1
 
1.5%

기업명
Text

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2024-03-13T14:39:32.313843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length14
Mean length8.9538462
Min length2

Characters and Unicode

Total characters582
Distinct characters172
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row메디엔인터내셔날
2nd row조이풀(JOYFUL)
3rd row주식회사 케이바이오
4th row㈜대윤산업
5th row포에버 차이나(4ever china)
ValueCountFrequency (%)
주식회사 23
 
22.3%
메디컬 2
 
1.9%
메디엔인터내셔날 1
 
1.0%
코리아베스트닥터 1
 
1.0%
세나스메디케어 1
 
1.0%
알지팩토리 1
 
1.0%
위투어코리아 1
 
1.0%
하트너 1
 
1.0%
유어코리아 1
 
1.0%
주)마루인스퍼레이션 1
 
1.0%
Other values (70) 70
68.0%
2024-03-13T14:39:32.738051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
6.5%
30
 
5.2%
27
 
4.6%
23
 
4.0%
23
 
4.0%
22
 
3.8%
19
 
3.3%
( 15
 
2.6%
) 14
 
2.4%
14
 
2.4%
Other values (162) 357
61.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 446
76.6%
Lowercase Letter 42
 
7.2%
Space Separator 38
 
6.5%
Uppercase Letter 19
 
3.3%
Open Punctuation 15
 
2.6%
Close Punctuation 14
 
2.4%
Other Symbol 4
 
0.7%
Other Punctuation 3
 
0.5%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
6.7%
27
 
6.1%
23
 
5.2%
23
 
5.2%
22
 
4.9%
19
 
4.3%
14
 
3.1%
12
 
2.7%
12
 
2.7%
12
 
2.7%
Other values (128) 252
56.5%
Lowercase Letter
ValueCountFrequency (%)
e 9
21.4%
n 6
14.3%
r 5
11.9%
o 4
9.5%
a 4
9.5%
t 3
 
7.1%
i 2
 
4.8%
v 2
 
4.8%
l 2
 
4.8%
g 1
 
2.4%
Other values (4) 4
9.5%
Uppercase Letter
ValueCountFrequency (%)
C 3
15.8%
L 2
10.5%
O 2
10.5%
I 2
10.5%
H 2
10.5%
R 1
 
5.3%
T 1
 
5.3%
U 1
 
5.3%
F 1
 
5.3%
Y 1
 
5.3%
Other values (3) 3
15.8%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
, 1
33.3%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 450
77.3%
Common 71
 
12.2%
Latin 61
 
10.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
6.7%
27
 
6.0%
23
 
5.1%
23
 
5.1%
22
 
4.9%
19
 
4.2%
14
 
3.1%
12
 
2.7%
12
 
2.7%
12
 
2.7%
Other values (129) 256
56.9%
Latin
ValueCountFrequency (%)
e 9
14.8%
n 6
 
9.8%
r 5
 
8.2%
o 4
 
6.6%
a 4
 
6.6%
C 3
 
4.9%
t 3
 
4.9%
i 2
 
3.3%
L 2
 
3.3%
O 2
 
3.3%
Other values (17) 21
34.4%
Common
ValueCountFrequency (%)
38
53.5%
( 15
 
21.1%
) 14
 
19.7%
. 2
 
2.8%
, 1
 
1.4%
4 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 446
76.6%
ASCII 132
 
22.7%
None 4
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
28.8%
( 15
 
11.4%
) 14
 
10.6%
e 9
 
6.8%
n 6
 
4.5%
r 5
 
3.8%
o 4
 
3.0%
a 4
 
3.0%
C 3
 
2.3%
t 3
 
2.3%
Other values (23) 31
23.5%
Hangul
ValueCountFrequency (%)
30
 
6.7%
27
 
6.1%
23
 
5.2%
23
 
5.2%
22
 
4.9%
19
 
4.3%
14
 
3.1%
12
 
2.7%
12
 
2.7%
12
 
2.7%
Other values (128) 252
56.5%
None
ValueCountFrequency (%)
4
100.0%

주소
Text

UNIQUE 

Distinct65
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size652.0 B
2024-03-13T14:39:33.091420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length48
Mean length39.984615
Min length26

Characters and Unicode

Total characters2599
Distinct characters237
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)100.0%

Sample

1st row인천광역시 서구 봉오재2로 37 (가정동, 엘에이치웨스턴블루힐) 210-1402
2nd row인천광역시 부평구 장제로 99 (부평동, 벨라루체) 벨라루체 1508호
3rd row인천광역시 미추홀구 경인로 305번길 17, 2층 205-1호(도화동)
4th row인천 서구 원석로196번길 16 (원창동) (원창동)
5th row인천광역시 연수구 대암로 39-1, 4층(옥련동, 대영빌딩)
ValueCountFrequency (%)
인천광역시 58
 
12.0%
연수구 19
 
3.9%
송도동 14
 
2.9%
중구 12
 
2.5%
서구 9
 
1.9%
부평구 9
 
1.9%
인천 7
 
1.4%
미추홀구 5
 
1.0%
남동구 5
 
1.0%
부평동 5
 
1.0%
Other values (264) 342
70.5%
2024-03-13T14:39:33.564683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
454
 
17.5%
1 91
 
3.5%
84
 
3.2%
80
 
3.1%
2 72
 
2.8%
71
 
2.7%
71
 
2.7%
, 67
 
2.6%
67
 
2.6%
( 66
 
2.5%
Other values (227) 1476
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1456
56.0%
Space Separator 454
 
17.5%
Decimal Number 445
 
17.1%
Other Punctuation 67
 
2.6%
Open Punctuation 66
 
2.5%
Close Punctuation 66
 
2.5%
Uppercase Letter 24
 
0.9%
Dash Punctuation 17
 
0.7%
Lowercase Letter 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
5.8%
80
 
5.5%
71
 
4.9%
71
 
4.9%
67
 
4.6%
66
 
4.5%
59
 
4.1%
59
 
4.1%
43
 
3.0%
38
 
2.6%
Other values (196) 818
56.2%
Uppercase Letter
ValueCountFrequency (%)
B 6
25.0%
C 3
12.5%
I 3
12.5%
T 2
 
8.3%
L 2
 
8.3%
E 2
 
8.3%
H 1
 
4.2%
A 1
 
4.2%
R 1
 
4.2%
F 1
 
4.2%
Other values (2) 2
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 91
20.4%
2 72
16.2%
0 49
11.0%
3 48
10.8%
4 38
8.5%
6 37
8.3%
7 31
 
7.0%
5 31
 
7.0%
8 27
 
6.1%
9 21
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
s 1
25.0%
a 1
25.0%
e 1
25.0%
t 1
25.0%
Space Separator
ValueCountFrequency (%)
454
100.0%
Other Punctuation
ValueCountFrequency (%)
, 67
100.0%
Open Punctuation
ValueCountFrequency (%)
( 66
100.0%
Close Punctuation
ValueCountFrequency (%)
) 66
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1456
56.0%
Common 1115
42.9%
Latin 28
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
5.8%
80
 
5.5%
71
 
4.9%
71
 
4.9%
67
 
4.6%
66
 
4.5%
59
 
4.1%
59
 
4.1%
43
 
3.0%
38
 
2.6%
Other values (196) 818
56.2%
Latin
ValueCountFrequency (%)
B 6
21.4%
C 3
10.7%
I 3
10.7%
T 2
 
7.1%
L 2
 
7.1%
E 2
 
7.1%
s 1
 
3.6%
a 1
 
3.6%
H 1
 
3.6%
A 1
 
3.6%
Other values (6) 6
21.4%
Common
ValueCountFrequency (%)
454
40.7%
1 91
 
8.2%
2 72
 
6.5%
, 67
 
6.0%
( 66
 
5.9%
) 66
 
5.9%
0 49
 
4.4%
3 48
 
4.3%
4 38
 
3.4%
6 37
 
3.3%
Other values (5) 127
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1456
56.0%
ASCII 1143
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
454
39.7%
1 91
 
8.0%
2 72
 
6.3%
, 67
 
5.9%
( 66
 
5.8%
) 66
 
5.8%
0 49
 
4.3%
3 48
 
4.2%
4 38
 
3.3%
6 37
 
3.2%
Other values (21) 155
 
13.6%
Hangul
ValueCountFrequency (%)
84
 
5.8%
80
 
5.5%
71
 
4.9%
71
 
4.9%
67
 
4.6%
66
 
4.5%
59
 
4.1%
59
 
4.1%
43
 
3.0%
38
 
2.6%
Other values (196) 818
56.2%

Interactions

2024-03-13T14:39:31.355649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T14:39:33.685253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명기업명주소
연번1.0000.3401.0001.000
군구명0.3401.0001.0001.000
기업명1.0001.0001.0001.000
주소1.0001.0001.0001.000
2024-03-13T14:39:33.793215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번군구명
연번1.0000.159
군구명0.1591.000

Missing values

2024-03-13T14:39:31.484377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T14:39:31.563699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번군구명기업명주소
01서구메디엔인터내셔날인천광역시 서구 봉오재2로 37 (가정동, 엘에이치웨스턴블루힐) 210-1402
12부평구조이풀(JOYFUL)인천광역시 부평구 장제로 99 (부평동, 벨라루체) 벨라루체 1508호
23미추홀구주식회사 케이바이오인천광역시 미추홀구 경인로 305번길 17, 2층 205-1호(도화동)
34서구㈜대윤산업인천 서구 원석로196번길 16 (원창동) (원창동)
45연수구포에버 차이나(4ever china)인천광역시 연수구 대암로 39-1, 4층(옥련동, 대영빌딩)
56연수구㈜티에이치인터내셔널(TH International Co., Ltd.)인천광역시 연수구 송도과학로 32, 엠902호(송도동, 송도테크노파크 IT센터)
67남동구메디코퍼인천광역시 남동구 인주대로 588, 6층 일부 (구월동, 세정빌딩)
78부평구리치코((RICHCO)인천광역시 부평구 부평대로 293, B117, 1104호
89중구주식회사 한스글로발인천광역시 중구 제물량로135번길 7, 1층(답동)
910미추홀구주식회사 제이제트그룹인천광역시 미추홀구 경인로 45 (숭의동) 3층, 드림타워
연번군구명기업명주소
5556서구다리인천광역시 서구 원당대로 848 (당하동, 유일프라자) 602호 B04
5657서구헤이 코리아인천광역시 서구 봉오재3로 120 (가정동) 가정역 봄2 프라자 7층 720호
5758부평구미래인천광역시 부평구 부평대로 68 (부평동) 동일빌딩 5층
5859중구(주) 에어맨인천광역시 중구 공항로 272 (운서동, 인천공항) 제1여객터미널 2025호 (22382)
5960중구씨코리아해운(주)인천광역시 중구 신포로 8 (사동, 농협,윤스골크,구몬,국민연금) 601호
6061남동구인천여행사인천광역시 남동구 논고개로123번길 35 (논현동, 칼리오페) 7층, 704-1호
6162계양구온유인천광역시 계양구 오조산로21번길 5-9 (작전동) 2층 일부
6263연수구위브이아이피트래블인천광역시 연수구 컨벤시아대로 165 (송도동, 포스코타워-송도) 26층
6364부평구우정국제여행사인천광역시 부평구 경인로931번길 12-4 (부평동) 지하1
6465연수구비케이시스텍인천광역시 연수구 송도과학로28번길 8 (송도동, 더샵 송도트리플타워 East) 5층 524호