Overview

Dataset statistics

Number of variables6
Number of observations2314
Missing cells367
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory108.6 KiB
Average record size in memory48.1 B

Variable types

Categorical1
DateTime2
Text3

Dataset

Description인천광역시 서구의 공중위생업소 정보 (업종명, 신고일자, 업소명, 영업소 소재지, 전화번호 등) 에 관한 데이터입니다.
Author인천광역시 서구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=3044218&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
소재지전화 has 367 (15.9%) missing valuesMissing

Reproduction

Analysis started2024-03-18 04:05:45.198641
Analysis finished2024-03-18 04:05:45.905955
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct22
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
일반미용업
910 
피부미용업
217 
미용업
189 
세탁업
180 
네일미용업
175 
Other values (17)
643 

Length

Max length23
Median length5
Mean length5.9684529
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row숙박업(일반)
2nd row숙박업(일반)
3rd row숙박업(일반)
4th row숙박업(일반)
5th row숙박업(일반)

Common Values

ValueCountFrequency (%)
일반미용업 910
39.3%
피부미용업 217
 
9.4%
미용업 189
 
8.2%
세탁업 180
 
7.8%
네일미용업 175
 
7.6%
건물위생관리업 115
 
5.0%
이용업 104
 
4.5%
화장ㆍ분장 미용업 83
 
3.6%
네일미용업, 화장ㆍ분장 미용업 73
 
3.2%
숙박업(일반) 73
 
3.2%
Other values (12) 195
 
8.4%

Length

2024-03-18T13:05:45.976933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반미용업 945
33.4%
미용업 440
15.6%
피부미용업 333
 
11.8%
네일미용업 332
 
11.7%
화장ㆍ분장 251
 
8.9%
세탁업 180
 
6.4%
건물위생관리업 115
 
4.1%
이용업 104
 
3.7%
숙박업(일반 73
 
2.6%
목욕장업 28
 
1.0%
Other values (2) 25
 
0.9%
Distinct1745
Distinct (%)75.4%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
Minimum1967-11-07 00:00:00
Maximum2023-07-26 00:00:00
2024-03-18T13:05:46.093102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:05:46.208209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct2197
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2024-03-18T13:05:46.506613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length32
Mean length6.4723423
Min length2

Characters and Unicode

Total characters14977
Distinct characters695
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2113 ?
Unique (%)91.3%

Sample

1st row하얀장여관
2nd row루미호텔
3rd row뱅크호텔
4th row비앤비BnB
5th row파크포시즌
ValueCountFrequency (%)
헤어 31
 
1.1%
hair 21
 
0.7%
주식회사 20
 
0.7%
nail 17
 
0.6%
에스테틱 16
 
0.6%
청라점 15
 
0.5%
네일 13
 
0.5%
de 11
 
0.4%
헤어샵 9
 
0.3%
검단신도시점 9
 
0.3%
Other values (2385) 2661
94.3%
2024-03-18T13:05:46.984964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
791
 
5.3%
747
 
5.0%
509
 
3.4%
432
 
2.9%
327
 
2.2%
274
 
1.8%
261
 
1.7%
) 260
 
1.7%
( 259
 
1.7%
250
 
1.7%
Other values (685) 10867
72.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12046
80.4%
Uppercase Letter 828
 
5.5%
Lowercase Letter 798
 
5.3%
Space Separator 509
 
3.4%
Close Punctuation 260
 
1.7%
Open Punctuation 259
 
1.7%
Decimal Number 137
 
0.9%
Other Punctuation 124
 
0.8%
Dash Punctuation 8
 
0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
791
 
6.6%
747
 
6.2%
432
 
3.6%
327
 
2.7%
274
 
2.3%
261
 
2.2%
250
 
2.1%
223
 
1.9%
212
 
1.8%
178
 
1.5%
Other values (606) 8351
69.3%
Uppercase Letter
ValueCountFrequency (%)
A 88
 
10.6%
H 67
 
8.1%
E 62
 
7.5%
I 59
 
7.1%
N 57
 
6.9%
S 55
 
6.6%
R 52
 
6.3%
L 51
 
6.2%
B 43
 
5.2%
O 40
 
4.8%
Other values (16) 254
30.7%
Lowercase Letter
ValueCountFrequency (%)
a 109
13.7%
e 81
10.2%
i 80
10.0%
o 75
9.4%
n 65
 
8.1%
l 65
 
8.1%
r 42
 
5.3%
y 35
 
4.4%
h 31
 
3.9%
u 31
 
3.9%
Other values (15) 184
23.1%
Other Punctuation
ValueCountFrequency (%)
& 34
27.4%
, 27
21.8%
# 24
19.4%
. 13
 
10.5%
: 12
 
9.7%
' 7
 
5.6%
; 4
 
3.2%
! 1
 
0.8%
? 1
 
0.8%
1
 
0.8%
Decimal Number
ValueCountFrequency (%)
1 33
24.1%
2 26
19.0%
0 21
15.3%
3 15
10.9%
9 9
 
6.6%
6 8
 
5.8%
4 8
 
5.8%
5 6
 
4.4%
7 6
 
4.4%
8 5
 
3.6%
Math Symbol
ValueCountFrequency (%)
> 2
40.0%
< 2
40.0%
= 1
20.0%
Space Separator
ValueCountFrequency (%)
509
100.0%
Close Punctuation
ValueCountFrequency (%)
) 260
100.0%
Open Punctuation
ValueCountFrequency (%)
( 259
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12041
80.4%
Latin 1626
 
10.9%
Common 1305
 
8.7%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
791
 
6.6%
747
 
6.2%
432
 
3.6%
327
 
2.7%
274
 
2.3%
261
 
2.2%
250
 
2.1%
223
 
1.9%
212
 
1.8%
178
 
1.5%
Other values (603) 8346
69.3%
Latin
ValueCountFrequency (%)
a 109
 
6.7%
A 88
 
5.4%
e 81
 
5.0%
i 80
 
4.9%
o 75
 
4.6%
H 67
 
4.1%
n 65
 
4.0%
l 65
 
4.0%
E 62
 
3.8%
I 59
 
3.6%
Other values (41) 875
53.8%
Common
ValueCountFrequency (%)
509
39.0%
) 260
19.9%
( 259
19.8%
& 34
 
2.6%
1 33
 
2.5%
, 27
 
2.1%
2 26
 
2.0%
# 24
 
1.8%
0 21
 
1.6%
3 15
 
1.1%
Other values (18) 97
 
7.4%
Han
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12041
80.4%
ASCII 2930
 
19.6%
CJK 5
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
791
 
6.6%
747
 
6.2%
432
 
3.6%
327
 
2.7%
274
 
2.3%
261
 
2.2%
250
 
2.1%
223
 
1.9%
212
 
1.8%
178
 
1.5%
Other values (603) 8346
69.3%
ASCII
ValueCountFrequency (%)
509
 
17.4%
) 260
 
8.9%
( 259
 
8.8%
a 109
 
3.7%
A 88
 
3.0%
e 81
 
2.8%
i 80
 
2.7%
o 75
 
2.6%
H 67
 
2.3%
n 65
 
2.2%
Other values (68) 1337
45.6%
CJK
ValueCountFrequency (%)
3
60.0%
1
 
20.0%
1
 
20.0%
None
ValueCountFrequency (%)
1
100.0%
Distinct2268
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
2024-03-18T13:05:47.243686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length55
Mean length35.251945
Min length9

Characters and Unicode

Total characters81573
Distinct characters400
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2224 ?
Unique (%)96.1%

Sample

1st row인천광역시 서구 칠천왕로 21 (석남동)
2nd row인천광역시 서구 길주로63번길 3 (석남동)
3rd row인천광역시 서구 염곡로 250 (석남동)
4th row인천광역시 서구 서곶로301번길 20 (심곡동)
5th row인천광역시 서구 염곡로272번길 24 (석남동)
ValueCountFrequency (%)
인천광역시 2312
 
14.6%
서구 2312
 
14.6%
1층 436
 
2.7%
청라동 428
 
2.7%
석남동 279
 
1.8%
일부호 232
 
1.5%
가정동 219
 
1.4%
가좌동 200
 
1.3%
마전동 185
 
1.2%
원당동 168
 
1.1%
Other values (2248) 9104
57.3%
2024-03-18T13:05:47.626805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13565
 
16.6%
1 3466
 
4.2%
2751
 
3.4%
2544
 
3.1%
2456
 
3.0%
2414
 
3.0%
) 2354
 
2.9%
( 2353
 
2.9%
2349
 
2.9%
2339
 
2.9%
Other values (390) 44982
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45871
56.2%
Decimal Number 14114
 
17.3%
Space Separator 13565
 
16.6%
Close Punctuation 2354
 
2.9%
Open Punctuation 2353
 
2.9%
Other Punctuation 2341
 
2.9%
Dash Punctuation 424
 
0.5%
Uppercase Letter 404
 
0.5%
Lowercase Letter 131
 
0.2%
Letter Number 9
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2751
 
6.0%
2544
 
5.5%
2456
 
5.4%
2414
 
5.3%
2349
 
5.1%
2339
 
5.1%
2325
 
5.1%
2320
 
5.1%
2316
 
5.0%
1671
 
3.6%
Other values (340) 22386
48.8%
Uppercase Letter
ValueCountFrequency (%)
B 98
24.3%
A 64
15.8%
K 35
 
8.7%
E 30
 
7.4%
I 29
 
7.2%
L 26
 
6.4%
S 25
 
6.2%
W 24
 
5.9%
V 23
 
5.7%
M 16
 
4.0%
Other values (8) 34
 
8.4%
Decimal Number
ValueCountFrequency (%)
1 3466
24.6%
2 2261
16.0%
0 1825
12.9%
3 1386
 
9.8%
4 1078
 
7.6%
5 881
 
6.2%
8 854
 
6.1%
6 848
 
6.0%
7 846
 
6.0%
9 669
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 48
36.6%
s 20
15.3%
r 20
15.3%
a 19
 
14.5%
d 19
 
14.5%
k 2
 
1.5%
n 1
 
0.8%
p 1
 
0.8%
y 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 2313
98.8%
' 19
 
0.8%
@ 5
 
0.2%
. 3
 
0.1%
/ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 3
42.9%
< 2
28.6%
> 2
28.6%
Space Separator
ValueCountFrequency (%)
13565
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2354
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2353
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 424
100.0%
Letter Number
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45870
56.2%
Common 35158
43.1%
Latin 544
 
0.7%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2751
 
6.0%
2544
 
5.5%
2456
 
5.4%
2414
 
5.3%
2349
 
5.1%
2339
 
5.1%
2325
 
5.1%
2320
 
5.1%
2316
 
5.0%
1671
 
3.6%
Other values (339) 22385
48.8%
Latin
ValueCountFrequency (%)
B 98
18.0%
A 64
11.8%
e 48
 
8.8%
K 35
 
6.4%
E 30
 
5.5%
I 29
 
5.3%
L 26
 
4.8%
S 25
 
4.6%
W 24
 
4.4%
V 23
 
4.2%
Other values (18) 142
26.1%
Common
ValueCountFrequency (%)
13565
38.6%
1 3466
 
9.9%
) 2354
 
6.7%
( 2353
 
6.7%
, 2313
 
6.6%
2 2261
 
6.4%
0 1825
 
5.2%
3 1386
 
3.9%
4 1078
 
3.1%
5 881
 
2.5%
Other values (12) 3676
 
10.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45870
56.2%
ASCII 35693
43.8%
Number Forms 9
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13565
38.0%
1 3466
 
9.7%
) 2354
 
6.6%
( 2353
 
6.6%
, 2313
 
6.5%
2 2261
 
6.3%
0 1825
 
5.1%
3 1386
 
3.9%
4 1078
 
3.0%
5 881
 
2.5%
Other values (39) 4211
 
11.8%
Hangul
ValueCountFrequency (%)
2751
 
6.0%
2544
 
5.5%
2456
 
5.4%
2414
 
5.3%
2349
 
5.1%
2339
 
5.1%
2325
 
5.1%
2320
 
5.1%
2316
 
5.0%
1671
 
3.6%
Other values (339) 22385
48.8%
Number Forms
ValueCountFrequency (%)
9
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

소재지전화
Text

MISSING 

Distinct1121
Distinct (%)57.6%
Missing367
Missing (%)15.9%
Memory size18.2 KiB
2024-03-18T13:05:47.826787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length7.4278377
Min length1

Characters and Unicode

Total characters14462
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1105 ?
Unique (%)56.8%

Sample

1st row032-579-4506
2nd row032-574-2187
3rd row032-578-6495
4th row032-561-7320
5th row032-583-3066
ValueCountFrequency (%)
032-568-0199 3
 
0.3%
032-566-4142 2
 
0.2%
032-576-6383 2
 
0.2%
032-565-8990 2
 
0.2%
032-564-1109 2
 
0.2%
032-564-1489 2
 
0.2%
032-575-2361 2
 
0.2%
032-561-1172 2
 
0.2%
032-563-2530 2
 
0.2%
032-564-3461 2
 
0.2%
Other values (1110) 1115
98.2%
2024-03-18T13:05:48.182963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2266
15.7%
2 1804
12.5%
3 1742
12.0%
0 1717
11.9%
5 1609
11.1%
6 1038
7.2%
7 1019
7.0%
811
 
5.6%
1 717
 
5.0%
8 673
 
4.7%
Other values (2) 1066
7.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 11385
78.7%
Dash Punctuation 2266
 
15.7%
Space Separator 811
 
5.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1804
15.8%
3 1742
15.3%
0 1717
15.1%
5 1609
14.1%
6 1038
9.1%
7 1019
9.0%
1 717
 
6.3%
8 673
 
5.9%
4 566
 
5.0%
9 500
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 2266
100.0%
Space Separator
ValueCountFrequency (%)
811
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14462
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2266
15.7%
2 1804
12.5%
3 1742
12.0%
0 1717
11.9%
5 1609
11.1%
6 1038
7.2%
7 1019
7.0%
811
 
5.6%
1 717
 
5.0%
8 673
 
4.7%
Other values (2) 1066
7.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14462
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2266
15.7%
2 1804
12.5%
3 1742
12.0%
0 1717
11.9%
5 1609
11.1%
6 1038
7.2%
7 1019
7.0%
811
 
5.6%
1 717
 
5.0%
8 673
 
4.7%
Other values (2) 1066
7.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size18.2 KiB
Minimum2023-07-28 00:00:00
Maximum2023-07-28 00:00:00
2024-03-18T13:05:48.281063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:05:48.361904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2024-03-18T13:05:45.726346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:05:45.858949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명신고일자업소명영업소 주소(도로명)소재지전화데이터기준일자
0숙박업(일반)1989-11-09하얀장여관인천광역시 서구 칠천왕로 21 (석남동)032-579-45062023-07-28
1숙박업(일반)1996-04-24루미호텔인천광역시 서구 길주로63번길 3 (석남동)032-574-21872023-07-28
2숙박업(일반)1997-09-12뱅크호텔인천광역시 서구 염곡로 250 (석남동)032-578-64952023-07-28
3숙박업(일반)1999-01-19비앤비BnB인천광역시 서구 서곶로301번길 20 (심곡동)032-561-73202023-07-28
4숙박업(일반)1985-08-29파크포시즌인천광역시 서구 염곡로272번길 24 (석남동)032-583-30662023-07-28
5숙박업(일반)1985-12-21스타모텔인천광역시 서구 가정로 191 (석남동)032-571-88582023-07-28
6숙박업(일반)1987-12-30미니인천광역시 서구 옻우물로 32 (석남동)032-574-56782023-07-28
7숙박업(일반)1988-10-20케이모텔인천광역시 서구 건지로 281 (석남동)032-573-98622023-07-28
8숙박업(일반)1990-02-01갤러리모텔인천광역시 서구 옻우물로 29 (석남동)032-573-35962023-07-28
9숙박업(일반)1990-02-01파라오모텔인천광역시 서구 옻우물로 27 (석남동)032-571-79122023-07-28
업종명신고일자업소명영업소 주소(도로명)소재지전화데이터기준일자
2304피부미용업, 네일미용업, 화장ㆍ분장 미용업2021-12-07블링제이뷰티인천광역시 서구 청중로478번안길 6, 아트프라자 2층 일부호 (가정동)2023-07-28
2305피부미용업, 네일미용업, 화장ㆍ분장 미용업2021-12-07청라한네일인천광역시 서구 청라에메랄드로 99, 지젤엠청라 1층 69호 (청라동)2023-07-28
2306피부미용업, 네일미용업, 화장ㆍ분장 미용업2021-08-18라라뷰티인천광역시 서구 승학로 481, 한명빌딩 3층 일부호 (검암동)2023-07-28
2307화장ㆍ분장 미용업2023-06-12바라다(Barada)인천광역시 서구 이음대로 384, 서영아너시티플러스 303호 (원당동)<NA>2023-07-28
2308피부미용업, 네일미용업, 화장ㆍ분장 미용업2022-03-04네일루인천광역시 서구 검단로768번1길 11-6, 102호 (불로동)2023-07-28
2309피부미용업, 네일미용업, 화장ㆍ분장 미용업2022-10-19오늘네일인천광역시 서구 서곶로 16, 한신그랜드힐빌리지 상가1동 227호 (가정동)2023-07-28
2310화장ㆍ분장 미용업2023-06-23유즈브로우인천광역시 서구 청라에메랄드로102번길 8-12, 5층 504호 (청라동)<NA>2023-07-28
2311화장ㆍ분장 미용업2023-07-20뷰티바이슬인천광역시 서구 새오개로111번안길 19, 4층 일부호 (신현동)<NA>2023-07-28
2312피부미용업, 네일미용업, 화장ㆍ분장 미용업2023-01-04도르뷰티인천광역시 서구 청라에메랄드로102번길 8-22, 미라클프라자 104호 (청라동)2023-07-28
2313피부미용업, 네일미용업, 화장ㆍ분장 미용업2023-07-25쏠네일인천광역시 서구 청라에메랄드로 79, 커낼에비뉴 2층 C29호 (청라동)032-4747-09192023-07-28