Overview

Dataset statistics

Number of variables5
Number of observations148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory41.9 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description인천광역시 연수구의 공중위생서비스 평가 결과 현황 데이터로서 연번, 군구, 업소명, 소재지(도로명주소)의 항목으로 이루어져 있습니다.
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15116602&srcSe=7661IVAWM27C61E190

Alerts

군구 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 12:28:28.240902
Analysis finished2024-01-28 12:28:29.166680
Duration0.93 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean74.5
Minimum1
Maximum148
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-01-28T21:28:29.223118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.35
Q137.75
median74.5
Q3111.25
95-th percentile140.65
Maximum148
Range147
Interquartile range (IQR)73.5

Descriptive statistics

Standard deviation42.868014
Coefficient of variation (CV)0.57540959
Kurtosis-1.2
Mean74.5
Median Absolute Deviation (MAD)37
Skewness0
Sum11026
Variance1837.6667
MonotonicityStrictly increasing
2024-01-28T21:28:29.344845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
102 1
 
0.7%
103 1
 
0.7%
104 1
 
0.7%
Other values (138) 138
93.2%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
148 1
0.7%
147 1
0.7%
146 1
0.7%
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%

군구
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
연수구
148 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row연수구
2nd row연수구
3rd row연수구
4th row연수구
5th row연수구

Common Values

ValueCountFrequency (%)
연수구 148
100.0%

Length

2024-01-28T21:28:29.460006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:28:29.540514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
연수구 148
100.0%

업종
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
세탁업
85 
숙박업
53 
목욕장업
10 

Length

Max length4
Median length3
Mean length3.0675676
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row세탁업
2nd row세탁업
3rd row세탁업
4th row세탁업
5th row세탁업

Common Values

ValueCountFrequency (%)
세탁업 85
57.4%
숙박업 53
35.8%
목욕장업 10
 
6.8%

Length

2024-01-28T21:28:29.628309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T21:28:29.713103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
세탁업 85
57.4%
숙박업 53
35.8%
목욕장업 10
 
6.8%
Distinct142
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-01-28T21:28:29.966919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length6.6756757
Min length3

Characters and Unicode

Total characters988
Distinct characters235
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)91.9%

Sample

1st row현대퍼크로세탁소
2nd row선학 세탁
3rd row현대세탁소
4th row금호세탁
5th row충청유천빨래방
ValueCountFrequency (%)
세탁 6
 
2.9%
호텔 4
 
1.9%
세탁소 4
 
1.9%
모텔 3
 
1.4%
스테이 3
 
1.4%
송도 3
 
1.4%
지금은 2
 
1.0%
송도파크호텔 2
 
1.0%
송도점 2
 
1.0%
인천 2
 
1.0%
Other values (164) 176
85.0%
2024-01-28T21:28:30.358810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70
 
7.1%
69
 
7.0%
59
 
6.0%
43
 
4.4%
38
 
3.8%
36
 
3.6%
30
 
3.0%
23
 
2.3%
22
 
2.2%
19
 
1.9%
Other values (225) 579
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 872
88.3%
Space Separator 59
 
6.0%
Lowercase Letter 15
 
1.5%
Decimal Number 13
 
1.3%
Uppercase Letter 10
 
1.0%
Close Punctuation 9
 
0.9%
Open Punctuation 9
 
0.9%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
8.0%
69
 
7.9%
43
 
4.9%
38
 
4.4%
36
 
4.1%
30
 
3.4%
23
 
2.6%
22
 
2.5%
19
 
2.2%
17
 
1.9%
Other values (197) 505
57.9%
Lowercase Letter
ValueCountFrequency (%)
e 3
20.0%
n 3
20.0%
l 2
13.3%
i 2
13.3%
t 1
 
6.7%
o 1
 
6.7%
a 1
 
6.7%
g 1
 
6.7%
h 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
2 3
23.1%
3 3
23.1%
0 2
15.4%
5 1
 
7.7%
8 1
 
7.7%
9 1
 
7.7%
4 1
 
7.7%
1 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
C 2
20.0%
S 2
20.0%
I 2
20.0%
Q 1
10.0%
E 1
10.0%
M 1
10.0%
H 1
10.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 872
88.3%
Common 91
 
9.2%
Latin 25
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
8.0%
69
 
7.9%
43
 
4.9%
38
 
4.4%
36
 
4.1%
30
 
3.4%
23
 
2.6%
22
 
2.5%
19
 
2.2%
17
 
1.9%
Other values (197) 505
57.9%
Latin
ValueCountFrequency (%)
e 3
12.0%
n 3
12.0%
l 2
 
8.0%
C 2
 
8.0%
S 2
 
8.0%
I 2
 
8.0%
i 2
 
8.0%
Q 1
 
4.0%
E 1
 
4.0%
M 1
 
4.0%
Other values (6) 6
24.0%
Common
ValueCountFrequency (%)
59
64.8%
) 9
 
9.9%
( 9
 
9.9%
2 3
 
3.3%
3 3
 
3.3%
0 2
 
2.2%
5 1
 
1.1%
8 1
 
1.1%
9 1
 
1.1%
4 1
 
1.1%
Other values (2) 2
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 872
88.3%
ASCII 116
 
11.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
70
 
8.0%
69
 
7.9%
43
 
4.9%
38
 
4.4%
36
 
4.1%
30
 
3.4%
23
 
2.6%
22
 
2.5%
19
 
2.2%
17
 
1.9%
Other values (197) 505
57.9%
ASCII
ValueCountFrequency (%)
59
50.9%
) 9
 
7.8%
( 9
 
7.8%
2 3
 
2.6%
3 3
 
2.6%
e 3
 
2.6%
n 3
 
2.6%
0 2
 
1.7%
l 2
 
1.7%
C 2
 
1.7%
Other values (18) 21
 
18.1%
Distinct147
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-01-28T21:28:30.591153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length49
Mean length38.777027
Min length21

Characters and Unicode

Total characters5739
Distinct characters203
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)98.6%

Sample

1st row인천광역시 연수구 원인재로 56 (동춘동)
2nd row인천광역시 연수구 선학로 14 (선학동)
3rd row인천광역시 연수구 옥련로 33 (옥련동)
4th row인천광역시 연수구 선학로 100, 1층 2호 (선학동, 금호타운상가)
5th row인천광역시 연수구 먼우금로 302 (연수동)
ValueCountFrequency (%)
인천광역시 148
 
14.9%
연수구 148
 
14.9%
송도동 40
 
4.0%
옥련동 36
 
3.6%
1층 18
 
1.8%
연수동 17
 
1.7%
원인재로 12
 
1.2%
송도 9
 
0.9%
상가동 8
 
0.8%
대암로 8
 
0.8%
Other values (363) 549
55.3%
2024-01-28T21:28:30.952762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
845
 
14.7%
1 292
 
5.1%
199
 
3.5%
, 189
 
3.3%
175
 
3.0%
175
 
3.0%
169
 
2.9%
168
 
2.9%
( 165
 
2.9%
) 165
 
2.9%
Other values (193) 3197
55.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3290
57.3%
Decimal Number 1012
 
17.6%
Space Separator 845
 
14.7%
Other Punctuation 191
 
3.3%
Open Punctuation 165
 
2.9%
Close Punctuation 165
 
2.9%
Math Symbol 38
 
0.7%
Dash Punctuation 19
 
0.3%
Uppercase Letter 13
 
0.2%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
199
 
6.0%
175
 
5.3%
175
 
5.3%
169
 
5.1%
168
 
5.1%
157
 
4.8%
152
 
4.6%
149
 
4.5%
148
 
4.5%
148
 
4.5%
Other values (166) 1650
50.2%
Decimal Number
ValueCountFrequency (%)
1 292
28.9%
2 139
13.7%
0 118
11.7%
3 88
 
8.7%
4 86
 
8.5%
5 72
 
7.1%
7 60
 
5.9%
9 55
 
5.4%
6 51
 
5.0%
8 51
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
23.1%
B 2
15.4%
H 2
15.4%
D 1
 
7.7%
I 1
 
7.7%
P 1
 
7.7%
T 1
 
7.7%
G 1
 
7.7%
F 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 189
99.0%
@ 2
 
1.0%
Space Separator
ValueCountFrequency (%)
845
100.0%
Open Punctuation
ValueCountFrequency (%)
( 165
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%
Math Symbol
ValueCountFrequency (%)
~ 38
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3290
57.3%
Common 2435
42.4%
Latin 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
199
 
6.0%
175
 
5.3%
175
 
5.3%
169
 
5.1%
168
 
5.1%
157
 
4.8%
152
 
4.6%
149
 
4.5%
148
 
4.5%
148
 
4.5%
Other values (166) 1650
50.2%
Common
ValueCountFrequency (%)
845
34.7%
1 292
 
12.0%
, 189
 
7.8%
( 165
 
6.8%
) 165
 
6.8%
2 139
 
5.7%
0 118
 
4.8%
3 88
 
3.6%
4 86
 
3.5%
5 72
 
3.0%
Other values (7) 276
 
11.3%
Latin
ValueCountFrequency (%)
A 3
21.4%
B 2
14.3%
H 2
14.3%
e 1
 
7.1%
D 1
 
7.1%
I 1
 
7.1%
P 1
 
7.1%
T 1
 
7.1%
G 1
 
7.1%
F 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3290
57.3%
ASCII 2449
42.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
845
34.5%
1 292
 
11.9%
, 189
 
7.7%
( 165
 
6.7%
) 165
 
6.7%
2 139
 
5.7%
0 118
 
4.8%
3 88
 
3.6%
4 86
 
3.5%
5 72
 
2.9%
Other values (17) 290
 
11.8%
Hangul
ValueCountFrequency (%)
199
 
6.0%
175
 
5.3%
175
 
5.3%
169
 
5.1%
168
 
5.1%
157
 
4.8%
152
 
4.6%
149
 
4.5%
148
 
4.5%
148
 
4.5%
Other values (166) 1650
50.2%

Interactions

2024-01-28T21:28:28.602198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T21:28:31.032902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.858
업종0.8581.000
2024-01-28T21:28:31.101716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.761
업종0.7611.000

Missing values

2024-01-28T21:28:29.053755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T21:28:29.128417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번군구업종업소명소재지(도로명주소)
01연수구세탁업현대퍼크로세탁소인천광역시 연수구 원인재로 56 (동춘동)
12연수구세탁업선학 세탁인천광역시 연수구 선학로 14 (선학동)
23연수구세탁업현대세탁소인천광역시 연수구 옥련로 33 (옥련동)
34연수구세탁업금호세탁인천광역시 연수구 선학로 100, 1층 2호 (선학동, 금호타운상가)
45연수구세탁업충청유천빨래방인천광역시 연수구 먼우금로 302 (연수동)
56연수구세탁업유일세탁소인천광역시 연수구 청학로 23 (청학동)
67연수구세탁업은혜 세탁인천광역시 연수구 함박로8번길 40 (연수동)
78연수구세탁업다듬이 셀프 크리닝인천광역시 연수구 함박안로62번길 42 (연수동,(1동))
89연수구세탁업서울 세탁소인천광역시 연수구 함박로12번길 17 (연수동,(1동))
910연수구세탁업황해세탁소인천광역시 연수구 원인재로 222 (연수동)
연번군구업종업소명소재지(도로명주소)
138139연수구숙박업호텔스카이파크 인천송도인천광역시 연수구 컨벤시아대로 233 (송도동)
139140연수구숙박업송도센트럴파크호텔인천광역시 연수구 테크노파크로 193, 송도센트럴파크호텔 (송도동)
140141연수구숙박업오크우드 프리미어 인천인천광역시 연수구 컨벤시아대로 165 (송도동, 동북아트레이드타워, 36층~64층)
141142연수구숙박업셀럽스테이송도인천광역시 연수구 아트센터대로168번길 100, 한라 웨스턴파크 송도 5~15,17,20~35층 (송도동)
142143연수구숙박업더 스테이 송도인천광역시 연수구 아트센터대로168번길 100, 한라 웨스턴파크 송도 5~10,12,14,15,17~19,21~24,27~29,31~33,37층 (송도동)
143144연수구숙박업랜드마크 송도 스테이인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 (송도동)
144145연수구숙박업달빛스테이인천광역시 연수구 아트센터대로168번길 100, 한라 웨스턴파크 송도 5,7~12,14,15,17~21,23~30,32~36층 (송도동)
145146연수구숙박업더노벰버스테이인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 4~14,16~33,34,35층 (송도동)
146147연수구숙박업랜드마크 송도스테이 투인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 1,4~14,16~20,22~29,31~34,36층 (송도동)
147148연수구숙박업어반스테이 더 파크 송도점인천광역시 연수구 아트센터대로168번길 101, 송도랜드마크푸르지오시티 1,4~14,16~35층 (송도동)