Overview

Dataset statistics

Number of variables5
Number of observations64
Missing cells2
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.7 KiB
Average record size in memory43.1 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 중구 관내에 위치한 호텔에 대한 정보입니다.
Author인천광역시
URLhttps://www.incheon.go.kr/data/DATA010201/view?docId=15074853

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 2 (3.1%) missing valuesMissing
순번 has unique valuesUnique
사업장명 has unique valuesUnique

Reproduction

Analysis started2024-01-28 07:40:14.282240
Analysis finished2024-01-28 07:40:14.892021
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.5
Minimum1
Maximum64
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size708.0 B
2024-01-28T16:40:14.959259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.15
Q116.75
median32.5
Q348.25
95-th percentile60.85
Maximum64
Range63
Interquartile range (IQR)31.5

Descriptive statistics

Standard deviation18.618987
Coefficient of variation (CV)0.5728919
Kurtosis-1.2
Mean32.5
Median Absolute Deviation (MAD)16
Skewness0
Sum2080
Variance346.66667
MonotonicityStrictly increasing
2024-01-28T16:40:15.083770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.6%
34 1
 
1.6%
36 1
 
1.6%
37 1
 
1.6%
38 1
 
1.6%
39 1
 
1.6%
40 1
 
1.6%
41 1
 
1.6%
42 1
 
1.6%
43 1
 
1.6%
Other values (54) 54
84.4%
ValueCountFrequency (%)
1 1
1.6%
2 1
1.6%
3 1
1.6%
4 1
1.6%
5 1
1.6%
6 1
1.6%
7 1
1.6%
8 1
1.6%
9 1
1.6%
10 1
1.6%
ValueCountFrequency (%)
64 1
1.6%
63 1
1.6%
62 1
1.6%
61 1
1.6%
60 1
1.6%
59 1
1.6%
58 1
1.6%
57 1
1.6%
56 1
1.6%
55 1
1.6%

사업장명
Text

UNIQUE 

Distinct64
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size644.0 B
2024-01-28T16:40:15.285612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length8.359375
Min length3

Characters and Unicode

Total characters535
Distinct characters130
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row네스트 호텔
2nd row파라다이스시티
3rd row골든튤립 인천공항 호텔&스위트
4th row리베라베리움
5th row베스트웨스턴 하버파크호텔
ValueCountFrequency (%)
호텔 9
 
8.9%
인천 3
 
3.0%
월미도 3
 
3.0%
인천공항 2
 
2.0%
2
 
2.0%
인천에어포트 2
 
2.0%
인천공항점 2
 
2.0%
환승호텔 2
 
2.0%
1
 
1.0%
베니키아호텔 1
 
1.0%
Other values (74) 74
73.3%
2024-01-28T16:40:15.906430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
11.2%
57
 
10.7%
38
 
7.1%
24
 
4.5%
19
 
3.6%
17
 
3.2%
16
 
3.0%
13
 
2.4%
11
 
2.1%
10
 
1.9%
Other values (120) 270
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 485
90.7%
Space Separator 38
 
7.1%
Uppercase Letter 9
 
1.7%
Decimal Number 2
 
0.4%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
12.4%
57
 
11.8%
24
 
4.9%
19
 
3.9%
17
 
3.5%
16
 
3.3%
13
 
2.7%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (107) 249
51.3%
Uppercase Letter
ValueCountFrequency (%)
T 1
11.1%
K 1
11.1%
Y 1
11.1%
W 1
11.1%
S 1
11.1%
R 1
11.1%
O 1
11.1%
A 1
11.1%
G 1
11.1%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 485
90.7%
Common 41
 
7.7%
Latin 9
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
12.4%
57
 
11.8%
24
 
4.9%
19
 
3.9%
17
 
3.5%
16
 
3.3%
13
 
2.7%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (107) 249
51.3%
Latin
ValueCountFrequency (%)
T 1
11.1%
K 1
11.1%
Y 1
11.1%
W 1
11.1%
S 1
11.1%
R 1
11.1%
O 1
11.1%
A 1
11.1%
G 1
11.1%
Common
ValueCountFrequency (%)
38
92.7%
1 1
 
2.4%
2 1
 
2.4%
& 1
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 485
90.7%
ASCII 50
 
9.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
60
 
12.4%
57
 
11.8%
24
 
4.9%
19
 
3.9%
17
 
3.5%
16
 
3.3%
13
 
2.7%
11
 
2.3%
10
 
2.1%
9
 
1.9%
Other values (107) 249
51.3%
ASCII
ValueCountFrequency (%)
38
76.0%
1 1
 
2.0%
2 1
 
2.0%
T 1
 
2.0%
K 1
 
2.0%
Y 1
 
2.0%
W 1
 
2.0%
S 1
 
2.0%
R 1
 
2.0%
O 1
 
2.0%
Other values (3) 3
 
6.0%
Distinct61
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size644.0 B
2024-01-28T16:40:16.140193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length28
Mean length21.3125
Min length15

Characters and Unicode

Total characters1364
Distinct characters117
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)90.6%

Sample

1st row인천광역시 중구 영종해안남로 19-5
2nd row인천광역시 중구 영종해안남로321번길 186
3rd row인천광역시 중구 흰바위로59번길 8
4th row인천광역시 중구 영종대로 881 리베라베리움 호텔 영종
5th row인천광역시 중구 제물량로 217
ValueCountFrequency (%)
인천광역시 64
22.5%
중구 64
22.5%
신도시남로149번길 5
 
1.8%
은하수로29번길 4
 
1.4%
12 4
 
1.4%
월미로 4
 
1.4%
마시란로 4
 
1.4%
6 3
 
1.1%
공항로424번길 3
 
1.1%
5 3
 
1.1%
Other values (105) 126
44.4%
2024-01-28T16:40:16.482758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
220
 
16.1%
77
 
5.6%
66
 
4.8%
65
 
4.8%
65
 
4.8%
65
 
4.8%
65
 
4.8%
65
 
4.8%
62
 
4.5%
1 51
 
3.7%
Other values (107) 563
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 876
64.2%
Decimal Number 256
 
18.8%
Space Separator 220
 
16.1%
Dash Punctuation 8
 
0.6%
Uppercase Letter 3
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
8.8%
66
 
7.5%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
62
 
7.1%
44
 
5.0%
42
 
4.8%
Other values (91) 260
29.7%
Decimal Number
ValueCountFrequency (%)
1 51
19.9%
2 46
18.0%
4 35
13.7%
3 26
10.2%
5 23
9.0%
9 22
8.6%
8 16
 
6.2%
7 15
 
5.9%
6 12
 
4.7%
0 10
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
B 1
33.3%
C 1
33.3%
Space Separator
ValueCountFrequency (%)
220
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 876
64.2%
Common 485
35.6%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
8.8%
66
 
7.5%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
62
 
7.1%
44
 
5.0%
42
 
4.8%
Other values (91) 260
29.7%
Common
ValueCountFrequency (%)
220
45.4%
1 51
 
10.5%
2 46
 
9.5%
4 35
 
7.2%
3 26
 
5.4%
5 23
 
4.7%
9 22
 
4.5%
8 16
 
3.3%
7 15
 
3.1%
6 12
 
2.5%
Other values (3) 19
 
3.9%
Latin
ValueCountFrequency (%)
I 1
33.3%
B 1
33.3%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 876
64.2%
ASCII 488
35.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
220
45.1%
1 51
 
10.5%
2 46
 
9.4%
4 35
 
7.2%
3 26
 
5.3%
5 23
 
4.7%
9 22
 
4.5%
8 16
 
3.3%
7 15
 
3.1%
6 12
 
2.5%
Other values (6) 22
 
4.5%
Hangul
ValueCountFrequency (%)
77
 
8.8%
66
 
7.5%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
65
 
7.4%
62
 
7.1%
44
 
5.0%
42
 
4.8%
Other values (91) 260
29.7%

전화번호
Text

MISSING 

Distinct61
Distinct (%)98.4%
Missing2
Missing (%)3.1%
Memory size644.0 B
2024-01-28T16:40:16.684053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.870968
Min length9

Characters and Unicode

Total characters736
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique60 ?
Unique (%)96.8%

Sample

1st row032-743-9000
2nd row1833-8855
3rd row032-232-2000
4th row032-751-7800
5th row032-770-9500
ValueCountFrequency (%)
1833-8855 2
 
3.2%
032-751-1177 1
 
1.6%
032-883-0083 1
 
1.6%
032-216-8000 1
 
1.6%
032-752-2066 1
 
1.6%
032-746-2270 1
 
1.6%
032-721-4110 1
 
1.6%
032-764-8993 1
 
1.6%
032-743-3040 1
 
1.6%
032-743-3000 1
 
1.6%
Other values (51) 51
82.3%
2024-01-28T16:40:16.997083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 160
21.7%
- 121
16.4%
2 98
13.3%
3 89
12.1%
7 85
11.5%
1 43
 
5.8%
5 39
 
5.3%
4 31
 
4.2%
6 31
 
4.2%
8 28
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 615
83.6%
Dash Punctuation 121
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 160
26.0%
2 98
15.9%
3 89
14.5%
7 85
13.8%
1 43
 
7.0%
5 39
 
6.3%
4 31
 
5.0%
6 31
 
5.0%
8 28
 
4.6%
9 11
 
1.8%
Dash Punctuation
ValueCountFrequency (%)
- 121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 736
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 160
21.7%
- 121
16.4%
2 98
13.3%
3 89
12.1%
7 85
11.5%
1 43
 
5.8%
5 39
 
5.3%
4 31
 
4.2%
6 31
 
4.2%
8 28
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 736
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 160
21.7%
- 121
16.4%
2 98
13.3%
3 89
12.1%
7 85
11.5%
1 43
 
5.8%
5 39
 
5.3%
4 31
 
4.2%
6 31
 
4.2%
8 28
 
3.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size644.0 B
2020-12-11
64 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-11
2nd row2020-12-11
3rd row2020-12-11
4th row2020-12-11
5th row2020-12-11

Common Values

ValueCountFrequency (%)
2020-12-11 64
100.0%

Length

2024-01-28T16:40:17.106137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T16:40:17.194837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-11 64
100.0%

Interactions

2024-01-28T16:40:14.654374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T16:40:17.241250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번사업장명도로명주소전화번호
순번1.0001.0000.9830.942
사업장명1.0001.0001.0001.000
도로명주소0.9831.0001.0000.996
전화번호0.9421.0000.9961.000

Missing values

2024-01-28T16:40:14.763060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T16:40:14.849667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번사업장명도로명주소전화번호데이터기준일자
01네스트 호텔인천광역시 중구 영종해안남로 19-5032-743-90002020-12-11
12파라다이스시티인천광역시 중구 영종해안남로321번길 1861833-88552020-12-11
23골든튤립 인천공항 호텔&스위트인천광역시 중구 흰바위로59번길 8032-232-20002020-12-11
34리베라베리움인천광역시 중구 영종대로 881 리베라베리움 호텔 영종032-751-78002020-12-11
45베스트웨스턴 하버파크호텔인천광역시 중구 제물량로 217032-770-95002020-12-11
56에어스카이호텔인천광역시 중구 은하수로29번길 311666-12432020-12-11
67그랜드하얏트인천웨스트타워인천광역시 중구 영종해안남로321번길 208032-745-12342020-12-11
78웨스턴그레이스호텔인천광역시 중구 은하수로29번길 36032-717-00002020-12-11
89데이즈호텔 앤 스위트 인천에어포트인천광역시 중구 신도시남로142번길 6032-722-30002020-12-11
910베스트웨스턴인천에어포트호텔인천광역시 중구 공항로424번길 48-27032-743-10002020-12-11
순번사업장명도로명주소전화번호데이터기준일자
5455파라다이스시티아트파라디소인천광역시 중구 연안부두로43번길 121833-88552020-12-11
5556헤이든영종호텔인천광역시 중구 은하수로43번길 9032-747-19512020-12-11
5657딘관광호텔인천광역시 중구 연안부두로43번길 12032-889-02452020-12-11
5758무의 씨사이드호텔인천광역시 중구 대무의로 119032-752-77362020-12-11
5859호텔 그랜드스위트인천광역시 중구 월미로248번길 2032-777-56332020-12-11
5960갤럭시관광호텔인천광역시 중구 월미문화로 9032-777-25002020-12-11
6061인터 호텔인천광역시 중구 은하수로29번길 47<NA>2020-12-11
6162호텔레이인천광역시 중구 을왕로58번길 7032-752-83332020-12-11
6263인천비치호텔인천광역시 중구 용유서로 373-1032-751-11772020-12-11
6364세븐호텔인천광역시 중구 신포로23번길 3 세븐호텔032-773-75002020-12-11