Overview

Dataset statistics

Number of variables5
Number of observations73
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory42.8 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description경상남도 창녕군 숙박업소 현황에 대한 데이터를 포함하고 있습니다.(업종명, 업소명, 영업소 주소, 소재지 전화번호, 객실수)
Author경상남도 창녕군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15025315

Alerts

객실수 is highly overall correlated with 분류High correlation
분류 is highly overall correlated with 객실수High correlation
분류 is highly imbalanced (69.4%)Imbalance

Reproduction

Analysis started2023-12-11 00:47:09.880289
Analysis finished2023-12-11 00:47:10.795105
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct72
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-11T09:47:11.006473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length5.7123288
Min length3

Characters and Unicode

Total characters417
Distinct characters137
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)97.3%

Sample

1st row춘산여관
2nd row일광여인숙
3rd rowDW모텔
4th row(주)레이크힐스호텔
5th row모텔퀸(MOTEL QUEEN)
ValueCountFrequency (%)
주식회사 3
 
3.9%
m모텔 2
 
2.6%
신일장모텔 1
 
1.3%
온천모텔 1
 
1.3%
c.f모텔(씨에프모텔 1
 
1.3%
주)레이크힐스골프텔 1
 
1.3%
러브홀릭 1
 
1.3%
j2모텔 1
 
1.3%
필모텔 1
 
1.3%
주)키즈스테이호텔인부곡 1
 
1.3%
Other values (64) 64
83.1%
2023-12-11T09:47:11.432171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
58
 
13.9%
31
 
7.4%
25
 
6.0%
12
 
2.9%
12
 
2.9%
10
 
2.4%
8
 
1.9%
7
 
1.7%
7
 
1.7%
7
 
1.7%
Other values (127) 240
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 377
90.4%
Uppercase Letter 22
 
5.3%
Close Punctuation 6
 
1.4%
Open Punctuation 6
 
1.4%
Space Separator 4
 
1.0%
Decimal Number 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
58
 
15.4%
31
 
8.2%
25
 
6.6%
12
 
3.2%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
7
 
1.9%
Other values (106) 200
53.1%
Uppercase Letter
ValueCountFrequency (%)
M 3
13.6%
E 3
13.6%
Q 2
 
9.1%
U 2
 
9.1%
S 1
 
4.5%
J 1
 
4.5%
K 1
 
4.5%
V 1
 
4.5%
C 1
 
4.5%
F 1
 
4.5%
Other values (6) 6
27.3%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 377
90.4%
Latin 22
 
5.3%
Common 18
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
58
 
15.4%
31
 
8.2%
25
 
6.6%
12
 
3.2%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
7
 
1.9%
Other values (106) 200
53.1%
Latin
ValueCountFrequency (%)
M 3
13.6%
E 3
13.6%
Q 2
 
9.1%
U 2
 
9.1%
S 1
 
4.5%
J 1
 
4.5%
K 1
 
4.5%
V 1
 
4.5%
C 1
 
4.5%
F 1
 
4.5%
Other values (6) 6
27.3%
Common
ValueCountFrequency (%)
) 6
33.3%
( 6
33.3%
4
22.2%
2 1
 
5.6%
. 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 377
90.4%
ASCII 40
 
9.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
58
 
15.4%
31
 
8.2%
25
 
6.6%
12
 
3.2%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
7
 
1.9%
Other values (106) 200
53.1%
ASCII
ValueCountFrequency (%)
) 6
15.0%
( 6
15.0%
4
 
10.0%
M 3
 
7.5%
E 3
 
7.5%
Q 2
 
5.0%
U 2
 
5.0%
S 1
 
2.5%
2 1
 
2.5%
J 1
 
2.5%
Other values (11) 11
27.5%

분류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size716.0 B
일반숙박업소
69 
호텔/콘도
 
4

Length

Max length6
Median length6
Mean length5.9452055
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반숙박업소
2nd row일반숙박업소
3rd row일반숙박업소
4th row호텔/콘도
5th row일반숙박업소

Common Values

ValueCountFrequency (%)
일반숙박업소 69
94.5%
호텔/콘도 4
 
5.5%

Length

2023-12-11T09:47:11.610713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:47:11.750725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반숙박업소 69
94.5%
호텔/콘도 4
 
5.5%
Distinct72
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-11T09:47:12.034583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length24
Mean length21.520548
Min length19

Characters and Unicode

Total characters1571
Distinct characters72
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)97.3%

Sample

1st row경상남도 창녕군 남지읍 낙동로 503
2nd row경상남도 창녕군 남지읍 남지시장길 10-3
3rd row경상남도 창녕군 부곡면 온천2길 27
4th row경상남도 창녕군 부곡면 온천2길 41
5th row경상남도 창녕군 부곡면 온천1길 49
ValueCountFrequency (%)
경상남도 73
19.9%
창녕군 73
19.9%
부곡면 25
 
6.8%
창녕읍 13
 
3.6%
남지읍 11
 
3.0%
온천중앙로 10
 
2.7%
온천2길 9
 
2.5%
남지중앙1길 6
 
1.6%
영산계성로 5
 
1.4%
계성면 5
 
1.4%
Other values (101) 136
37.2%
2023-12-11T09:47:12.492709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
293
18.7%
94
 
6.0%
91
 
5.8%
89
 
5.7%
76
 
4.8%
76
 
4.8%
74
 
4.7%
73
 
4.6%
49
 
3.1%
1 46
 
2.9%
Other values (62) 610
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1030
65.6%
Space Separator 293
 
18.7%
Decimal Number 223
 
14.2%
Dash Punctuation 23
 
1.5%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
9.1%
91
 
8.8%
89
 
8.6%
76
 
7.4%
76
 
7.4%
74
 
7.2%
73
 
7.1%
49
 
4.8%
39
 
3.8%
34
 
3.3%
Other values (48) 335
32.5%
Decimal Number
ValueCountFrequency (%)
1 46
20.6%
2 32
14.3%
5 27
12.1%
3 23
10.3%
4 23
10.3%
6 22
9.9%
9 16
 
7.2%
0 14
 
6.3%
7 12
 
5.4%
8 8
 
3.6%
Space Separator
ValueCountFrequency (%)
293
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1030
65.6%
Common 541
34.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
9.1%
91
 
8.8%
89
 
8.6%
76
 
7.4%
76
 
7.4%
74
 
7.2%
73
 
7.1%
49
 
4.8%
39
 
3.8%
34
 
3.3%
Other values (48) 335
32.5%
Common
ValueCountFrequency (%)
293
54.2%
1 46
 
8.5%
2 32
 
5.9%
5 27
 
5.0%
3 23
 
4.3%
4 23
 
4.3%
- 23
 
4.3%
6 22
 
4.1%
9 16
 
3.0%
0 14
 
2.6%
Other values (4) 22
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1030
65.6%
ASCII 541
34.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
293
54.2%
1 46
 
8.5%
2 32
 
5.9%
5 27
 
5.0%
3 23
 
4.3%
4 23
 
4.3%
- 23
 
4.3%
6 22
 
4.1%
9 16
 
3.0%
0 14
 
2.6%
Other values (4) 22
 
4.1%
Hangul
ValueCountFrequency (%)
94
 
9.1%
91
 
8.8%
89
 
8.6%
76
 
7.4%
76
 
7.4%
74
 
7.2%
73
 
7.1%
49
 
4.8%
39
 
3.8%
34
 
3.3%
Other values (48) 335
32.5%
Distinct72
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size716.0 B
2023-12-11T09:47:12.731315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.013699
Min length12

Characters and Unicode

Total characters877
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)97.3%

Sample

1st row055-526-2224
2nd row055-526-2234
3rd row055-536-5555
4th row055-536-5181
5th row055-536-5511
ValueCountFrequency (%)
055-536-5181 2
 
2.7%
055-526-2224 1
 
1.4%
055-533-8176 1
 
1.4%
055-526-1377 1
 
1.4%
055-526-4745 1
 
1.4%
055-526-6065 1
 
1.4%
055-536-3345 1
 
1.4%
055-526-1557 1
 
1.4%
055-521-8200 1
 
1.4%
055-532-1255 1
 
1.4%
Other values (62) 62
84.9%
2023-12-11T09:47:13.105270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 254
29.0%
- 146
16.6%
0 127
14.5%
3 84
 
9.6%
2 71
 
8.1%
6 63
 
7.2%
1 48
 
5.5%
7 27
 
3.1%
8 26
 
3.0%
9 16
 
1.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 731
83.4%
Dash Punctuation 146
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 254
34.7%
0 127
17.4%
3 84
 
11.5%
2 71
 
9.7%
6 63
 
8.6%
1 48
 
6.6%
7 27
 
3.7%
8 26
 
3.6%
9 16
 
2.2%
4 15
 
2.1%
Dash Punctuation
ValueCountFrequency (%)
- 146
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 877
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 254
29.0%
- 146
16.6%
0 127
14.5%
3 84
 
9.6%
2 71
 
8.1%
6 63
 
7.2%
1 48
 
5.5%
7 27
 
3.1%
8 26
 
3.0%
9 16
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 877
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 254
29.0%
- 146
16.6%
0 127
14.5%
3 84
 
9.6%
2 71
 
8.1%
6 63
 
7.2%
1 48
 
5.5%
7 27
 
3.1%
8 26
 
3.0%
9 16
 
1.8%

객실수
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)54.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.315068
Minimum8
Maximum238
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size789.0 B
2023-12-11T09:47:13.268394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile11.6
Q115
median27
Q338
95-th percentile72.2
Maximum238
Range230
Interquartile range (IQR)23

Descriptive statistics

Standard deviation31.606028
Coefficient of variation (CV)0.92105392
Kurtosis23.749474
Mean34.315068
Median Absolute Deviation (MAD)12
Skewness4.1120552
Sum2505
Variance998.94102
MonotonicityNot monotonic
2023-12-11T09:47:13.403718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
14 5
 
6.8%
19 4
 
5.5%
15 4
 
5.5%
20 4
 
5.5%
29 4
 
5.5%
26 3
 
4.1%
12 3
 
4.1%
38 3
 
4.1%
34 3
 
4.1%
13 3
 
4.1%
Other values (30) 37
50.7%
ValueCountFrequency (%)
8 1
 
1.4%
9 1
 
1.4%
10 1
 
1.4%
11 1
 
1.4%
12 3
4.1%
13 3
4.1%
14 5
6.8%
15 4
5.5%
16 1
 
1.4%
18 2
 
2.7%
ValueCountFrequency (%)
238 1
1.4%
105 1
1.4%
98 1
1.4%
80 1
1.4%
67 1
1.4%
65 1
1.4%
64 1
1.4%
60 2
2.7%
58 1
1.4%
56 2
2.7%

Interactions

2023-12-11T09:47:10.520745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:47:13.503049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명분류소재지도로명주소대표전화객실수
업소명1.0001.0000.9980.9981.000
분류1.0001.0000.0000.0001.000
소재지도로명주소0.9980.0001.0001.0000.000
대표전화0.9980.0001.0001.0000.000
객실수1.0001.0000.0000.0001.000
2023-12-11T09:47:13.601248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
객실수분류
객실수1.0000.971
분류0.9711.000

Missing values

2023-12-11T09:47:10.628862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:47:10.758284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명분류소재지도로명주소대표전화객실수
0춘산여관일반숙박업소경상남도 창녕군 남지읍 낙동로 503055-526-222411
1일광여인숙일반숙박업소경상남도 창녕군 남지읍 남지시장길 10-3055-526-22348
2DW모텔일반숙박업소경상남도 창녕군 부곡면 온천2길 27055-536-555535
3(주)레이크힐스호텔호텔/콘도경상남도 창녕군 부곡면 온천2길 41055-536-5181105
4모텔퀸(MOTEL QUEEN)일반숙박업소경상남도 창녕군 부곡면 온천1길 49055-536-551138
5주식회사 부곡신라호텔일반숙박업소경상남도 창녕군 부곡면 온천중앙로 19055-520-660058
6오리온호텔일반숙박업소경상남도 창녕군 부곡면 온천1길 35055-536-571160
7호텔레인보우일반숙박업소경상남도 창녕군 부곡면 온천중앙로 33055-521-577765
8썬크루즈일반숙박업소경상남도 창녕군 영산면 연지길 26-14055-521-279818
9부일온천일반숙박업소경상남도 창녕군 부곡면 온천중앙로 62055-536-542038
업소명분류소재지도로명주소대표전화객실수
63씨엔케이호텔일반숙박업소경상남도 창녕군 대합면 대합산업단지로 119-9055-532-050640
64호텔궁일반숙박업소경상남도 창녕군 대지면 미산길 3-42055-532-222014
65동궁무인텔일반숙박업소경상남도 창녕군 도천면 치이골길 56-24055-521-666914
66U모텔일반숙박업소경상남도 창녕군 도천면 치이골길 56-22055-521-826014
67하이츠호텔일반숙박업소경상남도 창녕군 남지읍 남지중앙1길 47-9055-521-170046
68에스에스무인호텔일반숙박업소경상남도 창녕군 대지면 미산길 3-50055-532-233415
69문도트호텔일반숙박업소경상남도 창녕군 성산면 경남대로 5742-13 (문도트호텔)055-532-088919
70제이모텔일반숙박업소경상남도 창녕군 대합면 평지퇴산로 432055-533-822812
71동정호무인호텔일반숙박업소경상남도 창녕군 창녕읍 계성화왕산로 470055-521-661225
72썸호텔일반숙박업소경상남도 창녕군 대지면 경남대로 5055055-533-551913