Overview

Dataset statistics

Number of variables5
Number of observations117
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.8 KiB
Average record size in memory42.1 B

Variable types

Categorical2
Text2
Numeric1

Dataset

Description당진시 관내 숙박업소들에 대하여 업종, 업소명, 주소, 객실수에 대한 정보 제공
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=371&beforeMenuCd=DOM_000000201001001000&publicdatapk=15077225

Alerts

지역 has constant value ""Constant
업종 is highly imbalanced (87.7%)Imbalance
업소명 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:47:19.638220
Analysis finished2024-01-09 19:47:20.018587
Duration0.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
당진시
117 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row당진시
2nd row당진시
3rd row당진시
4th row당진시
5th row당진시

Common Values

ValueCountFrequency (%)
당진시 117
100.0%

Length

2024-01-10T04:47:20.065349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:47:20.132788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
당진시 117
100.0%

업종
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
숙박업
114 
한옥체험업
 
2
관광호텔업
 
1

Length

Max length5
Median length3
Mean length3.0512821
Min length3

Unique

Unique1 ?
Unique (%)0.9%

Sample

1st row관광호텔업
2nd row한옥체험업
3rd row한옥체험업
4th row숙박업
5th row숙박업

Common Values

ValueCountFrequency (%)
숙박업 114
97.4%
한옥체험업 2
 
1.7%
관광호텔업 1
 
0.9%

Length

2024-01-10T04:47:20.209978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T04:47:20.295316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
숙박업 114
97.4%
한옥체험업 2
 
1.7%
관광호텔업 1
 
0.9%

업소명
Text

UNIQUE 

Distinct117
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-01-10T04:47:20.488210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length4.9487179
Min length1

Characters and Unicode

Total characters579
Distinct characters177
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)100.0%

Sample

1st row㈜당진관광호텔
2nd row시은고택
3rd row벽송재
4th row신흥여인숙
5th row금호여인숙
ValueCountFrequency (%)
호텔 2
 
1.6%
여관 2
 
1.6%
㈜당진관광호텔 1
 
0.8%
꿈의궁전여관 1
 
0.8%
모텔케이 1
 
0.8%
모텔비치타운 1
 
0.8%
노블리스 1
 
0.8%
행담도모텔 1
 
0.8%
츄리 1
 
0.8%
두바이호텔 1
 
0.8%
Other values (111) 111
90.2%
2024-01-10T04:47:20.800758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
10.2%
39
 
6.7%
39
 
6.7%
36
 
6.2%
24
 
4.1%
18
 
3.1%
13
 
2.2%
12
 
2.1%
12
 
2.1%
12
 
2.1%
Other values (167) 315
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 545
94.1%
Uppercase Letter 14
 
2.4%
Lowercase Letter 7
 
1.2%
Space Separator 6
 
1.0%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Decimal Number 2
 
0.3%
Other Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
10.8%
39
 
7.2%
39
 
7.2%
36
 
6.6%
24
 
4.4%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (146) 281
51.6%
Uppercase Letter
ValueCountFrequency (%)
M 2
14.3%
S 2
14.3%
X 2
14.3%
O 2
14.3%
K 1
7.1%
Q 1
7.1%
D 1
7.1%
F 1
7.1%
C 1
7.1%
L 1
7.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
42.9%
a 1
 
14.3%
l 1
 
14.3%
t 1
 
14.3%
o 1
 
14.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 546
94.3%
Latin 21
 
3.6%
Common 12
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
10.8%
39
 
7.1%
39
 
7.1%
36
 
6.6%
24
 
4.4%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (147) 282
51.6%
Latin
ValueCountFrequency (%)
e 3
14.3%
M 2
 
9.5%
S 2
 
9.5%
X 2
 
9.5%
O 2
 
9.5%
K 1
 
4.8%
Q 1
 
4.8%
a 1
 
4.8%
D 1
 
4.8%
F 1
 
4.8%
Other values (5) 5
23.8%
Common
ValueCountFrequency (%)
6
50.0%
) 2
 
16.7%
( 2
 
16.7%
2 1
 
8.3%
1 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 545
94.1%
ASCII 33
 
5.7%
None 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
10.8%
39
 
7.2%
39
 
7.2%
36
 
6.6%
24
 
4.4%
18
 
3.3%
13
 
2.4%
12
 
2.2%
12
 
2.2%
12
 
2.2%
Other values (146) 281
51.6%
ASCII
ValueCountFrequency (%)
6
18.2%
e 3
 
9.1%
M 2
 
6.1%
) 2
 
6.1%
( 2
 
6.1%
S 2
 
6.1%
X 2
 
6.1%
O 2
 
6.1%
2 1
 
3.0%
K 1
 
3.0%
Other values (10) 10
30.3%
None
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct116
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-01-10T04:47:20.954321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length29
Mean length23.367521
Min length18

Characters and Unicode

Total characters2734
Distinct characters122
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)98.3%

Sample

1st row충청남도 당진시 송악읍 반촌로 192
2nd row충청남도 당진시 면천면 소리벌길 109-22
3rd row충청남도 당진시 면천면 삼웅안길 27-15
4th row충청남도 당진시 합덕읍 합덕교동1길 20
5th row충청남도 당진시 당진시장길 99, 89호 (읍내동)
ValueCountFrequency (%)
충청남도 117
19.6%
당진시 117
19.6%
송악읍 32
 
5.4%
읍내동 29
 
4.9%
석문면 14
 
2.3%
당진중앙2로 14
 
2.3%
신평면 12
 
2.0%
당진중앙3로 10
 
1.7%
반촌로 9
 
1.5%
합덕읍 7
 
1.2%
Other values (180) 235
39.4%
2024-01-10T04:47:21.216539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
479
17.5%
149
 
5.4%
147
 
5.4%
126
 
4.6%
125
 
4.6%
118
 
4.3%
117
 
4.3%
117
 
4.3%
1 93
 
3.4%
72
 
2.6%
Other values (112) 1191
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1684
61.6%
Space Separator 479
 
17.5%
Decimal Number 419
 
15.3%
Dash Punctuation 57
 
2.1%
Close Punctuation 42
 
1.5%
Open Punctuation 42
 
1.5%
Other Punctuation 11
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
149
 
8.8%
147
 
8.7%
126
 
7.5%
125
 
7.4%
118
 
7.0%
117
 
6.9%
117
 
6.9%
72
 
4.3%
68
 
4.0%
47
 
2.8%
Other values (97) 598
35.5%
Decimal Number
ValueCountFrequency (%)
1 93
22.2%
2 69
16.5%
3 62
14.8%
6 35
 
8.4%
5 35
 
8.4%
8 32
 
7.6%
7 30
 
7.2%
4 25
 
6.0%
9 24
 
5.7%
0 14
 
3.3%
Space Separator
ValueCountFrequency (%)
479
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Other Punctuation
ValueCountFrequency (%)
, 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1684
61.6%
Common 1050
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
149
 
8.8%
147
 
8.7%
126
 
7.5%
125
 
7.4%
118
 
7.0%
117
 
6.9%
117
 
6.9%
72
 
4.3%
68
 
4.0%
47
 
2.8%
Other values (97) 598
35.5%
Common
ValueCountFrequency (%)
479
45.6%
1 93
 
8.9%
2 69
 
6.6%
3 62
 
5.9%
- 57
 
5.4%
) 42
 
4.0%
( 42
 
4.0%
6 35
 
3.3%
5 35
 
3.3%
8 32
 
3.0%
Other values (5) 104
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1684
61.6%
ASCII 1050
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
479
45.6%
1 93
 
8.9%
2 69
 
6.6%
3 62
 
5.9%
- 57
 
5.4%
) 42
 
4.0%
( 42
 
4.0%
6 35
 
3.3%
5 35
 
3.3%
8 32
 
3.0%
Other values (5) 104
 
9.9%
Hangul
ValueCountFrequency (%)
149
 
8.8%
147
 
8.7%
126
 
7.5%
125
 
7.4%
118
 
7.0%
117
 
6.9%
117
 
6.9%
72
 
4.3%
68
 
4.0%
47
 
2.8%
Other values (97) 598
35.5%

객실수
Real number (ℝ)

Distinct40
Distinct (%)34.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.598291
Minimum3
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-01-10T04:47:21.319165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile7
Q114
median19
Q330
95-th percentile45
Maximum66
Range63
Interquartile range (IQR)16

Descriptive statistics

Standard deviation12.169818
Coefficient of variation (CV)0.53852827
Kurtosis0.57160541
Mean22.598291
Median Absolute Deviation (MAD)9
Skewness0.81750265
Sum2644
Variance148.10448
MonotonicityNot monotonic
2024-01-10T04:47:21.417875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
19 9
 
7.7%
18 8
 
6.8%
8 6
 
5.1%
17 6
 
5.1%
10 5
 
4.3%
13 4
 
3.4%
24 4
 
3.4%
27 4
 
3.4%
9 4
 
3.4%
22 3
 
2.6%
Other values (30) 64
54.7%
ValueCountFrequency (%)
3 1
 
0.9%
4 2
 
1.7%
6 2
 
1.7%
7 2
 
1.7%
8 6
5.1%
9 4
3.4%
10 5
4.3%
12 3
2.6%
13 4
3.4%
14 3
2.6%
ValueCountFrequency (%)
66 1
 
0.9%
55 1
 
0.9%
48 2
1.7%
46 1
 
0.9%
45 3
2.6%
43 1
 
0.9%
42 1
 
0.9%
41 1
 
0.9%
40 2
1.7%
39 2
1.7%

Interactions

2024-01-10T04:47:19.841784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:47:21.484294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종객실수
업종1.0000.335
객실수0.3351.000
2024-01-10T04:47:21.543846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
객실수업종
객실수1.0000.205
업종0.2051.000

Missing values

2024-01-10T04:47:19.922439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:47:19.990870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역업종업소명주소객실수
0당진시관광호텔업㈜당진관광호텔충청남도 당진시 송악읍 반촌로 19245
1당진시한옥체험업시은고택충청남도 당진시 면천면 소리벌길 109-223
2당진시한옥체험업벽송재충청남도 당진시 면천면 삼웅안길 27-154
3당진시숙박업신흥여인숙충청남도 당진시 합덕읍 합덕교동1길 208
4당진시숙박업금호여인숙충청남도 당진시 당진시장길 99, 89호 (읍내동)9
5당진시숙박업동궁장여관충청남도 당진시 당진중앙2로 41-9 (읍내동)8
6당진시숙박업신정여인숙충청남도 당진시 당진중앙3로 44 (읍내동)8
7당진시숙박업삼일여관충청남도 당진시 당진중앙3로 53 (읍내동)9
8당진시숙박업현대여관충청남도 당진시 당진시장북길 37-19 (읍내동)7
9당진시숙박업모텔러브충청남도 당진시 당진중앙3로 33 (읍내동)19
지역업종업소명주소객실수
107당진시숙박업해와달 펜션충청남도 당진시 석문면 새골길 74-7510
108당진시숙박업펀다이어트캠프충청남도 당진시 대호지면 빈정들길 80-598
109당진시숙박업아미성한옥펜션충청남도 당진시 순성면 남부로 816-148
110당진시숙박업XO1호텔충청남도 당진시 송악읍 구래1길 2618
111당진시숙박업XO2호텔충청남도 당진시 송악읍 구래1길 2417
112당진시숙박업라메르펜션충청남도 당진시 석문면 석문해안로 133, 라메르펜션16
113당진시숙박업힐링인더삽교충청남도 당진시 신평면 삽교천3길 49-1, 2층4
114당진시숙박업하늘빛바다충청남도 당진시 석문면 석문해안로 19-26 (하늘빛바다펜션)10
115당진시숙박업왜목펜션충청남도 당진시 석문면 석문해안로 9, 왜목팬션14
116당진시숙박업왜목빌리지충청남도 당진시 석문면 석문해안로 19-4 (왜목빌리지)12