Overview

Dataset statistics

Number of variables4
Number of observations118
Missing cells7
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory34.1 B

Variable types

Text3
Numeric1

Dataset

Description예산군에 있는 호텔및여관 정보(업소명, 전화번호, 객실수, 주소) 제공
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=408&beforeMenuCd=DOM_000000201001001000&publicdatapk=15049861

Alerts

소재지전화 has 7 (5.9%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:53:54.575673
Analysis finished2024-01-09 19:53:54.908886
Duration0.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct118
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-10T04:53:55.122781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length4.720339
Min length1

Characters and Unicode

Total characters557
Distinct characters177
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)100.0%

Sample

1st row인천
2nd row제일여관
3rd row로또
4th row한양여관
5th row청수장
ValueCountFrequency (%)
인천 1
 
0.8%
스파모텔 1
 
0.8%
현미장 1
 
0.8%
에이투호텔디자이너스주식회사 1
 
0.8%
에이스모텔 1
 
0.8%
아이호텔 1
 
0.8%
태양여관 1
 
0.8%
스파(spa)텔 1
 
0.8%
티호텔 1
 
0.8%
스플라스리솜 1
 
0.8%
Other values (108) 108
91.5%
2024-01-10T04:53:55.500430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
46
 
8.3%
23
 
4.1%
21
 
3.8%
20
 
3.6%
19
 
3.4%
18
 
3.2%
17
 
3.1%
16
 
2.9%
13
 
2.3%
13
 
2.3%
Other values (167) 351
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 510
91.6%
Uppercase Letter 28
 
5.0%
Open Punctuation 7
 
1.3%
Close Punctuation 7
 
1.3%
Lowercase Letter 3
 
0.5%
Math Symbol 1
 
0.2%
Space Separator 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
46
 
9.0%
23
 
4.5%
21
 
4.1%
20
 
3.9%
19
 
3.7%
18
 
3.5%
17
 
3.3%
16
 
3.1%
13
 
2.5%
13
 
2.5%
Other values (146) 304
59.6%
Uppercase Letter
ValueCountFrequency (%)
A 5
17.9%
O 5
17.9%
G 4
14.3%
S 2
 
7.1%
J 2
 
7.1%
Z 2
 
7.1%
I 1
 
3.6%
V 1
 
3.6%
P 1
 
3.6%
B 1
 
3.6%
Other values (4) 4
14.3%
Lowercase Letter
ValueCountFrequency (%)
s 1
33.3%
p 1
33.3%
a 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 510
91.6%
Latin 31
 
5.6%
Common 16
 
2.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
46
 
9.0%
23
 
4.5%
21
 
4.1%
20
 
3.9%
19
 
3.7%
18
 
3.5%
17
 
3.3%
16
 
3.1%
13
 
2.5%
13
 
2.5%
Other values (146) 304
59.6%
Latin
ValueCountFrequency (%)
A 5
16.1%
O 5
16.1%
G 4
12.9%
S 2
 
6.5%
J 2
 
6.5%
Z 2
 
6.5%
I 1
 
3.2%
V 1
 
3.2%
P 1
 
3.2%
s 1
 
3.2%
Other values (7) 7
22.6%
Common
ValueCountFrequency (%)
( 7
43.8%
) 7
43.8%
+ 1
 
6.2%
1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 510
91.6%
ASCII 47
 
8.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
46
 
9.0%
23
 
4.5%
21
 
4.1%
20
 
3.9%
19
 
3.7%
18
 
3.5%
17
 
3.3%
16
 
3.1%
13
 
2.5%
13
 
2.5%
Other values (146) 304
59.6%
ASCII
ValueCountFrequency (%)
( 7
14.9%
) 7
14.9%
A 5
10.6%
O 5
10.6%
G 4
 
8.5%
S 2
 
4.3%
J 2
 
4.3%
Z 2
 
4.3%
I 1
 
2.1%
V 1
 
2.1%
Other values (11) 11
23.4%

소재지전화
Text

MISSING 

Distinct108
Distinct (%)97.3%
Missing7
Missing (%)5.9%
Memory size1.1 KiB
2024-01-10T04:53:55.706157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters1332
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique105 ?
Unique (%)94.6%

Sample

1st row041-334-5077
2nd row041-332-2513
3rd row041-335-8896
4th row041-334-3262
5th row041-334-6696
ValueCountFrequency (%)
041-338-0067 2
 
1.8%
041-333-1112 2
 
1.8%
041-331-4343 2
 
1.8%
041-337-1000 1
 
0.9%
041-338-1118 1
 
0.9%
041-330-8000 1
 
0.9%
041-332-7754 1
 
0.9%
041-337-6748 1
 
0.9%
041-334-0209 1
 
0.9%
041-338-1155 1
 
0.9%
Other values (98) 98
88.3%
2024-01-10T04:53:56.000519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 292
21.9%
- 222
16.7%
0 191
14.3%
1 177
13.3%
4 162
12.2%
8 62
 
4.7%
7 61
 
4.6%
5 48
 
3.6%
6 45
 
3.4%
2 43
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1110
83.3%
Dash Punctuation 222
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 292
26.3%
0 191
17.2%
1 177
15.9%
4 162
14.6%
8 62
 
5.6%
7 61
 
5.5%
5 48
 
4.3%
6 45
 
4.1%
2 43
 
3.9%
9 29
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 222
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1332
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 292
21.9%
- 222
16.7%
0 191
14.3%
1 177
13.3%
4 162
12.2%
8 62
 
4.7%
7 61
 
4.6%
5 48
 
3.6%
6 45
 
3.4%
2 43
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1332
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 292
21.9%
- 222
16.7%
0 191
14.3%
1 177
13.3%
4 162
12.2%
8 62
 
4.7%
7 61
 
4.6%
5 48
 
3.6%
6 45
 
3.4%
2 43
 
3.2%

객실수
Real number (ℝ)

Distinct38
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.550847
Minimum4
Maximum407
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-01-10T04:53:56.108027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4
5-th percentile6
Q19
median17.5
Q323.75
95-th percentile41.15
Maximum407
Range403
Interquartile range (IQR)14.75

Descriptive statistics

Standard deviation37.89044
Coefficient of variation (CV)1.7581879
Kurtosis93.426657
Mean21.550847
Median Absolute Deviation (MAD)7.5
Skewness9.2168299
Sum2543
Variance1435.6854
MonotonicityNot monotonic
2024-01-10T04:53:56.207205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
9 10
 
8.5%
19 8
 
6.8%
7 8
 
6.8%
8 6
 
5.1%
20 6
 
5.1%
6 6
 
5.1%
10 6
 
5.1%
18 6
 
5.1%
22 6
 
5.1%
12 5
 
4.2%
Other values (28) 51
43.2%
ValueCountFrequency (%)
4 2
 
1.7%
5 2
 
1.7%
6 6
5.1%
7 8
6.8%
8 6
5.1%
9 10
8.5%
10 6
5.1%
11 3
 
2.5%
12 5
4.2%
13 1
 
0.8%
ValueCountFrequency (%)
407 1
0.8%
97 1
0.8%
52 1
0.8%
48 1
0.8%
44 1
0.8%
42 1
0.8%
41 1
0.8%
38 1
0.8%
36 1
0.8%
35 2
1.7%
Distinct114
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-10T04:53:56.472981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length22.389831
Min length18

Characters and Unicode

Total characters2642
Distinct characters99
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)93.2%

Sample

1st row충청남도 예산군 예산읍 신례원로212번길 116
2nd row충청남도 예산군 예산읍 주교로 65-20
3rd row충청남도 예산군 예산읍 아리랑로11번길 8
4th row충청남도 예산군 예산읍 아리랑로11번길 9-1
5th row충청남도 예산군 예산읍 창말로 4-8
ValueCountFrequency (%)
충청남도 118
19.8%
예산군 118
19.8%
예산읍 54
 
9.0%
덕산면 45
 
7.5%
주교로 12
 
2.0%
온천단지1로 9
 
1.5%
아리랑로11번길 8
 
1.3%
삽교읍 7
 
1.2%
아리랑로 5
 
0.8%
덕산온천로 5
 
0.8%
Other values (156) 216
36.2%
2024-01-10T04:53:57.072889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
479
18.1%
236
 
8.9%
181
 
6.9%
124
 
4.7%
121
 
4.6%
120
 
4.5%
120
 
4.5%
118
 
4.5%
1 115
 
4.4%
98
 
3.7%
Other values (89) 930
35.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1690
64.0%
Space Separator 479
 
18.1%
Decimal Number 407
 
15.4%
Dash Punctuation 53
 
2.0%
Other Punctuation 8
 
0.3%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
236
14.0%
181
10.7%
124
 
7.3%
121
 
7.2%
120
 
7.1%
120
 
7.1%
118
 
7.0%
98
 
5.8%
61
 
3.6%
59
 
3.5%
Other values (73) 452
26.7%
Decimal Number
ValueCountFrequency (%)
1 115
28.3%
2 48
11.8%
5 41
 
10.1%
4 37
 
9.1%
0 37
 
9.1%
3 36
 
8.8%
6 33
 
8.1%
9 23
 
5.7%
7 22
 
5.4%
8 15
 
3.7%
Space Separator
ValueCountFrequency (%)
479
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%
Other Punctuation
ValueCountFrequency (%)
, 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1690
64.0%
Common 951
36.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
236
14.0%
181
10.7%
124
 
7.3%
121
 
7.2%
120
 
7.1%
120
 
7.1%
118
 
7.0%
98
 
5.8%
61
 
3.6%
59
 
3.5%
Other values (73) 452
26.7%
Common
ValueCountFrequency (%)
479
50.4%
1 115
 
12.1%
- 53
 
5.6%
2 48
 
5.0%
5 41
 
4.3%
4 37
 
3.9%
0 37
 
3.9%
3 36
 
3.8%
6 33
 
3.5%
9 23
 
2.4%
Other values (5) 49
 
5.2%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1690
64.0%
ASCII 952
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
479
50.3%
1 115
 
12.1%
- 53
 
5.6%
2 48
 
5.0%
5 41
 
4.3%
4 37
 
3.9%
0 37
 
3.9%
3 36
 
3.8%
6 33
 
3.5%
9 23
 
2.4%
Other values (6) 50
 
5.3%
Hangul
ValueCountFrequency (%)
236
14.0%
181
10.7%
124
 
7.3%
121
 
7.2%
120
 
7.1%
120
 
7.1%
118
 
7.0%
98
 
5.8%
61
 
3.6%
59
 
3.5%
Other values (73) 452
26.7%

Interactions

2024-01-10T04:53:54.726685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T04:53:54.822289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:53:54.883262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지전화객실수주소(도로명)
0인천041-334-50774충청남도 예산군 예산읍 신례원로212번길 116
1제일여관041-332-251310충청남도 예산군 예산읍 주교로 65-20
2로또041-335-88969충청남도 예산군 예산읍 아리랑로11번길 8
3한양여관041-334-32626충청남도 예산군 예산읍 아리랑로11번길 9-1
4청수장041-334-669612충청남도 예산군 예산읍 창말로 4-8
5일흥장041-333-800015충청남도 예산군 예산읍 아리랑로11번길 5
6한양장041-334-635315충청남도 예산군 예산읍 신례원로 219
7대호장041-334-511516충청남도 예산군 예산읍 창말로 12-1
8영신장041-332-258417충청남도 예산군 예산읍 아리랑로 11
9예일장041-335-260419충청남도 예산군 예산읍 임성로7번길 5
업소명소재지전화객실수주소(도로명)
108041-338-211011충청남도 예산군 덕산면 온천단지1로 101
109봉수산숲펜션041-332-19197충청남도 예산군 대흥면 임존성길 168
110초콜릿041-337-036610충청남도 예산군 덕산면 덕산향교길 107
111도영펜션<NA>7충청남도 예산군 덕산면 덕산향교길 100-3
112펜션허브041-337-600912충청남도 예산군 덕산면 대치남길 15-5
113형제펜션041-338-111810충청남도 예산군 덕산면 덕산향교길 108-8
114스파뷰호텔041-337-100097충청남도 예산군 덕산면 온천단지2로 77
115덕산참숯랜드041-337-639212충청남도 예산군 덕산면 노곡길 59
116온연프라이빗빌라<NA>6충청남도 예산군 덕산면 온천단지1로 69-5
117해월펜션<NA>6충청남도 예산군 응봉면 예당관광로 61, 3층