Overview

Dataset statistics

Number of variables5
Number of observations54
Missing cells7
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory44.4 B

Variable types

Text3
Numeric2

Dataset

Description예산군에 있는 호텔및여관 정보(업소명, 전화번호, 객실수, 주소) 제공
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=408&beforeMenuCd=DOM_000000201001001000&publicdatapk=15049861

Alerts

주차대수 has 7 (13.0%) missing valuesMissing
업소명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 19:53:51.538751
Analysis finished2024-01-09 19:53:52.090529
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T04:53:52.220827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length9.5
Mean length6.7777778
Min length1

Characters and Unicode

Total characters366
Distinct characters105
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st rowS드라이브무인텔
2nd rowZAZA호텔
3rd row가야관광호텔
4th row그랜드모텔
5th row궁전파크장(여관)
ValueCountFrequency (%)
s드라이브무인텔 1
 
1.8%
원모텔(여관 1
 
1.8%
신원파크장(여관 1
 
1.8%
아리장여관 1
 
1.8%
아이호텔 1
 
1.8%
애리장(여관 1
 
1.8%
애플파크장(여관 1
 
1.8%
에이스모텔 1
 
1.8%
에이투호텔디자이너스 1
 
1.8%
예당레이크하우스(여관 1
 
1.8%
Other values (45) 45
81.8%
2024-01-10T04:53:52.518057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30
 
8.2%
28
 
7.7%
( 26
 
7.1%
) 26
 
7.1%
25
 
6.8%
14
 
3.8%
13
 
3.6%
13
 
3.6%
13
 
3.6%
12
 
3.3%
Other values (95) 166
45.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 302
82.5%
Open Punctuation 26
 
7.1%
Close Punctuation 26
 
7.1%
Uppercase Letter 8
 
2.2%
Lowercase Letter 3
 
0.8%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
9.9%
28
 
9.3%
25
 
8.3%
14
 
4.6%
13
 
4.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
8
 
2.6%
7
 
2.3%
Other values (83) 139
46.0%
Uppercase Letter
ValueCountFrequency (%)
Z 2
25.0%
A 2
25.0%
S 1
12.5%
X 1
12.5%
Y 1
12.5%
M 1
12.5%
Lowercase Letter
ValueCountFrequency (%)
a 1
33.3%
p 1
33.3%
s 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 302
82.5%
Common 53
 
14.5%
Latin 11
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
9.9%
28
 
9.3%
25
 
8.3%
14
 
4.6%
13
 
4.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
8
 
2.6%
7
 
2.3%
Other values (83) 139
46.0%
Latin
ValueCountFrequency (%)
Z 2
18.2%
A 2
18.2%
S 1
9.1%
a 1
9.1%
p 1
9.1%
s 1
9.1%
X 1
9.1%
Y 1
9.1%
M 1
9.1%
Common
ValueCountFrequency (%)
( 26
49.1%
) 26
49.1%
1
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 302
82.5%
ASCII 64
 
17.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
30
 
9.9%
28
 
9.3%
25
 
8.3%
14
 
4.6%
13
 
4.3%
13
 
4.3%
13
 
4.3%
12
 
4.0%
8
 
2.6%
7
 
2.3%
Other values (83) 139
46.0%
ASCII
ValueCountFrequency (%)
( 26
40.6%
) 26
40.6%
Z 2
 
3.1%
A 2
 
3.1%
S 1
 
1.6%
1
 
1.6%
a 1
 
1.6%
p 1
 
1.6%
s 1
 
1.6%
X 1
 
1.6%
Other values (2) 2
 
3.1%

전화번호
Text

UNIQUE 

Distinct54
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T04:53:52.712700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters648
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)100.0%

Sample

1st row041-333-1112
2nd row041-331-9114
3rd row041-337-0101
4th row041-334-8934
5th row041-337-7893
ValueCountFrequency (%)
041-333-1112 1
 
1.9%
041-338-1992 1
 
1.9%
041-337-3070 1
 
1.9%
041-334-7001 1
 
1.9%
041-331-0310 1
 
1.9%
041-331-8888 1
 
1.9%
041-337-4020 1
 
1.9%
041-333-1221 1
 
1.9%
041-335-8500 1
 
1.9%
041-337-0611 1
 
1.9%
Other values (44) 44
81.5%
2024-01-10T04:53:52.996092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 140
21.6%
- 108
16.7%
1 96
14.8%
0 92
14.2%
4 77
11.9%
7 33
 
5.1%
8 31
 
4.8%
6 20
 
3.1%
5 19
 
2.9%
2 18
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 540
83.3%
Dash Punctuation 108
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 140
25.9%
1 96
17.8%
0 92
17.0%
4 77
14.3%
7 33
 
6.1%
8 31
 
5.7%
6 20
 
3.7%
5 19
 
3.5%
2 18
 
3.3%
9 14
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 648
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 140
21.6%
- 108
16.7%
1 96
14.8%
0 92
14.2%
4 77
11.9%
7 33
 
5.1%
8 31
 
4.8%
6 20
 
3.1%
5 19
 
2.9%
2 18
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 648
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 140
21.6%
- 108
16.7%
1 96
14.8%
0 92
14.2%
4 77
11.9%
7 33
 
5.1%
8 31
 
4.8%
6 20
 
3.1%
5 19
 
2.9%
2 18
 
2.8%

객실수
Real number (ℝ)

Distinct28
Distinct (%)51.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.462963
Minimum7
Maximum54
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2024-01-10T04:53:53.099491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum7
5-th percentile9.65
Q119
median24
Q328.75
95-th percentile45.4
Maximum54
Range47
Interquartile range (IQR)9.75

Descriptive statistics

Standard deviation10.302432
Coefficient of variation (CV)0.42114409
Kurtosis0.93799414
Mean24.462963
Median Absolute Deviation (MAD)5
Skewness0.85051681
Sum1321
Variance106.14011
MonotonicityNot monotonic
2024-01-10T04:53:53.185867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
24 7
 
13.0%
22 4
 
7.4%
19 4
 
7.4%
17 3
 
5.6%
20 3
 
5.6%
25 3
 
5.6%
26 3
 
5.6%
12 2
 
3.7%
18 2
 
3.7%
7 2
 
3.7%
Other values (18) 21
38.9%
ValueCountFrequency (%)
7 2
3.7%
9 1
 
1.9%
10 1
 
1.9%
11 1
 
1.9%
12 2
3.7%
14 1
 
1.9%
17 3
5.6%
18 2
3.7%
19 4
7.4%
20 3
5.6%
ValueCountFrequency (%)
54 1
1.9%
50 1
1.9%
48 1
1.9%
44 1
1.9%
41 1
1.9%
38 1
1.9%
35 2
3.7%
34 2
3.7%
33 1
1.9%
30 2
3.7%

주차대수
Real number (ℝ)

MISSING 

Distinct16
Distinct (%)34.0%
Missing7
Missing (%)13.0%
Infinite0
Infinite (%)0.0%
Mean31.06383
Minimum10
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size618.0 B
2024-01-10T04:53:53.263947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile10
Q115
median20
Q335
95-th percentile50
Maximum200
Range190
Interquartile range (IQR)20

Descriptive statistics

Standard deviation30.892805
Coefficient of variation (CV)0.99449441
Kurtosis20.078158
Mean31.06383
Median Absolute Deviation (MAD)10
Skewness4.0454428
Sum1460
Variance954.3654
MonotonicityNot monotonic
2024-01-10T04:53:53.346621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
20 9
16.7%
15 8
14.8%
50 7
13.0%
10 6
11.1%
30 4
7.4%
40 2
 
3.7%
35 2
 
3.7%
34 1
 
1.9%
110 1
 
1.9%
25 1
 
1.9%
Other values (6) 6
11.1%
(Missing) 7
13.0%
ValueCountFrequency (%)
10 6
11.1%
14 1
 
1.9%
15 8
14.8%
19 1
 
1.9%
20 9
16.7%
23 1
 
1.9%
24 1
 
1.9%
25 1
 
1.9%
30 4
7.4%
31 1
 
1.9%
ValueCountFrequency (%)
200 1
 
1.9%
110 1
 
1.9%
50 7
13.0%
40 2
 
3.7%
35 2
 
3.7%
34 1
 
1.9%
31 1
 
1.9%
30 4
7.4%
25 1
 
1.9%
24 1
 
1.9%

주소
Text

Distinct51
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size564.0 B
2024-01-10T04:53:53.550025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length21.240741
Min length18

Characters and Unicode

Total characters1147
Distinct characters73
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)88.9%

Sample

1st row충청남도 예산군 예산읍 벚꽃로 504-11
2nd row충청남도 예산군 예산읍 예산산업단지로 15-19
3rd row충청남도 예산군 덕산면 신평1길 14
4th row충청남도 예산군 예산읍 충서로 1344
5th row충청남도 예산군 덕산면 덕산온천로 14
ValueCountFrequency (%)
충청남도 54
20.1%
예산군 54
20.1%
덕산면 29
 
10.8%
예산읍 16
 
6.0%
14 4
 
1.5%
덕산온천로 4
 
1.5%
온천단지1로 4
 
1.5%
수암산로 4
 
1.5%
삽교읍 3
 
1.1%
벚꽃로 3
 
1.1%
Other values (74) 93
34.7%
2024-01-10T04:53:53.867935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
18.7%
115
 
10.0%
76
 
6.6%
61
 
5.3%
55
 
4.8%
55
 
4.8%
55
 
4.8%
54
 
4.7%
1 45
 
3.9%
43
 
3.7%
Other values (63) 374
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 760
66.3%
Space Separator 214
 
18.7%
Decimal Number 158
 
13.8%
Dash Punctuation 15
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
15.1%
76
10.0%
61
 
8.0%
55
 
7.2%
55
 
7.2%
55
 
7.2%
54
 
7.1%
43
 
5.7%
38
 
5.0%
35
 
4.6%
Other values (51) 173
22.8%
Decimal Number
ValueCountFrequency (%)
1 45
28.5%
4 22
13.9%
5 20
12.7%
3 19
12.0%
0 13
 
8.2%
2 12
 
7.6%
9 9
 
5.7%
6 7
 
4.4%
7 7
 
4.4%
8 4
 
2.5%
Space Separator
ValueCountFrequency (%)
214
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 760
66.3%
Common 387
33.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
15.1%
76
10.0%
61
 
8.0%
55
 
7.2%
55
 
7.2%
55
 
7.2%
54
 
7.1%
43
 
5.7%
38
 
5.0%
35
 
4.6%
Other values (51) 173
22.8%
Common
ValueCountFrequency (%)
214
55.3%
1 45
 
11.6%
4 22
 
5.7%
5 20
 
5.2%
3 19
 
4.9%
- 15
 
3.9%
0 13
 
3.4%
2 12
 
3.1%
9 9
 
2.3%
6 7
 
1.8%
Other values (2) 11
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 760
66.3%
ASCII 387
33.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
214
55.3%
1 45
 
11.6%
4 22
 
5.7%
5 20
 
5.2%
3 19
 
4.9%
- 15
 
3.9%
0 13
 
3.4%
2 12
 
3.1%
9 9
 
2.3%
6 7
 
1.8%
Other values (2) 11
 
2.8%
Hangul
ValueCountFrequency (%)
115
15.1%
76
10.0%
61
 
8.0%
55
 
7.2%
55
 
7.2%
55
 
7.2%
54
 
7.1%
43
 
5.7%
38
 
5.0%
35
 
4.6%
Other values (51) 173
22.8%

Interactions

2024-01-10T04:53:51.851727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:53:51.729495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:53:51.912516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T04:53:51.789217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T04:53:53.941817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명전화번호객실수주차대수주소
업소명1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
객실수1.0001.0001.0000.9330.000
주차대수1.0001.0000.9331.0000.000
주소1.0001.0000.0000.0001.000
2024-01-10T04:53:54.013441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
객실수주차대수
객실수1.0000.316
주차대수0.3161.000

Missing values

2024-01-10T04:53:51.996702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T04:53:52.063320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명전화번호객실수주차대수주소
0S드라이브무인텔041-333-1112950충청남도 예산군 예산읍 벚꽃로 504-11
1ZAZA호텔041-331-91143823충청남도 예산군 예산읍 예산산업단지로 15-19
2가야관광호텔041-337-010148110충청남도 예산군 덕산면 신평1길 14
3그랜드모텔041-334-89342434충청남도 예산군 예산읍 충서로 1344
4궁전파크장(여관)041-337-78931720충청남도 예산군 덕산면 덕산온천로 14
5덕산온천타워호텔041-338-11555450충청남도 예산군 덕산면 온천단지3로 69
6덕산파크(여관)041-337-57861910충청남도 예산군 덕산면 윤봉길로 455
7덕원장여관041-338-33602050충청남도 예산군 덕산면 신평1길 14
8덕화온천장041-338-36752550충청남도 예산군 덕산면 덕산온천로 293
9도영펜션041-338-11187<NA>충청남도 예산군 덕산면 덕산향교길 100-3
업소명전화번호객실수주차대수주소
44초원파크(여관)041-338-62003310충청남도 예산군 덕산면 덕산향교길 30-13
45코렉스여관041-334-66621915충청남도 예산군 예산읍 벚꽃로 338
46킴스파크041-331-03012210충청남도 예산군 예산읍 아리랑로 15
47태화장(여관)041-333-12871915충청남도 예산군 삽교읍 삽교로4길 3
48티모텔(여관)041-338-34003535충청남도 예산군 덕산면 온천단지2로 55
49퍼스트모텔(여관)041-338-10773019충청남도 예산군 덕산면 온천단지1로 20-16
50펜션허브041-337-600912<NA>충청남도 예산군 덕산면 대치남길 15-5
51한라장(여관)041-338-36111815충청남도 예산군 덕산면 도청대로 1163
52041-337-307011<NA>충청남도 예산군 덕산면 온천단지1로 101
53힐파크(여관)041-338-36211810충청남도 예산군 덕산면 예덕로 24-24