Overview

Dataset statistics

Number of variables4
Number of observations124
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory4.1 KiB
Average record size in memory34.1 B

Variable types

Text3
Numeric1

Dataset

Description예산군에 있는 호텔및여관 정보(업소명, 전화번호, 객실수, 주소) 제공예산군에 있는 숙박시설 정보 제공을 통해 예산군민의 생활 편의성을 높힘
Author충청남도 예산군
URLhttps://www.data.go.kr/data/15049861/fileData.do

Alerts

Dataset has 1 (0.8%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 23:55:50.192673
Analysis finished2023-12-12 23:55:50.559137
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct123
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T08:55:50.703417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15
Mean length5.1370968
Min length1

Characters and Unicode

Total characters637
Distinct characters188
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)98.4%

Sample

1st row인천
2nd row제일여관
3rd row로또
4th row한양여관
5th row청수장
ValueCountFrequency (%)
예당관광농원 4
 
3.1%
디에이치(글램핑 2
 
1.6%
신라호텔 1
 
0.8%
gogo무인텔(a동 1
 
0.8%
코코무인텔 1
 
0.8%
jj무인텔 1
 
0.8%
s드라이브인무인텔 1
 
0.8%
zaza호텔 1
 
0.8%
현미장 1
 
0.8%
에이투호텔디자이너스주식회사 1
 
0.8%
Other values (114) 114
89.1%
2023-12-13T08:55:51.054462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
7.5%
23
 
3.6%
22
 
3.5%
21
 
3.3%
20
 
3.1%
19
 
3.0%
17
 
2.7%
17
 
2.7%
14
 
2.2%
14
 
2.2%
Other values (178) 422
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 572
89.8%
Uppercase Letter 30
 
4.7%
Close Punctuation 13
 
2.0%
Open Punctuation 13
 
2.0%
Space Separator 5
 
0.8%
Lowercase Letter 3
 
0.5%
Math Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
8.4%
23
 
4.0%
22
 
3.8%
21
 
3.7%
20
 
3.5%
19
 
3.3%
17
 
3.0%
17
 
3.0%
14
 
2.4%
14
 
2.4%
Other values (156) 357
62.4%
Uppercase Letter
ValueCountFrequency (%)
G 6
20.0%
O 5
16.7%
A 4
13.3%
B 2
 
6.7%
Z 2
 
6.7%
J 2
 
6.7%
K 1
 
3.3%
S 1
 
3.3%
C 1
 
3.3%
M 1
 
3.3%
Other values (5) 5
16.7%
Lowercase Letter
ValueCountFrequency (%)
a 1
33.3%
p 1
33.3%
s 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 572
89.8%
Latin 33
 
5.2%
Common 32
 
5.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
8.4%
23
 
4.0%
22
 
3.8%
21
 
3.7%
20
 
3.5%
19
 
3.3%
17
 
3.0%
17
 
3.0%
14
 
2.4%
14
 
2.4%
Other values (156) 357
62.4%
Latin
ValueCountFrequency (%)
G 6
18.2%
O 5
15.2%
A 4
12.1%
B 2
 
6.1%
Z 2
 
6.1%
J 2
 
6.1%
K 1
 
3.0%
S 1
 
3.0%
C 1
 
3.0%
a 1
 
3.0%
Other values (8) 8
24.2%
Common
ValueCountFrequency (%)
) 13
40.6%
( 13
40.6%
5
 
15.6%
+ 1
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 572
89.8%
ASCII 65
 
10.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
8.4%
23
 
4.0%
22
 
3.8%
21
 
3.7%
20
 
3.5%
19
 
3.3%
17
 
3.0%
17
 
3.0%
14
 
2.4%
14
 
2.4%
Other values (156) 357
62.4%
ASCII
ValueCountFrequency (%)
) 13
20.0%
( 13
20.0%
G 6
9.2%
5
 
7.7%
O 5
 
7.7%
A 4
 
6.2%
B 2
 
3.1%
Z 2
 
3.1%
J 2
 
3.1%
+ 1
 
1.5%
Other values (12) 12
18.5%
Distinct110
Distinct (%)88.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T08:55:51.310599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.604839
Min length7

Characters and Unicode

Total characters1439
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)85.5%

Sample

1st row041-334-5077
2nd row041-332-2513
3rd row041-335-8896
4th row041-334-3262
5th row041-334-6696
ValueCountFrequency (%)
데이터 10
 
7.5%
미집계 10
 
7.5%
041-331-0901 4
 
3.0%
041-338-0067 2
 
1.5%
041-331-4343 2
 
1.5%
041-337-6748 1
 
0.7%
041-337-7131 1
 
0.7%
041-337-5553 1
 
0.7%
041-338-1155 1
 
0.7%
041-334-0209 1
 
0.7%
Other values (101) 101
75.4%
2023-12-13T08:55:51.698264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 295
20.5%
- 228
15.8%
0 203
14.1%
1 186
12.9%
4 164
11.4%
8 66
 
4.6%
7 58
 
4.0%
5 48
 
3.3%
6 45
 
3.1%
2 44
 
3.1%
Other values (8) 102
 
7.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1141
79.3%
Dash Punctuation 228
 
15.8%
Other Letter 60
 
4.2%
Space Separator 10
 
0.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 295
25.9%
0 203
17.8%
1 186
16.3%
4 164
14.4%
8 66
 
5.8%
7 58
 
5.1%
5 48
 
4.2%
6 45
 
3.9%
2 44
 
3.9%
9 32
 
2.8%
Other Letter
ValueCountFrequency (%)
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 228
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1379
95.8%
Hangul 60
 
4.2%

Most frequent character per script

Common
ValueCountFrequency (%)
3 295
21.4%
- 228
16.5%
0 203
14.7%
1 186
13.5%
4 164
11.9%
8 66
 
4.8%
7 58
 
4.2%
5 48
 
3.5%
6 45
 
3.3%
2 44
 
3.2%
Other values (2) 42
 
3.0%
Hangul
ValueCountFrequency (%)
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1379
95.8%
Hangul 60
 
4.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 295
21.4%
- 228
16.5%
0 203
14.7%
1 186
13.5%
4 164
11.9%
8 66
 
4.8%
7 58
 
4.2%
5 48
 
3.5%
6 45
 
3.3%
2 44
 
3.2%
Other values (2) 42
 
3.0%
Hangul
ValueCountFrequency (%)
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%
10
16.7%

객실수
Real number (ℝ)

Distinct39
Distinct (%)31.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.991935
Minimum1
Maximum407
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T08:55:52.112987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6
Q19
median17
Q323
95-th percentile40.55
Maximum407
Range406
Interquartile range (IQR)14

Descriptive statistics

Standard deviation37.076826
Coefficient of variation (CV)1.7662414
Kurtosis97.384972
Mean20.991935
Median Absolute Deviation (MAD)7
Skewness9.3918917
Sum2603
Variance1374.691
MonotonicityNot monotonic
2023-12-13T08:55:52.249533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%)
9 10
 
8.1%
19 9
 
7.3%
10 8
 
6.5%
7 7
 
5.6%
20 7
 
5.6%
6 7
 
5.6%
18 6
 
4.8%
8 6
 
4.8%
22 6
 
4.8%
12 5
 
4.0%
Other values (29) 53
42.7%
ValueCountFrequency (%)
1 2
 
1.6%
4 2
 
1.6%
5 2
 
1.6%
6 7
5.6%
7 7
5.6%
8 6
4.8%
9 10
8.1%
10 8
6.5%
11 3
 
2.4%
12 5
4.0%
ValueCountFrequency (%)
407 1
0.8%
97 1
0.8%
52 1
0.8%
48 1
0.8%
44 1
0.8%
42 1
0.8%
41 1
0.8%
38 1
0.8%
36 1
0.8%
35 2
1.6%
Distinct119
Distinct (%)96.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T08:55:52.572273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length22.645161
Min length18

Characters and Unicode

Total characters2808
Distinct characters107
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)91.9%

Sample

1st row충청남도 예산군 예산읍 신례원로212번길 116
2nd row충청남도 예산군 예산읍 주교로 65-20
3rd row충청남도 예산군 예산읍 아리랑로11번길 8
4th row충청남도 예산군 예산읍 아리랑로11번길 9-1
5th row충청남도 예산군 예산읍 창말로 4-8
ValueCountFrequency (%)
충청남도 124
19.6%
예산군 124
19.6%
예산읍 53
 
8.4%
덕산면 47
 
7.4%
주교로 12
 
1.9%
응봉면 9
 
1.4%
온천단지1로 9
 
1.4%
아리랑로11번길 8
 
1.3%
삽교읍 7
 
1.1%
예당로 6
 
0.9%
Other values (166) 234
37.0%
2023-12-13T08:55:53.037774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
509
18.1%
245
 
8.7%
190
 
6.8%
130
 
4.6%
1 127
 
4.5%
127
 
4.5%
126
 
4.5%
126
 
4.5%
124
 
4.4%
103
 
3.7%
Other values (97) 1001
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1779
63.4%
Space Separator 509
 
18.1%
Decimal Number 444
 
15.8%
Dash Punctuation 56
 
2.0%
Other Punctuation 12
 
0.4%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%
Math Symbol 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
245
13.8%
190
 
10.7%
130
 
7.3%
127
 
7.1%
126
 
7.1%
126
 
7.1%
124
 
7.0%
103
 
5.8%
64
 
3.6%
61
 
3.4%
Other values (80) 483
27.2%
Decimal Number
ValueCountFrequency (%)
1 127
28.6%
2 53
11.9%
3 45
 
10.1%
5 41
 
9.2%
4 38
 
8.6%
0 38
 
8.6%
6 35
 
7.9%
9 23
 
5.2%
7 22
 
5.0%
8 22
 
5.0%
Space Separator
ValueCountFrequency (%)
509
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1779
63.4%
Common 1028
36.6%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
245
13.8%
190
 
10.7%
130
 
7.3%
127
 
7.1%
126
 
7.1%
126
 
7.1%
124
 
7.0%
103
 
5.8%
64
 
3.6%
61
 
3.4%
Other values (80) 483
27.2%
Common
ValueCountFrequency (%)
509
49.5%
1 127
 
12.4%
- 56
 
5.4%
2 53
 
5.2%
3 45
 
4.4%
5 41
 
4.0%
4 38
 
3.7%
0 38
 
3.7%
6 35
 
3.4%
9 23
 
2.2%
Other values (6) 63
 
6.1%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1779
63.4%
ASCII 1029
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
509
49.5%
1 127
 
12.3%
- 56
 
5.4%
2 53
 
5.2%
3 45
 
4.4%
5 41
 
4.0%
4 38
 
3.7%
0 38
 
3.7%
6 35
 
3.4%
9 23
 
2.2%
Other values (7) 64
 
6.2%
Hangul
ValueCountFrequency (%)
245
13.8%
190
 
10.7%
130
 
7.3%
127
 
7.1%
126
 
7.1%
126
 
7.1%
124
 
7.0%
103
 
5.8%
64
 
3.6%
61
 
3.4%
Other values (80) 483
27.2%

Interactions

2023-12-13T08:55:50.360314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T08:55:50.460977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:55:50.531035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명소재지전화객실수영업소 주소(도로명)
0인천041-334-50774충청남도 예산군 예산읍 신례원로212번길 116
1제일여관041-332-251310충청남도 예산군 예산읍 주교로 65-20
2로또041-335-88969충청남도 예산군 예산읍 아리랑로11번길 8
3한양여관041-334-32626충청남도 예산군 예산읍 아리랑로11번길 9-1
4청수장041-334-669612충청남도 예산군 예산읍 창말로 4-8
5일흥장041-333-800015충청남도 예산군 예산읍 아리랑로11번길 5
6한양장모텔041-334-635315충청남도 예산군 예산읍 신례원로 219
7대호장041-334-511516충청남도 예산군 예산읍 창말로 12-1
8영신장041-332-258417충청남도 예산군 예산읍 아리랑로 11
9예일장041-335-260419충청남도 예산군 예산읍 임성로7번길 5
업소명소재지전화객실수영업소 주소(도로명)
114형제펜션041-338-111810충청남도 예산군 덕산면 덕산향교길 108-8
115스파뷰호텔041-337-100097충청남도 예산군 덕산면 온천단지2로 77
116덕산참숯랜드041-337-639212충청남도 예산군 덕산면 노곡길 59
117온연프라이빗빌라000-2882-27286충청남도 예산군 덕산면 온천단지1로 69-5
118해월펜션데이터 미집계6충청남도 예산군 응봉면 예당관광로 61, 3층
119티나의정원데이터 미집계1충청남도 예산군 광시면 서초정2길 86-16, 숙박시설 2동
120예당관광농원 디에이치(풀빌라)041-331-09016충청남도 예산군 응봉면 예당로 1133, 2~4동
121예당관광농원 디에이치(단독펜션)041-331-09011충청남도 예산군 응봉면 예당로 1133, 28호
122예당관광농원 디에이치(글램핑)041-331-090110충청남도 예산군 응봉면 예당로 1133
123예당관광농원 디에이치(글램핑)041-331-090110충청남도 예산군 응봉면 예당로 1133

Duplicate rows

Most frequently occurring

업소명소재지전화객실수영업소 주소(도로명)# duplicates
0예당관광농원 디에이치(글램핑)041-331-090110충청남도 예산군 응봉면 예당로 11332