Overview

Dataset statistics

Number of variables4
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory37.1 B

Variable types

Numeric2
Text2

Dataset

Description예산군에 있는 민박시설 정보(업소명, 전화번호, 객실수, 주소) 제공예산군에 있는 민박시설 정보 제공을 통해 관광성을 높힘
Author충청남도 예산군
URLhttps://www.data.go.kr/data/15049862/fileData.do

Alerts

연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:12:23.933494
Analysis finished2023-12-12 21:12:24.803946
Duration0.87 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22
Minimum1
Maximum43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T06:12:24.878433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.1
Q111.5
median22
Q332.5
95-th percentile40.9
Maximum43
Range42
Interquartile range (IQR)21

Descriptive statistics

Standard deviation12.556539
Coefficient of variation (CV)0.57075176
Kurtosis-1.2
Mean22
Median Absolute Deviation (MAD)11
Skewness0
Sum946
Variance157.66667
MonotonicityStrictly increasing
2023-12-13T06:12:25.009247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
1 1
 
2.3%
2 1
 
2.3%
25 1
 
2.3%
26 1
 
2.3%
27 1
 
2.3%
28 1
 
2.3%
29 1
 
2.3%
30 1
 
2.3%
31 1
 
2.3%
32 1
 
2.3%
Other values (33) 33
76.7%
ValueCountFrequency (%)
1 1
2.3%
2 1
2.3%
3 1
2.3%
4 1
2.3%
5 1
2.3%
6 1
2.3%
7 1
2.3%
8 1
2.3%
9 1
2.3%
10 1
2.3%
ValueCountFrequency (%)
43 1
2.3%
42 1
2.3%
41 1
2.3%
40 1
2.3%
39 1
2.3%
38 1
2.3%
37 1
2.3%
36 1
2.3%
35 1
2.3%
34 1
2.3%

업소명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T06:12:25.227603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length4.8837209
Min length2

Characters and Unicode

Total characters210
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row고즈, Knock
2nd row파파홈예산
3rd row하이엘라
4th row송지펜션
5th row예당하늘빛펜션
ValueCountFrequency (%)
고즈 1
 
2.2%
예당공원 1
 
2.2%
뉴캐슬펜션 1
 
2.2%
피플앤도그힐링 1
 
2.2%
덕산힐링하우스 1
 
2.2%
정원동 1
 
2.2%
아뜰리에 1
 
2.2%
7 1
 
2.2%
빛트인(between 1
 
2.2%
채운산방 1
 
2.2%
Other values (36) 36
78.3%
2023-12-13T06:12:25.612778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8
 
3.8%
8
 
3.8%
7
 
3.3%
6
 
2.9%
5
 
2.4%
5
 
2.4%
5
 
2.4%
4
 
1.9%
4
 
1.9%
4
 
1.9%
Other values (103) 154
73.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 189
90.0%
Uppercase Letter 10
 
4.8%
Lowercase Letter 4
 
1.9%
Space Separator 3
 
1.4%
Decimal Number 1
 
0.5%
Open Punctuation 1
 
0.5%
Close Punctuation 1
 
0.5%
Other Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
4.2%
8
 
4.2%
7
 
3.7%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (86) 133
70.4%
Uppercase Letter
ValueCountFrequency (%)
E 3
30.0%
W 1
 
10.0%
T 1
 
10.0%
B 1
 
10.0%
N 1
 
10.0%
K 1
 
10.0%
M 1
 
10.0%
G 1
 
10.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
25.0%
o 1
25.0%
c 1
25.0%
k 1
25.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 189
90.0%
Latin 14
 
6.7%
Common 7
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
4.2%
8
 
4.2%
7
 
3.7%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (86) 133
70.4%
Latin
ValueCountFrequency (%)
E 3
21.4%
W 1
 
7.1%
T 1
 
7.1%
n 1
 
7.1%
B 1
 
7.1%
N 1
 
7.1%
o 1
 
7.1%
c 1
 
7.1%
k 1
 
7.1%
K 1
 
7.1%
Other values (2) 2
14.3%
Common
ValueCountFrequency (%)
3
42.9%
7 1
 
14.3%
( 1
 
14.3%
) 1
 
14.3%
, 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 189
90.0%
ASCII 21
 
10.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8
 
4.2%
8
 
4.2%
7
 
3.7%
6
 
3.2%
5
 
2.6%
5
 
2.6%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (86) 133
70.4%
ASCII
ValueCountFrequency (%)
E 3
14.3%
3
14.3%
7 1
 
4.8%
W 1
 
4.8%
T 1
 
4.8%
n 1
 
4.8%
B 1
 
4.8%
( 1
 
4.8%
N 1
 
4.8%
o 1
 
4.8%
Other values (7) 7
33.3%

객실수
Real number (ℝ)

Distinct6
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.8604651
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T06:12:25.764202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum6
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.4071483
Coefficient of variation (CV)0.49192991
Kurtosis-0.75749247
Mean2.8604651
Median Absolute Deviation (MAD)1
Skewness0.42058943
Sum123
Variance1.9800664
MonotonicityNot monotonic
2023-12-13T06:12:25.883971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 12
27.9%
2 11
25.6%
1 8
18.6%
5 7
16.3%
4 4
 
9.3%
6 1
 
2.3%
ValueCountFrequency (%)
1 8
18.6%
2 11
25.6%
3 12
27.9%
4 4
 
9.3%
5 7
16.3%
6 1
 
2.3%
ValueCountFrequency (%)
6 1
 
2.3%
5 7
16.3%
4 4
 
9.3%
3 12
27.9%
2 11
25.6%
1 8
18.6%

주소
Text

Distinct40
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T06:12:26.148854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length24
Mean length21.883721
Min length18

Characters and Unicode

Total characters941
Distinct characters74
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)86.0%

Sample

1st row충청남도 예산군 덕산면 둔지미1길 27-4
2nd row충청남도 예산군 응봉면 응봉서로 186-8
3rd row충청남도 예산군 덕산면 남은들로 185-70
4th row충청남도 예산군 대흥면 송지못길 260
5th row충청남도 예산군 응봉면 신리길 3
ValueCountFrequency (%)
충청남도 43
19.8%
예산군 43
19.8%
덕산면 23
 
10.6%
응봉면 10
 
4.6%
대치남길 6
 
2.8%
대흥면 6
 
2.8%
예당관광로 6
 
2.8%
대치6길 4
 
1.8%
남은들로 4
 
1.8%
68 2
 
0.9%
Other values (60) 70
32.3%
2023-12-13T06:12:26.563874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
174
18.5%
71
 
7.5%
53
 
5.6%
53
 
5.6%
43
 
4.6%
43
 
4.6%
43
 
4.6%
43
 
4.6%
40
 
4.3%
28
 
3.0%
Other values (64) 350
37.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 607
64.5%
Space Separator 174
 
18.5%
Decimal Number 140
 
14.9%
Dash Punctuation 18
 
1.9%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
71
11.7%
53
 
8.7%
53
 
8.7%
43
 
7.1%
43
 
7.1%
43
 
7.1%
43
 
7.1%
40
 
6.6%
28
 
4.6%
22
 
3.6%
Other values (51) 168
27.7%
Decimal Number
ValueCountFrequency (%)
2 23
16.4%
1 19
13.6%
5 18
12.9%
6 18
12.9%
8 15
10.7%
3 14
10.0%
4 13
9.3%
7 8
 
5.7%
0 6
 
4.3%
9 6
 
4.3%
Space Separator
ValueCountFrequency (%)
174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 607
64.5%
Common 334
35.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
71
11.7%
53
 
8.7%
53
 
8.7%
43
 
7.1%
43
 
7.1%
43
 
7.1%
43
 
7.1%
40
 
6.6%
28
 
4.6%
22
 
3.6%
Other values (51) 168
27.7%
Common
ValueCountFrequency (%)
174
52.1%
2 23
 
6.9%
1 19
 
5.7%
5 18
 
5.4%
- 18
 
5.4%
6 18
 
5.4%
8 15
 
4.5%
3 14
 
4.2%
4 13
 
3.9%
7 8
 
2.4%
Other values (3) 14
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 607
64.5%
ASCII 334
35.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
174
52.1%
2 23
 
6.9%
1 19
 
5.7%
5 18
 
5.4%
- 18
 
5.4%
6 18
 
5.4%
8 15
 
4.5%
3 14
 
4.2%
4 13
 
3.9%
7 8
 
2.4%
Other values (3) 14
 
4.2%
Hangul
ValueCountFrequency (%)
71
11.7%
53
 
8.7%
53
 
8.7%
43
 
7.1%
43
 
7.1%
43
 
7.1%
43
 
7.1%
40
 
6.6%
28
 
4.6%
22
 
3.6%
Other values (51) 168
27.7%

Interactions

2023-12-13T06:12:24.543536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:12:24.096290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:12:24.605187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T06:12:24.160085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:12:26.687665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업소명객실수주소
연번1.0001.0000.1820.809
업소명1.0001.0001.0001.000
객실수0.1821.0001.0000.957
주소0.8091.0000.9571.000
2023-12-13T06:12:26.783718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번객실수
연번1.0000.127
객실수0.1271.000

Missing values

2023-12-13T06:12:24.703309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:12:24.774525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명객실수주소
01고즈, Knock2충청남도 예산군 덕산면 둔지미1길 27-4
12파파홈예산1충청남도 예산군 응봉면 응봉서로 186-8
23하이엘라5충청남도 예산군 덕산면 남은들로 185-70
34송지펜션1충청남도 예산군 대흥면 송지못길 260
45예당하늘빛펜션2충청남도 예산군 응봉면 신리길 3
56노블레스펜션5충청남도 예산군 덕산면 남은들로 68
67황토흙벽돌집친환경민박1충청남도 예산군 예산읍 관작중앙길 51-31
78예솔정가2충청남도 예산군 덕산면 대치남길 5-38
89자연이야기3충청남도 예산군 대흥면 대률송림길 135-6
910덕산마루3충청남도 예산군 덕산면 덕산온천로 23-32, 3동
연번업소명객실수주소
3334뉴캐슬펜션4충청남도 예산군 덕산면 남은들로 185-68
3435용고랑4충청남도 예산군 응봉면 예당관광로 244
3536하늘정원1충청남도 예산군 덕산면 대동2길 49
3637숲속의정원4충청남도 예산군 덕산면 대치6길 46
3738숲속1충청남도 예산군 덕산면 대치6길 45, 펜션
3839하늘채펜션2충청남도 예산군 덕산면 대치5길 25
3940가야산노블레스5충청남도 예산군 덕산면 남은들로 68
4041양천3충청남도 예산군 응봉면 예당관광로 244
4142돌고래4충청남도 예산군 응봉면 예당관광로 180
4243예촌사랑2충청남도 예산군 응봉면 예당관광로 205