Overview

Dataset statistics

Number of variables6
Number of observations66
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory51.0 B

Variable types

Text4
Numeric1
Categorical1

Dataset

Description당진시 우수 숙박업소 현황(업소명,대표자,주소,전화번호,객실현황)
Author충청남도 당진시
URLhttps://www.data.go.kr/data/15052876/fileData.do

Alerts

비고 has constant value ""Constant
업소명 has unique valuesUnique
주소 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:34:15.890369
Analysis finished2023-12-12 01:34:16.637831
Duration0.75 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업소명
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T10:34:16.913760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length4.8181818
Min length1

Characters and Unicode

Total characters318
Distinct characters121
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st rowa 호텔
2nd rowM 모텔
3rd rowQ 호텔
4th rowS모텔
5th rowS무인텔
ValueCountFrequency (%)
호텔 2
 
2.8%
여관 2
 
2.8%
스타파크장여관 1
 
1.4%
인피니티호텔 1
 
1.4%
유락모텔 1
 
1.4%
위너스호텔 1
 
1.4%
워커힐 1
 
1.4%
용궁장여관 1
 
1.4%
왜목하우스모텔 1
 
1.4%
예다원여관 1
 
1.4%
Other values (60) 60
83.3%
2023-12-12T10:34:17.421450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
12.3%
25
 
7.9%
20
 
6.3%
20
 
6.3%
15
 
4.7%
11
 
3.5%
6
 
1.9%
6
 
1.9%
5
 
1.6%
5
 
1.6%
Other values (111) 166
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 296
93.1%
Lowercase Letter 7
 
2.2%
Space Separator 6
 
1.9%
Uppercase Letter 6
 
1.9%
Other Punctuation 1
 
0.3%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
13.2%
25
 
8.4%
20
 
6.8%
20
 
6.8%
15
 
5.1%
11
 
3.7%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (98) 145
49.0%
Lowercase Letter
ValueCountFrequency (%)
e 3
42.9%
a 1
 
14.3%
l 1
 
14.3%
t 1
 
14.3%
o 1
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
M 2
33.3%
Q 1
16.7%
L 1
16.7%
Space Separator
ValueCountFrequency (%)
6
100.0%
Other Punctuation
ValueCountFrequency (%)
? 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 296
93.1%
Latin 13
 
4.1%
Common 9
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
13.2%
25
 
8.4%
20
 
6.8%
20
 
6.8%
15
 
5.1%
11
 
3.7%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (98) 145
49.0%
Latin
ValueCountFrequency (%)
e 3
23.1%
S 2
15.4%
M 2
15.4%
a 1
 
7.7%
Q 1
 
7.7%
l 1
 
7.7%
t 1
 
7.7%
o 1
 
7.7%
L 1
 
7.7%
Common
ValueCountFrequency (%)
6
66.7%
? 1
 
11.1%
) 1
 
11.1%
( 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 296
93.1%
ASCII 22
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
 
13.2%
25
 
8.4%
20
 
6.8%
20
 
6.8%
15
 
5.1%
11
 
3.7%
6
 
2.0%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (98) 145
49.0%
ASCII
ValueCountFrequency (%)
6
27.3%
e 3
13.6%
S 2
 
9.1%
M 2
 
9.1%
a 1
 
4.5%
Q 1
 
4.5%
? 1
 
4.5%
) 1
 
4.5%
l 1
 
4.5%
t 1
 
4.5%
Other values (3) 3
13.6%
Distinct64
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T10:34:17.743399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.9848485
Min length2

Characters and Unicode

Total characters197
Distinct characters87
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)93.9%

Sample

1st row이여리
2nd row정덕교
3rd row안계자
4th row임흥영
5th row유순례
ValueCountFrequency (%)
이여리 2
 
3.0%
김성민 2
 
3.0%
김영선 1
 
1.5%
김남회 1
 
1.5%
김연근 1
 
1.5%
박석근 1
 
1.5%
윤광수 1
 
1.5%
심현숙 1
 
1.5%
전홍주 1
 
1.5%
박동숙 1
 
1.5%
Other values (54) 54
81.8%
2023-12-12T10:34:18.198533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
9.1%
12
 
6.1%
8
 
4.1%
7
 
3.6%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 123
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 197
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
9.1%
12
 
6.1%
8
 
4.1%
7
 
3.6%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 123
62.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 197
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
9.1%
12
 
6.1%
8
 
4.1%
7
 
3.6%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 123
62.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 197
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
9.1%
12
 
6.1%
8
 
4.1%
7
 
3.6%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 123
62.4%

주소
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T10:34:18.511238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length22.5
Min length19

Characters and Unicode

Total characters1485
Distinct characters87
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st row충청남도 당진시 송악읍 부곡공단로 68-16
2nd row충청남도 당진시 송악읍 부곡공단로 68-33
3rd row충청남도 당진시 송악읍 북부산업로 835-25
4th row충청남도 당진시 송악읍 북부산업로 714-2
5th row충청남도 당진시 시청2로 47 (수청동)
ValueCountFrequency (%)
충청남도 66
20.2%
당진시 66
20.2%
송악읍 27
 
8.3%
읍내동 11
 
3.4%
당진중앙2로 9
 
2.8%
반촌로 8
 
2.4%
신평면 7
 
2.1%
북부산업로 5
 
1.5%
석문면 5
 
1.5%
부곡공단로 5
 
1.5%
Other values (93) 118
36.1%
2023-12-12T10:34:18.957270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
262
17.6%
82
 
5.5%
79
 
5.3%
72
 
4.8%
69
 
4.6%
66
 
4.4%
66
 
4.4%
66
 
4.4%
1 53
 
3.6%
44
 
3.0%
Other values (77) 626
42.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 921
62.0%
Space Separator 262
 
17.6%
Decimal Number 237
 
16.0%
Dash Punctuation 32
 
2.2%
Close Punctuation 16
 
1.1%
Open Punctuation 16
 
1.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
82
 
8.9%
79
 
8.6%
72
 
7.8%
69
 
7.5%
66
 
7.2%
66
 
7.2%
66
 
7.2%
44
 
4.8%
42
 
4.6%
30
 
3.3%
Other values (62) 305
33.1%
Decimal Number
ValueCountFrequency (%)
1 53
22.4%
2 38
16.0%
3 34
14.3%
6 27
11.4%
5 21
 
8.9%
7 20
 
8.4%
8 16
 
6.8%
4 13
 
5.5%
0 8
 
3.4%
9 7
 
3.0%
Space Separator
ValueCountFrequency (%)
262
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 921
62.0%
Common 564
38.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
82
 
8.9%
79
 
8.6%
72
 
7.8%
69
 
7.5%
66
 
7.2%
66
 
7.2%
66
 
7.2%
44
 
4.8%
42
 
4.6%
30
 
3.3%
Other values (62) 305
33.1%
Common
ValueCountFrequency (%)
262
46.5%
1 53
 
9.4%
2 38
 
6.7%
3 34
 
6.0%
- 32
 
5.7%
6 27
 
4.8%
5 21
 
3.7%
7 20
 
3.5%
8 16
 
2.8%
) 16
 
2.8%
Other values (5) 45
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 921
62.0%
ASCII 564
38.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
262
46.5%
1 53
 
9.4%
2 38
 
6.7%
3 34
 
6.0%
- 32
 
5.7%
6 27
 
4.8%
5 21
 
3.7%
7 20
 
3.5%
8 16
 
2.8%
) 16
 
2.8%
Other values (5) 45
 
8.0%
Hangul
ValueCountFrequency (%)
82
 
8.9%
79
 
8.6%
72
 
7.8%
69
 
7.5%
66
 
7.2%
66
 
7.2%
66
 
7.2%
44
 
4.8%
42
 
4.6%
30
 
3.3%
Other values (62) 305
33.1%

전화번호
Text

UNIQUE 

Distinct66
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size660.0 B
2023-12-12T10:34:19.291132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.015152
Min length12

Characters and Unicode

Total characters793
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)100.0%

Sample

1st row041-358-7100
2nd row041-357-3766
3rd row041-358-7707
4th row041-357-5211
5th row041-357-0073
ValueCountFrequency (%)
041-358-7100 1
 
1.5%
041-357-0002 1
 
1.5%
041-357-4006 1
 
1.5%
041-353-3790 1
 
1.5%
041-352-2177 1
 
1.5%
041-357-1091 1
 
1.5%
041-363-2897 1
 
1.5%
041-363-5073 1
 
1.5%
041-355-7731 1
 
1.5%
041-354-2911 1
 
1.5%
Other values (56) 56
84.8%
2023-12-12T10:34:19.746486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 132
16.6%
0 112
14.1%
3 111
14.0%
1 93
11.7%
4 84
10.6%
5 76
9.6%
7 63
7.9%
2 39
 
4.9%
6 37
 
4.7%
8 27
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 661
83.4%
Dash Punctuation 132
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 112
16.9%
3 111
16.8%
1 93
14.1%
4 84
12.7%
5 76
11.5%
7 63
9.5%
2 39
 
5.9%
6 37
 
5.6%
8 27
 
4.1%
9 19
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 132
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 793
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 132
16.6%
0 112
14.1%
3 111
14.0%
1 93
11.7%
4 84
10.6%
5 76
9.6%
7 63
7.9%
2 39
 
4.9%
6 37
 
4.7%
8 27
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 793
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 132
16.6%
0 112
14.1%
3 111
14.0%
1 93
11.7%
4 84
10.6%
5 76
9.6%
7 63
7.9%
2 39
 
4.9%
6 37
 
4.7%
8 27
 
3.4%

객실현황
Real number (ℝ)

Distinct32
Distinct (%)48.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean27.318182
Minimum8
Maximum66
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size726.0 B
2023-12-12T10:34:19.922520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum8
5-th percentile12.25
Q118.25
median27
Q332.75
95-th percentile45.75
Maximum66
Range58
Interquartile range (IQR)14.5

Descriptive statistics

Standard deviation11.02467
Coefficient of variation (CV)0.4035653
Kurtosis1.1118587
Mean27.318182
Median Absolute Deviation (MAD)8
Skewness0.79119591
Sum1803
Variance121.54336
MonotonicityNot monotonic
2023-12-12T10:34:20.094417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
18 6
 
9.1%
30 4
 
6.1%
19 4
 
6.1%
31 3
 
4.5%
27 3
 
4.5%
32 3
 
4.5%
17 3
 
4.5%
28 3
 
4.5%
24 3
 
4.5%
12 2
 
3.0%
Other values (22) 32
48.5%
ValueCountFrequency (%)
8 2
 
3.0%
12 2
 
3.0%
13 1
 
1.5%
14 1
 
1.5%
15 1
 
1.5%
16 1
 
1.5%
17 3
4.5%
18 6
9.1%
19 4
6.1%
20 1
 
1.5%
ValueCountFrequency (%)
66 1
1.5%
48 2
3.0%
46 1
1.5%
45 2
3.0%
43 1
1.5%
42 1
1.5%
40 2
3.0%
39 1
1.5%
36 2
3.0%
34 2
3.0%

비고
Categorical

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size660.0 B
A등급
66 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA등급
2nd rowA등급
3rd rowA등급
4th rowA등급
5th rowA등급

Common Values

ValueCountFrequency (%)
A등급 66
100.0%

Length

2023-12-12T10:34:20.232430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:34:20.334858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a등급 66
100.0%

Interactions

2023-12-12T10:34:16.264806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:34:20.415320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명대표자주소전화번호객실현황
업소명1.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0000.908
주소1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
객실현황1.0000.9081.0001.0001.000

Missing values

2023-12-12T10:34:16.430004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:34:16.593703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명대표자주소전화번호객실현황비고
0a 호텔이여리충청남도 당진시 송악읍 부곡공단로 68-16041-358-710031A등급
1M 모텔정덕교충청남도 당진시 송악읍 부곡공단로 68-33041-357-376627A등급
2Q 호텔안계자충청남도 당진시 송악읍 북부산업로 835-25041-358-770766A등급
3S모텔임흥영충청남도 당진시 송악읍 북부산업로 714-2041-357-521132A등급
4S무인텔유순례충청남도 당진시 시청2로 47 (수청동)041-357-007346A등급
5국일장여관송흥섭충청남도 당진시 서부로 15 (채운동)041-354-088118A등급
6궁전여관고기택충청남도 당진시 송산면 상거중앙길 77041-357-777030A등급
7귀빈모텔김순이충청남도 당진시 당진중앙2로 117-7 (읍내동)041-355-856018A등급
8금화여관김미선충청남도 당진시 당진중앙2로 127 (읍내동)041-357-123723A등급
9꿈의궁전여관황효순충청남도 당진시 송악읍 반촌로 50041-355-060334A등급
업소명대표자주소전화번호객실현황비고
56칸호텔김성민충청남도 당진시 신평면 삽교천2길 11-25041-363-730743A등급
57하이힐모텔천영화충청남도 당진시 송악읍 반촌로 153041-355-079836A등급
58한나루 여관신현웅충청남도 당진시 당진중앙2로 71-7 (읍내동)041-352-428830A등급
59한라장여관이순이충청남도 당진시 당진중앙3로 21 (읍내동)041-352-490912A등급
60한진타운주식회사이우대충청남도 당진시 송악읍 한진포구길 50-3041-357-588712A등급
61행담도모텔김상고충청남도 당진시 송악읍 부곡공단로 68-41041-358-119028A등급
62황실장여관박창옥충청남도 당진시 송악읍 송악로 39041-357-477719A등급
63황토장여관황원현충청남도 당진시 송악읍 틀모시로 861041-356-981845A등급
64힐튼모텔박미화충청남도 당진시 송악읍 반촌로 168041-357-400622A등급
65힐하우스모텔김민호충청남도 당진시 석문면 통정3길 30041-352-356631A등급