Overview

Dataset statistics

Number of variables6
Number of observations57
Missing cells7
Missing cells (%)2.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory50.3 B

Variable types

Categorical2
Text3
DateTime1

Dataset

Description이 데이터는 충청남도 금산군의 숙박업(업종구분, 업소명, 행정구역, 주소, 전화번호, 데이터기준일자)에 대한 정보를 제공합니다.
Author충청남도 금산군
URLhttps://www.data.go.kr/data/15099798/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업종구분 is highly imbalanced (78.1%)Imbalance
전화번호 has 7 (12.3%) missing valuesMissing

Reproduction

Analysis started2024-04-21 02:06:24.955812
Analysis finished2024-04-21 02:06:27.127498
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
일반
55 
생활
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 55
96.5%
생활 2
 
3.5%

Length

2024-04-21T11:06:27.210539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:06:27.328368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 55
96.5%
생활 2
 
3.5%
Distinct55
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Memory size588.0 B
2024-04-21T11:06:27.526076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length9
Mean length5.2807018
Min length1

Characters and Unicode

Total characters301
Distinct characters123
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)93.0%

Sample

1st row연신여인숙
2nd row장수여인숙
3rd row산장모텔
4th row힐튼모텔
5th row거북장여관
ValueCountFrequency (%)
자바라무인호텔 2
 
3.4%
나인모텔 2
 
3.4%
체리의향기 1
 
1.7%
비단골관광농원 1
 
1.7%
연신여인숙 1
 
1.7%
예스무인텔 1
 
1.7%
글로엘리트 1
 
1.7%
제이모텔 1
 
1.7%
월영산모텔 1
 
1.7%
스테이인터뷰금산 1
 
1.7%
Other values (46) 46
79.3%
2024-04-21T11:06:27.887804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
9.0%
17
 
5.6%
15
 
5.0%
10
 
3.3%
9
 
3.0%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.3%
Other values (113) 181
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 297
98.7%
Close Punctuation 1
 
0.3%
Uppercase Letter 1
 
0.3%
Open Punctuation 1
 
0.3%
Space Separator 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
9.1%
17
 
5.7%
15
 
5.1%
10
 
3.4%
9
 
3.0%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.4%
Other values (109) 177
59.6%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
Q 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 297
98.7%
Common 3
 
1.0%
Latin 1
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
9.1%
17
 
5.7%
15
 
5.1%
10
 
3.4%
9
 
3.0%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.4%
Other values (109) 177
59.6%
Common
ValueCountFrequency (%)
) 1
33.3%
( 1
33.3%
1
33.3%
Latin
ValueCountFrequency (%)
Q 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 297
98.7%
ASCII 4
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
9.1%
17
 
5.7%
15
 
5.1%
10
 
3.4%
9
 
3.0%
9
 
3.0%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.4%
Other values (109) 177
59.6%
ASCII
ValueCountFrequency (%)
) 1
25.0%
Q 1
25.0%
( 1
25.0%
1
25.0%

행정구역
Categorical

Distinct7
Distinct (%)12.3%
Missing0
Missing (%)0.0%
Memory size588.0 B
금산읍
16 
진산면
16 
복수면
11 
추부면
금성면
Other values (2)

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique1 ?
Unique (%)1.8%

Sample

1st row금산읍
2nd row금산읍
3rd row금산읍
4th row금산읍
5th row금산읍

Common Values

ValueCountFrequency (%)
금산읍 16
28.1%
진산면 16
28.1%
복수면 11
19.3%
추부면 7
12.3%
금성면 3
 
5.3%
남일면 3
 
5.3%
제원면 1
 
1.8%

Length

2024-04-21T11:06:28.023495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:06:28.117097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
금산읍 16
28.1%
진산면 16
28.1%
복수면 11
19.3%
추부면 7
12.3%
금성면 3
 
5.3%
남일면 3
 
5.3%
제원면 1
 
1.8%

주소
Text

Distinct56
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size588.0 B
2024-04-21T11:06:28.347377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length29
Mean length21.54386
Min length18

Characters and Unicode

Total characters1228
Distinct characters84
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)96.5%

Sample

1st row충청남도 금산군 금산읍 뒷담말길 17-5
2nd row충청남도 금산군 금산읍 뒷담말길 9-4
3rd row충청남도 금산군 금산읍 비호로 76
4th row충청남도 금산군 금산읍 금산로 1514
5th row충청남도 금산군 금산읍 인삼로 109
ValueCountFrequency (%)
충청남도 57
19.5%
금산군 57
19.5%
금산읍 16
 
5.5%
진산면 16
 
5.5%
복수로 12
 
4.1%
복수면 11
 
3.8%
추부면 7
 
2.4%
금산로 5
 
1.7%
대둔산로 5
 
1.7%
산내로 4
 
1.4%
Other values (85) 102
34.9%
2024-04-21T11:06:28.749335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
238
19.4%
109
 
8.9%
82
 
6.7%
60
 
4.9%
60
 
4.9%
58
 
4.7%
58
 
4.7%
57
 
4.6%
44
 
3.6%
41
 
3.3%
Other values (74) 421
34.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 781
63.6%
Space Separator 238
 
19.4%
Decimal Number 181
 
14.7%
Dash Punctuation 17
 
1.4%
Other Punctuation 5
 
0.4%
Open Punctuation 3
 
0.2%
Close Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
14.0%
82
10.5%
60
 
7.7%
60
 
7.7%
58
 
7.4%
58
 
7.4%
57
 
7.3%
44
 
5.6%
41
 
5.2%
24
 
3.1%
Other values (59) 188
24.1%
Decimal Number
ValueCountFrequency (%)
1 35
19.3%
2 22
12.2%
4 22
12.2%
3 19
10.5%
5 18
9.9%
7 15
8.3%
9 14
 
7.7%
0 13
 
7.2%
6 13
 
7.2%
8 10
 
5.5%
Space Separator
ValueCountFrequency (%)
238
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 781
63.6%
Common 447
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
14.0%
82
10.5%
60
 
7.7%
60
 
7.7%
58
 
7.4%
58
 
7.4%
57
 
7.3%
44
 
5.6%
41
 
5.2%
24
 
3.1%
Other values (59) 188
24.1%
Common
ValueCountFrequency (%)
238
53.2%
1 35
 
7.8%
2 22
 
4.9%
4 22
 
4.9%
3 19
 
4.3%
5 18
 
4.0%
- 17
 
3.8%
7 15
 
3.4%
9 14
 
3.1%
0 13
 
2.9%
Other values (5) 34
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 781
63.6%
ASCII 447
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
238
53.2%
1 35
 
7.8%
2 22
 
4.9%
4 22
 
4.9%
3 19
 
4.3%
5 18
 
4.0%
- 17
 
3.8%
7 15
 
3.4%
9 14
 
3.1%
0 13
 
2.9%
Other values (5) 34
 
7.6%
Hangul
ValueCountFrequency (%)
109
14.0%
82
10.5%
60
 
7.7%
60
 
7.7%
58
 
7.4%
58
 
7.4%
57
 
7.3%
44
 
5.6%
41
 
5.2%
24
 
3.1%
Other values (59) 188
24.1%

전화번호
Text

MISSING 

Distinct48
Distinct (%)96.0%
Missing7
Missing (%)12.3%
Memory size588.0 B
2024-04-21T11:06:28.944623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters700
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)92.0%

Sample

1st row 041- 754-2321
2nd row 041-751 -0581
3rd row 041- 752-1580
4th row 041- 752-1107
5th row 041- 753-2828
ValueCountFrequency (%)
041 48
41.0%
752 6
 
5.1%
753 4
 
3.4%
751 3
 
2.6%
0010 2
 
1.7%
7552 2
 
1.7%
754 2
 
1.7%
750 1
 
0.9%
753-2662 1
 
0.9%
752-3770 1
 
0.9%
Other values (47) 47
40.2%
2024-04-21T11:06:29.318275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
14.3%
- 100
14.3%
0 96
13.7%
1 83
11.9%
4 75
10.7%
5 66
9.4%
7 65
9.3%
2 31
 
4.4%
3 31
 
4.4%
8 23
 
3.3%
Other values (2) 30
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 500
71.4%
Space Separator 100
 
14.3%
Dash Punctuation 100
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 96
19.2%
1 83
16.6%
4 75
15.0%
5 66
13.2%
7 65
13.0%
2 31
 
6.2%
3 31
 
6.2%
8 23
 
4.6%
6 16
 
3.2%
9 14
 
2.8%
Space Separator
ValueCountFrequency (%)
100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 700
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
100
14.3%
- 100
14.3%
0 96
13.7%
1 83
11.9%
4 75
10.7%
5 66
9.4%
7 65
9.3%
2 31
 
4.4%
3 31
 
4.4%
8 23
 
3.3%
Other values (2) 30
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 700
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
100
14.3%
- 100
14.3%
0 96
13.7%
1 83
11.9%
4 75
10.7%
5 66
9.4%
7 65
9.3%
2 31
 
4.4%
3 31
 
4.4%
8 23
 
3.3%
Other values (2) 30
 
4.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size588.0 B
Minimum2024-03-27 00:00:00
Maximum2024-03-27 00:00:00
2024-04-21T11:06:29.427538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:06:29.518012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-04-21T11:06:29.588139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종구분업소명행정구역주소전화번호
업종구분1.0001.0000.0001.0001.000
업소명1.0001.0001.0000.9911.000
행정구역0.0001.0001.0001.0001.000
주소1.0000.9911.0001.0001.000
전화번호1.0001.0001.0001.0001.000
2024-04-21T11:06:29.701159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정구역업종구분
행정구역1.0000.000
업종구분0.0001.000
2024-04-21T11:06:29.780255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종구분행정구역
업종구분1.0000.000
행정구역0.0001.000

Missing values

2024-04-21T11:06:26.949776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:06:27.082380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종구분업소명행정구역주소전화번호데이터기준일자
0일반연신여인숙금산읍충청남도 금산군 금산읍 뒷담말길 17-5041- 754-23212024-03-27
1일반장수여인숙금산읍충청남도 금산군 금산읍 뒷담말길 9-4<NA>2024-03-27
2일반산장모텔금산읍충청남도 금산군 금산읍 비호로 76041-751 -05812024-03-27
3일반힐튼모텔금산읍충청남도 금산군 금산읍 금산로 1514041- 752-15802024-03-27
4일반거북장여관금산읍충청남도 금산군 금산읍 인삼로 109041- 752-11072024-03-27
5일반황금장여관금산읍충청남도 금산군 금산읍 인삼로 120 (,30)041- 753-28282024-03-27
6일반세종금산읍충청남도 금산군 금산읍 금산로 1542041- 751-24002024-03-27
7일반물돌장여관금산읍충청남도 금산군 금산읍 향군길 9041- 751-18102024-03-27
8일반호정장금산읍충청남도 금산군 금산읍 중도리 481041- 751-03952024-03-27
9일반신데렐라 파크추부면충청남도 금산군 추부면 산내로 20041- 752-24662024-03-27
업종구분업소명행정구역주소전화번호데이터기준일자
47일반금산인삼호텔금산읍충청남도 금산군 금산읍 인삼광장로 47041 -751 -62002024-03-27
48일반애플무인텔추부면충청남도 금산군 추부면 산내로 46<NA>2024-03-27
49일반더숲진산면충청남도 금산군 진산면 대둔산로 75-20<NA>2024-03-27
50일반더숲엔젤진산면충청남도 금산군 진산면 대둔산로 75-20<NA>2024-03-27
51일반리치모텔복수면충청남도 금산군 복수면 복수로 462, 리치모텔041 -734 -91032024-03-27
52일반꿈의궁전남일면충청남도 금산군 남일면 금산로 33041 -753 -11882024-03-27
53일반하이힐무인호텔추부면충청남도 금산군 추부면 서대산로 161, 하이힐무인호텔 주1동<NA>2024-03-27
54일반자바라무인호텔복수면충청남도 금산군 복수면 복수로 851041 -752 -75522024-03-27
55생활비단골관광농원진산면충청남도 금산군 진산면 태고사로 550-24, 비단골관광농원041 -752 -40142024-03-27
56생활더슈나애견펜션진산면충청남도 금산군 진산면 살구정길 178-13<NA>2024-03-27