Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 39681 |
Missing cells (%) | 79.4% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 468.8 KiB |
Average record size in memory | 48.0 B |
Variable types
Categorical | 1 |
---|---|
Text | 3 |
DateTime | 1 |
Dataset
Description | 제주특별자치도 제주시 관내 공중위생업 관련 숙박업 현황 데이터를 제공합니다. |
---|---|
Author | 제주특별자치도 제주시 |
URL | https://www.data.go.kr/data/15056155/fileData.do |
데이터기준일자 has constant value "" | Constant |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
업종명 is highly imbalanced (95.3%) | Imbalance |
업소명 has 9918 (99.2%) missing values | Missing |
주소 has 9918 (99.2%) missing values | Missing |
전화번호 has 9927 (99.3%) missing values | Missing |
데이터기준일자 has 9918 (99.2%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 08:22:12.873424 |
---|---|
Analysis finished | 2023-12-12 08:22:13.670169 |
Duration | 0.8 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
업종명
Categorical
IMBALANCE
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
숙박업(일반) | 64 |
숙박업(생활) | 18 |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 4.0246 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9918 | |
숙박업(일반) | 64 | 0.6% |
숙박업(생활) | 18 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9918 | |
숙박업(일반 | 64 | 0.6% |
숙박업(생활 | 18 | 0.2% |
업소명
Text
MISSING
 
Distinct | 82 |
---|---|
Distinct (%) | 100.0% |
Missing | 9918 |
Missing (%) | 99.2% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
호텔엘린 | 1 | 1.2% |
삼해인 | 1 | 1.2% |
엠버서더 | 1 | 1.2% |
노형호텔 | 1 | 1.2% |
브라보 | 1 | 1.2% |
노노레타 | 1 | 1.2% |
산지물호텔 | 1 | 1.2% |
하버 | 1 | 1.2% |
케이모텔 | 1 | 1.2% |
라온골프클럽휴양콘도미니엄 | 1 | 1.2% |
Other values (75) | 75 |
Most occurring characters
Value | Count | Frequency (%) |
텔 | 26 | 6.9% |
호 | 20 | 5.3% |
스 | 13 | 3.4% |
주 | 12 | 3.2% |
제 | 11 | 2.9% |
라 | 8 | 2.1% |
하 | 7 | 1.9% |
아 | 7 | 1.9% |
우 | 6 | 1.6% |
오 | 6 | 1.6% |
Other values (159) | 262 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 369 | |
Space Separator | 3 | 0.8% |
Decimal Number | 3 | 0.8% |
Close Punctuation | 1 | 0.3% |
Uppercase Letter | 1 | 0.3% |
Open Punctuation | 1 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
텔 | 26 | 7.0% |
호 | 20 | 5.4% |
스 | 13 | 3.5% |
주 | 12 | 3.3% |
제 | 11 | 3.0% |
라 | 8 | 2.2% |
하 | 7 | 1.9% |
아 | 7 | 1.9% |
우 | 6 | 1.6% |
오 | 6 | 1.6% |
Other values (153) | 253 |
Decimal Number
Value | Count | Frequency (%) |
2 | 2 | |
9 | 1 |
Space Separator
Value | Count | Frequency (%) |
3 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 369 | |
Common | 8 | 2.1% |
Latin | 1 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
텔 | 26 | 7.0% |
호 | 20 | 5.4% |
스 | 13 | 3.5% |
주 | 12 | 3.3% |
제 | 11 | 3.0% |
라 | 8 | 2.2% |
하 | 7 | 1.9% |
아 | 7 | 1.9% |
우 | 6 | 1.6% |
오 | 6 | 1.6% |
Other values (153) | 253 |
Common
Value | Count | Frequency (%) |
3 | ||
2 | 2 | |
9 | 1 | 12.5% |
) | 1 | 12.5% |
( | 1 | 12.5% |
Latin
Value | Count | Frequency (%) |
T | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 369 | |
ASCII | 9 | 2.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
텔 | 26 | 7.0% |
호 | 20 | 5.4% |
스 | 13 | 3.5% |
주 | 12 | 3.3% |
제 | 11 | 3.0% |
라 | 8 | 2.2% |
하 | 7 | 1.9% |
아 | 7 | 1.9% |
우 | 6 | 1.6% |
오 | 6 | 1.6% |
Other values (153) | 253 |
ASCII
Value | Count | Frequency (%) |
3 | ||
2 | 2 | |
9 | 1 | 11.1% |
) | 1 | 11.1% |
T | 1 | 11.1% |
( | 1 | 11.1% |
주소
Text
MISSING
 
Distinct | 82 |
---|---|
Distinct (%) | 100.0% |
Missing | 9918 |
Missing (%) | 99.2% |
Memory size | 156.2 KiB |
Length
Max length | 29 |
---|---|
Median length | 26 |
Mean length | 21.073171 |
Min length | 17 |
Characters and Unicode
Total characters | 1728 |
---|---|
Distinct characters | 95 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 82 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 제주특별자치도 제주시 신광로4길 38 |
---|---|
2nd row | 제주특별자치도 제주시 동문로 138 |
3rd row | 제주특별자치도 제주시 조천읍 명림로 655-77 |
4th row | 제주특별자치도 제주시 우도면 영일길 156-32 |
5th row | 제주특별자치도 제주시 애월읍 애원로 74 |
Value | Count | Frequency (%) |
제주특별자치도 | 82 | |
제주시 | 82 | |
애월읍 | 9 | 2.5% |
조천읍 | 7 | 2.0% |
3 | 6 | 1.7% |
한림읍 | 5 | 1.4% |
남조로 | 4 | 1.1% |
구좌읍 | 3 | 0.8% |
12 | 3 | 0.8% |
도령로 | 3 | 0.8% |
Other values (128) | 150 |
Most occurring characters
Value | Count | Frequency (%) |
272 | ||
제 | 166 | 9.6% |
주 | 164 | 9.5% |
도 | 87 | 5.0% |
특 | 82 | 4.7% |
별 | 82 | 4.7% |
자 | 82 | 4.7% |
치 | 82 | 4.7% |
시 | 82 | 4.7% |
1 | 66 | 3.8% |
Other values (85) | 563 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1181 | |
Space Separator | 272 | 15.7% |
Decimal Number | 252 | 14.6% |
Dash Punctuation | 23 | 1.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
제 | 166 | |
주 | 164 | |
도 | 87 | 7.4% |
특 | 82 | 6.9% |
별 | 82 | 6.9% |
자 | 82 | 6.9% |
치 | 82 | 6.9% |
시 | 82 | 6.9% |
로 | 56 | 4.7% |
길 | 45 | 3.8% |
Other values (73) | 253 |
Decimal Number
Value | Count | Frequency (%) |
1 | 66 | |
3 | 36 | |
2 | 33 | |
4 | 26 | 10.3% |
7 | 20 | 7.9% |
6 | 17 | 6.7% |
5 | 16 | 6.3% |
8 | 15 | 6.0% |
9 | 13 | 5.2% |
0 | 10 | 4.0% |
Space Separator
Value | Count | Frequency (%) |
272 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 23 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1181 | |
Common | 547 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
제 | 166 | |
주 | 164 | |
도 | 87 | 7.4% |
특 | 82 | 6.9% |
별 | 82 | 6.9% |
자 | 82 | 6.9% |
치 | 82 | 6.9% |
시 | 82 | 6.9% |
로 | 56 | 4.7% |
길 | 45 | 3.8% |
Other values (73) | 253 |
Common
Value | Count | Frequency (%) |
272 | ||
1 | 66 | 12.1% |
3 | 36 | 6.6% |
2 | 33 | 6.0% |
4 | 26 | 4.8% |
- | 23 | 4.2% |
7 | 20 | 3.7% |
6 | 17 | 3.1% |
5 | 16 | 2.9% |
8 | 15 | 2.7% |
Other values (2) | 23 | 4.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1181 | |
ASCII | 547 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
272 | ||
1 | 66 | 12.1% |
3 | 36 | 6.6% |
2 | 33 | 6.0% |
4 | 26 | 4.8% |
- | 23 | 4.2% |
7 | 20 | 3.7% |
6 | 17 | 3.1% |
5 | 16 | 2.9% |
8 | 15 | 2.7% |
Other values (2) | 23 | 4.2% |
Hangul
Value | Count | Frequency (%) |
제 | 166 | |
주 | 164 | |
도 | 87 | 7.4% |
특 | 82 | 6.9% |
별 | 82 | 6.9% |
자 | 82 | 6.9% |
치 | 82 | 6.9% |
시 | 82 | 6.9% |
로 | 56 | 4.7% |
길 | 45 | 3.8% |
Other values (73) | 253 |
전화번호
Text
MISSING
 
Distinct | 73 |
---|---|
Distinct (%) | 100.0% |
Missing | 9927 |
Missing (%) | 99.3% |
Memory size | 156.2 KiB |
Length
Max length | 12 |
---|---|
Median length | 12 |
Mean length | 11.958904 |
Min length | 9 |
Characters and Unicode
Total characters | 873 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 73 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 064-748-2280 |
---|---|
2nd row | 064-753-8011 |
3rd row | 064-758-6565 |
4th row | 064-751-4988 |
5th row | 064-747-8933 |
Value | Count | Frequency (%) |
064-722-1444 | 1 | 1.4% |
064-757-6582 | 1 | 1.4% |
064-747-2263 | 1 | 1.4% |
064-758-7076 | 1 | 1.4% |
064-783-0804 | 1 | 1.4% |
064-748-2105 | 1 | 1.4% |
064-754-6000 | 1 | 1.4% |
064-756-8700 | 1 | 1.4% |
064-795-1000 | 1 | 1.4% |
064-742-7775 | 1 | 1.4% |
Other values (63) | 63 |
Most occurring characters
Value | Count | Frequency (%) |
- | 145 | |
0 | 137 | |
4 | 113 | |
6 | 105 | |
7 | 102 | |
5 | 61 | |
2 | 50 | 5.7% |
9 | 45 | 5.2% |
8 | 41 | 4.7% |
1 | 39 | 4.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 728 | |
Dash Punctuation | 145 | 16.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 137 | |
4 | 113 | |
6 | 105 | |
7 | 102 | |
5 | 61 | |
2 | 50 | 6.9% |
9 | 45 | 6.2% |
8 | 41 | 5.6% |
1 | 39 | 5.4% |
3 | 35 | 4.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 145 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 873 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 145 | |
0 | 137 | |
4 | 113 | |
6 | 105 | |
7 | 102 | |
5 | 61 | |
2 | 50 | 5.7% |
9 | 45 | 5.2% |
8 | 41 | 4.7% |
1 | 39 | 4.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 873 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 145 | |
0 | 137 | |
4 | 113 | |
6 | 105 | |
7 | 102 | |
5 | 61 | |
2 | 50 | 5.7% |
9 | 45 | 5.2% |
8 | 41 | 4.7% |
1 | 39 | 4.5% |
데이터기준일자
Date
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 1.2% |
Missing | 9918 |
Missing (%) | 99.2% |
Memory size | 156.2 KiB |
Minimum | 2021-02-15 00:00:00 |
---|---|
Maximum | 2021-02-15 00:00:00 |
업종명 | 업소명 | 주소 | 전화번호 | |
---|---|---|---|---|
업종명 | 1.000 | 1.000 | 1.000 | 1.000 |
업소명 | 1.000 | 1.000 | 1.000 | 1.000 |
주소 | 1.000 | 1.000 | 1.000 | 1.000 |
전화번호 | 1.000 | 1.000 | 1.000 | 1.000 |
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | |
---|---|---|---|---|---|
71007 | <NA> | <NA> | <NA> | <NA> | <NA> |
46424 | <NA> | <NA> | <NA> | <NA> | <NA> |
31793 | <NA> | <NA> | <NA> | <NA> | <NA> |
37901 | <NA> | <NA> | <NA> | <NA> | <NA> |
27231 | <NA> | <NA> | <NA> | <NA> | <NA> |
963 | <NA> | <NA> | <NA> | <NA> | <NA> |
2594 | <NA> | <NA> | <NA> | <NA> | <NA> |
34978 | <NA> | <NA> | <NA> | <NA> | <NA> |
99616 | <NA> | <NA> | <NA> | <NA> | <NA> |
80863 | <NA> | <NA> | <NA> | <NA> | <NA> |
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | |
---|---|---|---|---|---|
76922 | <NA> | <NA> | <NA> | <NA> | <NA> |
99317 | <NA> | <NA> | <NA> | <NA> | <NA> |
96838 | <NA> | <NA> | <NA> | <NA> | <NA> |
58241 | <NA> | <NA> | <NA> | <NA> | <NA> |
37927 | <NA> | <NA> | <NA> | <NA> | <NA> |
40787 | <NA> | <NA> | <NA> | <NA> | <NA> |
19582 | <NA> | <NA> | <NA> | <NA> | <NA> |
65506 | <NA> | <NA> | <NA> | <NA> | <NA> |
42670 | <NA> | <NA> | <NA> | <NA> | <NA> |
37962 | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 9918 |