Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 29554 |
Missing cells (%) | 59.1% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 468.8 KiB |
Average record size in memory | 48.0 B |
Variable types
Categorical | 2 |
---|---|
Text | 3 |
Dataset
Description | 제주특별자치도 제주시 관내 공중위생업 관련 미용업 현황 데이터를 제공합니다. |
---|---|
Author | 제주특별자치도 제주시 |
URL | https://www.data.go.kr/data/15056159/fileData.do |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
데이터기준일자 is highly overall correlated with 업종명 | High correlation |
업종명 is highly overall correlated with 데이터기준일자 | High correlation |
업종명 is highly imbalanced (95.5%) | Imbalance |
데이터기준일자 is highly imbalanced (87.3%) | Imbalance |
업소명 has 9826 (98.3%) missing values | Missing |
주소 has 9826 (98.3%) missing values | Missing |
전화번호 has 9902 (99.0%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 08:11:37.579112 |
---|---|
Analysis finished | 2023-12-12 08:11:38.458253 |
Duration | 0.88 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
업종명
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
일반미용업 | 80 |
피부미용업 | 35 |
미용업 | 21 |
네일미용업 | 16 |
Other values (8) | 22 |
Length
Max length | 21 |
---|---|
Median length | 4 |
Mean length | 4.0274 |
Min length | 3 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9826 | |
일반미용업 | 80 | 0.8% |
피부미용업 | 35 | 0.4% |
미용업 | 21 | 0.2% |
네일미용업 | 16 | 0.2% |
화장ㆍ분장 미용업 | 5 | 0.1% |
일반미용업 화장ㆍ분장 미용업 | 4 | < 0.1% |
네일미용업 화장ㆍ분장 미용업 | 3 | < 0.1% |
종합미용업 | 3 | < 0.1% |
피부미용업 네일미용업 | 3 | < 0.1% |
Other values (3) | 4 | < 0.1% |
Length
Value | Count | Frequency (%) |
na | 9826 | |
일반미용업 | 87 | 0.9% |
피부미용업 | 41 | 0.4% |
미용업 | 34 | 0.3% |
네일미용업 | 24 | 0.2% |
화장ㆍ분장 | 13 | 0.1% |
종합미용업 | 3 | < 0.1% |
업소명
Text
MISSING
 
Distinct | 174 |
---|---|
Distinct (%) | 100.0% |
Missing | 9826 |
Missing (%) | 98.3% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
헤어아트 | 2 | 1.1% |
태후사랑 | 2 | 1.1% |
스킨존 | 1 | 0.5% |
쉼뷰티 | 1 | 0.5% |
헤어캄 | 1 | 0.5% |
고고살롱 | 1 | 0.5% |
깍쟁이헤어 | 1 | 0.5% |
라야롬에스테틱 | 1 | 0.5% |
미라인 | 1 | 0.5% |
설렘주의보 | 1 | 0.5% |
Other values (174) | 174 |
Most occurring characters
Value | Count | Frequency (%) |
어 | 73 | 8.1% |
헤 | 64 | 7.1% |
이 | 35 | 3.9% |
스 | 32 | 3.5% |
리 | 22 | 2.4% |
일 | 19 | 2.1% |
네 | 17 | 1.9% |
아 | 17 | 1.9% |
미 | 17 | 1.9% |
샵 | 15 | 1.7% |
Other values (226) | 591 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 886 | |
Space Separator | 12 | 1.3% |
Decimal Number | 4 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
어 | 73 | 8.2% |
헤 | 64 | 7.2% |
이 | 35 | 4.0% |
스 | 32 | 3.6% |
리 | 22 | 2.5% |
일 | 19 | 2.1% |
네 | 17 | 1.9% |
아 | 17 | 1.9% |
미 | 17 | 1.9% |
샵 | 15 | 1.7% |
Other values (222) | 575 |
Decimal Number
Value | Count | Frequency (%) |
0 | 2 | |
1 | 1 | |
9 | 1 |
Space Separator
Value | Count | Frequency (%) |
12 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 886 | |
Common | 16 | 1.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
어 | 73 | 8.2% |
헤 | 64 | 7.2% |
이 | 35 | 4.0% |
스 | 32 | 3.6% |
리 | 22 | 2.5% |
일 | 19 | 2.1% |
네 | 17 | 1.9% |
아 | 17 | 1.9% |
미 | 17 | 1.9% |
샵 | 15 | 1.7% |
Other values (222) | 575 |
Common
Value | Count | Frequency (%) |
12 | ||
0 | 2 | 12.5% |
1 | 1 | 6.2% |
9 | 1 | 6.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 886 | |
ASCII | 16 | 1.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
어 | 73 | 8.2% |
헤 | 64 | 7.2% |
이 | 35 | 4.0% |
스 | 32 | 3.6% |
리 | 22 | 2.5% |
일 | 19 | 2.1% |
네 | 17 | 1.9% |
아 | 17 | 1.9% |
미 | 17 | 1.9% |
샵 | 15 | 1.7% |
Other values (222) | 575 |
ASCII
Value | Count | Frequency (%) |
12 | ||
0 | 2 | 12.5% |
1 | 1 | 6.2% |
9 | 1 | 6.2% |
주소
Text
MISSING
 
Distinct | 172 |
---|---|
Distinct (%) | 98.9% |
Missing | 9826 |
Missing (%) | 98.3% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 25 |
Mean length | 19.781609 |
Min length | 17 |
Characters and Unicode
Total characters | 3442 |
---|---|
Distinct characters | 113 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 170 ? |
---|---|
Unique (%) | 97.7% |
Sample
1st row | 제주특별자치도 제주시 우정로11길 18 |
---|---|
2nd row | 제주특별자치도 제주시 서광로 288 |
3rd row | 제주특별자치도 제주시 절물1길 32 |
4th row | 제주특별자치도 제주시 남광북3길 18 |
5th row | 제주특별자치도 제주시 중앙로26길 2 |
Value | Count | Frequency (%) |
제주특별자치도 | 174 | |
제주시 | 174 | |
한림읍 | 8 | 1.1% |
2 | 7 | 1.0% |
34 | 5 | 0.7% |
27 | 4 | 0.6% |
5 | 4 | 0.6% |
9 | 4 | 0.6% |
1 | 4 | 0.6% |
동문로 | 4 | 0.6% |
Other values (244) | 322 |
Most occurring characters
Value | Count | Frequency (%) |
536 | ||
주 | 352 | 10.2% |
제 | 348 | 10.1% |
도 | 178 | 5.2% |
특 | 174 | 5.1% |
별 | 174 | 5.1% |
자 | 174 | 5.1% |
치 | 174 | 5.1% |
시 | 174 | 5.1% |
길 | 111 | 3.2% |
Other values (103) | 1047 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2390 | |
Space Separator | 536 | 15.6% |
Decimal Number | 490 | 14.2% |
Dash Punctuation | 26 | 0.8% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
주 | 352 | |
제 | 348 | |
도 | 178 | |
특 | 174 | |
별 | 174 | |
자 | 174 | |
치 | 174 | |
시 | 174 | |
길 | 111 | 4.6% |
로 | 109 | 4.6% |
Other values (91) | 422 |
Decimal Number
Value | Count | Frequency (%) |
1 | 102 | |
2 | 72 | |
3 | 62 | |
4 | 54 | |
5 | 44 | |
8 | 37 | 7.6% |
7 | 33 | 6.7% |
9 | 31 | 6.3% |
6 | 29 | 5.9% |
0 | 26 | 5.3% |
Space Separator
Value | Count | Frequency (%) |
536 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 26 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2390 | |
Common | 1052 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
주 | 352 | |
제 | 348 | |
도 | 178 | |
특 | 174 | |
별 | 174 | |
자 | 174 | |
치 | 174 | |
시 | 174 | |
길 | 111 | 4.6% |
로 | 109 | 4.6% |
Other values (91) | 422 |
Common
Value | Count | Frequency (%) |
536 | ||
1 | 102 | 9.7% |
2 | 72 | 6.8% |
3 | 62 | 5.9% |
4 | 54 | 5.1% |
5 | 44 | 4.2% |
8 | 37 | 3.5% |
7 | 33 | 3.1% |
9 | 31 | 2.9% |
6 | 29 | 2.8% |
Other values (2) | 52 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2390 | |
ASCII | 1052 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
536 | ||
1 | 102 | 9.7% |
2 | 72 | 6.8% |
3 | 62 | 5.9% |
4 | 54 | 5.1% |
5 | 44 | 4.2% |
8 | 37 | 3.5% |
7 | 33 | 3.1% |
9 | 31 | 2.9% |
6 | 29 | 2.8% |
Other values (2) | 52 | 4.9% |
Hangul
Value | Count | Frequency (%) |
주 | 352 | |
제 | 348 | |
도 | 178 | |
특 | 174 | |
별 | 174 | |
자 | 174 | |
치 | 174 | |
시 | 174 | |
길 | 111 | 4.6% |
로 | 109 | 4.6% |
Other values (91) | 422 |
전화번호
Text
MISSING
 
Distinct | 98 |
---|---|
Distinct (%) | 100.0% |
Missing | 9902 |
Missing (%) | 99.0% |
Memory size | 156.2 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 12.05102 |
Min length | 12 |
Characters and Unicode
Total characters | 1181 |
---|---|
Distinct characters | 11 |
Distinct categories | 2 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 98 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 064-742-3634 |
---|---|
2nd row | 064-721-8844 |
3rd row | 064-758-3304 |
4th row | 064-757-3088 |
5th row | 064-782-1062 |
Value | Count | Frequency (%) |
064-742-8611 | 1 | 1.0% |
064-758-6781 | 1 | 1.0% |
064-752-0176 | 1 | 1.0% |
070-8223-9099 | 1 | 1.0% |
064-743-0600 | 1 | 1.0% |
064-759-3003 | 1 | 1.0% |
064-725-1220 | 1 | 1.0% |
064-796-0246 | 1 | 1.0% |
064-758-0761 | 1 | 1.0% |
064-756-2861 | 1 | 1.0% |
Other values (88) | 88 |
Most occurring characters
Value | Count | Frequency (%) |
- | 196 | |
0 | 158 | |
4 | 152 | |
7 | 151 | |
6 | 146 | |
2 | 90 | |
5 | 79 | |
1 | 61 | 5.2% |
8 | 57 | 4.8% |
3 | 48 | 4.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 985 | |
Dash Punctuation | 196 | 16.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 158 | |
4 | 152 | |
7 | 151 | |
6 | 146 | |
2 | 90 | |
5 | 79 | |
1 | 61 | 6.2% |
8 | 57 | 5.8% |
3 | 48 | 4.9% |
9 | 43 | 4.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 196 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1181 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 196 | |
0 | 158 | |
4 | 152 | |
7 | 151 | |
6 | 146 | |
2 | 90 | |
5 | 79 | |
1 | 61 | 5.2% |
8 | 57 | 4.8% |
3 | 48 | 4.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1181 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 196 | |
0 | 158 | |
4 | 152 | |
7 | 151 | |
6 | 146 | |
2 | 90 | |
5 | 79 | |
1 | 61 | 5.2% |
8 | 57 | 4.8% |
3 | 48 | 4.1% |
데이터기준일자
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
2021-02-15 | 174 |
Length
Max length | 10 |
---|---|
Median length | 4 |
Mean length | 4.1044 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9826 | |
2021-02-15 | 174 | 1.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9826 | |
2021-02-15 | 174 | 1.7% |
업종명 | 전화번호 | |
---|---|---|
업종명 | 1.000 | 1.000 |
전화번호 | 1.000 | 1.000 |
데이터기준일자 | 업종명 | |
---|---|---|
데이터기준일자 | 1.000 | 1.000 |
업종명 | 1.000 | 1.000 |
업종명 | 데이터기준일자 | |
---|---|---|
업종명 | 1.000 | 1.000 |
데이터기준일자 | 1.000 | 1.000 |
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | |
---|---|---|---|---|---|
1951 | <NA> | <NA> | <NA> | <NA> | <NA> |
56891 | <NA> | <NA> | <NA> | <NA> | <NA> |
35125 | <NA> | <NA> | <NA> | <NA> | <NA> |
48212 | <NA> | <NA> | <NA> | <NA> | <NA> |
69842 | <NA> | <NA> | <NA> | <NA> | <NA> |
23012 | <NA> | <NA> | <NA> | <NA> | <NA> |
20049 | <NA> | <NA> | <NA> | <NA> | <NA> |
66074 | <NA> | <NA> | <NA> | <NA> | <NA> |
75933 | <NA> | <NA> | <NA> | <NA> | <NA> |
48478 | <NA> | <NA> | <NA> | <NA> | <NA> |
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | |
---|---|---|---|---|---|
93117 | <NA> | <NA> | <NA> | <NA> | <NA> |
76312 | <NA> | <NA> | <NA> | <NA> | <NA> |
11237 | <NA> | <NA> | <NA> | <NA> | <NA> |
11006 | <NA> | <NA> | <NA> | <NA> | <NA> |
40142 | <NA> | <NA> | <NA> | <NA> | <NA> |
67015 | <NA> | <NA> | <NA> | <NA> | <NA> |
77128 | <NA> | <NA> | <NA> | <NA> | <NA> |
52466 | <NA> | <NA> | <NA> | <NA> | <NA> |
55079 | <NA> | <NA> | <NA> | <NA> | <NA> |
60359 | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
업종명 | 업소명 | 주소 | 전화번호 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 9826 |