Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 10000 |
Missing cells | 39956 |
Missing cells (%) | 79.9% |
Duplicate rows | 1 |
Duplicate rows (%) | < 0.1% |
Total size in memory | 488.3 KiB |
Average record size in memory | 50.0 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 2 |
Dataset
Description | 광주광역시에서 지정한 우수숙박업소(크린숙박업소)에 대한 현황자료입니다.(연번,업소명,소재지,객실수) |
---|---|
URL | https://www.data.go.kr/data/15055845/fileData.do |
Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
연번 is highly overall correlated with 자치구 | High correlation |
자치구 is highly overall correlated with 연번 | High correlation |
자치구 is highly imbalanced (99.3%) | Imbalance |
연번 has 9989 (99.9%) missing values | Missing |
업 소 명 has 9989 (99.9%) missing values | Missing |
소 재 지 has 9989 (99.9%) missing values | Missing |
객실수 has 9989 (99.9%) missing values | Missing |
Reproduction
Analysis started | 2023-12-12 12:36:10.288813 |
---|---|
Analysis finished | 2023-12-12 12:36:11.404923 |
Duration | 1.12 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
연번
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 100.0% |
Missing | 9989 |
Missing (%) | 99.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 36.727273 |
Minimum | 9 |
---|---|
Maximum | 68 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 9 |
---|---|
5-th percentile | 15 |
Q1 | 24 |
median | 39 |
Q3 | 44.5 |
95-th percentile | 65 |
Maximum | 68 |
Range | 59 |
Interquartile range (IQR) | 20.5 |
Descriptive statistics
Standard deviation | 17.900229 |
---|---|
Coefficient of variation (CV) | 0.48738246 |
Kurtosis | -0.42792545 |
Mean | 36.727273 |
Median Absolute Deviation (MAD) | 13 |
Skewness | 0.36675511 |
Sum | 404 |
Variance | 320.41818 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
22 | 1 | < 0.1% |
44 | 1 | < 0.1% |
62 | 1 | < 0.1% |
68 | 1 | < 0.1% |
21 | 1 | < 0.1% |
39 | 1 | < 0.1% |
26 | 1 | < 0.1% |
45 | 1 | < 0.1% |
41 | 1 | < 0.1% |
9 | 1 | < 0.1% |
(Missing) | 9989 |
Value | Count | Frequency (%) |
9 | 1 | |
21 | 1 | |
22 | 1 | |
26 | 1 | |
27 | 1 | |
39 | 1 | |
41 | 1 | |
44 | 1 | |
45 | 1 | |
62 | 1 |
Value | Count | Frequency (%) |
68 | 1 | |
62 | 1 | |
45 | 1 | |
44 | 1 | |
41 | 1 | |
39 | 1 | |
27 | 1 | |
26 | 1 | |
22 | 1 | |
21 | 1 |
자치구
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
<NA> | |
---|---|
서구 | 5 |
북구 | 4 |
광산구 | 2 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.998 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 9989 | |
서구 | 5 | 0.1% |
북구 | 4 | < 0.1% |
광산구 | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 9989 | |
서구 | 5 | < 0.1% |
북구 | 4 | < 0.1% |
광산구 | 2 | < 0.1% |
업 소 명
Text
MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 100.0% |
Missing | 9989 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
모텔캐슬 | 1 | 7.7% |
러브스토리모텔 | 1 | 7.7% |
호텔야자 | 1 | 7.7% |
샆모텔 | 1 | 7.7% |
선샤인 | 1 | 7.7% |
아리아모텔 | 1 | 7.7% |
호텔 | 1 | 7.7% |
수 | 1 | 7.7% |
베네치아모텔 | 1 | 7.7% |
주식회사 | 1 | 7.7% |
Other values (3) | 3 |
Most occurring characters
Value | Count | Frequency (%) |
텔 | 9 | 16.1% |
모 | 6 | 10.7% |
아 | 3 | 5.4% |
호 | 3 | 5.4% |
스 | 3 | 5.4% |
리 | 2 | 3.6% |
베 | 2 | 3.6% |
2 | 3.6% | |
토 | 1 | 1.8% |
브 | 1 | 1.8% |
Other values (24) | 24 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 54 | |
Space Separator | 2 | 3.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
텔 | 9 | 16.7% |
모 | 6 | 11.1% |
아 | 3 | 5.6% |
호 | 3 | 5.6% |
스 | 3 | 5.6% |
리 | 2 | 3.7% |
베 | 2 | 3.7% |
토 | 1 | 1.9% |
브 | 1 | 1.9% |
라 | 1 | 1.9% |
Other values (23) | 23 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 54 | |
Common | 2 | 3.6% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
텔 | 9 | 16.7% |
모 | 6 | 11.1% |
아 | 3 | 5.6% |
호 | 3 | 5.6% |
스 | 3 | 5.6% |
리 | 2 | 3.7% |
베 | 2 | 3.7% |
토 | 1 | 1.9% |
브 | 1 | 1.9% |
라 | 1 | 1.9% |
Other values (23) | 23 |
Common
Value | Count | Frequency (%) |
2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 54 | |
ASCII | 2 | 3.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
텔 | 9 | 16.7% |
모 | 6 | 11.1% |
아 | 3 | 5.6% |
호 | 3 | 5.6% |
스 | 3 | 5.6% |
리 | 2 | 3.7% |
베 | 2 | 3.7% |
토 | 1 | 1.9% |
브 | 1 | 1.9% |
라 | 1 | 1.9% |
Other values (23) | 23 |
ASCII
Value | Count | Frequency (%) |
2 |
소 재 지
Text
MISSING
 
Distinct | 11 |
---|---|
Distinct (%) | 100.0% |
Missing | 9989 |
Missing (%) | 99.9% |
Memory size | 156.2 KiB |
Length
Max length | 23 |
---|---|
Median length | 20 |
Mean length | 19 |
Min length | 14 |
Characters and Unicode
Total characters | 209 |
---|---|
Distinct characters | 51 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 11 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 서구 상무평화로 97-6 (치평동) |
---|---|
2nd row | 북구 무등로218번길 38 (신안동) |
3rd row | 광산구 사암로171번길 90-24(우산동) |
4th row | 광산구 용아로401번길 31 (하남동) |
5th row | 서구 금화로85번길 4-24 (금호동) |
Value | Count | Frequency (%) |
서구 | 5 | 12.5% |
북구 | 4 | 10.0% |
치평동 | 3 | 7.5% |
광산구 | 2 | 5.0% |
상무평화로 | 2 | 5.0% |
설죽로217번길 | 1 | 2.5% |
154 | 1 | 2.5% |
8 | 1 | 2.5% |
상무연하로 | 1 | 2.5% |
36(오룡동 | 1 | 2.5% |
Other values (19) | 19 |
Most occurring characters
Value | Count | Frequency (%) |
29 | 13.9% | |
( | 11 | 5.3% |
구 | 11 | 5.3% |
로 | 11 | 5.3% |
) | 11 | 5.3% |
동 | 11 | 5.3% |
1 | 9 | 4.3% |
번 | 7 | 3.3% |
서 | 6 | 2.9% |
4 | 6 | 2.9% |
Other values (41) | 97 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 110 | |
Decimal Number | 44 | 21.1% |
Space Separator | 29 | 13.9% |
Open Punctuation | 11 | 5.3% |
Close Punctuation | 11 | 5.3% |
Dash Punctuation | 4 | 1.9% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
구 | 11 | 10.0% |
로 | 11 | 10.0% |
동 | 11 | 10.0% |
번 | 7 | 6.4% |
서 | 6 | 5.5% |
길 | 6 | 5.5% |
무 | 5 | 4.5% |
평 | 5 | 4.5% |
치 | 4 | 3.6% |
상 | 4 | 3.6% |
Other values (27) | 40 |
Decimal Number
Value | Count | Frequency (%) |
1 | 9 | |
4 | 6 | |
3 | 5 | |
8 | 5 | |
2 | 4 | |
5 | 4 | |
7 | 3 | 6.8% |
0 | 3 | 6.8% |
6 | 3 | 6.8% |
9 | 2 | 4.5% |
Space Separator
Value | Count | Frequency (%) |
29 |
Open Punctuation
Value | Count | Frequency (%) |
( | 11 |
Close Punctuation
Value | Count | Frequency (%) |
) | 11 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 110 | |
Common | 99 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
구 | 11 | 10.0% |
로 | 11 | 10.0% |
동 | 11 | 10.0% |
번 | 7 | 6.4% |
서 | 6 | 5.5% |
길 | 6 | 5.5% |
무 | 5 | 4.5% |
평 | 5 | 4.5% |
치 | 4 | 3.6% |
상 | 4 | 3.6% |
Other values (27) | 40 |
Common
Value | Count | Frequency (%) |
29 | ||
( | 11 | 11.1% |
) | 11 | 11.1% |
1 | 9 | 9.1% |
4 | 6 | 6.1% |
3 | 5 | 5.1% |
8 | 5 | 5.1% |
2 | 4 | 4.0% |
5 | 4 | 4.0% |
- | 4 | 4.0% |
Other values (4) | 11 | 11.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 110 | |
ASCII | 99 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
29 | ||
( | 11 | 11.1% |
) | 11 | 11.1% |
1 | 9 | 9.1% |
4 | 6 | 6.1% |
3 | 5 | 5.1% |
8 | 5 | 5.1% |
2 | 4 | 4.0% |
5 | 4 | 4.0% |
- | 4 | 4.0% |
Other values (4) | 11 | 11.1% |
Hangul
Value | Count | Frequency (%) |
구 | 11 | 10.0% |
로 | 11 | 10.0% |
동 | 11 | 10.0% |
번 | 7 | 6.4% |
서 | 6 | 5.5% |
길 | 6 | 5.5% |
무 | 5 | 4.5% |
평 | 5 | 4.5% |
치 | 4 | 3.6% |
상 | 4 | 3.6% |
Other values (27) | 40 |
객실수
Real number (ℝ)
MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 72.7% |
Missing | 9989 |
Missing (%) | 99.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 37.090909 |
Minimum | 30 |
---|---|
Maximum | 48 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 30 |
---|---|
5-th percentile | 30.5 |
Q1 | 32.5 |
median | 36 |
Q3 | 40 |
95-th percentile | 45 |
Maximum | 48 |
Range | 18 |
Interquartile range (IQR) | 7.5 |
Descriptive statistics
Standard deviation | 5.4855181 |
---|---|
Coefficient of variation (CV) | 0.14789387 |
Kurtosis | -0.1540749 |
Mean | 37.090909 |
Median Absolute Deviation (MAD) | 4 |
Skewness | 0.52196395 |
Sum | 408 |
Variance | 30.090909 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
40 | 3 | < 0.1% |
36 | 2 | < 0.1% |
42 | 1 | < 0.1% |
32 | 1 | < 0.1% |
48 | 1 | < 0.1% |
30 | 1 | < 0.1% |
31 | 1 | < 0.1% |
33 | 1 | < 0.1% |
(Missing) | 9989 |
Value | Count | Frequency (%) |
30 | 1 | < 0.1% |
31 | 1 | < 0.1% |
32 | 1 | < 0.1% |
33 | 1 | < 0.1% |
36 | 2 | |
40 | 3 | |
42 | 1 | < 0.1% |
48 | 1 | < 0.1% |
Value | Count | Frequency (%) |
48 | 1 | < 0.1% |
42 | 1 | < 0.1% |
40 | 3 | |
36 | 2 | |
33 | 1 | < 0.1% |
32 | 1 | < 0.1% |
31 | 1 | < 0.1% |
30 | 1 | < 0.1% |
연번 | 자치구 | 업 소 명 | 소 재 지 | 객실수 | |
---|---|---|---|---|---|
연번 | 1.000 | 1.000 | 1.000 | 1.000 | 0.525 |
자치구 | 1.000 | 1.000 | 1.000 | 1.000 | 0.740 |
업 소 명 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
소 재 지 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
객실수 | 0.525 | 0.740 | 1.000 | 1.000 | 1.000 |
연번 | 객실수 | 자치구 | |
---|---|---|---|
연번 | 1.000 | 0.326 | 0.707 |
객실수 | 0.326 | 1.000 | 0.250 |
자치구 | 0.707 | 0.250 | 1.000 |
연번 | 자치구 | 업 소 명 | 소 재 지 | 객실수 | |
---|---|---|---|---|---|
35413 | <NA> | <NA> | <NA> | <NA> | <NA> |
28049 | <NA> | <NA> | <NA> | <NA> | <NA> |
78654 | <NA> | <NA> | <NA> | <NA> | <NA> |
8770 | <NA> | <NA> | <NA> | <NA> | <NA> |
19934 | <NA> | <NA> | <NA> | <NA> | <NA> |
27884 | <NA> | <NA> | <NA> | <NA> | <NA> |
93443 | <NA> | <NA> | <NA> | <NA> | <NA> |
34793 | <NA> | <NA> | <NA> | <NA> | <NA> |
62723 | <NA> | <NA> | <NA> | <NA> | <NA> |
83351 | <NA> | <NA> | <NA> | <NA> | <NA> |
연번 | 자치구 | 업 소 명 | 소 재 지 | 객실수 | |
---|---|---|---|---|---|
35490 | <NA> | <NA> | <NA> | <NA> | <NA> |
59438 | <NA> | <NA> | <NA> | <NA> | <NA> |
83069 | <NA> | <NA> | <NA> | <NA> | <NA> |
89447 | <NA> | <NA> | <NA> | <NA> | <NA> |
13427 | <NA> | <NA> | <NA> | <NA> | <NA> |
5331 | <NA> | <NA> | <NA> | <NA> | <NA> |
98753 | <NA> | <NA> | <NA> | <NA> | <NA> |
95954 | <NA> | <NA> | <NA> | <NA> | <NA> |
64283 | <NA> | <NA> | <NA> | <NA> | <NA> |
71192 | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
연번 | 자치구 | 업 소 명 | 소 재 지 | 객실수 | # duplicates | |
---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | 9989 |