Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 2071 |
Duplicate rows (%) | 20.7% |
Total size in memory | 576.2 KiB |
Average record size in memory | 59.0 B |
Variable types
DateTime | 1 |
---|---|
Categorical | 1 |
Text | 1 |
Numeric | 3 |
Dataset
Description | 보흔휴양원에서 개방하는 객실 파손비품 내역 데이터로 파손일자, 파손구분, 파손비품, 비품단가, 비품수량, 파손금액이 포함된 데이터입니다. |
---|---|
URL | https://www.data.go.kr/data/15117117/fileData.do |
Dataset has 2071 (20.7%) duplicate rows | Duplicates |
비품단가 is highly overall correlated with 파손금액 | High correlation |
비품수량 is highly overall correlated with 파손금액 | High correlation |
파손금액 is highly overall correlated with 비품단가 and 1 other fields | High correlation |
파손구분 is highly imbalanced (73.6%) | Imbalance |
비품단가 is highly skewed (γ1 = 42.70658099) | Skewed |
Reproduction
Analysis started | 2023-12-12 00:44:37.997162 |
---|---|
Analysis finished | 2023-12-12 00:44:40.130244 |
Duration | 2.13 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
파손일자
Date
Distinct | 4409 |
---|---|
Distinct (%) | 44.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 1997-09-08 00:00:00 |
---|---|
Maximum | 2023-07-24 00:00:00 |
파손구분
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
추가비품 | |
---|---|
파손비품 | 449 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 추가비품 |
---|---|
2nd row | 추가비품 |
3rd row | 파손비품 |
4th row | 추가비품 |
5th row | 추가비품 |
Common Values
Value | Count | Frequency (%) |
추가비품 | 9551 | |
파손비품 | 449 | 4.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
추가비품 | 9551 | |
파손비품 | 449 | 4.5% |
파손비품
Text
Distinct | 53 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
침구류 | 9551 | |
커피잔세트 | 117 | 1.2% |
물컵 | 33 | 0.3% |
밥공기 | 33 | 0.3% |
소주잔 | 33 | 0.3% |
접시 | 29 | 0.3% |
찬그릇 | 24 | 0.2% |
슬리퍼 | 20 | 0.2% |
국그릇 | 19 | 0.2% |
냉장고 | 13 | 0.1% |
Other values (46) | 143 | 1.4% |
Most occurring characters
Value | Count | Frequency (%) |
침 | 9564 | |
구 | 9555 | |
류 | 9551 | |
잔 | 150 | 0.5% |
세 | 119 | 0.4% |
트 | 119 | 0.4% |
커 | 118 | 0.4% |
피 | 117 | 0.4% |
기 | 45 | 0.1% |
릇 | 43 | 0.1% |
Other values (104) | 889 | 2.9% |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 30235 | |
Space Separator | 15 | < 0.1% |
Open Punctuation | 8 | < 0.1% |
Close Punctuation | 8 | < 0.1% |
Lowercase Letter | 4 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
침 | 9564 | |
구 | 9555 | |
류 | 9551 | |
잔 | 150 | 0.5% |
세 | 119 | 0.4% |
트 | 119 | 0.4% |
커 | 118 | 0.4% |
피 | 117 | 0.4% |
기 | 45 | 0.1% |
릇 | 43 | 0.1% |
Other values (99) | 854 | 2.8% |
Lowercase Letter
Value | Count | Frequency (%) |
v | 2 | |
t | 2 |
Space Separator
Value | Count | Frequency (%) |
15 |
Open Punctuation
Value | Count | Frequency (%) |
( | 8 |
Close Punctuation
Value | Count | Frequency (%) |
) | 8 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 30231 | |
Common | 31 | 0.1% |
Han | 4 | < 0.1% |
Latin | 4 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
침 | 9564 | |
구 | 9555 | |
류 | 9551 | |
잔 | 150 | 0.5% |
세 | 119 | 0.4% |
트 | 119 | 0.4% |
커 | 118 | 0.4% |
피 | 117 | 0.4% |
기 | 45 | 0.1% |
릇 | 43 | 0.1% |
Other values (98) | 850 | 2.8% |
Common
Value | Count | Frequency (%) |
15 | ||
( | 8 | |
) | 8 |
Latin
Value | Count | Frequency (%) |
v | 2 | |
t | 2 |
Han
Value | Count | Frequency (%) |
中 | 4 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 30231 | |
ASCII | 35 | 0.1% |
CJK | 4 | < 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
침 | 9564 | |
구 | 9555 | |
류 | 9551 | |
잔 | 150 | 0.5% |
세 | 119 | 0.4% |
트 | 119 | 0.4% |
커 | 118 | 0.4% |
피 | 117 | 0.4% |
기 | 45 | 0.1% |
릇 | 43 | 0.1% |
Other values (98) | 850 | 2.8% |
ASCII
Value | Count | Frequency (%) |
15 | ||
( | 8 | |
) | 8 | |
v | 2 | 5.7% |
t | 2 | 5.7% |
CJK
Value | Count | Frequency (%) |
中 | 4 |
비품단가
Real number (ℝ)
HIGH CORRELATION
  SKEWED
 
Distinct | 31 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3344.183 |
Minimum | 550 |
---|---|
Maximum | 200000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 550 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 3000 |
Q3 | 5000 |
95-th percentile | 5000 |
Maximum | 200000 |
Range | 199450 |
Interquartile range (IQR) | 3000 |
Descriptive statistics
Standard deviation | 2662.7256 |
---|---|
Coefficient of variation (CV) | 0.79622603 |
Kurtosis | 3011.1489 |
Mean | 3344.183 |
Median Absolute Deviation (MAD) | 1000 |
Skewness | 42.706581 |
Sum | 33441830 |
Variance | 7090107.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 4311 | |
5000 | 3434 | |
3000 | 2000 | |
1000 | 64 | 0.6% |
4000 | 46 | 0.5% |
10000 | 38 | 0.4% |
6000 | 18 | 0.2% |
1650 | 14 | 0.1% |
2420 | 11 | 0.1% |
2200 | 10 | 0.1% |
Other values (21) | 54 | 0.5% |
Value | Count | Frequency (%) |
550 | 1 | < 0.1% |
660 | 1 | < 0.1% |
1000 | 64 | 0.6% |
1100 | 5 | 0.1% |
1650 | 14 | 0.1% |
2000 | 4311 | |
2200 | 10 | 0.1% |
2420 | 11 | 0.1% |
2500 | 3 | < 0.1% |
3000 | 2000 |
Value | Count | Frequency (%) |
200000 | 1 | < 0.1% |
60000 | 1 | < 0.1% |
46000 | 1 | < 0.1% |
35000 | 2 | < 0.1% |
33000 | 1 | < 0.1% |
30000 | 2 | < 0.1% |
29000 | 1 | < 0.1% |
20000 | 5 | |
19000 | 2 | < 0.1% |
14000 | 2 | < 0.1% |
비품수량
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 16 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.4719 |
Minimum | 1 |
---|---|
Maximum | 40 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 3 |
Maximum | 40 |
Range | 39 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.94037164 |
---|---|
Coefficient of variation (CV) | 0.63888283 |
Kurtosis | 380.7812 |
Mean | 1.4719 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 12.360625 |
Sum | 14719 |
Variance | 0.88429882 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 6464 | |
2 | 2762 | |
3 | 593 | 5.9% |
4 | 111 | 1.1% |
5 | 46 | 0.5% |
6 | 6 | 0.1% |
7 | 4 | < 0.1% |
10 | 3 | < 0.1% |
8 | 3 | < 0.1% |
17 | 2 | < 0.1% |
Other values (6) | 6 | 0.1% |
Value | Count | Frequency (%) |
1 | 6464 | |
2 | 2762 | |
3 | 593 | 5.9% |
4 | 111 | 1.1% |
5 | 46 | 0.5% |
6 | 6 | 0.1% |
7 | 4 | < 0.1% |
8 | 3 | < 0.1% |
10 | 3 | < 0.1% |
11 | 1 | < 0.1% |
Value | Count | Frequency (%) |
40 | 1 | < 0.1% |
28 | 1 | < 0.1% |
18 | 1 | < 0.1% |
17 | 2 | |
15 | 1 | < 0.1% |
14 | 1 | < 0.1% |
11 | 1 | < 0.1% |
10 | 3 | |
8 | 3 | |
7 | 4 |
파손금액
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 43 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4916.133 |
Minimum | 550 |
---|---|
Maximum | 200000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 550 |
---|---|
5-th percentile | 2000 |
Q1 | 2000 |
median | 4000 |
Q3 | 6000 |
95-th percentile | 10000 |
Maximum | 200000 |
Range | 199450 |
Interquartile range (IQR) | 4000 |
Descriptive statistics
Standard deviation | 4378.0003 |
---|---|
Coefficient of variation (CV) | 0.89053739 |
Kurtosis | 510.31011 |
Mean | 4916.133 |
Median Absolute Deviation (MAD) | 2000 |
Skewness | 14.444207 |
Sum | 49161330 |
Variance | 19166886 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2000 | 2979 | |
5000 | 2033 | |
10000 | 1245 | |
3000 | 1210 | |
4000 | 1009 | 10.1% |
6000 | 908 | 9.1% |
15000 | 186 | 1.9% |
9000 | 139 | 1.4% |
1000 | 61 | 0.6% |
8000 | 58 | 0.6% |
Other values (33) | 172 | 1.7% |
Value | Count | Frequency (%) |
550 | 1 | < 0.1% |
660 | 1 | < 0.1% |
1000 | 61 | 0.6% |
1100 | 5 | 0.1% |
1650 | 14 | 0.1% |
2000 | 2979 | |
2200 | 10 | 0.1% |
2420 | 11 | 0.1% |
2500 | 2 | < 0.1% |
3000 | 1210 |
Value | Count | Frequency (%) |
200000 | 1 | |
138000 | 1 | |
85000 | 1 | |
80000 | 1 | |
70000 | 1 | |
60000 | 1 | |
56000 | 1 | |
40000 | 1 | |
36000 | 1 | |
35000 | 2 |
파손구분 | 파손비품 | 비품단가 | 비품수량 | 파손금액 | |
---|---|---|---|---|---|
파손구분 | 1.000 | 1.000 | 0.207 | 0.000 | 0.078 |
파손비품 | 1.000 | 1.000 | 0.990 | 0.000 | 0.869 |
비품단가 | 0.207 | 0.990 | 1.000 | 0.000 | 0.876 |
비품수량 | 0.000 | 0.000 | 0.000 | 1.000 | 0.906 |
파손금액 | 0.078 | 0.869 | 0.876 | 0.906 | 1.000 |
비품단가 | 비품수량 | 파손금액 | 파손구분 | |
---|---|---|---|---|
비품단가 | 1.000 | 0.074 | 0.750 | 0.137 |
비품수량 | 0.074 | 1.000 | 0.694 | 0.000 |
파손금액 | 0.750 | 0.694 | 1.000 | 0.084 |
파손구분 | 0.137 | 0.000 | 0.084 | 1.000 |
파손일자 | 파손구분 | 파손비품 | 비품단가 | 비품수량 | 파손금액 | |
---|---|---|---|---|---|---|
10028 | 2009-12-25 | 추가비품 | 침구류 | 3000 | 2 | 6000 |
3989 | 1998-04-13 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
18621 | 2019-08-18 | 파손비품 | 가위 | 2000 | 1 | 2000 |
9335 | 2010-11-05 | 추가비품 | 침구류 | 3000 | 1 | 3000 |
4479 | 1998-08-10 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
9120 | 2010-10-16 | 추가비품 | 침구류 | 3000 | 2 | 6000 |
16747 | 2017-02-26 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
10952 | 2011-08-15 | 추가비품 | 침구류 | 3000 | 2 | 6000 |
6400 | 2007-09-02 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
15312 | 2015-08-07 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
파손일자 | 파손구분 | 파손비품 | 비품단가 | 비품수량 | 파손금액 | |
---|---|---|---|---|---|---|
15709 | 2016-12-30 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
14691 | 2014-08-30 | 추가비품 | 침구류 | 5000 | 2 | 10000 |
3114 | 1999-11-06 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
4821 | 2005-09-26 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
16918 | 2018-04-17 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
3583 | 1998-06-11 | 추가비품 | 침구류 | 2000 | 1 | 2000 |
12979 | 2014-05-24 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
11525 | 2012-01-13 | 추가비품 | 침구류 | 3000 | 1 | 3000 |
12184 | 2013-10-26 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
18374 | 2019-11-08 | 추가비품 | 침구류 | 5000 | 1 | 5000 |
Most frequently occurring
파손일자 | 파손구분 | 파손비품 | 비품단가 | 비품수량 | 파손금액 | # duplicates | |
---|---|---|---|---|---|---|---|
65 | 1998-06-11 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 19 |
170 | 1999-04-27 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 17 |
299 | 2000-06-20 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 17 |
54 | 1998-05-06 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 16 |
114 | 1998-10-20 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 16 |
75 | 1998-06-30 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 14 |
35 | 1998-02-24 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 12 |
50 | 1998-04-20 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 12 |
121 | 1998-10-30 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 12 |
45 | 1998-04-07 | 추가비품 | 침구류 | 2000 | 1 | 2000 | 11 |