Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 959 |
Duplicate rows (%) | 9.6% |
Total size in memory | 742.2 KiB |
Average record size in memory | 76.0 B |
Variable types
Numeric | 3 |
---|---|
Categorical | 2 |
Boolean | 1 |
DateTime | 2 |
Dataset
Description | 연수번호, 호텔순번, 객실번호, 남녀구분 등 연수에 참가했던 인원의 숙박에 대한 정보 항목을 제공하기 위한 데이터 자료입니다. |
---|---|
URL | https://www.data.go.kr/data/15042286/fileData.do |
독실여부 has constant value "" | Constant |
Dataset has 959 (9.6%) duplicate rows | Duplicates |
객실번호 is highly overall correlated with 수용인원 | High correlation |
수용인원 is highly overall correlated with 객실번호 | High correlation |
신청인원 has 5418 (54.2%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 04:15:55.908788 |
---|---|
Analysis finished | 2023-12-12 04:15:58.232994 |
Duration | 2.32 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
호텔고유번호
Real number (ℝ)
Distinct | 28 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 261.6876 |
Minimum | 40 |
---|---|
Maximum | 840 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 40 |
---|---|
5-th percentile | 100 |
Q1 | 100 |
median | 160 |
Q3 | 480 |
95-th percentile | 520 |
Maximum | 840 |
Range | 800 |
Interquartile range (IQR) | 380 |
Descriptive statistics
Standard deviation | 179.09853 |
---|---|
Coefficient of variation (CV) | 0.68439825 |
Kurtosis | -0.39054226 |
Mean | 261.6876 |
Median Absolute Deviation (MAD) | 60 |
Skewness | 0.8067231 |
Sum | 2616876 |
Variance | 32076.285 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
100 | 3236 | |
500 | 1238 | 12.4% |
160 | 1119 | 11.2% |
480 | 996 | 10.0% |
280 | 769 | 7.7% |
120 | 490 | 4.9% |
302 | 489 | 4.9% |
180 | 351 | 3.5% |
240 | 216 | 2.2% |
800 | 165 | 1.7% |
Other values (18) | 931 | 9.3% |
Value | Count | Frequency (%) |
40 | 151 | 1.5% |
100 | 3236 | |
120 | 490 | 4.9% |
140 | 8 | 0.1% |
160 | 1119 | 11.2% |
180 | 351 | 3.5% |
200 | 10 | 0.1% |
220 | 67 | 0.7% |
240 | 216 | 2.2% |
280 | 769 | 7.7% |
Value | Count | Frequency (%) |
840 | 4 | < 0.1% |
800 | 165 | 1.7% |
660 | 21 | 0.2% |
640 | 14 | 0.1% |
620 | 31 | 0.3% |
600 | 59 | 0.6% |
540 | 78 | 0.8% |
520 | 155 | 1.6% |
501 | 18 | 0.2% |
500 | 1238 |
객실번호
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 391 |
---|---|
Distinct (%) | 3.9% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 43.2837 |
Minimum | 1 |
---|---|
Maximum | 600 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 13 |
median | 27 |
Q3 | 50 |
95-th percentile | 132 |
Maximum | 600 |
Range | 599 |
Interquartile range (IQR) | 37 |
Descriptive statistics
Standard deviation | 62.874097 |
---|---|
Coefficient of variation (CV) | 1.4526045 |
Kurtosis | 30.644257 |
Mean | 43.2837 |
Median Absolute Deviation (MAD) | 17 |
Skewness | 4.8566618 |
Sum | 432837 |
Variance | 3953.1521 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 218 | 2.2% |
5 | 215 | 2.1% |
18 | 211 | 2.1% |
19 | 208 | 2.1% |
10 | 208 | 2.1% |
4 | 208 | 2.1% |
16 | 208 | 2.1% |
9 | 207 | 2.1% |
13 | 205 | 2.1% |
1 | 205 | 2.1% |
Other values (381) | 7907 |
Value | Count | Frequency (%) |
1 | 205 | |
2 | 200 | |
3 | 198 | |
4 | 208 | |
5 | 215 | |
6 | 218 | |
7 | 200 | |
8 | 201 | |
9 | 207 | |
10 | 208 |
Value | Count | Frequency (%) |
600 | 1 | |
597 | 1 | |
596 | 1 | |
595 | 1 | |
593 | 1 | |
591 | 1 | |
590 | 1 | |
589 | 1 | |
587 | 1 | |
586 | 1 |
수용인원
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
2 | |
---|---|
1 | |
0 | 291 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
2 | 6107 | |
1 | 3602 | |
0 | 291 | 2.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2 | 6107 | |
1 | 3602 | |
0 | 291 | 2.9% |
신청인원
Real number (ℝ)
ZEROS
 
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.6344 |
Minimum | -6 |
---|---|
Maximum | 2 |
Zeros | 5418 |
Zeros (%) | 54.2% |
Negative | 35 |
Negative (%) | 0.4% |
Memory size | 166.0 KiB |
Quantile statistics
Minimum | -6 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 2 |
Range | 8 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 0.78879583 |
---|---|
Coefficient of variation (CV) | 1.243373 |
Kurtosis | -0.016734052 |
Mean | 0.6344 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.52997402 |
Sum | 6344 |
Variance | 0.62219886 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 5418 | |
1 | 2694 | |
2 | 1853 | 18.5% |
-1 | 25 | 0.2% |
-2 | 5 | 0.1% |
-3 | 2 | < 0.1% |
-6 | 1 | < 0.1% |
-5 | 1 | < 0.1% |
-4 | 1 | < 0.1% |
Value | Count | Frequency (%) |
-6 | 1 | < 0.1% |
-5 | 1 | < 0.1% |
-4 | 1 | < 0.1% |
-3 | 2 | < 0.1% |
-2 | 5 | 0.1% |
-1 | 25 | 0.2% |
0 | 5418 | |
1 | 2694 | |
2 | 1853 | 18.5% |
Value | Count | Frequency (%) |
2 | 1853 | 18.5% |
1 | 2694 | |
0 | 5418 | |
-1 | 25 | 0.2% |
-2 | 5 | 0.1% |
-3 | 2 | < 0.1% |
-4 | 1 | < 0.1% |
-5 | 1 | < 0.1% |
-6 | 1 | < 0.1% |
남녀구분
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
M | |
---|---|
F | |
<NA> |
Length
Max length | 4 |
---|---|
Median length | 1 |
Mean length | 1.1842 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | M |
---|---|
2nd row | F |
3rd row | M |
4th row | F |
5th row | M |
Common Values
Value | Count | Frequency (%) |
M | 4925 | |
F | 4461 | |
<NA> | 614 | 6.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
m | 4925 | |
f | 4461 | |
na | 614 | 6.1% |
독실여부
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 87.9 KiB |
False |
---|
Value | Count | Frequency (%) |
False | 10000 |
입력일
Date
Distinct | 454 |
---|---|
Distinct (%) | 4.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2007-02-28 00:00:00 |
---|---|
Maximum | 2023-07-03 00:00:00 |
수정일
Date
Distinct | 620 |
---|---|
Distinct (%) | 6.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Minimum | 2007-02-28 00:00:00 |
---|---|
Maximum | 2023-07-03 00:00:00 |
호텔고유번호 | 객실번호 | 수용인원 | 신청인원 | 남녀구분 | |
---|---|---|---|---|---|
호텔고유번호 | 1.000 | 0.329 | 0.598 | 0.423 | 0.015 |
객실번호 | 0.329 | 1.000 | 0.700 | 0.118 | 0.172 |
수용인원 | 0.598 | 0.700 | 1.000 | 0.547 | 0.025 |
신청인원 | 0.423 | 0.118 | 0.547 | 1.000 | 0.168 |
남녀구분 | 0.015 | 0.172 | 0.025 | 0.168 | 1.000 |
남녀구분 | 수용인원 | |
---|---|---|
남녀구분 | 1.000 | 0.016 |
수용인원 | 0.016 | 1.000 |
호텔고유번호 | 객실번호 | 신청인원 | 수용인원 | 남녀구분 | |
---|---|---|---|---|---|
호텔고유번호 | 1.000 | -0.293 | 0.334 | 0.323 | 0.015 |
객실번호 | -0.293 | 1.000 | -0.095 | 0.557 | 0.114 |
신청인원 | 0.334 | -0.095 | 1.000 | 0.286 | 0.126 |
수용인원 | 0.323 | 0.557 | 0.286 | 1.000 | 0.016 |
남녀구분 | 0.015 | 0.114 | 0.126 | 0.016 | 1.000 |
호텔고유번호 | 객실번호 | 수용인원 | 신청인원 | 남녀구분 | 독실여부 | 입력일 | 수정일 | |
---|---|---|---|---|---|---|---|---|
17819 | 520 | 16 | 2 | 1 | M | N | 2016-03-03 | 2016-03-03 |
13347 | 500 | 23 | 2 | 1 | F | N | 2014-05-13 | 2014-05-13 |
1735 | 100 | 41 | 2 | 1 | M | N | 2008-01-31 | 2008-01-31 |
90 | 100 | 86 | 2 | 0 | F | N | 2007-05-10 | 2007-05-10 |
19864 | 540 | 56 | 2 | 2 | M | N | 2016-04-18 | 2016-04-18 |
16379 | 480 | 12 | 2 | 2 | M | N | 2019-02-28 | 2019-02-28 |
6121 | 220 | 19 | 1 | 1 | M | N | 2009-07-22 | 2009-07-22 |
7365 | 100 | 49 | 2 | 0 | F | N | 2009-09-04 | 2009-09-04 |
19666 | 480 | 19 | 2 | 2 | M | N | 2014-05-27 | 2014-05-27 |
17583 | 480 | 7 | 1 | 1 | F | N | 2015-02-10 | 2015-02-10 |
호텔고유번호 | 객실번호 | 수용인원 | 신청인원 | 남녀구분 | 독실여부 | 입력일 | 수정일 | |
---|---|---|---|---|---|---|---|---|
15336 | 500 | 6 | 1 | 1 | F | N | 2018-04-27 | 2018-04-27 |
3717 | 180 | 538 | 0 | 0 | <NA> | N | 2008-05-02 | 2008-05-02 |
9238 | 100 | 8 | 1 | 0 | M | N | 2010-07-08 | 2010-07-08 |
5910 | 100 | 11 | 2 | -1 | M | N | 2009-04-20 | 2009-04-20 |
4993 | 100 | 33 | 1 | 0 | F | N | 2008-11-10 | 2008-11-10 |
4848 | 100 | 28 | 2 | 1 | M | N | 2008-10-10 | 2008-10-10 |
16682 | 480 | 13 | 2 | 1 | M | N | 2015-09-08 | 2015-10-08 |
11404 | 280 | 10 | 1 | 0 | F | N | 2011-12-19 | 2011-12-19 |
8013 | 240 | 20 | 2 | 1 | M | N | 2010-04-09 | 2010-04-09 |
5961 | 100 | 54 | 2 | 0 | F | N | 2009-05-11 | 2009-05-11 |
Most frequently occurring
호텔고유번호 | 객실번호 | 수용인원 | 신청인원 | 남녀구분 | 독실여부 | 입력일 | 수정일 | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
253 | 100 | 36 | 1 | 0 | F | N | 2010-03-03 | 2010-03-03 | 7 |
144 | 100 | 19 | 2 | 0 | M | N | 2010-03-03 | 2010-03-03 | 6 |
277 | 100 | 39 | 1 | 0 | F | N | 2010-03-03 | 2010-03-03 | 6 |
322 | 100 | 46 | 2 | 0 | F | N | 2010-03-03 | 2010-03-03 | 6 |
355 | 100 | 50 | 2 | 0 | F | N | 2010-03-03 | 2010-03-03 | 6 |
35 | 100 | 4 | 1 | 0 | M | N | 2010-03-03 | 2010-03-03 | 5 |
37 | 100 | 4 | 1 | 0 | M | N | 2010-09-03 | 2010-09-03 | 5 |
44 | 100 | 5 | 1 | 0 | M | N | 2010-03-03 | 2010-03-03 | 5 |
55 | 100 | 6 | 1 | 0 | M | N | 2010-03-03 | 2010-03-03 | 5 |
71 | 100 | 8 | 1 | 0 | M | N | 2010-03-03 | 2010-03-03 | 5 |