Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 68.3 B |
Variable types
Categorical | 5 |
---|---|
Text | 1 |
Numeric | 2 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=4b0fc890-484d-11ec-9c54-b54b4d3d7cd0 |
dmn_nm has constant value "" | Constant |
chatbot_ctgry_nm has constant value "" | Constant |
convrs_partn_cl_nm has constant value "" | Constant |
area_nm is highly overall correlated with hotel_grad_no | High correlation |
hotel_grad_no is highly overall correlated with area_nm | High correlation |
Reproduction
Analysis started | 2023-12-10 10:15:44.143453 |
---|---|
Analysis finished | 2023-12-10 10:15:45.686140 |
Duration | 1.54 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
dmn_nm
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
숙박 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 숙박 |
---|---|
2nd row | 숙박 |
3rd row | 숙박 |
4th row | 숙박 |
5th row | 숙박 |
Common Values
Value | Count | Frequency (%) |
숙박 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
숙박 | 100 |
chatbot_ctgry_nm
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
호텔 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 호텔 |
---|---|
2nd row | 호텔 |
3rd row | 호텔 |
4th row | 호텔 |
5th row | 호텔 |
Common Values
Value | Count | Frequency (%) |
호텔 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
호텔 | 100 |
convrs_partn_cl_nm
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
고객 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 고객 |
---|---|
2nd row | 고객 |
3rd row | 고객 |
4th row | 고객 |
5th row | 고객 |
Common Values
Value | Count | Frequency (%) |
고객 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
고객 | 100 |
convrs_ctgry_nm
Text
Distinct | 67 |
---|---|
Distinct (%) | 67.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
객실예약 | 7 | 7.0% |
기본정보 | 5 | 5.0% |
선톡 | 4 | 4.0% |
교통안내 | 3 | 3.0% |
디럭스더블 | 3 | 3.0% |
운항정보 | 3 | 3.0% |
객실타입 | 3 | 3.0% |
호텔주소 | 2 | 2.0% |
이원메뉴 | 2 | 2.0% |
욕조 | 2 | 2.0% |
Other values (57) | 66 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 30 | 5.7% |
실 | 17 | 3.2% |
객 | 16 | 3.0% |
정 | 16 | 3.0% |
트 | 15 | 2.8% |
룸 | 13 | 2.5% |
타 | 13 | 2.5% |
원 | 11 | 2.1% |
기 | 11 | 2.1% |
메 | 10 | 1.9% |
Other values (130) | 377 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 515 | |
Decimal Number | 6 | 1.1% |
Lowercase Letter | 6 | 1.1% |
Open Punctuation | 1 | 0.2% |
Close Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 30 | 5.8% |
실 | 17 | 3.3% |
객 | 16 | 3.1% |
정 | 16 | 3.1% |
트 | 15 | 2.9% |
룸 | 13 | 2.5% |
타 | 13 | 2.5% |
원 | 11 | 2.1% |
기 | 11 | 2.1% |
메 | 10 | 1.9% |
Other values (118) | 363 |
Lowercase Letter
Value | Count | Frequency (%) |
a | 1 | |
i | 1 | |
r | 1 | |
e | 1 | |
s | 1 | |
t | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 3 | |
9 | 1 | 16.7% |
2 | 1 | 16.7% |
5 | 1 | 16.7% |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 515 | |
Common | 8 | 1.5% |
Latin | 6 | 1.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 30 | 5.8% |
실 | 17 | 3.3% |
객 | 16 | 3.1% |
정 | 16 | 3.1% |
트 | 15 | 2.9% |
룸 | 13 | 2.5% |
타 | 13 | 2.5% |
원 | 11 | 2.1% |
기 | 11 | 2.1% |
메 | 10 | 1.9% |
Other values (118) | 363 |
Common
Value | Count | Frequency (%) |
1 | 3 | |
9 | 1 | 12.5% |
2 | 1 | 12.5% |
5 | 1 | 12.5% |
( | 1 | 12.5% |
) | 1 | 12.5% |
Latin
Value | Count | Frequency (%) |
a | 1 | |
i | 1 | |
r | 1 | |
e | 1 | |
s | 1 | |
t | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 515 | |
ASCII | 14 | 2.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 30 | 5.8% |
실 | 17 | 3.3% |
객 | 16 | 3.1% |
정 | 16 | 3.1% |
트 | 15 | 2.9% |
룸 | 13 | 2.5% |
타 | 13 | 2.5% |
원 | 11 | 2.1% |
기 | 11 | 2.1% |
메 | 10 | 1.9% |
Other values (118) | 363 |
ASCII
Value | Count | Frequency (%) |
1 | 3 | |
9 | 1 | 7.1% |
a | 1 | 7.1% |
i | 1 | 7.1% |
r | 1 | 7.1% |
e | 1 | 7.1% |
s | 1 | 7.1% |
t | 1 | 7.1% |
2 | 1 | 7.1% |
5 | 1 | 7.1% |
Other values (2) | 2 |
text_fq_rt
Real number (ℝ)
Distinct | 25 |
---|---|
Distinct (%) | 25.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 17.7055 |
Minimum | 0.5 |
---|---|
Maximum | 100 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0.5 |
---|---|
5-th percentile | 2.44 |
Q1 | 5.41 |
median | 9.09 |
Q3 | 20 |
95-th percentile | 50 |
Maximum | 100 |
Range | 99.5 |
Interquartile range (IQR) | 14.59 |
Descriptive statistics
Standard deviation | 20.372779 |
---|---|
Coefficient of variation (CV) | 1.1506469 |
Kurtosis | 6.2586445 |
Mean | 17.7055 |
Median Absolute Deviation (MAD) | 5.09 |
Skewness | 2.3590165 |
Sum | 1770.55 |
Variance | 415.05014 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9.09 | 13 | 13.0% |
4.0 | 10 | 10.0% |
50.0 | 9 | 9.0% |
14.29 | 7 | 7.0% |
2.44 | 6 | 6.0% |
20.0 | 5 | 5.0% |
8.0 | 5 | 5.0% |
5.41 | 4 | 4.0% |
10.0 | 4 | 4.0% |
6.1 | 4 | 4.0% |
Other values (15) | 33 |
Value | Count | Frequency (%) |
0.5 | 2 | 2.0% |
2.44 | 6 | |
3.66 | 1 | 1.0% |
4.0 | 10 | |
4.88 | 4 | 4.0% |
5.41 | 4 | 4.0% |
6.1 | 4 | 4.0% |
7.32 | 1 | 1.0% |
8.0 | 5 | |
8.11 | 3 | 3.0% |
Value | Count | Frequency (%) |
100.0 | 3 | 3.0% |
50.0 | 9 | |
45.45 | 1 | 1.0% |
40.0 | 2 | 2.0% |
33.33 | 4 | |
32.43 | 1 | 1.0% |
25.0 | 2 | 2.0% |
22.22 | 1 | 1.0% |
20.0 | 5 | |
18.18 | 2 | 2.0% |
hotel_grad_no
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
4 | |
---|---|
5 | |
3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5 |
---|---|
2nd row | 5 |
3rd row | 5 |
4th row | 5 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
4 | 48 | |
5 | 45 | |
3 | 7 | 7.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
4 | 48 | |
5 | 45 | |
3 | 7 | 7.0% |
area_nm
Categorical
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 | |
---|---|
서울 강남구 | |
서울 중구 | |
인천 | |
제주 | 3 |
Length
Max length | 6 |
---|---|
Median length | 2 |
Mean length | 3.6 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
서울 | 42 | |
서울 강남구 | 25 | |
서울 중구 | 20 | |
인천 | 7 | 7.0% |
제주 | 3 | 3.0% |
부산 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 87 | |
강남구 | 25 | 17.2% |
중구 | 20 | 13.8% |
인천 | 7 | 4.8% |
제주 | 3 | 2.1% |
부산 | 3 | 2.1% |
base_de
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 6.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 20210881 |
Minimum | 20210430 |
---|---|
Maximum | 20211231 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 20210430 |
---|---|
5-th percentile | 20210430 |
Q1 | 20210430 |
median | 20211031 |
Q3 | 20211130 |
95-th percentile | 20211231 |
Maximum | 20211231 |
Range | 801 |
Interquartile range (IQR) | 700 |
Descriptive statistics
Standard deviation | 318.46146 |
---|---|
Coefficient of variation (CV) | 1.5756931 × 10-5 |
Kurtosis | -1.4791617 |
Mean | 20210881 |
Median Absolute Deviation (MAD) | 200 |
Skewness | -0.44602783 |
Sum | 2.0210881 × 109 |
Variance | 101417.7 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20211130 | 28 | |
20210430 | 28 | |
20211231 | 18 | |
20210731 | 12 | |
20210930 | 9 | 9.0% |
20211031 | 5 | 5.0% |
Value | Count | Frequency (%) |
20210430 | 28 | |
20210731 | 12 | |
20210930 | 9 | 9.0% |
20211031 | 5 | 5.0% |
20211130 | 28 | |
20211231 | 18 |
Value | Count | Frequency (%) |
20211231 | 18 | |
20211130 | 28 | |
20211031 | 5 | 5.0% |
20210930 | 9 | 9.0% |
20210731 | 12 | |
20210430 | 28 |
convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | base_de | |
---|---|---|---|---|---|
convrs_ctgry_nm | 1.000 | 0.000 | 0.000 | 0.000 | 0.657 |
text_fq_rt | 0.000 | 1.000 | 0.000 | 0.748 | 0.535 |
hotel_grad_no | 0.000 | 0.000 | 1.000 | 1.000 | 0.405 |
area_nm | 0.000 | 0.748 | 1.000 | 1.000 | 0.640 |
base_de | 0.657 | 0.535 | 0.405 | 0.640 | 1.000 |
area_nm | hotel_grad_no | |
---|---|---|
area_nm | 1.000 | 0.984 |
hotel_grad_no | 0.984 | 1.000 |
text_fq_rt | base_de | hotel_grad_no | area_nm | |
---|---|---|---|---|
text_fq_rt | 1.000 | 0.254 | 0.000 | 0.357 |
base_de | 0.254 | 1.000 | 0.301 | 0.428 |
hotel_grad_no | 0.000 | 0.301 | 1.000 | 0.984 |
area_nm | 0.357 | 0.428 | 0.984 | 1.000 |
dmn_nm | chatbot_ctgry_nm | convrs_partn_cl_nm | convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | base_de | |
---|---|---|---|---|---|---|---|---|
0 | 숙박 | 호텔 | 고객 | 레스토랑석식 | 9.09 | 5 | 서울 | 20211031 |
1 | 숙박 | 호텔 | 고객 | 셔틀버스 | 9.09 | 5 | 서울 | 20211031 |
2 | 숙박 | 호텔 | 고객 | 교통안내 | 18.18 | 5 | 서울 | 20211031 |
3 | 숙박 | 호텔 | 고객 | 선톡 | 45.45 | 5 | 서울 | 20211031 |
4 | 숙박 | 호텔 | 고객 | 지도 | 18.18 | 5 | 서울 | 20211031 |
5 | 숙박 | 호텔 | 고객 | 의료시설 | 100.0 | 5 | 제주 | 20210731 |
6 | 숙박 | 호텔 | 고객 | 투베드룸듀플렉스스위트 | 11.11 | 5 | 서울 | 20210731 |
7 | 숙박 | 호텔 | 고객 | 투베드룸듀플렉스스위트객실크기 | 33.33 | 5 | 서울 | 20210731 |
8 | 숙박 | 호텔 | 고객 | 투베드룸듀플렉스스위트객실전망 | 22.22 | 5 | 서울 | 20210731 |
9 | 숙박 | 호텔 | 고객 | 투베드룸듀플렉스스위트침대타입및정원 | 11.11 | 5 | 서울 | 20210731 |
dmn_nm | chatbot_ctgry_nm | convrs_partn_cl_nm | convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | base_de | |
---|---|---|---|---|---|---|---|---|
90 | 숙박 | 호텔 | 고객 | 객실수 | 4.88 | 4 | 서울 강남구 | 20210430 |
91 | 숙박 | 호텔 | 고객 | 프리미어펫룸 | 4.88 | 4 | 서울 강남구 | 20210430 |
92 | 숙박 | 호텔 | 고객 | 프리미어펫룸침대타입및정원 | 4.88 | 4 | 서울 강남구 | 20210430 |
93 | 숙박 | 호텔 | 고객 | 다른객실보기 | 3.66 | 4 | 서울 강남구 | 20210430 |
94 | 숙박 | 호텔 | 고객 | 기본정보 | 2.44 | 4 | 서울 강남구 | 20210430 |
95 | 숙박 | 호텔 | 고객 | 디럭스트윈 | 2.44 | 4 | 서울 강남구 | 20210430 |
96 | 숙박 | 호텔 | 고객 | 디럭스트윈객실전망 | 2.44 | 4 | 서울 강남구 | 20210430 |
97 | 숙박 | 호텔 | 고객 | 디럭스트윈침대타입및정원 | 2.44 | 4 | 서울 강남구 | 20210430 |
98 | 숙박 | 호텔 | 고객 | 디럭스패밀리트윈 | 2.44 | 4 | 서울 강남구 | 20210430 |
99 | 숙박 | 호텔 | 고객 | 디럭스패밀리트윈침대타입및정원 | 2.44 | 4 | 서울 강남구 | 20210430 |