Dataset statistics
Number of variables | 8 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 6.7 KiB |
Average record size in memory | 68.3 B |
Variable types
Categorical | 6 |
---|---|
Text | 1 |
Numeric | 1 |
Dataset
Description | Sample |
---|---|
Author | 레드타이 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=3a5121c0-484d-11ec-9c54-b54b4d3d7cd0 |
dmn_nm has constant value "" | Constant |
convrs_partn_cl_nm has constant value "" | Constant |
base_de has constant value "" | Constant |
chatbot_ctgry_nm is highly overall correlated with hotel_grad_no and 1 other fields | High correlation |
hotel_grad_no is highly overall correlated with chatbot_ctgry_nm and 1 other fields | High correlation |
area_nm is highly overall correlated with chatbot_ctgry_nm and 1 other fields | High correlation |
chatbot_ctgry_nm is highly imbalanced (80.6%) | Imbalance |
hotel_grad_no is highly imbalanced (80.6%) | Imbalance |
area_nm is highly imbalanced (80.6%) | Imbalance |
convrs_ctgry_nm has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 10:14:48.531862 |
---|---|
Analysis finished | 2023-12-10 10:14:49.584055 |
Duration | 1.05 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
dmn_nm
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
숙박 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 숙박 |
---|---|
2nd row | 숙박 |
3rd row | 숙박 |
4th row | 숙박 |
5th row | 숙박 |
Common Values
Value | Count | Frequency (%) |
숙박 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
숙박 | 100 |
chatbot_ctgry_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
호텔 | |
---|---|
펜션 | 3 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 호텔 |
---|---|
2nd row | 펜션 |
3rd row | 호텔 |
4th row | 호텔 |
5th row | 호텔 |
Common Values
Value | Count | Frequency (%) |
호텔 | 97 | |
펜션 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
호텔 | 97 | |
펜션 | 3 | 3.0% |
convrs_partn_cl_nm
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
고객 |
---|
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 고객 |
---|---|
2nd row | 고객 |
3rd row | 고객 |
4th row | 고객 |
5th row | 고객 |
Common Values
Value | Count | Frequency (%) |
고객 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
고객 | 100 |
convrs_ctgry_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
선톡 | 1 | 1.0% |
편의점 | 1 | 1.0% |
운항정보 | 1 | 1.0% |
캐슬테라스 | 1 | 1.0% |
낙원 | 1 | 1.0% |
객실타입 | 1 | 1.0% |
액티비티 | 1 | 1.0% |
레스토랑조식 | 1 | 1.0% |
지도 | 1 | 1.0% |
룸서비스 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 30 | 6.1% |
메 | 12 | 2.4% |
레 | 10 | 2.0% |
랑 | 10 | 2.0% |
토 | 10 | 2.0% |
식 | 10 | 2.0% |
예 | 10 | 2.0% |
라 | 10 | 2.0% |
약 | 10 | 2.0% |
원 | 9 | 1.8% |
Other values (157) | 370 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 487 | |
Uppercase Letter | 3 | 0.6% |
Decimal Number | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 30 | 6.2% |
메 | 12 | 2.5% |
레 | 10 | 2.1% |
랑 | 10 | 2.1% |
토 | 10 | 2.1% |
식 | 10 | 2.1% |
예 | 10 | 2.1% |
라 | 10 | 2.1% |
약 | 10 | 2.1% |
원 | 9 | 1.8% |
Other values (153) | 366 |
Uppercase Letter
Value | Count | Frequency (%) |
D | 1 | |
M | 1 | |
Z | 1 |
Decimal Number
Value | Count | Frequency (%) |
3 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 487 | |
Latin | 3 | 0.6% |
Common | 1 | 0.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 30 | 6.2% |
메 | 12 | 2.5% |
레 | 10 | 2.1% |
랑 | 10 | 2.1% |
토 | 10 | 2.1% |
식 | 10 | 2.1% |
예 | 10 | 2.1% |
라 | 10 | 2.1% |
약 | 10 | 2.1% |
원 | 9 | 1.8% |
Other values (153) | 366 |
Latin
Value | Count | Frequency (%) |
D | 1 | |
M | 1 | |
Z | 1 |
Common
Value | Count | Frequency (%) |
3 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 487 | |
ASCII | 4 | 0.8% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 30 | 6.2% |
메 | 12 | 2.5% |
레 | 10 | 2.1% |
랑 | 10 | 2.1% |
토 | 10 | 2.1% |
식 | 10 | 2.1% |
예 | 10 | 2.1% |
라 | 10 | 2.1% |
약 | 10 | 2.1% |
원 | 9 | 1.8% |
Other values (153) | 366 |
ASCII
Value | Count | Frequency (%) |
D | 1 | |
M | 1 | |
Z | 1 | |
3 | 1 |
text_fq_rt
Real number (ℝ)
Distinct | 38 |
---|---|
Distinct (%) | 38.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.9938 |
Minimum | 0.03 |
---|---|
Maximum | 79.57 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0.03 |
---|---|
5-th percentile | 0.03 |
Q1 | 0.04 |
median | 0.085 |
Q3 | 0.2025 |
95-th percentile | 0.903 |
Maximum | 79.57 |
Range | 79.54 |
Interquartile range (IQR) | 0.1625 |
Descriptive statistics
Standard deviation | 7.9448966 |
---|---|
Coefficient of variation (CV) | 7.9944622 |
Kurtosis | 99.590304 |
Mean | 0.9938 |
Median Absolute Deviation (MAD) | 0.045 |
Skewness | 9.9700045 |
Sum | 99.38 |
Variance | 63.121381 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.04 | 28 | |
0.05 | 8 | 8.0% |
0.03 | 6 | 6.0% |
0.13 | 4 | 4.0% |
0.12 | 4 | 4.0% |
0.08 | 4 | 4.0% |
0.11 | 3 | 3.0% |
0.06 | 3 | 3.0% |
0.1 | 3 | 3.0% |
0.16 | 3 | 3.0% |
Other values (28) | 34 |
Value | Count | Frequency (%) |
0.03 | 6 | 6.0% |
0.04 | 28 | |
0.05 | 8 | 8.0% |
0.06 | 3 | 3.0% |
0.07 | 1 | 1.0% |
0.08 | 4 | 4.0% |
0.09 | 2 | 2.0% |
0.1 | 3 | 3.0% |
0.11 | 3 | 3.0% |
0.12 | 4 | 4.0% |
Value | Count | Frequency (%) |
79.57 | 1 | |
2.82 | 1 | |
1.29 | 1 | |
1.27 | 1 | |
0.96 | 1 | |
0.9 | 1 | |
0.69 | 1 | |
0.57 | 1 | |
0.48 | 1 | |
0.46 | 1 |
hotel_grad_no
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
5 | |
---|---|
3 | 3 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 5 |
---|---|
2nd row | 3 |
3rd row | 5 |
4th row | 5 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
5 | 97 | |
3 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
5 | 97 | |
3 | 3 | 3.0% |
area_nm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
서울 | |
---|---|
서울 금천구 | 3 |
Length
Max length | 6 |
---|---|
Median length | 2 |
Mean length | 2.12 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 서울 |
---|---|
2nd row | 서울 금천구 |
3rd row | 서울 |
4th row | 서울 |
5th row | 서울 |
Common Values
Value | Count | Frequency (%) |
서울 | 97 | |
서울 금천구 | 3 | 3.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
서울 | 100 | |
금천구 | 3 | 2.9% |
base_de
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
20211031 |
---|
Length
Max length | 8 |
---|---|
Median length | 8 |
Mean length | 8 |
Min length | 8 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 20211031 |
---|---|
2nd row | 20211031 |
3rd row | 20211031 |
4th row | 20211031 |
5th row | 20211031 |
Common Values
Value | Count | Frequency (%) |
20211031 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
20211031 | 100 |
chatbot_ctgry_nm | convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | |
---|---|---|---|---|---|
chatbot_ctgry_nm | 1.000 | 1.000 | 0.000 | 0.963 | 0.963 |
convrs_ctgry_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
text_fq_rt | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 |
hotel_grad_no | 0.963 | 1.000 | 0.000 | 1.000 | 0.963 |
area_nm | 0.963 | 1.000 | 0.000 | 0.963 | 1.000 |
area_nm | hotel_grad_no | chatbot_ctgry_nm | |
---|---|---|---|
area_nm | 1.000 | 0.826 | 0.826 |
hotel_grad_no | 0.826 | 1.000 | 0.826 |
chatbot_ctgry_nm | 0.826 | 0.826 | 1.000 |
text_fq_rt | chatbot_ctgry_nm | hotel_grad_no | area_nm | |
---|---|---|---|---|
text_fq_rt | 1.000 | 0.000 | 0.000 | 0.000 |
chatbot_ctgry_nm | 0.000 | 1.000 | 0.826 | 0.826 |
hotel_grad_no | 0.000 | 0.826 | 1.000 | 0.826 |
area_nm | 0.000 | 0.826 | 0.826 | 1.000 |
dmn_nm | chatbot_ctgry_nm | convrs_partn_cl_nm | convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | base_de | |
---|---|---|---|---|---|---|---|---|
0 | 숙박 | 호텔 | 고객 | 선톡 | 79.57 | 5 | 서울 | 20211031 |
1 | 숙박 | 펜션 | 고객 | 청와삼대중식위치 | 0.43 | 3 | 서울 금천구 | 20211031 |
2 | 숙박 | 호텔 | 고객 | 체크인 | 0.04 | 5 | 서울 | 20211031 |
3 | 숙박 | 호텔 | 고객 | 부대시설 | 1.27 | 5 | 서울 | 20211031 |
4 | 숙박 | 호텔 | 고객 | 캐슬테라스요금 | 0.13 | 5 | 서울 | 20211031 |
5 | 숙박 | 호텔 | 고객 | 객실예약 | 2.82 | 5 | 서울 | 20211031 |
6 | 숙박 | 호텔 | 고객 | 투베드룸듀플렉스스위트 | 0.04 | 5 | 서울 | 20211031 |
7 | 숙박 | 펜션 | 고객 | 청와삼대조식예약 | 0.43 | 3 | 서울 금천구 | 20211031 |
8 | 숙박 | 호텔 | 고객 | 운동시설 | 0.21 | 5 | 서울 | 20211031 |
9 | 숙박 | 호텔 | 고객 | 레스토랑조식메뉴 | 0.12 | 5 | 서울 | 20211031 |
dmn_nm | chatbot_ctgry_nm | convrs_partn_cl_nm | convrs_ctgry_nm | text_fq_rt | hotel_grad_no | area_nm | base_de | |
---|---|---|---|---|---|---|---|---|
90 | 숙박 | 호텔 | 고객 | 캐슬테라스시간 | 0.06 | 5 | 서울 | 20211031 |
91 | 숙박 | 호텔 | 고객 | 로얄마일 | 0.05 | 5 | 서울 | 20211031 |
92 | 숙박 | 호텔 | 고객 | 상품권 | 0.05 | 5 | 서울 | 20211031 |
93 | 숙박 | 호텔 | 고객 | 봉래헌메뉴 | 0.05 | 5 | 서울 | 20211031 |
94 | 숙박 | 호텔 | 고객 | 프로모션안내 | 0.05 | 5 | 서울 | 20211031 |
95 | 숙박 | 호텔 | 고객 | 낙원예약 | 0.04 | 5 | 서울 | 20211031 |
96 | 숙박 | 호텔 | 고객 | 낙원요금 | 0.04 | 5 | 서울 | 20211031 |
97 | 숙박 | 호텔 | 고객 | 이원메뉴 | 0.04 | 5 | 서울 | 20211031 |
98 | 숙박 | 호텔 | 고객 | 메이필드돌잔치 | 0.04 | 5 | 서울 | 20211031 |
99 | 숙박 | 호텔 | 고객 | 모닝콜 | 0.04 | 5 | 서울 | 20211031 |