Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 90 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.3 KiB |
Average record size in memory | 60.5 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 4 |
Text | 1 |
Dataset
Description | Sample |
---|---|
Author | 데이터마케팅코리아 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=741d68f0-1e55-11eb-a4e6-a9a03a61580b |
UPPER_CTGRY_NM has constant value "" | Constant |
LWPRT_CTGRY_NM has constant value "" | Constant |
SEQ_NO is highly overall correlated with ANALS_YM and 1 other fields | High correlation |
ANALS_YM is highly overall correlated with SEQ_NO | High correlation |
SRCHWRD_NM is highly overall correlated with SEQ_NO | High correlation |
SEQ_NO has unique values | Unique |
Reproduction
Analysis started | 2024-04-17 13:30:21.886963 |
---|---|
Analysis finished | 2024-04-17 13:30:22.794353 |
Duration | 0.91 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SEQ_NO
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 90 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9009.0556 |
Minimum | 2506 |
---|---|
Maximum | 14212 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 942.0 B |
Quantile statistics
Minimum | 2506 |
---|---|
5-th percentile | 3247.1 |
Q1 | 8213.25 |
median | 8563 |
Q3 | 10691.75 |
95-th percentile | 13463.25 |
Maximum | 14212 |
Range | 11706 |
Interquartile range (IQR) | 2478.5 |
Descriptive statistics
Standard deviation | 2677.7181 |
---|---|
Coefficient of variation (CV) | 0.29722517 |
Kurtosis | 0.75461479 |
Mean | 9009.0556 |
Median Absolute Deviation (MAD) | 1565 |
Skewness | -0.53384585 |
Sum | 810815 |
Variance | 7170174 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2506 | 1 | 1.1% |
8254 | 1 | 1.1% |
8252 | 1 | 1.1% |
8251 | 1 | 1.1% |
10714 | 1 | 1.1% |
10713 | 1 | 1.1% |
10712 | 1 | 1.1% |
10711 | 1 | 1.1% |
10710 | 1 | 1.1% |
8575 | 1 | 1.1% |
Other values (80) | 80 |
Value | Count | Frequency (%) |
2506 | 1 | |
2507 | 1 | |
2508 | 1 | |
2509 | 1 | |
2510 | 1 | |
4148 | 1 | |
4149 | 1 | |
4150 | 1 | |
4151 | 1 | |
4152 | 1 |
Value | Count | Frequency (%) |
14212 | 1 | |
14211 | 1 | |
14210 | 1 | |
14209 | 1 | |
14208 | 1 | |
12553 | 1 | |
12552 | 1 | |
12551 | 1 | |
12550 | 1 | |
12549 | 1 |
SRCHWRD_NM
Categorical
HIGH CORRELATION
 
Distinct | 3 |
---|---|
Distinct (%) | 3.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 852.0 B |
뮤지컬귀환 | |
---|---|
뮤지컬렌트 | |
뮤지컬리지 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 뮤지컬귀환 |
---|---|
2nd row | 뮤지컬귀환 |
3rd row | 뮤지컬귀환 |
4th row | 뮤지컬귀환 |
5th row | 뮤지컬귀환 |
Common Values
Value | Count | Frequency (%) |
뮤지컬귀환 | 30 | |
뮤지컬렌트 | 30 | |
뮤지컬리지 | 30 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
뮤지컬귀환 | 30 | |
뮤지컬렌트 | 30 | |
뮤지컬리지 | 30 |
UPPER_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 852.0 B |
문화공연 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 문화공연 |
---|---|
2nd row | 문화공연 |
3rd row | 문화공연 |
4th row | 문화공연 |
5th row | 문화공연 |
Common Values
Value | Count | Frequency (%) |
문화공연 | 90 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
문화공연 | 90 |
LWPRT_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 852.0 B |
뮤지컬 |
---|
Length
Max length | 3 |
---|---|
Median length | 3 |
Mean length | 3 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 뮤지컬 |
---|---|
2nd row | 뮤지컬 |
3rd row | 뮤지컬 |
4th row | 뮤지컬 |
5th row | 뮤지컬 |
Common Values
Value | Count | Frequency (%) |
뮤지컬 | 90 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
뮤지컬 | 90 |
ALL_KWRD_RANK_CO
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 5.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 852.0 B |
16 | |
---|---|
17 | |
18 | |
19 | |
20 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 16 |
---|---|
2nd row | 17 |
3rd row | 18 |
4th row | 19 |
5th row | 20 |
Common Values
Value | Count | Frequency (%) |
16 | 18 | |
17 | 18 | |
18 | 18 | |
19 | 18 | |
20 | 18 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
16 | 18 | |
17 | 18 | |
18 | 18 | |
19 | 18 | |
20 | 18 |
ASKWRD_NM
Text
Distinct | 73 |
---|---|
Distinct (%) | 81.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 852.0 B |
Value | Count | Frequency (%) |
시간 | 4 | 4.4% |
노래 | 4 | 4.4% |
정다희 | 2 | 2.2% |
조앤 | 2 | 2.2% |
연극 | 2 | 2.2% |
이성열 | 2 | 2.2% |
프로그램 | 2 | 2.2% |
좌석 | 2 | 2.2% |
콘서트 | 2 | 2.2% |
출처 | 2 | 2.2% |
Other values (63) | 66 |
Most occurring characters
Value | Count | Frequency (%) |
미 | 7 | 3.3% |
정 | 5 | 2.4% |
래 | 5 | 2.4% |
이 | 5 | 2.4% |
시 | 4 | 1.9% |
노 | 4 | 1.9% |
나 | 4 | 1.9% |
간 | 4 | 1.9% |
모 | 4 | 1.9% |
지 | 3 | 1.4% |
Other values (112) | 165 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 210 |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
미 | 7 | 3.3% |
정 | 5 | 2.4% |
래 | 5 | 2.4% |
이 | 5 | 2.4% |
시 | 4 | 1.9% |
노 | 4 | 1.9% |
나 | 4 | 1.9% |
간 | 4 | 1.9% |
모 | 4 | 1.9% |
지 | 3 | 1.4% |
Other values (112) | 165 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 210 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
미 | 7 | 3.3% |
정 | 5 | 2.4% |
래 | 5 | 2.4% |
이 | 5 | 2.4% |
시 | 4 | 1.9% |
노 | 4 | 1.9% |
나 | 4 | 1.9% |
간 | 4 | 1.9% |
모 | 4 | 1.9% |
지 | 3 | 1.4% |
Other values (112) | 165 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 210 |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
미 | 7 | 3.3% |
정 | 5 | 2.4% |
래 | 5 | 2.4% |
이 | 5 | 2.4% |
시 | 4 | 1.9% |
노 | 4 | 1.9% |
나 | 4 | 1.9% |
간 | 4 | 1.9% |
모 | 4 | 1.9% |
지 | 3 | 1.4% |
Other values (112) | 165 |
ANALS_YM
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 6 |
---|---|
Distinct (%) | 6.7% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 202009.5 |
Minimum | 202007 |
---|---|
Maximum | 202012 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 942.0 B |
Quantile statistics
Minimum | 202007 |
---|---|
5-th percentile | 202007 |
Q1 | 202008 |
median | 202009.5 |
Q3 | 202011 |
95-th percentile | 202012 |
Maximum | 202012 |
Range | 5 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 1.7173929 |
---|---|
Coefficient of variation (CV) | 8.501545 × 10-6 |
Kurtosis | -1.2722257 |
Mean | 202009.5 |
Median Absolute Deviation (MAD) | 1.5 |
Skewness | 0 |
Sum | 18180855 |
Variance | 2.9494382 |
Monotonicity | Increasing |
Value | Count | Frequency (%) |
202007 | 15 | |
202008 | 15 | |
202009 | 15 | |
202010 | 15 | |
202011 | 15 | |
202012 | 15 |
Value | Count | Frequency (%) |
202007 | 15 | |
202008 | 15 | |
202009 | 15 | |
202010 | 15 | |
202011 | 15 | |
202012 | 15 |
Value | Count | Frequency (%) |
202012 | 15 | |
202011 | 15 | |
202010 | 15 | |
202009 | 15 | |
202008 | 15 | |
202007 | 15 |
SEQ_NO | SRCHWRD_NM | ALL_KWRD_RANK_CO | ASKWRD_NM | ANALS_YM | |
---|---|---|---|---|---|
SEQ_NO | 1.000 | 1.000 | 0.000 | 0.000 | 0.583 |
SRCHWRD_NM | 1.000 | 1.000 | 0.000 | 0.808 | 0.000 |
ALL_KWRD_RANK_CO | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
ASKWRD_NM | 0.000 | 0.808 | 0.000 | 1.000 | 0.399 |
ANALS_YM | 0.583 | 0.000 | 0.000 | 0.399 | 1.000 |
SRCHWRD_NM | ALL_KWRD_RANK_CO | |
---|---|---|
SRCHWRD_NM | 1.000 | 0.000 |
ALL_KWRD_RANK_CO | 0.000 | 1.000 |
SEQ_NO | ANALS_YM | SRCHWRD_NM | ALL_KWRD_RANK_CO | |
---|---|---|---|---|
SEQ_NO | 1.000 | 0.798 | 0.971 | 0.000 |
ANALS_YM | 0.798 | 1.000 | 0.000 | 0.000 |
SRCHWRD_NM | 0.971 | 0.000 | 1.000 | 0.000 |
ALL_KWRD_RANK_CO | 0.000 | 0.000 | 0.000 | 1.000 |
SEQ_NO | SRCHWRD_NM | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | ALL_KWRD_RANK_CO | ASKWRD_NM | ANALS_YM | |
---|---|---|---|---|---|---|---|
0 | 2506 | 뮤지컬귀환 | 문화공연 | 뮤지컬 | 16 | 미팅 | 202007 |
1 | 2507 | 뮤지컬귀환 | 문화공연 | 뮤지컬 | 17 | 경수 | 202007 |
2 | 2508 | 뮤지컬귀환 | 문화공연 | 뮤지컬 | 18 | 경력 | 202007 |
3 | 2509 | 뮤지컬귀환 | 문화공연 | 뮤지컬 | 19 | 미스터 | 202007 |
4 | 2510 | 뮤지컬귀환 | 문화공연 | 뮤지컬 | 20 | 제이미 | 202007 |
5 | 7520 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 16 | 영화 | 202007 |
6 | 7521 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 17 | 두훈 | 202007 |
7 | 7522 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 18 | 주년 | 202007 |
8 | 7523 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 19 | 정다희 | 202007 |
9 | 7524 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 20 | 시간 | 202007 |
SEQ_NO | SRCHWRD_NM | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | ALL_KWRD_RANK_CO | ASKWRD_NM | ANALS_YM | |
---|---|---|---|---|---|---|---|
80 | 12549 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 16 | 레아 | 202012 |
81 | 12550 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 17 | 유진선 | 202012 |
82 | 12551 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 18 | 노래 | 202012 |
83 | 12552 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 19 | 시간 | 202012 |
84 | 12553 | 뮤지컬렌트 | 문화공연 | 뮤지컬 | 20 | 엔젤 | 202012 |
85 | 14208 | 뮤지컬리지 | 문화공연 | 뮤지컬 | 16 | 펀홈 | 202012 |
86 | 14209 | 뮤지컬리지 | 문화공연 | 뮤지컬 | 17 | 가격 | 202012 |
87 | 14210 | 뮤지컬리지 | 문화공연 | 뮤지컬 | 18 | 도끼 | 202012 |
88 | 14211 | 뮤지컬리지 | 문화공연 | 뮤지컬 | 19 | 그냥 | 202012 |
89 | 14212 | 뮤지컬리지 | 문화공연 | 뮤지컬 | 20 | 하나 | 202012 |