Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 5.8 KiB |
Average record size in memory | 59.3 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 2 |
Text | 3 |
Dataset
Description | Sample |
---|---|
Author | 데이터마케팅코리아 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=3f8d0300-2a23-11eb-af9a-4b03f0a582d6 |
sccnt_ym has constant value "" | Constant |
origin_ty has constant value "" | Constant |
seq has unique values | Unique |
origin_sn_id has unique values | Unique |
kwrd_nm has unique values | Unique |
srchwrd_nm has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 09:56:07.922256 |
---|---|
Analysis finished | 2023-12-10 09:56:09.965138 |
Duration | 2.04 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
seq
Real number (ℝ)
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 283148.32 |
Minimum | 282862 |
---|---|
Maximum | 285081 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 282862 |
---|---|
5-th percentile | 282953.9 |
Q1 | 282996 |
median | 283122 |
Q3 | 283243.5 |
95-th percentile | 283285.1 |
Maximum | 285081 |
Range | 2219 |
Interquartile range (IQR) | 247.5 |
Descriptive statistics
Standard deviation | 262.13386 |
---|---|
Coefficient of variation (CV) | 0.00092578285 |
Kurtosis | 31.33068 |
Mean | 283148.32 |
Median Absolute Deviation (MAD) | 124 |
Skewness | 4.773381 |
Sum | 28314832 |
Variance | 68714.159 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
283980 | 1 | 1.0% |
283147 | 1 | 1.0% |
283237 | 1 | 1.0% |
283235 | 1 | 1.0% |
283232 | 1 | 1.0% |
283230 | 1 | 1.0% |
283228 | 1 | 1.0% |
283157 | 1 | 1.0% |
283155 | 1 | 1.0% |
283153 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
282862 | 1 | |
282873 | 1 | |
282875 | 1 | |
282877 | 1 | |
282952 | 1 | |
282954 | 1 | |
282956 | 1 | |
282960 | 1 | |
282962 | 1 | |
282963 | 1 |
Value | Count | Frequency (%) |
285081 | 1 | |
284050 | 1 | |
283980 | 1 | |
283289 | 1 | |
283287 | 1 | |
283285 | 1 | |
283283 | 1 | |
283281 | 1 | |
283279 | 1 | |
283277 | 1 |
sccnt_ym
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021-11 |
---|
Length
Max length | 7 |
---|---|
Median length | 7 |
Mean length | 7 |
Min length | 7 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-11 |
---|---|
2nd row | 2021-11 |
3rd row | 2021-11 |
4th row | 2021-11 |
5th row | 2021-11 |
Common Values
Value | Count | Frequency (%) |
2021-11 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-11 | 100 |
origin_sn_id
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 16 |
---|---|
Median length | 16 |
Mean length | 14.74 |
Min length | 8 |
Characters and Unicode
Total characters | 1474 |
---|---|
Distinct characters | 22 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | SAN_0080 |
---|---|
2nd row | KC498PP19N007924 |
3rd row | KC495PP19N014040 |
4th row | SAN_0086 |
5th row | KC495PP19N026268 |
Value | Count | Frequency (%) |
san_0080 | 1 | 1.0% |
kc495pp19n032283 | 1 | 1.0% |
kc498pp19n006280 | 1 | 1.0% |
kc498pp19n001768 | 1 | 1.0% |
kc498pp19n000936 | 1 | 1.0% |
kc498pp19n004635 | 1 | 1.0% |
kc495pp19n026407 | 1 | 1.0% |
kc498pp19n006689 | 1 | 1.0% |
culture_001211 | 1 | 1.0% |
kc498pp19n003369 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 225 | |
9 | 190 | |
P | 164 | |
1 | 127 | |
4 | 116 | |
N | 97 | |
8 | 92 | 6.2% |
C | 85 | 5.8% |
K | 82 | 5.6% |
7 | 58 | 3.9% |
Other values (12) | 238 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 980 | |
Uppercase Letter | 476 | |
Connector Punctuation | 18 | 1.2% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
P | 164 | |
N | 97 | |
C | 85 | |
K | 82 | |
S | 15 | 3.2% |
A | 15 | 3.2% |
U | 6 | 1.3% |
L | 3 | 0.6% |
T | 3 | 0.6% |
R | 3 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
0 | 225 | |
9 | 190 | |
1 | 127 | |
4 | 116 | |
8 | 92 | |
7 | 58 | 5.9% |
5 | 57 | 5.8% |
3 | 39 | 4.0% |
2 | 38 | 3.9% |
6 | 38 | 3.9% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 998 | |
Latin | 476 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 225 | |
9 | 190 | |
1 | 127 | |
4 | 116 | |
8 | 92 | |
7 | 58 | 5.8% |
5 | 57 | 5.7% |
3 | 39 | 3.9% |
2 | 38 | 3.8% |
6 | 38 | 3.8% |
Latin
Value | Count | Frequency (%) |
P | 164 | |
N | 97 | |
C | 85 | |
K | 82 | |
S | 15 | 3.2% |
A | 15 | 3.2% |
U | 6 | 1.3% |
L | 3 | 0.6% |
T | 3 | 0.6% |
R | 3 | 0.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1474 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 225 | |
9 | 190 | |
P | 164 | |
1 | 127 | |
4 | 116 | |
N | 97 | |
8 | 92 | 6.2% |
C | 85 | 5.8% |
K | 82 | 5.6% |
7 | 58 | 3.9% |
Other values (12) | 238 |
kwrd_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
구미산 | 1 | 1.0% |
강당계곡 | 1 | 1.0% |
개롱공원 | 1 | 1.0% |
개나리어린이공원 | 1 | 1.0% |
개나리공원 | 1 | 1.0% |
개금테마공원 | 1 | 1.0% |
강문해수욕장 | 1 | 1.0% |
강릉통일공원 | 1 | 1.0% |
강릉임영관삼문 | 1 | 1.0% |
강릉남대천체육공원 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
원 | 65 | 12.5% |
공 | 61 | 11.7% |
가 | 31 | 6.0% |
산 | 25 | 4.8% |
개 | 17 | 3.3% |
강 | 13 | 2.5% |
감 | 13 | 2.5% |
장 | 10 | 1.9% |
거 | 10 | 1.9% |
수 | 10 | 1.9% |
Other values (129) | 265 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 509 | |
Decimal Number | 10 | 1.9% |
Other Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
Decimal Number
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
2 | 1 | 10.0% |
7 | 1 | 10.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 509 | |
Common | 11 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
Common
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
. | 1 | 9.1% |
2 | 1 | 9.1% |
7 | 1 | 9.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 509 | |
ASCII | 11 | 2.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
ASCII
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
. | 1 | 9.1% |
2 | 1 | 9.1% |
7 | 1 | 9.1% |
srchwrd_nm
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Value | Count | Frequency (%) |
구미산 | 1 | 1.0% |
강당계곡 | 1 | 1.0% |
개롱공원 | 1 | 1.0% |
개나리어린이공원 | 1 | 1.0% |
개나리공원 | 1 | 1.0% |
개금테마공원 | 1 | 1.0% |
강문해수욕장 | 1 | 1.0% |
강릉통일공원 | 1 | 1.0% |
강릉임영관삼문 | 1 | 1.0% |
강릉남대천체육공원 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
원 | 65 | 12.5% |
공 | 61 | 11.7% |
가 | 31 | 6.0% |
산 | 25 | 4.8% |
개 | 17 | 3.3% |
강 | 13 | 2.5% |
감 | 13 | 2.5% |
장 | 10 | 1.9% |
거 | 10 | 1.9% |
수 | 10 | 1.9% |
Other values (129) | 265 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 509 | |
Decimal Number | 10 | 1.9% |
Other Punctuation | 1 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
Decimal Number
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
2 | 1 | 10.0% |
7 | 1 | 10.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 509 | |
Common | 11 | 2.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
Common
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
. | 1 | 9.1% |
2 | 1 | 9.1% |
7 | 1 | 9.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 509 | |
ASCII | 11 | 2.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
원 | 65 | 12.8% |
공 | 61 | 12.0% |
가 | 31 | 6.1% |
산 | 25 | 4.9% |
개 | 17 | 3.3% |
강 | 13 | 2.6% |
감 | 13 | 2.6% |
장 | 10 | 2.0% |
거 | 10 | 2.0% |
수 | 10 | 2.0% |
Other values (123) | 254 |
ASCII
Value | Count | Frequency (%) |
5 | 3 | |
1 | 3 | |
8 | 2 | |
. | 1 | 9.1% |
2 | 1 | 9.1% |
7 | 1 | 9.1% |
sccnt
Real number (ℝ)
Distinct | 66 |
---|---|
Distinct (%) | 66.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16190.81 |
Minimum | 10 |
---|---|
Maximum | 1493600 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 10 |
---|---|
5-th percentile | 11 |
Q1 | 40 |
median | 90 |
Q3 | 462.5 |
95-th percentile | 9558 |
Maximum | 1493600 |
Range | 1493590 |
Interquartile range (IQR) | 422.5 |
Descriptive statistics
Standard deviation | 149291.11 |
---|---|
Coefficient of variation (CV) | 9.2207316 |
Kurtosis | 99.840289 |
Mean | 16190.81 |
Median Absolute Deviation (MAD) | 72 |
Skewness | 9.988245 |
Sum | 1619081 |
Variance | 2.2287837 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
60 | 5 | 5.0% |
50 | 4 | 4.0% |
90 | 4 | 4.0% |
70 | 3 | 3.0% |
11 | 3 | 3.0% |
80 | 3 | 3.0% |
10 | 3 | 3.0% |
180 | 3 | 3.0% |
40 | 3 | 3.0% |
200 | 2 | 2.0% |
Other values (56) | 67 |
Value | Count | Frequency (%) |
10 | 3 | |
11 | 3 | |
13 | 1 | 1.0% |
14 | 2 | |
16 | 2 | |
20 | 2 | |
25 | 2 | |
26 | 1 | 1.0% |
27 | 1 | 1.0% |
28 | 1 | 1.0% |
Value | Count | Frequency (%) |
1493600 | 1 | |
27590 | 1 | |
24150 | 1 | |
12860 | 1 | |
11800 | 1 | |
9440 | 1 | |
9330 | 1 | |
4130 | 1 | |
2320 | 1 | |
1790 | 1 |
origin_ty
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
관광명소 |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 관광명소 |
---|---|
2nd row | 관광명소 |
3rd row | 관광명소 |
4th row | 관광명소 |
5th row | 관광명소 |
Common Values
Value | Count | Frequency (%) |
관광명소 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
관광명소 | 100 |
seq | origin_sn_id | kwrd_nm | srchwrd_nm | sccnt | |
---|---|---|---|---|---|
seq | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
origin_sn_id | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
kwrd_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
srchwrd_nm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
sccnt | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 |
seq | sccnt | |
---|---|---|
seq | 1.000 | -0.016 |
sccnt | -0.016 | 1.000 |
seq | sccnt_ym | origin_sn_id | kwrd_nm | srchwrd_nm | sccnt | origin_ty | |
---|---|---|---|---|---|---|---|
0 | 283980 | 2021-11 | SAN_0080 | 구미산 | 구미산 | 910 | 관광명소 |
1 | 282873 | 2021-11 | KC498PP19N007924 | 5.18자유공원 | 5.18자유공원 | 440 | 관광명소 |
2 | 282862 | 2021-11 | KC495PP19N014040 | 12폭포 | 12폭포 | 47 | 관광명소 |
3 | 284050 | 2021-11 | SAN_0086 | 구절산 | 구절산 | 850 | 관광명소 |
4 | 282952 | 2021-11 | KC495PP19N026268 | 가사해수욕장 | 가사해수욕장 | 26 | 관광명소 |
5 | 282954 | 2021-11 | SAN_0007 | 가산 | 가산 | 9440 | 관광명소 |
6 | 282956 | 2021-11 | KC498PP19N007314 | 가산공원 | 가산공원 | 180 | 관광명소 |
7 | 282875 | 2021-11 | KC498PP19N005217 | 518기념공원 | 518기념공원 | 930 | 관광명소 |
8 | 282960 | 2021-11 | KC498PP19N006181 | 가산수변공원 | 가산수변공원 | 270 | 관광명소 |
9 | 282962 | 2021-11 | SAN_0008 | 가섭산 | 가섭산 | 180 | 관광명소 |
seq | sccnt_ym | origin_sn_id | kwrd_nm | srchwrd_nm | sccnt | origin_ty | |
---|---|---|---|---|---|---|---|
90 | 283271 | 2021-11 | KC498PP19N003353 | 거금생태숲 | 거금생태숲 | 140 | 관광명소 |
91 | 283273 | 2021-11 | SAN_0030 | 거류산 | 거류산 | 1210 | 관광명소 |
92 | 283275 | 2021-11 | KC498PP19N000759 | 거류체육공원 | 거류체육공원 | 38 | 관광명소 |
93 | 283277 | 2021-11 | KC495PP19N014472 | 거림계곡 | 거림계곡 | 200 | 관광명소 |
94 | 283279 | 2021-11 | KC498PP19N004084 | 거마공원 | 거마공원 | 70 | 관광명소 |
95 | 283281 | 2021-11 | SAN_0031 | 거망산 | 거망산 | 90 | 관광명소 |
96 | 283283 | 2021-11 | KC495PP19N026269 | 거문도해수욕장 | 거문도해수욕장 | 14 | 관광명소 |
97 | 283285 | 2021-11 | SAN_0032 | 거문산 | 거문산 | 30 | 관광명소 |
98 | 283287 | 2021-11 | KC498PP19N007441 | 거북공원 | 거북공원 | 460 | 관광명소 |
99 | 283289 | 2021-11 | KC498PP19N007449 | 거북선공원 | 거북선공원 | 150 | 관광명소 |