Overview

Dataset statistics

Number of variables6
Number of observations60
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory53.2 B

Variable types

Numeric3
Categorical3

Alerts

UPPER_CTGRY_NM has constant value ""Constant
SRCHWRD_NM is highly overall correlated with LWPRT_CTGRY_NMHigh correlation
LWPRT_CTGRY_NM is highly overall correlated with SRCHWRD_NMHigh correlation
SEQ_NO is highly overall correlated with DPI_YMHigh correlation
DPI_YM is highly overall correlated with SEQ_NOHigh correlation
SEQ_NO has unique valuesUnique
KWRD_DPI_VALUE has unique valuesUnique

Reproduction

Analysis started2023-12-10 09:52:29.957650
Analysis finished2023-12-10 09:52:32.333750
Duration2.38 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

SEQ_NO
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2913.0833
Minimum2340
Maximum3393
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-10T18:52:32.906072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2340
5-th percentile2363.75
Q12715.75
median2884
Q33230.25
95-th percentile3380.05
Maximum3393
Range1053
Interquartile range (IQR)514.5

Descriptive statistics

Standard deviation329.22875
Coefficient of variation (CV)0.11301728
Kurtosis-0.95676232
Mean2913.0833
Median Absolute Deviation (MAD)257.5
Skewness-0.18498515
Sum174785
Variance108391.57
MonotonicityNot monotonic
2023-12-10T18:52:33.193044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2364 1
 
1.7%
2785 1
 
1.7%
2853 1
 
1.7%
2697 1
 
1.7%
2715 1
 
1.7%
2877 1
 
1.7%
2689 1
 
1.7%
2747 1
 
1.7%
2841 1
 
1.7%
3231 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
2340 1
1.7%
2351 1
1.7%
2359 1
1.7%
2364 1
1.7%
2376 1
1.7%
2378 1
1.7%
2386 1
1.7%
2390 1
1.7%
2402 1
1.7%
2420 1
1.7%
ValueCountFrequency (%)
3393 1
1.7%
3392 1
1.7%
3381 1
1.7%
3380 1
1.7%
3369 1
1.7%
3368 1
1.7%
3357 1
1.7%
3356 1
1.7%
3301 1
1.7%
3300 1
1.7%

SRCHWRD_NM
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)16.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
틴탑
엑소
NCT
데이식스
소녀시대
Other values (5)
30 

Length

Max length9
Median length5
Mean length3.9
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row틴탑
2nd row엑소
3rd rowNCT
4th row데이식스
5th row소녀시대

Common Values

ValueCountFrequency (%)
틴탑 6
10.0%
엑소 6
10.0%
NCT 6
10.0%
데이식스 6
10.0%
소녀시대 6
10.0%
비투비 6
10.0%
여자아이들 6
10.0%
초신성 6
10.0%
투모로우바이투게더 6
10.0%
B1A4 6
10.0%

Length

2023-12-10T18:52:33.511205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:52:33.748453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
틴탑 6
10.0%
엑소 6
10.0%
nct 6
10.0%
데이식스 6
10.0%
소녀시대 6
10.0%
비투비 6
10.0%
여자아이들 6
10.0%
초신성 6
10.0%
투모로우바이투게더 6
10.0%
b1a4 6
10.0%

UPPER_CTGRY_NM
Categorical

CONSTANT 

Distinct1
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size612.0 B
아이돌
60 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아이돌
2nd row아이돌
3rd row아이돌
4th row아이돌
5th row아이돌

Common Values

ValueCountFrequency (%)
아이돌 60
100.0%

Length

2023-12-10T18:52:34.007856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:52:34.213596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아이돌 60
100.0%

LWPRT_CTGRY_NM
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size612.0 B
보이그룹
48 
걸그룹
12 

Length

Max length4
Median length4
Mean length3.8
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보이그룹
2nd row보이그룹
3rd row보이그룹
4th row보이그룹
5th row걸그룹

Common Values

ValueCountFrequency (%)
보이그룹 48
80.0%
걸그룹 12
 
20.0%

Length

2023-12-10T18:52:34.405725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T18:52:34.617257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보이그룹 48
80.0%
걸그룹 12
 
20.0%

KWRD_DPI_VALUE
Real number (ℝ)

UNIQUE 

Distinct60
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean950809.91
Minimum33091.89
Maximum5037637.6
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-10T18:52:34.851354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum33091.89
5-th percentile65715.738
Q1434586.43
median668915.28
Q31071497.4
95-th percentile2922855.1
Maximum5037637.6
Range5004545.7
Interquartile range (IQR)636910.98

Descriptive statistics

Standard deviation1034807.3
Coefficient of variation (CV)1.0883429
Kurtosis7.541982
Mean950809.91
Median Absolute Deviation (MAD)323239.78
Skewness2.6362159
Sum57048595
Variance1.0708261 × 1012
MonotonicityNot monotonic
2023-12-10T18:52:35.136233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2849743.37 1
 
1.7%
533482.13 1
 
1.7%
1228262.76 1
 
1.7%
486033.6 1
 
1.7%
483654.96 1
 
1.7%
1425978.86 1
 
1.7%
1723318.85 1
 
1.7%
41367.87 1
 
1.7%
559736.29 1
 
1.7%
1445477.38 1
 
1.7%
Other values (50) 50
83.3%
ValueCountFrequency (%)
33091.89 1
1.7%
41367.87 1
1.7%
41487.28 1
1.7%
66990.92 1
1.7%
123688.48 1
1.7%
148172.76 1
1.7%
174337.0 1
1.7%
190356.15 1
1.7%
192926.96 1
1.7%
250530.28 1
1.7%
ValueCountFrequency (%)
5037637.58 1
1.7%
4822250.5 1
1.7%
4311978.82 1
1.7%
2849743.37 1
1.7%
2036666.91 1
1.7%
1984832.5 1
1.7%
1723318.85 1
1.7%
1685639.09 1
1.7%
1588576.97 1
1.7%
1445477.38 1
1.7%

DPI_YM
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202009.5
Minimum202007
Maximum202012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size672.0 B
2023-12-10T18:52:35.346354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum202007
5-th percentile202007
Q1202008
median202009.5
Q3202011
95-th percentile202012
Maximum202012
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.7222374
Coefficient of variation (CV)8.5255268 × 10-6
Kurtosis-1.2739227
Mean202009.5
Median Absolute Deviation (MAD)1.5
Skewness0
Sum12120570
Variance2.9661017
MonotonicityIncreasing
2023-12-10T18:52:35.603318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
202007 10
16.7%
202008 10
16.7%
202009 10
16.7%
202010 10
16.7%
202011 10
16.7%
202012 10
16.7%
ValueCountFrequency (%)
202007 10
16.7%
202008 10
16.7%
202009 10
16.7%
202010 10
16.7%
202011 10
16.7%
202012 10
16.7%
ValueCountFrequency (%)
202012 10
16.7%
202011 10
16.7%
202010 10
16.7%
202009 10
16.7%
202008 10
16.7%
202007 10
16.7%

Interactions

2023-12-10T18:52:31.360256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:30.224452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:30.733188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:31.601296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:30.381560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:30.893619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:31.762978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:30.561463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T18:52:31.162618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T18:52:35.783134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEQ_NOSRCHWRD_NMLWPRT_CTGRY_NMKWRD_DPI_VALUEDPI_YM
SEQ_NO1.0000.0000.2620.0000.744
SRCHWRD_NM0.0001.0001.0000.4620.000
LWPRT_CTGRY_NM0.2621.0001.0000.0000.000
KWRD_DPI_VALUE0.0000.4620.0001.0000.000
DPI_YM0.7440.0000.0000.0001.000
2023-12-10T18:52:35.998508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SRCHWRD_NMLWPRT_CTGRY_NM
SRCHWRD_NM1.0000.928
LWPRT_CTGRY_NM0.9281.000
2023-12-10T18:52:36.166338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SEQ_NOKWRD_DPI_VALUEDPI_YMSRCHWRD_NMLWPRT_CTGRY_NM
SEQ_NO1.0000.1590.7550.0000.266
KWRD_DPI_VALUE0.1591.0000.1110.2310.000
DPI_YM0.7550.1111.0000.0000.000
SRCHWRD_NM0.0000.2310.0001.0000.928
LWPRT_CTGRY_NM0.2660.0000.0000.9281.000

Missing values

2023-12-10T18:52:31.997921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T18:52:32.257892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

SEQ_NOSRCHWRD_NMUPPER_CTGRY_NMLWPRT_CTGRY_NMKWRD_DPI_VALUEDPI_YM
02364틴탑아이돌보이그룹2849743.37202007
12359엑소아이돌보이그룹808015.38202007
22340NCT아이돌보이그룹354679.39202007
32386데이식스아이돌보이그룹382840.44202007
42376소녀시대아이돌걸그룹451880.88202007
52402비투비아이돌보이그룹479455.93202007
62420여자아이들아이돌걸그룹496894.42202007
72378초신성아이돌보이그룹41487.28202007
82390투모로우바이투게더아이돌보이그룹687940.58202007
92351B1A4아이돌보이그룹148172.76202007
SEQ_NOSRCHWRD_NMUPPER_CTGRY_NMLWPRT_CTGRY_NMKWRD_DPI_VALUEDPI_YM
503356여자아이들아이돌걸그룹950352.01202012
513230비투비아이돌보이그룹4311978.82202012
523368투모로우바이투게더아이돌보이그룹1156585.8202012
533278데이식스아이돌보이그룹485454.48202012
543262초신성아이돌보이그룹33091.89202012
553380B1A4아이돌보이그룹190356.15202012
563392NCT아이돌보이그룹2036666.91202012
573212틴탑아이돌보이그룹123688.48202012
583204엑소아이돌보이그룹701523.21202012
593300소녀시대아이돌걸그룹649889.99202012