Overview

Dataset statistics

Number of variables3
Number of observations79
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory26.7 B

Variable types

Numeric1
Categorical1
Text1

Dataset

DescriptionJDC지정면세점의 2016.08부터 2017.04까지의 브랜드 언급 월별순위 데이터
Author제주국제자유도시개발센터
URLhttps://www.data.go.kr/data/15070427/fileData.do

Reproduction

Analysis started2023-12-12 00:57:57.816341
Analysis finished2023-12-12 00:57:58.317459
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순위
Real number (ℝ)

Distinct10
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.443038
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size843.0 B
2023-12-12T09:57:58.358462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q38
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.8633262
Coefficient of variation (CV)0.52605295
Kurtosis-1.2134465
Mean5.443038
Median Absolute Deviation (MAD)2
Skewness0.010180947
Sum430
Variance8.1986368
MonotonicityNot monotonic
2023-12-12T09:57:58.434308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 8
10.1%
2 8
10.1%
3 8
10.1%
4 8
10.1%
5 8
10.1%
6 8
10.1%
7 8
10.1%
8 8
10.1%
9 8
10.1%
10 7
8.9%
ValueCountFrequency (%)
1 8
10.1%
2 8
10.1%
3 8
10.1%
4 8
10.1%
5 8
10.1%
6 8
10.1%
7 8
10.1%
8 8
10.1%
9 8
10.1%
10 7
8.9%
ValueCountFrequency (%)
10 7
8.9%
9 8
10.1%
8 8
10.1%
7 8
10.1%
6 8
10.1%
5 8
10.1%
4 8
10.1%
3 8
10.1%
2 8
10.1%
1 8
10.1%

품목
Categorical

Distinct8
Distinct (%)10.1%
Missing0
Missing (%)0.0%
Memory size764.0 B
화장품
10 
식품/건강
10 
패션잡화
10 
주류
10 
향수
10 
Other values (3)
29 

Length

Max length5
Median length4
Mean length3.0126582
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화장품
2nd row화장품
3rd row화장품
4th row화장품
5th row화장품

Common Values

ValueCountFrequency (%)
화장품 10
12.7%
식품/건강 10
12.7%
패션잡화 10
12.7%
주류 10
12.7%
향수 10
12.7%
액세서리 10
12.7%
시계 10
12.7%
담배 9
11.4%

Length

2023-12-12T09:57:58.530446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:57:58.648328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
화장품 10
12.7%
식품/건강 10
12.7%
패션잡화 10
12.7%
주류 10
12.7%
향수 10
12.7%
액세서리 10
12.7%
시계 10
12.7%
담배 9
11.4%
Distinct40
Distinct (%)50.6%
Missing0
Missing (%)0.0%
Memory size764.0 B
2023-12-12T09:57:58.830758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.3544304
Min length2

Characters and Unicode

Total characters265
Distinct characters83
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)31.6%

Sample

1st row예쁘다
2nd row저렴하다
3rd row촉촉하다
4th row발색
5th row선물
ValueCountFrequency (%)
선물 8
 
10.1%
저렴하다 8
 
10.1%
예쁘다 7
 
8.9%
고급스럽다 5
 
6.3%
데일리 3
 
3.8%
할인 3
 
3.8%
심플하다 3
 
3.8%
만족스럽다 3
 
3.8%
맛있다 2
 
2.5%
향기 2
 
2.5%
Other values (30) 35
44.3%
2023-12-12T09:57:59.097466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
44
 
16.6%
20
 
7.5%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
Other values (73) 134
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 265
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
44
 
16.6%
20
 
7.5%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
Other values (73) 134
50.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 265
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
44
 
16.6%
20
 
7.5%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
Other values (73) 134
50.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 265
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
44
 
16.6%
20
 
7.5%
10
 
3.8%
10
 
3.8%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
8
 
3.0%
7
 
2.6%
Other values (73) 134
50.6%

Interactions

2023-12-12T09:57:57.931978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:57:59.169841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위품목키워드
순위1.0000.0000.500
품목0.0001.0000.000
키워드0.5000.0001.000
2023-12-12T09:57:59.236425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위품목
순위1.0000.000
품목0.0001.000

Missing values

2023-12-12T09:57:58.235218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:57:58.293927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순위품목키워드
01화장품예쁘다
12화장품저렴하다
23화장품촉촉하다
34화장품발색
45화장품선물
56화장품지속력
67화장품고급스럽다
78화장품데일리
89화장품향기
910화장품색상
순위품목키워드
691시계예쁘다
702시계만족스럽다
713시계여성스럽다
724시계데일리
735시계선물
746시계할인
757시계심플하다
768시계클래식하다
779시계서현진시계
7810시계저렴하다