Overview

Dataset statistics

Number of variables12
Number of observations99
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory102.3 B

Variable types

Categorical9
Numeric2
Text1

Dataset

DescriptionSample
Author(주)제로투원파트너스
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=ZTO010BSICGICIGARET

Alerts

MT has constant value ""Constant
201801 has constant value ""Constant
담배 has constant value ""Constant
전체 has constant value ""Constant
전체.1 has constant value ""Constant
2 has constant value ""Constant
브리티쉬아메리칸토바코 던힐 has constant value ""Constant
93 is highly overall correlated with 15High correlation
15 is highly overall correlated with 93High correlation

Reproduction

Analysis started2023-12-10 06:21:38.121742
Analysis finished2023-12-10 06:21:39.883619
Duration1.76 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

MT
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
MT
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMT
2nd rowMT
3rd rowMT
4th rowMT
5th rowMT

Common Values

ValueCountFrequency (%)
MT 99
100.0%

Length

2023-12-10T15:21:39.982466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:40.135978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mt 99
100.0%

201801
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
201801
99 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row201801
2nd row201801
3rd row201801
4th row201801
5th row201801

Common Values

ValueCountFrequency (%)
201801 99
100.0%

Length

2023-12-10T15:21:40.310617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:40.475844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
201801 99
100.0%

93
Real number (ℝ)

HIGH CORRELATION 

Distinct77
Distinct (%)77.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.27273
Minimum12
Maximum351
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1023.0 B
2023-12-10T15:21:40.647093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile24.5
Q149.5
median94
Q3143.5
95-th percentile244.1
Maximum351
Range339
Interquartile range (IQR)94

Descriptive statistics

Standard deviation74.83885
Coefficient of variation (CV)0.69765029
Kurtosis1.0261081
Mean107.27273
Median Absolute Deviation (MAD)46
Skewness1.1841187
Sum10620
Variance5600.8534
MonotonicityNot monotonic
2023-12-10T15:21:40.929762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
108 4
 
4.0%
62 3
 
3.0%
163 3
 
3.0%
98 2
 
2.0%
30 2
 
2.0%
52 2
 
2.0%
94 2
 
2.0%
84 2
 
2.0%
133 2
 
2.0%
75 2
 
2.0%
Other values (67) 75
75.8%
ValueCountFrequency (%)
12 1
1.0%
16 1
1.0%
19 2
2.0%
20 1
1.0%
25 1
1.0%
27 1
1.0%
29 1
1.0%
30 2
2.0%
34 2
2.0%
35 1
1.0%
ValueCountFrequency (%)
351 1
1.0%
319 1
1.0%
312 1
1.0%
298 1
1.0%
245 1
1.0%
244 1
1.0%
240 1
1.0%
236 1
1.0%
230 1
1.0%
225 1
1.0%

15
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)45.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.323232
Minimum1
Maximum88
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1023.0 B
2023-12-10T15:21:41.226235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3
Q19
median17
Q329
95-th percentile53.2
Maximum88
Range87
Interquartile range (IQR)20

Descriptive statistics

Standard deviation16.983683
Coefficient of variation (CV)0.79648725
Kurtosis3.4317411
Mean21.323232
Median Absolute Deviation (MAD)9
Skewness1.6376398
Sum2111
Variance288.44548
MonotonicityNot monotonic
2023-12-10T15:21:41.473957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
7 6
 
6.1%
4 5
 
5.1%
25 4
 
4.0%
11 4
 
4.0%
15 4
 
4.0%
9 4
 
4.0%
12 4
 
4.0%
26 4
 
4.0%
18 3
 
3.0%
16 3
 
3.0%
Other values (35) 58
58.6%
ValueCountFrequency (%)
1 1
 
1.0%
2 2
 
2.0%
3 3
3.0%
4 5
5.1%
5 1
 
1.0%
6 2
 
2.0%
7 6
6.1%
8 3
3.0%
9 4
4.0%
10 3
3.0%
ValueCountFrequency (%)
88 1
1.0%
81 1
1.0%
76 1
1.0%
60 1
1.0%
55 1
1.0%
53 1
1.0%
50 1
1.0%
49 1
1.0%
45 1
1.0%
41 1
1.0%

담배
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
담배
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row담배
2nd row담배
3rd row담배
4th row담배
5th row담배

Common Values

ValueCountFrequency (%)
담배 99
100.0%

Length

2023-12-10T15:21:41.674570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:41.819489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
담배 99
100.0%

전체
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
전체
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 99
100.0%

Length

2023-12-10T15:21:41.976272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:42.230830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 99
100.0%

전체.1
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
전체
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 99
100.0%

Length

2023-12-10T15:21:42.404375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:42.566975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 99
100.0%

2
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
2
99 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
2 99
100.0%

Length

2023-12-10T15:21:42.723832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:43.245246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2 99
100.0%

20
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
20
81 
30
18 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
20 81
81.8%
30 18
 
18.2%

Length

2023-12-10T15:21:43.538982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:43.704025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20 81
81.8%
30 18
 
18.2%

전체.2
Categorical

Distinct18
Distinct (%)18.2%
Missing0
Missing (%)0.0%
Memory size924.0 B
서울특별시
20 
경기도
10 
부산광역시
10 
인천광역시
대구광역시
Other values (13)
44 

Length

Max length7
Median length5
Mean length4.4646465
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 20
20.2%
경기도 10
10.1%
부산광역시 10
10.1%
인천광역시 8
 
8.1%
대구광역시 7
 
7.1%
충청남도 7
 
7.1%
강원도 6
 
6.1%
경상북도 5
 
5.1%
경상남도 4
 
4.0%
광주광역시 4
 
4.0%
Other values (8) 18
18.2%

Length

2023-12-10T15:21:43.905663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 20
20.2%
경기도 10
10.1%
부산광역시 10
10.1%
인천광역시 8
 
8.1%
대구광역시 7
 
7.1%
충청남도 7
 
7.1%
강원도 6
 
6.1%
경상북도 5
 
5.1%
광주광역시 4
 
4.0%
경상남도 4
 
4.0%
Other values (8) 18
18.2%
Distinct71
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Memory size924.0 B
2023-12-10T15:21:44.353909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.8181818
Min length2

Characters and Unicode

Total characters279
Distinct characters78
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)55.6%

Sample

1st row도봉구
2nd row양천구
3rd row동작구
4th row서초구
5th row강남구
ValueCountFrequency (%)
전체 9
 
9.1%
서구 4
 
4.0%
중구 4
 
4.0%
북구 3
 
3.0%
도봉구 2
 
2.0%
남동구 2
 
2.0%
양천구 2
 
2.0%
종로구 2
 
2.0%
성동구 2
 
2.0%
광진구 2
 
2.0%
Other values (61) 67
67.7%
2023-12-10T15:21:45.370070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
17.9%
30
 
10.8%
13
 
4.7%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.2%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (68) 123
44.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 279
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
17.9%
30
 
10.8%
13
 
4.7%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.2%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (68) 123
44.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 279
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
17.9%
30
 
10.8%
13
 
4.7%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.2%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (68) 123
44.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 279
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
50
17.9%
30
 
10.8%
13
 
4.7%
10
 
3.6%
10
 
3.6%
10
 
3.6%
9
 
3.2%
8
 
2.9%
8
 
2.9%
8
 
2.9%
Other values (68) 123
44.1%
Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
브리티쉬아메리칸토바코 던힐
99 

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row브리티쉬아메리칸토바코 던힐
2nd row브리티쉬아메리칸토바코 던힐
3rd row브리티쉬아메리칸토바코 던힐
4th row브리티쉬아메리칸토바코 던힐
5th row브리티쉬아메리칸토바코 던힐

Common Values

ValueCountFrequency (%)
브리티쉬아메리칸토바코 던힐 99
100.0%

Length

2023-12-10T15:21:45.605057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:21:45.891775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
브리티쉬아메리칸토바코 99
50.0%
던힐 99
50.0%

Interactions

2023-12-10T15:21:39.054259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:21:38.730940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:21:39.209443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:21:38.910703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:21:46.077557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
931520전체.2전체.3
931.0000.8330.2770.4850.765
150.8331.0000.1730.6370.835
200.2770.1731.0000.5030.000
전체.20.4850.6370.5031.0000.854
전체.30.7650.8350.0000.8541.000
2023-12-10T15:21:46.266704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
20전체.2
201.0000.362
전체.20.3621.000
2023-12-10T15:21:46.425991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
931520전체.2
931.0000.6670.2650.166
150.6671.0000.1640.247
200.2650.1641.0000.362
전체.20.1660.2470.3621.000

Missing values

2023-12-10T15:21:39.431098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:21:39.761208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

MT2018019315담배전체전체.1220전체.2전체.3브리티쉬아메리칸토바코 던힐
0MT2018014620담배전체전체220서울특별시도봉구브리티쉬아메리칸토바코 던힐
1MT2018017749담배전체전체220서울특별시양천구브리티쉬아메리칸토바코 던힐
2MT201801627담배전체전체220서울특별시동작구브리티쉬아메리칸토바코 던힐
3MT2018012711담배전체전체220서울특별시서초구브리티쉬아메리칸토바코 던힐
4MT201801424담배전체전체220서울특별시강남구브리티쉬아메리칸토바코 던힐
5MT201801496담배전체전체220부산광역시부산진구브리티쉬아메리칸토바코 던힐
6MT201801193담배전체전체220부산광역시기장군브리티쉬아메리칸토바코 던힐
7MT201801997담배전체전체220대구광역시중구브리티쉬아메리칸토바코 던힐
8MT20180110615담배전체전체220대구광역시북구브리티쉬아메리칸토바코 던힐
9MT201801202담배전체전체220대구광역시수성구브리티쉬아메리칸토바코 던힐
MT2018019315담배전체전체.1220전체.2전체.3브리티쉬아메리칸토바코 던힐
89MT20180117335담배전체전체220전라남도여수시브리티쉬아메리칸토바코 던힐
90MT2018015010담배전체전체220전라남도강진군브리티쉬아메리칸토바코 던힐
91MT2018012928담배전체전체220전라남도무안군브리티쉬아메리칸토바코 던힐
92MT201801799담배전체전체220경상북도구미시브리티쉬아메리칸토바코 던힐
93MT20180110811담배전체전체220경상북도상주시브리티쉬아메리칸토바코 던힐
94MT20180131241담배전체전체220경상남도진주시브리티쉬아메리칸토바코 던힐
95MT2018019824담배전체전체230서울특별시전체브리티쉬아메리칸토바코 던힐
96MT201801439담배전체전체230서울특별시종로구브리티쉬아메리칸토바코 던힐
97MT20180113353담배전체전체230서울특별시성동구브리티쉬아메리칸토바코 던힐
98MT2018018226담배전체전체230서울특별시광진구브리티쉬아메리칸토바코 던힐