Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells1922
Missing cells (%)2.7%
Duplicate rows214
Duplicate rows (%)2.1%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Categorical2
Numeric1
Boolean3
Text1

Dataset

Description아임셀러 제휴카테고리 현황에 대한 데이터를 제공합니다. 기준연도, 기준월, 정렬순서, 최하위여부, 제휴카테고리 등을 제공합니다.
Author(주)중소기업유통센터
URLhttps://www.data.go.kr/data/15067217/fileData.do

Alerts

기준연도 has constant value ""Constant
기준월 has constant value ""Constant
Dataset has 214 (2.1%) duplicate rowsDuplicates
정렬순서 has 1922 (19.2%) missing valuesMissing
정렬순서 has 243 (2.4%) zerosZeros

Reproduction

Analysis started2023-12-12 04:08:01.596108
Analysis finished2023-12-12 04:08:03.284500
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준연도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 10000
100.0%

Length

2023-12-12T13:08:03.360245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:08:03.466352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 10000
100.0%

기준월
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
9
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row9
2nd row9
3rd row9
4th row9
5th row9

Common Values

ValueCountFrequency (%)
9 10000
100.0%

Length

2023-12-12T13:08:03.563967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:08:03.683855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
9 10000
100.0%

정렬순서
Real number (ℝ)

MISSING  ZEROS 

Distinct2541
Distinct (%)31.5%
Missing1922
Missing (%)19.2%
Infinite0
Infinite (%)0.0%
Mean2446.9802
Minimum0
Maximum19811
Zeros243
Zeros (%)2.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T13:08:03.826347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q13
median7
Q32041.75
95-th percentile14614.75
Maximum19811
Range19811
Interquartile range (IQR)2038.75

Descriptive statistics

Standard deviation4738.265
Coefficient of variation (CV)1.9363724
Kurtosis3.1643803
Mean2446.9802
Median Absolute Deviation (MAD)6
Skewness2.0252354
Sum19766706
Variance22451155
MonotonicityNot monotonic
2023-12-12T13:08:03.983268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 800
 
8.0%
2 768
 
7.7%
3 659
 
6.6%
4 571
 
5.7%
5 484
 
4.8%
6 371
 
3.7%
7 311
 
3.1%
0 243
 
2.4%
8 227
 
2.3%
9 172
 
1.7%
Other values (2531) 3472
34.7%
(Missing) 1922
19.2%
ValueCountFrequency (%)
0 243
 
2.4%
1 800
8.0%
2 768
7.7%
3 659
6.6%
4 571
5.7%
5 484
4.8%
6 371
3.7%
7 311
 
3.1%
8 227
 
2.3%
9 172
 
1.7%
ValueCountFrequency (%)
19811 1
< 0.1%
19804 1
< 0.1%
19786 1
< 0.1%
19743 1
< 0.1%
19742 1
< 0.1%
19691 1
< 0.1%
19681 1
< 0.1%
19661 1
< 0.1%
19645 1
< 0.1%
19644 1
< 0.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
8384 
False
1616 
ValueCountFrequency (%)
True 8384
83.8%
False 1616
 
16.2%
2023-12-12T13:08:04.120770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
5865 
True
4135 
ValueCountFrequency (%)
False 5865
58.7%
True 4135
41.3%
2023-12-12T13:08:04.253945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct7773
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T13:08:04.668819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length5.4857
Min length1

Characters and Unicode

Total characters54857
Distinct characters1007
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6597 ?
Unique (%)66.0%

Sample

1st row피규어/캐릭터
2nd row타자헬멧
3rd row드립서버
4th row전라도특산물
5th row기타브랜드
ValueCountFrequency (%)
기타 79
 
0.8%
기타브랜드 67
 
0.7%
삼성 47
 
0.5%
lg 33
 
0.3%
티셔츠 21
 
0.2%
소니 19
 
0.2%
브랜드기타 17
 
0.2%
여성용 16
 
0.2%
세트상품 16
 
0.2%
hp 16
 
0.2%
Other values (7761) 9669
96.7%
2023-12-12T13:08:05.449648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 3424
 
6.2%
1568
 
2.9%
1466
 
2.7%
1349
 
2.5%
978
 
1.8%
827
 
1.5%
747
 
1.4%
727
 
1.3%
662
 
1.2%
659
 
1.2%
Other values (997) 42450
77.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47368
86.3%
Other Punctuation 3487
 
6.4%
Uppercase Letter 2194
 
4.0%
Decimal Number 949
 
1.7%
Lowercase Letter 700
 
1.3%
Math Symbol 58
 
0.1%
Open Punctuation 41
 
0.1%
Close Punctuation 41
 
0.1%
Other Symbol 14
 
< 0.1%
Connector Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1568
 
3.3%
1466
 
3.1%
1349
 
2.8%
978
 
2.1%
827
 
1.7%
747
 
1.6%
727
 
1.5%
662
 
1.4%
659
 
1.4%
573
 
1.2%
Other values (924) 37812
79.8%
Uppercase Letter
ValueCountFrequency (%)
D 233
 
10.6%
G 219
 
10.0%
B 162
 
7.4%
S 139
 
6.3%
T 138
 
6.3%
L 138
 
6.3%
C 130
 
5.9%
P 118
 
5.4%
A 114
 
5.2%
V 102
 
4.6%
Other values (16) 701
32.0%
Lowercase Letter
ValueCountFrequency (%)
m 128
18.3%
c 125
17.9%
i 82
11.7%
e 53
7.6%
o 44
 
6.3%
a 35
 
5.0%
t 32
 
4.6%
l 28
 
4.0%
r 28
 
4.0%
n 25
 
3.6%
Other values (15) 120
17.1%
Decimal Number
ValueCountFrequency (%)
1 144
15.2%
0 137
14.4%
2 126
13.3%
3 103
10.9%
6 93
9.8%
5 92
9.7%
8 85
9.0%
4 83
8.7%
7 49
 
5.2%
9 37
 
3.9%
Other Punctuation
ValueCountFrequency (%)
/ 3424
98.2%
. 49
 
1.4%
& 7
 
0.2%
· 4
 
0.1%
\ 1
 
< 0.1%
: 1
 
< 0.1%
* 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 58
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 41
100.0%
Close Punctuation
ValueCountFrequency (%)
] 41
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47368
86.3%
Common 4595
 
8.4%
Latin 2894
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1568
 
3.3%
1466
 
3.1%
1349
 
2.8%
978
 
2.1%
827
 
1.7%
747
 
1.6%
727
 
1.5%
662
 
1.4%
659
 
1.4%
573
 
1.2%
Other values (924) 37812
79.8%
Latin
ValueCountFrequency (%)
D 233
 
8.1%
G 219
 
7.6%
B 162
 
5.6%
S 139
 
4.8%
T 138
 
4.8%
L 138
 
4.8%
C 130
 
4.5%
m 128
 
4.4%
c 125
 
4.3%
P 118
 
4.1%
Other values (41) 1364
47.1%
Common
ValueCountFrequency (%)
/ 3424
74.5%
1 144
 
3.1%
0 137
 
3.0%
2 126
 
2.7%
3 103
 
2.2%
6 93
 
2.0%
5 92
 
2.0%
8 85
 
1.8%
4 83
 
1.8%
~ 58
 
1.3%
Other values (12) 250
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47367
86.3%
ASCII 7471
 
13.6%
CJK Compat 14
 
< 0.1%
None 4
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 3424
45.8%
D 233
 
3.1%
G 219
 
2.9%
B 162
 
2.2%
1 144
 
1.9%
S 139
 
1.9%
T 138
 
1.8%
L 138
 
1.8%
0 137
 
1.8%
C 130
 
1.7%
Other values (61) 2607
34.9%
Hangul
ValueCountFrequency (%)
1568
 
3.3%
1466
 
3.1%
1349
 
2.8%
978
 
2.1%
827
 
1.7%
747
 
1.6%
727
 
1.5%
662
 
1.4%
659
 
1.4%
573
 
1.2%
Other values (923) 37811
79.8%
CJK Compat
ValueCountFrequency (%)
14
100.0%
None
ValueCountFrequency (%)
· 4
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
8605 
True
1395 
ValueCountFrequency (%)
False 8605
86.1%
True 1395
 
14.0%
2023-12-12T13:08:05.719302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T13:08:02.544393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:08:05.846082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정렬순서최하위여부사용여부속성사용여부
정렬순서1.0000.1320.3200.317
최하위여부0.1321.0000.1050.273
사용여부0.3200.1051.0000.000
속성사용여부0.3170.2730.0001.000
2023-12-12T13:08:06.008817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용여부속성사용여부최하위여부
사용여부1.0000.0000.067
속성사용여부0.0001.0000.176
최하위여부0.0670.1761.000
2023-12-12T13:08:06.164306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
정렬순서최하위여부사용여부속성사용여부
정렬순서1.0000.1020.2450.243
최하위여부0.1021.0000.0670.176
사용여부0.2450.0671.0000.000
속성사용여부0.2430.1760.0001.000

Missing values

2023-12-12T13:08:03.097139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:08:03.228889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준연도기준월정렬순서최하위여부사용여부제휴카테고리명속성사용여부
562902020919544NY피규어/캐릭터N
566092020917187YY타자헬멧N
17070202092YN드립서버N
57262202098549YN전라도특산물Y
15715202095YN기타브랜드N
799120209<NA>YNDiorHommeN
60729202094810YN맨소래담로션N
51729202091237YY예초기N
39883202094YN멜로디언/키보드N
45602202091YY임산부스킨케어Y
기준연도기준월정렬순서최하위여부사용여부제휴카테고리명속성사용여부
3678820209<NA>YNALEXANDERWANGN
34044202092YN4GBN
37829202094YN스펀지샌들Y
945820209<NA>YN찌케이스N
455620209<NA>NNLG전자N
21016202092YN남성이너웨어N
632692020913960NY장갑N
1825202096NN브라더N
50837202090NY매트/발판/마루N
3076202095YN스프레이N

Duplicate rows

Most frequently occurring

기준연도기준월정렬순서최하위여부사용여부제휴카테고리명속성사용여부# duplicates
16920209<NA>YN기타N14
17220209<NA>YN남성용N12
19520209<NA>YN여성용N10
90202094YN등산조끼N9
17020209<NA>YN기타브랜드N9
18120209<NA>YN삼성N9
31202091YN자켓/점퍼N8
57202092YN삼성N7
27202091YN삼성N6
54202092YN다운/패딩N6