Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells2181
Missing cells (%)2.4%
Duplicate rows342
Duplicate rows (%)3.4%
Total size in memory791.0 KiB
Average record size in memory81.0 B

Variable types

Text5
Categorical3
Numeric1

Dataset

Description사회적경제기업 상품 및 서비스에 대한 가격과 단위 정보기업명 / 상품유형 / 카테고리그룹 / 1차카테고리 / 2차카테고리 / 상품명 / 판매단위 / 판매가 / 기준일
Author한국사회적기업진흥원
URLhttps://www.data.go.kr/data/15038606/fileData.do

Alerts

기준일 has constant value ""Constant
Dataset has 342 (3.4%) duplicate rowsDuplicates
카테고리그룹 is highly overall correlated with 상품유형High correlation
상품유형 is highly overall correlated with 카테고리그룹High correlation
상품유형 is highly imbalanced (63.4%)Imbalance
판매단위 has 2181 (21.8%) missing valuesMissing
판매가 is highly skewed (γ1 = 20.92047656)Skewed
판매가 has 155 (1.6%) zerosZeros

Reproduction

Analysis started2024-04-29 22:28:08.056123
Analysis finished2024-04-29 22:28:11.351695
Duration3.3 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1272
Distinct (%)12.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T07:28:11.513463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length22
Mean length9.0694
Min length2

Characters and Unicode

Total characters90694
Distinct characters653
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique349 ?
Unique (%)3.5%

Sample

1st row발효명가협동조합
2nd row주식회사매직북스
3rd row주식회사메이커스핸즈
4th row영농조합법인 에듀팜
5th row아이밍키
ValueCountFrequency (%)
주식회사 1490
 
10.9%
아이밍키 762
 
5.6%
㈜멘퍼스 562
 
4.1%
사회적협동조합 345
 
2.5%
사단법인 300
 
2.2%
㈜아트앤크래프트 294
 
2.1%
농업회사법인 260
 
1.9%
우리들행복 242
 
1.8%
나눔종합가구사업단 242
 
1.8%
협동조합 163
 
1.2%
Other values (1326) 9063
66.0%
2024-04-30T07:28:11.956388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4706
 
5.2%
3957
 
4.4%
3758
 
4.1%
3688
 
4.1%
2421
 
2.7%
) 2312
 
2.5%
( 2277
 
2.5%
2202
 
2.4%
2167
 
2.4%
2007
 
2.2%
Other values (643) 61199
67.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 80464
88.7%
Space Separator 3758
 
4.1%
Close Punctuation 2312
 
2.5%
Open Punctuation 2277
 
2.5%
Other Symbol 1492
 
1.6%
Uppercase Letter 191
 
0.2%
Decimal Number 100
 
0.1%
Lowercase Letter 71
 
0.1%
Connector Punctuation 15
 
< 0.1%
Other Punctuation 14
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4706
 
5.8%
3957
 
4.9%
3688
 
4.6%
2421
 
3.0%
2202
 
2.7%
2167
 
2.7%
2007
 
2.5%
1987
 
2.5%
1882
 
2.3%
1787
 
2.2%
Other values (593) 53660
66.7%
Uppercase Letter
ValueCountFrequency (%)
M 38
19.9%
R 34
17.8%
O 33
17.3%
E 12
 
6.3%
G 10
 
5.2%
H 10
 
5.2%
B 10
 
5.2%
N 7
 
3.7%
P 7
 
3.7%
U 5
 
2.6%
Other values (8) 25
13.1%
Lowercase Letter
ValueCountFrequency (%)
n 13
18.3%
a 12
16.9%
p 10
14.1%
e 9
12.7%
r 5
 
7.0%
i 3
 
4.2%
d 3
 
4.2%
t 3
 
4.2%
h 3
 
4.2%
s 2
 
2.8%
Other values (4) 8
11.3%
Decimal Number
ValueCountFrequency (%)
2 22
22.0%
1 18
18.0%
0 15
15.0%
5 14
14.0%
9 12
12.0%
4 10
10.0%
3 3
 
3.0%
6 3
 
3.0%
8 2
 
2.0%
7 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
& 12
85.7%
: 1
 
7.1%
. 1
 
7.1%
Space Separator
ValueCountFrequency (%)
3758
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2312
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2277
100.0%
Other Symbol
ValueCountFrequency (%)
1492
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 81956
90.4%
Common 8476
 
9.3%
Latin 262
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4706
 
5.7%
3957
 
4.8%
3688
 
4.5%
2421
 
3.0%
2202
 
2.7%
2167
 
2.6%
2007
 
2.4%
1987
 
2.4%
1882
 
2.3%
1787
 
2.2%
Other values (594) 55152
67.3%
Latin
ValueCountFrequency (%)
M 38
14.5%
R 34
13.0%
O 33
12.6%
n 13
 
5.0%
a 12
 
4.6%
E 12
 
4.6%
G 10
 
3.8%
H 10
 
3.8%
B 10
 
3.8%
p 10
 
3.8%
Other values (22) 80
30.5%
Common
ValueCountFrequency (%)
3758
44.3%
) 2312
27.3%
( 2277
26.9%
2 22
 
0.3%
1 18
 
0.2%
_ 15
 
0.2%
0 15
 
0.2%
5 14
 
0.2%
9 12
 
0.1%
& 12
 
0.1%
Other values (7) 21
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 80464
88.7%
ASCII 8738
 
9.6%
None 1492
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4706
 
5.8%
3957
 
4.9%
3688
 
4.6%
2421
 
3.0%
2202
 
2.7%
2167
 
2.7%
2007
 
2.5%
1987
 
2.5%
1882
 
2.3%
1787
 
2.2%
Other values (593) 53660
66.7%
ASCII
ValueCountFrequency (%)
3758
43.0%
) 2312
26.5%
( 2277
26.1%
M 38
 
0.4%
R 34
 
0.4%
O 33
 
0.4%
2 22
 
0.3%
1 18
 
0.2%
_ 15
 
0.2%
0 15
 
0.2%
Other values (39) 216
 
2.5%
None
ValueCountFrequency (%)
1492
100.0%

상품유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
제품
9299 
서비스
 
701

Length

Max length3
Median length2
Mean length2.0701
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제품
2nd row제품
3rd row제품
4th row제품
5th row제품

Common Values

ValueCountFrequency (%)
제품 9299
93.0%
서비스 701
 
7.0%

Length

2024-04-30T07:28:12.105176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:28:12.195836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제품 9299
93.0%
서비스 701
 
7.0%

카테고리그룹
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
식품
2316 
생활/취미
1342 
가구/홈데코
1221 
패션/잡화
918 
사무/교육
797 
Other values (20)
3406 

Length

Max length8
Median length7
Mean length4.3587
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품
2nd row사무/교육
3rd row사무/교육
4th row가구/홈데코
5th row가구/홈데코

Common Values

ValueCountFrequency (%)
식품 2316
23.2%
생활/취미 1342
13.4%
가구/홈데코 1221
12.2%
패션/잡화 918
 
9.2%
사무/교육 797
 
8.0%
출산/육아 697
 
7.0%
컴퓨터/주변기기 473
 
4.7%
기타 315
 
3.1%
기계/전기/소방 285
 
2.9%
식물류 281
 
2.8%
Other values (15) 1355
13.6%

Length

2024-04-30T07:28:12.288098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
식품 2316
23.2%
생활/취미 1342
13.4%
가구/홈데코 1221
12.2%
패션/잡화 918
 
9.2%
사무/교육 797
 
8.0%
출산/육아 697
 
7.0%
컴퓨터/주변기기 473
 
4.7%
기타 315
 
3.1%
기계/전기/소방 285
 
2.9%
식물류 281
 
2.8%
Other values (15) 1355
13.6%
Distinct78
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T07:28:12.504524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length4.6909
Min length2

Characters and Unicode

Total characters46909
Distinct characters155
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row가공식품
2nd row교육용품
3rd row교육용품
4th row홈데코
5th row홈데코
ValueCountFrequency (%)
가공식품 1424
 
14.2%
서재가구 873
 
8.7%
잡화 679
 
6.8%
생활/위생용품 608
 
6.1%
농수산물/정육 601
 
6.0%
문구/사무용품 507
 
5.1%
주방용품 340
 
3.4%
건강식품 291
 
2.9%
교육용품 283
 
2.8%
취미용품 255
 
2.5%
Other values (70) 4154
41.5%
2024-04-30T07:28:12.838988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4294
 
9.2%
/ 3749
 
8.0%
2558
 
5.5%
2401
 
5.1%
1784
 
3.8%
1761
 
3.8%
1516
 
3.2%
1236
 
2.6%
1230
 
2.6%
1025
 
2.2%
Other values (145) 25355
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 43091
91.9%
Other Punctuation 3749
 
8.0%
Uppercase Letter 24
 
0.1%
Space Separator 15
 
< 0.1%
Close Punctuation 15
 
< 0.1%
Open Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4294
 
10.0%
2558
 
5.9%
2401
 
5.6%
1784
 
4.1%
1761
 
4.1%
1516
 
3.5%
1236
 
2.9%
1230
 
2.9%
1025
 
2.4%
993
 
2.3%
Other values (138) 24293
56.4%
Uppercase Letter
ValueCountFrequency (%)
D 8
33.3%
I 8
33.3%
Y 8
33.3%
Other Punctuation
ValueCountFrequency (%)
/ 3749
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 43091
91.9%
Common 3794
 
8.1%
Latin 24
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4294
 
10.0%
2558
 
5.9%
2401
 
5.6%
1784
 
4.1%
1761
 
4.1%
1516
 
3.5%
1236
 
2.9%
1230
 
2.9%
1025
 
2.4%
993
 
2.3%
Other values (138) 24293
56.4%
Common
ValueCountFrequency (%)
/ 3749
98.8%
15
 
0.4%
) 15
 
0.4%
( 15
 
0.4%
Latin
ValueCountFrequency (%)
D 8
33.3%
I 8
33.3%
Y 8
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 43091
91.9%
ASCII 3818
 
8.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4294
 
10.0%
2558
 
5.9%
2401
 
5.6%
1784
 
4.1%
1761
 
4.1%
1516
 
3.5%
1236
 
2.9%
1230
 
2.9%
1025
 
2.4%
993
 
2.3%
Other values (138) 24293
56.4%
ASCII
ValueCountFrequency (%)
/ 3749
98.2%
15
 
0.4%
) 15
 
0.4%
( 15
 
0.4%
D 8
 
0.2%
I 8
 
0.2%
Y 8
 
0.2%
Distinct168
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T07:28:13.057260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.5925
Min length2

Characters and Unicode

Total characters55925
Distinct characters229
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st row기타 가공식품
2nd row교육용품/교구
3rd row교육용품/교구
4th row인테리어소품
5th row커튼/블라인드
ValueCountFrequency (%)
기타 1905
 
15.6%
사무가구 561
 
4.6%
가공식품 379
 
3.1%
패션소품 378
 
3.1%
커피/음료 339
 
2.8%
생활/위생용품 312
 
2.5%
서재가구 308
 
2.5%
가방 267
 
2.2%
관엽류 247
 
2.0%
청소/방역 222
 
1.8%
Other values (172) 7332
59.9%
2024-04-30T07:28:13.434977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4906
 
8.8%
3127
 
5.6%
2702
 
4.8%
2250
 
4.0%
1979
 
3.5%
1919
 
3.4%
1889
 
3.4%
1804
 
3.2%
1201
 
2.1%
1177
 
2.1%
Other values (219) 32971
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48680
87.0%
Other Punctuation 4906
 
8.8%
Space Separator 2250
 
4.0%
Uppercase Letter 59
 
0.1%
Close Punctuation 15
 
< 0.1%
Open Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3127
 
6.4%
2702
 
5.6%
1979
 
4.1%
1919
 
3.9%
1889
 
3.9%
1804
 
3.7%
1201
 
2.5%
1177
 
2.4%
1016
 
2.1%
874
 
1.8%
Other values (209) 30992
63.7%
Uppercase Letter
ValueCountFrequency (%)
D 22
37.3%
I 8
 
13.6%
Y 8
 
13.6%
V 7
 
11.9%
W 7
 
11.9%
S 7
 
11.9%
Other Punctuation
ValueCountFrequency (%)
/ 4906
100.0%
Space Separator
ValueCountFrequency (%)
2250
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48680
87.0%
Common 7186
 
12.8%
Latin 59
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3127
 
6.4%
2702
 
5.6%
1979
 
4.1%
1919
 
3.9%
1889
 
3.9%
1804
 
3.7%
1201
 
2.5%
1177
 
2.4%
1016
 
2.1%
874
 
1.8%
Other values (209) 30992
63.7%
Latin
ValueCountFrequency (%)
D 22
37.3%
I 8
 
13.6%
Y 8
 
13.6%
V 7
 
11.9%
W 7
 
11.9%
S 7
 
11.9%
Common
ValueCountFrequency (%)
/ 4906
68.3%
2250
31.3%
) 15
 
0.2%
( 15
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48680
87.0%
ASCII 7245
 
13.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4906
67.7%
2250
31.1%
D 22
 
0.3%
) 15
 
0.2%
( 15
 
0.2%
I 8
 
0.1%
Y 8
 
0.1%
V 7
 
0.1%
W 7
 
0.1%
S 7
 
0.1%
Hangul
ValueCountFrequency (%)
3127
 
6.4%
2702
 
5.6%
1979
 
4.1%
1919
 
3.9%
1889
 
3.9%
1804
 
3.7%
1201
 
2.5%
1177
 
2.4%
1016
 
2.1%
874
 
1.8%
Other values (209) 30992
63.7%
Distinct9138
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-30T07:28:13.751689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length73
Mean length21.111
Min length2

Characters and Unicode

Total characters211110
Distinct characters1219
Distinct categories17 ?
Distinct scripts4 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8335 ?
Unique (%)83.4%

Sample

1st row삼사미가 산삼주
2nd row역사북아트 동학농민혁명 팝업북 사회적경제기업
3rd row내맘대로 썸머 플레이 3종 세트
4th row1인 손발도장 석고 만들기 DIY 키트
5th row가리개커튼(가로형)_린넨그레이
ValueCountFrequency (%)
698
 
1.8%
선물세트 349
 
0.9%
세트 209
 
0.6%
시리즈 173
 
0.5%
넥타이 172
 
0.5%
의자 168
 
0.4%
친환경 156
 
0.4%
초록마당 155
 
0.4%
기타화초 155
 
0.4%
다년초 141
 
0.4%
Other values (15077) 35587
93.7%
2024-04-30T07:28:14.204986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28585
 
13.5%
0 5960
 
2.8%
1 3354
 
1.6%
2955
 
1.4%
2751
 
1.3%
2684
 
1.3%
( 2670
 
1.3%
2 2648
 
1.3%
) 2626
 
1.2%
- 2571
 
1.2%
Other values (1209) 154306
73.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 123291
58.4%
Space Separator 28585
 
13.5%
Decimal Number 19747
 
9.4%
Uppercase Letter 14348
 
6.8%
Lowercase Letter 6038
 
2.9%
Other Punctuation 4975
 
2.4%
Open Punctuation 4858
 
2.3%
Close Punctuation 4815
 
2.3%
Dash Punctuation 2571
 
1.2%
Connector Punctuation 1167
 
0.6%
Other values (7) 715
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2955
 
2.4%
2751
 
2.2%
2684
 
2.2%
2216
 
1.8%
1991
 
1.6%
1640
 
1.3%
1387
 
1.1%
1286
 
1.0%
1264
 
1.0%
1167
 
0.9%
Other values (1103) 103950
84.3%
Uppercase Letter
ValueCountFrequency (%)
A 1220
 
8.5%
C 1168
 
8.1%
D 1061
 
7.4%
S 1027
 
7.2%
I 931
 
6.5%
L 891
 
6.2%
R 820
 
5.7%
M 815
 
5.7%
P 674
 
4.7%
E 614
 
4.3%
Other values (17) 5127
35.7%
Lowercase Letter
ValueCountFrequency (%)
g 1138
18.8%
m 1002
16.6%
l 572
9.5%
c 451
 
7.5%
e 338
 
5.6%
o 300
 
5.0%
k 292
 
4.8%
i 281
 
4.7%
t 274
 
4.5%
a 221
 
3.7%
Other values (16) 1169
19.4%
Other Punctuation
ValueCountFrequency (%)
, 2195
44.1%
/ 1427
28.7%
. 395
 
7.9%
& 208
 
4.2%
% 184
 
3.7%
* 183
 
3.7%
; 144
 
2.9%
: 76
 
1.5%
# 73
 
1.5%
! 54
 
1.1%
Other values (5) 36
 
0.7%
Decimal Number
ValueCountFrequency (%)
0 5960
30.2%
1 3354
17.0%
2 2648
13.4%
5 2032
 
10.3%
3 1667
 
8.4%
4 1361
 
6.9%
6 878
 
4.4%
7 716
 
3.6%
8 709
 
3.6%
9 422
 
2.1%
Math Symbol
ValueCountFrequency (%)
+ 298
47.8%
~ 254
40.8%
× 52
 
8.3%
> 8
 
1.3%
< 8
 
1.3%
= 2
 
0.3%
| 1
 
0.2%
Letter Number
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
1
12.5%
1
12.5%
Open Punctuation
ValueCountFrequency (%)
( 2670
55.0%
[ 2172
44.7%
14
 
0.3%
2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 2626
54.5%
] 2173
45.1%
14
 
0.3%
2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
28585
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2571
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1167
100.0%
Control
ValueCountFrequency (%)
70
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 4
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 123205
58.4%
Common 67425
31.9%
Latin 20394
 
9.7%
Han 86
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2955
 
2.4%
2751
 
2.2%
2684
 
2.2%
2216
 
1.8%
1991
 
1.6%
1640
 
1.3%
1387
 
1.1%
1286
 
1.0%
1264
 
1.0%
1167
 
0.9%
Other values (1079) 103864
84.3%
Latin
ValueCountFrequency (%)
A 1220
 
6.0%
C 1168
 
5.7%
g 1138
 
5.6%
D 1061
 
5.2%
S 1027
 
5.0%
m 1002
 
4.9%
I 931
 
4.6%
L 891
 
4.4%
R 820
 
4.0%
M 815
 
4.0%
Other values (48) 10321
50.6%
Common
ValueCountFrequency (%)
28585
42.4%
0 5960
 
8.8%
1 3354
 
5.0%
( 2670
 
4.0%
2 2648
 
3.9%
) 2626
 
3.9%
- 2571
 
3.8%
, 2195
 
3.3%
] 2173
 
3.2%
[ 2172
 
3.2%
Other values (38) 12471
18.5%
Han
ValueCountFrequency (%)
38
44.2%
8
 
9.3%
4
 
4.7%
4
 
4.7%
3
 
3.5%
3
 
3.5%
3
 
3.5%
3
 
3.5%
2
 
2.3%
2
 
2.3%
Other values (14) 16
18.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 123197
58.4%
ASCII 87686
41.5%
None 111
 
0.1%
CJK 84
 
< 0.1%
Misc Symbols 8
 
< 0.1%
Compat Jamo 8
 
< 0.1%
Number Forms 8
 
< 0.1%
Punctuation 6
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28585
32.6%
0 5960
 
6.8%
1 3354
 
3.8%
( 2670
 
3.0%
2 2648
 
3.0%
) 2626
 
3.0%
- 2571
 
2.9%
, 2195
 
2.5%
] 2173
 
2.5%
[ 2172
 
2.5%
Other values (79) 32732
37.3%
Hangul
ValueCountFrequency (%)
2955
 
2.4%
2751
 
2.2%
2684
 
2.2%
2216
 
1.8%
1991
 
1.6%
1640
 
1.3%
1387
 
1.1%
1286
 
1.0%
1264
 
1.0%
1167
 
0.9%
Other values (1074) 103856
84.3%
None
ValueCountFrequency (%)
× 52
46.8%
· 21
18.9%
14
 
12.6%
14
 
12.6%
Ø 4
 
3.6%
2
 
1.8%
2
 
1.8%
2
 
1.8%
CJK
ValueCountFrequency (%)
38
45.2%
8
 
9.5%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
Other values (13) 14
 
16.7%
Misc Symbols
ValueCountFrequency (%)
8
100.0%
Punctuation
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Compat Jamo
ValueCountFrequency (%)
4
50.0%
1
 
12.5%
1
 
12.5%
1
 
12.5%
1
 
12.5%
Number Forms
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
1
12.5%
1
12.5%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%

판매단위
Text

MISSING 

Distinct1173
Distinct (%)15.0%
Missing2181
Missing (%)21.8%
Memory size156.2 KiB
2024-04-30T07:28:14.446298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length2
Mean length4.1697148
Min length1

Characters and Unicode

Total characters32603
Distinct characters373
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique755 ?
Unique (%)9.7%

Sample

1st row1ea
2nd row없음
3rd row1세트
4th row1개
5th row1박스(4개입)
ValueCountFrequency (%)
1개 2766
29.8%
ea 612
 
6.6%
1 350
 
3.8%
없음 338
 
3.6%
1박스 316
 
3.4%
1세트 306
 
3.3%
1대 234
 
2.5%
1본 140
 
1.5%
1ea 117
 
1.3%
105
 
1.1%
Other values (1237) 3989
43.0%
2024-04-30T07:28:14.804658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 7000
21.5%
3851
 
11.8%
0 2272
 
7.0%
1513
 
4.6%
( 1449
 
4.4%
) 1446
 
4.4%
1156
 
3.5%
1134
 
3.5%
864
 
2.7%
831
 
2.5%
Other values (363) 11087
34.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12296
37.7%
Decimal Number 11828
36.3%
Uppercase Letter 1742
 
5.3%
Space Separator 1513
 
4.6%
Open Punctuation 1455
 
4.5%
Close Punctuation 1452
 
4.5%
Lowercase Letter 1319
 
4.0%
Other Punctuation 891
 
2.7%
Math Symbol 58
 
0.2%
Other Symbol 22
 
0.1%
Other values (3) 27
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3851
31.3%
1156
 
9.4%
1134
 
9.2%
864
 
7.0%
831
 
6.8%
349
 
2.8%
339
 
2.8%
257
 
2.1%
217
 
1.8%
211
 
1.7%
Other values (303) 3087
25.1%
Lowercase Letter
ValueCountFrequency (%)
g 548
41.5%
m 199
 
15.1%
x 131
 
9.9%
k 117
 
8.9%
l 108
 
8.2%
e 63
 
4.8%
a 56
 
4.2%
c 38
 
2.9%
s 13
 
1.0%
b 13
 
1.0%
Other values (7) 33
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
E 771
44.3%
A 741
42.5%
T 65
 
3.7%
X 38
 
2.2%
S 35
 
2.0%
B 26
 
1.5%
O 25
 
1.4%
L 17
 
1.0%
M 7
 
0.4%
K 5
 
0.3%
Other values (5) 12
 
0.7%
Decimal Number
ValueCountFrequency (%)
1 7000
59.2%
0 2272
 
19.2%
2 738
 
6.2%
5 696
 
5.9%
3 494
 
4.2%
4 279
 
2.4%
6 136
 
1.1%
8 103
 
0.9%
7 81
 
0.7%
9 29
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 507
56.9%
/ 198
 
22.2%
* 161
 
18.1%
. 23
 
2.6%
: 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
+ 38
65.5%
~ 11
 
19.0%
= 6
 
10.3%
× 3
 
5.2%
Open Punctuation
ValueCountFrequency (%)
( 1449
99.6%
[ 6
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 1446
99.6%
] 6
 
0.4%
Space Separator
ValueCountFrequency (%)
1513
100.0%
Other Symbol
ValueCountFrequency (%)
22
100.0%
Letter Number
ValueCountFrequency (%)
12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17234
52.9%
Hangul 12295
37.7%
Latin 3073
 
9.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3851
31.3%
1156
 
9.4%
1134
 
9.2%
864
 
7.0%
831
 
6.8%
349
 
2.8%
339
 
2.8%
257
 
2.1%
217
 
1.8%
211
 
1.7%
Other values (302) 3086
25.1%
Latin
ValueCountFrequency (%)
E 771
25.1%
A 741
24.1%
g 548
17.8%
m 199
 
6.5%
x 131
 
4.3%
k 117
 
3.8%
l 108
 
3.5%
T 65
 
2.1%
e 63
 
2.1%
a 56
 
1.8%
Other values (23) 274
 
8.9%
Common
ValueCountFrequency (%)
1 7000
40.6%
0 2272
 
13.2%
1513
 
8.8%
( 1449
 
8.4%
) 1446
 
8.4%
2 738
 
4.3%
5 696
 
4.0%
, 507
 
2.9%
3 494
 
2.9%
4 279
 
1.6%
Other values (17) 840
 
4.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20270
62.2%
Hangul 12294
37.7%
CJK Compat 22
 
0.1%
Number Forms 12
 
< 0.1%
None 3
 
< 0.1%
CJK 1
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 7000
34.5%
0 2272
 
11.2%
1513
 
7.5%
( 1449
 
7.1%
) 1446
 
7.1%
E 771
 
3.8%
A 741
 
3.7%
2 738
 
3.6%
5 696
 
3.4%
g 548
 
2.7%
Other values (47) 3096
15.3%
Hangul
ValueCountFrequency (%)
3851
31.3%
1156
 
9.4%
1134
 
9.2%
864
 
7.0%
831
 
6.8%
349
 
2.8%
339
 
2.8%
257
 
2.1%
217
 
1.8%
211
 
1.7%
Other values (301) 3085
25.1%
CJK Compat
ValueCountFrequency (%)
22
100.0%
Number Forms
ValueCountFrequency (%)
12
100.0%
None
ValueCountFrequency (%)
× 3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

판매가
Real number (ℝ)

SKEWED  ZEROS 

Distinct1495
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean383994.48
Minimum0
Maximum1 × 108
Zeros155
Zeros (%)1.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-30T07:28:14.936299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile100
Q18500
median24000
Q372000
95-th percentile650000
Maximum1 × 108
Range1 × 108
Interquartile range (IQR)63500

Descriptive statistics

Standard deviation4283837.4
Coefficient of variation (CV)11.155987
Kurtosis469.20117
Mean383994.48
Median Absolute Deviation (MAD)19500
Skewness20.920477
Sum3.8399448 × 109
Variance1.8351263 × 1013
MonotonicityNot monotonic
2024-04-30T07:28:15.066780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 248
 
2.5%
100 188
 
1.9%
15000 176
 
1.8%
10000 160
 
1.6%
0 155
 
1.6%
25000 141
 
1.4%
12000 131
 
1.3%
20000 124
 
1.2%
100000 123
 
1.2%
35000 122
 
1.2%
Other values (1485) 8432
84.3%
ValueCountFrequency (%)
0 155
1.6%
10 248
2.5%
11 1
 
< 0.1%
20 2
 
< 0.1%
100 188
1.9%
110 2
 
< 0.1%
150 1
 
< 0.1%
165 1
 
< 0.1%
180 1
 
< 0.1%
205 1
 
< 0.1%
ValueCountFrequency (%)
100000000 12
0.1%
99999999 1
 
< 0.1%
99100000 2
 
< 0.1%
98000000 1
 
< 0.1%
55000000 1
 
< 0.1%
49500000 1
 
< 0.1%
44406250 1
 
< 0.1%
40176400 1
 
< 0.1%
36058000 1
 
< 0.1%
30492000 1
 
< 0.1%

기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-23
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-04-23
2nd row2024-04-23
3rd row2024-04-23
4th row2024-04-23
5th row2024-04-23

Common Values

ValueCountFrequency (%)
2024-04-23 10000
100.0%

Length

2024-04-30T07:28:15.183885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-30T07:28:15.265711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-04-23 10000
100.0%

Interactions

2024-04-30T07:28:10.897503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-30T07:28:15.318552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상품유형카테고리그룹1차카테고리판매가
상품유형1.0001.0001.0000.032
카테고리그룹1.0001.0001.0000.320
1차카테고리1.0001.0001.0000.415
판매가0.0320.3200.4151.000
2024-04-30T07:28:15.404618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
카테고리그룹상품유형
카테고리그룹1.0000.999
상품유형0.9991.000
2024-04-30T07:28:15.478430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매가상품유형카테고리그룹
판매가1.0000.0350.142
상품유형0.0351.0000.999
카테고리그룹0.1420.9991.000

Missing values

2024-04-30T07:28:11.108683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-30T07:28:11.270884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기업명상품유형카테고리그룹1차카테고리2차카테고리상품명판매단위판매가기준일
10137발효명가협동조합제품식품가공식품기타 가공식품삼사미가 산삼주1ea850002024-04-23
18949주식회사매직북스제품사무/교육교육용품교육용품/교구역사북아트 동학농민혁명 팝업북 사회적경제기업없음39902024-04-23
19050주식회사메이커스핸즈제품사무/교육교육용품교육용품/교구내맘대로 썸머 플레이 3종 세트1세트350002024-04-23
14823영농조합법인 에듀팜제품가구/홈데코홈데코인테리어소품1인 손발도장 석고 만들기 DIY 키트<NA>100002024-04-23
12956아이밍키제품가구/홈데코홈데코커튼/블라인드가리개커튼(가로형)_린넨그레이1개79002024-04-23
15914제천인삼약초영농조합법인제품식품건강식품기타 건강식품고려홍삼정과GOLD(20g*4pc)1박스(4개입)160002024-04-23
14588아트액세서리 협동조합제품패션/잡화액세서리주얼리핑크 진주 은 팔찌1개310002024-04-23
20308파라서 주식회사제품패션/잡화잡화패션소품[스마트톡-단비1]보건소 치매파트너 단비 스마트톡 핸드폰거치톡 주문제작디자인 대량,소량 가격차등<NA>19802024-04-23
7684농업회사법인 디엠제트드림푸드주식회사제품식품가공식품과자/간식[DMZ] 오늘콩 초콜릿150g, 콩볶음180g 선물세트<NA>252002024-04-23
5723㈜소윤컴퍼니제품사무/교육문구/사무용품인쇄물배너없음02024-04-23
기업명상품유형카테고리그룹1차카테고리2차카테고리상품명판매단위판매가기준일
19069주식회사메이커스핸즈제품사무/교육교육용품교육용품/교구언택트 시리즈- 내 인생의 이정표10개1683002024-04-23
94(사)담쟁이제품생활/취미생활/위생용품수납/정리용품아이엠 천연 탈취제 40ml1개20002024-04-23
6931갓피플주식회사서비스환경청소/방역청소/방역건물소독<NA>102024-04-23
1435(주)마음챙김여행백락투어서비스여행서비스국내여행국내여행강원도 인제 자작나무 숲과 싱잉볼명상 여행 (당일) (4인이상소그룹만 신청가능)<NA>900002024-04-23
12535슬로푸드 주식회사 농업회사법인제품식품가공식품커피/음료생강품은 도라지배즙 100ml * 30포30포250002024-04-23
19785초록마당영농조합법인제품식물류화환관엽류기타화초, 초록마당, DY-081, 다년초, 레몬타임, 초장5~30cm, 8cm포트1본6002024-04-23
14975예쁜손공예협동조합제품가구/홈데코홈데코인테리어소품큰부엉이가족 부엉이소품 부엉이인테리어 부엉이인형 부엉이인형만들기 DIY패키지1개180002024-04-23
10803사단법인 우리들행복 나눔종합가구사업단제품가구/홈데코서재가구서재가구의자 시리즈 127J-11개850002024-04-23
20999한국안심기프트사회적협동조합제품식품농수산물/정육지역특산품[면세]명품 사각 청송사과 선물셋트 1박스 13-14과(5k)1박스480002024-04-23
7502논산발그래일터제품생활/취미주방용품기타 주방용품발그래 폴리실 동그리 수세미<NA>20002024-04-23

Duplicate rows

Most frequently occurring

기업명상품유형카테고리그룹1차카테고리2차카테고리상품명판매단위판매가기준일# duplicates
42(주)오티비컴퍼니서비스문화행사/전시행사/전시행사운영 및 인력대행<NA>102024-04-238
41(주)오티비컴퍼니서비스문화행사/전시행사/전시행사기획 및 운영대행<NA>102024-04-236
273입점사테스트제품공정무역상품패션/잡화패션/잡화[추석테스트] 테스트 1<NA>11552024-04-235
275입점사테스트제품공정무역상품패션/잡화패션/잡화[추석테스트] 테스트 3<NA>26252024-04-234
202사단법인 우리들행복 나눔종합가구사업단제품가구/홈데코서재가구서재가구의자 시리즈 203AZ1개550002024-04-233
225사회적협동조합 우리누리제품사무/교육문구/사무용품파일/바인더정부결재판-a4/검정/회색/진녹색/낱개판매1개이상25002024-04-233
274입점사테스트제품공정무역상품패션/잡화패션/잡화[추석테스트] 테스트 2<NA>26252024-04-233
276입점사테스트제품공정무역상품패션/잡화패션/잡화추석테스트1<NA>11552024-04-233
311카리타스보호작업장제품생활/취미욕실용품화장지포카포카 꽃무늬 두루마리1팩(30롤)129002024-04-233
328플라워이음_(사)글로벌투게더경산제품식물류화환관엽류화분식물(관엽)1개500002024-04-233