Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells391
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory546.9 KiB
Average record size in memory56.0 B

Variable types

Text5
Categorical1

Dataset

Description파주시에 등록된 통신판매업 현황 정보로서 업체명, 통신판매관리번호, 소재지 도로명주소, 인터넷(홈페이지) 도메인, 취급품목 등의 데이터를 제공합니다.
Author경기도 파주시
URLhttps://www.data.go.kr/data/15124692/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
인터넷(홈페이지) 도메인 has 246 (2.5%) missing valuesMissing
취급품목 has 145 (1.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 23:15:41.704217
Analysis finished2023-12-12 23:15:43.790220
Duration2.09 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9880
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:15:44.129319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length39
Mean length6.7915
Min length1

Characters and Unicode

Total characters67915
Distinct characters1096
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9771 ?
Unique (%)97.7%

Sample

1st row올런
2nd row주식회사 힐스기계
3rd row쇼핑쎄씨
4th row으니유리
5th row나눔 연구소
ValueCountFrequency (%)
주식회사 1045
 
8.0%
145
 
1.1%
co 37
 
0.3%
ltd 37
 
0.3%
컴퍼니 33
 
0.3%
company 27
 
0.2%
농업회사법인 26
 
0.2%
26
 
0.2%
도서출판 20
 
0.2%
디자인 19
 
0.1%
Other values (10852) 11700
89.2%
2023-12-13T08:15:44.766666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3151
 
4.6%
2453
 
3.6%
2098
 
3.1%
1810
 
2.7%
) 1644
 
2.4%
( 1640
 
2.4%
1476
 
2.2%
1201
 
1.8%
1174
 
1.7%
1154
 
1.7%
Other values (1086) 50114
73.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 50821
74.8%
Lowercase Letter 5005
 
7.4%
Uppercase Letter 4583
 
6.7%
Space Separator 3151
 
4.6%
Close Punctuation 1647
 
2.4%
Open Punctuation 1643
 
2.4%
Decimal Number 637
 
0.9%
Other Punctuation 316
 
0.5%
Other Symbol 51
 
0.1%
Dash Punctuation 48
 
0.1%
Other values (2) 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2453
 
4.8%
2098
 
4.1%
1810
 
3.6%
1476
 
2.9%
1201
 
2.4%
1174
 
2.3%
1154
 
2.3%
839
 
1.7%
724
 
1.4%
630
 
1.2%
Other values (1007) 37262
73.3%
Lowercase Letter
ValueCountFrequency (%)
o 610
12.2%
e 573
11.4%
a 414
 
8.3%
n 404
 
8.1%
i 350
 
7.0%
r 321
 
6.4%
t 284
 
5.7%
s 274
 
5.5%
l 259
 
5.2%
m 180
 
3.6%
Other values (16) 1336
26.7%
Uppercase Letter
ValueCountFrequency (%)
A 365
 
8.0%
O 352
 
7.7%
S 308
 
6.7%
E 291
 
6.3%
C 269
 
5.9%
L 257
 
5.6%
T 254
 
5.5%
N 248
 
5.4%
M 246
 
5.4%
I 229
 
5.0%
Other values (16) 1764
38.5%
Decimal Number
ValueCountFrequency (%)
1 129
20.3%
2 101
15.9%
3 78
12.2%
0 71
11.1%
9 49
 
7.7%
4 46
 
7.2%
5 45
 
7.1%
8 43
 
6.8%
7 40
 
6.3%
6 35
 
5.5%
Other Punctuation
ValueCountFrequency (%)
. 189
59.8%
& 90
28.5%
' 17
 
5.4%
! 5
 
1.6%
/ 5
 
1.6%
: 5
 
1.6%
# 4
 
1.3%
* 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 1644
99.8%
] 3
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 1640
99.8%
[ 3
 
0.2%
Space Separator
ValueCountFrequency (%)
3151
100.0%
Other Symbol
ValueCountFrequency (%)
51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 48
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 50860
74.9%
Latin 9588
 
14.1%
Common 7455
 
11.0%
Han 11
 
< 0.1%
Hiragana 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2453
 
4.8%
2098
 
4.1%
1810
 
3.6%
1476
 
2.9%
1201
 
2.4%
1174
 
2.3%
1154
 
2.3%
839
 
1.6%
724
 
1.4%
630
 
1.2%
Other values (997) 37301
73.3%
Latin
ValueCountFrequency (%)
o 610
 
6.4%
e 573
 
6.0%
a 414
 
4.3%
n 404
 
4.2%
A 365
 
3.8%
O 352
 
3.7%
i 350
 
3.7%
r 321
 
3.3%
S 308
 
3.2%
E 291
 
3.0%
Other values (42) 5600
58.4%
Common
ValueCountFrequency (%)
3151
42.3%
) 1644
22.1%
( 1640
22.0%
. 189
 
2.5%
1 129
 
1.7%
2 101
 
1.4%
& 90
 
1.2%
3 78
 
1.0%
0 71
 
1.0%
9 49
 
0.7%
Other values (16) 313
 
4.2%
Han
ValueCountFrequency (%)
2
18.2%
槿 1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
1
9.1%
Hiragana
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 50809
74.8%
ASCII 17043
 
25.1%
None 51
 
0.1%
CJK 10
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%
Hiragana 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3151
18.5%
) 1644
 
9.6%
( 1640
 
9.6%
o 610
 
3.6%
e 573
 
3.4%
a 414
 
2.4%
n 404
 
2.4%
A 365
 
2.1%
O 352
 
2.1%
i 350
 
2.1%
Other values (68) 7540
44.2%
Hangul
ValueCountFrequency (%)
2453
 
4.8%
2098
 
4.1%
1810
 
3.6%
1476
 
2.9%
1201
 
2.4%
1174
 
2.3%
1154
 
2.3%
839
 
1.7%
724
 
1.4%
630
 
1.2%
Other values (996) 37250
73.3%
None
ValueCountFrequency (%)
51
100.0%
CJK
ValueCountFrequency (%)
2
20.0%
槿 1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Hiragana
ValueCountFrequency (%)
1
100.0%
Distinct9972
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:15:45.056461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length13.8213
Min length4

Characters and Unicode

Total characters138213
Distinct characters20
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9944 ?
Unique (%)99.4%

Sample

1st row2022-경기파주-1135
2nd row2022-경기파주-3715
3rd row2010-경기파주-2601
4th row2011-경기파주-2936
5th row2022-경기파주-3763
ValueCountFrequency (%)
2022-경기 4
 
< 0.1%
2017-경기파주-0450 2
 
< 0.1%
2020-경기파주-2060 2
 
< 0.1%
2016-경기파주-0976 2
 
< 0.1%
2021-경기파주-2199 2
 
< 0.1%
2016-경기파주-0105 2
 
< 0.1%
2020-경기파주-1785 2
 
< 0.1%
2013-경기파주-4763 2
 
< 0.1%
2020-경기파주-2910 2
 
< 0.1%
2021-경기파주-1962 2
 
< 0.1%
Other values (9963) 9982
99.8%
2023-12-13T08:15:45.462087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 22634
16.4%
- 19676
14.2%
0 17717
12.8%
1 10733
7.8%
9847
7.1%
9847
7.1%
9752
7.1%
9752
7.1%
3 6550
 
4.7%
5 3735
 
2.7%
Other values (10) 17970
13.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 79327
57.4%
Other Letter 39206
28.4%
Dash Punctuation 19676
 
14.2%
Space Separator 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 22634
28.5%
0 17717
22.3%
1 10733
13.5%
3 6550
 
8.3%
5 3735
 
4.7%
6 3721
 
4.7%
4 3699
 
4.7%
9 3635
 
4.6%
8 3454
 
4.4%
7 3449
 
4.3%
Other Letter
ValueCountFrequency (%)
9847
25.1%
9847
25.1%
9752
24.9%
9752
24.9%
3
 
< 0.1%
3
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 19676
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99007
71.6%
Hangul 39206
 
28.4%

Most frequent character per script

Common
ValueCountFrequency (%)
2 22634
22.9%
- 19676
19.9%
0 17717
17.9%
1 10733
10.8%
3 6550
 
6.6%
5 3735
 
3.8%
6 3721
 
3.8%
4 3699
 
3.7%
9 3635
 
3.7%
8 3454
 
3.5%
Other values (2) 3453
 
3.5%
Hangul
ValueCountFrequency (%)
9847
25.1%
9847
25.1%
9752
24.9%
9752
24.9%
3
 
< 0.1%
3
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 99007
71.6%
Hangul 39206
 
28.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 22634
22.9%
- 19676
19.9%
0 17717
17.9%
1 10733
10.8%
3 6550
 
6.6%
5 3735
 
3.8%
6 3721
 
3.8%
4 3699
 
3.7%
9 3635
 
3.7%
8 3454
 
3.5%
Other values (2) 3453
 
3.5%
Hangul
ValueCountFrequency (%)
9847
25.1%
9847
25.1%
9752
24.9%
9752
24.9%
3
 
< 0.1%
3
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%
Distinct9409
Distinct (%)94.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T08:15:45.727820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length52
Mean length34.1146
Min length15

Characters and Unicode

Total characters341146
Distinct characters573
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9049 ?
Unique (%)90.5%

Sample

1st row경기도 파주시 심학산로 384, 128동 2003호 (동패동, 운정신도시 IPARK)
2nd row경기도 파주시 파평면 율곡로 1390-1, 3동
3rd row경기도 파주시 중앙로 280, 101동 505호 (아동동,장안초원아파트)
4th row경기도 파주시 조리읍 봉일천리 156번지 성원아파트 101동 901호
5th row경기도 파주시 교하로159번길 33, 목동프라자 3층 304호 (목동동)
ValueCountFrequency (%)
파주시 10003
 
14.3%
경기도 10000
 
14.3%
동패동 681
 
1.0%
조리읍 638
 
0.9%
탄현면 625
 
0.9%
청석로 571
 
0.8%
문산읍 567
 
0.8%
1층 564
 
0.8%
동패동, 563
 
0.8%
야당동 543
 
0.8%
Other values (7434) 45252
64.6%
2023-12-13T08:15:46.143419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60082
 
17.6%
1 16464
 
4.8%
15309
 
4.5%
0 12562
 
3.7%
11904
 
3.5%
10616
 
3.1%
10606
 
3.1%
10352
 
3.0%
10305
 
3.0%
10270
 
3.0%
Other values (563) 172676
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 178769
52.4%
Decimal Number 71930
21.1%
Space Separator 60082
 
17.6%
Other Punctuation 10164
 
3.0%
Open Punctuation 7442
 
2.2%
Close Punctuation 7442
 
2.2%
Dash Punctuation 3640
 
1.1%
Uppercase Letter 1553
 
0.5%
Lowercase Letter 84
 
< 0.1%
Letter Number 21
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
15309
 
8.6%
11904
 
6.7%
10616
 
5.9%
10606
 
5.9%
10352
 
5.8%
10305
 
5.8%
10270
 
5.7%
7194
 
4.0%
6457
 
3.6%
3910
 
2.2%
Other values (500) 81846
45.8%
Uppercase Letter
ValueCountFrequency (%)
A 439
28.3%
B 308
19.8%
F 130
 
8.4%
R 94
 
6.1%
P 92
 
5.9%
I 78
 
5.0%
K 76
 
4.9%
E 67
 
4.3%
C 66
 
4.2%
D 43
 
2.8%
Other values (13) 160
 
10.3%
Lowercase Letter
ValueCountFrequency (%)
e 27
32.1%
a 16
19.0%
b 13
15.5%
c 11
13.1%
o 3
 
3.6%
s 2
 
2.4%
p 2
 
2.4%
h 2
 
2.4%
f 2
 
2.4%
k 1
 
1.2%
Other values (5) 5
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 16464
22.9%
0 12562
17.5%
2 9900
13.8%
3 6934
9.6%
4 6020
 
8.4%
5 5353
 
7.4%
7 4803
 
6.7%
6 3970
 
5.5%
8 3063
 
4.3%
9 2861
 
4.0%
Other Punctuation
ValueCountFrequency (%)
10123
99.6%
. 15
 
0.1%
@ 12
 
0.1%
& 11
 
0.1%
/ 2
 
< 0.1%
# 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
19
90.5%
1
 
4.8%
1
 
4.8%
Math Symbol
ValueCountFrequency (%)
~ 18
94.7%
= 1
 
5.3%
Space Separator
ValueCountFrequency (%)
60082
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7442
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7442
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3640
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 178768
52.4%
Common 160719
47.1%
Latin 1658
 
0.5%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
15309
 
8.6%
11904
 
6.7%
10616
 
5.9%
10606
 
5.9%
10352
 
5.8%
10305
 
5.8%
10270
 
5.7%
7194
 
4.0%
6457
 
3.6%
3910
 
2.2%
Other values (499) 81845
45.8%
Latin
ValueCountFrequency (%)
A 439
26.5%
B 308
18.6%
F 130
 
7.8%
R 94
 
5.7%
P 92
 
5.5%
I 78
 
4.7%
K 76
 
4.6%
E 67
 
4.0%
C 66
 
4.0%
D 43
 
2.6%
Other values (31) 265
16.0%
Common
ValueCountFrequency (%)
60082
37.4%
1 16464
 
10.2%
0 12562
 
7.8%
10123
 
6.3%
2 9900
 
6.2%
( 7442
 
4.6%
) 7442
 
4.6%
3 6934
 
4.3%
4 6020
 
3.7%
5 5353
 
3.3%
Other values (12) 18397
 
11.4%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 178768
52.4%
ASCII 152233
44.6%
None 10123
 
3.0%
Number Forms 21
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
60082
39.5%
1 16464
 
10.8%
0 12562
 
8.3%
2 9900
 
6.5%
( 7442
 
4.9%
) 7442
 
4.9%
3 6934
 
4.6%
4 6020
 
4.0%
5 5353
 
3.5%
7 4803
 
3.2%
Other values (49) 15231
 
10.0%
Hangul
ValueCountFrequency (%)
15309
 
8.6%
11904
 
6.7%
10616
 
5.9%
10606
 
5.9%
10352
 
5.8%
10305
 
5.8%
10270
 
5.7%
7194
 
4.0%
6457
 
3.6%
3910
 
2.2%
Other values (499) 81845
45.8%
None
ValueCountFrequency (%)
10123
100.0%
Number Forms
ValueCountFrequency (%)
19
90.5%
1
 
4.8%
1
 
4.8%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct5711
Distinct (%)58.6%
Missing246
Missing (%)2.5%
Memory size156.2 KiB
2023-12-13T08:15:46.282409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length229
Median length92
Mean length17.843346
Min length2

Characters and Unicode

Total characters174044
Distinct characters504
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5546 ?
Unique (%)56.9%

Sample

1st rowhttps://smartstore.naver.com/checkbody
2nd rowhttps://smartstore.naver.com/hillsmachine
3rd rowwww.mejiya.com
4th rowG마켓.11번가.옥션
5th rowhttps://sell.smartstore.naver.com/#/seller/info
ValueCountFrequency (%)
네이버 1215
 
9.6%
쿠팡 750
 
5.9%
스토어 733
 
5.8%
스마트 716
 
5.6%
스마트스토어 696
 
5.5%
오픈마켓 651
 
5.1%
네이버스마트스토어 414
 
3.3%
옥션 309
 
2.4%
11번가 219
 
1.7%
g마켓 202
 
1.6%
Other values (5586) 6807
53.5%
2023-12-13T08:15:46.537381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 11629
 
6.7%
t 11480
 
6.6%
. 11390
 
6.5%
r 9762
 
5.6%
s 8890
 
5.1%
a 8508
 
4.9%
/ 8497
 
4.9%
e 8464
 
4.9%
m 7971
 
4.6%
w 6528
 
3.8%
Other values (494) 80925
46.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 113984
65.5%
Other Letter 28582
 
16.4%
Other Punctuation 22666
 
13.0%
Decimal Number 3426
 
2.0%
Space Separator 3043
 
1.7%
Uppercase Letter 1273
 
0.7%
Connector Punctuation 491
 
0.3%
Dash Punctuation 292
 
0.2%
Close Punctuation 130
 
0.1%
Open Punctuation 130
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4121
14.4%
3175
11.1%
2151
 
7.5%
2150
 
7.5%
2006
 
7.0%
1899
 
6.6%
1897
 
6.6%
1896
 
6.6%
1281
 
4.5%
828
 
2.9%
Other values (413) 7178
25.1%
Lowercase Letter
ValueCountFrequency (%)
o 11629
10.2%
t 11480
 
10.1%
r 9762
 
8.6%
s 8890
 
7.8%
a 8508
 
7.5%
e 8464
 
7.4%
m 7971
 
7.0%
w 6528
 
5.7%
c 6303
 
5.5%
n 5416
 
4.8%
Other values (16) 29033
25.5%
Uppercase Letter
ValueCountFrequency (%)
G 283
22.2%
A 69
 
5.4%
N 58
 
4.6%
C 56
 
4.4%
B 55
 
4.3%
M 54
 
4.2%
D 51
 
4.0%
E 51
 
4.0%
S 46
 
3.6%
K 44
 
3.5%
Other values (16) 506
39.7%
Other Punctuation
ValueCountFrequency (%)
. 11390
50.3%
/ 8497
37.5%
: 2608
 
11.5%
% 52
 
0.2%
@ 29
 
0.1%
# 21
 
0.1%
; 19
 
0.1%
" 17
 
0.1%
? 15
 
0.1%
& 14
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 1131
33.0%
2 447
 
13.0%
0 355
 
10.4%
4 271
 
7.9%
3 241
 
7.0%
7 236
 
6.9%
9 217
 
6.3%
8 195
 
5.7%
5 182
 
5.3%
6 151
 
4.4%
Close Punctuation
ValueCountFrequency (%)
) 128
98.5%
] 2
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 128
98.5%
[ 2
 
1.5%
Space Separator
ValueCountFrequency (%)
3043
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 491
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 292
100.0%
Math Symbol
ValueCountFrequency (%)
= 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 115257
66.2%
Common 30205
 
17.4%
Hangul 28582
 
16.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4121
14.4%
3175
11.1%
2151
 
7.5%
2150
 
7.5%
2006
 
7.0%
1899
 
6.6%
1897
 
6.6%
1896
 
6.6%
1281
 
4.5%
828
 
2.9%
Other values (413) 7178
25.1%
Latin
ValueCountFrequency (%)
o 11629
 
10.1%
t 11480
 
10.0%
r 9762
 
8.5%
s 8890
 
7.7%
a 8508
 
7.4%
e 8464
 
7.3%
m 7971
 
6.9%
w 6528
 
5.7%
c 6303
 
5.5%
n 5416
 
4.7%
Other values (42) 30306
26.3%
Common
ValueCountFrequency (%)
. 11390
37.7%
/ 8497
28.1%
3043
 
10.1%
: 2608
 
8.6%
1 1131
 
3.7%
_ 491
 
1.6%
2 447
 
1.5%
0 355
 
1.2%
- 292
 
1.0%
4 271
 
0.9%
Other values (19) 1680
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 145462
83.6%
Hangul 28582
 
16.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 11629
 
8.0%
t 11480
 
7.9%
. 11390
 
7.8%
r 9762
 
6.7%
s 8890
 
6.1%
a 8508
 
5.8%
/ 8497
 
5.8%
e 8464
 
5.8%
m 7971
 
5.5%
w 6528
 
4.5%
Other values (71) 52343
36.0%
Hangul
ValueCountFrequency (%)
4121
14.4%
3175
11.1%
2151
 
7.5%
2150
 
7.5%
2006
 
7.0%
1899
 
6.6%
1897
 
6.6%
1896
 
6.6%
1281
 
4.5%
828
 
2.9%
Other values (413) 7178
25.1%

취급품목
Text

MISSING 

Distinct538
Distinct (%)5.5%
Missing145
Missing (%)1.5%
Memory size156.2 KiB
2023-12-13T08:15:46.669851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length87
Median length84
Mean length8.8561136
Min length2

Characters and Unicode

Total characters87277
Distinct characters50
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique344 ?
Unique (%)3.5%

Sample

1st row종합몰 가구/수납용품 의류/패션/잡화/뷰티 가전 기타 건강/식품
2nd row기타
3rd row의류/패션/잡화/뷰티
4th row종합몰
5th row종합몰
ValueCountFrequency (%)
종합몰 4138
29.2%
의류/패션/잡화/뷰티 2810
19.9%
기타 2217
15.7%
건강/식품 1090
 
7.7%
교육/도서/완구/오락 942
 
6.7%
가구/수납용품 937
 
6.6%
컴퓨터/사무용품 582
 
4.1%
가전 540
 
3.8%
자동차/자동차용품 402
 
2.8%
레져/여행/공연 355
 
2.5%
Other values (2) 143
 
1.0%
2023-12-13T08:15:46.911777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 15033
 
17.2%
4301
 
4.9%
4138
 
4.7%
4138
 
4.7%
4138
 
4.7%
3154
 
3.6%
2810
 
3.2%
2810
 
3.2%
2810
 
3.2%
2810
 
3.2%
Other values (40) 41135
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 67943
77.8%
Other Punctuation 15033
 
17.2%
Space Separator 4301
 
4.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4138
 
6.1%
4138
 
6.1%
4138
 
6.1%
3154
 
4.6%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
Other values (38) 35515
52.3%
Other Punctuation
ValueCountFrequency (%)
/ 15033
100.0%
Space Separator
ValueCountFrequency (%)
4301
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 67943
77.8%
Common 19334
 
22.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4138
 
6.1%
4138
 
6.1%
4138
 
6.1%
3154
 
4.6%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
Other values (38) 35515
52.3%
Common
ValueCountFrequency (%)
/ 15033
77.8%
4301
 
22.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 67943
77.8%
ASCII 19334
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 15033
77.8%
4301
 
22.2%
Hangul
ValueCountFrequency (%)
4138
 
6.1%
4138
 
6.1%
4138
 
6.1%
3154
 
4.6%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
2810
 
4.1%
Other values (38) 35515
52.3%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-10-26
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-26
2nd row2023-10-26
3rd row2023-10-26
4th row2023-10-26
5th row2023-10-26

Common Values

ValueCountFrequency (%)
2023-10-26 10000
100.0%

Length

2023-12-13T08:15:47.207303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:15:47.276397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-26 10000
100.0%

Missing values

2023-12-13T08:15:43.424841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:15:43.572006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:15:43.706857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명통신판매관리번호소재지 도로명주소인터넷(홈페이지) 도메인취급품목데이터기준일자
6527올런2022-경기파주-1135경기도 파주시 심학산로 384, 128동 2003호 (동패동, 운정신도시 IPARK)https://smartstore.naver.com/checkbody종합몰 가구/수납용품 의류/패션/잡화/뷰티 가전 기타 건강/식품2023-10-26
4339주식회사 힐스기계2022-경기파주-3715경기도 파주시 파평면 율곡로 1390-1, 3동https://smartstore.naver.com/hillsmachine기타2023-10-26
20111쇼핑쎄씨2010-경기파주-2601경기도 파주시 중앙로 280, 101동 505호 (아동동,장안초원아파트)www.mejiya.com의류/패션/잡화/뷰티2023-10-26
19952으니유리2011-경기파주-2936경기도 파주시 조리읍 봉일천리 156번지 성원아파트 101동 901호G마켓.11번가.옥션종합몰2023-10-26
4296나눔 연구소2022-경기파주-3763경기도 파주시 교하로159번길 33, 목동프라자 3층 304호 (목동동)https://sell.smartstore.naver.com/#/seller/info종합몰2023-10-26
13122매직인터내셔널2019-경기파주-1434경기도 파주시 가온로 245, 1007동 1702호 (와동동, 가람마을 10단지 동양엔파트 월드메르디앙)www.magicfilter.co.kr기타2023-10-26
6741이억조몰2022-경기파주-0878경기도 파주시 파평면 청송로432번길 47-2쿠팡종합몰2023-10-26
7903안초비스페이스2021-경기파주-2790경기도 파주시 광탄면 기산로186번길 66-11네이버 스마트 스토어가구/수납용품2023-10-26
15746KW코스메틱2017-경기파주-0692경기도 파주시 문산읍 문향로75번길 22, 4층 1호 (문산어린이집)http://storefarm.naver.om/yongprefume의류/패션/잡화/뷰티2023-10-26
9253레인보우2021-경기파주-1078경기도 파주시 탄현면 헤이리마을길 93-44, 운경재 2층네이버건강/식품 기타2023-10-26
업체명통신판매관리번호소재지 도로명주소인터넷(홈페이지) 도메인취급품목데이터기준일자
10489슬로우라이프2020-경기파주-2642경기도 파주시 월롱면 외도감길 24https://smartstore.naver.com/believe-buy종합몰2023-10-26
6711신남부부2022-경기파주-0898경기도 파주시 교하로159번길 33, 3층 304호 A413 (목동동)네이버스마트스토어종합몰2023-10-26
13336젠틀남2019-경기파주-1064경기도 파주시 재두루미길 70, 210호 (신촌동)오픈마켓의류/패션/잡화/뷰티2023-10-26
16831에뛰드하우스 금릉점2016-경기파주-0405경기도 파주시 금촌동 989번지 5호 굿닥터스빌딩 105호www.etude.co.kr의류/패션/잡화/뷰티2023-10-26
6317은별자리2022-경기파주-1382경기도 파주시 중앙로 226, 101동 4층 1호 (금촌동, 동현아파트)네이버 스마트 스토어종합몰2023-10-26
12611(주)러그홀릭2019-경기파주-2120경기도 파주시 하우고개길 172 (야당동)스마트스토어기타2023-10-26
6545글로리플레이스2022-경기파주-1117경기도 파주시 교하로 70, 306동 1703호 (목동동, 산내마을3단지)네이버스마트스토어종합몰2023-10-26
969울랄라마켓132023-경기파주-3369경기도 파주시 고봉로 755-27, 가동 B01층 E19호 (상지석동)스마트스토어종합몰2023-10-26
18348아토샵2014-경기파주-5430경기도 파주시 미래로 602, 212동 901호 (와동동,휴먼시아 가람마을)오픈마켓건강/식품2023-10-26
9311오네(one)2021-경기파주-1004경기도 파주시 동패로 100, 306동 1803호 (동패동, 한울마을 3단지)http://smartstore.naver.com/oney의류/패션/잡화/뷰티2023-10-26