Overview

Dataset statistics

Number of variables8
Number of observations3295
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory209.3 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Text5
Categorical2

Dataset

Description부산광역시 북구 관내에 있는 통신판매업 신고 현황에 관한 데이터로 신고번호, 대표자명, 법인 또는 상호명 등의 항목을 제공하고 있습니다.
Author부산광역시 북구
URLhttps://www.data.go.kr/data/15062064/fileData.do

Alerts

법인구분 is highly imbalanced (76.8%)Imbalance
판매방식 is highly imbalanced (87.4%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-30 08:39:21.665528
Analysis finished2024-03-30 08:39:26.432957
Duration4.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct3295
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1648.0067
Minimum1
Maximum3297
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size29.1 KiB
2024-03-30T08:39:26.791554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile165.7
Q1824.5
median1648
Q32471.5
95-th percentile3130.3
Maximum3297
Range3296
Interquartile range (IQR)1647

Descriptive statistics

Standard deviation951.34043
Coefficient of variation (CV)0.57726734
Kurtosis-1.1999425
Mean1648.0067
Median Absolute Deviation (MAD)824
Skewness4.1789242 × 10-5
Sum5430182
Variance905048.61
MonotonicityStrictly increasing
2024-03-30T08:39:27.271814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2202 1
 
< 0.1%
2192 1
 
< 0.1%
2193 1
 
< 0.1%
2194 1
 
< 0.1%
2195 1
 
< 0.1%
2196 1
 
< 0.1%
2197 1
 
< 0.1%
2198 1
 
< 0.1%
2199 1
 
< 0.1%
Other values (3285) 3285
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3297 1
< 0.1%
3296 1
< 0.1%
3295 1
< 0.1%
3294 1
< 0.1%
3293 1
< 0.1%
3292 1
< 0.1%
3291 1
< 0.1%
3290 1
< 0.1%
3289 1
< 0.1%
3288 1
< 0.1%
Distinct3290
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
2024-03-30T08:39:27.903702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length13.981791
Min length7

Characters and Unicode

Total characters46070
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3285 ?
Unique (%)99.7%

Sample

1st row2024-부산북구-0413
2nd row2024-부산북구-0412
3rd row2024-부산북구-0411
4th row2024-부산북구-0410
5th row2024-부산북구-0409
ValueCountFrequency (%)
2021-부산북구-0581 2
 
0.1%
2020-부산북구-0161 2
 
0.1%
2023-부산북구-0542 2
 
0.1%
북구 2
 
0.1%
2022-부산북구-0822 2
 
0.1%
2021-부산북구-0195 2
 
0.1%
2021-부산북구-0347 1
 
< 0.1%
2021-부산북구-0009 1
 
< 0.1%
2021-부산북구-0004 1
 
< 0.1%
2021-부산북구-0008 1
 
< 0.1%
Other values (3281) 3281
99.5%
2024-03-30T08:39:28.795791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 7978
17.3%
2 7609
16.5%
- 6577
14.3%
3285
7.1%
3285
7.1%
3283
7.1%
3283
7.1%
1 2705
 
5.9%
3 1853
 
4.0%
4 1362
 
3.0%
Other values (7) 4850
10.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26354
57.2%
Other Letter 13137
28.5%
Dash Punctuation 6577
 
14.3%
Space Separator 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 7978
30.3%
2 7609
28.9%
1 2705
 
10.3%
3 1853
 
7.0%
4 1362
 
5.2%
8 1031
 
3.9%
7 986
 
3.7%
9 974
 
3.7%
6 957
 
3.6%
5 899
 
3.4%
Other Letter
ValueCountFrequency (%)
3285
25.0%
3285
25.0%
3283
25.0%
3283
25.0%
1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 6577
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 32933
71.5%
Hangul 13137
 
28.5%

Most frequent character per script

Common
ValueCountFrequency (%)
0 7978
24.2%
2 7609
23.1%
- 6577
20.0%
1 2705
 
8.2%
3 1853
 
5.6%
4 1362
 
4.1%
8 1031
 
3.1%
7 986
 
3.0%
9 974
 
3.0%
6 957
 
2.9%
Other values (2) 901
 
2.7%
Hangul
ValueCountFrequency (%)
3285
25.0%
3285
25.0%
3283
25.0%
3283
25.0%
1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32933
71.5%
Hangul 13137
 
28.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 7978
24.2%
2 7609
23.1%
- 6577
20.0%
1 2705
 
8.2%
3 1853
 
5.6%
4 1362
 
4.1%
8 1031
 
3.1%
7 986
 
3.0%
9 974
 
3.0%
6 957
 
2.9%
Other values (2) 901
 
2.7%
Hangul
ValueCountFrequency (%)
3285
25.0%
3285
25.0%
3283
25.0%
3283
25.0%
1
 
< 0.1%

상호
Text

Distinct3270
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
2024-03-30T08:39:29.461045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length31
Mean length6.0288316
Min length1

Characters and Unicode

Total characters19865
Distinct characters872
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3246 ?
Unique (%)98.5%

Sample

1st row아정당 이사
2nd row페이글로
3rd row210별상점1
4th row네빈
5th row겜방24
ValueCountFrequency (%)
주식회사 120
 
3.1%
컴퍼니 9
 
0.2%
8
 
0.2%
덕천점 7
 
0.2%
화명점 7
 
0.2%
co 6
 
0.2%
인셀덤 6
 
0.2%
6
 
0.2%
company 5
 
0.1%
ltd 5
 
0.1%
Other values (3612) 3717
95.4%
2024-03-30T08:39:30.546218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
735
 
3.7%
608
 
3.1%
588
 
3.0%
) 428
 
2.2%
( 427
 
2.1%
365
 
1.8%
237
 
1.2%
233
 
1.2%
232
 
1.2%
228
 
1.1%
Other values (862) 15784
79.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15403
77.5%
Lowercase Letter 1407
 
7.1%
Uppercase Letter 1273
 
6.4%
Space Separator 608
 
3.1%
Close Punctuation 429
 
2.2%
Open Punctuation 428
 
2.2%
Decimal Number 227
 
1.1%
Other Punctuation 69
 
0.3%
Dash Punctuation 15
 
0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
735
 
4.8%
588
 
3.8%
365
 
2.4%
237
 
1.5%
233
 
1.5%
232
 
1.5%
228
 
1.5%
225
 
1.5%
221
 
1.4%
210
 
1.4%
Other values (786) 12129
78.7%
Lowercase Letter
ValueCountFrequency (%)
e 182
12.9%
o 159
11.3%
a 119
 
8.5%
n 104
 
7.4%
i 94
 
6.7%
r 86
 
6.1%
t 83
 
5.9%
l 78
 
5.5%
m 56
 
4.0%
s 52
 
3.7%
Other values (16) 394
28.0%
Uppercase Letter
ValueCountFrequency (%)
E 100
 
7.9%
O 100
 
7.9%
A 97
 
7.6%
S 81
 
6.4%
N 81
 
6.4%
R 70
 
5.5%
M 69
 
5.4%
T 69
 
5.4%
I 67
 
5.3%
L 64
 
5.0%
Other values (16) 475
37.3%
Decimal Number
ValueCountFrequency (%)
1 48
21.1%
2 35
15.4%
3 28
12.3%
0 25
11.0%
4 23
10.1%
5 22
9.7%
6 16
 
7.0%
8 12
 
5.3%
9 10
 
4.4%
7 8
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 33
47.8%
& 22
31.9%
' 8
 
11.6%
2
 
2.9%
· 2
 
2.9%
: 2
 
2.9%
Close Punctuation
ValueCountFrequency (%)
) 428
99.8%
] 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 427
99.8%
[ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
608
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15402
77.5%
Latin 2680
 
13.5%
Common 1779
 
9.0%
Han 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
735
 
4.8%
588
 
3.8%
365
 
2.4%
237
 
1.5%
233
 
1.5%
232
 
1.5%
228
 
1.5%
225
 
1.5%
221
 
1.4%
210
 
1.4%
Other values (783) 12128
78.7%
Latin
ValueCountFrequency (%)
e 182
 
6.8%
o 159
 
5.9%
a 119
 
4.4%
n 104
 
3.9%
E 100
 
3.7%
O 100
 
3.7%
A 97
 
3.6%
i 94
 
3.5%
r 86
 
3.2%
t 83
 
3.1%
Other values (42) 1556
58.1%
Common
ValueCountFrequency (%)
608
34.2%
) 428
24.1%
( 427
24.0%
1 48
 
2.7%
2 35
 
2.0%
. 33
 
1.9%
3 28
 
1.6%
0 25
 
1.4%
4 23
 
1.3%
& 22
 
1.2%
Other values (13) 102
 
5.7%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15399
77.5%
ASCII 4455
 
22.4%
None 7
 
< 0.1%
CJK 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
735
 
4.8%
588
 
3.8%
365
 
2.4%
237
 
1.5%
233
 
1.5%
232
 
1.5%
228
 
1.5%
225
 
1.5%
221
 
1.4%
210
 
1.4%
Other values (782) 12125
78.7%
ASCII
ValueCountFrequency (%)
608
 
13.6%
) 428
 
9.6%
( 427
 
9.6%
e 182
 
4.1%
o 159
 
3.6%
a 119
 
2.7%
n 104
 
2.3%
E 100
 
2.2%
O 100
 
2.2%
A 97
 
2.2%
Other values (63) 2131
47.8%
None
ValueCountFrequency (%)
3
42.9%
2
28.6%
· 2
28.6%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct2854
Distinct (%)86.6%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
2024-03-30T08:39:31.144900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.047041
Min length2

Characters and Unicode

Total characters10040
Distinct characters284
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2545 ?
Unique (%)77.2%

Sample

1st row김민기
2nd row정민수
3rd row김지현
4th row최신희
5th row문성진
ValueCountFrequency (%)
최병국 17
 
0.5%
김지현 7
 
0.2%
정지영 6
 
0.2%
정혜정 6
 
0.2%
이창민 5
 
0.2%
김지영 5
 
0.2%
김경민 5
 
0.2%
김건우 5
 
0.2%
김도형 5
 
0.2%
김선영 4
 
0.1%
Other values (2853) 3241
98.0%
2024-03-30T08:39:32.239103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
691
 
6.9%
473
 
4.7%
449
 
4.5%
315
 
3.1%
292
 
2.9%
259
 
2.6%
238
 
2.4%
235
 
2.3%
217
 
2.2%
197
 
2.0%
Other values (274) 6674
66.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9941
99.0%
Uppercase Letter 53
 
0.5%
Other Punctuation 22
 
0.2%
Space Separator 11
 
0.1%
Decimal Number 5
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Close Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
691
 
7.0%
473
 
4.8%
449
 
4.5%
315
 
3.2%
292
 
2.9%
259
 
2.6%
238
 
2.4%
235
 
2.4%
217
 
2.2%
197
 
2.0%
Other values (252) 6575
66.1%
Uppercase Letter
ValueCountFrequency (%)
I 13
24.5%
N 7
13.2%
A 4
 
7.5%
E 4
 
7.5%
U 4
 
7.5%
J 3
 
5.7%
L 3
 
5.7%
G 3
 
5.7%
Y 3
 
5.7%
Q 2
 
3.8%
Other values (5) 7
13.2%
Decimal Number
ValueCountFrequency (%)
1 3
60.0%
2 1
 
20.0%
0 1
 
20.0%
Other Punctuation
ValueCountFrequency (%)
22
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9941
99.0%
Latin 53
 
0.5%
Common 46
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
691
 
7.0%
473
 
4.8%
449
 
4.5%
315
 
3.2%
292
 
2.9%
259
 
2.6%
238
 
2.4%
235
 
2.4%
217
 
2.2%
197
 
2.0%
Other values (252) 6575
66.1%
Latin
ValueCountFrequency (%)
I 13
24.5%
N 7
13.2%
A 4
 
7.5%
E 4
 
7.5%
U 4
 
7.5%
J 3
 
5.7%
L 3
 
5.7%
G 3
 
5.7%
Y 3
 
5.7%
Q 2
 
3.8%
Other values (5) 7
13.2%
Common
ValueCountFrequency (%)
22
47.8%
11
23.9%
( 4
 
8.7%
) 4
 
8.7%
1 3
 
6.5%
2 1
 
2.2%
0 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9941
99.0%
ASCII 77
 
0.8%
None 22
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
691
 
7.0%
473
 
4.8%
449
 
4.5%
315
 
3.2%
292
 
2.9%
259
 
2.6%
238
 
2.4%
235
 
2.4%
217
 
2.2%
197
 
2.0%
Other values (252) 6575
66.1%
None
ValueCountFrequency (%)
22
100.0%
ASCII
ValueCountFrequency (%)
I 13
16.9%
11
14.3%
N 7
 
9.1%
( 4
 
5.2%
) 4
 
5.2%
A 4
 
5.2%
E 4
 
5.2%
U 4
 
5.2%
J 3
 
3.9%
L 3
 
3.9%
Other values (11) 20
26.0%

법인구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
개인
3070 
법인
 
221
미기재
 
4

Length

Max length3
Median length2
Mean length2.001214
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 3070
93.2%
법인 221
 
6.7%
미기재 4
 
0.1%

Length

2024-03-30T08:39:32.760637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T08:39:33.063543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 3070
93.2%
법인 221
 
6.7%
미기재 4
 
0.1%

판매방식
Categorical

IMBALANCE 

Distinct27
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
인터넷
3048 
인터넷 기타
 
121
기타
 
35
인터넷 카다로그
 
13
인터넷 카다로그 기타
 
13
Other values (22)
 
65

Length

Max length22
Median length3
Mean length3.2889226
Min length2

Unique

Unique11 ?
Unique (%)0.3%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row인터넷
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 3048
92.5%
인터넷 기타 121
 
3.7%
기타 35
 
1.1%
인터넷 카다로그 13
 
0.4%
인터넷 카다로그 기타 13
 
0.4%
TV홈쇼핑 12
 
0.4%
TV홈쇼핑 인터넷 10
 
0.3%
인터넷 TV홈쇼핑 7
 
0.2%
기타 인터넷 4
 
0.1%
인터넷 TV홈쇼핑 기타 4
 
0.1%
Other values (17) 28
 
0.8%

Length

2024-03-30T08:39:33.468843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 3242
91.5%
기타 190
 
5.4%
tv홈쇼핑 47
 
1.3%
카다로그 44
 
1.2%
신문잡지 16
 
0.5%
미기재 3
 
0.1%
Distinct214
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
2024-03-30T08:39:33.914318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length80
Mean length8.5122914
Min length2

Characters and Unicode

Total characters28048
Distinct characters52
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)4.1%

Sample

1st row기타
2nd row종합몰
3rd row종합몰
4th row종합몰
5th row종합몰
ValueCountFrequency (%)
종합몰 1587
34.8%
의류/패션/잡화/뷰티 1070
23.5%
기타 566
 
12.4%
건강/식품 402
 
8.8%
교육/도서/완구/오락 198
 
4.3%
가구/수납용품 193
 
4.2%
컴퓨터/사무용품 158
 
3.5%
가전 123
 
2.7%
레져/여행/공연 116
 
2.5%
자동차/자동차용품 109
 
2.4%
Other values (3) 36
 
0.8%
2024-03-30T08:39:34.855814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4912
17.5%
1587
 
5.7%
1587
 
5.7%
1587
 
5.7%
1263
 
4.5%
1070
 
3.8%
1070
 
3.8%
1070
 
3.8%
1070
 
3.8%
1070
 
3.8%
Other values (42) 11762
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21873
78.0%
Other Punctuation 4912
 
17.5%
Space Separator 1263
 
4.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1587
 
7.3%
1587
 
7.3%
1587
 
7.3%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
Other values (40) 9622
44.0%
Other Punctuation
ValueCountFrequency (%)
/ 4912
100.0%
Space Separator
ValueCountFrequency (%)
1263
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21873
78.0%
Common 6175
 
22.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1587
 
7.3%
1587
 
7.3%
1587
 
7.3%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
Other values (40) 9622
44.0%
Common
ValueCountFrequency (%)
/ 4912
79.5%
1263
 
20.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21873
78.0%
ASCII 6175
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4912
79.5%
1263
 
20.5%
Hangul
ValueCountFrequency (%)
1587
 
7.3%
1587
 
7.3%
1587
 
7.3%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
1070
 
4.9%
Other values (40) 9622
44.0%
Distinct1349
Distinct (%)40.9%
Missing0
Missing (%)0.0%
Memory size25.9 KiB
2024-03-30T08:39:35.282658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length150
Median length59
Mean length13.972989
Min length2

Characters and Unicode

Total characters46041
Distinct characters313
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1298 ?
Unique (%)39.4%

Sample

1st rowhttp://www.샘플.kr
2nd rowpayglo
3rd row스마트스토어
4th row쿠팡 외 오픈마켓 전체
5th rowhttps://smartstore.naver.com/gameland24
ValueCountFrequency (%)
미기재 1081
29.7%
스마트스토어 296
 
8.1%
쿠팡 226
 
6.2%
네이버 155
 
4.3%
네이버스마트스토어 131
 
3.6%
옥션 64
 
1.8%
오픈마켓 57
 
1.6%
11번가 51
 
1.4%
g마켓 42
 
1.2%
smartstore.naver.com 31
 
0.9%
Other values (1303) 1500
41.3%
2024-03-30T08:39:36.674524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 3061
 
6.6%
t 2967
 
6.4%
s 2741
 
6.0%
r 2670
 
5.8%
. 2646
 
5.7%
a 2368
 
5.1%
m 2317
 
5.0%
e 2314
 
5.0%
/ 2067
 
4.5%
c 1476
 
3.2%
Other values (303) 21414
46.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 29585
64.3%
Other Letter 8797
 
19.1%
Other Punctuation 5324
 
11.6%
Decimal Number 981
 
2.1%
Uppercase Letter 641
 
1.4%
Space Separator 347
 
0.8%
Connector Punctuation 253
 
0.5%
Dash Punctuation 73
 
0.2%
Close Punctuation 16
 
< 0.1%
Open Punctuation 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1083
12.3%
1082
12.3%
1081
12.3%
942
10.7%
586
 
6.7%
481
 
5.5%
479
 
5.4%
446
 
5.1%
355
 
4.0%
321
 
3.6%
Other values (226) 1941
22.1%
Lowercase Letter
ValueCountFrequency (%)
o 3061
10.3%
t 2967
10.0%
s 2741
 
9.3%
r 2670
 
9.0%
a 2368
 
8.0%
m 2317
 
7.8%
e 2314
 
7.8%
c 1476
 
5.0%
n 1460
 
4.9%
p 1070
 
3.6%
Other values (16) 7141
24.1%
Uppercase Letter
ValueCountFrequency (%)
G 62
 
9.7%
N 35
 
5.5%
E 31
 
4.8%
A 30
 
4.7%
J 28
 
4.4%
C 28
 
4.4%
T 27
 
4.2%
V 27
 
4.2%
P 26
 
4.1%
R 26
 
4.1%
Other values (16) 321
50.1%
Decimal Number
ValueCountFrequency (%)
1 274
27.9%
2 142
14.5%
0 106
 
10.8%
4 85
 
8.7%
8 73
 
7.4%
3 71
 
7.2%
9 64
 
6.5%
5 63
 
6.4%
6 52
 
5.3%
7 51
 
5.2%
Other Punctuation
ValueCountFrequency (%)
. 2646
49.7%
/ 2067
38.8%
: 562
 
10.6%
% 28
 
0.5%
# 10
 
0.2%
@ 7
 
0.1%
? 3
 
0.1%
& 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 6
75.0%
> 2
 
25.0%
Space Separator
ValueCountFrequency (%)
347
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 253
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 30226
65.7%
Hangul 8797
 
19.1%
Common 7018
 
15.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1083
12.3%
1082
12.3%
1081
12.3%
942
10.7%
586
 
6.7%
481
 
5.5%
479
 
5.4%
446
 
5.1%
355
 
4.0%
321
 
3.6%
Other values (226) 1941
22.1%
Latin
ValueCountFrequency (%)
o 3061
 
10.1%
t 2967
 
9.8%
s 2741
 
9.1%
r 2670
 
8.8%
a 2368
 
7.8%
m 2317
 
7.7%
e 2314
 
7.7%
c 1476
 
4.9%
n 1460
 
4.8%
p 1070
 
3.5%
Other values (42) 7782
25.7%
Common
ValueCountFrequency (%)
. 2646
37.7%
/ 2067
29.5%
: 562
 
8.0%
347
 
4.9%
1 274
 
3.9%
_ 253
 
3.6%
2 142
 
2.0%
0 106
 
1.5%
4 85
 
1.2%
- 73
 
1.0%
Other values (15) 463
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37244
80.9%
Hangul 8797
 
19.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 3061
 
8.2%
t 2967
 
8.0%
s 2741
 
7.4%
r 2670
 
7.2%
. 2646
 
7.1%
a 2368
 
6.4%
m 2317
 
6.2%
e 2314
 
6.2%
/ 2067
 
5.5%
c 1476
 
4.0%
Other values (67) 12617
33.9%
Hangul
ValueCountFrequency (%)
1083
12.3%
1082
12.3%
1081
12.3%
942
10.7%
586
 
6.7%
481
 
5.5%
479
 
5.4%
446
 
5.1%
355
 
4.0%
321
 
3.6%
Other values (226) 1941
22.1%

Interactions

2024-03-30T08:39:24.374736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T08:39:36.936567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호법인구분판매방식
번호1.0000.1780.076
법인구분0.1781.0000.223
판매방식0.0760.2231.000
2024-03-30T08:39:37.167292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매방식법인구분
판매방식1.0000.104
법인구분0.1041.000
2024-03-30T08:39:37.423810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호법인구분판매방식
번호1.0000.1070.027
법인구분0.1071.0000.104
판매방식0.0270.1041.000

Missing values

2024-03-30T08:39:24.993562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T08:39:26.086789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호관리번호상호대표자명법인구분판매방식취급품목도메인명
012024-부산북구-0413아정당 이사김민기개인인터넷기타http://www.샘플.kr
122024-부산북구-0412페이글로정민수개인인터넷종합몰payglo
232024-부산북구-0411210별상점1김지현개인인터넷종합몰스마트스토어
342024-부산북구-0410네빈최신희개인인터넷종합몰쿠팡 외 오픈마켓 전체
452024-부산북구-0409겜방24문성진개인인터넷종합몰https://smartstore.naver.com/gameland24
562024-부산북구-0408청이바리김강주개인인터넷종합몰 교육/도서/완구/오락 가전 컴퓨터/사무용품 가구/수납용품 의류/패션/잡화/뷰티 자동차/자동차용품네이버스토어팜
672024-부산북구-0407아이무드잇(I MOOD IT)김주영개인인터넷의류/패션/잡화/뷰티https://smartstore.naver.com/imoodit
782024-부산북구-0406신사몰박정태개인인터넷종합몰 가구/수납용품 가전https://buy.tosspayments.com/shops/mob_L1LjYruGUh
892024-부산북구-0405이스타일박충만개인인터넷종합몰스마트스토어
9102024-부산북구-0404강원피디최대용개인인터넷종합몰https://buy.tosspayments.com/shops/mob_CnspfZVcXK
번호관리번호상호대표자명법인구분판매방식취급품목도메인명
328532882007-부산북구-0027씨큐리더김상호개인인터넷미기재scrd.kr
328632892007-00007큐필드정영문개인인터넷종합몰epost.go.kr
328732902006-부산북구-0096에이비이알서홍걸개인인터넷종합몰kodette.com
328832912006-00046유신상사박태헌개인인터넷종합몰Yshopping.net
328932922005-00094㈜한세계와미래박성률개인인터넷종합몰옥션
329032932004-부산북구-0060코코(CoCo)이유로개인인터넷의류/패션/잡화/뷰티오픈마켓
329132942004-부산북구-0044비와치코리아백성권개인미기재미기재wachmall365.combrandwatch25.com
329232952004-00003㈜서원유통김병찬,이윤서법인인터넷 기타종합몰etopmart.co.kr
329332962003-00066정인식품강소현개인인터넷건강/식품 종합몰jeong-in.co.kr
329432972002-00009친한친구최현진개인인터넷종합몰87man.com