Overview

Dataset statistics

Number of variables3
Number of observations4076
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory99.6 KiB
Average record size in memory25.0 B

Variable types

Numeric1
Text2

Dataset

Description부산광역시_수영구_통신판매업현황_20230324
Author부산광역시 수영구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3044036

Alerts

번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:18:11.763018
Analysis finished2023-12-10 17:18:14.184226
Duration2.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct4076
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2038.5
Minimum1
Maximum4076
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size36.0 KiB
2023-12-11T02:18:14.350798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile204.75
Q11019.75
median2038.5
Q33057.25
95-th percentile3872.25
Maximum4076
Range4075
Interquartile range (IQR)2037.5

Descriptive statistics

Standard deviation1176.7842
Coefficient of variation (CV)0.57727946
Kurtosis-1.2
Mean2038.5
Median Absolute Deviation (MAD)1019
Skewness0
Sum8308926
Variance1384821
MonotonicityStrictly increasing
2023-12-11T02:18:14.656040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2724 1
 
< 0.1%
2711 1
 
< 0.1%
2712 1
 
< 0.1%
2713 1
 
< 0.1%
2714 1
 
< 0.1%
2715 1
 
< 0.1%
2716 1
 
< 0.1%
2717 1
 
< 0.1%
2718 1
 
< 0.1%
Other values (4066) 4066
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4076 1
< 0.1%
4075 1
< 0.1%
4074 1
< 0.1%
4073 1
< 0.1%
4072 1
< 0.1%
4071 1
< 0.1%
4070 1
< 0.1%
4069 1
< 0.1%
4068 1
< 0.1%
4067 1
< 0.1%
Distinct4026
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size32.0 KiB
2023-12-11T02:18:15.238694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length6.7090285
Min length1

Characters and Unicode

Total characters27346
Distinct characters914
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3979 ?
Unique (%)97.6%

Sample

1st row감사해(海)
2nd row윤교D
3rd row마리나스
4th row제이랑
5th row자르댕
ValueCountFrequency (%)
주식회사 172
 
3.4%
컴퍼니 14
 
0.3%
스튜디오 14
 
0.3%
11
 
0.2%
10
 
0.2%
코리아 9
 
0.2%
디자인 8
 
0.2%
company 7
 
0.1%
co 7
 
0.1%
ltd 6
 
0.1%
Other values (4605) 4857
95.0%
2023-12-11T02:18:16.384027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1044
 
3.8%
963
 
3.5%
( 773
 
2.8%
) 773
 
2.8%
765
 
2.8%
466
 
1.7%
o 364
 
1.3%
e 329
 
1.2%
327
 
1.2%
317
 
1.2%
Other values (904) 21225
77.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19288
70.5%
Lowercase Letter 2881
 
10.5%
Uppercase Letter 2188
 
8.0%
Space Separator 1044
 
3.8%
Open Punctuation 774
 
2.8%
Close Punctuation 774
 
2.8%
Decimal Number 234
 
0.9%
Other Punctuation 120
 
0.4%
Dash Punctuation 26
 
0.1%
Connector Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
963
 
5.0%
765
 
4.0%
466
 
2.4%
327
 
1.7%
317
 
1.6%
312
 
1.6%
295
 
1.5%
271
 
1.4%
268
 
1.4%
265
 
1.4%
Other values (826) 15039
78.0%
Lowercase Letter
ValueCountFrequency (%)
o 364
12.6%
e 329
11.4%
a 258
 
9.0%
i 207
 
7.2%
n 207
 
7.2%
r 181
 
6.3%
l 168
 
5.8%
t 163
 
5.7%
s 135
 
4.7%
m 124
 
4.3%
Other values (16) 745
25.9%
Uppercase Letter
ValueCountFrequency (%)
A 199
 
9.1%
O 151
 
6.9%
E 143
 
6.5%
S 142
 
6.5%
T 130
 
5.9%
I 130
 
5.9%
N 128
 
5.9%
M 124
 
5.7%
L 114
 
5.2%
D 109
 
5.0%
Other values (16) 818
37.4%
Decimal Number
ValueCountFrequency (%)
2 42
17.9%
1 40
17.1%
0 27
11.5%
5 21
9.0%
4 21
9.0%
3 20
8.5%
9 20
8.5%
7 17
7.3%
8 15
 
6.4%
6 11
 
4.7%
Other Punctuation
ValueCountFrequency (%)
. 61
50.8%
& 23
 
19.2%
' 12
 
10.0%
: 9
 
7.5%
? 7
 
5.8%
/ 5
 
4.2%
# 2
 
1.7%
% 1
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 773
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 773
99.9%
] 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1044
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 26
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 12
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19284
70.5%
Latin 5069
 
18.5%
Common 2984
 
10.9%
Han 9
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
963
 
5.0%
765
 
4.0%
466
 
2.4%
327
 
1.7%
317
 
1.6%
312
 
1.6%
295
 
1.5%
271
 
1.4%
268
 
1.4%
265
 
1.4%
Other values (819) 15035
78.0%
Latin
ValueCountFrequency (%)
o 364
 
7.2%
e 329
 
6.5%
a 258
 
5.1%
i 207
 
4.1%
n 207
 
4.1%
A 199
 
3.9%
r 181
 
3.6%
l 168
 
3.3%
t 163
 
3.2%
O 151
 
3.0%
Other values (42) 2842
56.1%
Common
ValueCountFrequency (%)
1044
35.0%
( 773
25.9%
) 773
25.9%
. 61
 
2.0%
2 42
 
1.4%
1 40
 
1.3%
0 27
 
0.9%
- 26
 
0.9%
& 23
 
0.8%
5 21
 
0.7%
Other values (15) 154
 
5.2%
Han
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19279
70.5%
ASCII 8053
29.4%
CJK 8
 
< 0.1%
None 5
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1044
 
13.0%
( 773
 
9.6%
) 773
 
9.6%
o 364
 
4.5%
e 329
 
4.1%
a 258
 
3.2%
i 207
 
2.6%
n 207
 
2.6%
A 199
 
2.5%
r 181
 
2.2%
Other values (67) 3718
46.2%
Hangul
ValueCountFrequency (%)
963
 
5.0%
765
 
4.0%
466
 
2.4%
327
 
1.7%
317
 
1.6%
312
 
1.6%
295
 
1.5%
271
 
1.4%
268
 
1.4%
265
 
1.4%
Other values (818) 15030
78.0%
None
ValueCountFrequency (%)
5
100.0%
CJK
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct3811
Distinct (%)93.5%
Missing1
Missing (%)< 0.1%
Memory size32.0 KiB
2023-12-11T02:18:16.989244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length66
Median length51
Mean length36.371779
Min length3

Characters and Unicode

Total characters148215
Distinct characters405
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3601 ?
Unique (%)88.4%

Sample

1st row부산광역시 수영구 황령대로473번길 25, 목연 마이텔 703호 (남천동)
2nd row부산광역시 수영구 광안해변로 141, 2동 1106호 (남천동, 협진태양아파트)
3rd row부산광역시 수영구 광안해변로 100, 101동 1001호 (남천동, 비치아파트)
4th row부산광역시 수영구 수영로 389, 102동 2103호 (남천동, 더샵 남천프레스티지)
5th row부산광역시 수영구 남천동로108번길 20, 4층 다15호 (남천동)
ValueCountFrequency (%)
부산광역시 4074
 
14.7%
수영구 4074
 
14.7%
광안동 776
 
2.8%
광안동, 740
 
2.7%
남천동 502
 
1.8%
1층 480
 
1.7%
수영로 405
 
1.5%
망미동 376
 
1.4%
민락동 356
 
1.3%
남천동, 338
 
1.2%
Other values (2859) 15630
56.3%
2023-12-11T02:18:18.139840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23681
 
16.0%
7062
 
4.8%
6039
 
4.1%
1 5993
 
4.0%
5786
 
3.9%
5614
 
3.8%
5254
 
3.5%
0 4534
 
3.1%
4332
 
2.9%
4171
 
2.8%
Other values (395) 75749
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 82451
55.6%
Decimal Number 27420
 
18.5%
Space Separator 23681
 
16.0%
Other Punctuation 5265
 
3.6%
Open Punctuation 4065
 
2.7%
Close Punctuation 4065
 
2.7%
Dash Punctuation 635
 
0.4%
Uppercase Letter 470
 
0.3%
Lowercase Letter 130
 
0.1%
Math Symbol 17
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7062
 
8.6%
6039
 
7.3%
5786
 
7.0%
5614
 
6.8%
4332
 
5.3%
4171
 
5.1%
4170
 
5.1%
4160
 
5.0%
4154
 
5.0%
4081
 
4.9%
Other values (345) 32882
39.9%
Uppercase Letter
ValueCountFrequency (%)
S 53
11.3%
V 50
10.6%
K 46
9.8%
E 46
9.8%
B 46
9.8%
W 43
9.1%
I 43
9.1%
A 34
7.2%
C 32
6.8%
D 26
5.5%
Other values (10) 51
10.9%
Lowercase Letter
ValueCountFrequency (%)
e 111
85.4%
c 4
 
3.1%
b 3
 
2.3%
i 3
 
2.3%
w 2
 
1.5%
o 2
 
1.5%
l 1
 
0.8%
d 1
 
0.8%
n 1
 
0.8%
u 1
 
0.8%
Decimal Number
ValueCountFrequency (%)
1 5993
21.9%
0 4534
16.5%
2 3718
13.6%
3 2665
9.7%
4 2476
9.0%
5 2137
 
7.8%
6 2009
 
7.3%
7 1463
 
5.3%
8 1254
 
4.6%
9 1171
 
4.3%
Other Punctuation
ValueCountFrequency (%)
5254
99.8%
? 10
 
0.2%
& 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
23681
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4065
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4065
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 635
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%
Letter Number
ValueCountFrequency (%)
16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 82450
55.6%
Common 65148
44.0%
Latin 616
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7062
 
8.6%
6039
 
7.3%
5786
 
7.0%
5614
 
6.8%
4332
 
5.3%
4171
 
5.1%
4170
 
5.1%
4160
 
5.0%
4154
 
5.0%
4081
 
4.9%
Other values (344) 32881
39.9%
Latin
ValueCountFrequency (%)
e 111
18.0%
S 53
8.6%
V 50
8.1%
K 46
7.5%
E 46
7.5%
B 46
7.5%
W 43
 
7.0%
I 43
 
7.0%
A 34
 
5.5%
C 32
 
5.2%
Other values (22) 112
18.2%
Common
ValueCountFrequency (%)
23681
36.3%
1 5993
 
9.2%
5254
 
8.1%
0 4534
 
7.0%
( 4065
 
6.2%
) 4065
 
6.2%
2 3718
 
5.7%
3 2665
 
4.1%
4 2476
 
3.8%
5 2137
 
3.3%
Other values (8) 6560
 
10.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 82450
55.6%
ASCII 60494
40.8%
None 5254
 
3.5%
Number Forms 16
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
23681
39.1%
1 5993
 
9.9%
0 4534
 
7.5%
( 4065
 
6.7%
) 4065
 
6.7%
2 3718
 
6.1%
3 2665
 
4.4%
4 2476
 
4.1%
5 2137
 
3.5%
6 2009
 
3.3%
Other values (38) 5151
 
8.5%
Hangul
ValueCountFrequency (%)
7062
 
8.6%
6039
 
7.3%
5786
 
7.0%
5614
 
6.8%
4332
 
5.3%
4171
 
5.1%
4170
 
5.1%
4160
 
5.0%
4154
 
5.0%
4081
 
4.9%
Other values (344) 32881
39.9%
None
ValueCountFrequency (%)
5254
100.0%
Number Forms
ValueCountFrequency (%)
16
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-11T02:18:13.564313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:18:13.921050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:18:14.110557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호법인또는상호소재지주소
01감사해(海)부산광역시 수영구 황령대로473번길 25, 목연 마이텔 703호 (남천동)
12윤교D부산광역시 수영구 광안해변로 141, 2동 1106호 (남천동, 협진태양아파트)
23마리나스부산광역시 수영구 광안해변로 100, 101동 1001호 (남천동, 비치아파트)
34제이랑부산광역시 수영구 수영로 389, 102동 2103호 (남천동, 더샵 남천프레스티지)
45자르댕부산광역시 수영구 남천동로108번길 20, 4층 다15호 (남천동)
56플럼눅(plumnook)부산광역시 수영구 광남로223번길 31, 104동 902호 (민락동, 광안현대하이페리온)
67구쯔독부산광역시 수영구 수영로 776, 102동 1502호 (민락동, 부산 더샵 센텀포레)
78육인치누시부산광역시 수영구 남천동로9번길 10-10, 302호 (남천동, 명당모란빌라)
89아원일(AWONIL)부산광역시 수영구 광안해변로326번길 31, 401동 501호 (민락동, e편한세상 오션테라스 4단지)
910제이제이부산광역시 수영구 수영로636번길 56(광안동)
번호법인또는상호소재지주소
40664067주식회사 정암정보넷부산광역시 수영구 남천2동 148번지 4호 삼익비치아파트 107동 405호
40674068모터매니아부산광역시 수영구 남천바다로21번길 69-3 (광안동)
40684069인디고서원부산광역시 수영구 수영로408번길 28 (남천동)
40694070패브릭홈부산광역시 수영구 망미배산로70번길 45, 102동 203호 (망미동, 한신아파트)
40704071코리아보아부산광역시 수영구 구락로 23 (수영동)
40714072반쪽이서점부산광역시 수영구 광서로 55 (광안동,2층)
40724073엘리스부산광역시 수영구 광안동 197번지 19호 인타워빌딩 지하
40734074다에누리닷컴부산광역시 수영구 남천서로25번길 6 (남천동)
40744075라잇하우스부산광역시 수영구 민락동 399
40754076㈜메가마트 남천점부산광역시 수영구 황령대로 521 (남천동)