Overview

Dataset statistics

Number of variables11
Number of observations1847
Missing cells79
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory160.7 KiB
Average record size in memory89.1 B

Variable types

Numeric1
DateTime2
Text5
Categorical3

Dataset

Description경상남도 거제시 통신판매업 현황(법인 또는 상호명, 운영상태, 법인구분, 우편번호, 주소, 도메인명, 기준일자)등에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15047551

Alerts

기준일자 has constant value ""Constant
영업상태명 is highly imbalanced (97.9%)Imbalance
판매방식명 is highly imbalanced (92.9%)Imbalance
소재지전체주소 has 40 (2.2%) missing valuesMissing
도로명전체주소 has 39 (2.1%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 01:01:59.477324
Analysis finished2023-12-11 01:02:01.030422
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct1847
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean924
Minimum1
Maximum1847
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.4 KiB
2023-12-11T10:02:01.111385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile93.3
Q1462.5
median924
Q31385.5
95-th percentile1754.7
Maximum1847
Range1846
Interquartile range (IQR)923

Descriptive statistics

Standard deviation533.32729
Coefficient of variation (CV)0.57719404
Kurtosis-1.2
Mean924
Median Absolute Deviation (MAD)462
Skewness0
Sum1706628
Variance284438
MonotonicityStrictly increasing
2023-12-11T10:02:01.245382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1242 1
 
0.1%
1240 1
 
0.1%
1239 1
 
0.1%
1238 1
 
0.1%
1237 1
 
0.1%
1236 1
 
0.1%
1235 1
 
0.1%
1234 1
 
0.1%
1233 1
 
0.1%
Other values (1837) 1837
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1847 1
0.1%
1846 1
0.1%
1845 1
0.1%
1844 1
0.1%
1843 1
0.1%
1842 1
0.1%
1841 1
0.1%
1840 1
0.1%
1839 1
0.1%
1838 1
0.1%
Distinct1091
Distinct (%)59.1%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
Minimum2003-03-25 00:00:00
Maximum2021-07-30 00:00:00
2023-12-11T10:02:01.376447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T10:02:01.499506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1834
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2023-12-11T10:02:01.802946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length28
Mean length6.5430428
Min length1

Characters and Unicode

Total characters12085
Distinct characters776
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1821 ?
Unique (%)98.6%

Sample

1st row냥이어멈
2nd row해광9호
3rd row리우
4th row제이씨마켓(JCmarket)
5th row
ValueCountFrequency (%)
주식회사 58
 
2.6%
거제 10
 
0.4%
농업회사법인 8
 
0.4%
펜션 7
 
0.3%
거제점 5
 
0.2%
company 4
 
0.2%
you 4
 
0.2%
the 4
 
0.2%
store 3
 
0.1%
핸드메이드 3
 
0.1%
Other values (2076) 2140
95.3%
2023-12-11T10:02:02.429239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
400
 
3.3%
351
 
2.9%
278
 
2.3%
) 254
 
2.1%
( 253
 
2.1%
236
 
2.0%
178
 
1.5%
168
 
1.4%
151
 
1.2%
145
 
1.2%
Other values (766) 9671
80.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9363
77.5%
Lowercase Letter 900
 
7.4%
Uppercase Letter 722
 
6.0%
Space Separator 400
 
3.3%
Close Punctuation 254
 
2.1%
Open Punctuation 253
 
2.1%
Decimal Number 124
 
1.0%
Other Punctuation 50
 
0.4%
Dash Punctuation 10
 
0.1%
Connector Punctuation 4
 
< 0.1%
Other values (2) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
351
 
3.7%
278
 
3.0%
236
 
2.5%
178
 
1.9%
168
 
1.8%
151
 
1.6%
145
 
1.5%
139
 
1.5%
129
 
1.4%
121
 
1.3%
Other values (692) 7467
79.8%
Uppercase Letter
ValueCountFrequency (%)
A 64
 
8.9%
S 61
 
8.4%
E 57
 
7.9%
O 50
 
6.9%
M 42
 
5.8%
N 41
 
5.7%
C 41
 
5.7%
B 39
 
5.4%
L 36
 
5.0%
I 32
 
4.4%
Other values (15) 259
35.9%
Lowercase Letter
ValueCountFrequency (%)
e 116
12.9%
o 82
 
9.1%
n 78
 
8.7%
a 77
 
8.6%
l 69
 
7.7%
i 66
 
7.3%
r 50
 
5.6%
t 48
 
5.3%
y 36
 
4.0%
s 35
 
3.9%
Other values (14) 243
27.0%
Decimal Number
ValueCountFrequency (%)
2 31
25.0%
1 21
16.9%
3 14
11.3%
0 12
 
9.7%
7 11
 
8.9%
9 11
 
8.9%
5 9
 
7.3%
6 7
 
5.6%
8 4
 
3.2%
4 4
 
3.2%
Other Punctuation
ValueCountFrequency (%)
, 17
34.0%
. 16
32.0%
& 12
24.0%
' 2
 
4.0%
· 1
 
2.0%
# 1
 
2.0%
/ 1
 
2.0%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
400
100.0%
Close Punctuation
ValueCountFrequency (%)
) 254
100.0%
Open Punctuation
ValueCountFrequency (%)
( 253
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9364
77.5%
Latin 1622
 
13.4%
Common 1097
 
9.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
351
 
3.7%
278
 
3.0%
236
 
2.5%
178
 
1.9%
168
 
1.8%
151
 
1.6%
145
 
1.5%
139
 
1.5%
129
 
1.4%
121
 
1.3%
Other values (691) 7468
79.8%
Latin
ValueCountFrequency (%)
e 116
 
7.2%
o 82
 
5.1%
n 78
 
4.8%
a 77
 
4.7%
l 69
 
4.3%
i 66
 
4.1%
A 64
 
3.9%
S 61
 
3.8%
E 57
 
3.5%
r 50
 
3.1%
Other values (39) 902
55.6%
Common
ValueCountFrequency (%)
400
36.5%
) 254
23.2%
( 253
23.1%
2 31
 
2.8%
1 21
 
1.9%
, 17
 
1.5%
. 16
 
1.5%
3 14
 
1.3%
& 12
 
1.1%
0 12
 
1.1%
Other values (14) 67
 
6.1%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9361
77.5%
ASCII 2718
 
22.5%
None 4
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
400
 
14.7%
) 254
 
9.3%
( 253
 
9.3%
e 116
 
4.3%
o 82
 
3.0%
n 78
 
2.9%
a 77
 
2.8%
l 69
 
2.5%
i 66
 
2.4%
A 64
 
2.4%
Other values (62) 1259
46.3%
Hangul
ValueCountFrequency (%)
351
 
3.7%
278
 
3.0%
236
 
2.5%
178
 
1.9%
168
 
1.8%
151
 
1.6%
145
 
1.5%
139
 
1.5%
129
 
1.4%
121
 
1.3%
Other values (690) 7465
79.7%
None
ValueCountFrequency (%)
3
75.0%
· 1
 
25.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

소재지전체주소
Text

MISSING 

Distinct927
Distinct (%)51.3%
Missing40
Missing (%)2.2%
Memory size14.6 KiB
2023-12-11T10:02:02.729270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length43
Mean length26.629773
Min length10

Characters and Unicode

Total characters48120
Distinct characters324
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique616 ?
Unique (%)34.1%

Sample

1st row경상남도 거제시 장목면 관포리 **번지 **호
2nd row경상남도 거제시 상동동 **번지 거제신현에스케이뷰 ***동 ***호
3rd row경상남도 거제시 장평동 **번지 **호
4th row경상남도 거제시 옥포동 ***번지 **호
5th row경상남도 거제시 아주동 ****-** ***호
ValueCountFrequency (%)
경상남도 1798
16.9%
거제시 1798
16.9%
1160
 
10.9%
번지 1100
 
10.4%
702
 
6.6%
327
 
3.1%
고현동 252
 
2.4%
옥포동 206
 
1.9%
아주동 182
 
1.7%
일운면 173
 
1.6%
Other values (417) 2919
27.5%
2023-12-11T10:02:03.167441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 9673
20.1%
8810
18.3%
2052
 
4.3%
2051
 
4.3%
2010
 
4.2%
1872
 
3.9%
1860
 
3.9%
1848
 
3.8%
1818
 
3.8%
1815
 
3.8%
Other values (314) 14311
29.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29014
60.3%
Other Punctuation 9683
 
20.1%
Space Separator 8810
 
18.3%
Dash Punctuation 411
 
0.9%
Uppercase Letter 111
 
0.2%
Lowercase Letter 86
 
0.2%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2052
 
7.1%
2051
 
7.1%
2010
 
6.9%
1872
 
6.5%
1860
 
6.4%
1848
 
6.4%
1818
 
6.3%
1815
 
6.3%
1266
 
4.4%
1222
 
4.2%
Other values (281) 11200
38.6%
Uppercase Letter
ValueCountFrequency (%)
C 23
20.7%
K 20
18.0%
A 16
14.4%
I 11
9.9%
R 10
9.0%
P 10
9.0%
B 7
 
6.3%
S 3
 
2.7%
H 2
 
1.8%
N 2
 
1.8%
Other values (5) 7
 
6.3%
Lowercase Letter
ValueCountFrequency (%)
e 63
73.3%
l 10
 
11.6%
i 3
 
3.5%
v 2
 
2.3%
y 2
 
2.3%
r 2
 
2.3%
s 2
 
2.3%
u 1
 
1.2%
a 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
* 9673
99.9%
, 7
 
0.1%
. 2
 
< 0.1%
· 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
8810
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 411
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29014
60.3%
Common 18908
39.3%
Latin 197
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2052
 
7.1%
2051
 
7.1%
2010
 
6.9%
1872
 
6.5%
1860
 
6.4%
1848
 
6.4%
1818
 
6.3%
1815
 
6.3%
1266
 
4.4%
1222
 
4.2%
Other values (281) 11200
38.6%
Latin
ValueCountFrequency (%)
e 63
32.0%
C 23
 
11.7%
K 20
 
10.2%
A 16
 
8.1%
I 11
 
5.6%
R 10
 
5.1%
P 10
 
5.1%
l 10
 
5.1%
B 7
 
3.6%
i 3
 
1.5%
Other values (14) 24
 
12.2%
Common
ValueCountFrequency (%)
* 9673
51.2%
8810
46.6%
- 411
 
2.2%
, 7
 
< 0.1%
. 2
 
< 0.1%
) 2
 
< 0.1%
( 2
 
< 0.1%
· 1
 
< 0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29013
60.3%
ASCII 19104
39.7%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 9673
50.6%
8810
46.1%
- 411
 
2.2%
e 63
 
0.3%
C 23
 
0.1%
K 20
 
0.1%
A 16
 
0.1%
I 11
 
0.1%
R 10
 
0.1%
P 10
 
0.1%
Other values (21) 57
 
0.3%
Hangul
ValueCountFrequency (%)
2052
 
7.1%
2051
 
7.1%
2010
 
6.9%
1872
 
6.5%
1860
 
6.4%
1848
 
6.4%
1818
 
6.3%
1815
 
6.3%
1266
 
4.4%
1222
 
4.2%
Other values (280) 11199
38.6%
None
ValueCountFrequency (%)
· 1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%

도로명전체주소
Text

MISSING 

Distinct1133
Distinct (%)62.7%
Missing39
Missing (%)2.1%
Memory size14.6 KiB
2023-12-11T10:02:03.411735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length47
Mean length32.611173
Min length15

Characters and Unicode

Total characters58961
Distinct characters351
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique849 ?
Unique (%)47.0%

Sample

1st row경상남도 거제시 장평*로*길 **-* (장평동)
2nd row경상남도 거제시 장목면 관포길 **-*
3rd row경상남도 거제시 상동*길 **, ***동 ***호 (상동동, 거제신현에스케이뷰)
4th row경상남도 거제시 장평*로*길 **-*, *층 ***호 (장평동)
5th row경상남도 거제시 옥포대첩로*길 **, ***호 (옥포동)
ValueCountFrequency (%)
경상남도 1808
14.9%
1807
14.8%
거제시 1806
14.8%
880
 
7.2%
587
 
4.8%
307
 
2.5%
고현동 256
 
2.1%
옥포동 203
 
1.7%
아주동 182
 
1.5%
일운면 181
 
1.5%
Other values (643) 4155
34.1%
2023-12-11T10:02:03.810231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 11249
19.1%
10368
17.6%
2374
 
4.0%
2361
 
4.0%
2340
 
4.0%
2158
 
3.7%
1924
 
3.3%
1847
 
3.1%
1837
 
3.1%
1833
 
3.1%
Other values (341) 20670
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32393
54.9%
Other Punctuation 12979
22.0%
Space Separator 10368
 
17.6%
Open Punctuation 1210
 
2.1%
Close Punctuation 1210
 
2.1%
Dash Punctuation 575
 
1.0%
Uppercase Letter 122
 
0.2%
Lowercase Letter 99
 
0.2%
Math Symbol 4
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2374
 
7.3%
2361
 
7.3%
2340
 
7.2%
2158
 
6.7%
1924
 
5.9%
1847
 
5.7%
1837
 
5.7%
1833
 
5.7%
1232
 
3.8%
949
 
2.9%
Other values (307) 13538
41.8%
Lowercase Letter
ValueCountFrequency (%)
e 69
69.7%
l 10
 
10.1%
i 3
 
3.0%
s 3
 
3.0%
c 3
 
3.0%
u 2
 
2.0%
v 2
 
2.0%
y 2
 
2.0%
r 2
 
2.0%
a 1
 
1.0%
Other values (2) 2
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
K 23
18.9%
C 23
18.9%
B 18
14.8%
A 18
14.8%
I 10
8.2%
P 10
8.2%
R 10
8.2%
S 3
 
2.5%
T 2
 
1.6%
V 2
 
1.6%
Other values (2) 3
 
2.5%
Other Punctuation
ValueCountFrequency (%)
* 11249
86.7%
, 1726
 
13.3%
. 3
 
< 0.1%
· 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
10368
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1210
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1210
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 575
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32393
54.9%
Common 26346
44.7%
Latin 221
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2374
 
7.3%
2361
 
7.3%
2340
 
7.2%
2158
 
6.7%
1924
 
5.9%
1847
 
5.7%
1837
 
5.7%
1833
 
5.7%
1232
 
3.8%
949
 
2.9%
Other values (307) 13538
41.8%
Latin
ValueCountFrequency (%)
e 69
31.2%
K 23
 
10.4%
C 23
 
10.4%
B 18
 
8.1%
A 18
 
8.1%
l 10
 
4.5%
I 10
 
4.5%
P 10
 
4.5%
R 10
 
4.5%
i 3
 
1.4%
Other values (14) 27
 
12.2%
Common
ValueCountFrequency (%)
* 11249
42.7%
10368
39.4%
, 1726
 
6.6%
( 1210
 
4.6%
) 1210
 
4.6%
- 575
 
2.2%
~ 4
 
< 0.1%
. 3
 
< 0.1%
· 1
 
< 0.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32392
54.9%
ASCII 26566
45.1%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 11249
42.3%
10368
39.0%
, 1726
 
6.5%
( 1210
 
4.6%
) 1210
 
4.6%
- 575
 
2.2%
e 69
 
0.3%
K 23
 
0.1%
C 23
 
0.1%
B 18
 
0.1%
Other values (22) 95
 
0.4%
Hangul
ValueCountFrequency (%)
2374
 
7.3%
2361
 
7.3%
2340
 
7.2%
2158
 
6.7%
1924
 
5.9%
1847
 
5.7%
1837
 
5.7%
1833
 
5.7%
1232
 
3.8%
949
 
2.9%
Other values (306) 13537
41.8%
None
ValueCountFrequency (%)
· 1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%

업태구분명
Categorical

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
기타
480 
의류/패션/잡화/뷰티
419 
종합몰
392 
건강/식품
279 
레져/여행/공연
210 
Other values (5)
67 

Length

Max length11
Median length9
Mean length5.6540336
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의류/패션/잡화/뷰티
2nd row기타
3rd row의류/패션/잡화/뷰티
4th row의류/패션/잡화/뷰티
5th row의류/패션/잡화/뷰티

Common Values

ValueCountFrequency (%)
기타 480
26.0%
의류/패션/잡화/뷰티 419
22.7%
종합몰 392
21.2%
건강/식품 279
15.1%
레져/여행/공연 210
11.4%
교육/도서/완구/오락 35
 
1.9%
가구/수납용품 13
 
0.7%
컴퓨터/사무용품 10
 
0.5%
자동차/자동차용품 7
 
0.4%
가전 2
 
0.1%

Length

2023-12-11T10:02:03.976136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T10:02:04.094898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 480
26.0%
의류/패션/잡화/뷰티 419
22.7%
종합몰 392
21.2%
건강/식품 279
15.1%
레져/여행/공연 210
11.4%
교육/도서/완구/오락 35
 
1.9%
가구/수납용품 13
 
0.7%
컴퓨터/사무용품 10
 
0.5%
자동차/자동차용품 7
 
0.4%
가전 2
 
0.1%

영업상태명
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
영업/정상
1841 
휴업
 
5
<NA>
 
1

Length

Max length5
Median length5
Mean length4.9913373
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row휴업
2nd row휴업
3rd row휴업
4th row휴업
5th row휴업

Common Values

ValueCountFrequency (%)
영업/정상 1841
99.7%
휴업 5
 
0.3%
<NA> 1
 
0.1%

Length

2023-12-11T10:02:04.250931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T10:02:04.347683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업/정상 1841
99.7%
휴업 5
 
0.3%
na 1
 
0.1%

위도
Text

Distinct1299
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2023-12-11T10:02:04.614427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length8.870601
Min length6

Characters and Unicode

Total characters16384
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1090 ?
Unique (%)59.0%

Sample

1st row34.672914
2nd row35.493462
3rd row34.961393
4th row34.729614
5th row35.474795
ValueCountFrequency (%)
34.894728 21
 
1.1%
35.057288 16
 
0.9%
35.00839 16
 
0.9%
34.892695 15
 
0.8%
34.278695 15
 
0.8%
35.015073 14
 
0.8%
35.438745 13
 
0.7%
34.937927 12
 
0.6%
35.335208 12
 
0.6%
35.43055 10
 
0.5%
Other values (1289) 1703
92.2%
2023-12-11T10:02:05.101699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 3001
18.3%
4 2072
12.6%
5 2026
12.4%
. 1841
11.2%
8 1359
8.3%
9 1190
 
7.3%
6 1060
 
6.5%
7 1030
 
6.3%
2 990
 
6.0%
1 924
 
5.6%
Other values (8) 891
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14501
88.5%
Other Punctuation 1853
 
11.3%
Uppercase Letter 30
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 3001
20.7%
4 2072
14.3%
5 2026
14.0%
8 1359
9.4%
9 1190
 
8.2%
6 1060
 
7.3%
7 1030
 
7.1%
2 990
 
6.8%
1 924
 
6.4%
0 849
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
V 6
20.0%
A 6
20.0%
L 6
20.0%
U 6
20.0%
E 6
20.0%
Other Punctuation
ValueCountFrequency (%)
. 1841
99.4%
# 6
 
0.3%
! 6
 
0.3%

Most occurring scripts

ValueCountFrequency (%)
Common 16354
99.8%
Latin 30
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
3 3001
18.4%
4 2072
12.7%
5 2026
12.4%
. 1841
11.3%
8 1359
8.3%
9 1190
 
7.3%
6 1060
 
6.5%
7 1030
 
6.3%
2 990
 
6.1%
1 924
 
5.6%
Other values (3) 861
 
5.3%
Latin
ValueCountFrequency (%)
V 6
20.0%
A 6
20.0%
L 6
20.0%
U 6
20.0%
E 6
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16384
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 3001
18.3%
4 2072
12.6%
5 2026
12.4%
. 1841
11.2%
8 1359
8.3%
9 1190
 
7.3%
6 1060
 
6.5%
7 1030
 
6.3%
2 990
 
6.0%
1 924
 
5.6%
Other values (8) 891
 
5.4%

경도
Text

Distinct1300
Distinct (%)70.4%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2023-12-11T10:02:05.465147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.8906335
Min length7

Characters and Unicode

Total characters18268
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1096 ?
Unique (%)59.3%

Sample

1st row156.068569
2nd row167.022713
3rd row152.742003
4th row155.758118
5th row156.474929
ValueCountFrequency (%)
153.395303 21
 
1.1%
152.365609 16
 
0.9%
154.028876 16
 
0.9%
156.485749 16
 
0.9%
152.817701 15
 
0.8%
156.073565 14
 
0.8%
153.826305 13
 
0.7%
152.396095 13
 
0.7%
153.48767 12
 
0.6%
155.972661 10
 
0.5%
Other values (1290) 1701
92.1%
2023-12-11T10:02:05.914583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2921
16.0%
5 2786
15.3%
. 1838
10.1%
6 1772
9.7%
4 1522
8.3%
3 1433
7.8%
2 1282
7.0%
8 1275
7.0%
7 1240
6.8%
9 1194
6.5%
Other values (8) 1005
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16367
89.6%
Other Punctuation 1856
 
10.2%
Uppercase Letter 45
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2921
17.8%
5 2786
17.0%
6 1772
10.8%
4 1522
9.3%
3 1433
8.8%
2 1282
7.8%
8 1275
7.8%
7 1240
7.6%
9 1194
7.3%
0 942
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
V 9
20.0%
A 9
20.0%
L 9
20.0%
U 9
20.0%
E 9
20.0%
Other Punctuation
ValueCountFrequency (%)
. 1838
99.0%
# 9
 
0.5%
! 9
 
0.5%

Most occurring scripts

ValueCountFrequency (%)
Common 18223
99.8%
Latin 45
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2921
16.0%
5 2786
15.3%
. 1838
10.1%
6 1772
9.7%
4 1522
8.4%
3 1433
7.9%
2 1282
7.0%
8 1275
7.0%
7 1240
6.8%
9 1194
6.6%
Other values (3) 960
 
5.3%
Latin
ValueCountFrequency (%)
V 9
20.0%
A 9
20.0%
L 9
20.0%
U 9
20.0%
E 9
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18268
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2921
16.0%
5 2786
15.3%
. 1838
10.1%
6 1772
9.7%
4 1522
8.3%
3 1433
7.8%
2 1282
7.0%
8 1275
7.0%
7 1240
6.8%
9 1194
6.5%
Other values (8) 1005
 
5.5%

판매방식명
Categorical

IMBALANCE 

Distinct15
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
인터넷
1793 
인터넷, 기타
 
23
TV홈쇼핑, 인터넷
 
6
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타
 
5
기타
 
4
Other values (10)
 
16

Length

Max length26
Median length3
Mean length3.1867894
Min length2

Unique

Unique6 ?
Unique (%)0.3%

Sample

1st row인터넷
2nd row인터넷
3rd row인터넷
4th row인터넷
5th row인터넷

Common Values

ValueCountFrequency (%)
인터넷 1793
97.1%
인터넷, 기타 23
 
1.2%
TV홈쇼핑, 인터넷 6
 
0.3%
TV홈쇼핑, 인터넷, 카다로그, 신문잡지, 기타 5
 
0.3%
기타 4
 
0.2%
<NA> 4
 
0.2%
인터넷, 카다로그 2
 
0.1%
TV홈쇼핑 2
 
0.1%
기타, 인터넷 2
 
0.1%
인터넷, 카다로그, 기타 1
 
0.1%
Other values (5) 5
 
0.3%

Length

2023-12-11T10:02:06.072634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
인터넷 1836
96.0%
기타 36
 
1.9%
tv홈쇼핑 16
 
0.8%
카다로그 12
 
0.6%
신문잡지 8
 
0.4%
na 4
 
0.2%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
Minimum2021-09-01 00:00:00
Maximum2021-09-01 00:00:00
2023-12-11T10:02:06.179191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T10:02:06.271609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T10:02:00.510450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T10:02:06.355483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업태구분명영업상태명판매방식명
번호1.0000.3780.1830.076
업태구분명0.3781.0000.0290.155
영업상태명0.1830.0291.0000.000
판매방식명0.0760.1550.0001.000
2023-12-11T10:02:06.481608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
판매방식명업태구분명영업상태명
판매방식명1.0000.0630.000
업태구분명0.0631.0000.022
영업상태명0.0000.0221.000
2023-12-11T10:02:06.608288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호업태구분명영업상태명판매방식명
번호1.0000.1240.1400.030
업태구분명0.1241.0000.0220.063
영업상태명0.1400.0221.0000.000
판매방식명0.0300.0630.0001.000

Missing values

2023-12-11T10:02:00.667762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T10:02:00.836033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T10:02:00.967971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호인허가일자사업장명소재지전체주소도로명전체주소업태구분명영업상태명위도경도판매방식명기준일자
012013-12-24냥이어멈<NA>경상남도 거제시 장평*로*길 **-* (장평동)의류/패션/잡화/뷰티휴업34.672914156.068569인터넷2021-09-01
122019-01-16해광9호경상남도 거제시 장목면 관포리 **번지 **호경상남도 거제시 장목면 관포길 **-*기타휴업35.493462167.022713인터넷2021-09-01
232020-03-25리우경상남도 거제시 상동동 **번지 거제신현에스케이뷰 ***동 ***호경상남도 거제시 상동*길 **, ***동 ***호 (상동동, 거제신현에스케이뷰)의류/패션/잡화/뷰티휴업34.961393152.742003인터넷2021-09-01
342018-09-06제이씨마켓(JCmarket)경상남도 거제시 장평동 **번지 **호경상남도 거제시 장평*로*길 **-*, *층 ***호 (장평동)의류/패션/잡화/뷰티휴업34.729614155.758118인터넷2021-09-01
452016-10-14경상남도 거제시 옥포동 ***번지 **호경상남도 거제시 옥포대첩로*길 **, ***호 (옥포동)의류/패션/잡화/뷰티휴업35.474795156.474929인터넷2021-09-01
562021-03-10포유(For you)경상남도 거제시 아주동 ****-** ***호경상남도 거제시 아주*로*길 **, ***호 (아주동)종합몰<NA>35.437045153.21922인터넷2021-09-01
672021-01-04에프에프엘 스토어(FFL Store)경상남도 거제시 능포동 ***-**경상남도 거제시 옥수로*길 ** (능포동)종합몰영업/정상35.822993154.632878인터넷2021-09-01
782021-01-04올니스경상남도 거제시 옥포동 ***-* 영진자이온 ***동 ***호경상남도 거제시 옥포대첩로 ***, ***동 ***호 (옥포동, 영진자이온)종합몰영업/정상35.52911156.487898인터넷2021-09-01
892021-01-04에르인(Er-in)경상남도 거제시 상동동 ** 거제신현에스케이뷰 ***동 ****호경상남도 거제시 상동*길 **, ***동 ****호 (상동동, 거제신현에스케이뷰)의류/패션/잡화/뷰티영업/정상34.961393152.742003인터넷2021-09-01
9102021-01-04몽돌수산경상남도 거제시 장목면 송진포리 **-*경상남도 거제시 장목면 거제북로 ****, *층건강/식품영업/정상35.518413168.646134인터넷2021-09-01
번호인허가일자사업장명소재지전체주소도로명전체주소업태구분명영업상태명위도경도판매방식명기준일자
183718382019-02-01린스토어(linstore)경상남도 거제시 일운면 지세포리 ****번지 거제 코아루 파크드림경상남도 거제시 일운면 지세포*길 **, ***동 **층 *호 (거제 코아루 파크드림)기타영업/정상34.825849#VALUE!인터넷2021-09-01
183818392019-01-08생각속의집3경상남도 거제시 장목면 시방리 ***번지 **호경상남도 거제시 장목면 시방*길 **-*기타영업/정상35.623568164.214217인터넷2021-09-01
183918402019-01-09푸른아이웨어경상남도 거제시 고현동 ***번지 **호 남영상가경상남도 거제시 중곡로*길 * (고현동)의류/패션/잡화/뷰티영업/정상34.889683156.498078인터넷2021-09-01
184018412019-01-09타구대경상남도 거제시 옥포동 ***번지 **호 e편한세상 옥포아파트 *단지경상남도 거제시 성산로 **, ***동 **층 ****호 (옥포동, e편한세상 옥포아파트 *단지)종합몰영업/정상35.491837156.917497인터넷2021-09-01
184118422019-01-14민서현 디저트(dessert)경상남도 거제시 고현동 ***번지 *호경상남도 거제시 거제중앙로*길 *, *층 ***호 (고현동)기타영업/정상34.862331154.76126인터넷2021-09-01
184218432019-02-21퓌쉬케경상남도 거제시 사등면 사등리 ****번지 *호경상남도 거제시 사등면 대리*길 **-**종합몰영업/정상34.113612157.312781인터넷2021-09-01
184318442019-02-22소풍경상남도 거제시 둔덕면 산방리 ***번지 *호경상남도 거제시 둔덕면 산방*길 **, *층기타영업/정상33.883729151.899709인터넷2021-09-01
184418452019-02-12무늬마켓경상남도 거제시 장평동 **번지 대한*차아파트경상남도 거제시 장평로*길 **, ***동 ****호 (장평동, 대한*차아파트)의류/패션/잡화/뷰티영업/정상34.74456155.994633인터넷2021-09-01
184518462019-02-14블라썸데이경상남도 거제시 고현동 ****번지 거제양정 대동피렌체아파트경상남도 거제시 중곡로*길 **, ***동 ***호 (고현동, 거제양정 대동피렌체아파트)종합몰영업/정상34.930848156.174727인터넷2021-09-01
184618472019-01-10알로하잡화점경상남도 거제시 장목면 대금리 ***번지 *호경상남도 거제시 장목면 율천대금길 **-**, 회색벽돌집기타영업/정상35.509724164.443787인터넷2021-09-01