Overview

Dataset statistics

Number of variables9
Number of observations1842
Missing cells7424
Missing cells (%)44.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory136.8 KiB
Average record size in memory76.1 B

Variable types

Numeric1
Text5
Unsupported3

Dataset

Description울산광역시 구군별(남구, 중구) 담배소매업에 대한 정보(업소명, 대표자명, 도로명주소, 등)를 제공하고 있습니다.
Author울산광역시
URLhttps://www.data.go.kr/data/15091252/fileData.do

Alerts

대표자명 has 634 (34.4%) missing valuesMissing
지정번 has 632 (34.3%) missing valuesMissing
관리번 has 632 (34.3%) missing valuesMissing
Unnamed: 6 has 1842 (100.0%) missing valuesMissing
Unnamed: 7 has 1842 (100.0%) missing valuesMissing
Unnamed: 8 has 1842 (100.0%) missing valuesMissing
연번 has unique valuesUnique
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 11:26:01.035581
Analysis finished2023-12-12 11:26:02.883765
Duration1.85 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1842
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean921.5
Minimum1
Maximum1842
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.3 KiB
2023-12-12T20:26:03.021480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile93.05
Q1461.25
median921.5
Q31381.75
95-th percentile1749.95
Maximum1842
Range1841
Interquartile range (IQR)920.5

Descriptive statistics

Standard deviation531.88392
Coefficient of variation (CV)0.57719361
Kurtosis-1.2
Mean921.5
Median Absolute Deviation (MAD)460.5
Skewness0
Sum1697403
Variance282900.5
MonotonicityStrictly increasing
2023-12-12T20:26:03.290399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1239 1
 
0.1%
1237 1
 
0.1%
1236 1
 
0.1%
1235 1
 
0.1%
1234 1
 
0.1%
1233 1
 
0.1%
1232 1
 
0.1%
1231 1
 
0.1%
1230 1
 
0.1%
Other values (1832) 1832
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1842 1
0.1%
1841 1
0.1%
1840 1
0.1%
1839 1
0.1%
1838 1
0.1%
1837 1
0.1%
1836 1
0.1%
1835 1
0.1%
1834 1
0.1%
1833 1
0.1%
Distinct1693
Distinct (%)91.9%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
2023-12-12T20:26:03.834282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length19
Mean length7.2969598
Min length1

Characters and Unicode

Total characters13441
Distinct characters579
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1613 ?
Unique (%)87.6%

Sample

1st row대현분식
2nd row더진국
3rd row예스마트
4th row신흥
5th row공원복권방
ValueCountFrequency (%)
씨유 87
 
3.7%
이마트24 52
 
2.2%
세븐일레븐 46
 
2.0%
지에스(gs)25 34
 
1.4%
gs25 30
 
1.3%
미니스톱 22
 
0.9%
주)코리아세븐 15
 
0.6%
주식회사 11
 
0.5%
지에스25 11
 
0.5%
없음 11
 
0.5%
Other values (1775) 2030
86.4%
2023-12-12T20:26:04.628374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
689
 
5.1%
539
 
4.0%
497
 
3.7%
485
 
3.6%
467
 
3.5%
394
 
2.9%
273
 
2.0%
2 250
 
1.9%
230
 
1.7%
196
 
1.5%
Other values (569) 9421
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11496
85.5%
Decimal Number 549
 
4.1%
Space Separator 539
 
4.0%
Uppercase Letter 450
 
3.3%
Open Punctuation 169
 
1.3%
Close Punctuation 169
 
1.3%
Lowercase Letter 43
 
0.3%
Other Punctuation 17
 
0.1%
Dash Punctuation 5
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
689
 
6.0%
497
 
4.3%
485
 
4.2%
467
 
4.1%
394
 
3.4%
273
 
2.4%
230
 
2.0%
196
 
1.7%
190
 
1.7%
187
 
1.6%
Other values (510) 7888
68.6%
Uppercase Letter
ValueCountFrequency (%)
S 146
32.4%
G 135
30.0%
K 19
 
4.2%
E 17
 
3.8%
C 15
 
3.3%
D 13
 
2.9%
R 13
 
2.9%
T 12
 
2.7%
L 10
 
2.2%
N 9
 
2.0%
Other values (13) 61
13.6%
Lowercase Letter
ValueCountFrequency (%)
e 8
18.6%
u 4
9.3%
a 4
9.3%
c 3
 
7.0%
p 3
 
7.0%
i 3
 
7.0%
n 3
 
7.0%
y 2
 
4.7%
s 2
 
4.7%
o 2
 
4.7%
Other values (8) 9
20.9%
Decimal Number
ValueCountFrequency (%)
2 250
45.5%
5 161
29.3%
4 93
 
16.9%
1 13
 
2.4%
3 8
 
1.5%
7 7
 
1.3%
0 7
 
1.3%
8 6
 
1.1%
6 4
 
0.7%
Other Punctuation
ValueCountFrequency (%)
. 8
47.1%
& 7
41.2%
# 2
 
11.8%
Space Separator
ValueCountFrequency (%)
539
100.0%
Open Punctuation
ValueCountFrequency (%)
( 169
100.0%
Close Punctuation
ValueCountFrequency (%)
) 169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11499
85.6%
Common 1449
 
10.8%
Latin 493
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
689
 
6.0%
497
 
4.3%
485
 
4.2%
467
 
4.1%
394
 
3.4%
273
 
2.4%
230
 
2.0%
196
 
1.7%
190
 
1.7%
187
 
1.6%
Other values (511) 7891
68.6%
Latin
ValueCountFrequency (%)
S 146
29.6%
G 135
27.4%
K 19
 
3.9%
E 17
 
3.4%
C 15
 
3.0%
D 13
 
2.6%
R 13
 
2.6%
T 12
 
2.4%
L 10
 
2.0%
N 9
 
1.8%
Other values (31) 104
21.1%
Common
ValueCountFrequency (%)
539
37.2%
2 250
17.3%
( 169
 
11.7%
) 169
 
11.7%
5 161
 
11.1%
4 93
 
6.4%
1 13
 
0.9%
3 8
 
0.6%
. 8
 
0.6%
7 7
 
0.5%
Other values (7) 32
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11496
85.5%
ASCII 1942
 
14.4%
None 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
689
 
6.0%
497
 
4.3%
485
 
4.2%
467
 
4.1%
394
 
3.4%
273
 
2.4%
230
 
2.0%
196
 
1.7%
190
 
1.7%
187
 
1.6%
Other values (510) 7888
68.6%
ASCII
ValueCountFrequency (%)
539
27.8%
2 250
12.9%
( 169
 
8.7%
) 169
 
8.7%
5 161
 
8.3%
S 146
 
7.5%
G 135
 
7.0%
4 93
 
4.8%
K 19
 
1.0%
E 17
 
0.9%
Other values (48) 244
12.6%
None
ValueCountFrequency (%)
3
100.0%

대표자명
Text

MISSING 

Distinct1126
Distinct (%)93.2%
Missing634
Missing (%)34.4%
Memory size14.5 KiB
2023-12-12T20:26:05.183319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length3
Mean length3.5273179
Min length2

Characters and Unicode

Total characters4261
Distinct characters296
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1068 ?
Unique (%)88.4%

Sample

1st row최경심
2nd row이유미
3rd row박수임
4th row황정숙
5th row정숙자
ValueCountFrequency (%)
주)코리아세븐 16
 
1.2%
주식회사 15
 
1.2%
10
 
0.8%
정승인 8
 
0.6%
김미영 6
 
0.5%
1 6
 
0.5%
주)아이비푸드 5
 
0.4%
반용호 5
 
0.4%
코리아세븐 4
 
0.3%
이정숙 3
 
0.2%
Other values (1140) 1203
93.9%
2023-12-12T20:26:06.014751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
256
 
6.0%
190
 
4.5%
170
 
4.0%
130
 
3.1%
122
 
2.9%
90
 
2.1%
89
 
2.1%
88
 
2.1%
80
 
1.9%
77
 
1.8%
Other values (286) 2969
69.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4052
95.1%
Space Separator 73
 
1.7%
Open Punctuation 49
 
1.1%
Close Punctuation 49
 
1.1%
Decimal Number 18
 
0.4%
Uppercase Letter 18
 
0.4%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
256
 
6.3%
190
 
4.7%
170
 
4.2%
130
 
3.2%
122
 
3.0%
90
 
2.2%
89
 
2.2%
88
 
2.2%
80
 
2.0%
77
 
1.9%
Other values (268) 2760
68.1%
Uppercase Letter
ValueCountFrequency (%)
N 3
16.7%
I 3
16.7%
E 2
11.1%
G 2
11.1%
D 2
11.1%
K 1
 
5.6%
S 1
 
5.6%
H 1
 
5.6%
X 1
 
5.6%
Z 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 11
61.1%
2 4
 
22.2%
4 3
 
16.7%
Space Separator
ValueCountFrequency (%)
73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 49
100.0%
Close Punctuation
ValueCountFrequency (%)
) 49
100.0%
Other Punctuation
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4052
95.1%
Common 191
 
4.5%
Latin 18
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
256
 
6.3%
190
 
4.7%
170
 
4.2%
130
 
3.2%
122
 
3.0%
90
 
2.2%
89
 
2.2%
88
 
2.2%
80
 
2.0%
77
 
1.9%
Other values (268) 2760
68.1%
Latin
ValueCountFrequency (%)
N 3
16.7%
I 3
16.7%
E 2
11.1%
G 2
11.1%
D 2
11.1%
K 1
 
5.6%
S 1
 
5.6%
H 1
 
5.6%
X 1
 
5.6%
Z 1
 
5.6%
Common
ValueCountFrequency (%)
73
38.2%
( 49
25.7%
) 49
25.7%
1 11
 
5.8%
2 4
 
2.1%
4 3
 
1.6%
2
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4052
95.1%
ASCII 207
 
4.9%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
256
 
6.3%
190
 
4.7%
170
 
4.2%
130
 
3.2%
122
 
3.0%
90
 
2.2%
89
 
2.2%
88
 
2.2%
80
 
2.0%
77
 
1.9%
Other values (268) 2760
68.1%
ASCII
ValueCountFrequency (%)
73
35.3%
( 49
23.7%
) 49
23.7%
1 11
 
5.3%
2 4
 
1.9%
4 3
 
1.4%
N 3
 
1.4%
I 3
 
1.4%
E 2
 
1.0%
G 2
 
1.0%
Other values (7) 8
 
3.9%
None
ValueCountFrequency (%)
2
100.0%
Distinct1488
Distinct (%)80.8%
Missing0
Missing (%)0.0%
Memory size14.5 KiB
2023-12-12T20:26:06.630183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length52
Mean length19.771987
Min length1

Characters and Unicode

Total characters36420
Distinct characters302
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1480 ?
Unique (%)80.3%

Sample

1st row울산광역시 남구 왕생로86번길 30. 현대홈타운스위트 상가동 102호 (달동)
2nd row울산광역시 남구 두왕로34번길 32 (선암동)
3rd row울산광역시 남구 정광로 17. 1호 (무거동)
4th row울산광역시 남구 중앙로204번길 32 (신정동)
5th row울산광역시 남구 대학로 60. 에이동 1층 (무거동)
ValueCountFrequency (%)
울산광역시 1494
19.8%
남구 863
 
11.4%
중구 631
 
8.4%
1층 258
 
3.4%
신정동 229
 
3.0%
달동 140
 
1.9%
삼산동 131
 
1.7%
무거동 119
 
1.6%
야음동 95
 
1.3%
반구동 88
 
1.2%
Other values (1611) 3496
46.3%
2023-12-12T20:26:07.495264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6677
18.3%
1873
 
5.1%
1 1685
 
4.6%
1618
 
4.4%
1599
 
4.4%
1518
 
4.2%
1502
 
4.1%
1497
 
4.1%
1495
 
4.1%
982
 
2.7%
Other values (292) 15974
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20278
55.7%
Space Separator 6677
 
18.3%
Decimal Number 6463
 
17.7%
Open Punctuation 877
 
2.4%
Close Punctuation 877
 
2.4%
Dash Punctuation 687
 
1.9%
Other Punctuation 490
 
1.3%
Uppercase Letter 63
 
0.2%
Lowercase Letter 4
 
< 0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1873
 
9.2%
1618
 
8.0%
1599
 
7.9%
1518
 
7.5%
1502
 
7.4%
1497
 
7.4%
1495
 
7.4%
982
 
4.8%
869
 
4.3%
739
 
3.6%
Other values (259) 6586
32.5%
Uppercase Letter
ValueCountFrequency (%)
B 23
36.5%
N 12
19.0%
A 7
 
11.1%
K 6
 
9.5%
S 5
 
7.9%
C 3
 
4.8%
P 2
 
3.2%
I 1
 
1.6%
T 1
 
1.6%
H 1
 
1.6%
Other values (2) 2
 
3.2%
Decimal Number
ValueCountFrequency (%)
1 1685
26.1%
2 854
13.2%
3 641
 
9.9%
4 591
 
9.1%
0 527
 
8.2%
5 503
 
7.8%
8 448
 
6.9%
6 431
 
6.7%
7 428
 
6.6%
9 355
 
5.5%
Lowercase Letter
ValueCountFrequency (%)
w 1
25.0%
e 1
25.0%
k 1
25.0%
s 1
25.0%
Space Separator
ValueCountFrequency (%)
6677
100.0%
Open Punctuation
ValueCountFrequency (%)
( 877
100.0%
Close Punctuation
ValueCountFrequency (%)
) 877
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 687
100.0%
Other Punctuation
ValueCountFrequency (%)
. 490
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20278
55.7%
Common 16074
44.1%
Latin 68
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1873
 
9.2%
1618
 
8.0%
1599
 
7.9%
1518
 
7.5%
1502
 
7.4%
1497
 
7.4%
1495
 
7.4%
982
 
4.8%
869
 
4.3%
739
 
3.6%
Other values (259) 6586
32.5%
Latin
ValueCountFrequency (%)
B 23
33.8%
N 12
17.6%
A 7
 
10.3%
K 6
 
8.8%
S 5
 
7.4%
C 3
 
4.4%
P 2
 
2.9%
w 1
 
1.5%
I 1
 
1.5%
T 1
 
1.5%
Other values (7) 7
 
10.3%
Common
ValueCountFrequency (%)
6677
41.5%
1 1685
 
10.5%
( 877
 
5.5%
) 877
 
5.5%
2 854
 
5.3%
- 687
 
4.3%
3 641
 
4.0%
4 591
 
3.7%
0 527
 
3.3%
5 503
 
3.1%
Other values (6) 2155
 
13.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20278
55.7%
ASCII 16141
44.3%
Number Forms 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6677
41.4%
1 1685
 
10.4%
( 877
 
5.4%
) 877
 
5.4%
2 854
 
5.3%
- 687
 
4.3%
3 641
 
4.0%
4 591
 
3.7%
0 527
 
3.3%
5 503
 
3.1%
Other values (22) 2222
 
13.8%
Hangul
ValueCountFrequency (%)
1873
 
9.2%
1618
 
8.0%
1599
 
7.9%
1518
 
7.5%
1502
 
7.4%
1497
 
7.4%
1495
 
7.4%
982
 
4.8%
869
 
4.3%
739
 
3.6%
Other values (259) 6586
32.5%
Number Forms
ValueCountFrequency (%)
1
100.0%

지정번
Text

MISSING 

Distinct1210
Distinct (%)100.0%
Missing632
Missing (%)34.3%
Memory size14.5 KiB
2023-12-12T20:26:07.869570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length23
Mean length23
Min length23

Characters and Unicode

Total characters27830
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1210 ?
Unique (%)100.0%

Sample

1st row2021-3700160-05-6-00015
2nd row2021-3700160-05-6-00014
3rd row2021-3700160-05-6-00013
4th row2021-3700160-05-6-00012
5th row2021-3700160-05-6-00011
ValueCountFrequency (%)
2020-3700160-05-6-00022 1
 
0.1%
2007-3700053-05-6-00172 1
 
0.1%
2007-3700053-05-6-00183 1
 
0.1%
2007-3700053-05-6-00185 1
 
0.1%
2008-3700053-05-6-00001 1
 
0.1%
2008-3700053-05-6-00006 1
 
0.1%
2008-3700053-05-6-00012 1
 
0.1%
2008-3700053-05-6-00036 1
 
0.1%
2008-3700053-05-6-00053 1
 
0.1%
2008-3700053-05-6-00065 1
 
0.1%
Other values (1200) 1200
99.2%
2023-12-12T20:26:08.461980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10051
36.1%
- 4840
17.4%
1 2076
 
7.5%
6 1796
 
6.5%
7 1703
 
6.1%
3 1665
 
6.0%
5 1651
 
5.9%
2 1589
 
5.7%
9 1124
 
4.0%
4 802
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22990
82.6%
Dash Punctuation 4840
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10051
43.7%
1 2076
 
9.0%
6 1796
 
7.8%
7 1703
 
7.4%
3 1665
 
7.2%
5 1651
 
7.2%
2 1589
 
6.9%
9 1124
 
4.9%
4 802
 
3.5%
8 533
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 4840
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27830
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10051
36.1%
- 4840
17.4%
1 2076
 
7.5%
6 1796
 
6.5%
7 1703
 
6.1%
3 1665
 
6.0%
5 1651
 
5.9%
2 1589
 
5.7%
9 1124
 
4.0%
4 802
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27830
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10051
36.1%
- 4840
17.4%
1 2076
 
7.5%
6 1796
 
6.5%
7 1703
 
6.1%
3 1665
 
6.0%
5 1651
 
5.9%
2 1589
 
5.7%
9 1124
 
4.0%
4 802
 
2.9%

관리번
Text

MISSING 

Distinct617
Distinct (%)51.0%
Missing632
Missing (%)34.3%
Memory size14.5 KiB
2023-12-12T20:26:08.847708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length7.6289256
Min length1

Characters and Unicode

Total characters9231
Distinct characters17
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique614 ?
Unique (%)50.7%

Sample

1st row2021-울산남구-0015
2nd row2021-울산남구-0014
3rd row2021-울산남구-0013
4th row2021-울산남구-0012
5th row2021-울산남구-0011
ValueCountFrequency (%)
2020-울산남구-0034 2
 
0.3%
2020-울산남구-0035 2
 
0.3%
2017-울산남구-0052 1
 
0.2%
2017-울산남구-0033 1
 
0.2%
2017-울산남구-0062 1
 
0.2%
2017-울산남구-0019 1
 
0.2%
2017-울산남구-0032 1
 
0.2%
2017-울산남구-0029 1
 
0.2%
2017-울산남구-0028 1
 
0.2%
2017-울산남구-0027 1
 
0.2%
Other values (606) 606
98.1%
2023-12-12T20:26:09.382818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1955
21.2%
- 1234
13.4%
2 903
9.8%
1 836
9.1%
616
 
6.7%
616
 
6.7%
616
 
6.7%
616
 
6.7%
592
 
6.4%
8 227
 
2.5%
Other values (7) 1020
11.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4940
53.5%
Other Letter 2464
26.7%
Dash Punctuation 1234
 
13.4%
Space Separator 592
 
6.4%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1955
39.6%
2 903
18.3%
1 836
16.9%
8 227
 
4.6%
9 207
 
4.2%
7 182
 
3.7%
6 167
 
3.4%
5 167
 
3.4%
4 156
 
3.2%
3 140
 
2.8%
Other Letter
ValueCountFrequency (%)
616
25.0%
616
25.0%
616
25.0%
616
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 1234
100.0%
Space Separator
ValueCountFrequency (%)
592
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6767
73.3%
Hangul 2464
 
26.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1955
28.9%
- 1234
18.2%
2 903
13.3%
1 836
12.4%
592
 
8.7%
8 227
 
3.4%
9 207
 
3.1%
7 182
 
2.7%
6 167
 
2.5%
5 167
 
2.5%
Other values (3) 297
 
4.4%
Hangul
ValueCountFrequency (%)
616
25.0%
616
25.0%
616
25.0%
616
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6767
73.3%
Hangul 2464
 
26.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1955
28.9%
- 1234
18.2%
2 903
13.3%
1 836
12.4%
592
 
8.7%
8 227
 
3.4%
9 207
 
3.1%
7 182
 
2.7%
6 167
 
2.5%
5 167
 
2.5%
Other values (3) 297
 
4.4%
Hangul
ValueCountFrequency (%)
616
25.0%
616
25.0%
616
25.0%
616
25.0%

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1842
Missing (%)100.0%
Memory size16.3 KiB

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1842
Missing (%)100.0%
Memory size16.3 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1842
Missing (%)100.0%
Memory size16.3 KiB

Interactions

2023-12-12T20:26:02.123100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T20:26:02.370892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:26:02.597756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T20:26:02.779169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업소명대표자명업소도로명주소지정번관리번Unnamed: 6Unnamed: 7Unnamed: 8
01대현분식최경심울산광역시 남구 왕생로86번길 30. 현대홈타운스위트 상가동 102호 (달동)2021-3700160-05-6-000152021-울산남구-0015<NA><NA><NA>
12더진국이유미울산광역시 남구 두왕로34번길 32 (선암동)2021-3700160-05-6-000142021-울산남구-0014<NA><NA><NA>
23예스마트박수임울산광역시 남구 정광로 17. 1호 (무거동)2021-3700160-05-6-000132021-울산남구-0013<NA><NA><NA>
34신흥황정숙울산광역시 남구 중앙로204번길 32 (신정동)2021-3700160-05-6-000122021-울산남구-0012<NA><NA><NA>
45공원복권방정숙자울산광역시 남구 대학로 60. 에이동 1층 (무거동)2021-3700160-05-6-000112021-울산남구-0011<NA><NA><NA>
56동희산업 복지매점차갑석울산광역시 남구 처용로 675. 동희산업(주) (황성동)2021-3700160-05-6-000102021-울산남구-0009<NA><NA><NA>
67원플러스 옥현이소희울산광역시 남구 옥현로 92-17. 102호 (무거동)2021-3700160-05-6-000092021-울산남구-0010<NA><NA><NA>
78브레드데이이성열울산광역시 남구 옥현로 92-18. 옥현주공2단지아파트 상가 103호 (무거동)2021-3700160-05-6-000082021-울산남구-0008<NA><NA><NA>
89울산남구지역자활센터(씨유울산신정우리점)울산남구지역자활센터(씨유울산신정우리점) 전우창울산광역시 남구 돋질로 40. 제2호 (신정동)2021-3700160-05-6-000072021-울산남구-0007<NA><NA><NA>
910씨유울산남구청점박용희울산광역시 남구 돋질로 253 (삼산동)2021-3700160-05-6-000062021-울산남구-0006<NA><NA><NA>
연번업소명대표자명업소도로명주소지정번관리번Unnamed: 6Unnamed: 7Unnamed: 8
18321833베스토아<NA>울산광역시 중구 반구동 456-13<NA><NA><NA><NA><NA>
18331834한마음<NA>울산광역시 중구 반구동 444-10<NA><NA><NA><NA><NA>
18341835우리들슈퍼<NA>울산광역시 중구 반구동 178-6<NA><NA><NA><NA><NA>
18351836오뚜기슈퍼<NA>울산광역시 중구 반구동 453-15<NA><NA><NA><NA><NA>
18361837영남상회<NA>울산광역시 중구 반구동 호 67B3-5N<NA><NA><NA><NA><NA>
18371838대성미니슈퍼<NA>울산광역시 중구 반구동 305-1<NA><NA><NA><NA><NA>
18381839남운스포렉스 구내<NA>울산광역시 중구 학성동 397-5<NA><NA><NA><NA><NA>
18391840쌍용상회<NA>울산광역시 중구 학성동 21-5<NA><NA><NA><NA><NA>
18401841없음<NA>울산광역시 중구 학성동 189-7<NA><NA><NA><NA><NA>
18411842공판장슈퍼<NA>울산광역시 중구 학성동 364-6<NA><NA><NA><NA><NA>