Overview

Dataset statistics

Number of variables8
Number of observations1262
Missing cells1137
Missing cells (%)11.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory80.2 KiB
Average record size in memory65.1 B

Variable types

Numeric1
Text6
Categorical1

Dataset

Description소상공인시장진흥공단 전국 백년가게 현황에 대한 정보로 업체명, 연락처 시도, 시도군, 기본주소, 상세주소, 주요사업을 항목으로 제공합니다.
Author소상공인시장진흥공단
URLhttps://www.data.go.kr/data/15102255/fileData.do

Alerts

상세주소 has 1131 (89.6%) missing valuesMissing
연번 has unique valuesUnique
기본주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 15:32:54.788344
Analysis finished2023-12-12 15:32:56.355592
Duration1.57 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1262
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean631.5
Minimum1
Maximum1262
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.2 KiB
2023-12-13T00:32:56.445760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile64.05
Q1316.25
median631.5
Q3946.75
95-th percentile1198.95
Maximum1262
Range1261
Interquartile range (IQR)630.5

Descriptive statistics

Standard deviation364.45233
Coefficient of variation (CV)0.57712166
Kurtosis-1.2
Mean631.5
Median Absolute Deviation (MAD)315.5
Skewness0
Sum796953
Variance132825.5
MonotonicityStrictly increasing
2023-12-13T00:32:56.614883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
840 1
 
0.1%
847 1
 
0.1%
846 1
 
0.1%
845 1
 
0.1%
844 1
 
0.1%
843 1
 
0.1%
842 1
 
0.1%
841 1
 
0.1%
839 1
 
0.1%
Other values (1252) 1252
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1262 1
0.1%
1261 1
0.1%
1260 1
0.1%
1259 1
0.1%
1258 1
0.1%
1257 1
0.1%
1256 1
0.1%
1255 1
0.1%
1254 1
0.1%
1253 1
0.1%
Distinct1241
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size10.0 KiB
2023-12-13T00:32:56.914125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length5.3518225
Min length2

Characters and Unicode

Total characters6754
Distinct characters577
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1224 ?
Unique (%)97.0%

Sample

1st row박성구과자점
2nd row덕성이발관
3rd row팡페이장제과점
4th row배영숙산야초밥상
5th row신내보리밥
ValueCountFrequency (%)
주식회사 15
 
1.1%
진미식당 4
 
0.3%
두꺼비집 3
 
0.2%
이태리안경 3
 
0.2%
부일식당 2
 
0.2%
농업회사법인 2
 
0.2%
하동집 2
 
0.2%
단골집 2
 
0.2%
시골집 2
 
0.2%
형제상회 2
 
0.2%
Other values (1282) 1291
97.2%
2023-12-13T00:32:57.466960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
 
2.9%
175
 
2.6%
131
 
1.9%
106
 
1.6%
96
 
1.4%
95
 
1.4%
89
 
1.3%
88
 
1.3%
76
 
1.1%
74
 
1.1%
Other values (567) 5627
83.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6492
96.1%
Space Separator 68
 
1.0%
Open Punctuation 39
 
0.6%
Close Punctuation 39
 
0.6%
Uppercase Letter 37
 
0.5%
Other Symbol 27
 
0.4%
Decimal Number 25
 
0.4%
Other Punctuation 17
 
0.3%
Lowercase Letter 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
197
 
3.0%
175
 
2.7%
131
 
2.0%
106
 
1.6%
96
 
1.5%
95
 
1.5%
89
 
1.4%
88
 
1.4%
76
 
1.2%
74
 
1.1%
Other values (527) 5365
82.6%
Uppercase Letter
ValueCountFrequency (%)
G 5
13.5%
B 4
10.8%
O 4
10.8%
T 3
 
8.1%
S 3
 
8.1%
A 2
 
5.4%
M 2
 
5.4%
L 2
 
5.4%
E 2
 
5.4%
J 2
 
5.4%
Other values (7) 8
21.6%
Decimal Number
ValueCountFrequency (%)
1 6
24.0%
8 6
24.0%
9 4
16.0%
7 3
12.0%
6 2
 
8.0%
3 2
 
8.0%
5 1
 
4.0%
0 1
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
o 2
20.0%
l 2
20.0%
e 1
10.0%
a 1
10.0%
r 1
10.0%
y 1
10.0%
m 1
10.0%
b 1
10.0%
Other Punctuation
ValueCountFrequency (%)
. 8
47.1%
& 5
29.4%
, 4
23.5%
Space Separator
ValueCountFrequency (%)
68
100.0%
Open Punctuation
ValueCountFrequency (%)
( 39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 39
100.0%
Other Symbol
ValueCountFrequency (%)
27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6519
96.5%
Common 188
 
2.8%
Latin 47
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
197
 
3.0%
175
 
2.7%
131
 
2.0%
106
 
1.6%
96
 
1.5%
95
 
1.5%
89
 
1.4%
88
 
1.3%
76
 
1.2%
74
 
1.1%
Other values (528) 5392
82.7%
Latin
ValueCountFrequency (%)
G 5
 
10.6%
B 4
 
8.5%
O 4
 
8.5%
T 3
 
6.4%
S 3
 
6.4%
o 2
 
4.3%
A 2
 
4.3%
M 2
 
4.3%
L 2
 
4.3%
l 2
 
4.3%
Other values (15) 18
38.3%
Common
ValueCountFrequency (%)
68
36.2%
( 39
20.7%
) 39
20.7%
. 8
 
4.3%
1 6
 
3.2%
8 6
 
3.2%
& 5
 
2.7%
9 4
 
2.1%
, 4
 
2.1%
7 3
 
1.6%
Other values (4) 6
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6492
96.1%
ASCII 235
 
3.5%
None 27
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
197
 
3.0%
175
 
2.7%
131
 
2.0%
106
 
1.6%
96
 
1.5%
95
 
1.5%
89
 
1.4%
88
 
1.4%
76
 
1.2%
74
 
1.1%
Other values (527) 5365
82.6%
ASCII
ValueCountFrequency (%)
68
28.9%
( 39
16.6%
) 39
16.6%
. 8
 
3.4%
1 6
 
2.6%
8 6
 
2.6%
G 5
 
2.1%
& 5
 
2.1%
B 4
 
1.7%
9 4
 
1.7%
Other values (29) 51
21.7%
None
ValueCountFrequency (%)
27
100.0%
Distinct1235
Distinct (%)98.3%
Missing6
Missing (%)0.5%
Memory size10.0 KiB
2023-12-13T00:32:57.859505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.910032
Min length9

Characters and Unicode

Total characters14959
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1218 ?
Unique (%)97.0%

Sample

1st row031-595-0098
2nd row031-5592-5685
3rd row054-248-6163
4th row043-543-1136
5th row031-771-9199
ValueCountFrequency (%)
063-856-4422 4
 
0.3%
02-888-4849 3
 
0.2%
031-483-2379 3
 
0.2%
02-2642-9399 2
 
0.2%
055-741-1410 2
 
0.2%
032-503-8054 2
 
0.2%
041-338-2654 2
 
0.2%
02-2279-3152 2
 
0.2%
032-522-1700 2
 
0.2%
033-255-2069 2
 
0.2%
Other values (1225) 1232
98.1%
2023-12-13T00:32:58.395494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2510
16.8%
0 1984
13.3%
3 1691
11.3%
2 1530
10.2%
5 1507
10.1%
4 1306
8.7%
6 1124
7.5%
1 1100
7.4%
7 838
 
5.6%
8 791
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12449
83.2%
Dash Punctuation 2510
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1984
15.9%
3 1691
13.6%
2 1530
12.3%
5 1507
12.1%
4 1306
10.5%
6 1124
9.0%
1 1100
8.8%
7 838
6.7%
8 791
 
6.4%
9 578
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 2510
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 14959
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2510
16.8%
0 1984
13.3%
3 1691
11.3%
2 1530
10.2%
5 1507
10.1%
4 1306
8.7%
6 1124
7.5%
1 1100
7.4%
7 838
 
5.6%
8 791
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 14959
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2510
16.8%
0 1984
13.3%
3 1691
11.3%
2 1530
10.2%
5 1507
10.1%
4 1306
8.7%
6 1124
7.5%
1 1100
7.4%
7 838
 
5.6%
8 791
 
5.3%

시도
Categorical

Distinct18
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size10.0 KiB
경기
186 
서울
154 
경북
113 
경남
103 
충북
83 
Other values (13)
623 

Length

Max length3
Median length2
Mean length2.0007924
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row경기
2nd row경기
3rd row경북
4th row충북
5th row경기

Common Values

ValueCountFrequency (%)
경기 186
14.7%
서울 154
12.2%
경북 113
9.0%
경남 103
8.2%
충북 83
 
6.6%
강원 83
 
6.6%
부산 80
 
6.3%
전남 74
 
5.9%
전북 74
 
5.9%
대구 67
 
5.3%
Other values (8) 245
19.4%

Length

2023-12-13T00:32:58.636356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기 186
14.7%
서울 154
12.2%
경북 113
9.0%
경남 103
8.2%
충북 83
 
6.6%
강원 83
 
6.6%
부산 80
 
6.3%
전남 74
 
5.9%
전북 74
 
5.9%
대구 67
 
5.3%
Other values (8) 245
19.4%
Distinct202
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size10.0 KiB
2023-12-13T00:32:59.026024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.9064976
Min length2

Characters and Unicode

Total characters3668
Distinct characters133
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)3.4%

Sample

1st row남양주시
2nd row남양주시
3rd row포항시
4th row보은군
5th row양평군
ValueCountFrequency (%)
중구 64
 
5.1%
동구 53
 
4.2%
남구 30
 
2.4%
포항시 26
 
2.1%
창원시 26
 
2.1%
전주시 24
 
1.9%
청주시 23
 
1.8%
북구 21
 
1.7%
수원시 20
 
1.6%
순천시 18
 
1.4%
Other values (192) 957
75.8%
2023-12-13T00:32:59.576970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
646
 
17.6%
457
 
12.5%
192
 
5.2%
156
 
4.3%
119
 
3.2%
112
 
3.1%
104
 
2.8%
96
 
2.6%
84
 
2.3%
73
 
2.0%
Other values (123) 1629
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3668
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
646
 
17.6%
457
 
12.5%
192
 
5.2%
156
 
4.3%
119
 
3.2%
112
 
3.1%
104
 
2.8%
96
 
2.6%
84
 
2.3%
73
 
2.0%
Other values (123) 1629
44.4%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3668
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
646
 
17.6%
457
 
12.5%
192
 
5.2%
156
 
4.3%
119
 
3.2%
112
 
3.1%
104
 
2.8%
96
 
2.6%
84
 
2.3%
73
 
2.0%
Other values (123) 1629
44.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3668
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
646
 
17.6%
457
 
12.5%
192
 
5.2%
156
 
4.3%
119
 
3.2%
112
 
3.1%
104
 
2.8%
96
 
2.6%
84
 
2.3%
73
 
2.0%
Other values (123) 1629
44.4%

기본주소
Text

UNIQUE 

Distinct1262
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size10.0 KiB
2023-12-13T00:32:59.942566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length45
Mean length23.66878
Min length12

Characters and Unicode

Total characters29870
Distinct characters418
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1262 ?
Unique (%)100.0%

Sample

1st row경기 남양주시 금곡동 404-41
2nd row경기 남양주시 금곡동 430-40
3rd row경북 포항시 북구 대안길 69 (용흥동)
4th row충북 보은군 속리산면 법주사로 253 (사내리, 배영숙 산야초밥상)
5th row경기 양평군 개군면 공서울길 39 (공세리)
ValueCountFrequency (%)
경기도 121
 
2.0%
서울특별시 91
 
1.5%
경상북도 68
 
1.1%
1층 66
 
1.1%
경기 65
 
1.1%
중구 64
 
1.0%
서울 62
 
1.0%
경상남도 59
 
1.0%
동구 53
 
0.9%
충청북도 50
 
0.8%
Other values (2984) 5454
88.6%
2023-12-13T00:33:00.563217image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4901
 
16.4%
1 1228
 
4.1%
1046
 
3.5%
998
 
3.3%
983
 
3.3%
( 812
 
2.7%
) 810
 
2.7%
747
 
2.5%
2 679
 
2.3%
608
 
2.0%
Other values (408) 17058
57.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17628
59.0%
Decimal Number 4975
 
16.7%
Space Separator 4901
 
16.4%
Open Punctuation 812
 
2.7%
Close Punctuation 810
 
2.7%
Dash Punctuation 363
 
1.2%
Other Punctuation 362
 
1.2%
Uppercase Letter 14
 
< 0.1%
Math Symbol 4
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1046
 
5.9%
998
 
5.7%
983
 
5.6%
747
 
4.2%
608
 
3.4%
541
 
3.1%
462
 
2.6%
446
 
2.5%
395
 
2.2%
363
 
2.1%
Other values (384) 11039
62.6%
Decimal Number
ValueCountFrequency (%)
1 1228
24.7%
2 679
13.6%
3 588
11.8%
5 446
 
9.0%
4 420
 
8.4%
6 364
 
7.3%
0 354
 
7.1%
7 325
 
6.5%
8 301
 
6.1%
9 270
 
5.4%
Uppercase Letter
ValueCountFrequency (%)
B 5
35.7%
A 4
28.6%
C 3
21.4%
O 1
 
7.1%
H 1
 
7.1%
Other Punctuation
ValueCountFrequency (%)
, 357
98.6%
. 4
 
1.1%
/ 1
 
0.3%
Space Separator
ValueCountFrequency (%)
4901
100.0%
Open Punctuation
ValueCountFrequency (%)
( 812
100.0%
Close Punctuation
ValueCountFrequency (%)
) 810
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 363
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17628
59.0%
Common 12227
40.9%
Latin 15
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1046
 
5.9%
998
 
5.7%
983
 
5.6%
747
 
4.2%
608
 
3.4%
541
 
3.1%
462
 
2.6%
446
 
2.5%
395
 
2.2%
363
 
2.1%
Other values (384) 11039
62.6%
Common
ValueCountFrequency (%)
4901
40.1%
1 1228
 
10.0%
( 812
 
6.6%
) 810
 
6.6%
2 679
 
5.6%
3 588
 
4.8%
5 446
 
3.6%
4 420
 
3.4%
6 364
 
3.0%
- 363
 
3.0%
Other values (8) 1616
 
13.2%
Latin
ValueCountFrequency (%)
B 5
33.3%
A 4
26.7%
C 3
20.0%
O 1
 
6.7%
b 1
 
6.7%
H 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17628
59.0%
ASCII 12242
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4901
40.0%
1 1228
 
10.0%
( 812
 
6.6%
) 810
 
6.6%
2 679
 
5.5%
3 588
 
4.8%
5 446
 
3.6%
4 420
 
3.4%
6 364
 
3.0%
- 363
 
3.0%
Other values (14) 1631
 
13.3%
Hangul
ValueCountFrequency (%)
1046
 
5.9%
998
 
5.7%
983
 
5.6%
747
 
4.2%
608
 
3.4%
541
 
3.1%
462
 
2.6%
446
 
2.5%
395
 
2.2%
363
 
2.1%
Other values (384) 11039
62.6%

상세주소
Text

MISSING 

Distinct95
Distinct (%)72.5%
Missing1131
Missing (%)89.6%
Memory size10.0 KiB
2023-12-13T00:33:00.861215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length19
Mean length5.9465649
Min length1

Characters and Unicode

Total characters779
Distinct characters204
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique91 ?
Unique (%)69.5%

Sample

1st row1층 박성구과자점
2nd row금곡로28
3rd row가동 101호 (용흥동, 제일상가)
4th row배영숙 산야초밥상
5th row1층
ValueCountFrequency (%)
1층 50
25.9%
2층 10
 
5.2%
102호 3
 
1.6%
1 3
 
1.6%
가동 2
 
1.0%
일부 2
 
1.0%
5층 2
 
1.0%
101호 2
 
1.0%
3층 2
 
1.0%
두꺼비집부대찌개 1
 
0.5%
Other values (116) 116
60.1%
2023-12-13T00:33:01.404826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 85
 
10.9%
70
 
9.0%
70
 
9.0%
2 28
 
3.6%
26
 
3.3%
, 19
 
2.4%
0 18
 
2.3%
16
 
2.1%
3 11
 
1.4%
11
 
1.4%
Other values (194) 425
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 503
64.6%
Decimal Number 166
 
21.3%
Space Separator 70
 
9.0%
Other Punctuation 19
 
2.4%
Dash Punctuation 7
 
0.9%
Close Punctuation 6
 
0.8%
Open Punctuation 5
 
0.6%
Uppercase Letter 2
 
0.3%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
70
 
13.9%
26
 
5.2%
16
 
3.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
7
 
1.4%
7
 
1.4%
7
 
1.4%
6
 
1.2%
Other values (177) 334
66.4%
Decimal Number
ValueCountFrequency (%)
1 85
51.2%
2 28
 
16.9%
0 18
 
10.8%
3 11
 
6.6%
4 7
 
4.2%
5 6
 
3.6%
7 5
 
3.0%
6 4
 
2.4%
8 2
 
1.2%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
70
100.0%
Other Punctuation
ValueCountFrequency (%)
, 19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 503
64.6%
Common 274
35.2%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
70
 
13.9%
26
 
5.2%
16
 
3.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
7
 
1.4%
7
 
1.4%
7
 
1.4%
6
 
1.2%
Other values (177) 334
66.4%
Common
ValueCountFrequency (%)
1 85
31.0%
70
25.5%
2 28
 
10.2%
, 19
 
6.9%
0 18
 
6.6%
3 11
 
4.0%
- 7
 
2.6%
4 7
 
2.6%
) 6
 
2.2%
5 6
 
2.2%
Other values (5) 17
 
6.2%
Latin
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 503
64.6%
ASCII 276
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 85
30.8%
70
25.4%
2 28
 
10.1%
, 19
 
6.9%
0 18
 
6.5%
3 11
 
4.0%
- 7
 
2.5%
4 7
 
2.5%
) 6
 
2.2%
5 6
 
2.2%
Other values (7) 19
 
6.9%
Hangul
ValueCountFrequency (%)
70
 
13.9%
26
 
5.2%
16
 
3.2%
11
 
2.2%
10
 
2.0%
9
 
1.8%
7
 
1.4%
7
 
1.4%
7
 
1.4%
6
 
1.2%
Other values (177) 334
66.4%
Distinct784
Distinct (%)62.1%
Missing0
Missing (%)0.0%
Memory size10.0 KiB
2023-12-13T00:33:01.811102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length17
Mean length5.118859
Min length1

Characters and Unicode

Total characters6460
Distinct characters416
Distinct categories6 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique642 ?
Unique (%)50.9%

Sample

1st row제과
2nd row이·미용
3rd row제과,제빵
4th row산채정식
5th row보리밥
ValueCountFrequency (%)
한식 44
 
2.5%
미용 37
 
2.1%
제과 28
 
1.6%
한정식 25
 
1.4%
냉면 25
 
1.4%
안경 21
 
1.2%
서적 19
 
1.1%
중화요리 17
 
1.0%
17
 
1.0%
부대찌개 16
 
0.9%
Other values (813) 1487
85.7%
2023-12-13T00:33:02.343842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
481
 
7.4%
, 415
 
6.4%
187
 
2.9%
147
 
2.3%
123
 
1.9%
121
 
1.9%
117
 
1.8%
110
 
1.7%
108
 
1.7%
97
 
1.5%
Other values (406) 4554
70.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5498
85.1%
Space Separator 481
 
7.4%
Other Punctuation 423
 
6.5%
Close Punctuation 28
 
0.4%
Open Punctuation 28
 
0.4%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
187
 
3.4%
147
 
2.7%
123
 
2.2%
121
 
2.2%
117
 
2.1%
110
 
2.0%
108
 
2.0%
97
 
1.8%
93
 
1.7%
89
 
1.6%
Other values (397) 4306
78.3%
Other Punctuation
ValueCountFrequency (%)
, 415
98.1%
· 5
 
1.2%
/ 2
 
0.5%
. 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
B 1
50.0%
O 1
50.0%
Space Separator
ValueCountFrequency (%)
481
100.0%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5497
85.1%
Common 960
 
14.9%
Latin 2
 
< 0.1%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
187
 
3.4%
147
 
2.7%
123
 
2.2%
121
 
2.2%
117
 
2.1%
110
 
2.0%
108
 
2.0%
97
 
1.8%
93
 
1.7%
89
 
1.6%
Other values (396) 4305
78.3%
Common
ValueCountFrequency (%)
481
50.1%
, 415
43.2%
) 28
 
2.9%
( 28
 
2.9%
· 5
 
0.5%
/ 2
 
0.2%
. 1
 
0.1%
Latin
ValueCountFrequency (%)
B 1
50.0%
O 1
50.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5497
85.1%
ASCII 957
 
14.8%
None 5
 
0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
481
50.3%
, 415
43.4%
) 28
 
2.9%
( 28
 
2.9%
/ 2
 
0.2%
. 1
 
0.1%
B 1
 
0.1%
O 1
 
0.1%
Hangul
ValueCountFrequency (%)
187
 
3.4%
147
 
2.7%
123
 
2.2%
121
 
2.2%
117
 
2.1%
110
 
2.0%
108
 
2.0%
97
 
1.8%
93
 
1.7%
89
 
1.6%
Other values (396) 4305
78.3%
None
ValueCountFrequency (%)
· 5
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-13T00:32:55.857749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T00:33:02.479311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시도상세주소
연번1.0000.4060.856
시도0.4061.0000.652
상세주소0.8560.6521.000
2023-12-13T00:33:02.596676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시도
연번1.0000.167
시도0.1671.000

Missing values

2023-12-13T00:32:56.031887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T00:32:56.172518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T00:32:56.294523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번업체명연락처시도시군구기본주소상세주소주요사업
01박성구과자점031-595-0098경기남양주시경기 남양주시 금곡동 404-411층 박성구과자점제과
12덕성이발관031-5592-5685경기남양주시경기 남양주시 금곡동 430-40금곡로28이·미용
23팡페이장제과점054-248-6163경북포항시경북 포항시 북구 대안길 69 (용흥동)가동 101호 (용흥동, 제일상가)제과,제빵
34배영숙산야초밥상043-543-1136충북보은군충북 보은군 속리산면 법주사로 253 (사내리, 배영숙 산야초밥상)배영숙 산야초밥상산채정식
45신내보리밥031-771-9199경기양평군경기 양평군 개군면 공서울길 39 (공세리)1층보리밥
56옥천면옥031-772-5187경기양평군경기 양평군 옥천면 옥천길 13 (옥천리)옥천면옥냉면
67정미소 김치관031-261-0042경기용인시경기 용인시 수지구 신봉1로 302 (신봉동)1층김치만두전골
78장수촌031-262-7711경기용인시경기 용인시 수지구 동천로 631 (고기동)1층백숙
89(주)제일레져053-554-5524대구동구대구 동구 팔공로 535 (지묘동, 제일레져)제일레져철물구조
910미세스고등어본점054-291-9018경북포항시경북 포항시 남구 정몽주로 873 (청림동)미세스고등어 본점고등어구이
연번업체명연락처시도시군구기본주소상세주소주요사업
12521253진미양념통닭033-746-6896강원원주시강원도 원주시 우산로 66(우산동)<NA>치킨
12531254㈜백년가게국제의료기053-426-7579대구달서구대구광역시 달서구 성당로 273(두류동)<NA>의료소모품, 의료기기 도소매
12541255제일스포츠063-535-9146전북정읍시전라북도 정읍시 중앙로 83(수성동)<NA>운동용품, 운동구, 체육시설물 등
12551256정우상사02-2272-2688서울종로구서울특별시 종로구 창경궁로 109(인의동)1043호 세원스퀘어시계 도소매
12561257을지OB베어02-2264-1597서울중구서울특별시 중구 충무로9길 12(을지로3가)1층 104호노가리, 번데기, OB생맥주
12571258스미센053-752-0980대구동구대구광역시 동구 동부로30길 15(신천동)<NA>민물장어구이, 초밥
12581259선천집02-734-1970서울종로구서울특별시 종로구 인사동14길 5(관훈동)100-3, 110-4한정식
12591260만석장02-385-2093서울은평구서울특별시 은평구 대서문길 43-10(진관동)1, 2층두부요리, 쌈밥전문점
12601261대림동삼거리먼지막순대국02-848-2469서울영등포구서울특별시 영등포구 시흥대로185길 11(대림동)1층,(지하층)순대국
12611262늘채움063-272-5737전북전주시전라북도 전주시 덕진구 덕진연못3길 6(덕진동2가)<NA>늘채움정식, 생선구이 등