Overview

Dataset statistics

Number of variables10
Number of observations2213
Missing cells477
Missing cells (%)2.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory177.3 KiB
Average record size in memory82.1 B

Variable types

Categorical3
Numeric2
Text5

Dataset

Description경상북도 칠곡군 음식점 현황입니다. 업종명, 인허가일자, 업소명, 영업자, 소재지주소, 영업장면적, 전화번호, 업태명 등이 등록되어있습니다.
Author경상북도 칠곡군
URLhttps://www.data.go.kr/data/15069093/fileData.do

Alerts

업종명 is highly overall correlated with 업태명High correlation
업태명 is highly overall correlated with 업종명High correlation
법인명 is highly imbalanced (94.8%)Imbalance
소재지전화 has 477 (21.6%) missing valuesMissing
영업장면적 has 107 (4.8%) zerosZeros

Reproduction

Analysis started2023-12-12 18:58:23.958003
Analysis finished2023-12-12 18:58:26.666227
Duration2.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
일반음식점
1962 
휴게음식점
251 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
일반음식점 1962
88.7%
휴게음식점 251
 
11.3%

Length

2023-12-13T03:58:26.776652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:58:26.971707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 1962
88.7%
휴게음식점 251
 
11.3%

인허가일자
Real number (ℝ)

Distinct1642
Distinct (%)74.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38677.11
Minimum26565
Maximum42528
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.6 KiB
2023-12-13T03:58:27.199352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26565
5-th percentile33390
Q136671
median39127
Q341487
95-th percentile42334.4
Maximum42528
Range15963
Interquartile range (IQR)4816

Descriptive statistics

Standard deviation3100.3287
Coefficient of variation (CV)0.080159264
Kurtosis0.43660711
Mean38677.11
Median Absolute Deviation (MAD)2416
Skewness-0.8445174
Sum85592445
Variance9612038.1
MonotonicityNot monotonic
2023-12-13T03:58:27.461629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42284 9
 
0.4%
38253 6
 
0.3%
41835 6
 
0.3%
29826 5
 
0.2%
37265 5
 
0.2%
41984 5
 
0.2%
42116 5
 
0.2%
41981 5
 
0.2%
40037 5
 
0.2%
42475 5
 
0.2%
Other values (1632) 2157
97.5%
ValueCountFrequency (%)
26565 1
 
< 0.1%
27806 2
0.1%
27810 3
0.1%
27816 1
 
< 0.1%
27915 1
 
< 0.1%
27962 1
 
< 0.1%
27979 1
 
< 0.1%
28229 1
 
< 0.1%
28241 1
 
< 0.1%
28296 1
 
< 0.1%
ValueCountFrequency (%)
42528 1
 
< 0.1%
42523 2
0.1%
42522 1
 
< 0.1%
42521 2
0.1%
42520 1
 
< 0.1%
42516 1
 
< 0.1%
42514 3
0.1%
42510 1
 
< 0.1%
42509 1
 
< 0.1%
42507 2
0.1%
Distinct2141
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
2023-12-13T03:58:27.861456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length5.9114324
Min length1

Characters and Unicode

Total characters13082
Distinct characters728
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2077 ?
Unique (%)93.9%

Sample

1st row서울식당
2nd row영진식당
3rd row약목식육식당
4th row밴프
5th row남성식당
ValueCountFrequency (%)
왜관점 10
 
0.4%
칠곡(서울방향)휴게소 8
 
0.3%
칠곡(하)휴게소 5
 
0.2%
석적중리점 4
 
0.2%
석적점 4
 
0.2%
세븐일레븐 4
 
0.2%
북삼점 4
 
0.2%
대성식당 3
 
0.1%
구내식당 3
 
0.1%
중리점 3
 
0.1%
Other values (2208) 2302
98.0%
2023-12-13T03:58:28.485425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
443
 
3.4%
358
 
2.7%
296
 
2.3%
235
 
1.8%
216
 
1.7%
165
 
1.3%
140
 
1.1%
138
 
1.1%
135
 
1.0%
134
 
1.0%
Other values (718) 10822
82.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12388
94.7%
Uppercase Letter 173
 
1.3%
Space Separator 138
 
1.1%
Decimal Number 118
 
0.9%
Lowercase Letter 81
 
0.6%
Close Punctuation 69
 
0.5%
Open Punctuation 68
 
0.5%
Other Punctuation 45
 
0.3%
Math Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
443
 
3.6%
358
 
2.9%
296
 
2.4%
235
 
1.9%
216
 
1.7%
165
 
1.3%
140
 
1.1%
135
 
1.1%
134
 
1.1%
133
 
1.1%
Other values (655) 10133
81.8%
Uppercase Letter
ValueCountFrequency (%)
B 22
12.7%
O 16
 
9.2%
E 16
 
9.2%
G 13
 
7.5%
C 13
 
7.5%
P 10
 
5.8%
T 10
 
5.8%
S 9
 
5.2%
H 8
 
4.6%
A 8
 
4.6%
Other values (13) 48
27.7%
Lowercase Letter
ValueCountFrequency (%)
e 14
17.3%
o 11
13.6%
c 7
8.6%
f 7
8.6%
h 6
7.4%
a 6
7.4%
r 5
 
6.2%
m 4
 
4.9%
n 4
 
4.9%
i 3
 
3.7%
Other values (8) 14
17.3%
Decimal Number
ValueCountFrequency (%)
2 28
23.7%
5 23
19.5%
0 19
16.1%
3 13
11.0%
1 11
 
9.3%
8 9
 
7.6%
9 5
 
4.2%
4 4
 
3.4%
6 3
 
2.5%
7 3
 
2.5%
Other Punctuation
ValueCountFrequency (%)
& 25
55.6%
. 9
 
20.0%
, 4
 
8.9%
# 3
 
6.7%
! 2
 
4.4%
' 1
 
2.2%
? 1
 
2.2%
Space Separator
ValueCountFrequency (%)
138
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Open Punctuation
ValueCountFrequency (%)
( 68
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12380
94.6%
Common 440
 
3.4%
Latin 254
 
1.9%
Han 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
443
 
3.6%
358
 
2.9%
296
 
2.4%
235
 
1.9%
216
 
1.7%
165
 
1.3%
140
 
1.1%
135
 
1.1%
134
 
1.1%
133
 
1.1%
Other values (647) 10125
81.8%
Latin
ValueCountFrequency (%)
B 22
 
8.7%
O 16
 
6.3%
E 16
 
6.3%
e 14
 
5.5%
G 13
 
5.1%
C 13
 
5.1%
o 11
 
4.3%
P 10
 
3.9%
T 10
 
3.9%
S 9
 
3.5%
Other values (31) 120
47.2%
Common
ValueCountFrequency (%)
138
31.4%
) 69
15.7%
( 68
15.5%
2 28
 
6.4%
& 25
 
5.7%
5 23
 
5.2%
0 19
 
4.3%
3 13
 
3.0%
1 11
 
2.5%
. 9
 
2.0%
Other values (12) 37
 
8.4%
Han
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12380
94.6%
ASCII 694
 
5.3%
CJK 7
 
0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
443
 
3.6%
358
 
2.9%
296
 
2.4%
235
 
1.9%
216
 
1.7%
165
 
1.3%
140
 
1.1%
135
 
1.1%
134
 
1.1%
133
 
1.1%
Other values (647) 10125
81.8%
ASCII
ValueCountFrequency (%)
138
19.9%
) 69
 
9.9%
( 68
 
9.8%
2 28
 
4.0%
& 25
 
3.6%
5 23
 
3.3%
B 22
 
3.2%
0 19
 
2.7%
O 16
 
2.3%
E 16
 
2.3%
Other values (53) 270
38.9%
CJK
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct1954
Distinct (%)88.3%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
2023-12-13T03:58:29.010990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length3
Mean length3.0244013
Min length2

Characters and Unicode

Total characters6693
Distinct characters250
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1780 ?
Unique (%)80.4%

Sample

1st row이신자
2nd row박정숙
3rd row김창순
4th row김순연
5th row이순옥
ValueCountFrequency (%)
고재욱 12
 
0.5%
이기진 8
 
0.4%
성대현 8
 
0.4%
이영자 5
 
0.2%
박정숙 5
 
0.2%
김미경 5
 
0.2%
김영숙 5
 
0.2%
김정희 4
 
0.2%
김인숙 4
 
0.2%
이은주 4
 
0.2%
Other values (1952) 2161
97.3%
2023-12-13T03:58:29.758797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
476
 
7.1%
348
 
5.2%
286
 
4.3%
221
 
3.3%
210
 
3.1%
202
 
3.0%
180
 
2.7%
173
 
2.6%
150
 
2.2%
137
 
2.0%
Other values (240) 4310
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6614
98.8%
Uppercase Letter 68
 
1.0%
Space Separator 8
 
0.1%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
476
 
7.2%
348
 
5.3%
286
 
4.3%
221
 
3.3%
210
 
3.2%
202
 
3.1%
180
 
2.7%
173
 
2.6%
150
 
2.3%
137
 
2.1%
Other values (216) 4231
64.0%
Uppercase Letter
ValueCountFrequency (%)
A 13
19.1%
N 8
11.8%
I 7
10.3%
O 6
 
8.8%
U 4
 
5.9%
X 4
 
5.9%
G 4
 
5.9%
M 3
 
4.4%
P 2
 
2.9%
R 2
 
2.9%
Other values (10) 15
22.1%
Space Separator
ValueCountFrequency (%)
8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6614
98.8%
Latin 68
 
1.0%
Common 11
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
476
 
7.2%
348
 
5.3%
286
 
4.3%
221
 
3.3%
210
 
3.2%
202
 
3.1%
180
 
2.7%
173
 
2.6%
150
 
2.3%
137
 
2.1%
Other values (216) 4231
64.0%
Latin
ValueCountFrequency (%)
A 13
19.1%
N 8
11.8%
I 7
10.3%
O 6
 
8.8%
U 4
 
5.9%
X 4
 
5.9%
G 4
 
5.9%
M 3
 
4.4%
P 2
 
2.9%
R 2
 
2.9%
Other values (10) 15
22.1%
Common
ValueCountFrequency (%)
8
72.7%
) 1
 
9.1%
( 1
 
9.1%
1 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6614
98.8%
ASCII 79
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
476
 
7.2%
348
 
5.3%
286
 
4.3%
221
 
3.3%
210
 
3.2%
202
 
3.1%
180
 
2.7%
173
 
2.6%
150
 
2.3%
137
 
2.1%
Other values (216) 4231
64.0%
ASCII
ValueCountFrequency (%)
A 13
16.5%
N 8
 
10.1%
8
 
10.1%
I 7
 
8.9%
O 6
 
7.6%
U 4
 
5.1%
X 4
 
5.1%
G 4
 
5.1%
M 3
 
3.8%
P 2
 
2.5%
Other values (14) 20
25.3%
Distinct1763
Distinct (%)79.7%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
2023-12-13T03:58:30.273994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length47
Mean length20.895165
Min length1

Characters and Unicode

Total characters46241
Distinct characters174
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1491 ?
Unique (%)67.4%

Sample

1st row-
2nd row경상북도 칠곡군 지천면 신동로7길 2
3rd row-
4th row경상북도 칠곡군 왜관읍 중앙로 205-9
5th row-
ValueCountFrequency (%)
칠곡군 2128
19.3%
경상북도 2128
19.3%
왜관읍 635
 
5.8%
석적읍 504
 
4.6%
북삼읍 409
 
3.7%
동명면 255
 
2.3%
약목면 127
 
1.2%
가산면 113
 
1.0%
중앙로 103
 
0.9%
유학로 98
 
0.9%
Other values (1163) 4501
40.9%
2023-12-13T03:58:31.053866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8788
19.0%
2869
 
6.2%
2223
 
4.8%
2216
 
4.8%
2197
 
4.8%
2188
 
4.7%
2180
 
4.7%
2146
 
4.6%
1 1741
 
3.8%
1548
 
3.3%
Other values (164) 18145
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29607
64.0%
Space Separator 8788
 
19.0%
Decimal Number 6933
 
15.0%
Dash Punctuation 585
 
1.3%
Other Punctuation 141
 
0.3%
Close Punctuation 77
 
0.2%
Open Punctuation 77
 
0.2%
Uppercase Letter 33
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2869
 
9.7%
2223
 
7.5%
2216
 
7.5%
2197
 
7.4%
2188
 
7.4%
2180
 
7.4%
2146
 
7.2%
1548
 
5.2%
1312
 
4.4%
1199
 
4.0%
Other values (144) 9529
32.2%
Decimal Number
ValueCountFrequency (%)
1 1741
25.1%
2 1078
15.5%
3 710
10.2%
5 577
 
8.3%
4 533
 
7.7%
6 507
 
7.3%
8 479
 
6.9%
0 471
 
6.8%
7 442
 
6.4%
9 395
 
5.7%
Uppercase Letter
ValueCountFrequency (%)
A 18
54.5%
B 10
30.3%
C 3
 
9.1%
L 1
 
3.0%
D 1
 
3.0%
Space Separator
ValueCountFrequency (%)
8788
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 585
100.0%
Other Punctuation
ValueCountFrequency (%)
, 141
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29607
64.0%
Common 16601
35.9%
Latin 33
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2869
 
9.7%
2223
 
7.5%
2216
 
7.5%
2197
 
7.4%
2188
 
7.4%
2180
 
7.4%
2146
 
7.2%
1548
 
5.2%
1312
 
4.4%
1199
 
4.0%
Other values (144) 9529
32.2%
Common
ValueCountFrequency (%)
8788
52.9%
1 1741
 
10.5%
2 1078
 
6.5%
3 710
 
4.3%
- 585
 
3.5%
5 577
 
3.5%
4 533
 
3.2%
6 507
 
3.1%
8 479
 
2.9%
0 471
 
2.8%
Other values (5) 1132
 
6.8%
Latin
ValueCountFrequency (%)
A 18
54.5%
B 10
30.3%
C 3
 
9.1%
L 1
 
3.0%
D 1
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29607
64.0%
ASCII 16634
36.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8788
52.8%
1 1741
 
10.5%
2 1078
 
6.5%
3 710
 
4.3%
- 585
 
3.5%
5 577
 
3.5%
4 533
 
3.2%
6 507
 
3.0%
8 479
 
2.9%
0 471
 
2.8%
Other values (10) 1165
 
7.0%
Hangul
ValueCountFrequency (%)
2869
 
9.7%
2223
 
7.5%
2216
 
7.5%
2197
 
7.4%
2188
 
7.4%
2180
 
7.4%
2146
 
7.2%
1548
 
5.2%
1312
 
4.4%
1199
 
4.0%
Other values (144) 9529
32.2%
Distinct1857
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
2023-12-13T03:58:31.558105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length54
Mean length27.756439
Min length4

Characters and Unicode

Total characters61425
Distinct characters171
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1603 ?
Unique (%)72.4%

Sample

1st row경상북도 칠곡군 왜관읍 석전리 424번지
2nd row경상북도 칠곡군 지천면 신리 399번지
3rd row경상북도 칠곡군 왜관읍 왜관리 212번지
4th row경상북도 칠곡군 왜관읍 왜관리 235번지 4호
5th row경상북도 칠곡군 왜관읍 왜관리 785번지
ValueCountFrequency (%)
경상북도 2206
 
16.2%
칠곡군 2206
 
16.2%
왜관읍 665
 
4.9%
석적읍 521
 
3.8%
왜관리 446
 
3.3%
북삼읍 422
 
3.1%
중리 409
 
3.0%
인평리 302
 
2.2%
1호 282
 
2.1%
동명면 261
 
1.9%
Other values (1112) 5883
43.2%
2023-12-13T03:58:32.370211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15449
25.2%
2640
 
4.3%
2473
 
4.0%
2259
 
3.7%
2233
 
3.6%
2220
 
3.6%
2216
 
3.6%
2208
 
3.6%
2208
 
3.6%
2206
 
3.6%
Other values (161) 25313
41.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35791
58.3%
Space Separator 15449
25.2%
Decimal Number 9974
 
16.2%
Other Punctuation 70
 
0.1%
Uppercase Letter 64
 
0.1%
Dash Punctuation 47
 
0.1%
Close Punctuation 15
 
< 0.1%
Open Punctuation 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2640
 
7.4%
2473
 
6.9%
2259
 
6.3%
2233
 
6.2%
2220
 
6.2%
2216
 
6.2%
2208
 
6.2%
2208
 
6.2%
2206
 
6.2%
2206
 
6.2%
Other values (139) 12922
36.1%
Decimal Number
ValueCountFrequency (%)
1 1982
19.9%
2 1483
14.9%
7 922
9.2%
3 904
9.1%
0 885
8.9%
4 810
8.1%
6 804
8.1%
8 767
 
7.7%
5 727
 
7.3%
9 690
 
6.9%
Uppercase Letter
ValueCountFrequency (%)
B 25
39.1%
A 19
29.7%
L 15
23.4%
C 3
 
4.7%
F 1
 
1.6%
D 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 65
92.9%
. 5
 
7.1%
Space Separator
ValueCountFrequency (%)
15449
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35791
58.3%
Common 25570
41.6%
Latin 64
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2640
 
7.4%
2473
 
6.9%
2259
 
6.3%
2233
 
6.2%
2220
 
6.2%
2216
 
6.2%
2208
 
6.2%
2208
 
6.2%
2206
 
6.2%
2206
 
6.2%
Other values (139) 12922
36.1%
Common
ValueCountFrequency (%)
15449
60.4%
1 1982
 
7.8%
2 1483
 
5.8%
7 922
 
3.6%
3 904
 
3.5%
0 885
 
3.5%
4 810
 
3.2%
6 804
 
3.1%
8 767
 
3.0%
5 727
 
2.8%
Other values (6) 837
 
3.3%
Latin
ValueCountFrequency (%)
B 25
39.1%
A 19
29.7%
L 15
23.4%
C 3
 
4.7%
F 1
 
1.6%
D 1
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35791
58.3%
ASCII 25634
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15449
60.3%
1 1982
 
7.7%
2 1483
 
5.8%
7 922
 
3.6%
3 904
 
3.5%
0 885
 
3.5%
4 810
 
3.2%
6 804
 
3.1%
8 767
 
3.0%
5 727
 
2.8%
Other values (12) 901
 
3.5%
Hangul
ValueCountFrequency (%)
2640
 
7.4%
2473
 
6.9%
2259
 
6.3%
2233
 
6.2%
2220
 
6.2%
2216
 
6.2%
2208
 
6.2%
2208
 
6.2%
2206
 
6.2%
2206
 
6.2%
Other values (139) 12922
36.1%

영업장면적
Real number (ℝ)

ZEROS 

Distinct1801
Distinct (%)81.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.18822
Minimum0
Maximum890.5
Zeros107
Zeros (%)4.8%
Negative0
Negative (%)0.0%
Memory size19.6 KiB
2023-12-13T03:58:32.637824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1.936
Q141.23
median72.46
Q3119.34
95-th percentile218.956
Maximum890.5
Range890.5
Interquartile range (IQR)78.11

Descriptive statistics

Standard deviation82.652938
Coefficient of variation (CV)0.90639929
Kurtosis20.67222
Mean91.18822
Median Absolute Deviation (MAD)35.15
Skewness3.3396036
Sum201799.53
Variance6831.5081
MonotonicityNot monotonic
2023-12-13T03:58:33.379330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 107
 
4.8%
98.0 9
 
0.4%
32.0 9
 
0.4%
44.0 5
 
0.2%
104.86 5
 
0.2%
45.0 5
 
0.2%
68.15 5
 
0.2%
33.75 4
 
0.2%
58.8 4
 
0.2%
40.5 4
 
0.2%
Other values (1791) 2056
92.9%
ValueCountFrequency (%)
0.0 107
4.8%
1.5 1
 
< 0.1%
1.76 1
 
< 0.1%
1.8 1
 
< 0.1%
1.9 1
 
< 0.1%
1.96 1
 
< 0.1%
2.13 1
 
< 0.1%
2.16 1
 
< 0.1%
2.38 1
 
< 0.1%
2.52 2
 
0.1%
ValueCountFrequency (%)
890.5 1
< 0.1%
890.2 1
< 0.1%
824.0 1
< 0.1%
780.0 1
< 0.1%
760.8 1
< 0.1%
702.95 1
< 0.1%
682.32 1
< 0.1%
628.3 1
< 0.1%
626.2 1
< 0.1%
561.4 1
< 0.1%

소재지전화
Text

MISSING 

Distinct1671
Distinct (%)96.3%
Missing477
Missing (%)21.6%
Memory size17.4 KiB
2023-12-13T03:58:33.839369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.000576
Min length12

Characters and Unicode

Total characters20833
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1624 ?
Unique (%)93.5%

Sample

1st row054-973-3336
2nd row054-972-2177
3rd row054-974-0232
4th row054-974-0370
5th row054-974-0947
ValueCountFrequency (%)
054-977-6901 9
 
0.5%
054-975-2277 7
 
0.4%
054-975-1883 6
 
0.3%
054-979-1071 4
 
0.2%
054-976-9998 2
 
0.1%
054-977-0205 2
 
0.1%
054-975-7171 2
 
0.1%
054-971-8999 2
 
0.1%
054-971-1238 2
 
0.1%
054-977-8800 2
 
0.1%
Other values (1661) 1698
97.8%
2023-12-13T03:58:34.481105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3472
16.7%
5 2768
13.3%
7 2718
13.0%
0 2566
12.3%
9 2520
12.1%
4 2463
11.8%
2 984
 
4.7%
3 954
 
4.6%
1 867
 
4.2%
6 801
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17361
83.3%
Dash Punctuation 3472
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 2768
15.9%
7 2718
15.7%
0 2566
14.8%
9 2520
14.5%
4 2463
14.2%
2 984
 
5.7%
3 954
 
5.5%
1 867
 
5.0%
6 801
 
4.6%
8 720
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 3472
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20833
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3472
16.7%
5 2768
13.3%
7 2718
13.0%
0 2566
12.3%
9 2520
12.1%
4 2463
11.8%
2 984
 
4.7%
3 954
 
4.6%
1 867
 
4.2%
6 801
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20833
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3472
16.7%
5 2768
13.3%
7 2718
13.0%
0 2566
12.3%
9 2520
12.1%
4 2463
11.8%
2 984
 
4.7%
3 954
 
4.6%
1 867
 
4.2%
6 801
 
3.8%

법인명
Categorical

IMBALANCE 

Distinct22
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
-
2164 
(주)대현하이웨이
 
12
대신기업(주)칠곡지점
 
8
(주)대주산업
 
8
(주)코리아세븐
 
2
Other values (17)
 
19

Length

Max length19
Median length1
Mean length1.1825576
Min length1

Unique

Unique15 ?
Unique (%)0.7%

Sample

1st row-
2nd row-
3rd row-
4th row-
5th row-

Common Values

ValueCountFrequency (%)
- 2164
97.8%
(주)대현하이웨이 12
 
0.5%
대신기업(주)칠곡지점 8
 
0.4%
(주)대주산업 8
 
0.4%
(주)코리아세븐 2
 
0.1%
(주)탑앤양지 2
 
0.1%
(주)대교디앤에스 2
 
0.1%
우원에스오씨 주식회사 1
 
< 0.1%
(주)금성홀딩스 1
 
< 0.1%
(주)더간강한나눔밥상된장과김치찌개 1
 
< 0.1%
Other values (12) 12
 
0.5%

Length

2023-12-13T03:58:34.771178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2164
97.6%
주)대현하이웨이 12
 
0.5%
대신기업(주)칠곡지점 8
 
0.4%
주)대주산업 8
 
0.4%
주식회사 3
 
0.1%
주)코리아세븐 2
 
0.1%
주)탑앤양지 2
 
0.1%
주)대교디앤에스 2
 
0.1%
주)학하 1
 
< 0.1%
왜관농업협동조합 1
 
< 0.1%
Other values (15) 15
 
0.7%

업태명
Categorical

HIGH CORRELATION 

Distinct30
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size17.4 KiB
한식
979 
식육(숯불구이)
202 
호프/통닭
141 
통닭(치킨)
115 
탕류(보신용)
109 
Other values (25)
667 

Length

Max length15
Median length2
Mean length3.6958879
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row식육(숯불구이)
2nd row식육(숯불구이)
3rd row식육(숯불구이)
4th row한식
5th row정종/대포집/소주방

Common Values

ValueCountFrequency (%)
한식 979
44.2%
식육(숯불구이) 202
 
9.1%
호프/통닭 141
 
6.4%
통닭(치킨) 115
 
5.2%
탕류(보신용) 109
 
4.9%
분식 95
 
4.3%
중국식 75
 
3.4%
커피숍 72
 
3.3%
경양식 58
 
2.6%
패스트푸드 54
 
2.4%
Other values (20) 313
 
14.1%

Length

2023-12-13T03:58:35.011937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 979
43.8%
식육(숯불구이 202
 
9.0%
호프/통닭 141
 
6.3%
통닭(치킨 115
 
5.1%
탕류(보신용 109
 
4.9%
분식 95
 
4.2%
중국식 75
 
3.4%
커피숍 72
 
3.2%
경양식 58
 
2.6%
패스트푸드 54
 
2.4%
Other values (20) 337
 
15.1%

Interactions

2023-12-13T03:58:25.920175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:58:25.591444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:58:26.131339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:58:25.772185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:58:35.190709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명인허가일자영업장면적법인명업태명
업종명1.0000.3170.2030.3190.999
인허가일자0.3171.0000.2400.0000.386
영업장면적0.2030.2401.0000.7810.255
법인명0.3190.0000.7811.0000.590
업태명0.9990.3860.2550.5901.000
2023-12-13T03:58:35.396208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법인명업종명업태명
법인명1.0000.2510.185
업종명0.2511.0000.963
업태명0.1850.9631.000
2023-12-13T03:58:35.567826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가일자영업장면적업종명법인명업태명
인허가일자1.0000.0620.2430.0000.133
영업장면적0.0621.0000.1550.4280.084
업종명0.2430.1551.0000.2510.963
법인명0.0000.4280.2511.0000.185
업태명0.1330.0840.9630.1851.000

Missing values

2023-12-13T03:58:26.338836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:58:26.558824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명인허가일자업소명영업자소재지(도로명)소재지(지번)영업장면적소재지전화법인명업태명
0일반음식점27810서울식당이신자-경상북도 칠곡군 왜관읍 석전리 424번지0.0054-973-3336-식육(숯불구이)
1일반음식점27962영진식당박정숙경상북도 칠곡군 지천면 신동로7길 2경상북도 칠곡군 지천면 신리 399번지0.0054-972-2177-식육(숯불구이)
2일반음식점27816약목식육식당김창순-경상북도 칠곡군 왜관읍 왜관리 212번지49.0054-974-0232-식육(숯불구이)
3일반음식점27810밴프김순연경상북도 칠곡군 왜관읍 중앙로 205-9경상북도 칠곡군 왜관읍 왜관리 235번지 4호0.0054-974-0370-한식
4일반음식점27810남성식당이순옥-경상북도 칠곡군 왜관읍 왜관리 785번지0.0054-974-0947-정종/대포집/소주방
5일반음식점27806북경반점박명숙경상북도 칠곡군 동명면 금암중앙길 49-1경상북도 칠곡군 동명면 금암리 232번지 1호38.15054-976-6334-중국식
6일반음식점27979초전식당이순덕경상북도 칠곡군 약목면 약목로2길 2-2경상북도 칠곡군 약목면 복성리 949번지 14호0.0054-974-6701-한식
7일반음식점27915참한우마당이재옥경상북도 칠곡군 약목면 약목로2길 2-1경상북도 칠곡군 약목면 복성리 949번지 5호64.34054-974-0712-식육(숯불구이)
8일반음식점28229첫집매운탕홍득식-경상북도 칠곡군 약목면 관호리 64번지0.0054-974-1963-탕류(보신용)
9일반음식점28241우돈나라강성진-경상북도 칠곡군 약목면 관호리 896번지0.0054-973-6551-한식
업종명인허가일자업소명영업자소재지(도로명)소재지(지번)영업장면적소재지전화법인명업태명
2203휴게음식점42500연꽃다방SU XIUJUAN경상북도 칠곡군 왜관읍 중앙로5길 5-1경상북도 칠곡군 왜관읍 왜관리 232번지 5호81.2<NA>-다방
2204휴게음식점42501금순이떡볶이조현정경상북도 칠곡군 석적읍 동중리10길 26-4경상북도 칠곡군 석적읍 중리 227번지 2호29.15<NA>-기타 휴게음식점
2205휴게음식점42503콩다방강선미경상북도 칠곡군 북삼읍 안산3길 33, 2동경상북도 칠곡군 북삼읍 숭오리 17번지 15호44.5054-973-3223-다방
2206휴게음식점42507홍대리만화카페정귀연경상북도 칠곡군 석적읍 북중리3길 10경상북도 칠곡군 석적읍 중리 137번지 1호16.7054-971-3992-기타 휴게음식점
2207휴게음식점42514세븐일레븐칠곡왜관점정승인경상북도 칠곡군 왜관읍 2번도로길 83경상북도 칠곡군 왜관읍 왜관리 210번지 19호3.6<NA>(주)코리아세븐편의점
2208휴게음식점42516바이킹PC카페김혜영경상북도 칠곡군 석적읍 서중리1길 20경상북도 칠곡군 석적읍 중리 168번지 20호10.26<NA>-기타 휴게음식점
2209휴게음식점42520핫다방석적점김미영경상북도 칠곡군 석적읍 유학로 36경상북도 칠곡군 석적읍 중리 203번지 1호31.9<NA>-커피숍
2210휴게음식점42521리치가플라워카페박건경상북도 칠곡군 가산면 인동가산로 735경상북도 칠곡군 가산면 학하리 524번지 2호71.28<NA>-커피숍
2211휴게음식점42523달달한공간박병희경상북도 칠곡군 석적읍 서중리6길 44경상북도 칠곡군 석적읍 중리 170번지 25호40.06<NA>-커피숍
2212휴게음식점42528미니스톱칠곡북삼점이혜란경상북도 칠곡군 북삼읍 북삼로 98경상북도 칠곡군 북삼읍 인평리 658번지 12호11.7<NA>-편의점