Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Categorical2
Numeric1
Text2

Dataset

Description경상북도 내 음식점(일반음식점 및 휴게음식점) 현황 데이터로 관할기관, 인허가번호, 업소명, 업종, 주소 항목으로 구성되어 있습니다
URLhttps://www.data.go.kr/data/15101688/fileData.do

Alerts

인허가번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:41:32.260322
Analysis finished2023-12-12 17:41:33.559593
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경상북도 구미시
1368 
경상북도 경주시
1238 
경상북도 포항시 남구
907 
경상북도 포항시 북구
901 
경상북도 경산시
840 
Other values (19)
4746 

Length

Max length11
Median length8
Mean length8.5424
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상북도 상주시
2nd row경상북도 문경시
3rd row경상북도 경산시
4th row경상북도 경산시
5th row경상북도 안동시

Common Values

ValueCountFrequency (%)
경상북도 구미시 1368
13.7%
경상북도 경주시 1238
12.4%
경상북도 포항시 남구 907
 
9.1%
경상북도 포항시 북구 901
 
9.0%
경상북도 경산시 840
 
8.4%
경상북도 안동시 612
 
6.1%
경상북도 김천시 539
 
5.4%
경상북도 칠곡군 429
 
4.3%
경상북도 영주시 413
 
4.1%
경상북도 영천시 386
 
3.9%
Other values (14) 2367
23.7%

Length

2023-12-13T02:41:33.629994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경상북도 10000
45.9%
포항시 1808
 
8.3%
구미시 1368
 
6.3%
경주시 1238
 
5.7%
남구 907
 
4.2%
북구 901
 
4.1%
경산시 840
 
3.9%
안동시 612
 
2.8%
김천시 539
 
2.5%
칠곡군 429
 
2.0%
Other values (16) 3166
 
14.5%

인허가번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0111876 × 1010
Minimum1.9610541 × 1010
Maximum2.0230838 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T02:41:33.778603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.9610541 × 1010
5-th percentile1.9910563 × 1010
Q12.0040541 × 1010
median2.0150541 × 1010
Q32.0200541 × 1010
95-th percentile2.0220822 × 1010
Maximum2.0230838 × 1010
Range6.2029702 × 108
Interquartile range (IQR)1.5999954 × 108

Descriptive statistics

Standard deviation1.0583177 × 108
Coefficient of variation (CV)0.0052621531
Kurtosis0.50425366
Mean2.0111876 × 1010
Median Absolute Deviation (MAD)60008986
Skewness-1.0392978
Sum2.0111876 × 1014
Variance1.1200364 × 1016
MonotonicityNot monotonic
2023-12-13T02:41:33.919668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20100556037 1
 
< 0.1%
20190552810 1
 
< 0.1%
20170558068 1
 
< 0.1%
20220773397 1
 
< 0.1%
20210538814 1
 
< 0.1%
20000536464 1
 
< 0.1%
20210571111 1
 
< 0.1%
20230794682 1
 
< 0.1%
20230834019 1
 
< 0.1%
20220770644 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
19610541001 1
< 0.1%
19640538001 1
< 0.1%
19660567001 1
< 0.1%
19670536002 1
< 0.1%
19680573003 1
< 0.1%
19690534002 1
< 0.1%
19690548001 1
< 0.1%
19700541006 1
< 0.1%
19710534003 1
< 0.1%
19710538016 1
< 0.1%
ValueCountFrequency (%)
20230838019 1
< 0.1%
20230838018 1
< 0.1%
20230836062 1
< 0.1%
20230836058 1
< 0.1%
20230836051 1
< 0.1%
20230836049 1
< 0.1%
20230836046 1
< 0.1%
20230836040 1
< 0.1%
20230836039 1
< 0.1%
20230836035 1
< 0.1%
Distinct9504
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:41:34.296581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length6.3772
Min length1

Characters and Unicode

Total characters63772
Distinct characters1068
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9152 ?
Unique (%)91.5%

Sample

1st row꼬꼬통닭
2nd row상주식당
3rd row북성로 불고기
4th row대가온족발
5th row투썸플레이스 경북신도청중앙점
ValueCountFrequency (%)
카페 50
 
0.4%
세븐일레븐 41
 
0.3%
씨유 37
 
0.3%
gs25 23
 
0.2%
식당 21
 
0.2%
하양점 20
 
0.2%
경북도청점 17
 
0.1%
경산점 17
 
0.1%
지에스(gs)25 15
 
0.1%
안동점 14
 
0.1%
Other values (10189) 11598
97.8%
2023-12-13T02:41:34.848204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2190
 
3.4%
1857
 
2.9%
1718
 
2.7%
1408
 
2.2%
1068
 
1.7%
799
 
1.3%
742
 
1.2%
643
 
1.0%
) 613
 
1.0%
613
 
1.0%
Other values (1058) 52121
81.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 57259
89.8%
Space Separator 1857
 
2.9%
Uppercase Letter 1339
 
2.1%
Decimal Number 971
 
1.5%
Lowercase Letter 927
 
1.5%
Close Punctuation 613
 
1.0%
Open Punctuation 611
 
1.0%
Other Punctuation 175
 
0.3%
Dash Punctuation 16
 
< 0.1%
Modifier Symbol 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2190
 
3.8%
1718
 
3.0%
1408
 
2.5%
1068
 
1.9%
799
 
1.4%
742
 
1.3%
643
 
1.1%
613
 
1.1%
611
 
1.1%
599
 
1.0%
Other values (978) 46868
81.9%
Uppercase Letter
ValueCountFrequency (%)
C 150
 
11.2%
S 131
 
9.8%
E 108
 
8.1%
G 103
 
7.7%
A 99
 
7.4%
O 72
 
5.4%
B 70
 
5.2%
P 57
 
4.3%
F 52
 
3.9%
N 52
 
3.9%
Other values (16) 445
33.2%
Lowercase Letter
ValueCountFrequency (%)
e 117
12.6%
a 97
 
10.5%
o 76
 
8.2%
r 60
 
6.5%
t 54
 
5.8%
n 51
 
5.5%
c 49
 
5.3%
s 48
 
5.2%
l 45
 
4.9%
i 43
 
4.6%
Other values (15) 287
31.0%
Other Punctuation
ValueCountFrequency (%)
& 75
42.9%
. 40
22.9%
, 37
21.1%
' 5
 
2.9%
# 4
 
2.3%
· 4
 
2.3%
4
 
2.3%
! 2
 
1.1%
: 1
 
0.6%
; 1
 
0.6%
Other values (2) 2
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 251
25.8%
5 168
17.3%
1 128
13.2%
0 89
 
9.2%
3 80
 
8.2%
9 69
 
7.1%
8 56
 
5.8%
4 49
 
5.0%
7 44
 
4.5%
6 37
 
3.8%
Space Separator
ValueCountFrequency (%)
1857
100.0%
Close Punctuation
ValueCountFrequency (%)
) 613
100.0%
Open Punctuation
ValueCountFrequency (%)
( 611
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 57234
89.7%
Common 4246
 
6.7%
Latin 2267
 
3.6%
Han 25
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2190
 
3.8%
1718
 
3.0%
1408
 
2.5%
1068
 
1.9%
799
 
1.4%
742
 
1.3%
643
 
1.1%
613
 
1.1%
611
 
1.1%
599
 
1.0%
Other values (957) 46843
81.8%
Latin
ValueCountFrequency (%)
C 150
 
6.6%
S 131
 
5.8%
e 117
 
5.2%
E 108
 
4.8%
G 103
 
4.5%
A 99
 
4.4%
a 97
 
4.3%
o 76
 
3.4%
O 72
 
3.2%
B 70
 
3.1%
Other values (42) 1244
54.9%
Common
ValueCountFrequency (%)
1857
43.7%
) 613
 
14.4%
( 611
 
14.4%
2 251
 
5.9%
5 168
 
4.0%
1 128
 
3.0%
0 89
 
2.1%
3 80
 
1.9%
& 75
 
1.8%
9 69
 
1.6%
Other values (18) 305
 
7.2%
Han
ValueCountFrequency (%)
3
 
12.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other values (11) 11
44.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 57231
89.7%
ASCII 6504
 
10.2%
CJK 23
 
< 0.1%
None 8
 
< 0.1%
Compat Jamo 3
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2190
 
3.8%
1718
 
3.0%
1408
 
2.5%
1068
 
1.9%
799
 
1.4%
742
 
1.3%
643
 
1.1%
613
 
1.1%
611
 
1.1%
599
 
1.0%
Other values (956) 46840
81.8%
ASCII
ValueCountFrequency (%)
1857
28.6%
) 613
 
9.4%
( 611
 
9.4%
2 251
 
3.9%
5 168
 
2.6%
C 150
 
2.3%
S 131
 
2.0%
1 128
 
2.0%
e 117
 
1.8%
E 108
 
1.7%
Other values (67) 2370
36.4%
None
ValueCountFrequency (%)
· 4
50.0%
4
50.0%
CJK
ValueCountFrequency (%)
3
 
13.0%
2
 
8.7%
2
 
8.7%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
1
 
4.3%
Other values (9) 9
39.1%
Compat Jamo
ValueCountFrequency (%)
3
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%

업종
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반음식점
7723 
휴게음식점
2277 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반음식점
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row휴게음식점

Common Values

ValueCountFrequency (%)
일반음식점 7723
77.2%
휴게음식점 2277
 
22.8%

Length

2023-12-13T02:41:34.990151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:41:35.086998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반음식점 7723
77.2%
휴게음식점 2277
 
22.8%
Distinct9794
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T02:41:35.393322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length52
Mean length25.4188
Min length2

Characters and Unicode

Total characters254188
Distinct characters575
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9612 ?
Unique (%)96.1%

Sample

1st row경상북도 상주시 낙동면 영남제일로 91
2nd row경상북도 문경시 점촌동 158
3rd row경상북도 경산시 둥지로 4(조영동)
4th row경상북도 경산시 선비길 37(사정동)
5th row경상북도 안동시 풍천면 검무로 10-19(1,2층)
ValueCountFrequency (%)
경상북도 10000
 
19.4%
포항시 1808
 
3.5%
구미시 1368
 
2.7%
경주시 1237
 
2.4%
남구 907
 
1.8%
북구 901
 
1.7%
경산시 840
 
1.6%
안동시 612
 
1.2%
김천시 540
 
1.0%
1층 532
 
1.0%
Other values (10671) 32838
63.7%
2023-12-13T02:41:35.948632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41588
 
16.4%
12798
 
5.0%
1 12590
 
5.0%
11423
 
4.5%
11297
 
4.4%
10951
 
4.3%
8289
 
3.3%
8243
 
3.2%
( 6900
 
2.7%
) 6900
 
2.7%
Other values (565) 123209
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 153362
60.3%
Space Separator 41588
 
16.4%
Decimal Number 41253
 
16.2%
Open Punctuation 6904
 
2.7%
Close Punctuation 6904
 
2.7%
Dash Punctuation 2903
 
1.1%
Other Punctuation 912
 
0.4%
Uppercase Letter 268
 
0.1%
Math Symbol 63
 
< 0.1%
Lowercase Letter 20
 
< 0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12798
 
8.3%
11423
 
7.4%
11297
 
7.4%
10951
 
7.1%
8289
 
5.4%
8243
 
5.4%
6801
 
4.4%
5202
 
3.4%
3827
 
2.5%
3703
 
2.4%
Other values (510) 70828
46.2%
Uppercase Letter
ValueCountFrequency (%)
A 73
27.2%
B 66
24.6%
C 28
 
10.4%
M 13
 
4.9%
L 13
 
4.9%
G 8
 
3.0%
D 8
 
3.0%
S 7
 
2.6%
K 7
 
2.6%
W 7
 
2.6%
Other values (10) 38
14.2%
Decimal Number
ValueCountFrequency (%)
1 12590
30.5%
2 6021
14.6%
3 4065
 
9.9%
4 3125
 
7.6%
0 3118
 
7.6%
5 3012
 
7.3%
6 2702
 
6.5%
7 2389
 
5.8%
8 2207
 
5.3%
9 2024
 
4.9%
Lowercase Letter
ValueCountFrequency (%)
e 7
35.0%
o 4
20.0%
c 2
 
10.0%
h 2
 
10.0%
r 1
 
5.0%
w 1
 
5.0%
b 1
 
5.0%
i 1
 
5.0%
j 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 879
96.4%
. 20
 
2.2%
/ 7
 
0.8%
* 4
 
0.4%
& 1
 
0.1%
· 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 6900
99.9%
[ 4
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 6900
99.9%
] 4
 
0.1%
Letter Number
ValueCountFrequency (%)
6
60.0%
4
40.0%
Space Separator
ValueCountFrequency (%)
41588
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2903
100.0%
Math Symbol
ValueCountFrequency (%)
~ 63
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 153362
60.3%
Common 100528
39.5%
Latin 298
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12798
 
8.3%
11423
 
7.4%
11297
 
7.4%
10951
 
7.1%
8289
 
5.4%
8243
 
5.4%
6801
 
4.4%
5202
 
3.4%
3827
 
2.5%
3703
 
2.4%
Other values (510) 70828
46.2%
Latin
ValueCountFrequency (%)
A 73
24.5%
B 66
22.1%
C 28
 
9.4%
M 13
 
4.4%
L 13
 
4.4%
G 8
 
2.7%
D 8
 
2.7%
e 7
 
2.3%
S 7
 
2.3%
K 7
 
2.3%
Other values (21) 68
22.8%
Common
ValueCountFrequency (%)
41588
41.4%
1 12590
 
12.5%
( 6900
 
6.9%
) 6900
 
6.9%
2 6021
 
6.0%
3 4065
 
4.0%
4 3125
 
3.1%
0 3118
 
3.1%
5 3012
 
3.0%
- 2903
 
2.9%
Other values (14) 10306
 
10.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 153362
60.3%
ASCII 100815
39.7%
Number Forms 10
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41588
41.3%
1 12590
 
12.5%
( 6900
 
6.8%
) 6900
 
6.8%
2 6021
 
6.0%
3 4065
 
4.0%
4 3125
 
3.1%
0 3118
 
3.1%
5 3012
 
3.0%
- 2903
 
2.9%
Other values (42) 10593
 
10.5%
Hangul
ValueCountFrequency (%)
12798
 
8.3%
11423
 
7.4%
11297
 
7.4%
10951
 
7.1%
8289
 
5.4%
8243
 
5.4%
6801
 
4.4%
5202
 
3.4%
3827
 
2.5%
3703
 
2.4%
Other values (510) 70828
46.2%
Number Forms
ValueCountFrequency (%)
6
60.0%
4
40.0%
None
ValueCountFrequency (%)
· 1
100.0%

Interactions

2023-12-13T02:41:33.259303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:41:36.054718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가관할기관인허가번호업종
인허가관할기관1.0000.2870.080
인허가번호0.2871.0000.354
업종0.0800.3541.000
2023-12-13T02:41:36.165837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가관할기관업종
인허가관할기관1.0000.063
업종0.0631.000
2023-12-13T02:41:36.255229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인허가번호인허가관할기관업종
인허가번호1.0000.1090.272
인허가관할기관0.1091.0000.063
업종0.2720.0631.000

Missing values

2023-12-13T02:41:33.396969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:41:33.501236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인허가관할기관인허가번호업소명업종업소주소
40088경상북도 상주시20100556037꼬꼬통닭일반음식점경상북도 상주시 낙동면 영남제일로 91
41943경상북도 문경시19930558014상주식당일반음식점경상북도 문경시 점촌동 158
44132경상북도 경산시20040560001북성로 불고기일반음식점경상북도 경산시 둥지로 4(조영동)
43323경상북도 경산시20120560145대가온족발일반음식점경상북도 경산시 선비길 37(사정동)
5451경상북도 안동시20200547049투썸플레이스 경북신도청중앙점휴게음식점경상북도 안동시 풍천면 검무로 10-19(1,2층)
37350경상북도 영주시20140554137송꼬치일반음식점경상북도 영주시 대동로31번길 33(가흥동)
2718경상북도 경주시20210542241감찻집휴게음식점경상북도 경주시 안강읍 비화동길 23(1층)
29901경상북도 안동시20120543351우리집애가서먹자일반음식점경상북도 안동시 강남6길 31(정하동)
24430경상북도 경주시20140538199주현소주방일반음식점경상북도 경주시 감포읍 감포로6길 8
5894경상북도 구미시20170548255메가엠지씨커피구미도량점휴게음식점경상북도 구미시 도봉로 82(1층 105호 도량동, GMG프라자)
인허가관할기관인허가번호업소명업종업소주소
40959경상북도 상주시20050556019전통옛날손짜장일반음식점경상북도 상주시 영남제일로 1281(외답동,411-24)
40757경상북도 상주시19940556049영아식당일반음식점경상북도 상주시 풍물시장길 45-10(남성동)
41104경상북도 상주시20150556082츠츠일반음식점경상북도 상주시 동수4길 119-3(무양동)
11617경상북도 칠곡군20170570459씨유왜관타운점휴게음식점경상북도 칠곡군 왜관읍 중앙로 108
23463경상북도 경주시20180542296엄마손맛일반음식점경상북도 경주시 초당길 39-4(1층 동천동)
32520경상북도 구미시20060548585명가굴국밥본점일반음식점경상북도 구미시 3공단1로 289-24(임수동,외1필지 102호)
20916경상북도 경주시20080538299굽네치킨충효탑정점일반음식점경상북도 경주시 충효녹지길 6-17(충효동)
37807경상북도 영주시20120554025정희네소머리국밥일반음식점경상북도 영주시 원당로 370(상망동)
38114경상북도 영주시20140554042피자헛영주본점일반음식점경상북도 영주시 중앙로 50(103호 영주동)
46688경상북도 군위군20140562006플랫폼분식일반음식점경상북도 군위군 부계면 한티로 2130