Overview

Dataset statistics

Number of variables4
Number of observations787
Missing cells427
Missing cells (%)13.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.7 KiB
Average record size in memory32.2 B

Variable types

Categorical1
Text3

Dataset

Description안양시 즉석판매제조가공업(즉석판매 업종명,즉석판매제조가공업 업소명,즉석판매제조가공업 소재지,즉석판매제조가공업 소재지전화번호)
URLhttps://www.data.go.kr/data/15118246/fileData.do

Alerts

업종명 has constant value ""Constant
소재지전화 has 427 (54.3%) missing valuesMissing

Reproduction

Analysis started2023-12-12 17:32:34.395874
Analysis finished2023-12-12 17:32:35.128851
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
즉석판매제조가공업
787 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row즉석판매제조가공업
2nd row즉석판매제조가공업
3rd row즉석판매제조가공업
4th row즉석판매제조가공업
5th row즉석판매제조가공업

Common Values

ValueCountFrequency (%)
즉석판매제조가공업 787
100.0%

Length

2023-12-13T02:32:35.237157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:32:35.372306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
즉석판매제조가공업 787
100.0%
Distinct751
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-13T02:32:35.636112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length6.1067344
Min length1

Characters and Unicode

Total characters4806
Distinct characters558
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique727 ?
Unique (%)92.4%

Sample

1st row경북상회
2nd row대림상회
3rd row삼성기름집
4th row화성기름집
5th row금산고추기름집
ValueCountFrequency (%)
주식회사 12
 
1.3%
평촌점 6
 
0.7%
낙원떡집 5
 
0.5%
장수건강원 5
 
0.5%
땅스부대찌개 4
 
0.4%
담꾹 4
 
0.4%
김준호의 3
 
0.3%
오늘쉐프 3
 
0.3%
반찬 3
 
0.3%
coffee 3
 
0.3%
Other values (822) 868
94.8%
2023-12-13T02:32:36.122095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
130
 
2.7%
95
 
2.0%
91
 
1.9%
87
 
1.8%
83
 
1.7%
69
 
1.4%
) 69
 
1.4%
( 69
 
1.4%
66
 
1.4%
64
 
1.3%
Other values (548) 3983
82.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4316
89.8%
Space Separator 130
 
2.7%
Uppercase Letter 108
 
2.2%
Lowercase Letter 82
 
1.7%
Close Punctuation 70
 
1.5%
Open Punctuation 70
 
1.5%
Decimal Number 19
 
0.4%
Other Punctuation 11
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
95
 
2.2%
91
 
2.1%
87
 
2.0%
83
 
1.9%
69
 
1.6%
66
 
1.5%
64
 
1.5%
62
 
1.4%
62
 
1.4%
58
 
1.3%
Other values (491) 3579
82.9%
Lowercase Letter
ValueCountFrequency (%)
e 13
15.9%
o 11
13.4%
a 9
11.0%
n 6
 
7.3%
s 5
 
6.1%
r 4
 
4.9%
t 4
 
4.9%
m 4
 
4.9%
c 4
 
4.9%
i 3
 
3.7%
Other values (11) 19
23.2%
Uppercase Letter
ValueCountFrequency (%)
O 14
13.0%
E 11
10.2%
F 10
 
9.3%
B 8
 
7.4%
A 8
 
7.4%
C 8
 
7.4%
S 7
 
6.5%
I 6
 
5.6%
M 6
 
5.6%
R 5
 
4.6%
Other values (10) 25
23.1%
Decimal Number
ValueCountFrequency (%)
2 5
26.3%
1 4
21.1%
4 3
15.8%
0 2
 
10.5%
9 2
 
10.5%
8 2
 
10.5%
3 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
& 6
54.5%
' 2
 
18.2%
. 2
 
18.2%
, 1
 
9.1%
Close Punctuation
ValueCountFrequency (%)
) 69
98.6%
] 1
 
1.4%
Open Punctuation
ValueCountFrequency (%)
( 69
98.6%
[ 1
 
1.4%
Space Separator
ValueCountFrequency (%)
130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4316
89.8%
Common 300
 
6.2%
Latin 190
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
95
 
2.2%
91
 
2.1%
87
 
2.0%
83
 
1.9%
69
 
1.6%
66
 
1.5%
64
 
1.5%
62
 
1.4%
62
 
1.4%
58
 
1.3%
Other values (491) 3579
82.9%
Latin
ValueCountFrequency (%)
O 14
 
7.4%
e 13
 
6.8%
o 11
 
5.8%
E 11
 
5.8%
F 10
 
5.3%
a 9
 
4.7%
B 8
 
4.2%
A 8
 
4.2%
C 8
 
4.2%
S 7
 
3.7%
Other values (31) 91
47.9%
Common
ValueCountFrequency (%)
130
43.3%
) 69
23.0%
( 69
23.0%
& 6
 
2.0%
2 5
 
1.7%
1 4
 
1.3%
4 3
 
1.0%
0 2
 
0.7%
' 2
 
0.7%
. 2
 
0.7%
Other values (6) 8
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4316
89.8%
ASCII 490
 
10.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
130
26.5%
) 69
14.1%
( 69
14.1%
O 14
 
2.9%
e 13
 
2.7%
o 11
 
2.2%
E 11
 
2.2%
F 10
 
2.0%
a 9
 
1.8%
B 8
 
1.6%
Other values (47) 146
29.8%
Hangul
ValueCountFrequency (%)
95
 
2.2%
91
 
2.1%
87
 
2.0%
83
 
1.9%
69
 
1.6%
66
 
1.5%
64
 
1.5%
62
 
1.4%
62
 
1.4%
58
 
1.3%
Other values (491) 3579
82.9%
Distinct743
Distinct (%)94.4%
Missing0
Missing (%)0.0%
Memory size6.3 KiB
2023-12-13T02:32:36.507205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length50
Mean length36.377382
Min length23

Characters and Unicode

Total characters28629
Distinct characters293
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique720 ?
Unique (%)91.5%

Sample

1st row경기도 안양시 만안구 안양로291번길 34 (안양동)
2nd row경기도 안양시 만안구 냉천로 196 (안양동)
3rd row경기도 안양시 만안구 안양로291번길 34, 1층 (안양동)
4th row경기도 안양시 만안구 냉천로 190 (안양동)
5th row경기도 안양시 만안구 안양로258번길 34 (안양동)
ValueCountFrequency (%)
안양시 790
 
13.1%
경기도 787
 
13.1%
만안구 394
 
6.5%
동안구 393
 
6.5%
1층 283
 
4.7%
안양동 249
 
4.1%
호계동 139
 
2.3%
관양동 130
 
2.2%
지상1층 66
 
1.1%
지하1층 57
 
0.9%
Other values (882) 2742
45.5%
2023-12-13T02:32:37.069414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5243
 
18.3%
2098
 
7.3%
1378
 
4.8%
1 1375
 
4.8%
1321
 
4.6%
862
 
3.0%
845
 
3.0%
802
 
2.8%
797
 
2.8%
795
 
2.8%
Other values (283) 13113
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16373
57.2%
Space Separator 5243
 
18.3%
Decimal Number 4424
 
15.5%
Other Punctuation 808
 
2.8%
Close Punctuation 794
 
2.8%
Open Punctuation 794
 
2.8%
Uppercase Letter 117
 
0.4%
Dash Punctuation 69
 
0.2%
Lowercase Letter 5
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2098
 
12.8%
1378
 
8.4%
1321
 
8.1%
862
 
5.3%
845
 
5.2%
802
 
4.9%
797
 
4.9%
795
 
4.9%
789
 
4.8%
543
 
3.3%
Other values (241) 6143
37.5%
Uppercase Letter
ValueCountFrequency (%)
B 13
11.1%
S 11
9.4%
E 11
9.4%
G 10
8.5%
Q 10
8.5%
U 10
8.5%
R 10
8.5%
W 8
 
6.8%
K 7
 
6.0%
A 6
 
5.1%
Other values (7) 21
17.9%
Decimal Number
ValueCountFrequency (%)
1 1375
31.1%
2 640
14.5%
3 498
 
11.3%
0 400
 
9.0%
4 309
 
7.0%
5 293
 
6.6%
8 241
 
5.4%
6 239
 
5.4%
9 221
 
5.0%
7 208
 
4.7%
Lowercase Letter
ValueCountFrequency (%)
e 1
20.0%
o 1
20.0%
m 1
20.0%
t 1
20.0%
s 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 793
98.1%
. 13
 
1.6%
& 1
 
0.1%
' 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Space Separator
ValueCountFrequency (%)
5243
100.0%
Close Punctuation
ValueCountFrequency (%)
) 794
100.0%
Open Punctuation
ValueCountFrequency (%)
( 794
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16373
57.2%
Common 12134
42.4%
Latin 122
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2098
 
12.8%
1378
 
8.4%
1321
 
8.1%
862
 
5.3%
845
 
5.2%
802
 
4.9%
797
 
4.9%
795
 
4.9%
789
 
4.8%
543
 
3.3%
Other values (241) 6143
37.5%
Latin
ValueCountFrequency (%)
B 13
10.7%
S 11
9.0%
E 11
9.0%
G 10
 
8.2%
Q 10
 
8.2%
U 10
 
8.2%
R 10
 
8.2%
W 8
 
6.6%
K 7
 
5.7%
A 6
 
4.9%
Other values (12) 26
21.3%
Common
ValueCountFrequency (%)
5243
43.2%
1 1375
 
11.3%
) 794
 
6.5%
( 794
 
6.5%
, 793
 
6.5%
2 640
 
5.3%
3 498
 
4.1%
0 400
 
3.3%
4 309
 
2.5%
5 293
 
2.4%
Other values (10) 995
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16373
57.2%
ASCII 12256
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5243
42.8%
1 1375
 
11.2%
) 794
 
6.5%
( 794
 
6.5%
, 793
 
6.5%
2 640
 
5.2%
3 498
 
4.1%
0 400
 
3.3%
4 309
 
2.5%
5 293
 
2.4%
Other values (32) 1117
 
9.1%
Hangul
ValueCountFrequency (%)
2098
 
12.8%
1378
 
8.4%
1321
 
8.1%
862
 
5.3%
845
 
5.2%
802
 
4.9%
797
 
4.9%
795
 
4.9%
789
 
4.8%
543
 
3.3%
Other values (241) 6143
37.5%

소재지전화
Text

MISSING 

Distinct357
Distinct (%)99.2%
Missing427
Missing (%)54.3%
Memory size6.3 KiB
2023-12-13T02:32:37.371212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.013889
Min length9

Characters and Unicode

Total characters4325
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique354 ?
Unique (%)98.3%

Sample

1st row031-449-7948
2nd row031-442-6565
3rd row031-466-5193
4th row031-449-6682
5th row031-449-8232
ValueCountFrequency (%)
031-444-6891 2
 
0.6%
031-449-1007 2
 
0.6%
031-380-3350 2
 
0.6%
031-382-5289 1
 
0.3%
070-8888-8090 1
 
0.3%
031-472-2817 1
 
0.3%
031-456-0805 1
 
0.3%
031-466-6880 1
 
0.3%
031-429-6979 1
 
0.3%
031-427-6047 1
 
0.3%
Other values (347) 347
96.4%
2023-12-13T02:32:37.850538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 720
16.6%
3 594
13.7%
0 560
12.9%
4 539
12.5%
1 516
11.9%
8 294
6.8%
2 256
 
5.9%
5 248
 
5.7%
7 222
 
5.1%
6 210
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3605
83.4%
Dash Punctuation 720
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 594
16.5%
0 560
15.5%
4 539
15.0%
1 516
14.3%
8 294
8.2%
2 256
7.1%
5 248
6.9%
7 222
 
6.2%
6 210
 
5.8%
9 166
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 720
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4325
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 720
16.6%
3 594
13.7%
0 560
12.9%
4 539
12.5%
1 516
11.9%
8 294
6.8%
2 256
 
5.9%
5 248
 
5.7%
7 222
 
5.1%
6 210
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4325
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 720
16.6%
3 594
13.7%
0 560
12.9%
4 539
12.5%
1 516
11.9%
8 294
6.8%
2 256
 
5.9%
5 248
 
5.7%
7 222
 
5.1%
6 210
 
4.9%

Missing values

2023-12-13T02:32:34.986189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:32:35.081125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지소재지전화
0즉석판매제조가공업경북상회경기도 안양시 만안구 안양로291번길 34 (안양동)031-449-7948
1즉석판매제조가공업대림상회경기도 안양시 만안구 냉천로 196 (안양동)031-442-6565
2즉석판매제조가공업삼성기름집경기도 안양시 만안구 안양로291번길 34, 1층 (안양동)031-466-5193
3즉석판매제조가공업화성기름집경기도 안양시 만안구 냉천로 190 (안양동)031-449-6682
4즉석판매제조가공업금산고추기름집경기도 안양시 만안구 안양로258번길 34 (안양동)031-449-8232
5즉석판매제조가공업제일기름집경기도 안양시 만안구 양화로136번길 9 (박달동)031-466-4418
6즉석판매제조가공업청원방아간경기도 안양시 만안구 안양로360번길 16 (안양동)031-442-8142
7즉석판매제조가공업안산방아간경기도 안양시 만안구 석천로211번길 56 (석수동)<NA>
8즉석판매제조가공업선주기름집경기도 안양시 만안구 냉천로 196 (안양동)031-446-7446
9즉석판매제조가공업형제방앗간경기도 안양시 동안구 경수대로556번길 17 (호계동)031-452-5254
업종명업소명소재지소재지전화
777즉석판매제조가공업백남옥손만두경기도 안양시 동안구 시민대로 300, 이마트평촌점 1층 (관양동)<NA>
778즉석판매제조가공업주식회사 미래식품경기도 안양시 동안구 시민대로 300, 이마트평촌점 1층 (관양동)<NA>
779즉석판매제조가공업수라원경기도 안양시 동안구 시민대로 300, 이마트평촌점 지하2층 (관양동)<NA>
780즉석판매제조가공업(주)동명에스티유경기도 안양시 동안구 엘에스로 76, 국제유통단지 가동 지하1층 (호계동)<NA>
781즉석판매제조가공업신라푸드경기도 안양시 동안구 동안로 162, 홈플러스 1층 (비산동)<NA>
782즉석판매제조가공업감동푸드경기도 안양시 동안구 시민대로 180, G.SQURE, 롯데백화점 평촌점 지하1층 (호계동)<NA>
783즉석판매제조가공업주식회사 월드푸드경기도 안양시 만안구 연현로79번길 19, 석수프라자 1층 (석수동)<NA>
784즉석판매제조가공업백두산천지인경기도 안양시 만안구 안양로258번길 34, 성우상가 2층 (안양동)<NA>
785즉석판매제조가공업우리찬경기도 안양시 동안구 시민대로 300, 이마트평촌점 지하2층 (관양동)<NA>
786즉석판매제조가공업(주)팜덕 다향오리경기도 안양시 동안구 시민대로 180, G.SQURE, 롯데백화점 평촌점 지하1층 일부호 (호계동)<NA>