Overview

Dataset statistics

Number of variables5
Number of observations1195
Missing cells591
Missing cells (%)9.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory46.8 KiB
Average record size in memory40.1 B

Variable types

Categorical1
Text4

Dataset

Description경기도 부천시 관내에 즉석판매제조가공업에 대한 업종명, 업소명, 소재지(도로명), 소재지전화, 우편번호(도로명) 정보등의 자료를 제공합니다.
URLhttps://www.data.go.kr/data/15055465/fileData.do

Alerts

업종명 has constant value ""Constant
소재지전화 has 591 (49.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 06:46:08.110853
Analysis finished2023-12-12 06:46:08.875744
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
즉석판매제조가공업
1195 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row즉석판매제조가공업
2nd row즉석판매제조가공업
3rd row즉석판매제조가공업
4th row즉석판매제조가공업
5th row즉석판매제조가공업

Common Values

ValueCountFrequency (%)
즉석판매제조가공업 1195
100.0%

Length

2023-12-12T15:46:08.950495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:46:09.063710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
즉석판매제조가공업 1195
100.0%
Distinct1134
Distinct (%)94.9%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T15:46:09.321616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length6.3322176
Min length1

Characters and Unicode

Total characters7567
Distinct characters627
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1088 ?
Unique (%)91.0%

Sample

1st row자유방앗간
2nd row대성방앗간
3rd row성심방앗간
4th row태양떡방앗간
5th row맛나방앗간
ValueCountFrequency (%)
주식회사 22
 
1.5%
담꾹 7
 
0.5%
부천옥길점 7
 
0.5%
땅스부대찌개 6
 
0.4%
종로떡집 5
 
0.4%
반찬 5
 
0.4%
집어가 4
 
0.3%
스타필드시티 4
 
0.3%
홀세일클럽 4
 
0.3%
트레이더스 4
 
0.3%
Other values (1246) 1357
95.2%
2023-12-12T15:46:09.724059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
231
 
3.1%
138
 
1.8%
126
 
1.7%
) 125
 
1.7%
124
 
1.6%
124
 
1.6%
( 123
 
1.6%
119
 
1.6%
114
 
1.5%
108
 
1.4%
Other values (617) 6235
82.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6668
88.1%
Space Separator 231
 
3.1%
Uppercase Letter 207
 
2.7%
Lowercase Letter 157
 
2.1%
Close Punctuation 125
 
1.7%
Open Punctuation 123
 
1.6%
Decimal Number 29
 
0.4%
Other Punctuation 20
 
0.3%
Dash Punctuation 4
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
138
 
2.1%
126
 
1.9%
124
 
1.9%
124
 
1.9%
119
 
1.8%
114
 
1.7%
108
 
1.6%
96
 
1.4%
92
 
1.4%
91
 
1.4%
Other values (547) 5536
83.0%
Uppercase Letter
ValueCountFrequency (%)
E 23
 
11.1%
A 21
 
10.1%
O 18
 
8.7%
N 16
 
7.7%
B 15
 
7.2%
C 14
 
6.8%
T 10
 
4.8%
I 10
 
4.8%
R 9
 
4.3%
D 8
 
3.9%
Other values (15) 63
30.4%
Lowercase Letter
ValueCountFrequency (%)
e 23
14.6%
i 16
10.2%
a 15
9.6%
o 14
8.9%
t 14
8.9%
r 12
 
7.6%
s 12
 
7.6%
l 7
 
4.5%
n 6
 
3.8%
y 5
 
3.2%
Other values (12) 33
21.0%
Decimal Number
ValueCountFrequency (%)
8 5
17.2%
1 5
17.2%
2 4
13.8%
4 3
10.3%
6 3
10.3%
9 2
 
6.9%
3 2
 
6.9%
5 2
 
6.9%
7 2
 
6.9%
0 1
 
3.4%
Other Punctuation
ValueCountFrequency (%)
& 6
30.0%
. 5
25.0%
, 5
25.0%
# 1
 
5.0%
' 1
 
5.0%
: 1
 
5.0%
· 1
 
5.0%
Space Separator
ValueCountFrequency (%)
231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 125
100.0%
Open Punctuation
ValueCountFrequency (%)
( 123
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Other Symbol
ValueCountFrequency (%)
° 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6667
88.1%
Common 535
 
7.1%
Latin 364
 
4.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
138
 
2.1%
126
 
1.9%
124
 
1.9%
124
 
1.9%
119
 
1.8%
114
 
1.7%
108
 
1.6%
96
 
1.4%
92
 
1.4%
91
 
1.4%
Other values (546) 5535
83.0%
Latin
ValueCountFrequency (%)
e 23
 
6.3%
E 23
 
6.3%
A 21
 
5.8%
O 18
 
4.9%
N 16
 
4.4%
i 16
 
4.4%
a 15
 
4.1%
B 15
 
4.1%
o 14
 
3.8%
t 14
 
3.8%
Other values (37) 189
51.9%
Common
ValueCountFrequency (%)
231
43.2%
) 125
23.4%
( 123
23.0%
& 6
 
1.1%
. 5
 
0.9%
8 5
 
0.9%
, 5
 
0.9%
1 5
 
0.9%
2 4
 
0.7%
- 4
 
0.7%
Other values (13) 22
 
4.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6667
88.1%
ASCII 897
 
11.9%
None 2
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
231
25.8%
) 125
13.9%
( 123
13.7%
e 23
 
2.6%
E 23
 
2.6%
A 21
 
2.3%
O 18
 
2.0%
N 16
 
1.8%
i 16
 
1.8%
a 15
 
1.7%
Other values (58) 286
31.9%
Hangul
ValueCountFrequency (%)
138
 
2.1%
126
 
1.9%
124
 
1.9%
124
 
1.9%
119
 
1.8%
114
 
1.7%
108
 
1.6%
96
 
1.4%
92
 
1.4%
91
 
1.4%
Other values (546) 5535
83.0%
None
ValueCountFrequency (%)
° 1
50.0%
· 1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct1159
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T15:46:09.987548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length47
Mean length32.893724
Min length19

Characters and Unicode

Total characters39308
Distinct characters337
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1128 ?
Unique (%)94.4%

Sample

1st row경기도 부천시 부천로47번길 9 (심곡동)
2nd row경기도 부천시 장말로294번길 22 (심곡동)
3rd row경기도 부천시 부일로 673-1 (역곡동)
4th row경기도 부천시 원미로144번길 8-3 (원미동 삼성그린빌 101호)
5th row경기도 부천시 지봉로 148 (역곡동)
ValueCountFrequency (%)
경기도 1195
 
14.4%
부천시 1195
 
14.4%
1층 446
 
5.4%
일부호 276
 
3.3%
중동 196
 
2.4%
상동 179
 
2.2%
일부 141
 
1.7%
심곡본동 100
 
1.2%
소사본동 91
 
1.1%
원종동 79
 
1.0%
Other values (1349) 4386
52.9%
2023-12-12T15:46:10.405595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8152
20.7%
1 2008
 
5.1%
1959
 
5.0%
1443
 
3.7%
1395
 
3.5%
) 1268
 
3.2%
( 1268
 
3.2%
1259
 
3.2%
1256
 
3.2%
1230
 
3.1%
Other values (327) 18070
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21605
55.0%
Space Separator 8152
 
20.7%
Decimal Number 6693
 
17.0%
Close Punctuation 1268
 
3.2%
Open Punctuation 1268
 
3.2%
Dash Punctuation 190
 
0.5%
Uppercase Letter 108
 
0.3%
Lowercase Letter 9
 
< 0.1%
Letter Number 7
 
< 0.1%
Other Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1959
 
9.1%
1443
 
6.7%
1395
 
6.5%
1259
 
5.8%
1256
 
5.8%
1230
 
5.7%
1209
 
5.6%
1195
 
5.5%
862
 
4.0%
732
 
3.4%
Other values (280) 9065
42.0%
Uppercase Letter
ValueCountFrequency (%)
B 23
21.3%
A 17
15.7%
O 9
 
8.3%
T 8
 
7.4%
C 8
 
7.4%
S 7
 
6.5%
L 6
 
5.6%
E 6
 
5.6%
P 4
 
3.7%
U 4
 
3.7%
Other values (8) 16
14.8%
Decimal Number
ValueCountFrequency (%)
1 2008
30.0%
2 922
13.8%
0 682
 
10.2%
3 566
 
8.5%
4 518
 
7.7%
7 443
 
6.6%
5 428
 
6.4%
8 387
 
5.8%
6 379
 
5.7%
9 360
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 3
33.3%
y 2
22.2%
c 1
 
11.1%
t 1
 
11.1%
i 1
 
11.1%
b 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
& 3
42.9%
/ 1
 
14.3%
' 1
 
14.3%
. 1
 
14.3%
· 1
 
14.3%
Letter Number
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
Space Separator
ValueCountFrequency (%)
8152
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1268
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1268
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 190
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21605
55.0%
Common 17579
44.7%
Latin 124
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1959
 
9.1%
1443
 
6.7%
1395
 
6.5%
1259
 
5.8%
1256
 
5.8%
1230
 
5.7%
1209
 
5.6%
1195
 
5.5%
862
 
4.0%
732
 
3.4%
Other values (280) 9065
42.0%
Latin
ValueCountFrequency (%)
B 23
18.5%
A 17
13.7%
O 9
 
7.3%
T 8
 
6.5%
C 8
 
6.5%
S 7
 
5.6%
L 6
 
4.8%
E 6
 
4.8%
P 4
 
3.2%
U 4
 
3.2%
Other values (17) 32
25.8%
Common
ValueCountFrequency (%)
8152
46.4%
1 2008
 
11.4%
) 1268
 
7.2%
( 1268
 
7.2%
2 922
 
5.2%
0 682
 
3.9%
3 566
 
3.2%
4 518
 
2.9%
7 443
 
2.5%
5 428
 
2.4%
Other values (10) 1324
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21605
55.0%
ASCII 17695
45.0%
Number Forms 7
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8152
46.1%
1 2008
 
11.3%
) 1268
 
7.2%
( 1268
 
7.2%
2 922
 
5.2%
0 682
 
3.9%
3 566
 
3.2%
4 518
 
2.9%
7 443
 
2.5%
5 428
 
2.4%
Other values (33) 1440
 
8.1%
Hangul
ValueCountFrequency (%)
1959
 
9.1%
1443
 
6.7%
1395
 
6.5%
1259
 
5.8%
1256
 
5.8%
1230
 
5.7%
1209
 
5.6%
1195
 
5.5%
862
 
4.0%
732
 
3.4%
Other values (280) 9065
42.0%
Number Forms
ValueCountFrequency (%)
3
42.9%
2
28.6%
2
28.6%
None
ValueCountFrequency (%)
· 1
100.0%

소재지전화
Text

MISSING 

Distinct595
Distinct (%)98.5%
Missing591
Missing (%)49.5%
Memory size9.5 KiB
2023-12-12T15:46:10.646792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.930464
Min length9

Characters and Unicode

Total characters7206
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique589 ?
Unique (%)97.5%

Sample

1st row032-664-0125
2nd row032-654-4914
3rd row032-341-6054
4th row032-678-2922
5th row032-345-0604
ValueCountFrequency (%)
055-863-1633 4
 
0.7%
02-6256-1234 3
 
0.5%
032-610-5052 2
 
0.3%
032-343-0009 2
 
0.3%
02-2290-5739 2
 
0.3%
032-343-1052 2
 
0.3%
032-677-2933 1
 
0.2%
070-8810-1642 1
 
0.2%
032-210-9455 1
 
0.2%
032-678-6668 1
 
0.2%
Other values (585) 585
96.9%
2023-12-12T15:46:11.140604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1208
16.8%
3 1070
14.8%
2 992
13.8%
0 934
13.0%
6 721
10.0%
7 458
 
6.4%
5 445
 
6.2%
4 407
 
5.6%
1 390
 
5.4%
8 325
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5998
83.2%
Dash Punctuation 1208
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1070
17.8%
2 992
16.5%
0 934
15.6%
6 721
12.0%
7 458
7.6%
5 445
7.4%
4 407
 
6.8%
1 390
 
6.5%
8 325
 
5.4%
9 256
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 1208
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7206
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1208
16.8%
3 1070
14.8%
2 992
13.8%
0 934
13.0%
6 721
10.0%
7 458
 
6.4%
5 445
 
6.2%
4 407
 
5.6%
1 390
 
5.4%
8 325
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7206
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1208
16.8%
3 1070
14.8%
2 992
13.8%
0 934
13.0%
6 721
10.0%
7 458
 
6.4%
5 445
 
6.2%
4 407
 
5.6%
1 390
 
5.4%
8 325
 
4.5%
Distinct261
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size9.5 KiB
2023-12-12T15:46:11.524197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length5
Mean length5.0066946
Min length5

Characters and Unicode

Total characters5983
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique83 ?
Unique (%)6.9%

Sample

1st row14633
2nd row14614
3rd row14662
4th row14565
5th row14669
ValueCountFrequency (%)
14709 46
 
3.8%
14548 31
 
2.6%
14670 31
 
2.6%
14546 29
 
2.4%
14786 29
 
2.4%
14545 26
 
2.2%
14621 24
 
2.0%
14571 21
 
1.8%
14568 21
 
1.8%
14589 20
 
1.7%
Other values (251) 917
76.7%
2023-12-12T15:46:12.125103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 1716
28.7%
1 1396
23.3%
5 587
 
9.8%
7 543
 
9.1%
6 507
 
8.5%
8 303
 
5.1%
0 256
 
4.3%
3 245
 
4.1%
9 216
 
3.6%
2 196
 
3.3%
Other values (8) 18
 
0.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5965
99.7%
Uppercase Letter 14
 
0.2%
Math Symbol 4
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 1716
28.8%
1 1396
23.4%
5 587
 
9.8%
7 543
 
9.1%
6 507
 
8.5%
8 303
 
5.1%
0 256
 
4.3%
3 245
 
4.1%
9 216
 
3.6%
2 196
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
E 4
28.6%
R 2
14.3%
P 2
14.3%
L 2
14.3%
A 2
14.3%
C 2
14.3%
Math Symbol
ValueCountFrequency (%)
< 2
50.0%
> 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5969
99.8%
Latin 14
 
0.2%

Most frequent character per script

Common
ValueCountFrequency (%)
4 1716
28.7%
1 1396
23.4%
5 587
 
9.8%
7 543
 
9.1%
6 507
 
8.5%
8 303
 
5.1%
0 256
 
4.3%
3 245
 
4.1%
9 216
 
3.6%
2 196
 
3.3%
Other values (2) 4
 
0.1%
Latin
ValueCountFrequency (%)
E 4
28.6%
R 2
14.3%
P 2
14.3%
L 2
14.3%
A 2
14.3%
C 2
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5983
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 1716
28.7%
1 1396
23.3%
5 587
 
9.8%
7 543
 
9.1%
6 507
 
8.5%
8 303
 
5.1%
0 256
 
4.3%
3 245
 
4.1%
9 216
 
3.6%
2 196
 
3.3%
Other values (8) 18
 
0.3%

Missing values

2023-12-12T15:46:08.713889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:46:08.827854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화우편번호(도로명)
0즉석판매제조가공업자유방앗간경기도 부천시 부천로47번길 9 (심곡동)032-664-012514633
1즉석판매제조가공업대성방앗간경기도 부천시 장말로294번길 22 (심곡동)032-654-491414614
2즉석판매제조가공업성심방앗간경기도 부천시 부일로 673-1 (역곡동)032-341-605414662
3즉석판매제조가공업태양떡방앗간경기도 부천시 원미로144번길 8-3 (원미동 삼성그린빌 101호)032-678-292214565
4즉석판매제조가공업맛나방앗간경기도 부천시 지봉로 148 (역곡동)032-345-060414669
5즉석판매제조가공업제일떡집경기도 부천시 자유로 37-1 (심곡본동)032-653-075114709
6즉석판매제조가공업풍년방앗간경기도 부천시 심곡로 32-1 (심곡본동)032-654-509314738
7즉석판매제조가공업원종방앗간경기도 부천시 소사로 829-1 (원종동)032-672-850114425
8즉석판매제조가공업심곡방앗간경기도 부천시 부흥로355번길 38 (심곡동)032-653-816314577
9즉석판매제조가공업충남기름집경기도 부천시 부일로571번길 20 (소사동)032-347-163414647
업종명업소명소재지(도로명)소재지전화우편번호(도로명)
1185즉석판매제조가공업장원에프엔비경기도 부천시 길주로 118 홈플러스부천상동점 1층일부 (상동)<NA>14545
1186즉석판매제조가공업장원에프엔비경기도 부천시 경인로 532 HOME PLUS 소사점 지하1층일부 (괴안동)<NA>14676
1187즉석판매제조가공업(주)햇살드림경기도 부천시 길주로 180 현대백화점 중동점 지하1층 일부호 (중동)<NA>14546
1188즉석판매제조가공업아리랑에프앤비경기도 부천시 길주로 180 현대백화점 중동점 지하1층일부 (중동)<NA>14546
1189즉석판매제조가공업(주)마켓인경기도 부천시 부천로 1 부천역사 이마트 3층 행사매대 (심곡본동)<NA>14637
1190즉석판매제조가공업주식회사 월드푸드경기도 부천시 수도로 48 1층 일부 (약대동)<NA>14518
1191즉석판매제조가공업복덩이순대국경기도 부천시 범안로7번길 79 1층 일부호 (괴안동)<NA>14682
1192즉석판매제조가공업카페 구들장경기도 부천시 은성로67번길 41 소사본동 156-33 단독주택 지하1층 일부호 (소사본동)<NA>14702
1193즉석판매제조가공업삼부자경기도 부천시 소사로 202 지하1층 일부호 (소사본동)<NA>14703
1194즉석판매제조가공업명류당티에프경기도 부천시 소사로 663 홈플러스 부천여월점 지하2층일부 (여월동)<NA>14485