Overview

Dataset statistics

Number of variables3
Number of observations122
Missing cells30
Missing cells (%)8.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory25.1 B

Variable types

Text3

Dataset

Description인천광역시 부평구_가스난방공사업 현황 데이터는 부평구 내에 가스난방공업사업체의 업체명, 도로명 주소, 전화번호 정보를 제공하고 있습니다.
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117915&srcSe=7661IVAWM27C61E190

Alerts

전화번호 has 30 (24.6%) missing valuesMissing
업체명 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2024-01-28 17:29:19.722505
Analysis finished2024-01-28 17:29:19.996023
Duration0.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct122
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-29T02:29:20.206328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length5.7459016
Min length2

Characters and Unicode

Total characters701
Distinct characters162
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)100.0%

Sample

1st row(주)경원
2nd row(주)일신자동차
3rd row365에너시스
4th row911보일러할인마트
5th rowLG에어컨종합보일러
ValueCountFrequency (%)
주)경원 1
 
0.8%
차이나yo 1
 
0.8%
중앙상사 1
 
0.8%
제일종합설비 1
 
0.8%
제이엠보일러 1
 
0.8%
정진설비 1
 
0.8%
정성엔지니어링 1
 
0.8%
정도eng 1
 
0.8%
재일설비 1
 
0.8%
일성산업 1
 
0.8%
Other values (112) 112
91.8%
2024-01-29T02:29:20.583460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
8.8%
61
 
8.7%
21
 
3.0%
20
 
2.9%
19
 
2.7%
16
 
2.3%
16
 
2.3%
15
 
2.1%
14
 
2.0%
14
 
2.0%
Other values (152) 443
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 671
95.7%
Uppercase Letter 11
 
1.6%
Decimal Number 6
 
0.9%
Open Punctuation 5
 
0.7%
Close Punctuation 5
 
0.7%
Lowercase Letter 2
 
0.3%
Other Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
9.2%
61
 
9.1%
21
 
3.1%
20
 
3.0%
19
 
2.8%
16
 
2.4%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
Other values (138) 413
61.5%
Decimal Number
ValueCountFrequency (%)
1 2
33.3%
5 1
16.7%
9 1
16.7%
3 1
16.7%
6 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
G 4
36.4%
N 3
27.3%
E 3
27.3%
L 1
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
y 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 672
95.9%
Common 16
 
2.3%
Latin 13
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
9.2%
61
 
9.1%
21
 
3.1%
20
 
3.0%
19
 
2.8%
16
 
2.4%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
Other values (139) 414
61.6%
Common
ValueCountFrequency (%)
( 5
31.2%
) 5
31.2%
1 2
 
12.5%
5 1
 
6.2%
9 1
 
6.2%
3 1
 
6.2%
6 1
 
6.2%
Latin
ValueCountFrequency (%)
G 4
30.8%
N 3
23.1%
E 3
23.1%
o 1
 
7.7%
y 1
 
7.7%
L 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 671
95.7%
ASCII 29
 
4.1%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
9.2%
61
 
9.1%
21
 
3.1%
20
 
3.0%
19
 
2.8%
16
 
2.4%
16
 
2.4%
15
 
2.2%
14
 
2.1%
14
 
2.1%
Other values (138) 413
61.5%
ASCII
ValueCountFrequency (%)
( 5
17.2%
) 5
17.2%
G 4
13.8%
N 3
10.3%
E 3
10.3%
1 2
 
6.9%
o 1
 
3.4%
y 1
 
3.4%
5 1
 
3.4%
L 1
 
3.4%
Other values (3) 3
10.3%
None
ValueCountFrequency (%)
1
100.0%

도로명주소
Text

UNIQUE 

Distinct122
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-29T02:29:20.829882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length34.5
Mean length29.221311
Min length22

Characters and Unicode

Total characters3565
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique122 ?
Unique (%)100.0%

Sample

1st row인천광역시 부평구 장제로373번길 37-8 (갈산동)
2nd row인천광역시 부평구 서촌로 23 (일신동)
3rd row인천광역시 부평구 부평북로 136 (청천동)
4th row인천광역시 부평구 장제로28번길 31 (부평동)
5th row인천광역시 부평구 충선로 191 111호 (부개동,뉴서울상가)
ValueCountFrequency (%)
부평구 122
 
17.4%
인천광역시 119
 
17.0%
부평동 31
 
4.4%
1층 30
 
4.3%
십정동 23
 
3.3%
부개동 19
 
2.7%
갈산동 14
 
2.0%
산곡동 13
 
1.9%
청천동 12
 
1.7%
마장로 9
 
1.3%
Other values (231) 310
44.2%
2024-01-29T02:29:21.201966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
580
 
16.3%
204
 
5.7%
1 193
 
5.4%
167
 
4.7%
142
 
4.0%
131
 
3.7%
129
 
3.6%
122
 
3.4%
122
 
3.4%
( 122
 
3.4%
Other values (103) 1653
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2065
57.9%
Decimal Number 627
 
17.6%
Space Separator 580
 
16.3%
Open Punctuation 122
 
3.4%
Close Punctuation 121
 
3.4%
Dash Punctuation 31
 
0.9%
Other Punctuation 18
 
0.5%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
204
 
9.9%
167
 
8.1%
142
 
6.9%
131
 
6.3%
129
 
6.2%
122
 
5.9%
122
 
5.9%
122
 
5.9%
120
 
5.8%
119
 
5.8%
Other values (86) 687
33.3%
Decimal Number
ValueCountFrequency (%)
1 193
30.8%
2 79
12.6%
3 72
 
11.5%
0 69
 
11.0%
4 42
 
6.7%
9 37
 
5.9%
5 37
 
5.9%
6 36
 
5.7%
8 33
 
5.3%
7 29
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 10
55.6%
8
44.4%
Space Separator
ValueCountFrequency (%)
580
100.0%
Open Punctuation
ValueCountFrequency (%)
( 122
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 31
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2065
57.9%
Common 1499
42.0%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
204
 
9.9%
167
 
8.1%
142
 
6.9%
131
 
6.3%
129
 
6.2%
122
 
5.9%
122
 
5.9%
122
 
5.9%
120
 
5.8%
119
 
5.8%
Other values (86) 687
33.3%
Common
ValueCountFrequency (%)
580
38.7%
1 193
 
12.9%
( 122
 
8.1%
) 121
 
8.1%
2 79
 
5.3%
3 72
 
4.8%
0 69
 
4.6%
4 42
 
2.8%
9 37
 
2.5%
5 37
 
2.5%
Other values (6) 147
 
9.8%
Latin
ValueCountFrequency (%)
C 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2065
57.9%
ASCII 1492
41.9%
None 8
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
580
38.9%
1 193
 
12.9%
( 122
 
8.2%
) 121
 
8.1%
2 79
 
5.3%
3 72
 
4.8%
0 69
 
4.6%
4 42
 
2.8%
9 37
 
2.5%
5 37
 
2.5%
Other values (6) 140
 
9.4%
Hangul
ValueCountFrequency (%)
204
 
9.9%
167
 
8.1%
142
 
6.9%
131
 
6.3%
129
 
6.2%
122
 
5.9%
122
 
5.9%
122
 
5.9%
120
 
5.8%
119
 
5.8%
Other values (86) 687
33.3%
None
ValueCountFrequency (%)
8
100.0%

전화번호
Text

MISSING 

Distinct91
Distinct (%)98.9%
Missing30
Missing (%)24.6%
Memory size1.1 KiB
2024-01-29T02:29:21.407631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.043478
Min length12

Characters and Unicode

Total characters1108
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)97.8%

Sample

1st row032-543-9004
2nd row032-527-3011
3rd row032-281-9000
4th row032-502-9110
5th row032-527-1545
ValueCountFrequency (%)
032-000-0000 2
 
2.2%
032-504-9670 1
 
1.1%
032-543-9004 1
 
1.1%
032-437-6734 1
 
1.1%
032-516-3651 1
 
1.1%
032-503-0404 1
 
1.1%
031-222-9996 1
 
1.1%
032-503-0578 1
 
1.1%
032-362-9000 1
 
1.1%
032-524-8111 1
 
1.1%
Other values (81) 81
88.0%
2024-01-29T02:29:21.717139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 202
18.2%
- 184
16.6%
2 166
15.0%
3 151
13.6%
5 102
9.2%
4 67
 
6.0%
1 65
 
5.9%
7 50
 
4.5%
9 47
 
4.2%
6 37
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 924
83.4%
Dash Punctuation 184
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 202
21.9%
2 166
18.0%
3 151
16.3%
5 102
11.0%
4 67
 
7.3%
1 65
 
7.0%
7 50
 
5.4%
9 47
 
5.1%
6 37
 
4.0%
8 37
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 184
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1108
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 202
18.2%
- 184
16.6%
2 166
15.0%
3 151
13.6%
5 102
9.2%
4 67
 
6.0%
1 65
 
5.9%
7 50
 
4.5%
9 47
 
4.2%
6 37
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1108
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 202
18.2%
- 184
16.6%
2 166
15.0%
3 151
13.6%
5 102
9.2%
4 67
 
6.0%
1 65
 
5.9%
7 50
 
4.5%
9 47
 
4.2%
6 37
 
3.3%

Missing values

2024-01-29T02:29:19.900645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-29T02:29:19.969835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명도로명주소전화번호
0(주)경원인천광역시 부평구 장제로373번길 37-8 (갈산동)032-543-9004
1(주)일신자동차인천광역시 부평구 서촌로 23 (일신동)032-527-3011
2365에너시스인천광역시 부평구 부평북로 136 (청천동)032-281-9000
3911보일러할인마트인천광역시 부평구 장제로28번길 31 (부평동)032-502-9110
4LG에어컨종합보일러인천광역시 부평구 충선로 191 111호 (부개동,뉴서울상가)032-527-1545
5㈜금천엔지니어링인천광역시 부평구 배곶로 32-1 1층 (십정동)032-469-3604
6가가설비인천광역시 부평구 마장로 41 (십정동)032-466-2905
7가경삼천리인천광역시 부평구 주부토로 236 C동 514호 (갈산동)<NA>
8가나설비인천광역시 부평구 충선로 79 1층 (부개동)032-529-0404
9가성시스템인천광역시 부평구 백범로487번길 12-1 101호 (십정동)032-433-3338
업체명도로명주소전화번호
112현대설비공사인천광역시 부평구 부흥로375번길 13 1층 102호 (부평동)032-523-6808
113현대종합설비인천시 부평구 부일로113번길 29-15, 1층032-515-5535
114현대철물설비인천광역시 부평구 주부토로 267 (갈산동)032-524-1979
115형제보일러인천광역시 부평구 경인로980번길 4 104호 (부평동)032-518-4848
116형제설비인천광역시 부평구 신촌로 50 (부평동)<NA>
117혜린건축설비인천광역시 부평구 마분로35번길 1 (부개동)032-524-7379
118호설비인천광역시 부평구 장제로231번길 26 (부평동)032-518-6441
119화성종합건축공사인천광역시 부평구 장제로373번길 8 (갈산동)032-525-2972
120효창종합설비인천광역시 부평구 길주남로66번길 9 (부평동)032-504-8204
121히람엔지니어링인천광역시 부평구 마장로319번길 21 (산곡동)032-672-2344