Overview

Dataset statistics

Number of variables3
Number of observations344
Missing cells3
Missing cells (%)0.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory24.4 B

Variable types

Text3

Dataset

Description인천광역시 미추홀구의 건설업현황 데이터 입니다. 데이터 세부내역에는 상호명, 전화번호, 도로명 주소를 포함하여 데이터를 제공하고 있습니다.
Author인천광역시 미추홀구
URLhttps://www.data.go.kr/data/15099994/fileData.do

Reproduction

Analysis started2024-03-23 06:52:55.192307
Analysis finished2024-03-23 06:52:57.863387
Duration2.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct343
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-23T06:52:58.233572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.0901163
Min length2

Characters and Unicode

Total characters2439
Distinct characters264
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique342 ?
Unique (%)99.4%

Sample

1st row(주)가람건설산업
2nd row(주)가야이엔씨
3rd row(주)가인엔지니어링
4th row(주)건우건축설비
5th row(주)경유산업개발
ValueCountFrequency (%)
삼성건축설비 2
 
0.6%
신현산업개발(주 1
 
0.3%
송도종합서비스 1
 
0.3%
스마트보일러 1
 
0.3%
수창산업개발(주 1
 
0.3%
수엔지니어링 1
 
0.3%
수아이엔씨주식회사 1
 
0.3%
수복설비 1
 
0.3%
수도종합상사 1
 
0.3%
수(水 1
 
0.3%
Other values (333) 333
96.8%
2024-03-23T06:52:59.430192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
230
 
9.4%
( 203
 
8.3%
) 203
 
8.3%
117
 
4.8%
104
 
4.3%
67
 
2.7%
55
 
2.3%
40
 
1.6%
40
 
1.6%
35
 
1.4%
Other values (254) 1345
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1993
81.7%
Open Punctuation 203
 
8.3%
Close Punctuation 203
 
8.3%
Uppercase Letter 28
 
1.1%
Lowercase Letter 7
 
0.3%
Decimal Number 2
 
0.1%
Other Punctuation 2
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
230
 
11.5%
117
 
5.9%
104
 
5.2%
67
 
3.4%
55
 
2.8%
40
 
2.0%
40
 
2.0%
35
 
1.8%
33
 
1.7%
32
 
1.6%
Other values (236) 1240
62.2%
Uppercase Letter
ValueCountFrequency (%)
N 8
28.6%
E 8
28.6%
G 8
28.6%
S 2
 
7.1%
D 1
 
3.6%
M 1
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
o 2
28.6%
l 1
14.3%
k 1
14.3%
g 1
14.3%
n 1
14.3%
e 1
14.3%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 203
100.0%
Close Punctuation
ValueCountFrequency (%)
) 203
100.0%
Decimal Number
ValueCountFrequency (%)
3 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1992
81.7%
Common 411
 
16.9%
Latin 35
 
1.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
230
 
11.5%
117
 
5.9%
104
 
5.2%
67
 
3.4%
55
 
2.8%
40
 
2.0%
40
 
2.0%
35
 
1.8%
33
 
1.7%
32
 
1.6%
Other values (235) 1239
62.2%
Latin
ValueCountFrequency (%)
N 8
22.9%
E 8
22.9%
G 8
22.9%
o 2
 
5.7%
S 2
 
5.7%
D 1
 
2.9%
l 1
 
2.9%
k 1
 
2.9%
g 1
 
2.9%
n 1
 
2.9%
Other values (2) 2
 
5.7%
Common
ValueCountFrequency (%)
( 203
49.4%
) 203
49.4%
3 2
 
0.5%
· 1
 
0.2%
- 1
 
0.2%
. 1
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1992
81.7%
ASCII 445
 
18.2%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
230
 
11.5%
117
 
5.9%
104
 
5.2%
67
 
3.4%
55
 
2.8%
40
 
2.0%
40
 
2.0%
35
 
1.8%
33
 
1.7%
32
 
1.6%
Other values (235) 1239
62.2%
ASCII
ValueCountFrequency (%)
( 203
45.6%
) 203
45.6%
N 8
 
1.8%
E 8
 
1.8%
G 8
 
1.8%
3 2
 
0.4%
o 2
 
0.4%
S 2
 
0.4%
D 1
 
0.2%
- 1
 
0.2%
Other values (7) 7
 
1.6%
None
ValueCountFrequency (%)
· 1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct305
Distinct (%)89.4%
Missing3
Missing (%)0.9%
Memory size2.8 KiB
2024-03-23T06:53:00.302237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.093842
Min length11

Characters and Unicode

Total characters4124
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique293 ?
Unique (%)85.9%

Sample

1st row032-433-0406
2nd row032-288-0482
3rd row032-863-6900
4th row032-431-0991
5th row032-564-1809
ValueCountFrequency (%)
000-0000-0000 23
 
6.7%
032-000-0000 3
 
0.9%
00-000-0000 3
 
0.9%
032-1544-3002 3
 
0.9%
032-873-4688 2
 
0.6%
032-887-8428 2
 
0.6%
032-883-0888 2
 
0.6%
032-000-000 2
 
0.6%
032-437-9790 2
 
0.6%
032-467-1114 2
 
0.6%
Other values (295) 297
87.1%
2024-03-23T06:53:01.682729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 854
20.7%
- 682
16.5%
2 515
12.5%
3 509
12.3%
8 364
8.8%
4 263
 
6.4%
7 230
 
5.6%
6 215
 
5.2%
1 183
 
4.4%
5 168
 
4.1%
Other values (2) 141
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3441
83.4%
Dash Punctuation 682
 
16.5%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 854
24.8%
2 515
15.0%
3 509
14.8%
8 364
10.6%
4 263
 
7.6%
7 230
 
6.7%
6 215
 
6.2%
1 183
 
5.3%
5 168
 
4.9%
9 140
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 682
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4124
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 854
20.7%
- 682
16.5%
2 515
12.5%
3 509
12.3%
8 364
8.8%
4 263
 
6.4%
7 230
 
5.6%
6 215
 
5.2%
1 183
 
4.4%
5 168
 
4.1%
Other values (2) 141
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4124
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 854
20.7%
- 682
16.5%
2 515
12.5%
3 509
12.3%
8 364
8.8%
4 263
 
6.4%
7 230
 
5.6%
6 215
 
5.2%
1 183
 
4.4%
5 168
 
4.1%
Other values (2) 141
 
3.4%
Distinct341
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2024-03-23T06:53:02.541739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length44
Mean length31.738372
Min length20

Characters and Unicode

Total characters10918
Distinct characters192
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique338 ?
Unique (%)98.3%

Sample

1st row인천광역시 미추홀구 수봉로45번길 56 1층 (숭의동)
2nd row인천광역시 미추홀구 경원대로658번길 21-4 (관교동)
3rd row인천광역시 미추홀구 인하로 201-3 , 3층 (주안동)
4th row인천광역시 미추홀구 방축로 190 , 312호 (도화동)
5th row인천광역시 미추홀구 인중로 22, 6층(숭의동, 용운빌딩)
ValueCountFrequency (%)
인천광역시 344
 
16.4%
미추홀구 342
 
16.3%
주안동 136
 
6.5%
66
 
3.1%
1층 61
 
2.9%
도화동 47
 
2.2%
숭의동 37
 
1.8%
용현동 34
 
1.6%
2층 30
 
1.4%
학익동 27
 
1.3%
Other values (564) 974
46.4%
2024-03-23T06:53:03.901957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1754
 
16.1%
439
 
4.0%
1 408
 
3.7%
379
 
3.5%
371
 
3.4%
) 368
 
3.4%
( 368
 
3.4%
361
 
3.3%
361
 
3.3%
350
 
3.2%
Other values (182) 5759
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6367
58.3%
Decimal Number 1849
 
16.9%
Space Separator 1754
 
16.1%
Close Punctuation 368
 
3.4%
Open Punctuation 368
 
3.4%
Other Punctuation 129
 
1.2%
Dash Punctuation 79
 
0.7%
Uppercase Letter 3
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
439
 
6.9%
379
 
6.0%
371
 
5.8%
361
 
5.7%
361
 
5.7%
350
 
5.5%
348
 
5.5%
348
 
5.5%
345
 
5.4%
345
 
5.4%
Other values (163) 2720
42.7%
Decimal Number
ValueCountFrequency (%)
1 408
22.1%
2 267
14.4%
3 220
11.9%
0 191
10.3%
4 183
9.9%
6 151
 
8.2%
5 143
 
7.7%
9 96
 
5.2%
8 95
 
5.1%
7 95
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 114
88.4%
14
 
10.9%
/ 1
 
0.8%
Space Separator
ValueCountFrequency (%)
1754
100.0%
Close Punctuation
ValueCountFrequency (%)
) 368
100.0%
Open Punctuation
ValueCountFrequency (%)
( 368
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6367
58.3%
Common 4548
41.7%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
439
 
6.9%
379
 
6.0%
371
 
5.8%
361
 
5.7%
361
 
5.7%
350
 
5.5%
348
 
5.5%
348
 
5.5%
345
 
5.4%
345
 
5.4%
Other values (163) 2720
42.7%
Common
ValueCountFrequency (%)
1754
38.6%
1 408
 
9.0%
) 368
 
8.1%
( 368
 
8.1%
2 267
 
5.9%
3 220
 
4.8%
0 191
 
4.2%
4 183
 
4.0%
6 151
 
3.3%
5 143
 
3.1%
Other values (8) 495
 
10.9%
Latin
ValueCountFrequency (%)
B 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6367
58.3%
ASCII 4537
41.6%
None 14
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1754
38.7%
1 408
 
9.0%
) 368
 
8.1%
( 368
 
8.1%
2 267
 
5.9%
3 220
 
4.8%
0 191
 
4.2%
4 183
 
4.0%
6 151
 
3.3%
5 143
 
3.2%
Other values (8) 484
 
10.7%
Hangul
ValueCountFrequency (%)
439
 
6.9%
379
 
6.0%
371
 
5.8%
361
 
5.7%
361
 
5.7%
350
 
5.5%
348
 
5.5%
348
 
5.5%
345
 
5.4%
345
 
5.4%
Other values (163) 2720
42.7%
None
ValueCountFrequency (%)
14
100.0%

Missing values

2024-03-23T06:52:57.502804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-23T06:52:57.753373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호전화번호도로명주소
0(주)가람건설산업032-433-0406인천광역시 미추홀구 수봉로45번길 56 1층 (숭의동)
1(주)가야이엔씨032-288-0482인천광역시 미추홀구 경원대로658번길 21-4 (관교동)
2(주)가인엔지니어링032-863-6900인천광역시 미추홀구 인하로 201-3 , 3층 (주안동)
3(주)건우건축설비032-431-0991인천광역시 미추홀구 방축로 190 , 312호 (도화동)
4(주)경유산업개발032-564-1809인천광역시 미추홀구 인중로 22, 6층(숭의동, 용운빌딩)
5(주)경인이앤씨032-467-1114인천광역시 미추홀구 경원대로716번길 42-1 , 1층 2호 (관교동)
6(주)경인조경건설032-467-1114인천광역시 미추홀구 경원대로716번길 42-1 1층 (관교동)
7(주)경인종합설비032-886-9994인천광역시 미추홀구 장천로14번길 28 102호 (숭의동)
8(주)경호토건032-423-2292인천광역시 미추홀구 주안로 116, 5층506호(주안동, 주안리가스퀘어)
9(주)계양건설032-541-8898인천광역시 미추홀구 인주대로224번길 6 208호(수봉오피스텔) (용현동)
상호전화번호도로명주소
334현대조경개발032-437-9790인천광역시 미추홀구 경인로 384 , 2층 79호 (주안동)
335현돈건설(주)032-432-9974인천광역시 미추홀구 주안로205번길 12-12 , 1003호(제이앤케이하베스트) (주안동)
336현진건설(주)032-813-2285인천광역시 미추홀구 매소홀로 592 3층 (문학동)
337협동보일러032-865-4343인천광역시 미추홀구 인하로236번길 40 (주안동)
338형제ENG0000-0000-0000인천광역시 미추홀구 염전로168번길 28, 도화두손지젤시티 제비동 901호
339형제공사032-425-6206인천광역시 미추홀구 동주길 37 (주안동)
340효성종합건축설비032-544-1574인천광역시 미추홀구 수봉남로17번길 24 1층 (용현동)
341흥운건설(주)032-589-6641인천광역시 미추홀구 방축로 190, 2-318 (도화동)
342흥일전문건설(주)032-889-0300인천광역시 미추홀구 인중로 7 2층 (숭의동)
343희망ENG032-885-6111인천광역시 미추홀구 참외전로 349 (숭의동)