Overview

Dataset statistics

Number of variables3
Number of observations353
Missing cells38
Missing cells (%)3.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory24.4 B

Variable types

Text3

Dataset

Description인천광역시 미추홀구의 건설업현황 데이터 입니다. 데이터 세부내역에는 상호명, 전화번호, 도로명 주소를 포함하여 데이터를 제공하고 있습니다.<br/>
Author인천광역시 미추홀구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15099994&srcSe=7661IVAWM27C61E190

Alerts

전화번호 has 37 (10.5%) missing valuesMissing

Reproduction

Analysis started2024-04-06 09:46:24.158646
Analysis finished2024-04-06 09:46:25.095139
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct352
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-04-06T18:46:25.412720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length6.9745042
Min length2

Characters and Unicode

Total characters2462
Distinct characters255
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique351 ?
Unique (%)99.4%

Sample

1st row(주)가람건설산업
2nd row(주)가야이엔씨
3rd row(주)가인엔지니어링
4th row(주)건우건축설비
5th row(주)경유산업개발
ValueCountFrequency (%)
삼성건축설비 2
 
0.6%
삼오설비 1
 
0.3%
수복설비 1
 
0.3%
수도종합상사 1
 
0.3%
수(水 1
 
0.3%
송연가스산업 1
 
0.3%
송도종합서비스 1
 
0.3%
세진종합공사 1
 
0.3%
세진공조시스템 1
 
0.3%
세원아이디에스(주 1
 
0.3%
Other values (342) 342
96.9%
2024-04-06T18:46:26.103430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
236
 
9.6%
( 216
 
8.8%
) 216
 
8.8%
126
 
5.1%
112
 
4.5%
67
 
2.7%
59
 
2.4%
39
 
1.6%
36
 
1.5%
32
 
1.3%
Other values (245) 1323
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1997
81.1%
Open Punctuation 216
 
8.8%
Close Punctuation 216
 
8.8%
Uppercase Letter 26
 
1.1%
Lowercase Letter 3
 
0.1%
Decimal Number 2
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
236
 
11.8%
126
 
6.3%
112
 
5.6%
67
 
3.4%
59
 
3.0%
39
 
2.0%
36
 
1.8%
32
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (232) 1228
61.5%
Uppercase Letter
ValueCountFrequency (%)
N 8
30.8%
E 8
30.8%
G 8
30.8%
S 1
 
3.8%
M 1
 
3.8%
Lowercase Letter
ValueCountFrequency (%)
g 1
33.3%
n 1
33.3%
e 1
33.3%
Other Punctuation
ValueCountFrequency (%)
· 1
50.0%
. 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 216
100.0%
Close Punctuation
ValueCountFrequency (%)
) 216
100.0%
Decimal Number
ValueCountFrequency (%)
3 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1996
81.1%
Common 436
 
17.7%
Latin 29
 
1.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
236
 
11.8%
126
 
6.3%
112
 
5.6%
67
 
3.4%
59
 
3.0%
39
 
2.0%
36
 
1.8%
32
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (231) 1227
61.5%
Latin
ValueCountFrequency (%)
N 8
27.6%
E 8
27.6%
G 8
27.6%
g 1
 
3.4%
n 1
 
3.4%
e 1
 
3.4%
S 1
 
3.4%
M 1
 
3.4%
Common
ValueCountFrequency (%)
( 216
49.5%
) 216
49.5%
3 2
 
0.5%
· 1
 
0.2%
. 1
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1996
81.1%
ASCII 464
 
18.8%
None 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
236
 
11.8%
126
 
6.3%
112
 
5.6%
67
 
3.4%
59
 
3.0%
39
 
2.0%
36
 
1.8%
32
 
1.6%
31
 
1.6%
31
 
1.6%
Other values (231) 1227
61.5%
ASCII
ValueCountFrequency (%)
( 216
46.6%
) 216
46.6%
N 8
 
1.7%
E 8
 
1.7%
G 8
 
1.7%
3 2
 
0.4%
g 1
 
0.2%
n 1
 
0.2%
e 1
 
0.2%
. 1
 
0.2%
Other values (2) 2
 
0.4%
None
ValueCountFrequency (%)
· 1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct307
Distinct (%)97.2%
Missing37
Missing (%)10.5%
Memory size2.9 KiB
2024-04-06T18:46:26.541733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.987342
Min length9

Characters and Unicode

Total characters3788
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique298 ?
Unique (%)94.3%

Sample

1st row032-433-0406
2nd row032-288-0482
3rd row032-863-6900
4th row032-431-0991
5th row032-564-1809
ValueCountFrequency (%)
032-887-8428 2
 
0.6%
1544-3002 2
 
0.6%
032-873-4688 2
 
0.6%
032-437-9790 2
 
0.6%
032-467-1114 2
 
0.6%
032-437-8720 2
 
0.6%
032-719-7380 2
 
0.6%
032-875-9238 2
 
0.6%
032-883-0888 2
 
0.6%
032-831-5599 1
 
0.3%
Other values (297) 297
94.0%
2024-04-06T18:46:27.275598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 629
16.6%
2 532
14.0%
0 520
13.7%
3 510
13.5%
8 379
10.0%
4 265
7.0%
7 234
 
6.2%
6 226
 
6.0%
1 190
 
5.0%
5 169
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3159
83.4%
Dash Punctuation 629
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 532
16.8%
0 520
16.5%
3 510
16.1%
8 379
12.0%
4 265
8.4%
7 234
7.4%
6 226
7.2%
1 190
 
6.0%
5 169
 
5.3%
9 134
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 629
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3788
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 629
16.6%
2 532
14.0%
0 520
13.7%
3 510
13.5%
8 379
10.0%
4 265
7.0%
7 234
 
6.2%
6 226
 
6.0%
1 190
 
5.0%
5 169
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3788
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 629
16.6%
2 532
14.0%
0 520
13.7%
3 510
13.5%
8 379
10.0%
4 265
7.0%
7 234
 
6.2%
6 226
 
6.0%
1 190
 
5.0%
5 169
 
4.5%
Distinct349
Distinct (%)99.1%
Missing1
Missing (%)0.3%
Memory size2.9 KiB
2024-04-06T18:46:27.693762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length46
Mean length31.741477
Min length22

Characters and Unicode

Total characters11173
Distinct characters191
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique346 ?
Unique (%)98.3%

Sample

1st row인천광역시 미추홀구 수봉로45번길 56 1층 (숭의동)
2nd row인천광역시 미추홀구 경원대로658번길 21-4 (관교동)
3rd row인천광역시 미추홀구 인하로 201-3 , 3층 (주안동)
4th row인천광역시 미추홀구 방축로 190 , 312호 (도화동)
5th row인천광역시 미추홀구 인중로 22, 6층(숭의동, 용운빌딩)
ValueCountFrequency (%)
인천광역시 352
 
16.4%
미추홀구 350
 
16.3%
주안동 136
 
6.3%
82
 
3.8%
1층 59
 
2.7%
도화동 50
 
2.3%
숭의동 42
 
2.0%
2층 33
 
1.5%
용현동 29
 
1.3%
학익동 24
 
1.1%
Other values (556) 995
46.2%
2024-04-06T18:46:28.871197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1800
 
16.1%
459
 
4.1%
1 417
 
3.7%
386
 
3.5%
381
 
3.4%
( 380
 
3.4%
) 380
 
3.4%
370
 
3.3%
370
 
3.3%
359
 
3.2%
Other values (181) 5871
52.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6517
58.3%
Decimal Number 1864
 
16.7%
Space Separator 1800
 
16.1%
Open Punctuation 380
 
3.4%
Close Punctuation 380
 
3.4%
Other Punctuation 148
 
1.3%
Dash Punctuation 80
 
0.7%
Uppercase Letter 3
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
459
 
7.0%
386
 
5.9%
381
 
5.8%
370
 
5.7%
370
 
5.7%
359
 
5.5%
356
 
5.5%
356
 
5.5%
354
 
5.4%
353
 
5.4%
Other values (162) 2773
42.6%
Decimal Number
ValueCountFrequency (%)
1 417
22.4%
2 260
13.9%
3 228
12.2%
0 187
10.0%
4 185
9.9%
5 145
 
7.8%
6 144
 
7.7%
7 103
 
5.5%
8 101
 
5.4%
9 94
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 130
87.8%
17
 
11.5%
/ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
1800
100.0%
Open Punctuation
ValueCountFrequency (%)
( 380
100.0%
Close Punctuation
ValueCountFrequency (%)
) 380
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6517
58.3%
Common 4653
41.6%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
459
 
7.0%
386
 
5.9%
381
 
5.8%
370
 
5.7%
370
 
5.7%
359
 
5.5%
356
 
5.5%
356
 
5.5%
354
 
5.4%
353
 
5.4%
Other values (162) 2773
42.6%
Common
ValueCountFrequency (%)
1800
38.7%
1 417
 
9.0%
( 380
 
8.2%
) 380
 
8.2%
2 260
 
5.6%
3 228
 
4.9%
0 187
 
4.0%
4 185
 
4.0%
5 145
 
3.1%
6 144
 
3.1%
Other values (8) 527
 
11.3%
Latin
ValueCountFrequency (%)
B 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6517
58.3%
ASCII 4639
41.5%
None 17
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1800
38.8%
1 417
 
9.0%
( 380
 
8.2%
) 380
 
8.2%
2 260
 
5.6%
3 228
 
4.9%
0 187
 
4.0%
4 185
 
4.0%
5 145
 
3.1%
6 144
 
3.1%
Other values (8) 513
 
11.1%
Hangul
ValueCountFrequency (%)
459
 
7.0%
386
 
5.9%
381
 
5.8%
370
 
5.7%
370
 
5.7%
359
 
5.5%
356
 
5.5%
356
 
5.5%
354
 
5.4%
353
 
5.4%
Other values (162) 2773
42.6%
None
ValueCountFrequency (%)
17
100.0%

Missing values

2024-04-06T18:46:24.598572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T18:46:24.728064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-06T18:46:24.988432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

상호전화번호도로명주소
0(주)가람건설산업032-433-0406인천광역시 미추홀구 수봉로45번길 56 1층 (숭의동)
1(주)가야이엔씨032-288-0482인천광역시 미추홀구 경원대로658번길 21-4 (관교동)
2(주)가인엔지니어링032-863-6900인천광역시 미추홀구 인하로 201-3 , 3층 (주안동)
3(주)건우건축설비032-431-0991인천광역시 미추홀구 방축로 190 , 312호 (도화동)
4(주)경유산업개발032-564-1809인천광역시 미추홀구 인중로 22, 6층(숭의동, 용운빌딩)
5(주)경인이앤씨032-467-1114인천광역시 미추홀구 경원대로716번길 42-1 , 1층 2호 (관교동)
6(주)경인조경건설032-467-1114인천광역시 미추홀구 경원대로716번길 42-1 1층 (관교동)
7(주)경인종합설비032-886-9994인천광역시 미추홀구 독정이로 71 (숭의동)
8(주)경호토건032-423-2292인천광역시 미추홀구 주안로 116, 5층506호(주안동, 주안리가스퀘어)
9(주)계양건설032-541-8898인천광역시 미추홀구 인주대로224번길 6 208호(수봉오피스텔) (용현동)
상호전화번호도로명주소
343현대조경개발032-437-9790인천광역시 미추홀구 경인로 384 , 2층 79호 (주안동)
344현돈건설(주)032-432-9974인천광역시 미추홀구 주안로205번길 12-12 , 1003호(제이앤케이하베스트) (주안동)
345현진건설(주)032-813-2285인천광역시 미추홀구 매소홀로 592 3층 (문학동)
346협동보일러032-865-4343인천광역시 미추홀구 인하로236번길 40 (주안동)
347형제ENG<NA>인천광역시 미추홀구 염전로168번길 28, 도화두손지젤시티 제비동 901호
348형제공사032-425-6206인천광역시 미추홀구 동주길 37 (주안동)
349효성종합건축설비032-544-1574인천광역시 미추홀구 수봉남로17번길 24 1층 (용현동)
350흥운건설(주)032-589-6641인천광역시 미추홀구 방축로 190, 2-318 (도화동)
351흥일전문건설(주)032-889-0300인천광역시 미추홀구 인중로 7 2층 (숭의동)
352희망ENG032-885-6111인천광역시 미추홀구 참외전로 349 (숭의동)