Overview

Dataset statistics

Number of variables3
Number of observations42
Missing cells2
Missing cells (%)1.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory27.1 B

Variable types

Text3

Dataset

Description인천광역시 부평구_도장_습식_방수_석공사업 현황 데이터는 부평구 내에 도장_습식_방수_석공사업체의 업체명, 도로명 주소, 전화번호 정보를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15117922/fileData.do

Alerts

전화번호 has 2 (4.8%) missing valuesMissing
업체명 has unique valuesUnique
도로명주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:38:43.046412
Analysis finished2023-12-12 03:38:43.363585
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T12:38:43.541710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length11
Mean length8.047619
Min length5

Characters and Unicode

Total characters338
Distinct characters101
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row(주)다온건설
2nd row(주)대신건설
3rd row(주)대하건설
4th row(주)동우건설
5th row(주)보성씨앤아이
ValueCountFrequency (%)
주)다온건설 1
 
2.4%
에이치에스도장건설(주 1
 
2.4%
하나월드텍건설(주 1
 
2.4%
도형이엔씨(주 1
 
2.4%
리더스하우징 1
 
2.4%
반석티티건설(주 1
 
2.4%
백상건설(주 1
 
2.4%
선강건설(주 1
 
2.4%
심일건설(주 1
 
2.4%
어반이엔씨(주 1
 
2.4%
Other values (32) 32
76.2%
2023-12-12T12:38:44.094630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
11.5%
( 35
 
10.4%
) 35
 
10.4%
25
 
7.4%
21
 
6.2%
11
 
3.3%
8
 
2.4%
7
 
2.1%
6
 
1.8%
6
 
1.8%
Other values (91) 145
42.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 266
78.7%
Open Punctuation 35
 
10.4%
Close Punctuation 35
 
10.4%
Other Symbol 2
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
14.7%
25
 
9.4%
21
 
7.9%
11
 
4.1%
8
 
3.0%
7
 
2.6%
6
 
2.3%
6
 
2.3%
4
 
1.5%
4
 
1.5%
Other values (88) 135
50.8%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 268
79.3%
Common 70
 
20.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
14.6%
25
 
9.3%
21
 
7.8%
11
 
4.1%
8
 
3.0%
7
 
2.6%
6
 
2.2%
6
 
2.2%
4
 
1.5%
4
 
1.5%
Other values (89) 137
51.1%
Common
ValueCountFrequency (%)
( 35
50.0%
) 35
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 266
78.7%
ASCII 70
 
20.7%
None 2
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
 
14.7%
25
 
9.4%
21
 
7.9%
11
 
4.1%
8
 
3.0%
7
 
2.6%
6
 
2.3%
6
 
2.3%
4
 
1.5%
4
 
1.5%
Other values (88) 135
50.8%
ASCII
ValueCountFrequency (%)
( 35
50.0%
) 35
50.0%
None
ValueCountFrequency (%)
2
100.0%

도로명주소
Text

UNIQUE 

Distinct42
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size468.0 B
2023-12-12T12:38:44.474276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length34.5
Mean length30.404762
Min length25

Characters and Unicode

Total characters1277
Distinct characters88
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)100.0%

Sample

1st row인천시 부평구 부평대로167번길 54, 401호 (청천동)
2nd row인천광역시 부평구 길주로 420-5, 1층 (산곡동)
3rd row인천광역시 부평구 대정로 66 406호 (부평동)
4th row인천광역시 부평구 장제로221번길 19 1층 (부평동)
5th row인천광역시 부평구 부평대로313번길 84 (청천동)
ValueCountFrequency (%)
부평구 42
 
16.2%
인천광역시 40
 
15.4%
부평동 11
 
4.2%
1층 7
 
2.7%
청천동 7
 
2.7%
십정동 6
 
2.3%
부개동 5
 
1.9%
산곡동 5
 
1.9%
삼산동 5
 
1.9%
길주로 3
 
1.2%
Other values (105) 128
49.4%
2023-12-12T12:38:45.093678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
217
 
17.0%
69
 
5.4%
61
 
4.8%
50
 
3.9%
50
 
3.9%
1 49
 
3.8%
( 44
 
3.4%
) 44
 
3.4%
2 44
 
3.4%
43
 
3.4%
Other values (78) 606
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 711
55.7%
Decimal Number 238
 
18.6%
Space Separator 217
 
17.0%
Open Punctuation 44
 
3.4%
Close Punctuation 44
 
3.4%
Other Punctuation 9
 
0.7%
Dash Punctuation 8
 
0.6%
Uppercase Letter 6
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
9.7%
61
 
8.6%
50
 
7.0%
50
 
7.0%
43
 
6.0%
43
 
6.0%
42
 
5.9%
42
 
5.9%
40
 
5.6%
40
 
5.6%
Other values (56) 231
32.5%
Decimal Number
ValueCountFrequency (%)
1 49
20.6%
2 44
18.5%
3 26
10.9%
4 26
10.9%
0 24
10.1%
9 20
8.4%
6 16
 
6.7%
8 13
 
5.5%
7 11
 
4.6%
5 9
 
3.8%
Uppercase Letter
ValueCountFrequency (%)
H 1
16.7%
I 1
16.7%
Z 1
16.7%
O 1
16.7%
N 1
16.7%
E 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 7
77.8%
2
 
22.2%
Space Separator
ValueCountFrequency (%)
217
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 711
55.7%
Common 560
43.9%
Latin 6
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
9.7%
61
 
8.6%
50
 
7.0%
50
 
7.0%
43
 
6.0%
43
 
6.0%
42
 
5.9%
42
 
5.9%
40
 
5.6%
40
 
5.6%
Other values (56) 231
32.5%
Common
ValueCountFrequency (%)
217
38.8%
1 49
 
8.8%
( 44
 
7.9%
) 44
 
7.9%
2 44
 
7.9%
3 26
 
4.6%
4 26
 
4.6%
0 24
 
4.3%
9 20
 
3.6%
6 16
 
2.9%
Other values (6) 50
 
8.9%
Latin
ValueCountFrequency (%)
H 1
16.7%
I 1
16.7%
Z 1
16.7%
O 1
16.7%
N 1
16.7%
E 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 711
55.7%
ASCII 564
44.2%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
217
38.5%
1 49
 
8.7%
( 44
 
7.8%
) 44
 
7.8%
2 44
 
7.8%
3 26
 
4.6%
4 26
 
4.6%
0 24
 
4.3%
9 20
 
3.5%
6 16
 
2.8%
Other values (11) 54
 
9.6%
Hangul
ValueCountFrequency (%)
69
 
9.7%
61
 
8.6%
50
 
7.0%
50
 
7.0%
43
 
6.0%
43
 
6.0%
42
 
5.9%
42
 
5.9%
40
 
5.6%
40
 
5.6%
Other values (56) 231
32.5%
None
ValueCountFrequency (%)
2
100.0%

전화번호
Text

MISSING 

Distinct40
Distinct (%)100.0%
Missing2
Missing (%)4.8%
Memory size468.0 B
2023-12-12T12:38:45.356091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.025
Min length12

Characters and Unicode

Total characters481
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)100.0%

Sample

1st row032-836-5550
2nd row032-246-0123
3rd row053-562-7728
4th row032-566-1867
5th row032-526-0485
ValueCountFrequency (%)
032-932-5101 1
 
2.5%
032-503-7181 1
 
2.5%
032-327-9572 1
 
2.5%
070-7514-2772 1
 
2.5%
032-515-2341 1
 
2.5%
032-330-6327 1
 
2.5%
032-522-9994 1
 
2.5%
032-710-7061 1
 
2.5%
032-330-0922 1
 
2.5%
032-511-0482 1
 
2.5%
Other values (30) 30
75.0%
2023-12-12T12:38:45.798199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 81
16.8%
2 81
16.8%
- 80
16.6%
3 67
13.9%
5 39
8.1%
1 33
6.9%
7 27
 
5.6%
4 22
 
4.6%
6 20
 
4.2%
9 18
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 401
83.4%
Dash Punctuation 80
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 81
20.2%
2 81
20.2%
3 67
16.7%
5 39
9.7%
1 33
8.2%
7 27
 
6.7%
4 22
 
5.5%
6 20
 
5.0%
9 18
 
4.5%
8 13
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 481
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 81
16.8%
2 81
16.8%
- 80
16.6%
3 67
13.9%
5 39
8.1%
1 33
6.9%
7 27
 
5.6%
4 22
 
4.6%
6 20
 
4.2%
9 18
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 481
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 81
16.8%
2 81
16.8%
- 80
16.6%
3 67
13.9%
5 39
8.1%
1 33
6.9%
7 27
 
5.6%
4 22
 
4.6%
6 20
 
4.2%
9 18
 
3.7%

Correlations

2023-12-12T12:38:45.927030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명도로명주소전화번호
업체명1.0001.0001.000
도로명주소1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-12T12:38:43.262349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:38:43.334061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명도로명주소전화번호
0(주)다온건설인천시 부평구 부평대로167번길 54, 401호 (청천동)032-836-5550
1(주)대신건설인천광역시 부평구 길주로 420-5, 1층 (산곡동)032-246-0123
2(주)대하건설인천광역시 부평구 대정로 66 406호 (부평동)053-562-7728
3(주)동우건설인천광역시 부평구 장제로221번길 19 1층 (부평동)<NA>
4(주)보성씨앤아이인천광역시 부평구 부평대로313번길 84 (청천동)032-566-1867
5(주)서영이엔씨인천광역시 부평구 주부토로172번길 34 2층 (갈산동)032-526-0485
6(주)신영석재건설인천광역시 부평구 동암남로34번길 12 (십정동)032-422-2353
7(주)엔에스플랜인천광역시 부평구 길주로 659 301호 (삼산동)032-508-1017
8(주)예현석건인천광역시 부평구 대정로 7 22층 2239호 (부평동)032-673-1770
9(주)와이구조엔지니어링인천광역시 부평구 부평대로 301 5층 501호 (청천동)032-519-1971
업체명도로명주소전화번호
32유청건설(주)인천광역시 부평구 체육관로 32 하이존(HI-ZONE) 403호 (삼산동)032-327-9572
33장흥토건(주)인천광역시 부평구 경인로 707 602호 (십정동)032-330-0006
34주식회사라인원이앤씨인천광역시 부평구 동암산로37번길 1 1층 102호 (십정동)032-439-1002
35주식회사유현건설인천광역시 부평구 주부토로81번길 50 3동 214호 (부평동)032-330-1640
36케스켐(주)인천광역시 부평구 부평대로 283 씨동 6층 611-2호 (청천동)032-505-0520
37케이에이치종합건설㈜인천광역시 부평구 배곶남로9번길 12 1층 2호 (십정동)032-426-4111
38태림산업개발(주)인천광역시 부평구 평천로 296 경연빌딩 4층 (갈산동)032-526-6704
39티씨에스이앤씨주식회사인천광역시 부평구 새벌로 29, 8층 813호 (청천동)032-724-8929
40하나월드텍건설(주)인천광역시 부평구 마장로 368-1 104호 (청천동)032-504-0457
41한양전문건설(주)인천광역시 부평구 마장로179번길 9 (산곡동)032-523-3337