Overview

Dataset statistics

Number of variables3
Number of observations31
Missing cells1
Missing cells (%)1.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory876.0 B
Average record size in memory28.3 B

Variable types

Text3

Dataset

Description인천광역시 부평구_상하수도설비공사업 현황 데이터는 부평구 내에 상하수도설비공사업체의 업체명, 도로명 주소, 전화번호 정보를 제공하고 있습니다.
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15117924&srcSe=7661IVAWM27C61E190

Alerts

전화번호 has 1 (3.2%) missing valuesMissing
업체명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 03:30:28.776156
Analysis finished2024-03-18 03:30:30.261159
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size380.0 B
2024-03-18T12:30:30.421069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length7.6129032
Min length4

Characters and Unicode

Total characters236
Distinct characters76
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row(주)국토산업개발
2nd row(주)대맥이엔씨
3rd row(주)대성
4th row(주)보은건설
5th row(주)삼선건설
ValueCountFrequency (%)
주)국토산업개발 1
 
3.2%
선강건설(주 1
 
3.2%
한양전문건설(주 1
 
3.2%
태인산업개발주식회사 1
 
3.2%
청우건설(주 1
 
3.2%
창운건설(주 1
 
3.2%
주식회사유현건설 1
 
3.2%
주식회사엘림건설 1
 
3.2%
제이에프이앤씨㈜ 1
 
3.2%
일흥건설(주 1
 
3.2%
Other values (21) 21
67.7%
2024-03-18T12:30:30.753323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
11.4%
( 21
 
8.9%
) 21
 
8.9%
19
 
8.1%
17
 
7.2%
7
 
3.0%
6
 
2.5%
6
 
2.5%
6
 
2.5%
6
 
2.5%
Other values (66) 100
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 192
81.4%
Open Punctuation 21
 
8.9%
Close Punctuation 21
 
8.9%
Other Symbol 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
14.1%
19
 
9.9%
17
 
8.9%
7
 
3.6%
6
 
3.1%
6
 
3.1%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (63) 88
45.8%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 194
82.2%
Common 42
 
17.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
13.9%
19
 
9.8%
17
 
8.8%
7
 
3.6%
6
 
3.1%
6
 
3.1%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (64) 90
46.4%
Common
ValueCountFrequency (%)
( 21
50.0%
) 21
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 192
81.4%
ASCII 42
 
17.8%
None 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
14.1%
19
 
9.9%
17
 
8.9%
7
 
3.6%
6
 
3.1%
6
 
3.1%
6
 
3.1%
6
 
3.1%
5
 
2.6%
5
 
2.6%
Other values (63) 88
45.8%
ASCII
ValueCountFrequency (%)
( 21
50.0%
) 21
50.0%
None
ValueCountFrequency (%)
2
100.0%
Distinct30
Distinct (%)96.8%
Missing0
Missing (%)0.0%
Memory size380.0 B
2024-03-18T12:30:30.982190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length47
Median length39
Mean length30.870968
Min length22

Characters and Unicode

Total characters957
Distinct characters91
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)93.5%

Sample

1st row인천광역시 부평구 경인로 795 2층 (십정동)
2nd row인천광역시 부평구 백범로577번길 20 관리동 405호(십정동, 경인센타) (십정동)
3rd row인천광역시 부평구 영성로 46 402호 (삼산동)
4th row인천광역시 부평구 일신로 85 (일신동)
5th row인천광역시 부평구 경인로 727 영보빌딩3층 (십정동)
ValueCountFrequency (%)
인천광역시 31
 
15.7%
부평구 31
 
15.7%
삼산동 8
 
4.0%
십정동 8
 
4.0%
2층 7
 
3.5%
46 4
 
2.0%
부개동 4
 
2.0%
경인로 4
 
2.0%
부평동 3
 
1.5%
1층 2
 
1.0%
Other values (81) 96
48.5%
2024-03-18T12:30:31.320864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
167
 
17.5%
43
 
4.5%
42
 
4.4%
38
 
4.0%
36
 
3.8%
33
 
3.4%
33
 
3.4%
32
 
3.3%
) 32
 
3.3%
( 32
 
3.3%
Other values (81) 469
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 550
57.5%
Space Separator 167
 
17.5%
Decimal Number 161
 
16.8%
Close Punctuation 32
 
3.3%
Open Punctuation 32
 
3.3%
Other Punctuation 9
 
0.9%
Dash Punctuation 4
 
0.4%
Uppercase Letter 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
7.8%
42
 
7.6%
38
 
6.9%
36
 
6.5%
33
 
6.0%
33
 
6.0%
32
 
5.8%
31
 
5.6%
31
 
5.6%
31
 
5.6%
Other values (62) 200
36.4%
Decimal Number
ValueCountFrequency (%)
0 25
15.5%
2 23
14.3%
4 23
14.3%
1 22
13.7%
3 16
9.9%
7 15
9.3%
6 11
6.8%
5 10
 
6.2%
9 10
 
6.2%
8 6
 
3.7%
Other Punctuation
ValueCountFrequency (%)
, 5
55.6%
3
33.3%
/ 1
 
11.1%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
167
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 550
57.5%
Common 405
42.3%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
7.8%
42
 
7.6%
38
 
6.9%
36
 
6.5%
33
 
6.0%
33
 
6.0%
32
 
5.8%
31
 
5.6%
31
 
5.6%
31
 
5.6%
Other values (62) 200
36.4%
Common
ValueCountFrequency (%)
167
41.2%
) 32
 
7.9%
( 32
 
7.9%
0 25
 
6.2%
2 23
 
5.7%
4 23
 
5.7%
1 22
 
5.4%
3 16
 
4.0%
7 15
 
3.7%
6 11
 
2.7%
Other values (7) 39
 
9.6%
Latin
ValueCountFrequency (%)
D 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 550
57.5%
ASCII 404
42.2%
None 3
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
167
41.3%
) 32
 
7.9%
( 32
 
7.9%
0 25
 
6.2%
2 23
 
5.7%
4 23
 
5.7%
1 22
 
5.4%
3 16
 
4.0%
7 15
 
3.7%
6 11
 
2.7%
Other values (8) 38
 
9.4%
Hangul
ValueCountFrequency (%)
43
 
7.8%
42
 
7.6%
38
 
6.9%
36
 
6.5%
33
 
6.0%
33
 
6.0%
32
 
5.8%
31
 
5.6%
31
 
5.6%
31
 
5.6%
Other values (62) 200
36.4%
None
ValueCountFrequency (%)
3
100.0%

전화번호
Text

MISSING 

Distinct29
Distinct (%)96.7%
Missing1
Missing (%)3.2%
Memory size380.0 B
2024-03-18T12:30:31.483108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.033333
Min length11

Characters and Unicode

Total characters361
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)93.3%

Sample

1st row032-507-2113
2nd row031-981-2712
3rd row032-867-2206
4th row032-504-0044
5th row032-0513-9474
ValueCountFrequency (%)
032-433-8133 2
 
6.7%
032-522-9994 1
 
3.3%
031-981-2712 1
 
3.3%
032-523-3337 1
 
3.3%
032-504-5782 1
 
3.3%
032-524-0088 1
 
3.3%
032-330-1640 1
 
3.3%
032-867-7589 1
 
3.3%
032-502-1131 1
 
3.3%
032-529-8100 1
 
3.3%
Other values (19) 19
63.3%
2024-03-18T12:30:31.808603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 60
16.6%
0 56
15.5%
3 56
15.5%
2 45
12.5%
1 32
8.9%
5 31
8.6%
4 21
 
5.8%
6 17
 
4.7%
8 15
 
4.2%
9 15
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 301
83.4%
Dash Punctuation 60
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 56
18.6%
3 56
18.6%
2 45
15.0%
1 32
10.6%
5 31
10.3%
4 21
 
7.0%
6 17
 
5.6%
8 15
 
5.0%
9 15
 
5.0%
7 13
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 361
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 60
16.6%
0 56
15.5%
3 56
15.5%
2 45
12.5%
1 32
8.9%
5 31
8.6%
4 21
 
5.8%
6 17
 
4.7%
8 15
 
4.2%
9 15
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 361
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 60
16.6%
0 56
15.5%
3 56
15.5%
2 45
12.5%
1 32
8.9%
5 31
8.6%
4 21
 
5.8%
6 17
 
4.7%
8 15
 
4.2%
9 15
 
4.2%

Correlations

2024-03-18T12:30:31.892825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명도로명주소전화번호
업체명1.0001.0001.000
도로명주소1.0001.0000.990
전화번호1.0000.9901.000

Missing values

2024-03-18T12:30:30.124976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T12:30:30.221381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명도로명주소전화번호
0(주)국토산업개발인천광역시 부평구 경인로 795 2층 (십정동)032-507-2113
1(주)대맥이엔씨인천광역시 부평구 백범로577번길 20 관리동 405호(십정동, 경인센타) (십정동)031-981-2712
2(주)대성인천광역시 부평구 영성로 46 402호 (삼산동)032-867-2206
3(주)보은건설인천광역시 부평구 일신로 85 (일신동)032-504-0044
4(주)삼선건설인천광역시 부평구 경인로 727 영보빌딩3층 (십정동)032-0513-9474
5(주)앞선산업개발인천광역시 부평구 충선로209번길 49 삼산노블시티프라자 2층 205호 (삼산동)02-539-7094
6(주)엘에스폼웍인천광역시 부평구 경인로1046번길 7 동아빌딩 2층 (부개동)032-508-6750
7(주)인정산업개발인천광역시 부평구 일신로 85 2층 (일신동)070-4466-6997
8(주)태흥아스콘포장인천광역시 부평구 부평대로 230 풍진 B/D 203호 (갈산동)032-501-6856
9㈜태울환경인천광역시 부평구 백범로 500 2층 (십정동)<NA>
업체명도로명주소전화번호
21우진산업건설주식회사인천광역시 부평구 안남로 440, 301호 (청천동)032-511-1951
22일흥건설(주)인천광역시 부평구 부흥로365번길 3, 701호(부평동, 상인빌딩)032-529-8100
23제이에프이앤씨㈜인천광역시 부평구 충선로 169 3층 1호 (부개동)032-502-1131
24주식회사엘림건설인천광역시 부평구 영성로 46 4층 401호 (삼산동)032-867-7589
25주식회사유현건설인천광역시 부평구 주부토로81번길 50 3동 214호 (부평동)032-330-1640
26창운건설(주)인천광역시 부평구 함봉로 24 2층 (십정동)032-433-8133
27청우건설(주)인천광역시 부평구 경인로 737 (십정동)032-524-0088
28태인산업개발주식회사인천광역시 부평구 동수천로 45-11 2층 (부개동)032-504-5782
29한양전문건설(주)인천광역시 부평구 마장로179번길 9 (산곡동)032-523-3337
30호건종합건설(주)인천광역시 부평구 동수로 69 301호 (부평동)032-513-3211