Overview

Dataset statistics

Number of variables2
Number of observations116
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory17.1 B

Variable types

Text2

Dataset

Description전국 지하수 개발시공업체 정보를 아래와 같은 항목으로 제공합니다. (전화번호는 제공하지 않습니다.)- 제공 항목 : 상호명, 주소
Author한국수자원공사
URLhttps://www.data.go.kr/data/15054554/fileData.do

Reproduction

Analysis started2024-05-11 10:48:02.767266
Analysis finished2024-05-11 10:48:03.997871
Duration1.23 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct107
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-05-11T10:48:04.536714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length8.7155172
Min length4

Characters and Unicode

Total characters1011
Distinct characters155
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)84.5%

Sample

1st row(주)그린비즈
2nd row(주)금산기술환경
3rd row(주)금송ENG
4th row(주)다산컨설턴트
5th row(주)대우건설
ValueCountFrequency (%)
주식회사 11
 
8.6%
주)이데아이엔에스 2
 
1.6%
주)한국종합기술 2
 
1.6%
벽산엔지니어링(주 2
 
1.6%
주)동해종합기술공사 2
 
1.6%
유성삼정개발(주 2
 
1.6%
현대엔지니어링(주 2
 
1.6%
㈜한서엔지니어링 2
 
1.6%
주)동명엔터프라이즈 2
 
1.6%
서정엔지니어링㈜ 2
 
1.6%
Other values (99) 99
77.3%
2024-05-11T10:48:06.327074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
10.1%
( 87
 
8.6%
) 87
 
8.6%
42
 
4.2%
36
 
3.6%
33
 
3.3%
25
 
2.5%
25
 
2.5%
24
 
2.4%
22
 
2.2%
Other values (145) 528
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 811
80.2%
Open Punctuation 87
 
8.6%
Close Punctuation 87
 
8.6%
Space Separator 12
 
1.2%
Other Symbol 8
 
0.8%
Uppercase Letter 3
 
0.3%
Decimal Number 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
102
 
12.6%
42
 
5.2%
36
 
4.4%
33
 
4.1%
25
 
3.1%
25
 
3.1%
24
 
3.0%
22
 
2.7%
21
 
2.6%
20
 
2.5%
Other values (135) 461
56.8%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 819
81.0%
Common 189
 
18.7%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
 
12.5%
42
 
5.1%
36
 
4.4%
33
 
4.0%
25
 
3.1%
25
 
3.1%
24
 
2.9%
22
 
2.7%
21
 
2.6%
20
 
2.4%
Other values (136) 469
57.3%
Common
ValueCountFrequency (%)
( 87
46.0%
) 87
46.0%
12
 
6.3%
2 1
 
0.5%
. 1
 
0.5%
1 1
 
0.5%
Latin
ValueCountFrequency (%)
E 1
33.3%
N 1
33.3%
G 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 811
80.2%
ASCII 192
 
19.0%
None 8
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
102
 
12.6%
42
 
5.2%
36
 
4.4%
33
 
4.1%
25
 
3.1%
25
 
3.1%
24
 
3.0%
22
 
2.7%
21
 
2.6%
20
 
2.5%
Other values (135) 461
56.8%
ASCII
ValueCountFrequency (%)
( 87
45.3%
) 87
45.3%
12
 
6.2%
2 1
 
0.5%
. 1
 
0.5%
1 1
 
0.5%
E 1
 
0.5%
N 1
 
0.5%
G 1
 
0.5%
None
ValueCountFrequency (%)
8
100.0%

주소
Text

Distinct113
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2024-05-11T10:48:07.216519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length42
Mean length31.12931
Min length18

Characters and Unicode

Total characters3611
Distinct characters272
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)94.8%

Sample

1st row경기도 용인시 기흥구 기흥단지로 81-13 (고매동)
2nd row경기도 안양시 만안구 안양로 424, 4층 401호 (안양동)
3rd row부산 해운대구 APEC로 17, 센텀 리더스마크 2702호
4th row경상북도 구미시 구미중앙로42길 5-66, 4층 (송정동)
5th row서울특별시 종로구 새문안로 75 (신문로1가)
ValueCountFrequency (%)
서울특별시 45
 
6.2%
경기도 35
 
4.8%
성남시 12
 
1.7%
안양시 9
 
1.2%
송파구 9
 
1.2%
강남구 8
 
1.1%
종로구 7
 
1.0%
분당구 7
 
1.0%
2층 6
 
0.8%
동안구 6
 
0.8%
Other values (468) 583
80.2%
2024-05-11T10:48:08.586117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
611
 
16.9%
127
 
3.5%
112
 
3.1%
108
 
3.0%
1 108
 
3.0%
) 93
 
2.6%
( 93
 
2.6%
91
 
2.5%
2 90
 
2.5%
, 78
 
2.2%
Other values (262) 2100
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2119
58.7%
Space Separator 611
 
16.9%
Decimal Number 561
 
15.5%
Close Punctuation 93
 
2.6%
Open Punctuation 93
 
2.6%
Other Punctuation 78
 
2.2%
Uppercase Letter 25
 
0.7%
Dash Punctuation 22
 
0.6%
Lowercase Letter 9
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
127
 
6.0%
112
 
5.3%
108
 
5.1%
91
 
4.3%
69
 
3.3%
65
 
3.1%
50
 
2.4%
46
 
2.2%
46
 
2.2%
45
 
2.1%
Other values (225) 1360
64.2%
Uppercase Letter
ValueCountFrequency (%)
A 4
16.0%
S 4
16.0%
T 3
12.0%
K 3
12.0%
E 2
8.0%
W 1
 
4.0%
O 1
 
4.0%
V 1
 
4.0%
X 1
 
4.0%
R 1
 
4.0%
Other values (4) 4
16.0%
Decimal Number
ValueCountFrequency (%)
1 108
19.3%
2 90
16.0%
0 62
11.1%
4 54
9.6%
3 53
9.4%
6 48
8.6%
8 44
7.8%
5 38
 
6.8%
7 34
 
6.1%
9 30
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
r 2
22.2%
e 1
11.1%
w 1
11.1%
o 1
11.1%
t 1
11.1%
s 1
11.1%
i 1
11.1%
n 1
11.1%
Space Separator
ValueCountFrequency (%)
611
100.0%
Close Punctuation
ValueCountFrequency (%)
) 93
100.0%
Open Punctuation
ValueCountFrequency (%)
( 93
100.0%
Other Punctuation
ValueCountFrequency (%)
, 78
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2119
58.7%
Common 1458
40.4%
Latin 34
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
127
 
6.0%
112
 
5.3%
108
 
5.1%
91
 
4.3%
69
 
3.3%
65
 
3.1%
50
 
2.4%
46
 
2.2%
46
 
2.2%
45
 
2.1%
Other values (225) 1360
64.2%
Latin
ValueCountFrequency (%)
A 4
 
11.8%
S 4
 
11.8%
T 3
 
8.8%
K 3
 
8.8%
r 2
 
5.9%
E 2
 
5.9%
W 1
 
2.9%
O 1
 
2.9%
V 1
 
2.9%
X 1
 
2.9%
Other values (12) 12
35.3%
Common
ValueCountFrequency (%)
611
41.9%
1 108
 
7.4%
) 93
 
6.4%
( 93
 
6.4%
2 90
 
6.2%
, 78
 
5.3%
0 62
 
4.3%
4 54
 
3.7%
3 53
 
3.6%
6 48
 
3.3%
Other values (5) 168
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2119
58.7%
ASCII 1492
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
611
41.0%
1 108
 
7.2%
) 93
 
6.2%
( 93
 
6.2%
2 90
 
6.0%
, 78
 
5.2%
0 62
 
4.2%
4 54
 
3.6%
3 53
 
3.6%
6 48
 
3.2%
Other values (27) 202
 
13.5%
Hangul
ValueCountFrequency (%)
127
 
6.0%
112
 
5.3%
108
 
5.1%
91
 
4.3%
69
 
3.3%
65
 
3.1%
50
 
2.4%
46
 
2.2%
46
 
2.2%
45
 
2.1%
Other values (225) 1360
64.2%

Missing values

2024-05-11T10:48:03.579912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T10:48:03.887975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명주소
0(주)그린비즈경기도 용인시 기흥구 기흥단지로 81-13 (고매동)
1(주)금산기술환경경기도 안양시 만안구 안양로 424, 4층 401호 (안양동)
2(주)금송ENG부산 해운대구 APEC로 17, 센텀 리더스마크 2702호
3(주)다산컨설턴트경상북도 구미시 구미중앙로42길 5-66, 4층 (송정동)
4(주)대우건설서울특별시 종로구 새문안로 75 (신문로1가)
5(주)대일이앤씨경기도 여주시 북내면 외재로 176
6(주)동명엔터프라이즈서울특별시 강남구 도곡로 131 (역삼동)
7(주)동명엔터프라이즈서울특별시 서초구 남부순환로 2471, 동명빌딩 (서초동)
8(주)동해종합기술공사서울특별시 성동구 광나루로6길 35, 성수동 우림 이비즈센터 502호 (성수동2가)
9(주)동해종합기술공사서울특별시 성동구 광나루로6길 35, 610호 (성수동2가, 성수동 우림 이비즈센터)
상호명주소
106한국수자원공사대전광역시 대덕구 연축동 산6번지 2호
107한솔이엠이주식회사경기도 성남시 분당구 분당로 55 (서현동,First Tower 7층(분당구 분당로 29))
108해림기술단(주)서울특별시 성동구 왕십리로10길 9-14 (성수동1가)
109현대건설(주)서울특별시 종로구 율곡로 75 (계동)
110현대엔지니어링(주)서울특별시 종로구 율곡로 75 (계동)
111현대엔지니어링(주)서울특별시 양천구 목동동로 293 (목동)
112형제건설경기도 화성시 동탄면 오산리 886번지 4호
113화성산업(주)대구광역시 수성구 동대구로 111, 황금빌딩 (황금동)
114환경관리 주식회사인천광역시 연수구 송도과학로 32 (송도동)
115환경시설관리 주식회사경기도 안양시 만안구 일직로 88 (석수동)