Overview

Dataset statistics

Number of variables5
Number of observations1928
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.4 KiB
Average record size in memory40.1 B

Variable types

Categorical2
Text3

Dataset

Description지하수 측정망 제원에 대한 내용 입니다. - 구분, 관측소명, 주소, 원시자료명, 원시자료 기관명 등을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15104446/fileData.do

Alerts

구분 is highly overall correlated with 원시자료기관명High correlation
원시자료기관명 is highly overall correlated with 구분High correlation

Reproduction

Analysis started2023-12-12 13:22:52.059184
Analysis finished2023-12-12 13:22:52.648693
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
국가지하수관리측정망
918 
농촌지하수관측망
505 
해수침투망
259 
국가지하수오염측정망
246 

Length

Max length10
Median length10
Mean length8.8044606
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국가지하수관리측정망
2nd row국가지하수관리측정망
3rd row국가지하수관리측정망
4th row국가지하수관리측정망
5th row국가지하수관리측정망

Common Values

ValueCountFrequency (%)
국가지하수관리측정망 918
47.6%
농촌지하수관측망 505
26.2%
해수침투망 259
 
13.4%
국가지하수오염측정망 246
 
12.8%

Length

2023-12-12T22:22:52.715515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:22:52.814972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국가지하수관리측정망 918
47.6%
농촌지하수관측망 505
26.2%
해수침투망 259
 
13.4%
국가지하수오염측정망 246
 
12.8%
Distinct1791
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T22:22:53.163454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length4.7090249
Min length3

Characters and Unicode

Total characters9079
Distinct characters282
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1654 ?
Unique (%)85.8%

Sample

1st row가평가평
2nd row가평달전_심부2
3rd row가평달전_천부
4th row가평북면
5th row가평상면
ValueCountFrequency (%)
흥왕2 2
 
0.1%
포항1 2
 
0.1%
대포1 2
 
0.1%
덕호1 2
 
0.1%
포항5 2
 
0.1%
포항4 2
 
0.1%
포항3 2
 
0.1%
포항2 2
 
0.1%
마량2 2
 
0.1%
사등1 2
 
0.1%
Other values (1782) 1910
99.0%
2023-12-12T22:22:53.732551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
649
 
7.1%
_ 592
 
6.5%
567
 
6.2%
2 489
 
5.4%
1 299
 
3.3%
293
 
3.2%
275
 
3.0%
246
 
2.7%
206
 
2.3%
152
 
1.7%
Other values (272) 5311
58.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7409
81.6%
Decimal Number 1072
 
11.8%
Connector Punctuation 592
 
6.5%
Space Separator 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
649
 
8.8%
567
 
7.7%
293
 
4.0%
275
 
3.7%
246
 
3.3%
206
 
2.8%
152
 
2.1%
145
 
2.0%
126
 
1.7%
114
 
1.5%
Other values (259) 4636
62.6%
Decimal Number
ValueCountFrequency (%)
2 489
45.6%
1 299
27.9%
3 114
 
10.6%
4 80
 
7.5%
5 48
 
4.5%
6 27
 
2.5%
7 10
 
0.9%
8 4
 
0.4%
9 1
 
0.1%
Connector Punctuation
ValueCountFrequency (%)
_ 592
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7409
81.6%
Common 1670
 
18.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
649
 
8.8%
567
 
7.7%
293
 
4.0%
275
 
3.7%
246
 
3.3%
206
 
2.8%
152
 
2.1%
145
 
2.0%
126
 
1.7%
114
 
1.5%
Other values (259) 4636
62.6%
Common
ValueCountFrequency (%)
_ 592
35.4%
2 489
29.3%
1 299
17.9%
3 114
 
6.8%
4 80
 
4.8%
5 48
 
2.9%
6 27
 
1.6%
7 10
 
0.6%
8 4
 
0.2%
2
 
0.1%
Other values (3) 5
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7409
81.6%
ASCII 1670
 
18.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
649
 
8.8%
567
 
7.7%
293
 
4.0%
275
 
3.7%
246
 
3.3%
206
 
2.8%
152
 
2.1%
145
 
2.0%
126
 
1.7%
114
 
1.5%
Other values (259) 4636
62.6%
ASCII
ValueCountFrequency (%)
_ 592
35.4%
2 489
29.3%
1 299
17.9%
3 114
 
6.8%
4 80
 
4.8%
5 48
 
2.9%
6 27
 
1.6%
7 10
 
0.6%
8 4
 
0.2%
2
 
0.1%
Other values (3) 5
 
0.3%

주소
Text

Distinct1621
Distinct (%)84.1%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
2023-12-12T22:22:54.200093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length34
Mean length21.789938
Min length14

Characters and Unicode

Total characters42011
Distinct characters346
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1315 ?
Unique (%)68.2%

Sample

1st row경기도 가평군 가평읍 읍내리 624-1
2nd row경기도 가평군 가평읍 달전리 산59-6
3rd row경기도 가평군 가평읍 달전리 산59-6
4th row경기도 가평군 북면 목동리 866-2
5th row경기도 가평군 상면 임초리 643
ValueCountFrequency (%)
전라남도 335
 
3.5%
경상북도 296
 
3.1%
경상남도 247
 
2.6%
경기도 213
 
2.2%
충청남도 196
 
2.0%
전라북도 170
 
1.8%
강원특별자치도 148
 
1.5%
충청북도 123
 
1.3%
강원도 78
 
0.8%
인천광역시 32
 
0.3%
Other values (3674) 7730
80.8%
2023-12-12T22:22:54.772221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7955
 
18.9%
1970
 
4.7%
1674
 
4.0%
1 1489
 
3.5%
1283
 
3.1%
- 1154
 
2.7%
1087
 
2.6%
1002
 
2.4%
963
 
2.3%
2 867
 
2.1%
Other values (336) 22567
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25719
61.2%
Space Separator 7955
 
18.9%
Decimal Number 7177
 
17.1%
Dash Punctuation 1154
 
2.7%
Open Punctuation 3
 
< 0.1%
Close Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1970
 
7.7%
1674
 
6.5%
1283
 
5.0%
1087
 
4.2%
1002
 
3.9%
963
 
3.7%
808
 
3.1%
708
 
2.8%
670
 
2.6%
656
 
2.6%
Other values (322) 14898
57.9%
Decimal Number
ValueCountFrequency (%)
1 1489
20.7%
2 867
12.1%
3 746
10.4%
4 699
9.7%
5 646
9.0%
6 616
8.6%
7 575
 
8.0%
8 521
 
7.3%
0 519
 
7.2%
9 499
 
7.0%
Space Separator
ValueCountFrequency (%)
7955
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1154
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25719
61.2%
Common 16292
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1970
 
7.7%
1674
 
6.5%
1283
 
5.0%
1087
 
4.2%
1002
 
3.9%
963
 
3.7%
808
 
3.1%
708
 
2.8%
670
 
2.6%
656
 
2.6%
Other values (322) 14898
57.9%
Common
ValueCountFrequency (%)
7955
48.8%
1 1489
 
9.1%
- 1154
 
7.1%
2 867
 
5.3%
3 746
 
4.6%
4 699
 
4.3%
5 646
 
4.0%
6 616
 
3.8%
7 575
 
3.5%
8 521
 
3.2%
Other values (4) 1024
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25719
61.2%
ASCII 16292
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7955
48.8%
1 1489
 
9.1%
- 1154
 
7.1%
2 867
 
5.3%
3 746
 
4.6%
4 699
 
4.3%
5 646
 
4.0%
6 616
 
3.8%
7 575
 
3.5%
8 521
 
3.2%
Other values (4) 1024
 
6.3%
Hangul
ValueCountFrequency (%)
1970
 
7.7%
1674
 
6.5%
1283
 
5.0%
1087
 
4.2%
1002
 
3.9%
963
 
3.7%
808
 
3.1%
708
 
2.8%
670
 
2.6%
656
 
2.6%
Other values (322) 14898
57.9%
Distinct59
Distinct (%)3.1%
Missing3
Missing (%)0.2%
Memory size15.2 KiB
2023-12-12T22:22:55.092864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length28
Mean length15.597922
Min length7

Characters and Unicode

Total characters30026
Distinct characters64
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.6%

Sample

1st row95지하수관측망설치공사 준공보고서
2nd row2017국가지하수수질전용측정망설치보고서
3rd row2017국가지하수수질전용측정망설치보고서
4th row97지하수 관측망 설치공사 준공보고서
5th row2005 지하수 관측망 설치공사 준공보고서
ValueCountFrequency (%)
농촌지하수관측망 505
16.0%
준공보고서 371
 
11.8%
설치공사 299
 
9.5%
관측망 284
 
9.0%
해수침투관측망 259
 
8.2%
2017국가지하수수질전용측정망설치보고서 83
 
2.6%
2014국가지하수수질전용측정망설치보고서 73
 
2.3%
2016국가지하수수질전용측정망설치보고서 73
 
2.3%
2018국가지하수수질전용측정망설치보고서 72
 
2.3%
2011국가지하수수질전용측정망설치보고서 70
 
2.2%
Other values (64) 1065
33.8%
2023-12-12T22:22:55.543739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2549
 
8.5%
1909
 
6.4%
1906
 
6.3%
1648
 
5.5%
1647
 
5.5%
0 1318
 
4.4%
1230
 
4.1%
1227
 
4.1%
2 1205
 
4.0%
1181
 
3.9%
Other values (54) 14206
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24450
81.4%
Decimal Number 4336
 
14.4%
Space Separator 1230
 
4.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2549
 
10.4%
1909
 
7.8%
1906
 
7.8%
1648
 
6.7%
1647
 
6.7%
1227
 
5.0%
1181
 
4.8%
1160
 
4.7%
1146
 
4.7%
1146
 
4.7%
Other values (41) 8931
36.5%
Decimal Number
ValueCountFrequency (%)
0 1318
30.4%
2 1205
27.8%
1 829
19.1%
9 218
 
5.0%
7 171
 
3.9%
8 148
 
3.4%
6 116
 
2.7%
3 112
 
2.6%
4 111
 
2.6%
5 108
 
2.5%
Space Separator
ValueCountFrequency (%)
1230
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24450
81.4%
Common 5576
 
18.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2549
 
10.4%
1909
 
7.8%
1906
 
7.8%
1648
 
6.7%
1647
 
6.7%
1227
 
5.0%
1181
 
4.8%
1160
 
4.7%
1146
 
4.7%
1146
 
4.7%
Other values (41) 8931
36.5%
Common
ValueCountFrequency (%)
0 1318
23.6%
1230
22.1%
2 1205
21.6%
1 829
14.9%
9 218
 
3.9%
7 171
 
3.1%
8 148
 
2.7%
6 116
 
2.1%
3 112
 
2.0%
4 111
 
2.0%
Other values (3) 118
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24450
81.4%
ASCII 5576
 
18.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2549
 
10.4%
1909
 
7.8%
1906
 
7.8%
1648
 
6.7%
1647
 
6.7%
1227
 
5.0%
1181
 
4.8%
1160
 
4.7%
1146
 
4.7%
1146
 
4.7%
Other values (41) 8931
36.5%
ASCII
ValueCountFrequency (%)
0 1318
23.6%
1230
22.1%
2 1205
21.6%
1 829
14.9%
9 218
 
3.9%
7 171
 
3.1%
8 148
 
2.7%
6 116
 
2.1%
3 112
 
2.0%
4 111
 
2.0%
Other values (3) 118
 
2.1%

원시자료기관명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size15.2 KiB
한국농어촌공사
754 
환경부, 한국환경공단
673 
환경부, 한국수자원공사
491 
개인
 
3
거제시
 
3
Other values (4)
 
4

Length

Max length12
Median length11
Mean length9.6473029
Min length2

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row환경부, 한국수자원공사
2nd row환경부, 한국환경공단
3rd row환경부, 한국환경공단
4th row환경부, 한국수자원공사
5th row환경부, 한국수자원공사

Common Values

ValueCountFrequency (%)
한국농어촌공사 754
39.1%
환경부, 한국환경공단 673
34.9%
환경부, 한국수자원공사 491
25.5%
개인 3
 
0.2%
거제시 3
 
0.2%
남해군 1
 
0.1%
신안군 1
 
0.1%
강화군 1
 
0.1%
완도군 1
 
0.1%

Length

2023-12-12T22:22:55.705174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T22:22:55.845285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
환경부 1164
37.6%
한국농어촌공사 754
24.4%
한국환경공단 673
21.8%
한국수자원공사 491
15.9%
개인 3
 
0.1%
거제시 3
 
0.1%
남해군 1
 
< 0.1%
신안군 1
 
< 0.1%
강화군 1
 
< 0.1%
완도군 1
 
< 0.1%

Correlations

2023-12-12T22:22:55.964943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분원시자료명원시자료기관명
구분1.0000.9830.771
원시자료명0.9831.0000.834
원시자료기관명0.7710.8341.000
2023-12-12T22:22:56.073375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분원시자료기관명
구분1.0000.620
원시자료기관명0.6201.000
2023-12-12T22:22:56.162126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분원시자료기관명
구분1.0000.620
원시자료기관명0.6201.000

Missing values

2023-12-12T22:22:52.500917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:22:52.604018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분관측소명주소원시자료명원시자료기관명
0국가지하수관리측정망가평가평경기도 가평군 가평읍 읍내리 624-195지하수관측망설치공사 준공보고서환경부, 한국수자원공사
1국가지하수관리측정망가평달전_심부2경기도 가평군 가평읍 달전리 산59-62017국가지하수수질전용측정망설치보고서환경부, 한국환경공단
2국가지하수관리측정망가평달전_천부경기도 가평군 가평읍 달전리 산59-62017국가지하수수질전용측정망설치보고서환경부, 한국환경공단
3국가지하수관리측정망가평북면경기도 가평군 북면 목동리 866-297지하수 관측망 설치공사 준공보고서환경부, 한국수자원공사
4국가지하수관리측정망가평상면경기도 가평군 상면 임초리 6432005 지하수 관측망 설치공사 준공보고서환경부, 한국수자원공사
5국가지하수관리측정망가평외서경기도 가평군 외서면 고성리 552004지하수관측망설치준공보고서환경부, 한국수자원공사
6국가지하수관리측정망가평현리_심부2경기도 가평군 조종면 현리 438 조종중고등학교2017국가지하수수질전용측정망설치보고서환경부, 한국환경공단
7국가지하수관리측정망가평현리_천부경기도 가평군 조종면 현리 438 조종중고등학교2017국가지하수수질전용측정망설치보고서환경부, 한국환경공단
8국가지하수관리측정망강릉강동강원특별자치도 강릉시 강동면 언별리 1336-32016년 국가 지하수관측망 설치공사(1권역) 준공보고서환경부, 한국수자원공사
9국가지하수관리측정망강릉모전_심부2강원특별자치도 강릉시 강동면 모전리 384 강동초등학교2016국가지하수수질전용측정망설치보고서환경부, 한국환경공단
구분관측소명주소원시자료명원시자료기관명
1918해수침투망화흥1전라남도 완도군 완도읍 정도리 980-2해수침투관측망한국농어촌공사
1919해수침투망화흥2전라남도 완도군 완도읍 대신리 1234-1해수침투관측망완도군
1920해수침투망화흥2전라남도 완도군 완도읍 대신리 1245해수침투관측망한국농어촌공사
1921해수침투망화흥3전라남도 완도군 완도읍 대신리 1264해수침투관측망한국농어촌공사
1922해수침투망화흥3전라남도 완도군 완도읍 대신리 1264(1189주변)해수침투관측망한국농어촌공사
1923해수침투망효지1전라남도 신안군 지도읍 자동리 1383해수침투관측망한국농어촌공사
1924해수침투망후포1경상북도 울진군 후포면 삼율리 872해수침투관측망한국농어촌공사
1925해수침투망흥왕1인천광역시 강화군 화도면 흥왕리 1314해수침투관측망한국농어촌공사
1926해수침투망흥왕2인천광역시 강화군 화도면 흥왕리 1013해수침투관측망한국농어촌공사
1927해수침투망흥왕2인천광역시 강화군 화도면 흥왕리 1128-7해수침투관측망한국농어촌공사