Overview

Dataset statistics

Number of variables13
Number of observations25
Missing cells100
Missing cells (%)30.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory114.3 B

Variable types

Text2
Unsupported4
Categorical7

Dataset

Description파일 다운로드
Author강남구
URLhttps://data.seoul.go.kr/dataList/OA-15010/S/1/datasetView.do

Alerts

수질검사일자 has constant value ""Constant
수질검사결과구분 has constant value ""Constant
관리기관전화번호 has constant value ""Constant
관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
소재지도로명주소 has 25 (100.0%) missing valuesMissing
위도 has 25 (100.0%) missing valuesMissing
경도 has 25 (100.0%) missing valuesMissing
지정일자 has 25 (100.0%) missing valuesMissing
약수터명 has unique valuesUnique
소재지도로명주소 is an unsupported type, check if it needs cleaning or further analysisUnsupported
위도 is an unsupported type, check if it needs cleaning or further analysisUnsupported
경도 is an unsupported type, check if it needs cleaning or further analysisUnsupported
지정일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 05:29:26.089262
Analysis finished2023-12-11 05:29:26.616562
Duration0.53 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

약수터명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T14:29:26.770611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length2
Mean length2.56
Min length2

Characters and Unicode

Total characters64
Distinct characters40
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row개암
2nd row천의
3rd row대룡
4th row대천
5th row구룡산
ValueCountFrequency (%)
개암 1
 
4.0%
대모천 1
 
4.0%
불국사 1
 
4.0%
용두천 1
 
4.0%
인수천 1
 
4.0%
임록천 1
 
4.0%
못골 1
 
4.0%
옥수천 1
 
4.0%
실로암 1
 
4.0%
성지 1
 
4.0%
Other values (15) 15
60.0%
2023-12-11T14:29:27.166219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
 
14.1%
4
 
6.2%
3
 
4.7%
3
 
4.7%
3
 
4.7%
3
 
4.7%
2
 
3.1%
2
 
3.1%
2
 
3.1%
1 2
 
3.1%
Other values (30) 31
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60
93.8%
Decimal Number 4
 
6.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
15.0%
4
 
6.7%
3
 
5.0%
3
 
5.0%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
Other values (28) 28
46.7%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60
93.8%
Common 4
 
6.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9
 
15.0%
4
 
6.7%
3
 
5.0%
3
 
5.0%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
Other values (28) 28
46.7%
Common
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60
93.8%
ASCII 4
 
6.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
9
 
15.0%
4
 
6.7%
3
 
5.0%
3
 
5.0%
3
 
5.0%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
1
 
1.7%
Other values (28) 28
46.7%
ASCII
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%

소재지도로명주소
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B
Distinct23
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T14:29:27.384206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length20
Mean length19.32
Min length17

Characters and Unicode

Total characters483
Distinct characters34
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)84.0%

Sample

1st row서울특별시 강남구 개포동 산53-20
2nd row서울특별시 강남구 개포동 산53-30
3rd row서울특별시 강남구 개포동 118-21
4th row서울특별시 강남구 개포동 산53-31
5th row서울특별시 강남구 개포동 1017-7
ValueCountFrequency (%)
서울특별시 25
25.0%
강남구 25
25.0%
개포동 11
11.0%
일원동 7
 
7.0%
산53-28 2
 
2.0%
산63-32 2
 
2.0%
자곡동 2
 
2.0%
개포당 1
 
1.0%
산63-51 1
 
1.0%
산53-32 1
 
1.0%
Other values (23) 23
23.0%
2023-12-11T14:29:27.797565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
15.5%
26
 
5.4%
25
 
5.2%
25
 
5.2%
25
 
5.2%
25
 
5.2%
25
 
5.2%
25
 
5.2%
25
 
5.2%
24
 
5.0%
Other values (24) 183
37.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 294
60.9%
Decimal Number 93
 
19.3%
Space Separator 75
 
15.5%
Dash Punctuation 21
 
4.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
8.8%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
24
8.2%
19
 
6.5%
Other values (12) 50
17.0%
Decimal Number
ValueCountFrequency (%)
3 23
24.7%
1 17
18.3%
2 14
15.1%
5 12
12.9%
4 7
 
7.5%
6 7
 
7.5%
0 4
 
4.3%
9 3
 
3.2%
8 3
 
3.2%
7 3
 
3.2%
Space Separator
ValueCountFrequency (%)
75
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 294
60.9%
Common 189
39.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
8.8%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
24
8.2%
19
 
6.5%
Other values (12) 50
17.0%
Common
ValueCountFrequency (%)
75
39.7%
3 23
 
12.2%
- 21
 
11.1%
1 17
 
9.0%
2 14
 
7.4%
5 12
 
6.3%
4 7
 
3.7%
6 7
 
3.7%
0 4
 
2.1%
9 3
 
1.6%
Other values (2) 6
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 294
60.9%
ASCII 189
39.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
75
39.7%
3 23
 
12.2%
- 21
 
11.1%
1 17
 
9.0%
2 14
 
7.4%
5 12
 
6.3%
4 7
 
3.7%
6 7
 
3.7%
0 4
 
2.1%
9 3
 
1.6%
Other values (2) 6
 
3.2%
Hangul
ValueCountFrequency (%)
26
8.8%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
25
8.5%
24
8.2%
19
 
6.5%
Other values (12) 50
17.0%

위도
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

경도
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

지정일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B
Distinct5
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
150
100
200
300
250

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row150
2nd row200
3rd row150
4th row100
5th row150

Common Values

ValueCountFrequency (%)
150 7
28.0%
100 7
28.0%
200 6
24.0%
300 3
12.0%
250 2
 
8.0%

Length

2023-12-11T14:29:27.932074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:28.076455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
150 7
28.0%
100 7
28.0%
200 6
24.0%
300 3
12.0%
250 2
 
8.0%

수질검사일자
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2019-08-12
25 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-08-12
2nd row2019-08-12
3rd row2019-08-12
4th row2019-08-12
5th row2019-08-12

Common Values

ValueCountFrequency (%)
2019-08-12 25
100.0%

Length

2023-12-11T14:29:28.196437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:28.314028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-08-12 25
100.0%

수질검사결과구분
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
부적합
25 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부적합
2nd row부적합
3rd row부적합
4th row부적합
5th row부적합

Common Values

ValueCountFrequency (%)
부적합 25
100.0%

Length

2023-12-11T14:29:28.418637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:28.513352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부적합 25
100.0%

부적합항목
Categorical

Distinct4
Distinct (%)16.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
총대장균군, 대장균군 검출
12 
총대장균군 검출
10 
일반세균,대장균군,총대장균군검출
미검사
 
1

Length

Max length17
Median length14
Mean length11.4
Min length3

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row총대장균군, 대장균군 검출
2nd row총대장균군, 대장균군 검출
3rd row총대장균군, 대장균군 검출
4th row총대장균군 검출
5th row총대장균군 검출

Common Values

ValueCountFrequency (%)
총대장균군, 대장균군 검출 12
48.0%
총대장균군 검출 10
40.0%
일반세균,대장균군,총대장균군검출 2
 
8.0%
미검사 1
 
4.0%

Length

2023-12-11T14:29:28.617055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:29.017289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
총대장균군 22
37.3%
검출 22
37.3%
대장균군 12
20.3%
일반세균,대장균군,총대장균군검출 2
 
3.4%
미검사 1
 
1.7%

관리기관전화번호
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
02-3423-6259
25 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row02-3423-6259
2nd row02-3423-6259
3rd row02-3423-6259
4th row02-3423-6259
5th row02-3423-6259

Common Values

ValueCountFrequency (%)
02-3423-6259 25
100.0%

Length

2023-12-11T14:29:29.143549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:29.257946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
02-3423-6259 25
100.0%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
서울특별시 강남구청
25 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 강남구청
2nd row서울특별시 강남구청
3rd row서울특별시 강남구청
4th row서울특별시 강남구청
5th row서울특별시 강남구청

Common Values

ValueCountFrequency (%)
서울특별시 강남구청 25
100.0%

Length

2023-12-11T14:29:29.401978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:29.530234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 25
50.0%
강남구청 25
50.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2019-08-23
25 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-08-23
2nd row2019-08-23
3rd row2019-08-23
4th row2019-08-23
5th row2019-08-23

Common Values

ValueCountFrequency (%)
2019-08-23 25
100.0%

Length

2023-12-11T14:29:29.642031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T14:29:29.740759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-08-23 25
100.0%

Correlations

2023-12-11T14:29:29.801744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
약수터명소재지지번주소일평균이용인구수부적합항목
약수터명1.0001.0001.0001.000
소재지지번주소1.0001.0001.0001.000
일평균이용인구수1.0001.0001.0000.298
부적합항목1.0001.0000.2981.000
2023-12-11T14:29:29.909267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일평균이용인구수부적합항목
일평균이용인구수1.0000.226
부적합항목0.2261.000
2023-12-11T14:29:30.003523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일평균이용인구수부적합항목
일평균이용인구수1.0000.226
부적합항목0.2261.000

Missing values

2023-12-11T14:29:26.331918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T14:29:26.533823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

약수터명소재지도로명주소소재지지번주소위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
0개암<NA>서울특별시 강남구 개포동 산53-20<NA><NA><NA>1502019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
1천의<NA>서울특별시 강남구 개포동 산53-30<NA><NA><NA>2002019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
2대룡<NA>서울특별시 강남구 개포동 118-21<NA><NA><NA>1502019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
3대천<NA>서울특별시 강남구 개포동 산53-31<NA><NA><NA>1002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
4구룡산<NA>서울특별시 강남구 개포동 1017-7<NA><NA><NA>1502019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
5습지원<NA>서울특별시 강남구 개포동 산53-42<NA><NA><NA>1002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
6율암<NA>서울특별시 강남구 세곡동 산52-44<NA><NA><NA>1002019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
7만수정<NA>서울특별시 강남구 자곡동 산15<NA><NA><NA>2002019-08-12부적합미검사02-3423-6259서울특별시 강남구청2019-08-23
8구룡천2<NA>서울특별시 강남구 개포동 산53-28<NA><NA><NA>3002019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
9매봉<NA>서울특별시 강남구 도곡동 산31-3<NA><NA><NA>2002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
약수터명소재지도로명주소소재지지번주소위도경도지정일자일평균이용인구수수질검사일자수질검사결과구분부적합항목관리기관전화번호관리기관명데이터기준일자
15옛2<NA>서울특별시 강남구 일원동 산63-32<NA><NA><NA>2502019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
16성지<NA>서울특별시 강남구 일원동 산63-32<NA><NA><NA>2502019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
17실로암<NA>서울특별시 강남구 일원동 산63-51<NA><NA><NA>1002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
18옥수천<NA>서울특별시 강남구 개포동 산53-32<NA><NA><NA>1002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
19못골<NA>서울특별시 강남구 자곡동 산39-10<NA><NA><NA>1502019-08-12부적합일반세균,대장균군,총대장균군검출02-3423-6259서울특별시 강남구청2019-08-23
20임록천<NA>서울특별시 강남구 개포당 산53-41<NA><NA><NA>1002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
21인수천<NA>서울특별시 강남구 일원동 산63-1<NA><NA><NA>3002019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
22용두천<NA>서울특별시 강남구 개포동 192<NA><NA><NA>1502019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
23불국사<NA>서울특별시 강남구 일원동 441<NA><NA><NA>2002019-08-12부적합총대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23
24구룡천1<NA>서울특별시 강남구 개포동 산53-28<NA><NA><NA>3002019-08-12부적합총대장균군, 대장균군 검출02-3423-6259서울특별시 강남구청2019-08-23