Overview

Dataset statistics

Number of variables5
Number of observations26
Missing cells7
Missing cells (%)5.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory46.1 B

Variable types

Text4
Numeric1

Dataset

Description전라북도 군산시 소재한 하폐수처리시설 현황(시설물명, 도로명주소, 지번주소, 시설용량, 처리방법). 2023년 6월 16일 현재 사용개시공고된 처리시설.
URLhttps://www.data.go.kr/data/3080409/fileData.do

Alerts

도로명주소 has 7 (26.9%) missing valuesMissing
시설물명 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:32:00.787607
Analysis finished2023-12-13 00:32:01.228320
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설물명
Text

UNIQUE 

Distinct26
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T09:32:01.343666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length8.9615385
Min length7

Characters and Unicode

Total characters233
Distinct characters59
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)100.0%

Sample

1st row공공하수처리장
2nd row폐수종말처리장
3rd row하수슬러지자원화시설
4th row대야하수처리장
5th row옥서하수처리장
ValueCountFrequency (%)
하수처리시설 9
22.5%
폐수종말처리장 2
 
5.0%
공공하수처리장 1
 
2.5%
대위 1
 
2.5%
어은하수처리시설 1
 
2.5%
선유도하수처리시설 1
 
2.5%
무녀도하수처리시설 1
 
2.5%
신시도하수처리시설 1
 
2.5%
창오하수처리시설 1
 
2.5%
가산하수처리시설 1
 
2.5%
Other values (21) 21
52.5%
2023-12-13T09:32:01.596398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
12.0%
25
 
10.7%
25
 
10.7%
24
 
10.3%
19
 
8.2%
19
 
8.2%
14
 
6.0%
8
 
3.4%
4
 
1.7%
4
 
1.7%
Other values (49) 63
27.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
93.6%
Space Separator 14
 
6.0%
Decimal Number 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
28
12.8%
25
11.5%
25
11.5%
24
11.0%
19
 
8.7%
19
 
8.7%
8
 
3.7%
4
 
1.8%
4
 
1.8%
3
 
1.4%
Other values (47) 59
27.1%
Space Separator
ValueCountFrequency (%)
14
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 218
93.6%
Common 15
 
6.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
28
12.8%
25
11.5%
25
11.5%
24
11.0%
19
 
8.7%
19
 
8.7%
8
 
3.7%
4
 
1.8%
4
 
1.8%
3
 
1.4%
Other values (47) 59
27.1%
Common
ValueCountFrequency (%)
14
93.3%
1 1
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
93.6%
ASCII 15
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
28
12.8%
25
11.5%
25
11.5%
24
11.0%
19
 
8.7%
19
 
8.7%
8
 
3.7%
4
 
1.8%
4
 
1.8%
3
 
1.4%
Other values (47) 59
27.1%
ASCII
ValueCountFrequency (%)
14
93.3%
1 1
 
6.7%

도로명주소
Text

MISSING 

Distinct17
Distinct (%)89.5%
Missing7
Missing (%)26.9%
Memory size340.0 B
2023-12-13T09:32:01.758486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length21.526316
Min length16

Characters and Unicode

Total characters409
Distinct characters70
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)78.9%

Sample

1st row전라북도 군산시 서해로 289(소룡동)
2nd row전라북도 군산시 외항로1350(비응도동)
3rd row전라북도 군산시 서해로 289(소룡동)
4th row전라북도 군산시 대야면 석화들길 178
5th row전라북도 군산시 옥서면 옥구저수지로 205-40
ValueCountFrequency (%)
전라북도 19
21.8%
군산시 19
21.8%
회현면 4
 
4.6%
임피면 2
 
2.3%
나포면 2
 
2.3%
289(소룡동 2
 
2.3%
외항로1350(비응도동 2
 
2.3%
서해로 2
 
2.3%
무도도4길 1
 
1.1%
옥도면 1
 
1.1%
Other values (33) 33
37.9%
2023-12-13T09:32:02.006843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
18.8%
24
 
5.9%
21
 
5.1%
20
 
4.9%
19
 
4.6%
19
 
4.6%
19
 
4.6%
19
 
4.6%
14
 
3.4%
13
 
3.2%
Other values (60) 164
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 255
62.3%
Space Separator 77
 
18.8%
Decimal Number 64
 
15.6%
Dash Punctuation 5
 
1.2%
Open Punctuation 4
 
1.0%
Close Punctuation 4
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
9.4%
21
 
8.2%
20
 
7.8%
19
 
7.5%
19
 
7.5%
19
 
7.5%
19
 
7.5%
14
 
5.5%
13
 
5.1%
6
 
2.4%
Other values (46) 81
31.8%
Decimal Number
ValueCountFrequency (%)
1 10
15.6%
3 9
14.1%
2 8
12.5%
4 8
12.5%
7 7
10.9%
0 6
9.4%
8 5
7.8%
9 4
 
6.2%
6 4
 
6.2%
5 3
 
4.7%
Space Separator
ValueCountFrequency (%)
77
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 255
62.3%
Common 154
37.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
9.4%
21
 
8.2%
20
 
7.8%
19
 
7.5%
19
 
7.5%
19
 
7.5%
19
 
7.5%
14
 
5.5%
13
 
5.1%
6
 
2.4%
Other values (46) 81
31.8%
Common
ValueCountFrequency (%)
77
50.0%
1 10
 
6.5%
3 9
 
5.8%
2 8
 
5.2%
4 8
 
5.2%
7 7
 
4.5%
0 6
 
3.9%
- 5
 
3.2%
8 5
 
3.2%
( 4
 
2.6%
Other values (4) 15
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 255
62.3%
ASCII 154
37.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
77
50.0%
1 10
 
6.5%
3 9
 
5.8%
2 8
 
5.2%
4 8
 
5.2%
7 7
 
4.5%
0 6
 
3.9%
- 5
 
3.2%
8 5
 
3.2%
( 4
 
2.6%
Other values (4) 15
 
9.7%
Hangul
ValueCountFrequency (%)
24
 
9.4%
21
 
8.2%
20
 
7.8%
19
 
7.5%
19
 
7.5%
19
 
7.5%
19
 
7.5%
14
 
5.5%
13
 
5.1%
6
 
2.4%
Other values (46) 81
31.8%
Distinct24
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T09:32:02.192334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length21.461538
Min length16

Characters and Unicode

Total characters558
Distinct characters69
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)84.6%

Sample

1st row전라북도 군산시 소룡동 1584
2nd row전라북도 군산시 비응도동 21
3rd row전라북도 군산시 소룡동 1584
4th row전라북도 군산시 대야면 산원리 27-15
5th row전라북도 군산시 옥서면 옥봉리 1809-1
ValueCountFrequency (%)
전라북도 26
20.8%
군산시 26
20.8%
회현면 5
 
4.0%
옥도면 4
 
3.2%
나포면 4
 
3.2%
임피면 2
 
1.6%
옥구읍 2
 
1.6%
서포리 2
 
1.6%
월연리 2
 
1.6%
비응도동 2
 
1.6%
Other values (47) 50
40.0%
2023-12-13T09:32:02.468530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
99
17.7%
36
 
6.5%
31
 
5.6%
27
 
4.8%
26
 
4.7%
26
 
4.7%
26
 
4.7%
26
 
4.7%
1 22
 
3.9%
21
 
3.8%
Other values (59) 218
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 339
60.8%
Decimal Number 103
 
18.5%
Space Separator 99
 
17.7%
Dash Punctuation 17
 
3.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
36
 
10.6%
31
 
9.1%
27
 
8.0%
26
 
7.7%
26
 
7.7%
26
 
7.7%
26
 
7.7%
21
 
6.2%
20
 
5.9%
10
 
2.9%
Other values (47) 90
26.5%
Decimal Number
ValueCountFrequency (%)
1 22
21.4%
2 14
13.6%
7 13
12.6%
5 11
10.7%
8 10
9.7%
0 9
8.7%
4 7
 
6.8%
9 7
 
6.8%
3 6
 
5.8%
6 4
 
3.9%
Space Separator
ValueCountFrequency (%)
99
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 339
60.8%
Common 219
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
10.6%
31
 
9.1%
27
 
8.0%
26
 
7.7%
26
 
7.7%
26
 
7.7%
26
 
7.7%
21
 
6.2%
20
 
5.9%
10
 
2.9%
Other values (47) 90
26.5%
Common
ValueCountFrequency (%)
99
45.2%
1 22
 
10.0%
- 17
 
7.8%
2 14
 
6.4%
7 13
 
5.9%
5 11
 
5.0%
8 10
 
4.6%
0 9
 
4.1%
4 7
 
3.2%
9 7
 
3.2%
Other values (2) 10
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 339
60.8%
ASCII 219
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
99
45.2%
1 22
 
10.0%
- 17
 
7.8%
2 14
 
6.4%
7 13
 
5.9%
5 11
 
5.0%
8 10
 
4.6%
0 9
 
4.1%
4 7
 
3.2%
9 7
 
3.2%
Other values (2) 10
 
4.6%
Hangul
ValueCountFrequency (%)
36
 
10.6%
31
 
9.1%
27
 
8.0%
26
 
7.7%
26
 
7.7%
26
 
7.7%
26
 
7.7%
21
 
6.2%
20
 
5.9%
10
 
2.9%
Other values (47) 90
26.5%
Distinct20
Distinct (%)76.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9641.1538
Minimum30
Maximum200000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size366.0 B
2023-12-13T09:32:02.560724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile30
Q157.5
median150
Q3550
95-th percentile25750
Maximum200000
Range199970
Interquartile range (IQR)492.5

Descriptive statistics

Standard deviation39324.476
Coefficient of variation (CV)4.0788142
Kurtosis24.542421
Mean9641.1538
Median Absolute Deviation (MAD)120
Skewness4.9080459
Sum250670
Variance1.5464144 × 109
MonotonicityNot monotonic
2023-12-13T09:32:02.648319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
30 4
 
15.4%
150 3
 
11.5%
550 2
 
7.7%
200000 1
 
3.8%
100 1
 
3.8%
210 1
 
3.8%
390 1
 
3.8%
120 1
 
3.8%
92 1
 
3.8%
330 1
 
3.8%
Other values (10) 10
38.5%
ValueCountFrequency (%)
30 4
15.4%
40 1
 
3.8%
48 1
 
3.8%
50 1
 
3.8%
80 1
 
3.8%
90 1
 
3.8%
92 1
 
3.8%
100 1
 
3.8%
120 1
 
3.8%
150 3
11.5%
ValueCountFrequency (%)
200000 1
3.8%
30000 1
3.8%
13000 1
3.8%
1900 1
3.8%
1600 1
3.8%
950 1
3.8%
550 2
7.7%
390 1
3.8%
330 1
3.8%
210 1
3.8%
Distinct14
Distinct (%)53.8%
Missing0
Missing (%)0.0%
Memory size340.0 B
2023-12-13T09:32:02.804470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length14.384615
Min length3

Characters and Unicode

Total characters374
Distinct characters65
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)42.3%

Sample

1st row4-Stage BNR
2nd rowCSBR
3rd row건조연료화
4th row간헐방류식 장기포기공정(KIDEA공법)
5th row간헐방류식 장기포기공정(KIDEA공법)
ValueCountFrequency (%)
분뇨 8
10.1%
8
10.1%
고농도 8
10.1%
유기 8
10.1%
오폐수 8
10.1%
고도처리 8
10.1%
간헐방류식 5
 
6.3%
장기포기공정(kidea공법 5
 
6.3%
공법 2
 
2.5%
선회와류식 2
 
2.5%
Other values (16) 17
21.5%
2023-12-13T09:32:03.060536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
53
 
14.2%
18
 
4.8%
16
 
4.3%
16
 
4.3%
R 14
 
3.7%
13
 
3.5%
S 12
 
3.2%
9
 
2.4%
B 9
 
2.4%
8
 
2.1%
Other values (55) 206
55.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 214
57.2%
Uppercase Letter 85
 
22.7%
Space Separator 53
 
14.2%
Close Punctuation 5
 
1.3%
Open Punctuation 5
 
1.3%
Dash Punctuation 5
 
1.3%
Lowercase Letter 4
 
1.1%
Decimal Number 2
 
0.5%
Math Symbol 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
8.4%
16
 
7.5%
16
 
7.5%
13
 
6.1%
9
 
4.2%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
Other values (27) 102
47.7%
Uppercase Letter
ValueCountFrequency (%)
R 14
16.5%
S 12
14.1%
B 9
10.6%
A 7
8.2%
E 6
7.1%
K 6
7.1%
I 6
7.1%
D 5
 
5.9%
M 4
 
4.7%
H 3
 
3.5%
Other values (7) 13
15.3%
Lowercase Letter
ValueCountFrequency (%)
t 1
25.0%
a 1
25.0%
e 1
25.0%
g 1
25.0%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
4 1
50.0%
Space Separator
ValueCountFrequency (%)
53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 214
57.2%
Latin 89
23.8%
Common 71
 
19.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
8.4%
16
 
7.5%
16
 
7.5%
13
 
6.1%
9
 
4.2%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
Other values (27) 102
47.7%
Latin
ValueCountFrequency (%)
R 14
15.7%
S 12
13.5%
B 9
10.1%
A 7
7.9%
E 6
 
6.7%
K 6
 
6.7%
I 6
 
6.7%
D 5
 
5.6%
M 4
 
4.5%
H 3
 
3.4%
Other values (11) 17
19.1%
Common
ValueCountFrequency (%)
53
74.6%
) 5
 
7.0%
( 5
 
7.0%
- 5
 
7.0%
+ 1
 
1.4%
2 1
 
1.4%
4 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 214
57.2%
ASCII 160
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
53
33.1%
R 14
 
8.8%
S 12
 
7.5%
B 9
 
5.6%
A 7
 
4.4%
E 6
 
3.8%
K 6
 
3.8%
I 6
 
3.8%
) 5
 
3.1%
D 5
 
3.1%
Other values (18) 37
23.1%
Hangul
ValueCountFrequency (%)
18
 
8.4%
16
 
7.5%
16
 
7.5%
13
 
6.1%
9
 
4.2%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
8
 
3.7%
Other values (27) 102
47.7%

Interactions

2023-12-13T09:32:01.016127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:32:03.131780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설물명도로명주소지번주소시설용량(세제곱미터_일)처리방법
시설물명1.0001.0001.0001.0001.000
도로명주소1.0001.0001.0000.0000.000
지번주소1.0001.0001.0000.0000.000
시설용량(세제곱미터_일)1.0000.0000.0001.0001.000
처리방법1.0000.0000.0001.0001.000

Missing values

2023-12-13T09:32:01.119035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:32:01.197998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설물명도로명주소지번주소시설용량(세제곱미터_일)처리방법
0공공하수처리장전라북도 군산시 서해로 289(소룡동)전라북도 군산시 소룡동 15842000004-Stage BNR
1폐수종말처리장전라북도 군산시 외항로1350(비응도동)전라북도 군산시 비응도동 2130000CSBR
2하수슬러지자원화시설전라북도 군산시 서해로 289(소룡동)전라북도 군산시 소룡동 1584150건조연료화
3대야하수처리장전라북도 군산시 대야면 석화들길 178전라북도 군산시 대야면 산원리 27-151900간헐방류식 장기포기공정(KIDEA공법)
4옥서하수처리장전라북도 군산시 옥서면 옥구저수지로 205-40전라북도 군산시 옥서면 옥봉리 1809-11600간헐방류식 장기포기공정(KIDEA공법)
5서수하수처리장전라북도 군산시 임피면 탑천로 248-37전라북도 군산시 임피면 술산리 668-10950간헐방류식 장기포기공정(KIDEA공법)
6회현하수처리장전라북도 군산시 회현면 회미로 196전라북도 군산시 회현면 대정리 762-1550간헐방류식 장기포기공정(KIDEA공법)
7임피하수처리장전라북도 군산시 임피면 수반들길 49-4전라북도 군산시 임피면 미원리 778550간헐방류식 장기포기공정(KIDEA공법)
8옥곤 하수처리시설전라북도 군산시 철새로 712전라북도 군산시 나포면 옥곤리 955-27150분뇨 및 고농도 유기 오폐수 고도처리
9원우 하수처리시설전라북도 군산시 회현면 남내로 238전라북도 군산시 회현면 원우리 550-190HBR-2
시설물명도로명주소지번주소시설용량(세제곱미터_일)처리방법
16옥산 남내 하수처리시설<NA>전라북도 군산시 옥산면 남내리795-330분뇨 및 고농도 유기 오폐수 고도처리
17나포 뜰아름 하수처리시설<NA>전라북도 군산시 나포면 주곡리 1131-930SMMIAR
18폐수종말처리장 1단계 증설전라북도 군산시 외항로1350(비응도동)전라북도 군산시 비응도동 2113000KUMHO-MBR + URC 공법
19가산하수처리시설<NA>전라북도 군산시 옥구읍 수산리 748330H-SBR
20창오하수처리시설전라북도 군산시 성산면 동군산로 403전라북도 군산시 성산면 창오리 529-292회분식활성슬러지
21신시도하수처리시설<NA>전라북도 군산시 옥도면 신시도리 146-2120선회와류식 SBR
22무녀도하수처리시설전라북도 군산시 옥도면 무도도4길 74전라북도 군산시 옥도면 무녀도리 223-11150선회와류식 SBR
23선유도하수처리시설<NA>전라북도 군산시 옥도면 선유도리 279-3390분뇨 및 고농도 유기 오폐수 고도처리
24어은하수처리시설전라북도 군산시 어은동로 276전라북도 군산시 옥구읍 어은리 107-2210SBR공업
25어청도하수처리시설<NA>전라북도 군산시 옥도면 어청도리 387-10100JASSFR PROCESS-SBR 공법