Overview

Dataset statistics

Number of variables6
Number of observations50
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.5 KiB
Average record size in memory51.6 B

Variable types

Categorical3
Text2
Numeric1

Dataset

Description경기도 남양주시의 빗물이용시설의 시설종류, 시설명, 위치(주소), 저수조용량, 이용용도 등에 대한 자료입니다.
URLhttps://www.data.go.kr/data/15114301/fileData.do

Alerts

시설종류 is highly overall correlated with 이용용도 and 1 other fieldsHigh correlation
이용용도 is highly overall correlated with 시설종류High correlation
비고 is highly overall correlated with 시설종류High correlation
시설명 has unique valuesUnique
위치(주소) has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:57:32.612781
Analysis finished2023-12-12 18:57:33.501230
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시설종류
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
학교
25 
공공청사
11 
공동주택
10 
실내체육관
 
2
대규모점포
 
1

Length

Max length5
Median length2
Mean length3.02
Min length2

Unique

Unique2 ?
Unique (%)4.0%

Sample

1st row실내체육관
2nd row실내체육관
3rd row공공청사
4th row공공청사
5th row공공청사

Common Values

ValueCountFrequency (%)
학교 25
50.0%
공공청사 11
22.0%
공동주택 10
 
20.0%
실내체육관 2
 
4.0%
대규모점포 1
 
2.0%
기타 1
 
2.0%

Length

2023-12-13T03:57:33.611030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:33.781046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교 25
50.0%
공공청사 11
22.0%
공동주택 10
 
20.0%
실내체육관 2
 
4.0%
대규모점포 1
 
2.0%
기타 1
 
2.0%

시설명
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-13T03:57:34.048299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length8.76
Min length4

Characters and Unicode

Total characters438
Distinct characters151
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row화도체육문화센터
2nd row오남체육문화센터
3rd row경기도구리남양주교육지원청
4th row남양주남부경찰서
5th row남양주북부경찰서
ValueCountFrequency (%)
화도체육문화센터 1
 
2.0%
퇴계원고등학교 1
 
2.0%
호평고등학교 1
 
2.0%
구룡초등학교 1
 
2.0%
가운초등학교 1
 
2.0%
가운중학교 1
 
2.0%
도농고등학교 1
 
2.0%
덕송초등학교 1
 
2.0%
별가람중학교 1
 
2.0%
화접초등학교 1
 
2.0%
Other values (40) 40
80.0%
2023-12-13T03:57:34.479917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
26
 
5.9%
25
 
5.7%
18
 
4.1%
11
 
2.5%
11
 
2.5%
10
 
2.3%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
Other values (141) 299
68.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 426
97.3%
Decimal Number 10
 
2.3%
Open Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
26
 
6.1%
25
 
5.9%
18
 
4.2%
11
 
2.6%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
Other values (135) 287
67.4%
Decimal Number
ValueCountFrequency (%)
1 5
50.0%
2 2
 
20.0%
9 2
 
20.0%
3 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 426
97.3%
Common 12
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
26
 
6.1%
25
 
5.9%
18
 
4.2%
11
 
2.6%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
Other values (135) 287
67.4%
Common
ValueCountFrequency (%)
1 5
41.7%
2 2
 
16.7%
9 2
 
16.7%
( 1
 
8.3%
) 1
 
8.3%
3 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 426
97.3%
ASCII 12
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
26
 
6.1%
25
 
5.9%
18
 
4.2%
11
 
2.6%
11
 
2.6%
10
 
2.3%
10
 
2.3%
10
 
2.3%
9
 
2.1%
9
 
2.1%
Other values (135) 287
67.4%
ASCII
ValueCountFrequency (%)
1 5
41.7%
2 2
 
16.7%
9 2
 
16.7%
( 1
 
8.3%
) 1
 
8.3%
3 1
 
8.3%

위치(주소)
Text

UNIQUE 

Distinct50
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
2023-12-13T03:57:34.753466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length21
Mean length18.7
Min length16

Characters and Unicode

Total characters935
Distinct characters55
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)100.0%

Sample

1st row경기도 남양주시 화도읍 창현리 529-5
2nd row경기도 남양주시 오남읍 오남리 538-1
3rd row경기도 남양주시 다산동 3158-3
4th row경기도 남양주시 다산동 3020-15
5th row경기도 남양주시 진접읍 연평리 134-23
ValueCountFrequency (%)
경기도 50
22.8%
남양주시 50
22.8%
다산동 18
 
8.2%
별내동 9
 
4.1%
진접읍 7
 
3.2%
금곡리 5
 
2.3%
와부읍 4
 
1.8%
화도읍 4
 
1.8%
덕소리 3
 
1.4%
호평동 3
 
1.4%
Other values (64) 66
30.1%
2023-12-13T03:57:35.347600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
169
18.1%
55
 
5.9%
52
 
5.6%
50
 
5.3%
50
 
5.3%
50
 
5.3%
50
 
5.3%
50
 
5.3%
1 32
 
3.4%
32
 
3.4%
Other values (45) 345
36.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 560
59.9%
Decimal Number 190
 
20.3%
Space Separator 169
 
18.1%
Dash Punctuation 16
 
1.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
55
9.8%
52
 
9.3%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
32
 
5.7%
19
 
3.4%
18
 
3.2%
Other values (33) 134
23.9%
Decimal Number
ValueCountFrequency (%)
1 32
16.8%
3 23
12.1%
5 22
11.6%
2 22
11.6%
0 22
11.6%
6 18
9.5%
7 14
7.4%
9 14
7.4%
4 12
 
6.3%
8 11
 
5.8%
Space Separator
ValueCountFrequency (%)
169
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 560
59.9%
Common 375
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
55
9.8%
52
 
9.3%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
32
 
5.7%
19
 
3.4%
18
 
3.2%
Other values (33) 134
23.9%
Common
ValueCountFrequency (%)
169
45.1%
1 32
 
8.5%
3 23
 
6.1%
5 22
 
5.9%
2 22
 
5.9%
0 22
 
5.9%
6 18
 
4.8%
- 16
 
4.3%
7 14
 
3.7%
9 14
 
3.7%
Other values (2) 23
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 560
59.9%
ASCII 375
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
169
45.1%
1 32
 
8.5%
3 23
 
6.1%
5 22
 
5.9%
2 22
 
5.9%
0 22
 
5.9%
6 18
 
4.8%
- 16
 
4.3%
7 14
 
3.7%
9 14
 
3.7%
Other values (2) 23
 
6.1%
Hangul
ValueCountFrequency (%)
55
9.8%
52
 
9.3%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
50
 
8.9%
32
 
5.7%
19
 
3.4%
18
 
3.2%
Other values (33) 134
23.9%
Distinct32
Distinct (%)64.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean159.06
Minimum5
Maximum1764
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size582.0 B
2023-12-13T03:57:35.556941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum5
5-th percentile20
Q126.5
median100
Q3150
95-th percentile524.35
Maximum1764
Range1759
Interquartile range (IQR)123.5

Descriptive statistics

Standard deviation271.23756
Coefficient of variation (CV)1.7052531
Kurtosis25.521657
Mean159.06
Median Absolute Deviation (MAD)66.5
Skewness4.6030419
Sum7953
Variance73569.813
MonotonicityNot monotonic
2023-12-13T03:57:35.744832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
20 8
 
16.0%
120 6
 
12.0%
100 4
 
8.0%
60 2
 
4.0%
23 2
 
4.0%
150 2
 
4.0%
66 1
 
2.0%
25 1
 
2.0%
31 1
 
2.0%
168 1
 
2.0%
Other values (22) 22
44.0%
ValueCountFrequency (%)
5 1
 
2.0%
20 8
16.0%
23 2
 
4.0%
24 1
 
2.0%
25 1
 
2.0%
31 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
39 1
 
2.0%
45 1
 
2.0%
ValueCountFrequency (%)
1764 1
2.0%
635 1
2.0%
550 1
2.0%
493 1
2.0%
461 1
2.0%
292 1
2.0%
270 1
2.0%
254 1
2.0%
220 1
2.0%
176 1
2.0%

이용용도
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
조경
29 
옥외,조경
10 
조경,청소
청소
 
2
조경,소방
 
2
Other values (2)

Length

Max length6
Median length2
Mean length3.16
Min length2

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row청소
2nd row조경
3rd row조경
4th row조경
5th row조경

Common Values

ValueCountFrequency (%)
조경 29
58.0%
옥외,조경 10
 
20.0%
조경,청소 4
 
8.0%
청소 2
 
4.0%
조경,소방 2
 
4.0%
조경,옥외 2
 
4.0%
조경,화장실 1
 
2.0%

Length

2023-12-13T03:57:35.973111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:36.143206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
조경 29
58.0%
옥외,조경 10
 
20.0%
조경,청소 4
 
8.0%
청소 2
 
4.0%
조경,소방 2
 
4.0%
조경,옥외 2
 
4.0%
조경,화장실 1
 
2.0%

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size532.0 B
비의무대상시설
35 
의무대상시설
15 

Length

Max length7
Median length7
Mean length6.7
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row의무대상시설
2nd row의무대상시설
3rd row의무대상시설
4th row의무대상시설
5th row의무대상시설

Common Values

ValueCountFrequency (%)
비의무대상시설 35
70.0%
의무대상시설 15
30.0%

Length

2023-12-13T03:57:36.331844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:57:36.475175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
비의무대상시설 35
70.0%
의무대상시설 15
30.0%

Interactions

2023-12-13T03:57:33.049187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:57:36.576489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설종류시설명위치(주소)저류조용량(세제곱미터)이용용도비고
시설종류1.0001.0001.0000.3370.7450.872
시설명1.0001.0001.0001.0001.0001.000
위치(주소)1.0001.0001.0001.0001.0001.000
저류조용량(세제곱미터)0.3371.0001.0001.0000.0000.083
이용용도0.7451.0001.0000.0001.0000.258
비고0.8721.0001.0000.0830.2581.000
2023-12-13T03:57:36.738014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설종류이용용도비고
시설종류1.0000.5550.653
이용용도0.5551.0000.257
비고0.6530.2571.000
2023-12-13T03:57:37.377533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
저류조용량(세제곱미터)시설종류이용용도비고
저류조용량(세제곱미터)1.0000.2280.0000.089
시설종류0.2281.0000.5550.653
이용용도0.0000.5551.0000.257
비고0.0890.6530.2571.000

Missing values

2023-12-13T03:57:33.266537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:57:33.441286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설종류시설명위치(주소)저류조용량(세제곱미터)이용용도비고
0실내체육관화도체육문화센터경기도 남양주시 화도읍 창현리 529-5115청소의무대상시설
1실내체육관오남체육문화센터경기도 남양주시 오남읍 오남리 538-191조경의무대상시설
2공공청사경기도구리남양주교육지원청경기도 남양주시 다산동 3158-380조경의무대상시설
3공공청사남양주남부경찰서경기도 남양주시 다산동 3020-15220조경의무대상시설
4공공청사남양주북부경찰서경기도 남양주시 진접읍 연평리 134-23150조경의무대상시설
5공공청사다산행정복지센터경기도 남양주시 다산동 6150270조경의무대상시설
6공공청사수동면종합행정타운경기도 남양주시 수동면 운수리 95-260조경의무대상시설
7공공청사한국전력공사남양주지사경기도 남양주시 다산동 6246100조경의무대상시설
8공공청사의정부지방검찰청남양주지청경기도 남양주시 다산동 3232160조경의무대상시설
9공동주택별내효성해링턴코트경기도 남양주시 별내동 919120조경의무대상시설
시설종류시설명위치(주소)저류조용량(세제곱미터)이용용도비고
40학교샛별초등학교경기도 남양주시 별내동 870124조경비의무대상시설
41학교한별중학교경기도 남양주시 별내동 1055100옥외,조경비의무대상시설
42학교경은학교경기도 남양주시 별내면 광전리 145-531옥외,조경비의무대상시설
43학교가운고등학교경기도 남양주시 다산동 68466조경,옥외비의무대상시설
44학교예봉초등학교경기도 남양주시 와부읍 덕소리 25220옥외,조경비의무대상시설
45학교예봉중학교경기도 남양주시 와부읍 덕소리 25920조경비의무대상시설
46학교도심초등학교경기도 남양주시 와부읍 도곡리 1051-725옥외,조경비의무대상시설
47학교광릉중학교경기도 남양주시 진접읍 진벌리 61120조경비의무대상시설
48학교호평고등학교경기도 남양주시 호평동 67720조경비의무대상시설
49기타정약용도서관경기도 남양주시 다산동 3207-4176조경,화장실비의무대상시설