Overview

Dataset statistics

Number of variables7
Number of observations153
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.6 KiB
Average record size in memory57.9 B

Variable types

Categorical3
Text2
Numeric1
DateTime1

Dataset

Description성남시 비상급수시설 현황에 대한 데이터로 구별,시설명,위치,규모,수질구분,유형 등의 항목으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15000625/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
구별 is highly overall correlated with 유형High correlation
수질구분 is highly overall correlated with 유형High correlation
유형 is highly overall correlated with 구별 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 01:27:04.316309
Analysis finished2023-12-12 01:27:05.123398
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구별
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
분당구
67 
수정구
53 
중원구
33 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수정구
2nd row수정구
3rd row수정구
4th row수정구
5th row수정구

Common Values

ValueCountFrequency (%)
분당구 67
43.8%
수정구 53
34.6%
중원구 33
21.6%

Length

2023-12-12T10:27:05.202723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:27:05.344096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분당구 67
43.8%
수정구 53
34.6%
중원구 33
21.6%
Distinct136
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T10:27:05.711516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length12
Mean length7.6993464
Min length3

Characters and Unicode

Total characters1178
Distinct characters262
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique132 ?
Unique (%)86.3%

Sample

1st row산성공원
2nd row복우물공원
3rd row제15특수임무비행단(가)
4th row제15특수임무비행단(나)
5th row제15특수임무비행단(다)
ValueCountFrequency (%)
남서울cc(경원건설 14
 
8.2%
라이프원코리아 3
 
1.8%
스파밸리골프연습장(한백찬 2
 
1.2%
송림중고등학교(박형규 2
 
1.2%
이수선생 1
 
0.6%
분당메모리얼파크(이규만 1
 
0.6%
든든한교회 1
 
0.6%
복합화력발전처 1
 
0.6%
갈보리교회(이필재 1
 
0.6%
꿈동산어린이공원 1
 
0.6%
Other values (143) 143
84.1%
2023-12-12T10:27:06.336186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 63
 
5.3%
( 62
 
5.3%
35
 
3.0%
C 29
 
2.5%
21
 
1.8%
20
 
1.7%
20
 
1.7%
19
 
1.6%
18
 
1.5%
18
 
1.5%
Other values (252) 873
74.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 967
82.1%
Close Punctuation 63
 
5.3%
Open Punctuation 62
 
5.3%
Uppercase Letter 35
 
3.0%
Decimal Number 22
 
1.9%
Space Separator 17
 
1.4%
Other Symbol 6
 
0.5%
Lowercase Letter 3
 
0.3%
Dash Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
3.6%
21
 
2.2%
20
 
2.1%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (230) 768
79.4%
Uppercase Letter
ValueCountFrequency (%)
C 29
82.9%
M 1
 
2.9%
E 1
 
2.9%
G 1
 
2.9%
P 1
 
2.9%
L 1
 
2.9%
A 1
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 8
36.4%
3 5
22.7%
5 3
 
13.6%
2 2
 
9.1%
6 2
 
9.1%
8 2
 
9.1%
Lowercase Letter
ValueCountFrequency (%)
s 1
33.3%
k 1
33.3%
e 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 62
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 973
82.6%
Common 167
 
14.2%
Latin 38
 
3.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
3.6%
21
 
2.2%
20
 
2.1%
20
 
2.1%
19
 
2.0%
18
 
1.8%
18
 
1.8%
17
 
1.7%
16
 
1.6%
15
 
1.5%
Other values (231) 774
79.5%
Common
ValueCountFrequency (%)
) 63
37.7%
( 62
37.1%
17
 
10.2%
1 8
 
4.8%
3 5
 
3.0%
5 3
 
1.8%
2 2
 
1.2%
6 2
 
1.2%
8 2
 
1.2%
- 2
 
1.2%
Latin
ValueCountFrequency (%)
C 29
76.3%
s 1
 
2.6%
k 1
 
2.6%
M 1
 
2.6%
E 1
 
2.6%
G 1
 
2.6%
P 1
 
2.6%
L 1
 
2.6%
e 1
 
2.6%
A 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 967
82.1%
ASCII 205
 
17.4%
None 6
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 63
30.7%
( 62
30.2%
C 29
14.1%
17
 
8.3%
1 8
 
3.9%
3 5
 
2.4%
5 3
 
1.5%
2 2
 
1.0%
6 2
 
1.0%
8 2
 
1.0%
Other values (11) 12
 
5.9%
Hangul
ValueCountFrequency (%)
35
 
3.6%
21
 
2.2%
20
 
2.1%
20
 
2.1%
19
 
2.0%
18
 
1.9%
18
 
1.9%
17
 
1.8%
16
 
1.7%
15
 
1.6%
Other values (230) 768
79.4%
None
ValueCountFrequency (%)
6
100.0%

위치
Text

Distinct142
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T10:27:06.757959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length25
Mean length22.431373
Min length14

Characters and Unicode

Total characters3432
Distinct characters107
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique138 ?
Unique (%)90.2%

Sample

1st row성남 수정구 수정로 456번길 19(산성동)
2nd row성남 수정구 복정로20번길 16(복정동)
3rd row성남 수정구 고등동 353
4th row성남 수정구 고등동 336
5th row성남 수정구 고등동 273-1
ValueCountFrequency (%)
성남시 101
 
14.0%
경기도 100
 
13.8%
분당구 66
 
9.1%
수정구 53
 
7.3%
성남 52
 
7.2%
중원구 33
 
4.6%
하산운동 13
 
1.8%
266-34(운중동 8
 
1.1%
둔촌대로 8
 
1.1%
갈현동 6
 
0.8%
Other values (210) 284
39.2%
2023-12-12T10:27:07.290246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
571
 
16.6%
180
 
5.2%
167
 
4.9%
164
 
4.8%
156
 
4.5%
114
 
3.3%
107
 
3.1%
1 104
 
3.0%
103
 
3.0%
) 100
 
2.9%
Other values (97) 1666
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2025
59.0%
Decimal Number 575
 
16.8%
Space Separator 571
 
16.6%
Close Punctuation 100
 
2.9%
Open Punctuation 100
 
2.9%
Dash Punctuation 60
 
1.7%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
8.9%
167
 
8.2%
164
 
8.1%
156
 
7.7%
114
 
5.6%
107
 
5.3%
103
 
5.1%
100
 
4.9%
69
 
3.4%
68
 
3.4%
Other values (82) 797
39.4%
Decimal Number
ValueCountFrequency (%)
1 104
18.1%
2 90
15.7%
3 82
14.3%
6 65
11.3%
4 56
9.7%
5 46
8.0%
8 40
 
7.0%
7 36
 
6.3%
0 30
 
5.2%
9 26
 
4.5%
Space Separator
ValueCountFrequency (%)
571
100.0%
Close Punctuation
ValueCountFrequency (%)
) 100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2025
59.0%
Common 1407
41.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
8.9%
167
 
8.2%
164
 
8.1%
156
 
7.7%
114
 
5.6%
107
 
5.3%
103
 
5.1%
100
 
4.9%
69
 
3.4%
68
 
3.4%
Other values (82) 797
39.4%
Common
ValueCountFrequency (%)
571
40.6%
1 104
 
7.4%
) 100
 
7.1%
( 100
 
7.1%
2 90
 
6.4%
3 82
 
5.8%
6 65
 
4.6%
- 60
 
4.3%
4 56
 
4.0%
5 46
 
3.3%
Other values (5) 133
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2025
59.0%
ASCII 1407
41.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
571
40.6%
1 104
 
7.4%
) 100
 
7.1%
( 100
 
7.1%
2 90
 
6.4%
3 82
 
5.8%
6 65
 
4.6%
- 60
 
4.3%
4 56
 
4.0%
5 46
 
3.3%
Other values (5) 133
 
9.5%
Hangul
ValueCountFrequency (%)
180
 
8.9%
167
 
8.2%
164
 
8.1%
156
 
7.7%
114
 
5.6%
107
 
5.3%
103
 
5.1%
100
 
4.9%
69
 
3.4%
68
 
3.4%
Other values (82) 797
39.4%

규모(톤_일)
Real number (ℝ)

Distinct66
Distinct (%)43.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.313725
Minimum15
Maximum1359
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2023-12-12T10:27:07.460813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile23
Q145
median60
Q398
95-th percentile269.6
Maximum1359
Range1344
Interquartile range (IQR)53

Descriptive statistics

Standard deviation157.27601
Coefficient of variation (CV)1.616175
Kurtosis39.339955
Mean97.313725
Median Absolute Deviation (MAD)20
Skewness5.9140663
Sum14889
Variance24735.743
MonotonicityNot monotonic
2023-12-12T10:27:07.649279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60 19
 
12.4%
50 12
 
7.8%
100 10
 
6.5%
45 10
 
6.5%
80 8
 
5.2%
30 8
 
5.2%
20 7
 
4.6%
40 7
 
4.6%
98 3
 
2.0%
110 3
 
2.0%
Other values (56) 66
43.1%
ValueCountFrequency (%)
15 1
 
0.7%
20 7
4.6%
25 2
 
1.3%
28 1
 
0.7%
30 8
5.2%
32 1
 
0.7%
35 1
 
0.7%
39 1
 
0.7%
40 7
4.6%
41 2
 
1.3%
ValueCountFrequency (%)
1359 1
0.7%
1044 1
0.7%
910 1
0.7%
361 1
0.7%
350 1
0.7%
321 1
0.7%
310 1
0.7%
299 1
0.7%
250 1
0.7%
240 1
0.7%

수질구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
생활용수
128 
음용수
25 

Length

Max length4
Median length4
Mean length3.8366013
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음용수
2nd row음용수
3rd row음용수
4th row음용수
5th row음용수

Common Values

ValueCountFrequency (%)
생활용수 128
83.7%
음용수 25
 
16.3%

Length

2023-12-12T10:27:07.833822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:27:07.986833image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
생활용수 128
83.7%
음용수 25
 
16.3%

유형
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
민간지정
88 
민간시설
45 
공공지원
 
7
정부지원
 
6
공공시설
 
4

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정부지원
2nd row정부지원
3rd row공공시설
4th row공공시설
5th row공공시설

Common Values

ValueCountFrequency (%)
민간지정 88
57.5%
민간시설 45
29.4%
공공지원 7
 
4.6%
정부지원 6
 
3.9%
공공시설 4
 
2.6%
정부시설 3
 
2.0%

Length

2023-12-12T10:27:08.139324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:27:08.282880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간지정 88
57.5%
민간시설 45
29.4%
공공지원 7
 
4.6%
정부지원 6
 
3.9%
공공시설 4
 
2.6%
정부시설 3
 
2.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum2023-06-09 00:00:00
Maximum2023-06-09 00:00:00
2023-12-12T10:27:08.419161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:27:08.538922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T10:27:04.767039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T10:27:08.627510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구별규모(톤_일)수질구분유형
구별1.0000.1460.0000.944
규모(톤_일)0.1461.0000.0000.000
수질구분0.0000.0001.0000.963
유형0.9440.0000.9631.000
2023-12-12T10:27:08.740971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구별유형수질구분
구별1.0000.7070.000
유형0.7071.0000.818
수질구분0.0000.8181.000
2023-12-12T10:27:08.870877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모(톤_일)구별수질구분유형
규모(톤_일)1.0000.0580.0000.000
구별0.0581.0000.0000.707
수질구분0.0000.0001.0000.818
유형0.0000.7070.8181.000

Missing values

2023-12-12T10:27:04.931903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:27:05.075954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구별시설명위치규모(톤_일)수질구분유형데이터기준일자
0수정구산성공원성남 수정구 수정로 456번길 19(산성동)52음용수정부지원2023-06-09
1수정구복우물공원성남 수정구 복정로20번길 16(복정동)60음용수정부지원2023-06-09
2수정구제15특수임무비행단(가)성남 수정구 고등동 35398음용수공공시설2023-06-09
3수정구제15특수임무비행단(나)성남 수정구 고등동 33698음용수공공시설2023-06-09
4수정구제15특수임무비행단(다)성남 수정구 고등동 273-198음용수공공시설2023-06-09
5수정구시흥동체육공원성남 수정구 시흥동 220-755음용수정부지원2023-06-09
6수정구위례근린공원성남 수정구 위례동이로 24(위례동)62음용수정부지원2023-06-09
7수정구옛골 제2호 어린이공원성남 수정구 상적동 291-1615음용수정부지원2023-06-09
8수정구왕남초등학교성남시 수정구 고등동 464-3105음용수정부지원2023-06-09
9수정구나성목욕탕성남 수정구 탄리로 58(신흥동)80생활용수민간시설2023-06-09
구별시설명위치규모(톤_일)수질구분유형데이터기준일자
143분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-34(운중동)168생활용수민간지정2023-06-09
144분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-34(운중동)361생활용수민간지정2023-06-09
145분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-35(운중동)209생활용수민간지정2023-06-09
146분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-35(운중동)321생활용수민간지정2023-06-09
147분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 372(운중동)110생활용수민간지정2023-06-09
148분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동221-7(운중동)910생활용수민간지정2023-06-09
149분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-34(운중동)82생활용수민간지정2023-06-09
150분당구남서울CC(경원건설)경기도 성남시 분당구 하산운동 266-34(운중동)299생활용수민간지정2023-06-09
151분당구sk케미컬㈜경기도 성남시 분당구 판교로 310(삼평동)87생활용수민간지정2023-06-09
152분당구(주)새서울석유판교경기도 성남시 분당구 판교동 587 (판교동)83생활용수민간지정2023-06-09