Overview

Dataset statistics

Number of variables17
Number of observations393
Missing cells341
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory52.7 KiB
Average record size in memory137.3 B

Variable types

Numeric1
Categorical7
Text5
Boolean4

Dataset

Description경상북도 김천시의 공중화장실 현황으로 화장실 위치, 개방시간, 비상벨설치유무, 기저귀교환대장소유무 정보 등을 제공합니다.
Author경상북도 김천시
URLhttps://www.data.go.kr/data/15091984/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
기저귀교환대장소 is highly overall correlated with 비상벨설치여부 and 1 other fieldsHigh correlation
기저귀교환대유무 is highly overall correlated with 기저귀교환대장소High correlation
연번 is highly overall correlated with 화장실소유구분High correlation
화장실구분 is highly overall correlated with 개방시간High correlation
개방시간 is highly overall correlated with 화장실구분 and 2 other fieldsHigh correlation
화장실소유구분 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
화장실설치장소유형 is highly overall correlated with 화장실소유구분 and 1 other fieldsHigh correlation
오물처리방식 is highly overall correlated with 개방시간High correlation
화장실정보공개여부 is highly overall correlated with 화장실소유구분 and 1 other fieldsHigh correlation
비상벨설치여부 is highly overall correlated with 개방시간 and 1 other fieldsHigh correlation
화장실구분 is highly imbalanced (76.9%)Imbalance
개방시간 is highly imbalanced (59.3%)Imbalance
오물처리방식 is highly imbalanced (85.7%)Imbalance
기저귀교환대유무 is highly imbalanced (52.5%)Imbalance
기저귀교환대장소 is highly imbalanced (73.6%)Imbalance
소재지도로명주소 has 6 (1.5%) missing valuesMissing
소재지지번주소 has 328 (83.5%) missing valuesMissing
관리기관전화번호 has 7 (1.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:31:32.091145
Analysis finished2023-12-12 23:31:33.772742
Duration1.68 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct393
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean197
Minimum1
Maximum393
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T08:31:33.842140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.6
Q199
median197
Q3295
95-th percentile373.4
Maximum393
Range392
Interquartile range (IQR)196

Descriptive statistics

Standard deviation113.59357
Coefficient of variation (CV)0.57661713
Kurtosis-1.2
Mean197
Median Absolute Deviation (MAD)98
Skewness0
Sum77421
Variance12903.5
MonotonicityStrictly increasing
2023-12-13T08:31:33.972943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
271 1
 
0.3%
269 1
 
0.3%
268 1
 
0.3%
267 1
 
0.3%
266 1
 
0.3%
265 1
 
0.3%
264 1
 
0.3%
263 1
 
0.3%
262 1
 
0.3%
Other values (383) 383
97.5%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
393 1
0.3%
392 1
0.3%
391 1
0.3%
390 1
0.3%
389 1
0.3%
388 1
0.3%
387 1
0.3%
386 1
0.3%
385 1
0.3%
384 1
0.3%

화장실구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
공중화장실
361 
개방화장실
 
28
이동화장실
 
2
간이화장실
 
2

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개방화장실
2nd row개방화장실
3rd row공중화장실
4th row공중화장실
5th row공중화장실

Common Values

ValueCountFrequency (%)
공중화장실 361
91.9%
개방화장실 28
 
7.1%
이동화장실 2
 
0.5%
간이화장실 2
 
0.5%

Length

2023-12-13T08:31:34.093697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:34.279009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공중화장실 361
91.9%
개방화장실 28
 
7.1%
이동화장실 2
 
0.5%
간이화장실 2
 
0.5%
Distinct384
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-13T08:31:34.467092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length15
Mean length10.521628
Min length4

Characters and Unicode

Total characters4135
Distinct characters270
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique376 ?
Unique (%)95.7%

Sample

1st row김천(부산)주유소(1층)
2nd row김천(서울)주유소(1층)
3rd row감천제방공중화장실
4th row황금시장공중화장실
5th row황금시장플랫폼 화장실
ValueCountFrequency (%)
화장실 117
 
16.1%
김천시 50
 
6.9%
1층 32
 
4.4%
2층 31
 
4.3%
3층 12
 
1.7%
김천지원 8
 
1.1%
김천제일병원 7
 
1.0%
화장실(1층 5
 
0.7%
2층화장실 4
 
0.6%
화장실(2층 4
 
0.6%
Other values (364) 455
62.8%
2023-12-13T08:31:34.797758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
332
 
8.0%
215
 
5.2%
198
 
4.8%
181
 
4.4%
176
 
4.3%
142
 
3.4%
132
 
3.2%
130
 
3.1%
108
 
2.6%
93
 
2.2%
Other values (260) 2428
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3451
83.5%
Space Separator 332
 
8.0%
Decimal Number 157
 
3.8%
Close Punctuation 85
 
2.1%
Open Punctuation 85
 
2.1%
Uppercase Letter 23
 
0.6%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
215
 
6.2%
198
 
5.7%
181
 
5.2%
176
 
5.1%
142
 
4.1%
132
 
3.8%
130
 
3.8%
108
 
3.1%
93
 
2.7%
84
 
2.4%
Other values (241) 1992
57.7%
Decimal Number
ValueCountFrequency (%)
1 68
43.3%
2 55
35.0%
3 25
 
15.9%
9 4
 
2.5%
4 2
 
1.3%
7 1
 
0.6%
6 1
 
0.6%
5 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
I 5
21.7%
C 5
21.7%
G 3
13.0%
P 3
13.0%
L 3
13.0%
K 2
 
8.7%
S 2
 
8.7%
Space Separator
ValueCountFrequency (%)
332
100.0%
Close Punctuation
ValueCountFrequency (%)
) 85
100.0%
Open Punctuation
ValueCountFrequency (%)
( 85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3451
83.5%
Common 661
 
16.0%
Latin 23
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
215
 
6.2%
198
 
5.7%
181
 
5.2%
176
 
5.1%
142
 
4.1%
132
 
3.8%
130
 
3.8%
108
 
3.1%
93
 
2.7%
84
 
2.4%
Other values (241) 1992
57.7%
Common
ValueCountFrequency (%)
332
50.2%
) 85
 
12.9%
( 85
 
12.9%
1 68
 
10.3%
2 55
 
8.3%
3 25
 
3.8%
9 4
 
0.6%
- 2
 
0.3%
4 2
 
0.3%
7 1
 
0.2%
Other values (2) 2
 
0.3%
Latin
ValueCountFrequency (%)
I 5
21.7%
C 5
21.7%
G 3
13.0%
P 3
13.0%
L 3
13.0%
K 2
 
8.7%
S 2
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3451
83.5%
ASCII 684
 
16.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
332
48.5%
) 85
 
12.4%
( 85
 
12.4%
1 68
 
9.9%
2 55
 
8.0%
3 25
 
3.7%
I 5
 
0.7%
C 5
 
0.7%
9 4
 
0.6%
G 3
 
0.4%
Other values (9) 17
 
2.5%
Hangul
ValueCountFrequency (%)
215
 
6.2%
198
 
5.7%
181
 
5.2%
176
 
5.1%
142
 
4.1%
132
 
3.8%
130
 
3.8%
108
 
3.1%
93
 
2.7%
84
 
2.4%
Other values (241) 1992
57.7%
Distinct296
Distinct (%)76.5%
Missing6
Missing (%)1.5%
Memory size3.2 KiB
2023-12-13T08:31:35.084114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length34
Mean length25.382429
Min length11

Characters and Unicode

Total characters9823
Distinct characters198
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)63.0%

Sample

1st row경상북도 김천시 농소면 경부고속도로 193
2nd row경상북도 김천시 농소면 경부고속도로 94-2
3rd row경상북도 김천시 양천동 1774-47
4th row경상북도 김천시 황금시장3길 15
5th row경상북도 김천시 황금시장5길 4
ValueCountFrequency (%)
경상북도 384
 
19.1%
김천시 384
 
19.1%
남김천대로 25
 
1.2%
영남대로 25
 
1.2%
봉산면 22
 
1.1%
감문면 18
 
0.9%
어모면 17
 
0.8%
대항면 16
 
0.8%
율곡동 16
 
0.8%
구성면 15
 
0.7%
Other values (511) 1086
54.1%
2023-12-13T08:31:35.553622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1621
 
16.5%
469
 
4.8%
450
 
4.6%
424
 
4.3%
414
 
4.2%
409
 
4.2%
402
 
4.1%
395
 
4.0%
1 320
 
3.3%
2 219
 
2.2%
Other values (188) 4700
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6277
63.9%
Space Separator 1621
 
16.5%
Decimal Number 1284
 
13.1%
Close Punctuation 200
 
2.0%
Open Punctuation 200
 
2.0%
Other Punctuation 155
 
1.6%
Dash Punctuation 86
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
469
 
7.5%
450
 
7.2%
424
 
6.8%
414
 
6.6%
409
 
6.5%
402
 
6.4%
395
 
6.3%
195
 
3.1%
191
 
3.0%
182
 
2.9%
Other values (173) 2746
43.7%
Decimal Number
ValueCountFrequency (%)
1 320
24.9%
2 219
17.1%
3 139
10.8%
4 106
 
8.3%
5 103
 
8.0%
9 100
 
7.8%
0 98
 
7.6%
6 77
 
6.0%
7 72
 
5.6%
8 50
 
3.9%
Space Separator
ValueCountFrequency (%)
1621
100.0%
Close Punctuation
ValueCountFrequency (%)
) 200
100.0%
Open Punctuation
ValueCountFrequency (%)
( 200
100.0%
Other Punctuation
ValueCountFrequency (%)
, 155
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6277
63.9%
Common 3546
36.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
469
 
7.5%
450
 
7.2%
424
 
6.8%
414
 
6.6%
409
 
6.5%
402
 
6.4%
395
 
6.3%
195
 
3.1%
191
 
3.0%
182
 
2.9%
Other values (173) 2746
43.7%
Common
ValueCountFrequency (%)
1621
45.7%
1 320
 
9.0%
2 219
 
6.2%
) 200
 
5.6%
( 200
 
5.6%
, 155
 
4.4%
3 139
 
3.9%
4 106
 
3.0%
5 103
 
2.9%
9 100
 
2.8%
Other values (5) 383
 
10.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6277
63.9%
ASCII 3546
36.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1621
45.7%
1 320
 
9.0%
2 219
 
6.2%
) 200
 
5.6%
( 200
 
5.6%
, 155
 
4.4%
3 139
 
3.9%
4 106
 
3.0%
5 103
 
2.9%
9 100
 
2.8%
Other values (5) 383
 
10.8%
Hangul
ValueCountFrequency (%)
469
 
7.5%
450
 
7.2%
424
 
6.8%
414
 
6.6%
409
 
6.5%
402
 
6.4%
395
 
6.3%
195
 
3.1%
191
 
3.0%
182
 
2.9%
Other values (173) 2746
43.7%

소재지지번주소
Text

MISSING 

Distinct38
Distinct (%)58.5%
Missing328
Missing (%)83.5%
Memory size3.2 KiB
2023-12-13T08:31:35.859326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length25.415385
Min length16

Characters and Unicode

Total characters1652
Distinct characters119
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)41.5%

Sample

1st row경상북도 김천시 양천동 1774-47
2nd row경상북도 김천시 황금동 31-10
3rd row경상북도 김천시 황금동 20-9
4th row경상북도 김천시 율곡동 6-1566
5th row경상북도 김천시 율곡동 804
ValueCountFrequency (%)
경상북도 65
18.5%
김천시 65
18.5%
봉산면 13
 
3.7%
신음동 11
 
3.1%
대항면 8
 
2.3%
103 7
 
2.0%
구성면 7
 
2.0%
운수리 7
 
2.0%
사명대사공원 6
 
1.7%
하강리 6
 
1.7%
Other values (81) 156
44.4%
2023-12-13T08:31:36.355581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
286
 
17.3%
88
 
5.3%
81
 
4.9%
72
 
4.4%
68
 
4.1%
66
 
4.0%
66
 
4.0%
65
 
3.9%
1 58
 
3.5%
45
 
2.7%
Other values (109) 757
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1091
66.0%
Space Separator 286
 
17.3%
Decimal Number 240
 
14.5%
Dash Punctuation 35
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
8.1%
81
 
7.4%
72
 
6.6%
68
 
6.2%
66
 
6.0%
66
 
6.0%
65
 
6.0%
45
 
4.1%
38
 
3.5%
27
 
2.5%
Other values (97) 475
43.5%
Decimal Number
ValueCountFrequency (%)
1 58
24.2%
4 28
11.7%
8 25
10.4%
2 25
10.4%
3 22
 
9.2%
6 18
 
7.5%
7 17
 
7.1%
0 17
 
7.1%
5 16
 
6.7%
9 14
 
5.8%
Space Separator
ValueCountFrequency (%)
286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1091
66.0%
Common 561
34.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
8.1%
81
 
7.4%
72
 
6.6%
68
 
6.2%
66
 
6.0%
66
 
6.0%
65
 
6.0%
45
 
4.1%
38
 
3.5%
27
 
2.5%
Other values (97) 475
43.5%
Common
ValueCountFrequency (%)
286
51.0%
1 58
 
10.3%
- 35
 
6.2%
4 28
 
5.0%
8 25
 
4.5%
2 25
 
4.5%
3 22
 
3.9%
6 18
 
3.2%
7 17
 
3.0%
0 17
 
3.0%
Other values (2) 30
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1091
66.0%
ASCII 561
34.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
286
51.0%
1 58
 
10.3%
- 35
 
6.2%
4 28
 
5.0%
8 25
 
4.5%
2 25
 
4.5%
3 22
 
3.9%
6 18
 
3.2%
7 17
 
3.0%
0 17
 
3.0%
Other values (2) 30
 
5.3%
Hangul
ValueCountFrequency (%)
88
 
8.1%
81
 
7.4%
72
 
6.6%
68
 
6.2%
66
 
6.0%
66
 
6.0%
65
 
6.0%
45
 
4.1%
38
 
3.5%
27
 
2.5%
Other values (97) 475
43.5%
Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size525.0 B
False
332 
True
61 
ValueCountFrequency (%)
False 332
84.5%
True 61
 
15.5%
2023-12-13T08:31:36.503787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct225
Distinct (%)57.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-13T08:31:36.715697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length7.6463104
Min length3

Characters and Unicode

Total characters3005
Distinct characters208
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique162 ?
Unique (%)41.2%

Sample

1st row김천(부산)주유소
2nd row김천(서울)주유소
3rd row김천시 양금동
4th row김천시 양금동
5th row김천시 원도심재생과
ValueCountFrequency (%)
김천시 88
 
15.1%
김천시청 53
 
9.1%
산림녹지과 25
 
4.3%
농업기술센터 15
 
2.6%
행정복지센터 10
 
1.7%
관광진흥과 10
 
1.7%
서무계 8
 
1.4%
농촌지도과 8
 
1.4%
김천지원 8
 
1.4%
투자유치과 8
 
1.4%
Other values (226) 348
59.9%
2023-12-13T08:31:37.030215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
214
 
7.1%
193
 
6.4%
188
 
6.3%
176
 
5.9%
99
 
3.3%
84
 
2.8%
82
 
2.7%
65
 
2.2%
65
 
2.2%
58
 
1.9%
Other values (198) 1781
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2772
92.2%
Space Separator 188
 
6.3%
Open Punctuation 17
 
0.6%
Close Punctuation 14
 
0.5%
Decimal Number 13
 
0.4%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
214
 
7.7%
193
 
7.0%
176
 
6.3%
99
 
3.6%
84
 
3.0%
82
 
3.0%
65
 
2.3%
65
 
2.3%
58
 
2.1%
56
 
2.0%
Other values (191) 1680
60.6%
Decimal Number
ValueCountFrequency (%)
1 8
61.5%
9 4
30.8%
2 1
 
7.7%
Space Separator
ValueCountFrequency (%)
188
100.0%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2772
92.2%
Common 233
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
214
 
7.7%
193
 
7.0%
176
 
6.3%
99
 
3.6%
84
 
3.0%
82
 
3.0%
65
 
2.3%
65
 
2.3%
58
 
2.1%
56
 
2.0%
Other values (191) 1680
60.6%
Common
ValueCountFrequency (%)
188
80.7%
( 17
 
7.3%
) 14
 
6.0%
1 8
 
3.4%
9 4
 
1.7%
, 1
 
0.4%
2 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2772
92.2%
ASCII 233
 
7.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
214
 
7.7%
193
 
7.0%
176
 
6.3%
99
 
3.6%
84
 
3.0%
82
 
3.0%
65
 
2.3%
65
 
2.3%
58
 
2.1%
56
 
2.0%
Other values (191) 1680
60.6%
ASCII
ValueCountFrequency (%)
188
80.7%
( 17
 
7.3%
) 14
 
6.0%
1 8
 
3.4%
9 4
 
1.7%
, 1
 
0.4%
2 1
 
0.4%
Distinct230
Distinct (%)59.6%
Missing7
Missing (%)1.8%
Memory size3.2 KiB
2023-12-13T08:31:37.269089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.005181
Min length12

Characters and Unicode

Total characters4634
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique169 ?
Unique (%)43.8%

Sample

1st row054-435-0266
2nd row054-431-4477
3rd row054-421-2354
4th row054-421-2354
5th row054-420-6847
ValueCountFrequency (%)
054-420-6757 25
 
6.5%
054-420-2033 8
 
2.1%
054-432-2517 7
 
1.8%
054-421-1633 7
 
1.8%
054-434-2565 7
 
1.8%
054-433-4000 6
 
1.6%
054-421-1661 6
 
1.6%
054-420-9300 5
 
1.3%
054-431-6301 5
 
1.3%
054-430-5301 5
 
1.3%
Other values (220) 305
79.0%
2023-12-13T08:31:37.634231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 878
18.9%
- 772
16.7%
0 736
15.9%
5 609
13.1%
2 423
9.1%
3 403
8.7%
1 278
 
6.0%
6 202
 
4.4%
7 147
 
3.2%
8 95
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3862
83.3%
Dash Punctuation 772
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 878
22.7%
0 736
19.1%
5 609
15.8%
2 423
11.0%
3 403
10.4%
1 278
 
7.2%
6 202
 
5.2%
7 147
 
3.8%
8 95
 
2.5%
9 91
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 772
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4634
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 878
18.9%
- 772
16.7%
0 736
15.9%
5 609
13.1%
2 423
9.1%
3 403
8.7%
1 278
 
6.0%
6 202
 
4.4%
7 147
 
3.2%
8 95
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4634
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 878
18.9%
- 772
16.7%
0 736
15.9%
5 609
13.1%
2 423
9.1%
3 403
8.7%
1 278
 
6.0%
6 202
 
4.4%
7 147
 
3.2%
8 95
 
2.1%

개방시간
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct22
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
09:00~18:00
278 
00:00~00:00
52 
24시간
 
7
09:00~21:00
 
7
상시개방
 
6
Other values (17)
43 

Length

Max length25
Median length11
Mean length10.631043
Min length2

Unique

Unique5 ?
Unique (%)1.3%

Sample

1st row24시간
2nd row24시간
3rd row상시개방
4th row상시개방
5th row상시개방

Common Values

ValueCountFrequency (%)
09:00~18:00 278
70.7%
00:00~00:00 52
 
13.2%
24시간 7
 
1.8%
09:00~21:00 7
 
1.8%
상시개방 6
 
1.5%
09:00~17:00 5
 
1.3%
08:00~23:00 4
 
1.0%
06:00~21:00 4
 
1.0%
24시 4
 
1.0%
08:00~22:00 4
 
1.0%
Other values (12) 22
 
5.6%

Length

2023-12-13T08:31:37.779466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
09:00~18:00 278
70.4%
00:00~00:00 52
 
13.2%
24시간 7
 
1.8%
09:00~21:00 7
 
1.8%
상시개방 6
 
1.5%
09:00~17:00 5
 
1.3%
08:00~23:00 4
 
1.0%
06:00~21:00 4
 
1.0%
24시 4
 
1.0%
08:00~22:00 4
 
1.0%
Other values (14) 24
 
6.1%

화장실소유구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
지방자치단체
231 
민간
116 
공공기관
29 
국가
 
17

Length

Max length6
Median length6
Mean length4.4987277
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공공기관
2nd row공공기관
3rd row지방자치단체
4th row지방자치단체
5th row지방자치단체

Common Values

ValueCountFrequency (%)
지방자치단체 231
58.8%
민간 116
29.5%
공공기관 29
 
7.4%
국가 17
 
4.3%

Length

2023-12-13T08:31:37.900580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:38.005825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지방자치단체 231
58.8%
민간 116
29.5%
공공기관 29
 
7.4%
국가 17
 
4.3%

화장실설치장소유형
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
공공용시설
220 
주유소·충전소
90 
관광·체육시설
44 
기타
 
21
교통시설
 
9

Length

Max length7
Median length5
Mean length5.475827
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주유소·충전소
2nd row주유소·충전소
3rd row공공용시설
4th row공공용시설
5th row공공용시설

Common Values

ValueCountFrequency (%)
공공용시설 220
56.0%
주유소·충전소 90
22.9%
관광·체육시설 44
 
11.2%
기타 21
 
5.3%
교통시설 9
 
2.3%
상업시설 9
 
2.3%

Length

2023-12-13T08:31:38.130757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:38.264687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공공용시설 220
56.0%
주유소·충전소 90
22.9%
관광·체육시설 44
 
11.2%
기타 21
 
5.3%
교통시설 9
 
2.3%
상업시설 9
 
2.3%

오물처리방식
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
수세식
385 
수거식
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수세식
2nd row수세식
3rd row수세식
4th row수세식
5th row수세식

Common Values

ValueCountFrequency (%)
수세식 385
98.0%
수거식 8
 
2.0%

Length

2023-12-13T08:31:38.399992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:38.784616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수세식 385
98.0%
수거식 8
 
2.0%

화장실정보공개여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size525.0 B
True
214 
False
179 
ValueCountFrequency (%)
True 214
54.5%
False 179
45.5%
2023-12-13T08:31:38.887568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

비상벨설치여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size525.0 B
False
334 
True
59 
ValueCountFrequency (%)
False 334
85.0%
True 59
 
15.0%
2023-12-13T08:31:38.980559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기저귀교환대유무
Boolean

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size525.0 B
False
353 
True
40 
ValueCountFrequency (%)
False 353
89.8%
True 40
 
10.2%
2023-12-13T08:31:39.082734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기저귀교환대장소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
<NA>
353 
여자화장실
 
25
남자화장실+여자화장실
 
11
여자화장실+남자화장실
 
2
남자화장실
 
2

Length

Max length11
Median length4
Mean length4.3002545
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 353
89.8%
여자화장실 25
 
6.4%
남자화장실+여자화장실 11
 
2.8%
여자화장실+남자화장실 2
 
0.5%
남자화장실 2
 
0.5%

Length

2023-12-13T08:31:39.207276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:39.323196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 353
89.8%
여자화장실 25
 
6.4%
남자화장실+여자화장실 11
 
2.8%
여자화장실+남자화장실 2
 
0.5%
남자화장실 2
 
0.5%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-09-27
393 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-27
2nd row2023-09-27
3rd row2023-09-27
4th row2023-09-27
5th row2023-09-27

Common Values

ValueCountFrequency (%)
2023-09-27 393
100.0%

Length

2023-12-13T08:31:39.437411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:31:39.530899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-27 393
100.0%

Interactions

2023-12-13T08:31:33.125101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:31:39.605043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번화장실구분소재지지번주소남녀공용화장실여부개방시간화장실소유구분화장실설치장소유형오물처리방식화장실정보공개여부비상벨설치여부기저귀교환대유무기저귀교환대장소
연번1.0000.5591.0000.0860.5950.7370.5550.0510.5660.1940.0990.457
화장실구분0.5591.0001.0000.0810.8230.6730.1890.6940.1140.0010.0000.000
소재지지번주소1.0001.0001.0000.8800.9830.9961.0001.0001.0000.8240.6531.000
남녀공용화장실여부0.0860.0810.8801.0000.0000.1210.2960.0000.0000.1550.1520.000
개방시간0.5950.8230.9830.0001.0000.5270.7750.6620.4160.7870.1990.484
화장실소유구분0.7370.6730.9960.1210.5271.0000.7140.0000.8860.3420.2940.676
화장실설치장소유형0.5550.1891.0000.2960.7750.7141.0000.0000.8560.6620.4640.516
오물처리방식0.0510.6941.0000.0000.6620.0000.0001.0000.0000.0000.0000.000
화장실정보공개여부0.5660.1141.0000.0000.4160.8860.8560.0001.0000.2180.1880.000
비상벨설치여부0.1940.0010.8240.1550.7870.3420.6620.0000.2181.0000.4060.727
기저귀교환대유무0.0990.0000.6530.1520.1990.2940.4640.0000.1880.4061.000NaN
기저귀교환대장소0.4570.0001.0000.0000.4840.6760.5160.0000.0000.727NaN1.000
2023-12-13T08:31:39.776663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비상벨설치여부화장실설치장소유형기저귀교환대장소화장실소유구분화장실정보공개여부기저귀교환대유무남녀공용화장실여부개방시간오물처리방식화장실구분
비상벨설치여부1.0000.4820.5050.2280.1400.2660.0990.6280.0000.000
화장실설치장소유형0.4821.0000.2170.5450.6610.3330.2120.4840.0000.122
기저귀교환대장소0.5050.2171.0000.3210.0001.0000.0000.3330.0000.000
화장실소유구분0.2280.5450.3211.0000.6930.1960.0800.3010.0000.322
화장실정보공개여부0.1400.6610.0000.6931.0000.1200.0000.3210.0000.075
기저귀교환대유무0.2660.3331.0000.1960.1201.0000.0970.1530.0000.000
남녀공용화장실여부0.0990.2120.0000.0800.0000.0971.0000.0000.0000.053
개방시간0.6280.4840.3330.3010.3210.1530.0001.0000.5190.587
오물처리방식0.0000.0000.0000.0000.0000.0000.0000.5191.0000.490
화장실구분0.0000.1220.0000.3220.0750.0000.0530.5870.4901.000
2023-12-13T08:31:39.939902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번화장실구분남녀공용화장실여부개방시간화장실소유구분화장실설치장소유형오물처리방식화장실정보공개여부비상벨설치여부기저귀교환대유무기저귀교환대장소
연번1.0000.3660.0640.2620.5390.3310.0380.4320.1470.0750.253
화장실구분0.3661.0000.0530.5870.3220.1220.4900.0750.0000.0000.000
남녀공용화장실여부0.0640.0531.0000.0000.0800.2120.0000.0000.0990.0970.000
개방시간0.2620.5870.0001.0000.3010.4840.5190.3210.6280.1530.333
화장실소유구분0.5390.3220.0800.3011.0000.5450.0000.6930.2280.1960.321
화장실설치장소유형0.3310.1220.2120.4840.5451.0000.0000.6610.4820.3330.217
오물처리방식0.0380.4900.0000.5190.0000.0001.0000.0000.0000.0000.000
화장실정보공개여부0.4320.0750.0000.3210.6930.6610.0001.0000.1400.1200.000
비상벨설치여부0.1470.0000.0990.6280.2280.4820.0000.1401.0000.2660.505
기저귀교환대유무0.0750.0000.0970.1530.1960.3330.0000.1200.2661.0001.000
기저귀교환대장소0.2530.0000.0000.3330.3210.2170.0000.0000.5051.0001.000

Missing values

2023-12-13T08:31:33.296322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:31:33.540569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T08:31:33.702218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번화장실구분화장실명소재지도로명주소소재지지번주소남녀공용화장실여부관리기관명관리기관전화번호개방시간화장실소유구분화장실설치장소유형오물처리방식화장실정보공개여부비상벨설치여부기저귀교환대유무기저귀교환대장소데이터기준일자
01개방화장실김천(부산)주유소(1층)경상북도 김천시 농소면 경부고속도로 193<NA>N김천(부산)주유소054-435-026624시간공공기관주유소·충전소수세식YYN<NA>2023-09-27
12개방화장실김천(서울)주유소(1층)경상북도 김천시 농소면 경부고속도로 94-2<NA>N김천(서울)주유소054-431-447724시간공공기관주유소·충전소수세식YYN<NA>2023-09-27
23공중화장실감천제방공중화장실경상북도 김천시 양천동 1774-47경상북도 김천시 양천동 1774-47N김천시 양금동054-421-2354상시개방지방자치단체공공용시설수세식YNN<NA>2023-09-27
34공중화장실황금시장공중화장실경상북도 김천시 황금시장3길 15경상북도 김천시 황금동 31-10N김천시 양금동054-421-2354상시개방지방자치단체공공용시설수세식YYN<NA>2023-09-27
45공중화장실황금시장플랫폼 화장실경상북도 김천시 황금시장5길 4경상북도 김천시 황금동 20-9N김천시 원도심재생과054-420-6847상시개방지방자치단체공공용시설수세식YNN<NA>2023-09-27
56개방화장실공공조달역량개발원 화장실경상북도 김천시 혁신로 316-20(율곡동)경상북도 김천시 율곡동 6-1566N공공조달역량개발원 교육운영과054-716-152309:00~18:00국가공공용시설수세식YNN<NA>2023-09-27
67개방화장실김천지원 사무동4층경상북도 김천시 물망골길 39<NA>N김천지원 서무계054-420-203309:00~18:00국가공공용시설수세식NNN<NA>2023-09-27
78개방화장실김천지원 사무동3층경상북도 김천시 물망골길 39<NA>N김천지원 서무계054-420-203309:00~18:00국가공공용시설수세식NNN<NA>2023-09-27
89개방화장실김천지원 사무동2층경상북도 김천시 물망골길 39<NA>N김천지원 서무계054-420-203309:00~18:00국가공공용시설수세식NNN<NA>2023-09-27
910개방화장실김천지원 사무동1층경상북도 김천시 물망골길 39<NA>N김천지원 서무계054-420-203309:00~18:00국가공공용시설수세식NNN<NA>2023-09-27
연번화장실구분화장실명소재지도로명주소소재지지번주소남녀공용화장실여부관리기관명관리기관전화번호개방시간화장실소유구분화장실설치장소유형오물처리방식화장실정보공개여부비상벨설치여부기저귀교환대유무기저귀교환대장소데이터기준일자
383384공중화장실개령면보건지소 화장실경상북도 김천시 개령면 서부3길 107 (동부리, 개령면보건지소)<NA>N김천시 개령면보건지소054-430-546509:00~18:00지방자치단체공공용시설수세식YNN<NA>2023-09-27
384385공중화장실김천시 개령면사무소 화장실경상북도 김천시 개령면 동부길 59-14 (동부리, 개령면사무소)<NA>N김천시 개령면054-436-530109:00~18:00지방자치단체공공용시설수세식YNN<NA>2023-09-27
385386공중화장실강변공원화장실경상북도 김천시 강변공원길 169 (교동)<NA>N김천시청 산림녹지과054-420-675700:00~00:00지방자치단체관광·체육시설수세식YYN<NA>2023-09-27
386387공중화장실김천시 감천면사무소 (2층)경상북도 김천시 감천면 금감로 1552 (광기리, 감천면사무소)<NA>N김천시 감천면사무소054-421-218209:00~18:00지방자치단체공공용시설수세식YNN<NA>2023-09-27
387388공중화장실김천시 감천면보건지소 화장실경상북도 김천시 감천면 금감로 1552 (광기리, 감천면사무소)<NA>N김천시 감천면 보건지소054-431-030109:00~18:00지방자치단체공공용시설수거식YNN<NA>2023-09-27
388389공중화장실김천시 감천면사무소 (1층)경상북도 김천시 감천면 금감로 1552 (광기리, 감천면사무소)<NA>N김천시 감천면사무소054-421-218209:00~18:00지방자치단체공공용시설수거식YNN<NA>2023-09-27
389390공중화장실김천감문휴요양병원 화장실경상북도 김천시 감문면 문화로 252 (남곡리)<NA>N휴요양병원 원무과054-437-270009:00~17:00민간기타수세식NYN<NA>2023-09-27
390391공중화장실감문면 족구장경상북도 김천시 감문면 감문로 975-6<NA>N김천시 감문면사무소054-430-530109:00~18:00지방자치단체관광·체육시설수세식YNN<NA>2023-09-27
391392개방화장실감문농협 화장실경상북도 김천시 감문면 감문로 1142 (삼성리)<NA>N감문농협054-430-500909:00~16:00민간상업시설수세식NNN<NA>2023-09-27
392393공중화장실김천시 삼성보건진료소 화장실경상북도 김천시 감문면 감문1로 218 (삼성리, 삼성보건진료소)<NA>Y김천시 삼성보건진료소054-433-566309:00~18:00지방자치단체공공용시설수세식YNN<NA>2023-09-27