Overview

Dataset statistics

Number of variables3
Number of observations211
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.1 KiB
Average record size in memory24.6 B

Variable types

Categorical1
Text2

Dataset

Description경상남도 고성군에 소재하고 있는 종교 시설 현황에 대한 데이터로 구분, 시설명, 주소 등의 항목을 제공하고 있습니다. * 데이터 미집계, 개인정보 포함 등의 사유로 데이터값에 공란이 존재할 수 있습니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15117743/fileData.do

Alerts

구분 is highly imbalanced (55.3%)Imbalance

Reproduction

Analysis started2024-04-18 05:21:22.370362
Analysis finished2024-04-18 05:21:24.507864
Duration2.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

IMBALANCE 

Distinct10
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
불교
119 
개신교
76 
천주교
 
4
기타
 
4
천리교
 
3
Other values (5)
 
5

Length

Max length6
Median length2
Mean length2.4549763
Min length2

Unique

Unique5 ?
Unique (%)2.4%

Sample

1st row개신교
2nd row개신교
3rd row개신교
4th row개신교
5th row개신교

Common Values

ValueCountFrequency (%)
불교 119
56.4%
개신교 76
36.0%
천주교 4
 
1.9%
기타 4
 
1.9%
천리교 3
 
1.4%
원불교 1
 
0.5%
통일교 1
 
0.5%
여호와의증인 1
 
0.5%
하나님의교회 1
 
0.5%
대순진리교 1
 
0.5%

Length

2024-04-18T14:21:24.579633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T14:21:24.706024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
불교 119
56.4%
개신교 76
36.0%
천주교 4
 
1.9%
기타 4
 
1.9%
천리교 3
 
1.4%
원불교 1
 
0.5%
통일교 1
 
0.5%
여호와의증인 1
 
0.5%
하나님의교회 1
 
0.5%
대순진리교 1
 
0.5%
Distinct205
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-18T14:21:24.956435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length4.1279621
Min length3

Characters and Unicode

Total characters871
Distinct characters189
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)94.3%

Sample

1st row고성교회
2nd row고성제일교회
3rd row고성순복음교회
4th row고성중앙성결교회
5th row고성침례교회
ValueCountFrequency (%)
성산교회 2
 
0.9%
보현사 2
 
0.9%
감로사 2
 
0.9%
샬롬교회 2
 
0.9%
해동사 2
 
0.9%
청룡사 2
 
0.9%
금곡사 1
 
0.5%
청암사 1
 
0.5%
폭포암 1
 
0.5%
일출암 1
 
0.5%
Other values (195) 195
92.4%
2024-04-18T14:21:25.330615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
10.8%
89
 
10.2%
87
 
10.0%
32
 
3.7%
31
 
3.6%
16
 
1.8%
16
 
1.8%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (179) 468
53.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 866
99.4%
Open Punctuation 2
 
0.2%
Close Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
10.9%
89
 
10.3%
87
 
10.0%
32
 
3.7%
31
 
3.6%
16
 
1.8%
16
 
1.8%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (176) 463
53.5%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 866
99.4%
Common 5
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
10.9%
89
 
10.3%
87
 
10.0%
32
 
3.7%
31
 
3.6%
16
 
1.8%
16
 
1.8%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (176) 463
53.5%
Common
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
7 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 866
99.4%
ASCII 5
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
94
 
10.9%
89
 
10.3%
87
 
10.0%
32
 
3.7%
31
 
3.6%
16
 
1.8%
16
 
1.8%
13
 
1.5%
13
 
1.5%
12
 
1.4%
Other values (176) 463
53.5%
ASCII
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
7 1
20.0%

주소
Text

Distinct208
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2024-04-18T14:21:25.594977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length22.137441
Min length19

Characters and Unicode

Total characters4671
Distinct characters126
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)97.6%

Sample

1st row경상남도 고성군 고성읍 중앙로25번길 12
2nd row경상남도 고성군 고성읍 남포로140번길 42
3rd row경상남도 고성군 고성읍 우산2길 312
4th row경상남도 고성군 고성읍 남산로 69
5th row경상남도 고성군 고성읍 동외로27번길 57
ValueCountFrequency (%)
경상남도 211
20.2%
고성군 211
20.2%
고성읍 44
 
4.2%
대가면 25
 
2.4%
동해면 19
 
1.8%
상리면 16
 
1.5%
회화면 15
 
1.4%
거류면 15
 
1.4%
개천면 14
 
1.3%
마암면 12
 
1.1%
Other values (344) 464
44.4%
2024-04-18T14:21:26.054462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
894
19.1%
260
 
5.6%
258
 
5.5%
230
 
4.9%
222
 
4.8%
214
 
4.6%
212
 
4.5%
211
 
4.5%
1 192
 
4.1%
167
 
3.6%
Other values (116) 1811
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2807
60.1%
Space Separator 894
 
19.1%
Decimal Number 873
 
18.7%
Dash Punctuation 97
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
260
 
9.3%
258
 
9.2%
230
 
8.2%
222
 
7.9%
214
 
7.6%
212
 
7.6%
211
 
7.5%
167
 
5.9%
135
 
4.8%
82
 
2.9%
Other values (104) 816
29.1%
Decimal Number
ValueCountFrequency (%)
1 192
22.0%
2 119
13.6%
3 103
11.8%
4 103
11.8%
5 72
 
8.2%
6 67
 
7.7%
8 57
 
6.5%
7 56
 
6.4%
9 53
 
6.1%
0 51
 
5.8%
Space Separator
ValueCountFrequency (%)
894
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 97
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2807
60.1%
Common 1864
39.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
260
 
9.3%
258
 
9.2%
230
 
8.2%
222
 
7.9%
214
 
7.6%
212
 
7.6%
211
 
7.5%
167
 
5.9%
135
 
4.8%
82
 
2.9%
Other values (104) 816
29.1%
Common
ValueCountFrequency (%)
894
48.0%
1 192
 
10.3%
2 119
 
6.4%
3 103
 
5.5%
4 103
 
5.5%
- 97
 
5.2%
5 72
 
3.9%
6 67
 
3.6%
8 57
 
3.1%
7 56
 
3.0%
Other values (2) 104
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2807
60.1%
ASCII 1864
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
894
48.0%
1 192
 
10.3%
2 119
 
6.4%
3 103
 
5.5%
4 103
 
5.5%
- 97
 
5.2%
5 72
 
3.9%
6 67
 
3.6%
8 57
 
3.1%
7 56
 
3.0%
Other values (2) 104
 
5.6%
Hangul
ValueCountFrequency (%)
260
 
9.3%
258
 
9.2%
230
 
8.2%
222
 
7.9%
214
 
7.6%
212
 
7.6%
211
 
7.5%
167
 
5.9%
135
 
4.8%
82
 
2.9%
Other values (104) 816
29.1%

Missing values

2024-04-18T14:21:24.477523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분시설명주소
0개신교고성교회경상남도 고성군 고성읍 중앙로25번길 12
1개신교고성제일교회경상남도 고성군 고성읍 남포로140번길 42
2개신교고성순복음교회경상남도 고성군 고성읍 우산2길 312
3개신교고성중앙성결교회경상남도 고성군 고성읍 남산로 69
4개신교고성침례교회경상남도 고성군 고성읍 동외로27번길 57
5개신교천성교회경상남도 고성군 고성읍 성내로 137
6개신교덕선교회경상남도 고성군 고성읍 선동안길 36
7개신교동산교회경상남도 고성군 고성읍 동외로 214-4
8개신교샘물교회경상남도 고성군 고성읍 공룡로 3045-1
9개신교성실교회경상남도 고성군 고성읍 중앙로80번길 9
구분시설명주소
201천리교천리교교성교회경상남도 고성군 고성읍 성내로76번길 13-7
202천리교천리교배둔교회경상남도 고성군 회화면 배둔로 19번길 59-2
203천리교천리교구만교회경상남도 고성군 구만면 구만로 814
204여호와의증인여호와의증인의왕국경상남도 고성군 고성읍 동외리164번길 36
205하나님의교회하나님의교회경상남도 고성군 고성읍 동외로 156번길 13
206대순진리교대순진리회부전우산회관경상남도 고성군 고성읍 우산3길 30-8
207기타옥산기도원경상남도 고성군 회화면 구만로 1215
208기타은혜기도원경상남도 고성군 마암면 금호로 435-194
209기타천도교고성교구경상남도 고성군 고성읍 동외로 118-9
210기타고성교회교육관경상남도 고성군 고성읍 교사1길 29