Overview

Dataset statistics

Number of variables3
Number of observations399
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory9.5 KiB
Average record size in memory24.3 B

Variable types

Text2
Categorical1

Dataset

Description충청남도 서산시 내 종교시설 데이터입니다. 항목명은 시설명, 주소, 종교구분으로 이루어져 있습니다. 문의사항은 041-660-2224로 문의주세요.
URLhttps://www.data.go.kr/data/15117738/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates
종교구분 is highly imbalanced (57.7%)Imbalance

Reproduction

Analysis started2023-12-12 06:10:26.660144
Analysis finished2023-12-12 06:10:27.084504
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct390
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T15:10:27.296000image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length19
Mean length8.2406015
Min length2

Characters and Unicode

Total characters3288
Distinct characters261
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique381 ?
Unique (%)95.5%

Sample

1st row삼마기도원
2nd row로뎀수양관
3rd row대한연합기도원
4th row대한기독교 하나님의 성회 임마누엘수양원
5th row예수교대한성결교회 서울반석교회 수양관
ValueCountFrequency (%)
기독교대한감리회 27
 
4.7%
대한예수교장로회 22
 
3.9%
기독교대한성결교회 21
 
3.7%
교회 11
 
1.9%
서산교회 9
 
1.6%
예수교대한성결교회 6
 
1.1%
서산 4
 
0.7%
벧엘교회 3
 
0.5%
임마누엘교회 3
 
0.5%
사랑의 3
 
0.5%
Other values (427) 460
80.8%
2023-12-12T15:10:27.743788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
463
 
14.1%
411
 
12.5%
172
 
5.2%
130
 
4.0%
120
 
3.6%
88
 
2.7%
82
 
2.5%
79
 
2.4%
79
 
2.4%
67
 
2.0%
Other values (251) 1597
48.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3078
93.6%
Space Separator 172
 
5.2%
Close Punctuation 16
 
0.5%
Open Punctuation 16
 
0.5%
Lowercase Letter 4
 
0.1%
Decimal Number 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
463
 
15.0%
411
 
13.4%
130
 
4.2%
120
 
3.9%
88
 
2.9%
82
 
2.7%
79
 
2.6%
79
 
2.6%
67
 
2.2%
65
 
2.1%
Other values (242) 1494
48.5%
Lowercase Letter
ValueCountFrequency (%)
t 1
25.0%
u 1
25.0%
r 1
25.0%
c 1
25.0%
Space Separator
ValueCountFrequency (%)
172
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3078
93.6%
Common 206
 
6.3%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
463
 
15.0%
411
 
13.4%
130
 
4.2%
120
 
3.9%
88
 
2.9%
82
 
2.7%
79
 
2.6%
79
 
2.6%
67
 
2.2%
65
 
2.1%
Other values (242) 1494
48.5%
Common
ValueCountFrequency (%)
172
83.5%
) 16
 
7.8%
( 16
 
7.8%
7 1
 
0.5%
, 1
 
0.5%
Latin
ValueCountFrequency (%)
t 1
25.0%
u 1
25.0%
r 1
25.0%
c 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3078
93.6%
ASCII 210
 
6.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
463
 
15.0%
411
 
13.4%
130
 
4.2%
120
 
3.9%
88
 
2.9%
82
 
2.7%
79
 
2.6%
79
 
2.6%
67
 
2.2%
65
 
2.1%
Other values (242) 1494
48.5%
ASCII
ValueCountFrequency (%)
172
81.9%
) 16
 
7.6%
( 16
 
7.6%
t 1
 
0.5%
u 1
 
0.5%
r 1
 
0.5%
7 1
 
0.5%
c 1
 
0.5%
, 1
 
0.5%

주소
Text

Distinct393
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T15:10:28.080801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length21.20802
Min length15

Characters and Unicode

Total characters8462
Distinct characters205
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique387 ?
Unique (%)97.0%

Sample

1st row충청남도 서산시 해미면 삼송장승길 38-96
2nd row충청남도 서산시 해미면 산수1길 210
3rd row충청남도 서산시 해미면 반양2길 33-13
4th row충청남도 서산시 해미면 관터로 79-1
5th row충청남도 서산시 해미면 관터로 355
ValueCountFrequency (%)
충청남도 399
20.9%
서산시 399
20.9%
대산읍 36
 
1.9%
해미면 36
 
1.9%
음암면 33
 
1.7%
운산면 33
 
1.7%
부석면 26
 
1.4%
인지면 24
 
1.3%
팔봉면 22
 
1.2%
고북면 21
 
1.1%
Other values (627) 876
46.0%
2023-12-12T15:10:28.520167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1543
18.2%
519
 
6.1%
416
 
4.9%
415
 
4.9%
414
 
4.9%
407
 
4.8%
403
 
4.8%
401
 
4.7%
1 394
 
4.7%
226
 
2.7%
Other values (195) 3324
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5180
61.2%
Space Separator 1543
 
18.2%
Decimal Number 1478
 
17.5%
Dash Punctuation 173
 
2.0%
Close Punctuation 38
 
0.4%
Open Punctuation 38
 
0.4%
Other Punctuation 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
519
 
10.0%
416
 
8.0%
415
 
8.0%
414
 
8.0%
407
 
7.9%
403
 
7.8%
401
 
7.7%
226
 
4.4%
216
 
4.2%
180
 
3.5%
Other values (180) 1583
30.6%
Decimal Number
ValueCountFrequency (%)
1 394
26.7%
2 203
13.7%
3 161
10.9%
4 142
 
9.6%
5 124
 
8.4%
6 104
 
7.0%
7 93
 
6.3%
9 90
 
6.1%
8 85
 
5.8%
0 82
 
5.5%
Space Separator
ValueCountFrequency (%)
1543
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 173
100.0%
Close Punctuation
ValueCountFrequency (%)
) 38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 38
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5180
61.2%
Common 3282
38.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
519
 
10.0%
416
 
8.0%
415
 
8.0%
414
 
8.0%
407
 
7.9%
403
 
7.8%
401
 
7.7%
226
 
4.4%
216
 
4.2%
180
 
3.5%
Other values (180) 1583
30.6%
Common
ValueCountFrequency (%)
1543
47.0%
1 394
 
12.0%
2 203
 
6.2%
- 173
 
5.3%
3 161
 
4.9%
4 142
 
4.3%
5 124
 
3.8%
6 104
 
3.2%
7 93
 
2.8%
9 90
 
2.7%
Other values (5) 255
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5180
61.2%
ASCII 3282
38.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1543
47.0%
1 394
 
12.0%
2 203
 
6.2%
- 173
 
5.3%
3 161
 
4.9%
4 142
 
4.3%
5 124
 
3.8%
6 104
 
3.2%
7 93
 
2.8%
9 90
 
2.7%
Other values (5) 255
 
7.8%
Hangul
ValueCountFrequency (%)
519
 
10.0%
416
 
8.0%
415
 
8.0%
414
 
8.0%
407
 
7.9%
403
 
7.8%
401
 
7.7%
226
 
4.4%
216
 
4.2%
180
 
3.5%
Other values (180) 1583
30.6%

종교구분
Categorical

IMBALANCE 

Distinct6
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
기독교
306 
불교
62 
기도원
 
20
천주교
 
9
신천지
 
1

Length

Max length3
Median length3
Mean length2.8446115
Min length2

Unique

Unique2 ?
Unique (%)0.5%

Sample

1st row기도원
2nd row기도원
3rd row기도원
4th row기도원
5th row기도원

Common Values

ValueCountFrequency (%)
기독교 306
76.7%
불교 62
 
15.5%
기도원 20
 
5.0%
천주교 9
 
2.3%
신천지 1
 
0.3%
이슬람 1
 
0.3%

Length

2023-12-12T15:10:28.648708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:10:28.746921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기독교 306
76.7%
불교 62
 
15.5%
기도원 20
 
5.0%
천주교 9
 
2.3%
신천지 1
 
0.3%
이슬람 1
 
0.3%

Missing values

2023-12-12T15:10:26.942329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:10:27.047193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시설명주소종교구분
0삼마기도원충청남도 서산시 해미면 삼송장승길 38-96기도원
1로뎀수양관충청남도 서산시 해미면 산수1길 210기도원
2대한연합기도원충청남도 서산시 해미면 반양2길 33-13기도원
3대한기독교 하나님의 성회 임마누엘수양원충청남도 서산시 해미면 관터로 79-1기도원
4예수교대한성결교회 서울반석교회 수양관충청남도 서산시 해미면 관터로 355기도원
5임마누엘기도원충청남도 서산시 팔봉면 흑석중앙길 40-49기도원
6한영장로교회수양관충청남도 서산시 팔봉면 범머리길 507기도원
7라마나욧 기도원(예람교회)충청남도 서산시 음암면 충청남도 서산시 음암면 황금터길 172-13기도원
8예수피난처기도원충청남도 서산시 운산면 해운로 1191기도원
9엘림하우스충청남도 서산시 운산면 장생동로 181-54기도원
시설명주소종교구분
389이슬람 예배당충청남도 서산시 시장4길 16 2층이슬람
390천주교해미성지충청남도 서산시 해미면 성지1로 13천주교
391천주교 대전교구 해미성당충청남도 서산시 해미면 남문5로 30-10천주교
392석림성당충청남도 서산시 중앙로 210천주교
393운산성당충청남도 서산시 운산면 해운로 1125천주교
394천주교 대전교구 서산동문교회충청남도 서산시 서령로 53천주교
395천주교서산예천동성당충청남도 서산시 무학로 1864-8천주교
396대산성당충청남도 서산시 대산읍 충의로 1882-5천주교
397천주교 대전교구 성연성당충청남도 서산시 성연면 명천1길 119-3천주교
398용성대성당충청남도 서산시 고북면 신성로 330(공군부대내)천주교

Duplicate rows

Most frequently occurring

시설명주소종교구분# duplicates
0재림예수교 서산교회충청남도 서산시 주을2길 24기독교2