Overview

Dataset statistics

Number of variables5
Number of observations522
Missing cells81
Missing cells (%)3.1%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory20.5 KiB
Average record size in memory40.3 B

Variable types

Categorical2
Text3

Dataset

Description강원특별자치도 원주시에 있는 종교시설 현황에 대한 데이터이며, 종교분류, 시설명칭, 주소, 전화번호를 포함합니다.
URLhttps://www.data.go.kr/data/15117521/fileData.do

Alerts

데이터기준일 has constant value ""Constant
Dataset has 1 (0.2%) duplicate rowsDuplicates
종교구분 is highly imbalanced (63.1%)Imbalance
전화번호 has 81 (15.5%) missing valuesMissing

Reproduction

Analysis started2023-12-12 01:55:12.065432
Analysis finished2023-12-12 01:55:12.689879
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종교구분
Categorical

IMBALANCE 

Distinct6
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
개신교
409 
불교
88 
천주교
 
22
원불교
 
1
천도교
 
1

Length

Max length3
Median length3
Mean length2.8295019
Min length2

Unique

Unique3 ?
Unique (%)0.6%

Sample

1st row개신교
2nd row개신교
3rd row개신교
4th row개신교
5th row개신교

Common Values

ValueCountFrequency (%)
개신교 409
78.4%
불교 88
 
16.9%
천주교 22
 
4.2%
원불교 1
 
0.2%
천도교 1
 
0.2%
향교 1
 
0.2%

Length

2023-12-12T10:55:12.794614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:55:12.953133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개신교 409
78.4%
불교 88
 
16.9%
천주교 22
 
4.2%
원불교 1
 
0.2%
천도교 1
 
0.2%
향교 1
 
0.2%
Distinct511
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T10:55:13.245542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length7.98659
Min length3

Characters and Unicode

Total characters4169
Distinct characters283
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique501 ?
Unique (%)96.0%

Sample

1st row흰돌감리교회
2nd row문막순복음교회(기하성)
3rd row문막제일감리교회
4th row함께하는감리교회
5th row성산감리교회
ValueCountFrequency (%)
관음사 3
 
0.6%
교회 3
 
0.6%
제칠일안식일예수재림교회 2
 
0.4%
보문사 2
 
0.4%
하나님의 2
 
0.4%
약사암 2
 
0.4%
보현사 2
 
0.4%
샘물교회 2
 
0.4%
예수마을장로교회(합동 2
 
0.4%
석경사 2
 
0.4%
Other values (513) 514
95.9%
2023-12-12T10:55:13.758606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
426
 
10.2%
412
 
9.9%
) 232
 
5.6%
( 232
 
5.6%
165
 
4.0%
149
 
3.6%
146
 
3.5%
132
 
3.2%
111
 
2.7%
103
 
2.5%
Other values (273) 2061
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3688
88.5%
Close Punctuation 232
 
5.6%
Open Punctuation 232
 
5.6%
Space Separator 15
 
0.4%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
426
 
11.6%
412
 
11.2%
165
 
4.5%
149
 
4.0%
146
 
4.0%
132
 
3.6%
111
 
3.0%
103
 
2.8%
94
 
2.5%
93
 
2.5%
Other values (268) 1857
50.4%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
7 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 232
100.0%
Open Punctuation
ValueCountFrequency (%)
( 232
100.0%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3688
88.5%
Common 481
 
11.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
426
 
11.6%
412
 
11.2%
165
 
4.5%
149
 
4.0%
146
 
4.0%
132
 
3.6%
111
 
3.0%
103
 
2.8%
94
 
2.5%
93
 
2.5%
Other values (268) 1857
50.4%
Common
ValueCountFrequency (%)
) 232
48.2%
( 232
48.2%
15
 
3.1%
1 1
 
0.2%
7 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3688
88.5%
ASCII 481
 
11.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
426
 
11.6%
412
 
11.2%
165
 
4.5%
149
 
4.0%
146
 
4.0%
132
 
3.6%
111
 
3.0%
103
 
2.8%
94
 
2.5%
93
 
2.5%
Other values (268) 1857
50.4%
ASCII
ValueCountFrequency (%)
) 232
48.2%
( 232
48.2%
15
 
3.1%
1 1
 
0.2%
7 1
 
0.2%

주소
Text

Distinct519
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2023-12-12T10:55:14.223443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length25
Mean length17.337165
Min length10

Characters and Unicode

Total characters9050
Distinct characters247
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique516 ?
Unique (%)98.9%

Sample

1st row원주시 문막읍 임동길 20(취병리)
2nd row원주시 문막읍 천마산길 46-1
3rd row원주시 문막읍 왕건로 96-14
4th row원주시 문막읍 왕건로 143-7
5th row원주시 문막읍 비두네미2길 2(비두리)
ValueCountFrequency (%)
원주시 522
27.9%
치악로 34
 
1.8%
문막읍 29
 
1.5%
소초면 28
 
1.5%
판부면 26
 
1.4%
신림면 25
 
1.3%
원문로 23
 
1.2%
흥업면 22
 
1.2%
남원로 19
 
1.0%
호저면 19
 
1.0%
Other values (811) 1127
60.1%
2023-12-12T10:55:14.766629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1357
 
15.0%
621
 
6.9%
531
 
5.9%
528
 
5.8%
1 396
 
4.4%
( 304
 
3.4%
) 303
 
3.3%
2 276
 
3.0%
275
 
3.0%
268
 
3.0%
Other values (237) 4191
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5032
55.6%
Decimal Number 1823
 
20.1%
Space Separator 1357
 
15.0%
Open Punctuation 304
 
3.4%
Close Punctuation 303
 
3.3%
Dash Punctuation 198
 
2.2%
Other Punctuation 29
 
0.3%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
621
 
12.3%
531
 
10.6%
528
 
10.5%
275
 
5.5%
268
 
5.3%
249
 
4.9%
159
 
3.2%
107
 
2.1%
102
 
2.0%
89
 
1.8%
Other values (217) 2103
41.8%
Decimal Number
ValueCountFrequency (%)
1 396
21.7%
2 276
15.1%
3 191
10.5%
4 176
9.7%
6 155
 
8.5%
5 150
 
8.2%
0 136
 
7.5%
7 127
 
7.0%
8 121
 
6.6%
9 95
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 26
89.7%
/ 2
 
6.9%
@ 1
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
A 2
50.0%
S 1
25.0%
B 1
25.0%
Space Separator
ValueCountFrequency (%)
1357
100.0%
Open Punctuation
ValueCountFrequency (%)
( 304
100.0%
Close Punctuation
ValueCountFrequency (%)
) 303
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 198
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5032
55.6%
Common 4014
44.4%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
621
 
12.3%
531
 
10.6%
528
 
10.5%
275
 
5.5%
268
 
5.3%
249
 
4.9%
159
 
3.2%
107
 
2.1%
102
 
2.0%
89
 
1.8%
Other values (217) 2103
41.8%
Common
ValueCountFrequency (%)
1357
33.8%
1 396
 
9.9%
( 304
 
7.6%
) 303
 
7.5%
2 276
 
6.9%
- 198
 
4.9%
3 191
 
4.8%
4 176
 
4.4%
6 155
 
3.9%
5 150
 
3.7%
Other values (7) 508
 
12.7%
Latin
ValueCountFrequency (%)
A 2
50.0%
S 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5032
55.6%
ASCII 4018
44.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1357
33.8%
1 396
 
9.9%
( 304
 
7.6%
) 303
 
7.5%
2 276
 
6.9%
- 198
 
4.9%
3 191
 
4.8%
4 176
 
4.4%
6 155
 
3.9%
5 150
 
3.7%
Other values (10) 512
 
12.7%
Hangul
ValueCountFrequency (%)
621
 
12.3%
531
 
10.6%
528
 
10.5%
275
 
5.5%
268
 
5.3%
249
 
4.9%
159
 
3.2%
107
 
2.1%
102
 
2.0%
89
 
1.8%
Other values (217) 2103
41.8%

전화번호
Text

MISSING 

Distinct435
Distinct (%)98.6%
Missing81
Missing (%)15.5%
Memory size4.2 KiB
2023-12-12T10:55:15.170759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length12
Mean length13.945578
Min length12

Characters and Unicode

Total characters6150
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique429 ?
Unique (%)97.3%

Sample

1st row033-731-7270
2nd row033-734-3989
3rd row033-745-9438
4th row033-734-4832, 033-734-6911
5th row033-734-7579, 033-734-7010
ValueCountFrequency (%)
033-761-8528 2
 
0.4%
033-734-0691 2
 
0.4%
033-744-3859 2
 
0.4%
033-731-3284 2
 
0.4%
033-734-3989 2
 
0.4%
033-747-1525 2
 
0.4%
033-745-0061 1
 
0.2%
033-743-9038 1
 
0.2%
033-734-0791 1
 
0.2%
033-746-0251 1
 
0.2%
Other values (484) 484
96.8%
2023-12-12T10:55:15.711733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1329
21.6%
- 1000
16.3%
0 766
12.5%
7 716
11.6%
6 430
 
7.0%
4 423
 
6.9%
1 373
 
6.1%
2 321
 
5.2%
5 255
 
4.1%
9 229
 
3.7%
Other values (4) 308
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5027
81.7%
Dash Punctuation 1000
 
16.3%
Space Separator 60
 
1.0%
Other Punctuation 59
 
1.0%
Math Symbol 4
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1329
26.4%
0 766
15.2%
7 716
14.2%
6 430
 
8.6%
4 423
 
8.4%
1 373
 
7.4%
2 321
 
6.4%
5 255
 
5.1%
9 229
 
4.6%
8 185
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 1000
100.0%
Space Separator
ValueCountFrequency (%)
60
100.0%
Other Punctuation
ValueCountFrequency (%)
, 59
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6150
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1329
21.6%
- 1000
16.3%
0 766
12.5%
7 716
11.6%
6 430
 
7.0%
4 423
 
6.9%
1 373
 
6.1%
2 321
 
5.2%
5 255
 
4.1%
9 229
 
3.7%
Other values (4) 308
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6150
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1329
21.6%
- 1000
16.3%
0 766
12.5%
7 716
11.6%
6 430
 
7.0%
4 423
 
6.9%
1 373
 
6.1%
2 321
 
5.2%
5 255
 
4.1%
9 229
 
3.7%
Other values (4) 308
 
5.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
2021-12-31
522 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-12-31
2nd row2021-12-31
3rd row2021-12-31
4th row2021-12-31
5th row2021-12-31

Common Values

ValueCountFrequency (%)
2021-12-31 522
100.0%

Length

2023-12-12T10:55:15.864357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:55:15.974598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-12-31 522
100.0%

Missing values

2023-12-12T10:55:12.494057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:55:12.630327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

종교구분시설명주소전화번호데이터기준일
0개신교흰돌감리교회원주시 문막읍 임동길 20(취병리)033-731-72702021-12-31
1개신교문막순복음교회(기하성)원주시 문막읍 천마산길 46-1033-734-39892021-12-31
2개신교문막제일감리교회원주시 문막읍 왕건로 96-14033-745-94382021-12-31
3개신교함께하는감리교회원주시 문막읍 왕건로 143-7<NA>2021-12-31
4개신교성산감리교회원주시 문막읍 비두네미2길 2(비두리)<NA>2021-12-31
5개신교드림장로교회(합동)원주시 문막읍 덕난이길 8033-734-4832, 033-734-69112021-12-31
6개신교문막감리교회원주시 문막읍 원문로 1821033-734-7579, 033-734-70102021-12-31
7개신교문막평화장로교회(백석)원주시 문막읍 동화택지길 9033-734-06912021-12-31
8개신교문막중앙침례교회(기침)원주시 문막읍 큰애니길 16-13033-734-06912021-12-31
9개신교문호감리교회원주시 문막읍 벌무내기길 55033-731-77092021-12-31
종교구분시설명주소전화번호데이터기준일
512불교관음사원주시 행구로 533033-747-13112021-12-31
513불교석경사원주시 석경길 83033-747-11442021-12-31
514불교원각사원주시 덕성길 80033-735-97112021-12-31
515불교보문사원주시 행구동 산 5033-747-15252021-12-31
516불교세명선원원주시 행구로 486-7033-735-16082021-12-31
517불교무량사원주시 반곡동 1213-8033-741-31622021-12-31
518불교정주사원주시 한가터길 211033-734-10012021-12-31
519원불교원불교 원주교당원주시 남산로 151(명륜동)070-7011-94062021-12-31
520천도교천도교 원주교구원주시 중앙로 58-3(원동)033-746-11962021-12-31
521향교원주향교원주시 향교길 37-1(명륜동)033-764-82202021-12-31

Duplicate rows

Most frequently occurring

종교구분시설명주소전화번호데이터기준일# duplicates
0불교약사암원주시 신림면 연봉정길 161-3033-744-38592021-12-312