Overview

Dataset statistics

Number of variables5
Number of observations145
Missing cells19
Missing cells (%)2.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory41.9 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description전라남도 장성군 관내에 설치된 종교시설 정보입니다. 데이터 세부 항목은 종교시설명, 도로명주소, 유선전화번호, 종교구분(천주교, 원불교, 불교, 개신교)으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15117729/fileData.do

Alerts

전화번호 has 19 (13.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:49:35.825073
Analysis finished2023-12-12 09:49:36.437257
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73
Minimum1
Maximum145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-12T18:49:36.552989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.2
Q137
median73
Q3109
95-th percentile137.8
Maximum145
Range144
Interquartile range (IQR)72

Descriptive statistics

Standard deviation42.001984
Coefficient of variation (CV)0.57536964
Kurtosis-1.2
Mean73
Median Absolute Deviation (MAD)36
Skewness0
Sum10585
Variance1764.1667
MonotonicityStrictly increasing
2023-12-12T18:49:36.717285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
110 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
Other values (135) 135
93.1%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
Distinct143
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T18:49:37.047098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length12
Mean length4.7517241
Min length3

Characters and Unicode

Total characters689
Distinct characters160
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)97.2%

Sample

1st row성산교회
2nd row순복음교회
3rd row남부교회
4th row장성예수중심교회
5th row장성초대장로교회
ValueCountFrequency (%)
중앙교회 2
 
1.3%
국제도덕협회 2
 
1.3%
평강교회 2
 
1.3%
장성교회 2
 
1.3%
천주교 2
 
1.3%
열린문교회 1
 
0.6%
용천사 1
 
0.6%
라파엘쉼터교회 1
 
0.6%
용화정사 1
 
0.6%
신정교회 1
 
0.6%
Other values (139) 139
90.3%
2023-12-12T18:49:37.508214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
100
 
14.5%
96
 
13.9%
38
 
5.5%
29
 
4.2%
19
 
2.8%
14
 
2.0%
12
 
1.7%
10
 
1.5%
9
 
1.3%
9
 
1.3%
Other values (150) 353
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 675
98.0%
Space Separator 9
 
1.3%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
14.8%
96
 
14.2%
38
 
5.6%
29
 
4.3%
19
 
2.8%
14
 
2.1%
12
 
1.8%
10
 
1.5%
9
 
1.3%
9
 
1.3%
Other values (146) 339
50.2%
Space Separator
ValueCountFrequency (%)
9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Decimal Number
ValueCountFrequency (%)
7 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 675
98.0%
Common 14
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
14.8%
96
 
14.2%
38
 
5.6%
29
 
4.3%
19
 
2.8%
14
 
2.1%
12
 
1.8%
10
 
1.5%
9
 
1.3%
9
 
1.3%
Other values (146) 339
50.2%
Common
ValueCountFrequency (%)
9
64.3%
) 2
 
14.3%
( 2
 
14.3%
7 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 675
98.0%
ASCII 14
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
100
 
14.8%
96
 
14.2%
38
 
5.6%
29
 
4.3%
19
 
2.8%
14
 
2.1%
12
 
1.8%
10
 
1.5%
9
 
1.3%
9
 
1.3%
Other values (146) 339
50.2%
ASCII
ValueCountFrequency (%)
9
64.3%
) 2
 
14.3%
( 2
 
14.3%
7 1
 
7.1%
Distinct143
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-12T18:49:37.983356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length20.737931
Min length18

Characters and Unicode

Total characters3007
Distinct characters134
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)97.2%

Sample

1st row전라남도 장성군 장성읍 성산1길 45
2nd row전라남도 장성군 장성읍 충무4길 14-3
3rd row전라남도 장성군 장성읍 청운2길 15-1
4th row전라남도 장성군 장성읍 영천로 178
5th row전라남도 장성군 장성읍 매화1길30
ValueCountFrequency (%)
전라남도 145
20.1%
장성군 145
20.1%
장성읍 36
 
5.0%
북하면 19
 
2.6%
황룡면 15
 
2.1%
북이면 15
 
2.1%
삼계면 13
 
1.8%
진원면 9
 
1.2%
동화면 9
 
1.2%
남면 9
 
1.2%
Other values (243) 308
42.6%
2023-12-12T18:49:38.665694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
578
19.2%
188
 
6.3%
186
 
6.2%
160
 
5.3%
147
 
4.9%
145
 
4.8%
145
 
4.8%
145
 
4.8%
109
 
3.6%
1 101
 
3.4%
Other values (124) 1103
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1908
63.5%
Space Separator 578
 
19.2%
Decimal Number 463
 
15.4%
Dash Punctuation 58
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
188
 
9.9%
186
 
9.7%
160
 
8.4%
147
 
7.7%
145
 
7.6%
145
 
7.6%
145
 
7.6%
109
 
5.7%
76
 
4.0%
65
 
3.4%
Other values (112) 542
28.4%
Decimal Number
ValueCountFrequency (%)
1 101
21.8%
3 63
13.6%
2 60
13.0%
4 46
9.9%
6 42
9.1%
7 36
 
7.8%
5 36
 
7.8%
0 33
 
7.1%
8 25
 
5.4%
9 21
 
4.5%
Space Separator
ValueCountFrequency (%)
578
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1908
63.5%
Common 1099
36.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
188
 
9.9%
186
 
9.7%
160
 
8.4%
147
 
7.7%
145
 
7.6%
145
 
7.6%
145
 
7.6%
109
 
5.7%
76
 
4.0%
65
 
3.4%
Other values (112) 542
28.4%
Common
ValueCountFrequency (%)
578
52.6%
1 101
 
9.2%
3 63
 
5.7%
2 60
 
5.5%
- 58
 
5.3%
4 46
 
4.2%
6 42
 
3.8%
7 36
 
3.3%
5 36
 
3.3%
0 33
 
3.0%
Other values (2) 46
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1908
63.5%
ASCII 1099
36.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
578
52.6%
1 101
 
9.2%
3 63
 
5.7%
2 60
 
5.5%
- 58
 
5.3%
4 46
 
4.2%
6 42
 
3.8%
7 36
 
3.3%
5 36
 
3.3%
0 33
 
3.0%
Other values (2) 46
 
4.2%
Hangul
ValueCountFrequency (%)
188
 
9.9%
186
 
9.7%
160
 
8.4%
147
 
7.7%
145
 
7.6%
145
 
7.6%
145
 
7.6%
109
 
5.7%
76
 
4.0%
65
 
3.4%
Other values (112) 542
28.4%

전화번호
Text

MISSING 

Distinct125
Distinct (%)99.2%
Missing19
Missing (%)13.1%
Memory size1.3 KiB
2023-12-12T18:49:38.985603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length12
Mean length12.222222
Min length12

Characters and Unicode

Total characters1540
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)98.4%

Sample

1st row061-393-2771
2nd row061-392-4766
3rd row061-392-1913
4th row061-393-3996
5th row061-393-0463
ValueCountFrequency (%)
061-393-6082 2
 
1.6%
061-393-5478 1
 
0.8%
061-393-6788 1
 
0.8%
061-392-9009 1
 
0.8%
061-392-8051 1
 
0.8%
061-393-4555 1
 
0.8%
061-393-0793 1
 
0.8%
061-393-0884 1
 
0.8%
061-393-0655 1
 
0.8%
061-393-5135 1
 
0.8%
Other values (117) 117
91.4%
2023-12-12T18:49:39.433362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 256
16.6%
3 217
14.1%
0 193
12.5%
1 189
12.3%
9 181
11.8%
6 177
11.5%
2 95
 
6.2%
4 72
 
4.7%
7 60
 
3.9%
5 53
 
3.4%
Other values (3) 47
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1280
83.1%
Dash Punctuation 256
 
16.6%
Other Punctuation 2
 
0.1%
Space Separator 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 217
17.0%
0 193
15.1%
1 189
14.8%
9 181
14.1%
6 177
13.8%
2 95
7.4%
4 72
 
5.6%
7 60
 
4.7%
5 53
 
4.1%
8 43
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 256
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1540
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 256
16.6%
3 217
14.1%
0 193
12.5%
1 189
12.3%
9 181
11.8%
6 177
11.5%
2 95
 
6.2%
4 72
 
4.7%
7 60
 
3.9%
5 53
 
3.4%
Other values (3) 47
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1540
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 256
16.6%
3 217
14.1%
0 193
12.5%
1 189
12.3%
9 181
11.8%
6 177
11.5%
2 95
 
6.2%
4 72
 
4.7%
7 60
 
3.9%
5 53
 
3.4%
Other values (3) 47
 
3.1%

종교구분
Categorical

Distinct4
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
개신교
93 
불교
42 
천주교
 
8
원불교
 
2

Length

Max length3
Median length3
Mean length2.7103448
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개신교
2nd row개신교
3rd row개신교
4th row개신교
5th row개신교

Common Values

ValueCountFrequency (%)
개신교 93
64.1%
불교 42
29.0%
천주교 8
 
5.5%
원불교 2
 
1.4%

Length

2023-12-12T18:49:39.569649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:49:39.668527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개신교 93
64.1%
불교 42
29.0%
천주교 8
 
5.5%
원불교 2
 
1.4%

Interactions

2023-12-12T18:49:36.103526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:49:39.736209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종교구분
연번1.0000.318
종교구분0.3181.000
2023-12-12T18:49:39.819628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종교구분
연번1.0000.192
종교구분0.1921.000

Missing values

2023-12-12T18:49:36.256267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:49:36.370374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설명도로명주소전화번호종교구분
01성산교회전라남도 장성군 장성읍 성산1길 45061-393-2771개신교
12순복음교회전라남도 장성군 장성읍 충무4길 14-3061-392-4766개신교
23남부교회전라남도 장성군 장성읍 청운2길 15-1061-392-1913개신교
34장성예수중심교회전라남도 장성군 장성읍 영천로 178<NA>개신교
45장성초대장로교회전라남도 장성군 장성읍 매화1길30<NA>개신교
56장성충성교회전라남도 장성군 장성읍 봉암로 88<NA>개신교
67성글라라 수도원전라남도 장성군 장성읍 상오길 63-100061-393-3996천주교
78관음사전라남도 장성군 장성읍 백계길31-32061-393-0463불교
89정불사전라남도 장성군 장성읍 상오3리 465061-394-8032불교
910수정사전라남도 장성군 장성읍 부흥신흥길 117-16061-393-8527불교
연번시설명도로명주소전화번호종교구분
135136신성제일교회전라남도 장성군 북하면 남창로 12061-393-7884개신교
136137백양사전라남도 장성군 북하면 백양로 1239061-392-0281불교
137138천진암전라남도 장성군 북하면 백양로 1239-1061-392-0533불교
138139홍연암전라남도 장성군 북하면 가인길 59-40061-392-7737불교
139140약사암전라남도 장성군 북하면 백양로 1239-3061-392-7791불교
140141운문암전라남도 장성군 북하면 백양로 1239061-392-7706불교
141142청류암전라남도 장성군 북하면 가인길 157061-392-7506불교
142143청량원전라남도 장성군 북하면 백양로 1239-2061-393-2732불교
143144무량선원전라남도 장성군 북하면 단전리 615-1061-394-7121불교
144145금계사전라남도 장성군 북하면 병풍로 1047061-394-1524불교