Overview

Dataset statistics

Number of variables5
Number of observations250
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description나주시 소재에 있는 종교 단체에 대한 데이터로 시설명, 주소, 종교구분(천주교, 기독교, 원불교, 불교, 개신교) 자료를 제공합니다.
URLhttps://www.data.go.kr/data/15117638/fileData.do

Alerts

연번 is highly overall correlated with 읍면동High correlation
읍면동 is highly overall correlated with 연번High correlation
구분 is highly imbalanced (55.3%)Imbalance
연번 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:41:11.284798
Analysis finished2023-12-12 05:41:12.090758
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct250
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.004
Minimum1
Maximum251
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T14:41:12.192258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.45
Q163.25
median126.5
Q3188.75
95-th percentile238.55
Maximum251
Range250
Interquartile range (IQR)125.5

Descriptive statistics

Standard deviation72.747266
Coefficient of variation (CV)0.57734092
Kurtosis-1.2073136
Mean126.004
Median Absolute Deviation (MAD)63
Skewness-0.00016627442
Sum31501
Variance5292.1646
MonotonicityStrictly increasing
2023-12-12T14:41:12.380985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
174 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
Other values (240) 240
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%

구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
기독교
190 
불교
44 
천주교
 
14
원불교
 
1
<NA>
 
1

Length

Max length4
Median length3
Mean length2.828
Min length2

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row기독교
2nd row기독교
3rd row기독교
4th row기독교
5th row기독교

Common Values

ValueCountFrequency (%)
기독교 190
76.0%
불교 44
 
17.6%
천주교 14
 
5.6%
원불교 1
 
0.4%
<NA> 1
 
0.4%

Length

2023-12-12T14:41:12.572087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:41:12.711097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기독교 190
76.0%
불교 44
 
17.6%
천주교 14
 
5.6%
원불교 1
 
0.4%
na 1
 
0.4%

읍면동
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
빛가람동
38 
성북동
18 
금남동
17 
남평읍
16 
다시면
16 
Other values (15)
145 

Length

Max length4
Median length3
Mean length3.152
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공산면
2nd row공산면
3rd row공산면
4th row공산면
5th row공산면

Common Values

ValueCountFrequency (%)
빛가람동 38
15.2%
성북동 18
 
7.2%
금남동 17
 
6.8%
남평읍 16
 
6.4%
다시면 16
 
6.4%
영산동 13
 
5.2%
노안면 13
 
5.2%
이창동 12
 
4.8%
금천면 12
 
4.8%
산포면 12
 
4.8%
Other values (10) 83
33.2%

Length

2023-12-12T14:41:12.832602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
빛가람동 38
15.2%
성북동 18
 
7.2%
금남동 17
 
6.8%
남평읍 16
 
6.4%
다시면 16
 
6.4%
영산동 13
 
5.2%
노안면 13
 
5.2%
산포면 12
 
4.8%
봉황면 12
 
4.8%
금천면 12
 
4.8%
Other values (10) 83
33.2%
Distinct245
Distinct (%)98.4%
Missing1
Missing (%)0.4%
Memory size2.1 KiB
2023-12-12T14:41:13.151420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length5.3293173
Min length1

Characters and Unicode

Total characters1327
Distinct characters205
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique241 ?
Unique (%)96.8%

Sample

1st row공산교회
2nd row화성교회
3rd row덕음교회
4th row중포리교회
5th row상복교회
ValueCountFrequency (%)
교회 4
 
1.5%
나주교회 3
 
1.1%
태고종 2
 
0.7%
빛가람중앙교회 2
 
0.7%
대송교회 2
 
0.7%
남평성당 2
 
0.7%
나주 2
 
0.7%
천주교 2
 
0.7%
사랑교회 1
 
0.4%
등수중앙교회 1
 
0.4%
Other values (249) 249
92.2%
2023-12-12T14:41:13.676385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
199
 
15.0%
197
 
14.8%
40
 
3.0%
40
 
3.0%
31
 
2.3%
29
 
2.2%
28
 
2.1%
21
 
1.6%
20
 
1.5%
18
 
1.4%
Other values (195) 704
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1297
97.7%
Space Separator 21
 
1.6%
Open Punctuation 4
 
0.3%
Close Punctuation 4
 
0.3%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
199
 
15.3%
197
 
15.2%
40
 
3.1%
40
 
3.1%
31
 
2.4%
29
 
2.2%
28
 
2.2%
20
 
1.5%
18
 
1.4%
18
 
1.4%
Other values (191) 677
52.2%
Space Separator
ValueCountFrequency (%)
21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1297
97.7%
Common 30
 
2.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
199
 
15.3%
197
 
15.2%
40
 
3.1%
40
 
3.1%
31
 
2.4%
29
 
2.2%
28
 
2.2%
20
 
1.5%
18
 
1.4%
18
 
1.4%
Other values (191) 677
52.2%
Common
ValueCountFrequency (%)
21
70.0%
( 4
 
13.3%
) 4
 
13.3%
- 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1297
97.7%
ASCII 30
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
199
 
15.3%
197
 
15.2%
40
 
3.1%
40
 
3.1%
31
 
2.4%
29
 
2.2%
28
 
2.2%
20
 
1.5%
18
 
1.4%
18
 
1.4%
Other values (191) 677
52.2%
ASCII
ValueCountFrequency (%)
21
70.0%
( 4
 
13.3%
) 4
 
13.3%
- 1
 
3.3%

주소
Text

UNIQUE 

Distinct250
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T14:41:13.980922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length11.876
Min length3

Characters and Unicode

Total characters2969
Distinct characters186
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique250 ?
Unique (%)100.0%

Sample

1st row공산면 덕음로 25-1
2nd row공산면 성남길 70
3rd row공산면 덕음로 508-7
4th row공산면 수양길 77
5th row공산면 흥복길 14-128
ValueCountFrequency (%)
나주시 82
 
11.5%
다시면 16
 
2.2%
봉황면 12
 
1.7%
금천면 12
 
1.7%
노안면 11
 
1.5%
이창동 8
 
1.1%
세지면 8
 
1.1%
2층 7
 
1.0%
그린로 7
 
1.0%
공산면 7
 
1.0%
Other values (431) 544
76.2%
2023-12-12T14:41:14.470512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
466
 
15.7%
1 177
 
6.0%
- 126
 
4.2%
2 122
 
4.1%
3 117
 
3.9%
106
 
3.6%
104
 
3.5%
99
 
3.3%
93
 
3.1%
93
 
3.1%
Other values (176) 1466
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1451
48.9%
Decimal Number 885
29.8%
Space Separator 466
 
15.7%
Dash Punctuation 126
 
4.2%
Other Punctuation 24
 
0.8%
Uppercase Letter 7
 
0.2%
Close Punctuation 5
 
0.2%
Open Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
7.3%
104
 
7.2%
99
 
6.8%
93
 
6.4%
93
 
6.4%
85
 
5.9%
58
 
4.0%
40
 
2.8%
31
 
2.1%
23
 
1.6%
Other values (156) 719
49.6%
Decimal Number
ValueCountFrequency (%)
1 177
20.0%
2 122
13.8%
3 117
13.2%
5 87
9.8%
4 83
9.4%
7 69
 
7.8%
0 66
 
7.5%
8 64
 
7.2%
6 63
 
7.1%
9 37
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
A 3
42.9%
B 2
28.6%
C 1
 
14.3%
S 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
, 23
95.8%
* 1
 
4.2%
Space Separator
ValueCountFrequency (%)
466
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 126
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1511
50.9%
Hangul 1451
48.9%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
7.3%
104
 
7.2%
99
 
6.8%
93
 
6.4%
93
 
6.4%
85
 
5.9%
58
 
4.0%
40
 
2.8%
31
 
2.1%
23
 
1.6%
Other values (156) 719
49.6%
Common
ValueCountFrequency (%)
466
30.8%
1 177
 
11.7%
- 126
 
8.3%
2 122
 
8.1%
3 117
 
7.7%
5 87
 
5.8%
4 83
 
5.5%
7 69
 
4.6%
0 66
 
4.4%
8 64
 
4.2%
Other values (6) 134
 
8.9%
Latin
ValueCountFrequency (%)
A 3
42.9%
B 2
28.6%
C 1
 
14.3%
S 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1518
51.1%
Hangul 1451
48.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
466
30.7%
1 177
 
11.7%
- 126
 
8.3%
2 122
 
8.0%
3 117
 
7.7%
5 87
 
5.7%
4 83
 
5.5%
7 69
 
4.5%
0 66
 
4.3%
8 64
 
4.2%
Other values (10) 141
 
9.3%
Hangul
ValueCountFrequency (%)
106
 
7.3%
104
 
7.2%
99
 
6.8%
93
 
6.4%
93
 
6.4%
85
 
5.9%
58
 
4.0%
40
 
2.8%
31
 
2.1%
23
 
1.6%
Other values (156) 719
49.6%

Interactions

2023-12-12T14:41:11.707130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:41:14.576241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분읍면동
연번1.0000.2740.998
구분0.2741.0000.228
읍면동0.9980.2281.000
2023-12-12T14:41:14.680415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면동구분
읍면동1.0000.104
구분0.1041.000
2023-12-12T14:41:14.772642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분읍면동
연번1.0000.1520.889
구분0.1521.0000.104
읍면동0.8890.1041.000

Missing values

2023-12-12T14:41:11.903952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:41:12.031109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분읍면동단체명주소
01기독교공산면공산교회공산면 덕음로 25-1
12기독교공산면화성교회공산면 성남길 70
23기독교공산면덕음교회공산면 덕음로 508-7
34기독교공산면중포리교회공산면 수양길 77
45기독교공산면상복교회공산면 흥복길 14-128
56기독교공산면남도중앙교회공산면 가송리 299
67기독교공산면공산목양교회공산면 가송로 125
78기독교금남동제칠일안식일교회금계동 47
89기독교금남동나주중부교회금계동 103-3
910기독교금남동한국예수고 새예루살렘교회금계동 34-20
연번구분읍면동단체명주소
240242기독교이창동한소망교회이창동 747-4
241243기독교이창동소명교회이창동 473-3
242244기독교이창동나주교회이창동 712-8
243245기독교이창동여호와의증인왕국회관이창동 601
244246기독교이창동가야제일교회동수동 48-148
245247불교이창동운암사운곡동 150-46
246248불교이창동영천사가야길 177
247249불교이창동존제산 일월사이창동 175-1
248250기독교이창동순복음꽃동산교회대기동 190
249251기독교이창동찬송교회이창동 74-3