Overview

Dataset statistics

Number of variables5
Number of observations252
Missing cells64
Missing cells (%)5.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.2 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Categorical1
Text3

Dataset

Description부산광역시동래구_종교단체현황_20220913
Author부산광역시 동래구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15062399

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
전화번호 has 64 (25.4%) missing valuesMissing
연번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:46:23.520815
Analysis finished2023-12-10 16:46:24.314257
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct252
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.5
Minimum1
Maximum252
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-11T01:46:24.468225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.55
Q163.75
median126.5
Q3189.25
95-th percentile239.45
Maximum252
Range251
Interquartile range (IQR)125.5

Descriptive statistics

Standard deviation72.890329
Coefficient of variation (CV)0.57620813
Kurtosis-1.2
Mean126.5
Median Absolute Deviation (MAD)63
Skewness0
Sum31878
Variance5313
MonotonicityStrictly increasing
2023-12-11T01:46:24.763892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
175 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
Other values (242) 242
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
252 1
0.4%
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
교회
164 
사찰
76 
기도원
 
5
성당
 
4
대순진리회
 
3

Length

Max length5
Median length2
Mean length2.0555556
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교회
2nd row교회
3rd row교회
4th row교회
5th row교회

Common Values

ValueCountFrequency (%)
교회 164
65.1%
사찰 76
30.2%
기도원 5
 
2.0%
성당 4
 
1.6%
대순진리회 3
 
1.2%

Length

2023-12-11T01:46:24.981383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:46:25.168610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
교회 164
65.1%
사찰 76
30.2%
기도원 5
 
2.0%
성당 4
 
1.6%
대순진리회 3
 
1.2%
Distinct239
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T01:46:25.486059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.484127
Min length3

Characters and Unicode

Total characters1382
Distinct characters228
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)90.1%

Sample

1st row동래중앙교회
2nd row예람농아교회
3rd row복천교회
4th row신부산제일교회
5th row동래제일교회
ValueCountFrequency (%)
교회 8
 
2.9%
소림사 3
 
1.1%
원불교 3
 
1.1%
대순진리회 3
 
1.1%
연화사 2
 
0.7%
동래교회 2
 
0.7%
여호와의증인의 2
 
0.7%
영광교회 2
 
0.7%
아름다운 2
 
0.7%
안락교회 2
 
0.7%
Other values (241) 250
89.6%
2023-12-11T01:46:26.013820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
12.5%
171
 
12.4%
78
 
5.6%
29
 
2.1%
29
 
2.1%
28
 
2.0%
27
 
2.0%
24
 
1.7%
20
 
1.4%
19
 
1.4%
Other values (218) 784
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1322
95.7%
Space Separator 27
 
2.0%
Open Punctuation 15
 
1.1%
Close Punctuation 15
 
1.1%
Uppercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
173
 
13.1%
171
 
12.9%
78
 
5.9%
29
 
2.2%
29
 
2.2%
28
 
2.1%
24
 
1.8%
20
 
1.5%
19
 
1.4%
16
 
1.2%
Other values (212) 735
55.6%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
S 1
33.3%
G 1
33.3%
Space Separator
ValueCountFrequency (%)
27
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1322
95.7%
Common 57
 
4.1%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
173
 
13.1%
171
 
12.9%
78
 
5.9%
29
 
2.2%
29
 
2.2%
28
 
2.1%
24
 
1.8%
20
 
1.5%
19
 
1.4%
16
 
1.2%
Other values (212) 735
55.6%
Common
ValueCountFrequency (%)
27
47.4%
( 15
26.3%
) 15
26.3%
Latin
ValueCountFrequency (%)
I 1
33.3%
S 1
33.3%
G 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1322
95.7%
ASCII 60
 
4.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
173
 
13.1%
171
 
12.9%
78
 
5.9%
29
 
2.2%
29
 
2.2%
28
 
2.1%
24
 
1.8%
20
 
1.5%
19
 
1.4%
16
 
1.2%
Other values (212) 735
55.6%
ASCII
ValueCountFrequency (%)
27
45.0%
( 15
25.0%
) 15
25.0%
I 1
 
1.7%
S 1
 
1.7%
G 1
 
1.7%

소재지
Text

UNIQUE 

Distinct252
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T01:46:26.442066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length28
Mean length20.781746
Min length15

Characters and Unicode

Total characters5237
Distinct characters91
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique252 ?
Unique (%)100.0%

Sample

1st row부산광역시 동래구 충렬대로202번가길 24
2nd row부산광역시 동래구 명륜로75번길 11
3rd row부산광역시 동래구 충렬대로272번길 8
4th row부산광역시 동래구 수안로 17
5th row부산광역시 동래구 충렬대로322번길19-12
ValueCountFrequency (%)
부산광역시 253
24.6%
동래구 253
24.6%
2층 12
 
1.2%
충렬대로 8
 
0.8%
명륜로 8
 
0.8%
11 7
 
0.7%
쇠미로129번길 7
 
0.7%
쇠미로 6
 
0.6%
6 6
 
0.6%
아시아드대로 6
 
0.6%
Other values (329) 463
45.0%
2023-12-11T01:46:27.011385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
790
 
15.1%
284
 
5.4%
268
 
5.1%
263
 
5.0%
255
 
4.9%
254
 
4.9%
253
 
4.8%
253
 
4.8%
253
 
4.8%
246
 
4.7%
Other values (81) 2118
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3304
63.1%
Decimal Number 1048
 
20.0%
Space Separator 790
 
15.1%
Dash Punctuation 58
 
1.1%
Other Punctuation 27
 
0.5%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
284
 
8.6%
268
 
8.1%
263
 
8.0%
255
 
7.7%
254
 
7.7%
253
 
7.7%
253
 
7.7%
253
 
7.7%
246
 
7.4%
152
 
4.6%
Other values (63) 823
24.9%
Decimal Number
ValueCountFrequency (%)
1 230
21.9%
2 174
16.6%
3 123
11.7%
5 94
9.0%
7 85
 
8.1%
6 84
 
8.0%
9 69
 
6.6%
8 67
 
6.4%
0 66
 
6.3%
4 56
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 26
96.3%
/ 1
 
3.7%
Uppercase Letter
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%
Space Separator
ValueCountFrequency (%)
790
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3304
63.1%
Common 1931
36.9%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
284
 
8.6%
268
 
8.1%
263
 
8.0%
255
 
7.7%
254
 
7.7%
253
 
7.7%
253
 
7.7%
253
 
7.7%
246
 
7.4%
152
 
4.6%
Other values (63) 823
24.9%
Common
ValueCountFrequency (%)
790
40.9%
1 230
 
11.9%
2 174
 
9.0%
3 123
 
6.4%
5 94
 
4.9%
7 85
 
4.4%
6 84
 
4.4%
9 69
 
3.6%
8 67
 
3.5%
0 66
 
3.4%
Other values (6) 149
 
7.7%
Latin
ValueCountFrequency (%)
K 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3304
63.1%
ASCII 1933
36.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
790
40.9%
1 230
 
11.9%
2 174
 
9.0%
3 123
 
6.4%
5 94
 
4.9%
7 85
 
4.4%
6 84
 
4.3%
9 69
 
3.6%
8 67
 
3.5%
0 66
 
3.4%
Other values (8) 151
 
7.8%
Hangul
ValueCountFrequency (%)
284
 
8.6%
268
 
8.1%
263
 
8.0%
255
 
7.7%
254
 
7.7%
253
 
7.7%
253
 
7.7%
253
 
7.7%
246
 
7.4%
152
 
4.6%
Other values (63) 823
24.9%

전화번호
Text

MISSING 

Distinct188
Distinct (%)100.0%
Missing64
Missing (%)25.4%
Memory size2.1 KiB
2023-12-11T01:46:27.400341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.989362
Min length8

Characters and Unicode

Total characters2254
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)100.0%

Sample

1st row051-558-1191
2nd row051-554-3737
3rd row051-556-1009
4th row051-556-6302
5th row051-554-7714
ValueCountFrequency (%)
051-554-9011 1
 
0.5%
051-558-4352 1
 
0.5%
051-555-5529 1
 
0.5%
051-532-9191 1
 
0.5%
051-531-9191 1
 
0.5%
051-555-2295 1
 
0.5%
051-506-0213 1
 
0.5%
051-501-5986 1
 
0.5%
051-526-4210 1
 
0.5%
051-528-4177 1
 
0.5%
Other values (178) 178
94.7%
2023-12-11T01:46:28.472076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 517
22.9%
- 375
16.6%
0 353
15.7%
1 334
14.8%
2 152
 
6.7%
3 107
 
4.7%
4 99
 
4.4%
9 93
 
4.1%
7 83
 
3.7%
6 76
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1879
83.4%
Dash Punctuation 375
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 517
27.5%
0 353
18.8%
1 334
17.8%
2 152
 
8.1%
3 107
 
5.7%
4 99
 
5.3%
9 93
 
4.9%
7 83
 
4.4%
6 76
 
4.0%
8 65
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 375
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2254
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 517
22.9%
- 375
16.6%
0 353
15.7%
1 334
14.8%
2 152
 
6.7%
3 107
 
4.7%
4 99
 
4.4%
9 93
 
4.1%
7 83
 
3.7%
6 76
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2254
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 517
22.9%
- 375
16.6%
0 353
15.7%
1 334
14.8%
2 152
 
6.7%
3 107
 
4.7%
4 99
 
4.4%
9 93
 
4.1%
7 83
 
3.7%
6 76
 
3.4%

Interactions

2023-12-11T01:46:23.877508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:46:28.607296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.871
구분0.8711.000
2023-12-11T01:46:28.723027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.539
구분0.5391.000

Missing values

2023-12-11T01:46:24.075328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:46:24.241088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분종교단체명소재지전화번호
01교회동래중앙교회부산광역시 동래구 충렬대로202번가길 24051-558-1191
12교회예람농아교회부산광역시 동래구 명륜로75번길 11051-554-3737
23교회복천교회부산광역시 동래구 충렬대로272번길 8051-556-1009
34교회신부산제일교회부산광역시 동래구 수안로 17<NA>
45교회동래제일교회부산광역시 동래구 충렬대로322번길19-12051-556-6302
56교회낙민교회부산광역시 동래구 온천천로319번나길 12051-554-7714
67교회바른길교회부산광역시 동래구 충렬대로202번길 14<NA>
78교회수안교회부산광역시 동래구 충렬대로237번길 57051-555-4017
89교회반석교회부산광역시 동래구 수안로8번길 19-10<NA>
910교회천리교 동래교회부산광역시 동래구 명륜로112번길 60/충렬대로237번가길 7051-555-2476
연번구분종교단체명소재지전화번호
242243사찰지장 본원사부산광역시 동래구 시실로107번길27051-524-5240
243244사찰정토사부산광역시 동래구 명서로110번길 11051-755-0831
244245대순진리회대순진리회 부산회관부산광역시 동래구 여고로 51<NA>
245246대순진리회대순진리회 부전회관부산광역시 동래구 충렬대로 122051-501-7116
246247대순진리회대순진리회 부전회관부산광역시 동래구 아시아드대로 129<NA>
247248기도원새소망기도원부산광역시 동래구 금정마을로 63<NA>
248249기도원겟세네마기도원부산광역시 동래구 아시아드대로231번길 30<NA>
249250기도원큰기쁨기도원부산광역시 동래구 충렬대로140번길 56<NA>
250251기도원임마누엘기도원부산광역시 동래구 안연로98번길 6<NA>
251252기도원소망기도원부산광역시 동래구 명안로26번길 4, 2층<NA>