Overview

Dataset statistics

Number of variables4
Number of observations253
Missing cells241
Missing cells (%)23.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.0 KiB
Average record size in memory32.5 B

Variable types

Text3
DateTime1

Dataset

Description부산광역시_사하구_종교시설현황_20230126
Author부산광역시 사하구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15042775

Alerts

데이터기준일자 has constant value ""Constant
연락처 has 241 (95.3%) missing valuesMissing
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 15:59:34.412686
Analysis finished2023-12-10 15:59:35.496052
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

이름
Text

Distinct232
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T00:59:35.861697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.8774704
Min length3

Characters and Unicode

Total characters1234
Distinct characters197
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique212 ?
Unique (%)83.8%

Sample

1st row부산서부교회
2nd row시온성교회
3rd row서진교회
4th row감천제일교회
5th row감천중앙교회
ValueCountFrequency (%)
새에덴교회 3
 
1.2%
은혜로교회 2
 
0.8%
충만한교회 2
 
0.8%
행복한교회 2
 
0.8%
괴정교회 2
 
0.8%
보광사 2
 
0.8%
동산교회 2
 
0.8%
주사랑교회 2
 
0.8%
열린문교회 2
 
0.8%
은혜교회 2
 
0.8%
Other values (225) 235
91.8%
2023-12-11T00:59:36.544638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
201
 
16.3%
195
 
15.8%
49
 
4.0%
25
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.6%
18
 
1.5%
16
 
1.3%
15
 
1.2%
Other values (187) 651
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1222
99.0%
Space Separator 4
 
0.3%
Uppercase Letter 3
 
0.2%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
16.4%
195
 
16.0%
49
 
4.0%
25
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.6%
18
 
1.5%
16
 
1.3%
15
 
1.2%
Other values (180) 639
52.3%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
I 1
33.3%
S 1
33.3%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1222
99.0%
Common 9
 
0.7%
Latin 3
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
16.4%
195
 
16.0%
49
 
4.0%
25
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.6%
18
 
1.5%
16
 
1.3%
15
 
1.2%
Other values (180) 639
52.3%
Common
ValueCountFrequency (%)
4
44.4%
) 2
22.2%
( 2
22.2%
2 1
 
11.1%
Latin
ValueCountFrequency (%)
G 1
33.3%
I 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1222
99.0%
ASCII 12
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
201
 
16.4%
195
 
16.0%
49
 
4.0%
25
 
2.0%
22
 
1.8%
22
 
1.8%
20
 
1.6%
18
 
1.5%
16
 
1.3%
15
 
1.2%
Other values (180) 639
52.3%
ASCII
ValueCountFrequency (%)
4
33.3%
) 2
16.7%
( 2
16.7%
G 1
 
8.3%
I 1
 
8.3%
S 1
 
8.3%
2 1
 
8.3%

소재지
Text

UNIQUE 

Distinct253
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-11T00:59:36.972461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length26
Mean length18.241107
Min length10

Characters and Unicode

Total characters4615
Distinct characters122
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)100.0%

Sample

1st row사하구 장평로 111-6 (장림동)
2nd row사하구 두송로 57 상가동 (장림동)
3rd row사하구 다대로277번길 36 2층 (장림동)
4th row사하구 옥천로75번길 14(감천동)
5th row사하구 감천로117번길 9(감천동)
ValueCountFrequency (%)
사하구 253
30.0%
다대로 24
 
2.8%
2층 10
 
1.2%
하신중앙로 10
 
1.2%
윤공단로 9
 
1.1%
상가 9
 
1.1%
장림동 7
 
0.8%
장평로 7
 
0.8%
낙동대로 7
 
0.8%
윤공단로14번길 5
 
0.6%
Other values (409) 503
59.6%
2023-12-11T00:59:37.586464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
591
 
12.8%
316
 
6.8%
272
 
5.9%
258
 
5.6%
246
 
5.3%
1 213
 
4.6%
184
 
4.0%
( 161
 
3.5%
) 160
 
3.5%
149
 
3.2%
Other values (112) 2065
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2571
55.7%
Decimal Number 1053
22.8%
Space Separator 591
 
12.8%
Open Punctuation 161
 
3.5%
Close Punctuation 160
 
3.5%
Dash Punctuation 42
 
0.9%
Other Punctuation 35
 
0.8%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
316
12.3%
272
 
10.6%
258
 
10.0%
246
 
9.6%
184
 
7.2%
149
 
5.8%
141
 
5.5%
109
 
4.2%
84
 
3.3%
55
 
2.1%
Other values (95) 757
29.4%
Decimal Number
ValueCountFrequency (%)
1 213
20.2%
2 138
13.1%
3 133
12.6%
4 110
10.4%
5 98
9.3%
0 81
 
7.7%
7 80
 
7.6%
6 79
 
7.5%
9 66
 
6.3%
8 55
 
5.2%
Other Punctuation
ValueCountFrequency (%)
, 34
97.1%
/ 1
 
2.9%
Space Separator
ValueCountFrequency (%)
591
100.0%
Open Punctuation
ValueCountFrequency (%)
( 161
100.0%
Close Punctuation
ValueCountFrequency (%)
) 160
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2571
55.7%
Common 2042
44.2%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
316
12.3%
272
 
10.6%
258
 
10.0%
246
 
9.6%
184
 
7.2%
149
 
5.8%
141
 
5.5%
109
 
4.2%
84
 
3.3%
55
 
2.1%
Other values (95) 757
29.4%
Common
ValueCountFrequency (%)
591
28.9%
1 213
 
10.4%
( 161
 
7.9%
) 160
 
7.8%
2 138
 
6.8%
3 133
 
6.5%
4 110
 
5.4%
5 98
 
4.8%
0 81
 
4.0%
7 80
 
3.9%
Other values (6) 277
13.6%
Latin
ValueCountFrequency (%)
A 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2571
55.7%
ASCII 2044
44.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
591
28.9%
1 213
 
10.4%
( 161
 
7.9%
) 160
 
7.8%
2 138
 
6.8%
3 133
 
6.5%
4 110
 
5.4%
5 98
 
4.8%
0 81
 
4.0%
7 80
 
3.9%
Other values (7) 279
13.6%
Hangul
ValueCountFrequency (%)
316
12.3%
272
 
10.6%
258
 
10.0%
246
 
9.6%
184
 
7.2%
149
 
5.8%
141
 
5.5%
109
 
4.2%
84
 
3.3%
55
 
2.1%
Other values (95) 757
29.4%

연락처
Text

MISSING 

Distinct12
Distinct (%)100.0%
Missing241
Missing (%)95.3%
Memory size2.1 KiB
2023-12-11T00:59:37.808669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters144
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)100.0%

Sample

1st row051-908-5672
2nd row051-205-2195
3rd row051-253-3143
4th row051-206-0561
5th row051-204-1148
ValueCountFrequency (%)
051-908-5672 1
8.3%
051-205-2195 1
8.3%
051-253-3143 1
8.3%
051-206-0561 1
8.3%
051-204-1148 1
8.3%
051-248-3446 1
8.3%
051-292-4292 1
8.3%
051-265-7622 1
8.3%
051-292-4008 1
8.3%
051-291-1176 1
8.3%
Other values (2) 2
16.7%
2023-12-11T00:59:38.229214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 24
16.7%
5 22
15.3%
1 21
14.6%
2 21
14.6%
0 20
13.9%
6 10
6.9%
4 8
 
5.6%
9 6
 
4.2%
3 5
 
3.5%
8 4
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 120
83.3%
Dash Punctuation 24
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 22
18.3%
1 21
17.5%
2 21
17.5%
0 20
16.7%
6 10
8.3%
4 8
 
6.7%
9 6
 
5.0%
3 5
 
4.2%
8 4
 
3.3%
7 3
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 144
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 24
16.7%
5 22
15.3%
1 21
14.6%
2 21
14.6%
0 20
13.9%
6 10
6.9%
4 8
 
5.6%
9 6
 
4.2%
3 5
 
3.5%
8 4
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 144
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 24
16.7%
5 22
15.3%
1 21
14.6%
2 21
14.6%
0 20
13.9%
6 10
6.9%
4 8
 
5.6%
9 6
 
4.2%
3 5
 
3.5%
8 4
 
2.8%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2023-01-26 00:00:00
Maximum2023-01-26 00:00:00
2023-12-11T00:59:38.399646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:59:38.547216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-11T00:59:35.288468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T00:59:35.439016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

이름소재지연락처데이터기준일자
0부산서부교회사하구 장평로 111-6 (장림동)<NA>2023-01-26
1시온성교회사하구 두송로 57 상가동 (장림동)<NA>2023-01-26
2서진교회사하구 다대로277번길 36 2층 (장림동)<NA>2023-01-26
3감천제일교회사하구 옥천로75번길 14(감천동)<NA>2023-01-26
4감천중앙교회사하구 감천로117번길 9(감천동)<NA>2023-01-26
5괴정제일교회사하구 장평로449번길17(괴정동)<NA>2023-01-26
6구평제일교회사하구 을숙도대로745번길21(구평동)<NA>2023-01-26
7다대교회사하구 다대로529번길38-2(다대동)<NA>2023-01-26
8다대로교회사하구 윤공단로14번길 87,2층<NA>2023-01-26
9당리성산교회사하구 제석로 127, 3층((혜성A, 상가)<NA>2023-01-26
이름소재지연락처데이터기준일자
243신평교당사하구 장평로299번안길 6<NA>2023-01-26
244원불교하단성적지사하구 괴정로15번길 46<NA>2023-01-26
245하단성당사하구 동매로 82-6<NA>2023-01-26
246장림성당사하구 하신중앙로54번길 62<NA>2023-01-26
247아미성당사하구 옥천로 131<NA>2023-01-26
248다대성당사하구 다송로 53<NA>2023-01-26
249사하성당사하구 사하로 179<NA>2023-01-26
250괴정성당사하구 승학로299번길 11<NA>2023-01-26
251몰운대성당사하구 다대낙조2길 70051-265-55312023-01-26
252당리성당사하구 승학로71번길 115051-205-22662023-01-26