Overview

Dataset statistics

Number of variables6
Number of observations96
Missing cells36
Missing cells (%)6.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.7 KiB
Average record size in memory50.4 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description인천광역시 동구에 있는 종교시설현황 데이터로, 시설명, 도로명주소, 전화번호, 종교구분, 데이터기준일자 등 항목을 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15117193/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
종교구분 is highly imbalanced (71.2%)Imbalance
전화번호 has 36 (37.5%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:23:50.141083
Analysis finished2023-12-12 17:23:51.534979
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct96
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.5
Minimum1
Maximum96
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size996.0 B
2023-12-13T02:23:51.654018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.75
Q124.75
median48.5
Q372.25
95-th percentile91.25
Maximum96
Range95
Interquartile range (IQR)47.5

Descriptive statistics

Standard deviation27.856777
Coefficient of variation (CV)0.57436653
Kurtosis-1.2
Mean48.5
Median Absolute Deviation (MAD)24
Skewness0
Sum4656
Variance776
MonotonicityStrictly increasing
2023-12-13T02:23:51.865870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
50 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
67 1
 
1.0%
66 1
 
1.0%
65 1
 
1.0%
Other values (86) 86
89.6%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
90 1
1.0%
89 1
1.0%
88 1
1.0%
87 1
1.0%
Distinct94
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size900.0 B
2023-12-13T02:23:52.171313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length6.2291667
Min length3

Characters and Unicode

Total characters598
Distinct characters139
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)96.9%

Sample

1st row만석중앙교회
2nd row만석감리교회
3rd row보아스교회
4th row예장성서교회
5th row동광장로교회
ValueCountFrequency (%)
새소망교회 3
 
2.9%
교회 2
 
1.9%
송림교회(성결 1
 
1.0%
구세군송림교회 1
 
1.0%
송림4동성당 1
 
1.0%
하나님의 1
 
1.0%
하늘빛교회 1
 
1.0%
초원교회 1
 
1.0%
새빛교회 1
 
1.0%
주사랑양문교회 1
 
1.0%
Other values (90) 90
87.4%
2023-12-13T02:23:52.667486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
91
 
15.2%
90
 
15.1%
16
 
2.7%
15
 
2.5%
13
 
2.2%
13
 
2.2%
12
 
2.0%
9
 
1.5%
9
 
1.5%
8
 
1.3%
Other values (129) 322
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 575
96.2%
Open Punctuation 7
 
1.2%
Space Separator 7
 
1.2%
Close Punctuation 7
 
1.2%
Decimal Number 1
 
0.2%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
15.8%
90
 
15.7%
16
 
2.8%
15
 
2.6%
13
 
2.3%
13
 
2.3%
12
 
2.1%
9
 
1.6%
9
 
1.6%
8
 
1.4%
Other values (124) 299
52.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Space Separator
ValueCountFrequency (%)
7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 573
95.8%
Common 23
 
3.8%
Han 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
15.9%
90
 
15.7%
16
 
2.8%
15
 
2.6%
13
 
2.3%
13
 
2.3%
12
 
2.1%
9
 
1.6%
9
 
1.6%
8
 
1.4%
Other values (123) 297
51.8%
Common
ValueCountFrequency (%)
( 7
30.4%
7
30.4%
) 7
30.4%
4 1
 
4.3%
& 1
 
4.3%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 573
95.8%
ASCII 23
 
3.8%
CJK 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
91
 
15.9%
90
 
15.7%
16
 
2.8%
15
 
2.6%
13
 
2.3%
13
 
2.3%
12
 
2.1%
9
 
1.6%
9
 
1.6%
8
 
1.4%
Other values (123) 297
51.8%
ASCII
ValueCountFrequency (%)
( 7
30.4%
7
30.4%
) 7
30.4%
4 1
 
4.3%
& 1
 
4.3%
CJK
ValueCountFrequency (%)
2
100.0%
Distinct95
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size900.0 B
2023-12-13T02:23:53.014370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length25
Mean length18.84375
Min length14

Characters and Unicode

Total characters1809
Distinct characters101
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)97.9%

Sample

1st row인천광역시 동구 제물량로 404
2nd row인천광역시 동구 화도진로186번길 65
3rd row인천광역시 동구 화도진로 187, 만석비치A 103-1903
4th row인천광역시 동구 화도진로 187, 만석비치A 101-1103
5th row인천광역시 동구 어촌로5번길 8
ValueCountFrequency (%)
인천광역시 96
23.0%
동구 96
23.0%
송림로 9
 
2.2%
화도진로 9
 
2.2%
2층 9
 
2.2%
화수로 7
 
1.7%
동산로 6
 
1.4%
15 5
 
1.2%
3층 5
 
1.2%
수문통로 5
 
1.2%
Other values (136) 171
40.9%
2023-12-13T02:23:53.601332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
322
17.8%
108
 
6.0%
97
 
5.4%
97
 
5.4%
97
 
5.4%
96
 
5.3%
96
 
5.3%
96
 
5.3%
96
 
5.3%
1 74
 
4.1%
Other values (91) 630
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1132
62.6%
Space Separator 322
 
17.8%
Decimal Number 305
 
16.9%
Other Punctuation 23
 
1.3%
Dash Punctuation 13
 
0.7%
Close Punctuation 6
 
0.3%
Open Punctuation 6
 
0.3%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
9.5%
97
 
8.6%
97
 
8.6%
97
 
8.6%
96
 
8.5%
96
 
8.5%
96
 
8.5%
96
 
8.5%
30
 
2.7%
27
 
2.4%
Other values (75) 292
25.8%
Decimal Number
ValueCountFrequency (%)
1 74
24.3%
2 42
13.8%
3 35
11.5%
6 33
10.8%
4 29
 
9.5%
0 22
 
7.2%
5 20
 
6.6%
8 18
 
5.9%
7 17
 
5.6%
9 15
 
4.9%
Space Separator
ValueCountFrequency (%)
322
100.0%
Other Punctuation
ValueCountFrequency (%)
, 23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1132
62.6%
Common 675
37.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
9.5%
97
 
8.6%
97
 
8.6%
97
 
8.6%
96
 
8.5%
96
 
8.5%
96
 
8.5%
96
 
8.5%
30
 
2.7%
27
 
2.4%
Other values (75) 292
25.8%
Common
ValueCountFrequency (%)
322
47.7%
1 74
 
11.0%
2 42
 
6.2%
3 35
 
5.2%
6 33
 
4.9%
4 29
 
4.3%
, 23
 
3.4%
0 22
 
3.3%
5 20
 
3.0%
8 18
 
2.7%
Other values (5) 57
 
8.4%
Latin
ValueCountFrequency (%)
A 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1132
62.6%
ASCII 677
37.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
322
47.6%
1 74
 
10.9%
2 42
 
6.2%
3 35
 
5.2%
6 33
 
4.9%
4 29
 
4.3%
, 23
 
3.4%
0 22
 
3.2%
5 20
 
3.0%
8 18
 
2.7%
Other values (6) 59
 
8.7%
Hangul
ValueCountFrequency (%)
108
 
9.5%
97
 
8.6%
97
 
8.6%
97
 
8.6%
96
 
8.5%
96
 
8.5%
96
 
8.5%
96
 
8.5%
30
 
2.7%
27
 
2.4%
Other values (75) 292
25.8%

전화번호
Text

MISSING 

Distinct59
Distinct (%)98.3%
Missing36
Missing (%)37.5%
Memory size900.0 B
2023-12-13T02:23:53.927017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.033333
Min length12

Characters and Unicode

Total characters722
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)96.7%

Sample

1st row032-777-5386
2nd row032-773-7455
3rd row032-762-1209
4th row032-761-1009
5th row032-763-6630
ValueCountFrequency (%)
032-764-4066 2
 
3.3%
032-766-5276 1
 
1.7%
032-777-5386 1
 
1.7%
032-873-9585 1
 
1.7%
032-763-2700 1
 
1.7%
032-766-5550 1
 
1.7%
032-762-3667 1
 
1.7%
032-763-1961 1
 
1.7%
032-777-8511 1
 
1.7%
032-762-5512 1
 
1.7%
Other values (49) 49
81.7%
2023-12-13T02:23:54.380136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 120
16.6%
2 102
14.1%
3 98
13.6%
7 91
12.6%
0 88
12.2%
6 78
10.8%
1 38
 
5.3%
5 33
 
4.6%
8 30
 
4.2%
4 23
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 602
83.4%
Dash Punctuation 120
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 102
16.9%
3 98
16.3%
7 91
15.1%
0 88
14.6%
6 78
13.0%
1 38
 
6.3%
5 33
 
5.5%
8 30
 
5.0%
4 23
 
3.8%
9 21
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 120
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 722
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 120
16.6%
2 102
14.1%
3 98
13.6%
7 91
12.6%
0 88
12.2%
6 78
10.8%
1 38
 
5.3%
5 33
 
4.6%
8 30
 
4.2%
4 23
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 722
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 120
16.6%
2 102
14.1%
3 98
13.6%
7 91
12.6%
0 88
12.2%
6 78
10.8%
1 38
 
5.3%
5 33
 
4.6%
8 30
 
4.2%
4 23
 
3.2%

종교구분
Categorical

IMBALANCE 

Distinct4
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size900.0 B
기독교
87 
천주교
 
5
불교
 
3
기타
 
1

Length

Max length3
Median length3
Mean length2.9583333
Min length2

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row기독교
2nd row기독교
3rd row기독교
4th row기독교
5th row기독교

Common Values

ValueCountFrequency (%)
기독교 87
90.6%
천주교 5
 
5.2%
불교 3
 
3.1%
기타 1
 
1.0%

Length

2023-12-13T02:23:54.560492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:23:54.687719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기독교 87
90.6%
천주교 5
 
5.2%
불교 3
 
3.1%
기타 1
 
1.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size900.0 B
2023-07-26
96 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-26
2nd row2023-07-26
3rd row2023-07-26
4th row2023-07-26
5th row2023-07-26

Common Values

ValueCountFrequency (%)
2023-07-26 96
100.0%

Length

2023-12-13T02:23:54.834074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:23:54.955952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-26 96
100.0%

Interactions

2023-12-13T02:23:51.123336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:23:55.045720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설명도로명주소전화번호종교구분
연번1.0000.8721.0000.9360.162
시설명0.8721.0000.9970.9971.000
도로명주소1.0000.9971.0001.0001.000
전화번호0.9360.9971.0001.0001.000
종교구분0.1621.0001.0001.0001.000
2023-12-13T02:23:55.183160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종교구분
연번1.0000.040
종교구분0.0401.000

Missing values

2023-12-13T02:23:51.316265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:23:51.472054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설명도로명주소전화번호종교구분데이터기준일자
01만석중앙교회인천광역시 동구 제물량로 404032-777-5386기독교2023-07-26
12만석감리교회인천광역시 동구 화도진로186번길 65032-773-7455기독교2023-07-26
23보아스교회인천광역시 동구 화도진로 187, 만석비치A 103-1903<NA>기독교2023-07-26
34예장성서교회인천광역시 동구 화도진로 187, 만석비치A 101-1103<NA>기독교2023-07-26
45동광장로교회인천광역시 동구 어촌로5번길 8<NA>기독교2023-07-26
56고신교회인천광역시 동구 제물량로341번길 25-1032-762-1209기독교2023-07-26
67만석성결교회인천광역시 동구 만석로 9032-761-1009기독교2023-07-26
78동인교회인천광역시 동구 석수로 15032-763-6630기독교2023-07-26
89인천교회인천광역시 동구 화도진로 80032-764-3223기독교2023-07-26
910인천방주교회인천광역시 동구 화도진로 124032-773-6936기독교2023-07-26
연번시설명도로명주소전화번호종교구분데이터기준일자
8687새앎교회인천광역시 동구 금곡로 42032-764-8559기독교2023-07-26
8788인천성서침례교회인천광역시 동구 금창로36번길 10-16032-773-3795기독교2023-07-26
8889인천중앙교회인천광역시 동구 금곡로 60-1032-773-6473기독교2023-07-26
8990사랑&섬김교회인천광역시 동구 금곡로 64-1070-8288-2692기독교2023-07-26
9091참이웃교회인천광역시 동구 송림로 24, 5층032-435-4541기독교2023-07-26
9192반석교회(반석기도원)인천광역시 동구 수문통로 5-1, 2층<NA>기독교2023-07-26
9293동인천예배당(동인천교회)인천광역시 동구 금창로 39(금곡동)<NA>기독교2023-07-26
9394참된교회인천광역시 동구 송림로 26032-203-2091기독교2023-07-26
9495능력장로교회인천광역시 동구 송림로 12, 6층(금곡동, 솔빛메티칼센터)<NA>기독교2023-07-26
9596방주교회인천광역시 동구 금곡로 55<NA>기독교2023-07-26