Overview

Dataset statistics

Number of variables6
Number of observations569
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.4 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Categorical3
Text2

Dataset

Description광주광역시 광산구 2022.06.01.기준 기독교, 불교, 천주교, 이슬람교 등 종교시설명, 행정동, 소재 등 데이터 제공
URLhttps://www.data.go.kr/data/15117799/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 행정동High correlation
행정동 is highly overall correlated with 연번High correlation
종교구분 is highly imbalanced (71.7%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:49:01.771469
Analysis finished2023-12-12 05:49:03.349070
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct569
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean285
Minimum1
Maximum569
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.1 KiB
2023-12-12T14:49:03.510152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile29.4
Q1143
median285
Q3427
95-th percentile540.6
Maximum569
Range568
Interquartile range (IQR)284

Descriptive statistics

Standard deviation164.40043
Coefficient of variation (CV)0.5768436
Kurtosis-1.2
Mean285
Median Absolute Deviation (MAD)142
Skewness0
Sum162165
Variance27027.5
MonotonicityStrictly increasing
2023-12-12T14:49:03.847186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
383 1
 
0.2%
377 1
 
0.2%
378 1
 
0.2%
379 1
 
0.2%
380 1
 
0.2%
381 1
 
0.2%
382 1
 
0.2%
384 1
 
0.2%
375 1
 
0.2%
Other values (559) 559
98.2%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
569 1
0.2%
568 1
0.2%
567 1
0.2%
566 1
0.2%
565 1
0.2%
564 1
0.2%
563 1
0.2%
562 1
0.2%
561 1
0.2%
560 1
0.2%

행정동
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
첨단2동
70 
신가동
57 
어룡동
47 
월곡2동
43 
수완동
39 
Other values (16)
313 

Length

Max length4
Median length3
Mean length3.3268893
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row하남동
2nd row하남동
3rd row하남동
4th row하남동
5th row하남동

Common Values

ValueCountFrequency (%)
첨단2동 70
12.3%
신가동 57
 
10.0%
어룡동 47
 
8.3%
월곡2동 43
 
7.6%
수완동 39
 
6.9%
우산동 37
 
6.5%
신창동 36
 
6.3%
월곡1동 33
 
5.8%
운남동 27
 
4.7%
비아동 26
 
4.6%
Other values (11) 154
27.1%

Length

2023-12-12T14:49:04.052306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
첨단2동 70
12.3%
신가동 57
 
10.0%
어룡동 47
 
8.3%
월곡2동 43
 
7.6%
수완동 39
 
6.9%
우산동 37
 
6.5%
신창동 36
 
6.3%
월곡1동 33
 
5.8%
운남동 27
 
4.7%
비아동 26
 
4.6%
Other values (11) 154
27.1%
Distinct524
Distinct (%)92.1%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-12T14:49:04.330782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length5.9050967
Min length3

Characters and Unicode

Total characters3360
Distinct characters306
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique488 ?
Unique (%)85.8%

Sample

1st row세움교회
2nd row화평교회
3rd row광주새암교회
4th row가나안교회
5th row여호와의증인 신가, 월곡, 하남 회중교회
ValueCountFrequency (%)
포교원 5
 
0.8%
참빛교회 4
 
0.7%
원불교 4
 
0.7%
주사랑교회 4
 
0.7%
은혜교회 3
 
0.5%
열린벧엘교회 3
 
0.5%
생명나무교회 3
 
0.5%
주은혜교회 3
 
0.5%
교회 3
 
0.5%
행복한교회 3
 
0.5%
Other values (537) 580
94.3%
2023-12-12T14:49:04.813464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
519
 
15.4%
505
 
15.0%
100
 
3.0%
89
 
2.6%
78
 
2.3%
47
 
1.4%
46
 
1.4%
43
 
1.3%
42
 
1.2%
42
 
1.2%
Other values (296) 1849
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3267
97.2%
Space Separator 46
 
1.4%
Uppercase Letter 16
 
0.5%
Close Punctuation 12
 
0.4%
Open Punctuation 12
 
0.4%
Other Punctuation 4
 
0.1%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
519
 
15.9%
505
 
15.5%
100
 
3.1%
89
 
2.7%
78
 
2.4%
47
 
1.4%
43
 
1.3%
42
 
1.3%
42
 
1.3%
34
 
1.0%
Other values (281) 1768
54.1%
Uppercase Letter
ValueCountFrequency (%)
A 5
31.2%
H 3
18.8%
L 2
 
12.5%
T 2
 
12.5%
B 1
 
6.2%
I 1
 
6.2%
U 1
 
6.2%
S 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 2
50.0%
. 1
25.0%
: 1
25.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3267
97.2%
Common 77
 
2.3%
Latin 16
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
519
 
15.9%
505
 
15.5%
100
 
3.1%
89
 
2.7%
78
 
2.4%
47
 
1.4%
43
 
1.3%
42
 
1.3%
42
 
1.3%
34
 
1.0%
Other values (281) 1768
54.1%
Latin
ValueCountFrequency (%)
A 5
31.2%
H 3
18.8%
L 2
 
12.5%
T 2
 
12.5%
B 1
 
6.2%
I 1
 
6.2%
U 1
 
6.2%
S 1
 
6.2%
Common
ValueCountFrequency (%)
46
59.7%
) 12
 
15.6%
( 12
 
15.6%
- 3
 
3.9%
, 2
 
2.6%
. 1
 
1.3%
: 1
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3267
97.2%
ASCII 93
 
2.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
519
 
15.9%
505
 
15.5%
100
 
3.1%
89
 
2.7%
78
 
2.4%
47
 
1.4%
43
 
1.3%
42
 
1.3%
42
 
1.3%
34
 
1.0%
Other values (281) 1768
54.1%
ASCII
ValueCountFrequency (%)
46
49.5%
) 12
 
12.9%
( 12
 
12.9%
A 5
 
5.4%
H 3
 
3.2%
- 3
 
3.2%
L 2
 
2.2%
T 2
 
2.2%
, 2
 
2.2%
B 1
 
1.1%
Other values (5) 5
 
5.4%
Distinct565
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2023-12-12T14:49:05.176732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length31
Mean length16.938489
Min length9

Characters and Unicode

Total characters9638
Distinct characters152
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique561 ?
Unique (%)98.6%

Sample

1st row광산구 목련로16
2nd row광산구 왕버들로45
3rd row광산구 왕버들로33-3
4th row광산구 사암로 423-1
5th row광산구 수남길 52
ValueCountFrequency (%)
광산구 573
29.9%
2층 29
 
1.5%
사암로 20
 
1.0%
3층 15
 
0.8%
월계로 15
 
0.8%
월곡산정로 14
 
0.7%
목련로 13
 
0.7%
수등로 10
 
0.5%
상가동 10
 
0.5%
산정공원로 9
 
0.5%
Other values (781) 1211
63.1%
2023-12-12T14:49:05.765900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1413
 
14.7%
658
 
6.8%
590
 
6.1%
577
 
6.0%
498
 
5.2%
1 487
 
5.1%
392
 
4.1%
2 379
 
3.9%
329
 
3.4%
3 285
 
3.0%
Other values (142) 4030
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5189
53.8%
Decimal Number 2465
25.6%
Space Separator 1413
 
14.7%
Dash Punctuation 210
 
2.2%
Open Punctuation 133
 
1.4%
Close Punctuation 133
 
1.4%
Other Punctuation 94
 
1.0%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
658
 
12.7%
590
 
11.4%
577
 
11.1%
498
 
9.6%
392
 
7.6%
329
 
6.3%
176
 
3.4%
76
 
1.5%
74
 
1.4%
73
 
1.4%
Other values (125) 1746
33.6%
Decimal Number
ValueCountFrequency (%)
1 487
19.8%
2 379
15.4%
3 285
11.6%
4 224
9.1%
7 206
8.4%
5 198
8.0%
0 195
7.9%
6 187
 
7.6%
8 166
 
6.7%
9 138
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 90
95.7%
@ 4
 
4.3%
Space Separator
ValueCountFrequency (%)
1413
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 210
100.0%
Open Punctuation
ValueCountFrequency (%)
( 133
100.0%
Close Punctuation
ValueCountFrequency (%)
) 133
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5189
53.8%
Common 4448
46.2%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
658
 
12.7%
590
 
11.4%
577
 
11.1%
498
 
9.6%
392
 
7.6%
329
 
6.3%
176
 
3.4%
76
 
1.5%
74
 
1.4%
73
 
1.4%
Other values (125) 1746
33.6%
Common
ValueCountFrequency (%)
1413
31.8%
1 487
 
10.9%
2 379
 
8.5%
3 285
 
6.4%
4 224
 
5.0%
- 210
 
4.7%
7 206
 
4.6%
5 198
 
4.5%
0 195
 
4.4%
6 187
 
4.2%
Other values (6) 664
14.9%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5189
53.8%
ASCII 4449
46.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1413
31.8%
1 487
 
10.9%
2 379
 
8.5%
3 285
 
6.4%
4 224
 
5.0%
- 210
 
4.7%
7 206
 
4.6%
5 198
 
4.5%
0 195
 
4.4%
6 187
 
4.2%
Other values (7) 665
14.9%
Hangul
ValueCountFrequency (%)
658
 
12.7%
590
 
11.4%
577
 
11.1%
498
 
9.6%
392
 
7.6%
329
 
6.3%
176
 
3.4%
76
 
1.5%
74
 
1.4%
73
 
1.4%
Other values (125) 1746
33.6%

종교구분
Categorical

IMBALANCE 

Distinct8
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
기독교
490 
불교
 
45
천주교
 
15
기도원
 
6
원불교
 
4
Other values (3)
 
9

Length

Max length5
Median length3
Mean length2.9420035
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row기독교
2nd row기독교
3rd row기독교
4th row기독교
5th row기독교

Common Values

ValueCountFrequency (%)
기독교 490
86.1%
불교 45
 
7.9%
천주교 15
 
2.6%
기도원 6
 
1.1%
원불교 4
 
0.7%
대순진리교 4
 
0.7%
이슬람교 4
 
0.7%
수녀원 1
 
0.2%

Length

2023-12-12T14:49:05.960431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:49:06.089397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기독교 490
86.1%
불교 45
 
7.9%
천주교 15
 
2.6%
기도원 6
 
1.1%
원불교 4
 
0.7%
대순진리교 4
 
0.7%
이슬람교 4
 
0.7%
수녀원 1
 
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.6 KiB
2022-06-01
569 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-06-01
2nd row2022-06-01
3rd row2022-06-01
4th row2022-06-01
5th row2022-06-01

Common Values

ValueCountFrequency (%)
2022-06-01 569
100.0%

Length

2023-12-12T14:49:06.217266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:49:06.321093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-06-01 569
100.0%

Interactions

2023-12-12T14:49:02.343797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:49:06.374750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동종교구분
연번1.0000.9420.657
행정동0.9421.0000.338
종교구분0.6570.3381.000
2023-12-12T14:49:06.457431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동종교구분
행정동1.0000.145
종교구분0.1451.000
2023-12-12T14:49:06.540063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동종교구분
연번1.0000.7240.388
행정동0.7241.0000.145
종교구분0.3880.1451.000

Missing values

2023-12-12T14:49:03.037935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:49:03.281496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번행정동시설명소재지종교구분데이터기준일자
01하남동세움교회광산구 목련로16기독교2022-06-01
12하남동화평교회광산구 왕버들로45기독교2022-06-01
23하남동광주새암교회광산구 왕버들로33-3기독교2022-06-01
34하남동가나안교회광산구 사암로 423-1기독교2022-06-01
45하남동여호와의증인 신가, 월곡, 하남 회중교회광산구 수남길 52기독교2022-06-01
56하남동하남장수교회광산구 장수길 96-4기독교2022-06-01
67하남동순복음호남교회광산구 하남대로54번안길 82기독교2022-06-01
78하남동하남교회광산구 하남대로76번길87기독교2022-06-01
89하남동수완행복한교회광산구 하남산단5번로 42기독교2022-06-01
910평동축복받는교회광산구 기곡길 190기독교2022-06-01
연번행정동시설명소재지종교구분데이터기준일자
559560월곡2동산정동 이슬람 예배소광산구 산정공원로 43(지하 1층)이슬람교2022-06-01
560561비아동하남 앗따우바(AT-TAUBAH) 예배소광산구 하남산단8번로 172(2층)이슬람교2022-06-01
561562평동평동 알이슬라(AL-ISHLAH) 예배소광산구 평동로 851, 선양빌딩 5층이슬람교2022-06-01
562563어룡동하늘문기도원광산구 송정공원로47번길 15기도원2022-06-01
563564월곡2동갈보리 기도원광산구 산정공원로10번길 7기도원2022-06-01
564565신가동엘벧엘 기도원광산구 목련로397번안길 5기도원2022-06-01
565566첨단1동광주비손수양관광산구 북문대로 902-45기도원2022-06-01
566567비아동드림기도원광산구 비아중앙로 38기도원2022-06-01
567568도산동사랑의기도원광산구 송도로 177기도원2022-06-01
568569삼도동이사벨레떼 영성원광산구 삼도송계길 51수녀원2022-06-01