Overview

Dataset statistics

Number of variables7
Number of observations190
Missing cells78
Missing cells (%)5.9%
Duplicate rows1
Duplicate rows (%)0.5%
Total size in memory10.5 KiB
Average record size in memory56.7 B

Variable types

Categorical2
Text3
Unsupported2

Dataset

Description태안군 마을회관 정보
Author충청남도 태안군
URLhttps://www.data.go.kr/data/15004505/fileData.do

Alerts

Dataset has 1 (0.5%) duplicate rowsDuplicates
태안군 마을회관 현황(2015년) is highly overall correlated with Unnamed: 6High correlation
Unnamed: 6 is highly overall correlated with 태안군 마을회관 현황(2015년)High correlation
Unnamed: 6 is highly imbalanced (55.0%)Imbalance
Unnamed: 1 has 2 (1.1%) missing valuesMissing
Unnamed: 2 has 8 (4.2%) missing valuesMissing
Unnamed: 3 has 53 (27.9%) missing valuesMissing
Unnamed: 4 has 8 (4.2%) missing valuesMissing
Unnamed: 5 has 7 (3.7%) missing valuesMissing
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 00:22:54.320816
Analysis finished2023-12-12 00:22:54.962104
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

태안군 마을회관 현황(2015년)
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
태안읍
45 
안면읍
28 
원북면
24 
소원면
23 
근흥면
20 
Other values (5)
50 

Length

Max length5
Median length3
Mean length2.9368421
Min length2

Unique

Unique1 ?
Unique (%)0.5%

Sample

1st row<NA>
2nd row읍 · 면
3rd row<NA>
4th row태안읍
5th row태안읍

Common Values

ValueCountFrequency (%)
태안읍 45
23.7%
안면읍 28
14.7%
원북면 24
12.6%
소원면 23
12.1%
근흥면 20
10.5%
남면 16
 
8.4%
이원면 16
 
8.4%
고남면 15
 
7.9%
<NA> 2
 
1.1%
읍 · 면 1
 
0.5%

Length

2023-12-12T09:22:55.028216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:22:55.156158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
태안읍 45
23.4%
안면읍 28
14.6%
원북면 24
12.5%
소원면 23
12.0%
근흥면 20
10.4%
남면 16
 
8.3%
이원면 16
 
8.3%
고남면 15
 
7.8%
na 2
 
1.0%
1
 
0.5%
Other values (2) 2
 
1.0%

Unnamed: 1
Text

MISSING 

Distinct188
Distinct (%)100.0%
Missing2
Missing (%)1.1%
Memory size1.6 KiB
2023-12-12T09:22:55.552084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.9521277
Min length6

Characters and Unicode

Total characters1683
Distinct characters87
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)100.0%

Sample

1st row회 관 명
2nd row동문1리 마을회관
3rd row동문2리 마을회관
4th row동문3리 마을회관
5th row동문4리 마을회관
ValueCountFrequency (%)
마을회관 187
49.6%
신장1리 1
 
0.3%
달산1리 1
 
0.3%
신덕2리 1
 
0.3%
정죽4리 1
 
0.3%
정죽5리 1
 
0.3%
신진도1리 1
 
0.3%
신진도2리 1
 
0.3%
가의도리 1
 
0.3%
시목1리 1
 
0.3%
Other values (181) 181
48.0%
2023-12-12T09:22:56.094919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
192
11.4%
192
11.4%
191
11.3%
188
11.2%
187
11.1%
187
11.1%
1 59
 
3.5%
2 59
 
3.5%
3 29
 
1.7%
26
 
1.5%
Other values (77) 373
22.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1309
77.8%
Space Separator 192
 
11.4%
Decimal Number 182
 
10.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
14.7%
191
14.6%
188
14.4%
187
14.3%
187
14.3%
26
 
2.0%
17
 
1.3%
16
 
1.2%
16
 
1.2%
12
 
0.9%
Other values (68) 277
21.2%
Decimal Number
ValueCountFrequency (%)
1 59
32.4%
2 59
32.4%
3 29
15.9%
4 14
 
7.7%
5 9
 
4.9%
6 7
 
3.8%
7 4
 
2.2%
8 1
 
0.5%
Space Separator
ValueCountFrequency (%)
192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1309
77.8%
Common 374
 
22.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
14.7%
191
14.6%
188
14.4%
187
14.3%
187
14.3%
26
 
2.0%
17
 
1.3%
16
 
1.2%
16
 
1.2%
12
 
0.9%
Other values (68) 277
21.2%
Common
ValueCountFrequency (%)
192
51.3%
1 59
 
15.8%
2 59
 
15.8%
3 29
 
7.8%
4 14
 
3.7%
5 9
 
2.4%
6 7
 
1.9%
7 4
 
1.1%
8 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1309
77.8%
ASCII 374
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
192
51.3%
1 59
 
15.8%
2 59
 
15.8%
3 29
 
7.8%
4 14
 
3.7%
5 9
 
2.4%
6 7
 
1.9%
7 4
 
1.1%
8 1
 
0.3%
Hangul
ValueCountFrequency (%)
192
14.7%
191
14.6%
188
14.4%
187
14.3%
187
14.3%
26
 
2.0%
17
 
1.3%
16
 
1.2%
16
 
1.2%
12
 
0.9%
Other values (68) 277
21.2%

Unnamed: 2
Text

MISSING 

Distinct181
Distinct (%)99.5%
Missing8
Missing (%)4.2%
Memory size1.6 KiB
2023-12-12T09:22:56.532952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length14
Mean length11.840659
Min length8

Characters and Unicode

Total characters2155
Distinct characters162
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)98.9%

Sample

1st row주 소 (충남 태안군)
2nd row태안읍 재경미길 8-5
3rd row태안읍 경이정6길
4th row태안읍 시장5길 18-14
5th row태안읍 재경미길 64
ValueCountFrequency (%)
태안읍 35
 
6.4%
안면읍 28
 
5.1%
원북면 24
 
4.4%
소원면 23
 
4.2%
근흥면 20
 
3.7%
이원면 16
 
2.9%
고남면 15
 
2.7%
남면 15
 
2.7%
원이로 10
 
1.8%
태안군 6
 
1.1%
Other values (307) 354
64.8%
2023-12-12T09:22:57.205132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
364
 
16.9%
146
 
6.8%
1 116
 
5.4%
111
 
5.2%
2 85
 
3.9%
81
 
3.8%
76
 
3.5%
- 73
 
3.4%
3 71
 
3.3%
70
 
3.2%
Other values (152) 962
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1139
52.9%
Decimal Number 577
26.8%
Space Separator 364
 
16.9%
Dash Punctuation 73
 
3.4%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
146
 
12.8%
111
 
9.7%
81
 
7.1%
76
 
6.7%
70
 
6.1%
63
 
5.5%
42
 
3.7%
33
 
2.9%
32
 
2.8%
27
 
2.4%
Other values (138) 458
40.2%
Decimal Number
ValueCountFrequency (%)
1 116
20.1%
2 85
14.7%
3 71
12.3%
5 58
10.1%
4 55
9.5%
6 51
8.8%
7 40
 
6.9%
9 37
 
6.4%
8 33
 
5.7%
0 31
 
5.4%
Space Separator
ValueCountFrequency (%)
364
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1139
52.9%
Common 1016
47.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
146
 
12.8%
111
 
9.7%
81
 
7.1%
76
 
6.7%
70
 
6.1%
63
 
5.5%
42
 
3.7%
33
 
2.9%
32
 
2.8%
27
 
2.4%
Other values (138) 458
40.2%
Common
ValueCountFrequency (%)
364
35.8%
1 116
 
11.4%
2 85
 
8.4%
- 73
 
7.2%
3 71
 
7.0%
5 58
 
5.7%
4 55
 
5.4%
6 51
 
5.0%
7 40
 
3.9%
9 37
 
3.6%
Other values (4) 66
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1139
52.9%
ASCII 1016
47.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
364
35.8%
1 116
 
11.4%
2 85
 
8.4%
- 73
 
7.2%
3 71
 
7.0%
5 58
 
5.7%
4 55
 
5.4%
6 51
 
5.0%
7 40
 
3.9%
9 37
 
3.6%
Other values (4) 66
 
6.5%
Hangul
ValueCountFrequency (%)
146
 
12.8%
111
 
9.7%
81
 
7.1%
76
 
6.7%
70
 
6.1%
63
 
5.5%
42
 
3.7%
33
 
2.9%
32
 
2.8%
27
 
2.4%
Other values (138) 458
40.2%

Unnamed: 3
Text

MISSING 

Distinct137
Distinct (%)100.0%
Missing53
Missing (%)27.9%
Memory size1.6 KiB
2023-12-12T09:22:57.576247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length8
Mean length8.0510949
Min length8

Characters and Unicode

Total characters1103
Distinct characters21
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)100.0%

Sample

1st row전화번호 (지역번호 041)
2nd row674-8848
3rd row675-4225
4th row675-6107
5th row673-2633
ValueCountFrequency (%)
673-7098 1
 
0.7%
672-7051 1
 
0.7%
672-3887 1
 
0.7%
675-8856 1
 
0.7%
672-3959 1
 
0.7%
672-1063 1
 
0.7%
674-5763 1
 
0.7%
674-2795 1
 
0.7%
672-0511 1
 
0.7%
전화번호 1
 
0.7%
Other values (129) 129
92.8%
2023-12-12T09:22:58.191908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 191
17.3%
6 188
17.0%
- 136
12.3%
2 100
9.1%
3 94
8.5%
5 84
7.6%
4 72
 
6.5%
0 59
 
5.3%
8 58
 
5.3%
1 57
 
5.2%
Other values (11) 64
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 955
86.6%
Dash Punctuation 136
 
12.3%
Other Letter 8
 
0.7%
Control 1
 
0.1%
Open Punctuation 1
 
0.1%
Space Separator 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 191
20.0%
6 188
19.7%
2 100
10.5%
3 94
9.8%
5 84
8.8%
4 72
 
7.5%
0 59
 
6.2%
8 58
 
6.1%
1 57
 
6.0%
9 52
 
5.4%
Other Letter
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Dash Punctuation
ValueCountFrequency (%)
- 136
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1095
99.3%
Hangul 8
 
0.7%

Most frequent character per script

Common
ValueCountFrequency (%)
7 191
17.4%
6 188
17.2%
- 136
12.4%
2 100
9.1%
3 94
8.6%
5 84
7.7%
4 72
 
6.6%
0 59
 
5.4%
8 58
 
5.3%
1 57
 
5.2%
Other values (5) 56
 
5.1%
Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1095
99.3%
Hangul 8
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7 191
17.4%
6 188
17.2%
- 136
12.4%
2 100
9.1%
3 94
8.6%
5 84
7.7%
4 72
 
6.6%
0 59
 
5.4%
8 58
 
5.3%
1 57
 
5.2%
Other values (5) 56
 
5.1%
Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8
Missing (%)4.2%
Memory size1.6 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing7
Missing (%)3.7%
Memory size1.6 KiB

Unnamed: 6
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
<NA>
138 
회관전화 없음
45 
마을회관 없음
 
5
비 고
 
1
신축 공사중
 
1

Length

Max length7
Median length4
Mean length4.7947368
Min length3

Unique

Unique2 ?
Unique (%)1.1%

Sample

1st row<NA>
2nd row비 고
3rd row<NA>
4th row회관전화 없음
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 138
72.6%
회관전화 없음 45
 
23.7%
마을회관 없음 5
 
2.6%
비 고 1
 
0.5%
신축 공사중 1
 
0.5%

Length

2023-12-12T09:22:58.359956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:22:58.494474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 138
57.0%
없음 50
 
20.7%
회관전화 45
 
18.6%
마을회관 5
 
2.1%
1
 
0.4%
1
 
0.4%
신축 1
 
0.4%
공사중 1
 
0.4%

Correlations

2023-12-12T09:22:58.581482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태안군 마을회관 현황(2015년)Unnamed: 6
태안군 마을회관 현황(2015년)1.0000.865
Unnamed: 60.8651.000
2023-12-12T09:22:58.672006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태안군 마을회관 현황(2015년)Unnamed: 6
태안군 마을회관 현황(2015년)1.0000.772
Unnamed: 60.7721.000
2023-12-12T09:22:58.792298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
태안군 마을회관 현황(2015년)Unnamed: 6
태안군 마을회관 현황(2015년)1.0000.772
Unnamed: 60.7721.000

Missing values

2023-12-12T09:22:54.561536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:22:54.697164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:22:54.861195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

태안군 마을회관 현황(2015년)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
0<NA><NA><NA><NA>NaN기준일 : '15. 1. 9<NA>
1읍 · 면회 관 명주 소 (충남 태안군)전화번호 (지역번호 041)건축연도규모(㎡)비 고
2<NA><NA><NA><NA>NaNNaN<NA>
3태안읍동문1리 마을회관태안읍 재경미길 8-5<NA>1987341.1회관전화 없음
4태안읍동문2리 마을회관태안읍 경이정6길674-88482006179.34<NA>
5태안읍동문3리 마을회관태안읍 시장5길 18-14675-42251974343<NA>
6태안읍동문4리 마을회관태안읍 재경미길 64<NA>2006144.77회관전화 없음
7태안읍동문5리 마을회관<NA><NA>NaNNaN마을회관 없음
8태안읍동문6리 마을회관<NA><NA>NaNNaN마을회관 없음
9태안읍남문1리 마을회관태안읍 서문5길 21<NA>1997270.9회관전화 없음
태안군 마을회관 현황(2015년)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
180이원면당산1리 마을회관이원면 원이로 1678-1672-83201998151.2<NA>
181이원면당산2리 마을회관이원면 당산2길 143672-4356199899.8<NA>
182이원면당산3리 마을회관이원면 사관로 111-47672-2115200980.52<NA>
183이원면당산4리 마을회관이원면 사관로 576-2672-3417201091.35<NA>
184이원면포지1리 마을회관이원면 분지길 60672-98712012239.79<NA>
185이원면포지2리 마을회관이원면 원이로 1404-7672-31531998166.63<NA>
186이원면포지3리 마을회관이원면 굴항1길 338-2672-70161998179.46<NA>
187이원면사창1리 마을회관이원면 장작길 166-3672-1839199889.1<NA>
188이원면사창2리 마을회관이원면 원이로 1224672-55161995187.26<NA>
189이원면사창3리 마을회관이원면 태포길 79-65672-0485199993.75<NA>

Duplicate rows

Most frequently occurring

태안군 마을회관 현황(2015년)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 6# duplicates
0<NA><NA><NA><NA><NA>2