Overview

Dataset statistics

Number of variables14
Number of observations1070
Missing cells10012
Missing cells (%)66.8%
Duplicate rows15
Duplicate rows (%)1.4%
Total size in memory117.2 KiB
Average record size in memory112.1 B

Variable types

Categorical3
Unsupported9
Text2

Dataset

Description의회홈페이지시스템 DB 내용으로, 회원등급관리, 회원리스트관리, 전체카테고리, 포토갤러리, 자료실, 의사일정목록 등 정보입니다.
Author전라남도 영광군
URLhttps://www.data.go.kr/data/15039835/fileData.do

Alerts

Dataset has 15 (1.4%) duplicate rowsDuplicates
Unnamed: 7 is highly overall correlated with DB명 and 1 other fieldsHigh correlation
Unnamed: 12 is highly overall correlated with DB명 and 1 other fieldsHigh correlation
DB명 is highly overall correlated with Unnamed: 7 and 1 other fieldsHigh correlation
Unnamed: 7 is highly imbalanced (94.6%)Imbalance
Unnamed: 12 is highly imbalanced (94.5%)Imbalance
영문테이블명 has 423 (39.5%) missing valuesMissing
한글테이블명 has 489 (45.7%) missing valuesMissing
Unnamed: 3 has 864 (80.7%) missing valuesMissing
Unnamed: 4 has 944 (88.2%) missing valuesMissing
Unnamed: 5 has 968 (90.5%) missing valuesMissing
Unnamed: 6 has 1054 (98.5%) missing valuesMissing
Unnamed: 8 has 1054 (98.5%) missing valuesMissing
Unnamed: 9 has 1054 (98.5%) missing valuesMissing
Unnamed: 10 has 1054 (98.5%) missing valuesMissing
Unnamed: 11 has 1054 (98.5%) missing valuesMissing
Unnamed: 13 has 1054 (98.5%) missing valuesMissing
영문테이블명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 6 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 03:35:55.353855
Analysis finished2023-12-12 03:35:56.743764
Duration1.39 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

DB명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
<NA>
536 
ygcouncil
107 
공개 컬럼명
107 
영문명
107 
한글명
107 

Length

Max length9
Median length4
Mean length4.4009346
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowygcouncil
2nd row<NA>
3rd row공개 컬럼명
4th row영문명
5th row한글명

Common Values

ValueCountFrequency (%)
<NA> 536
50.1%
ygcouncil 107
 
10.0%
공개 컬럼명 107
 
10.0%
영문명 107
 
10.0%
한글명 107
 
10.0%
DB명 106
 
9.9%

Length

2023-12-12T12:35:56.852044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:35:57.018160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 536
45.5%
ygcouncil 107
 
9.1%
공개 107
 
9.1%
컬럼명 107
 
9.1%
영문명 107
 
9.1%
한글명 107
 
9.1%
db명 106
 
9.0%

영문테이블명
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing423
Missing (%)39.5%
Memory size8.5 KiB

한글테이블명
Text

MISSING 

Distinct219
Distinct (%)37.7%
Missing489
Missing (%)45.7%
Memory size8.5 KiB
2023-12-12T12:35:57.471363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length6.0826162
Min length1

Characters and Unicode

Total characters3534
Distinct characters125
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)35.5%

Sample

1st row회원등급관리
2nd rowgrade_name
3rd row등급
4th row최고관리자
5th row일반회원
ValueCountFrequency (%)
한글테이블명 106
15.3%
카테고리 97
14.0%
board_index 52
 
7.5%
catename 52
 
7.5%
댓글 52
 
7.5%
일반 51
 
7.4%
게시판타입 36
 
5.2%
카테고리이름 16
 
2.3%
게시판인덱스 7
 
1.0%
포토갤러리11 3
 
0.4%
Other values (166) 219
31.7%
2023-12-12T12:35:58.568362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
221
 
6.3%
e 162
 
4.6%
a 161
 
4.6%
160
 
4.5%
158
 
4.5%
122
 
3.5%
122
 
3.5%
115
 
3.3%
110
 
3.1%
d 106
 
3.0%
Other values (115) 2097
59.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2060
58.3%
Lowercase Letter 967
27.4%
Uppercase Letter 236
 
6.7%
Space Separator 110
 
3.1%
Decimal Number 108
 
3.1%
Connector Punctuation 53
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
221
 
10.7%
160
 
7.8%
158
 
7.7%
122
 
5.9%
122
 
5.9%
115
 
5.6%
106
 
5.1%
106
 
5.1%
106
 
5.1%
56
 
2.7%
Other values (72) 788
38.3%
Uppercase Letter
ValueCountFrequency (%)
A 65
27.5%
B 40
16.9%
C 36
15.3%
D 26
 
11.0%
E 16
 
6.8%
F 16
 
6.8%
H 11
 
4.7%
G 6
 
2.5%
I 5
 
2.1%
M 5
 
2.1%
Other values (6) 10
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
e 162
16.8%
a 161
16.6%
d 106
11.0%
n 105
10.9%
m 55
 
5.7%
r 55
 
5.7%
c 54
 
5.6%
i 54
 
5.6%
t 53
 
5.5%
o 53
 
5.5%
Other values (5) 109
11.3%
Decimal Number
ValueCountFrequency (%)
1 46
42.6%
3 10
 
9.3%
2 10
 
9.3%
4 8
 
7.4%
5 8
 
7.4%
6 6
 
5.6%
7 6
 
5.6%
8 6
 
5.6%
0 4
 
3.7%
9 4
 
3.7%
Space Separator
ValueCountFrequency (%)
110
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2060
58.3%
Latin 1203
34.0%
Common 271
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
221
 
10.7%
160
 
7.8%
158
 
7.7%
122
 
5.9%
122
 
5.9%
115
 
5.6%
106
 
5.1%
106
 
5.1%
106
 
5.1%
56
 
2.7%
Other values (72) 788
38.3%
Latin
ValueCountFrequency (%)
e 162
13.5%
a 161
13.4%
d 106
 
8.8%
n 105
 
8.7%
A 65
 
5.4%
m 55
 
4.6%
r 55
 
4.6%
c 54
 
4.5%
i 54
 
4.5%
t 53
 
4.4%
Other values (21) 333
27.7%
Common
ValueCountFrequency (%)
110
40.6%
_ 53
19.6%
1 46
17.0%
3 10
 
3.7%
2 10
 
3.7%
4 8
 
3.0%
5 8
 
3.0%
6 6
 
2.2%
7 6
 
2.2%
8 6
 
2.2%
Other values (2) 8
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2060
58.3%
ASCII 1474
41.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
221
 
10.7%
160
 
7.8%
158
 
7.7%
122
 
5.9%
122
 
5.9%
115
 
5.6%
106
 
5.1%
106
 
5.1%
106
 
5.1%
56
 
2.7%
Other values (72) 788
38.3%
ASCII
ValueCountFrequency (%)
e 162
 
11.0%
a 161
 
10.9%
110
 
7.5%
d 106
 
7.2%
n 105
 
7.1%
A 65
 
4.4%
m 55
 
3.7%
r 55
 
3.7%
c 54
 
3.7%
i 54
 
3.7%
Other values (33) 547
37.1%

Unnamed: 3
Text

MISSING 

Distinct87
Distinct (%)42.2%
Missing864
Missing (%)80.7%
Memory size8.5 KiB
2023-12-12T12:35:58.896241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length5.8058252
Min length2

Characters and Unicode

Total characters1196
Distinct characters166
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)35.0%

Sample

1st rowsmsser
2nd rowsmsser
3rd rowcatename
4th row카테고리이름
5th row의회안내
ValueCountFrequency (%)
작성자구분코드 52
24.6%
writer 52
24.6%
의사일정 3
 
1.4%
구성과조직 3
 
1.4%
구성및소개 3
 
1.4%
사무과안내 3
 
1.4%
활동게시판 3
 
1.4%
의원소개 2
 
0.9%
의회연혁 2
 
0.9%
찾아오시는길 2
 
0.9%
Other values (80) 86
40.8%
2023-12-12T12:35:59.321526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 106
 
8.9%
62
 
5.2%
58
 
4.8%
57
 
4.8%
e 56
 
4.7%
t 53
 
4.4%
53
 
4.4%
i 52
 
4.3%
52
 
4.3%
52
 
4.3%
Other values (156) 595
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 838
70.1%
Lowercase Letter 341
28.5%
Decimal Number 8
 
0.7%
Space Separator 5
 
0.4%
Other Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
7.4%
58
 
6.9%
57
 
6.8%
53
 
6.3%
52
 
6.2%
52
 
6.2%
52
 
6.2%
52
 
6.2%
25
 
3.0%
21
 
2.5%
Other values (136) 354
42.2%
Lowercase Letter
ValueCountFrequency (%)
r 106
31.1%
e 56
16.4%
t 53
15.5%
i 52
15.2%
w 52
15.2%
a 11
 
3.2%
s 6
 
1.8%
m 3
 
0.9%
n 1
 
0.3%
c 1
 
0.3%
Decimal Number
ValueCountFrequency (%)
5 2
25.0%
7 1
12.5%
4 1
12.5%
2 1
12.5%
3 1
12.5%
1 1
12.5%
6 1
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 3
75.0%
· 1
 
25.0%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 838
70.1%
Latin 341
28.5%
Common 17
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
7.4%
58
 
6.9%
57
 
6.8%
53
 
6.3%
52
 
6.2%
52
 
6.2%
52
 
6.2%
52
 
6.2%
25
 
3.0%
21
 
2.5%
Other values (136) 354
42.2%
Latin
ValueCountFrequency (%)
r 106
31.1%
e 56
16.4%
t 53
15.5%
i 52
15.2%
w 52
15.2%
a 11
 
3.2%
s 6
 
1.8%
m 3
 
0.9%
n 1
 
0.3%
c 1
 
0.3%
Common
ValueCountFrequency (%)
5
29.4%
/ 3
17.6%
5 2
 
11.8%
7 1
 
5.9%
· 1
 
5.9%
4 1
 
5.9%
2 1
 
5.9%
3 1
 
5.9%
1 1
 
5.9%
6 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 836
69.9%
ASCII 357
29.8%
Compat Jamo 2
 
0.2%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 106
29.7%
e 56
15.7%
t 53
14.8%
i 52
14.6%
w 52
14.6%
a 11
 
3.1%
s 6
 
1.7%
5
 
1.4%
/ 3
 
0.8%
m 3
 
0.8%
Other values (9) 10
 
2.8%
Hangul
ValueCountFrequency (%)
62
 
7.4%
58
 
6.9%
57
 
6.8%
53
 
6.3%
52
 
6.2%
52
 
6.2%
52
 
6.2%
52
 
6.2%
25
 
3.0%
21
 
2.5%
Other values (135) 352
42.1%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing944
Missing (%)88.2%
Memory size8.5 KiB

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing968
Missing (%)90.5%
Memory size8.5 KiB

Unnamed: 6
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Unnamed: 7
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
<NA>
1054 
210.99.208.106
 
9
59.150.136.98
 
3
lasttip
 
2
211.253.124.56
 
1

Length

Max length14
Median length4
Mean length4.1336449
Min length4

Unique

Unique2 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1054
98.5%
210.99.208.106 9
 
0.8%
59.150.136.98 3
 
0.3%
lasttip 2
 
0.2%
211.253.124.56 1
 
0.1%
211.253.124.55 1
 
0.1%

Length

2023-12-12T12:35:59.466971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:35:59.594001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1054
98.5%
210.99.208.106 9
 
0.8%
59.150.136.98 3
 
0.3%
lasttip 2
 
0.2%
211.253.124.56 1
 
0.1%
211.253.124.55 1
 
0.1%

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Unnamed: 12
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
<NA>
1054 
5
 
13
memgrade
 
1
회원등급
 
1
1
 
1

Length

Max length8
Median length4
Mean length3.964486
Min length1

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1054
98.5%
5 13
 
1.2%
memgrade 1
 
0.1%
회원등급 1
 
0.1%
1 1
 
0.1%

Length

2023-12-12T12:35:59.715158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:35:59.825465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 1054
98.5%
5 13
 
1.2%
memgrade 1
 
0.1%
회원등급 1
 
0.1%
1 1
 
0.1%

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing1054
Missing (%)98.5%
Memory size8.5 KiB

Correlations

2023-12-12T12:35:59.911311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
DB명Unnamed: 3Unnamed: 7Unnamed: 12
DB명1.0000.898NaN0.000
Unnamed: 30.8981.000NaNNaN
Unnamed: 7NaNNaN1.0000.789
Unnamed: 120.000NaN0.7891.000
2023-12-12T12:36:00.049973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 7Unnamed: 12DB명
Unnamed: 71.0000.7071.000
Unnamed: 120.7071.0001.000
DB명1.0001.0001.000
2023-12-12T12:36:00.153735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
DB명Unnamed: 7Unnamed: 12
DB명1.0001.0001.000
Unnamed: 71.0001.0000.707
Unnamed: 121.0000.7071.000

Missing values

2023-12-12T12:35:55.895819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:35:56.235699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:35:56.508858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

DB명영문테이블명한글테이블명Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
0ygcouncilsite_member_grade회원등급관리<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
2공개 컬럼명NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
3영문명index_nograde_name<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
4한글명인덱스번호등급<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
5<NA>1최고관리자<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
6<NA>5일반회원<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
7<NA>9의원<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
8<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
9<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
DB명영문테이블명한글테이블명Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13
1060<NA>데이터 없음<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1061<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1062<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1063DB명영문테이블명한글테이블명<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1064ygcouncilsite_board_15_cate의장동정 카테고리<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1065<NA>NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1066공개 컬럼명NaN<NA><NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1067영문명index_nocatename<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1068한글명인덱스번호카테고리이름<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN
1069<NA>1일반<NA>NaNNaNNaN<NA>NaNNaNNaNNaN<NA>NaN

Duplicate rows

Most frequently occurring

DB명한글테이블명Unnamed: 3Unnamed: 7Unnamed: 12# duplicates
14<NA><NA><NA><NA><NA>368
4공개 컬럼명<NA><NA><NA><NA>107
0DB명한글테이블명<NA><NA><NA>106
5영문명board_indexwriter<NA><NA>52
6영문명catename<NA><NA><NA>52
11<NA>일반<NA><NA><NA>51
9한글명카테고리작성자구분코드<NA><NA>45
8한글명게시판타입<NA><NA><NA>36
10한글명카테고리이름<NA><NA><NA>16
12<NA><NA><NA>210.99.208.10659