Overview

Dataset statistics

Number of variables9
Number of observations135
Missing cells399
Missing cells (%)32.8%
Duplicate rows1
Duplicate rows (%)0.7%
Total size in memory9.6 KiB
Average record size in memory73.0 B

Variable types

Categorical3
Text2
Unsupported4

Dataset

Description계룡시의회 지방의회 의안정보 목록으로 의안번호,회기등 정보가 포함되어 있습니다.(의안번호, 제안일자, 제안자 등)
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=413&beforeMenuCd=DOM_000000201001001000&publicdatapk=15038421

Alerts

Dataset has 1 (0.7%) duplicate rowsDuplicates
2003년도 계룡시의회 의안 현황 is highly overall correlated with Unnamed: 6High correlation
Unnamed: 6 is highly overall correlated with 2003년도 계룡시의회 의안 현황 and 1 other fieldsHigh correlation
Unnamed: 8 is highly overall correlated with Unnamed: 6High correlation
Unnamed: 6 is highly imbalanced (64.0%)Imbalance
Unnamed: 8 is highly imbalanced (61.7%)Imbalance
Unnamed: 1 has 119 (88.1%) missing valuesMissing
Unnamed: 2 has 125 (92.6%) missing valuesMissing
Unnamed: 3 has 16 (11.9%) missing valuesMissing
Unnamed: 4 has 15 (11.1%) missing valuesMissing
Unnamed: 5 has 16 (11.9%) missing valuesMissing
Unnamed: 7 has 108 (80.0%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 22:28:09.993526
Analysis finished2024-01-09 22:28:10.583266
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

2003년도 계룡시의회 의안 현황
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
제2회
79 
제3회
39 
<NA>
16 
회기
 
1

Length

Max length4
Median length3
Mean length3.4
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row<NA>
2nd row회기
3rd row제2회
4th row제2회
5th row제2회

Common Values

ValueCountFrequency (%)
제2회 79
58.5%
제3회 39
28.9%
<NA> 16
 
11.9%
회기 1
 
0.7%

Length

2024-01-10T07:28:10.642779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:28:10.749981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제2회 79
58.5%
제3회 39
28.9%
na 16
 
11.9%
회기 1
 
0.7%

Unnamed: 1
Text

MISSING 

Distinct16
Distinct (%)100.0%
Missing119
Missing (%)88.1%
Memory size1.2 KiB
2024-01-10T07:28:10.876460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length5.3125
Min length2

Characters and Unicode

Total characters85
Distinct characters42
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)100.0%

Sample

1st row기간
2nd row03.11.12~03.11.26
3rd row2003.12.1~2003.12.20
4th row구분(계)
5th row조례 제정
ValueCountFrequency (%)
조례 2
 
10.0%
기간 1
 
5.0%
폐지 1
 
5.0%
실시 1
 
5.0%
결의문 1
 
5.0%
구성 1
 
5.0%
집회요구 1
 
5.0%
선출 1
 
5.0%
예산 1
 
5.0%
의견제시 1
 
5.0%
Other values (9) 9
45.0%
2024-01-10T07:28:11.138768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 8
 
9.4%
1 8
 
9.4%
0 7
 
8.2%
2 7
 
8.2%
5
 
5.9%
3 4
 
4.7%
3
 
3.5%
3
 
3.5%
2
 
2.4%
2
 
2.4%
Other values (32) 36
42.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41
48.2%
Decimal Number 27
31.8%
Other Punctuation 8
 
9.4%
Space Separator 5
 
5.9%
Math Symbol 2
 
2.4%
Close Punctuation 1
 
1.2%
Open Punctuation 1
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3
 
7.3%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (22) 22
53.7%
Decimal Number
ValueCountFrequency (%)
1 8
29.6%
0 7
25.9%
2 7
25.9%
3 4
14.8%
6 1
 
3.7%
Other Punctuation
ValueCountFrequency (%)
. 8
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44
51.8%
Hangul 41
48.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3
 
7.3%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (22) 22
53.7%
Common
ValueCountFrequency (%)
. 8
18.2%
1 8
18.2%
0 7
15.9%
2 7
15.9%
5
11.4%
3 4
9.1%
~ 2
 
4.5%
) 1
 
2.3%
( 1
 
2.3%
6 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44
51.8%
Hangul 41
48.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 8
18.2%
1 8
18.2%
0 7
15.9%
2 7
15.9%
5
11.4%
3 4
9.1%
~ 2
 
4.5%
) 1
 
2.3%
( 1
 
2.3%
6 1
 
2.3%
Hangul
ValueCountFrequency (%)
3
 
7.3%
3
 
7.3%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
2
 
4.9%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (22) 22
53.7%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing125
Missing (%)92.6%
Memory size1.2 KiB

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing16
Missing (%)11.9%
Memory size1.2 KiB

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing15
Missing (%)11.1%
Memory size1.2 KiB

Unnamed: 5
Text

MISSING 

Distinct118
Distinct (%)99.2%
Missing16
Missing (%)11.9%
Memory size1.2 KiB
2024-01-10T07:28:11.366181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length32
Mean length20.226891
Min length4

Characters and Unicode

Total characters2407
Distinct characters205
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)98.3%

Sample

1st row안건명
2nd row2003년 계룡시 일반회계 및 특별회계 세입세출예산안
3rd row계룡시의회 공인조례
4th row계룡시의회 사무기구 설치 및 직원정수 조례
5th row계룡시 조례.규칙 등 공포에 관한 조례
ValueCountFrequency (%)
계룡시 91
 
16.0%
조례안 70
 
12.3%
관한 31
 
5.4%
28
 
4.9%
설치 16
 
2.8%
계룡시의회 16
 
2.8%
운영에 8
 
1.4%
관리 7
 
1.2%
규칙안 5
 
0.9%
지급 5
 
0.9%
Other values (244) 292
51.3%
2024-01-10T07:28:11.702981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
459
19.1%
128
 
5.3%
124
 
5.2%
110
 
4.6%
108
 
4.5%
103
 
4.3%
97
 
4.0%
63
 
2.6%
58
 
2.4%
49
 
2.0%
Other values (195) 1108
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1924
79.9%
Space Separator 459
 
19.1%
Decimal Number 18
 
0.7%
Other Punctuation 6
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
128
 
6.7%
124
 
6.4%
110
 
5.7%
108
 
5.6%
103
 
5.4%
97
 
5.0%
63
 
3.3%
58
 
3.0%
49
 
2.5%
39
 
2.0%
Other values (187) 1045
54.3%
Decimal Number
ValueCountFrequency (%)
0 8
44.4%
2 5
27.8%
4 2
 
11.1%
3 2
 
11.1%
1 1
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
, 2
33.3%
Space Separator
ValueCountFrequency (%)
459
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1924
79.9%
Common 483
 
20.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
128
 
6.7%
124
 
6.4%
110
 
5.7%
108
 
5.6%
103
 
5.4%
97
 
5.0%
63
 
3.3%
58
 
3.0%
49
 
2.5%
39
 
2.0%
Other values (187) 1045
54.3%
Common
ValueCountFrequency (%)
459
95.0%
0 8
 
1.7%
2 5
 
1.0%
. 4
 
0.8%
4 2
 
0.4%
, 2
 
0.4%
3 2
 
0.4%
1 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1924
79.9%
ASCII 483
 
20.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
459
95.0%
0 8
 
1.7%
2 5
 
1.0%
. 4
 
0.8%
4 2
 
0.4%
, 2
 
0.4%
3 2
 
0.4%
1 1
 
0.2%
Hangul
ValueCountFrequency (%)
128
 
6.7%
124
 
6.4%
110
 
5.7%
108
 
5.6%
103
 
5.4%
97
 
5.0%
63
 
3.3%
58
 
3.0%
49
 
2.5%
39
 
2.0%
Other values (187) 1045
54.3%

Unnamed: 6
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct11
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
제정
107 
<NA>
14 
구성
 
4
예산
 
3
구분
 
1
Other values (6)
 
6

Length

Max length12
Median length2
Mean length2.3333333
Min length2

Unique

Unique7 ?
Unique (%)5.2%

Sample

1st row<NA>
2nd row구분
3rd row예산
4th row제정
5th row제정

Common Values

ValueCountFrequency (%)
제정 107
79.3%
<NA> 14
 
10.4%
구성 4
 
3.0%
예산 3
 
2.2%
구분 1
 
0.7%
집회요구 1
 
0.7%
결의문 1
 
0.7%
승인 1
 
0.7%
※ 조례 제개정 총 수 1
 
0.7%
계룡시장 1
 
0.7%

Length

2024-01-10T07:28:11.823196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제정 107
77.0%
na 14
 
10.1%
구성 4
 
2.9%
예산 3
 
2.2%
구분 1
 
0.7%
집회요구 1
 
0.7%
결의문 1
 
0.7%
승인 1
 
0.7%
1
 
0.7%
조례 1
 
0.7%
Other values (5) 5
 
3.6%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing108
Missing (%)80.0%
Memory size1.2 KiB

Unnamed: 8
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
원안가결
117 
<NA>
17 
수정가결
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row<NA>
2nd row<NA>
3rd row원안가결
4th row원안가결
5th row원안가결

Common Values

ValueCountFrequency (%)
원안가결 117
86.7%
<NA> 17
 
12.6%
수정가결 1
 
0.7%

Length

2024-01-10T07:28:11.918676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:28:12.000313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
원안가결 117
86.7%
na 17
 
12.6%
수정가결 1
 
0.7%

Correlations

2024-01-10T07:28:12.057608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2003년도 계룡시의회 의안 현황Unnamed: 1Unnamed: 6Unnamed: 8
2003년도 계룡시의회 의안 현황1.0001.0000.7710.000
Unnamed: 11.0001.0001.0000.000
Unnamed: 60.7711.0001.0000.736
Unnamed: 80.0000.0000.7361.000
2024-01-10T07:28:12.146599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2003년도 계룡시의회 의안 현황Unnamed: 8Unnamed: 6
2003년도 계룡시의회 의안 현황1.0000.0000.698
Unnamed: 80.0001.0000.536
Unnamed: 60.6980.5361.000
2024-01-10T07:28:12.219568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2003년도 계룡시의회 의안 현황Unnamed: 6Unnamed: 8
2003년도 계룡시의회 의안 현황1.0000.6980.000
Unnamed: 60.6981.0000.536
Unnamed: 80.0000.5361.000

Missing values

2024-01-10T07:28:10.278480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:28:10.388110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T07:28:10.497550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2003년도 계룡시의회 의안 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
0<NA><NA>NaNNaNNaN<NA><NA>NaN<NA>
1회기기간일수의안번호발의자안건명구분비고<NA>
2제2회03.11.12~03.11.26151계룡시장2003년 계룡시 일반회계 및 특별회계 세입세출예산안예산NaN원안가결
3제2회<NA>NaN2계룡시장계룡시의회 공인조례제정NaN원안가결
4제2회<NA>NaN3계룡시장계룡시의회 사무기구 설치 및 직원정수 조례제정NaN원안가결
5제2회<NA>NaN4계룡시장계룡시 조례.규칙 등 공포에 관한 조례제정NaN원안가결
6제2회<NA>NaN5계룡시장계룡시 행정기구 설치조례제정NaN원안가결
7제2회<NA>NaN6계룡시장계룡시 지방공무원 정원조례제정NaN원안가결
8제2회<NA>NaN7계룡시장계룡시 공인조례제정NaN원안가결
9제2회<NA>NaN8계룡시장계룡시 행정동리의 명칭 관할 구역 및 동리장 정수에 관한 조례제정NaN원안가결
2003년도 계룡시의회 의안 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8
125<NA>동의 및 승인1NaNNaN<NA>계룡시장90<NA>
126<NA>폐지NaNNaNNaN<NA>의원발의17<NA>
127<NA>의견제시NaNNaNNaN<NA><NA>NaN<NA>
128<NA>예산3NaNNaN<NA><NA>NaN<NA>
129<NA>선출NaNNaNNaN<NA><NA>NaN<NA>
130<NA>집회요구NaNNaNNaN<NA><NA>NaN<NA>
131<NA>구성4NaNNaN<NA><NA>NaN<NA>
132<NA>결의문1NaNNaN<NA><NA>NaN<NA>
133<NA>실시1NaNNaN<NA><NA>NaN<NA>
134<NA>보고NaNNaNNaN<NA><NA>NaN<NA>

Duplicate rows

Most frequently occurring

2003년도 계룡시의회 의안 현황Unnamed: 1Unnamed: 5Unnamed: 6Unnamed: 8# duplicates
0<NA><NA><NA><NA><NA>3