Overview

Dataset statistics

Number of variables6
Number of observations1718
Missing cells95
Missing cells (%)0.9%
Duplicate rows5
Duplicate rows (%)0.3%
Total size in memory84.0 KiB
Average record size in memory50.1 B

Variable types

Numeric2
Text2
DateTime1
Categorical1

Dataset

Description대구광역시_수성구_의안정보_20190814
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15037972&dataSetDetailId=150379722bc4256f00ac7&provdMethod=FILE

Alerts

Dataset has 5 (0.3%) duplicate rowsDuplicates
의안번호 is highly overall correlated with 회수High correlation
회수 is highly overall correlated with 의안번호High correlation
결과 is highly imbalanced (52.1%)Imbalance
처리일 has 90 (5.2%) missing valuesMissing
회수 has 415 (24.2%) zerosZeros

Reproduction

Analysis started2024-04-19 05:47:11.579176
Analysis finished2024-04-19 05:47:12.686389
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

의안번호
Real number (ℝ)

HIGH CORRELATION 

Distinct1694
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean850.21886
Minimum1
Maximum1715
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size15.2 KiB
2024-04-19T14:47:12.773886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile86.85
Q1423.25
median846
Q31272.75
95-th percentile1628.15
Maximum1715
Range1714
Interquartile range (IQR)849.5

Descriptive statistics

Standard deviation492.87487
Coefficient of variation (CV)0.57970353
Kurtosis-1.1900111
Mean850.21886
Median Absolute Deviation (MAD)425
Skewness0.025269429
Sum1460676
Variance242925.64
MonotonicityNot monotonic
2024-04-19T14:47:12.921688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
355 2
 
0.1%
274 2
 
0.1%
468 2
 
0.1%
992 2
 
0.1%
591 2
 
0.1%
294 2
 
0.1%
573 2
 
0.1%
646 2
 
0.1%
272 2
 
0.1%
282 2
 
0.1%
Other values (1684) 1698
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1715 1
0.1%
1714 1
0.1%
1713 1
0.1%
1712 1
0.1%
1711 1
0.1%
1710 1
0.1%
1709 1
0.1%
1708 1
0.1%
1707 1
0.1%
1706 1
0.1%
Distinct1382
Distinct (%)80.4%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
2024-04-19T14:47:13.196083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length46
Mean length27.336438
Min length5

Characters and Unicode

Total characters46964
Distinct characters394
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1231 ?
Unique (%)71.7%

Sample

1st row대구광역시수성구행정기구설치조례개정조례
2nd row대구광역시수성구지방공무원정원조례중개정조례안
3rd row대구광역시수성구공중화장실설치및관리조례중개정조례안
4th row대구광역시수성구오수.분뇨및축산폐수의처리에관한조례중개정조례안
5th row1997년도세입세출결산승인안
ValueCountFrequency (%)
대구광역시 674
 
9.3%
수성구 654
 
9.0%
조례 401
 
5.5%
일부개정조례안 389
 
5.4%
258
 
3.6%
관한 257
 
3.6%
조례안 186
 
2.6%
설치 106
 
1.5%
운영 97
 
1.3%
66
 
0.9%
Other values (1685) 4144
57.3%
2024-04-19T14:47:13.728530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5528
 
11.8%
2645
 
5.6%
2029
 
4.3%
1950
 
4.2%
1542
 
3.3%
1502
 
3.2%
1373
 
2.9%
1322
 
2.8%
1294
 
2.8%
1278
 
2.7%
Other values (384) 26501
56.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39359
83.8%
Space Separator 5528
 
11.8%
Decimal Number 1548
 
3.3%
Open Punctuation 182
 
0.4%
Close Punctuation 182
 
0.4%
Other Punctuation 144
 
0.3%
Uppercase Letter 11
 
< 0.1%
Dash Punctuation 6
 
< 0.1%
Modifier Symbol 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2645
 
6.7%
2029
 
5.2%
1950
 
5.0%
1542
 
3.9%
1502
 
3.8%
1373
 
3.5%
1322
 
3.4%
1294
 
3.3%
1278
 
3.2%
1122
 
2.9%
Other values (351) 23302
59.2%
Decimal Number
ValueCountFrequency (%)
0 412
26.6%
2 380
24.5%
1 259
16.7%
9 156
 
10.1%
3 77
 
5.0%
6 58
 
3.7%
7 56
 
3.6%
4 55
 
3.6%
5 48
 
3.1%
8 47
 
3.0%
Other Punctuation
ValueCountFrequency (%)
. 82
56.9%
· 30
 
20.8%
, 21
 
14.6%
; 3
 
2.1%
& 2
 
1.4%
' 2
 
1.4%
# 2
 
1.4%
: 1
 
0.7%
* 1
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
A 3
27.3%
I 3
27.3%
P 2
18.2%
U 1
 
9.1%
H 1
 
9.1%
T 1
 
9.1%
Open Punctuation
ValueCountFrequency (%)
( 181
99.5%
1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 181
99.5%
1
 
0.5%
Space Separator
ValueCountFrequency (%)
5528
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39359
83.8%
Common 7594
 
16.2%
Latin 11
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2645
 
6.7%
2029
 
5.2%
1950
 
5.0%
1542
 
3.9%
1502
 
3.8%
1373
 
3.5%
1322
 
3.4%
1294
 
3.3%
1278
 
3.2%
1122
 
2.9%
Other values (351) 23302
59.2%
Common
ValueCountFrequency (%)
5528
72.8%
0 412
 
5.4%
2 380
 
5.0%
1 259
 
3.4%
( 181
 
2.4%
) 181
 
2.4%
9 156
 
2.1%
. 82
 
1.1%
3 77
 
1.0%
6 58
 
0.8%
Other values (17) 280
 
3.7%
Latin
ValueCountFrequency (%)
A 3
27.3%
I 3
27.3%
P 2
18.2%
U 1
 
9.1%
H 1
 
9.1%
T 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39359
83.8%
ASCII 7573
 
16.1%
None 32
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5528
73.0%
0 412
 
5.4%
2 380
 
5.0%
1 259
 
3.4%
( 181
 
2.4%
) 181
 
2.4%
9 156
 
2.1%
. 82
 
1.1%
3 77
 
1.0%
6 58
 
0.8%
Other values (20) 259
 
3.4%
Hangul
ValueCountFrequency (%)
2645
 
6.7%
2029
 
5.2%
1950
 
5.0%
1542
 
3.9%
1502
 
3.8%
1373
 
3.5%
1322
 
3.4%
1294
 
3.3%
1278
 
3.2%
1122
 
2.9%
Other values (351) 23302
59.2%
None
ValueCountFrequency (%)
· 30
93.8%
1
 
3.1%
1
 
3.1%

회수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct166
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.09895
Minimum0
Maximum2016
Zeros415
Zeros (%)24.2%
Negative0
Negative (%)0.0%
Memory size15.2 KiB
2024-04-19T14:47:13.863051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median152
Q3202
95-th percentile225
Maximum2016
Range2016
Interquartile range (IQR)195

Descriptive statistics

Standard deviation103.02193
Coefficient of variation (CV)0.81699272
Kurtosis89.191206
Mean126.09895
Median Absolute Deviation (MAD)59
Skewness5.0470834
Sum216638
Variance10613.517
MonotonicityNot monotonic
2024-04-19T14:47:14.006764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 415
 
24.2%
227 45
 
2.6%
213 33
 
1.9%
212 30
 
1.7%
221 25
 
1.5%
206 21
 
1.2%
7 21
 
1.2%
205 21
 
1.2%
160 20
 
1.2%
225 20
 
1.2%
Other values (156) 1067
62.1%
ValueCountFrequency (%)
0 415
24.2%
2 4
 
0.2%
3 3
 
0.2%
4 4
 
0.2%
5 1
 
0.1%
6 2
 
0.1%
7 21
 
1.2%
8 11
 
0.6%
9 6
 
0.3%
10 5
 
0.3%
ValueCountFrequency (%)
2016 1
 
0.1%
1611 1
 
0.1%
228 19
1.1%
227 45
2.6%
226 19
1.1%
225 20
1.2%
224 2
 
0.1%
223 17
 
1.0%
222 7
 
0.4%
221 25
1.5%
Distinct355
Distinct (%)20.7%
Missing5
Missing (%)0.3%
Memory size13.6 KiB
2024-04-19T14:47:14.246398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length5.7203736
Min length2

Characters and Unicode

Total characters9799
Distinct characters149
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique281 ?
Unique (%)16.4%

Sample

1st row수성구청장
2nd row수성구청장
3rd row수성구청장
4th row수성구청장
5th row수성구청장
ValueCountFrequency (%)
수성구청장 779
31.4%
구청장 400
16.1%
242
 
9.7%
의원 182
 
7.3%
의장 32
 
1.3%
조례정비특별위원장 31
 
1.2%
7인 31
 
1.2%
5인 28
 
1.1%
5 25
 
1.0%
6인 24
 
1.0%
Other values (252) 709
28.6%
2024-04-19T14:47:14.587956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1266
12.9%
1193
12.2%
1179
12.0%
801
 
8.2%
790
 
8.1%
770
 
7.9%
503
 
5.1%
465
 
4.7%
400
 
4.1%
215
 
2.2%
Other values (139) 2217
22.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8378
85.5%
Space Separator 770
 
7.9%
Decimal Number 611
 
6.2%
Other Punctuation 31
 
0.3%
Close Punctuation 4
 
< 0.1%
Open Punctuation 4
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1266
15.1%
1193
14.2%
1179
14.1%
801
9.6%
790
9.4%
503
 
6.0%
465
 
5.6%
400
 
4.8%
215
 
2.6%
115
 
1.4%
Other values (122) 1451
17.3%
Decimal Number
ValueCountFrequency (%)
1 147
24.1%
5 97
15.9%
6 89
14.6%
7 76
12.4%
0 48
 
7.9%
9 43
 
7.0%
2 38
 
6.2%
8 32
 
5.2%
4 31
 
5.1%
3 10
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 24
77.4%
, 6
 
19.4%
/ 1
 
3.2%
Space Separator
ValueCountFrequency (%)
770
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8378
85.5%
Common 1421
 
14.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1266
15.1%
1193
14.2%
1179
14.1%
801
9.6%
790
9.4%
503
 
6.0%
465
 
5.6%
400
 
4.8%
215
 
2.6%
115
 
1.4%
Other values (122) 1451
17.3%
Common
ValueCountFrequency (%)
770
54.2%
1 147
 
10.3%
5 97
 
6.8%
6 89
 
6.3%
7 76
 
5.3%
0 48
 
3.4%
9 43
 
3.0%
2 38
 
2.7%
8 32
 
2.3%
4 31
 
2.2%
Other values (7) 50
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8378
85.5%
ASCII 1421
 
14.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1266
15.1%
1193
14.2%
1179
14.1%
801
9.6%
790
9.4%
503
 
6.0%
465
 
5.6%
400
 
4.8%
215
 
2.6%
115
 
1.4%
Other values (122) 1451
17.3%
ASCII
ValueCountFrequency (%)
770
54.2%
1 147
 
10.3%
5 97
 
6.8%
6 89
 
6.3%
7 76
 
5.3%
0 48
 
3.4%
9 43
 
3.0%
2 38
 
2.7%
8 32
 
2.3%
4 31
 
2.2%
Other values (7) 50
 
3.5%

처리일
Date

MISSING 

Distinct337
Distinct (%)20.7%
Missing90
Missing (%)5.2%
Memory size13.6 KiB
Minimum1991-02-27 00:00:00
Maximum2019-03-18 00:00:00
2024-04-19T14:47:14.709928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:47:14.892895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

결과
Categorical

IMBALANCE 

Distinct14
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size13.6 KiB
처리-원안가결
1092 
처리-수정가결
265 
처리-원안의결
160 
접수
 
76
찬성의견
 
35
Other values (9)
 
90

Length

Max length8
Median length7
Mean length6.5989523
Min length2

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st row처리-원안가결
2nd row처리-원안가결
3rd row처리-원안가결
4th row처리-원안가결
5th row처리-원안가결

Common Values

ValueCountFrequency (%)
처리-원안가결 1092
63.6%
처리-수정가결 265
 
15.4%
처리-원안의결 160
 
9.3%
접수 76
 
4.4%
찬성의견 35
 
2.0%
처리-수정의결 21
 
1.2%
원안채택 21
 
1.2%
처리-부결 18
 
1.0%
보류 11
 
0.6%
<NA> 10
 
0.6%
Other values (4) 9
 
0.5%

Length

2024-04-19T14:47:15.024775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
처리-원안가결 1092
63.5%
처리-수정가결 265
 
15.4%
처리-원안의결 160
 
9.3%
접수 76
 
4.4%
찬성의견 35
 
2.0%
처리-수정의결 21
 
1.2%
원안채택 21
 
1.2%
처리-부결 18
 
1.0%
보류 11
 
0.6%
na 10
 
0.6%
Other values (5) 10
 
0.6%

Interactions

2024-04-19T14:47:12.146819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:47:11.956812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:47:12.265856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:47:12.049361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:47:15.104623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의안번호회수결과
의안번호1.0000.7180.578
회수0.7181.0000.231
결과0.5780.2311.000
2024-04-19T14:47:15.186424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
의안번호회수결과
의안번호1.0000.8660.283
회수0.8661.0000.136
결과0.2830.1361.000

Missing values

2024-04-19T14:47:12.417416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:47:12.538971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-19T14:47:12.632073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

의안번호의안명회수발의자처리일결과
0355대구광역시수성구행정기구설치조례개정조례63수성구청장1998-08-29처리-원안가결
1356대구광역시수성구지방공무원정원조례중개정조례안63수성구청장1998-08-29처리-원안가결
2357대구광역시수성구공중화장실설치및관리조례중개정조례안63수성구청장1998-08-29처리-원안가결
3358대구광역시수성구오수.분뇨및축산폐수의처리에관한조례중개정조례안63수성구청장1998-08-29처리-원안가결
43591997년도세입세출결산승인안64수성구청장1998-09-29처리-원안가결
53601998년도제3회추가경정예산안64수성구청장1998-09-29처리-원안가결
6406대구광역시수성구지방공무원정원조례중개정조례안0구청장1999-10-20처리-원안가결
7362대구광역시수성구별정직공무원의임용등에관한조례중개정조례안67수성구청장1998-12-05처리-원안가결
8363대구광역시수성구지방고용직공무원임용등에관한조례중개정조례안67수성구청장1998-12-05처리-원안가결
9425대구광역시수성구주민의감사청구에관한조례안0구청장2000-02-24처리-원안가결
의안번호의안명회수발의자처리일결과
17081706대구광역시 수성구 공동주택 관리비용 지원에 관한 조례 일부개정조례안228이성오의원외 8명2019-03-18처리-원안가결
17091707대구광역시 수성구 도로점용 허가 및 점용료 등 징수조례 일부개정조례안228이성오의원외 8명2019-03-18처리-원안가결
17101708대구광역시 수성구 주민참여형 미래어린이공원·어린이놀이터 조성 및 운영에 관한 조례안228김두현의원외 6명2019-03-18처리-수정가결
17111709대구광역시 수성구 금연환경조성 및 간접흡연 피해방지 조례 일부개정조례안228전영태의원외 8명2019-03-18처리-원안가결
17121710대구광역시 수성구 어린이와 청소년 보호를 위한 금주·금연구역 지정 등에 관한 조례 일부개정조례안228전영태의원외 8명2019-03-18처리-수정가결
17131711대구광역시 수성구 출산장려금 지원 등에 관한 조례 일부개정조례안228박정권의원외 8명2019-03-18처리-원안가결
17141712대구광역시 수성구 자살예방 및 생명존중문화 조성을 위한 조례 일부개정조례안228홍경임의원외 6명2019-03-18처리-원안가결
17151713대구광역시 수성구 정신건강복지센터 설치 및 운영에 관한 조례안228조규화의원외 11명2019-03-18처리-원안가결
17161714수성구 미래전략산업 추진 특별위원회 구성안228류지호의원외 8명2019-03-12원안채택
17171715대구광역시 수성구 도시공원 살리기 특별위원회 구성안228박정권의원외 8명2019-03-12원안채택

Duplicate rows

Most frequently occurring

의안번호의안명회수발의자처리일결과# duplicates
034698년도제1회추가경정일반회계및특별회계세입.세출예산안0구청장1998-04-01처리-원안가결2
134798년도제2회일반회계추가경정세입.세출안0구청장1998-04-30처리-원안가결2
2573대구광역시수성구영구임대주택단지내보안등전기요금지원조례안116양문환의원외9인2004-05-31처리-원안가결2
36372005년도제2회추가경정세입.세출예산안127구청장2005-09-13처리-수정가결2
46772006년도제2회추가경정세입.세출예산안133구청장2006-04-21처리-원안가결2