Overview

Dataset statistics

Number of variables13
Number of observations10000
Missing cells60358
Missing cells (%)46.4%
Duplicate rows1537
Duplicate rows (%)15.4%
Total size in memory1.1 MiB
Average record size in memory118.0 B

Variable types

DateTime3
Text1
Categorical3
Unsupported6

Dataset

Description전라남도 신안군 새올 민원의 접수일자, 민원사무명, 민원구분, 처리결과, 처리일자, 처리예정일, 처리부서 정보입니다.
Author전라남도 신안군
URLhttps://www.data.go.kr/data/15091725/fileData.do

Alerts

Dataset has 1537 (15.4%) duplicate rowsDuplicates
처리결과 is highly imbalanced (79.7%)Imbalance
처리일자 has 358 (3.6%) missing valuesMissing
Unnamed: 7 has 10000 (100.0%) missing valuesMissing
Unnamed: 8 has 10000 (100.0%) missing valuesMissing
Unnamed: 9 has 10000 (100.0%) missing valuesMissing
Unnamed: 10 has 10000 (100.0%) missing valuesMissing
Unnamed: 11 has 10000 (100.0%) missing valuesMissing
Unnamed: 12 has 10000 (100.0%) missing valuesMissing
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 00:16:21.744609
Analysis finished2023-12-12 00:16:22.414532
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct748
Distinct (%)7.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2017-12-04 00:00:00
Maximum2021-10-08 00:00:00
2023-12-12T09:16:22.472790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:16:22.592783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct113
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T09:16:22.789026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length50
Mean length23.6515
Min length4

Characters and Unicode

Total characters236515
Distinct characters197
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)0.2%

Sample

1st row개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)
2nd row건축허가-2층이하또는1천제곱미터미만 건축물-기타(건축사 업무대행 건축물)
3rd row유해야생동물 포획허가
4th row연안어업허가
5th row개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)
ValueCountFrequency (%)
토석채취 2965
10.9%
토지분할 2964
10.9%
물건적치 2964
10.9%
개발행위허가(토지형질변경 2964
10.9%
공작물설치 2964
10.9%
3천킬로와트 1686
 
6.2%
옥외광고물등의표시신고 980
 
3.6%
이하 863
 
3.2%
연안어업허가 804
 
3.0%
전기사업허가-시설용량 765
 
2.8%
Other values (185) 7233
26.6%
2023-12-12T09:16:23.132113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17152
 
7.3%
, 12875
 
5.4%
9343
 
4.0%
9020
 
3.8%
8904
 
3.8%
7671
 
3.2%
6221
 
2.6%
6177
 
2.6%
5940
 
2.5%
4796
 
2.0%
Other values (187) 148416
62.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 190005
80.3%
Space Separator 17152
 
7.3%
Other Punctuation 13588
 
5.7%
Close Punctuation 4165
 
1.8%
Open Punctuation 4165
 
1.8%
Dash Punctuation 3767
 
1.6%
Decimal Number 2723
 
1.2%
Connector Punctuation 822
 
0.3%
Lowercase Letter 80
 
< 0.1%
Other Symbol 48
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9343
 
4.9%
9020
 
4.7%
8904
 
4.7%
7671
 
4.0%
6221
 
3.3%
6177
 
3.3%
5940
 
3.1%
4796
 
2.5%
4679
 
2.5%
4670
 
2.5%
Other values (167) 122584
64.5%
Decimal Number
ValueCountFrequency (%)
3 1797
66.0%
1 466
 
17.1%
2 261
 
9.6%
0 110
 
4.0%
5 86
 
3.2%
6 3
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 12875
94.8%
. 539
 
4.0%
· 172
 
1.3%
/ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 4052
97.3%
] 113
 
2.7%
Open Punctuation
ValueCountFrequency (%)
( 4052
97.3%
[ 113
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
a 40
50.0%
h 40
50.0%
Space Separator
ValueCountFrequency (%)
17152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3767
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 822
100.0%
Other Symbol
ValueCountFrequency (%)
48
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190005
80.3%
Common 46430
 
19.6%
Latin 80
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9343
 
4.9%
9020
 
4.7%
8904
 
4.7%
7671
 
4.0%
6221
 
3.3%
6177
 
3.3%
5940
 
3.1%
4796
 
2.5%
4679
 
2.5%
4670
 
2.5%
Other values (167) 122584
64.5%
Common
ValueCountFrequency (%)
17152
36.9%
, 12875
27.7%
) 4052
 
8.7%
( 4052
 
8.7%
- 3767
 
8.1%
3 1797
 
3.9%
_ 822
 
1.8%
. 539
 
1.2%
1 466
 
1.0%
2 261
 
0.6%
Other values (8) 647
 
1.4%
Latin
ValueCountFrequency (%)
a 40
50.0%
h 40
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 190005
80.3%
ASCII 46290
 
19.6%
None 172
 
0.1%
CJK Compat 48
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17152
37.1%
, 12875
27.8%
) 4052
 
8.8%
( 4052
 
8.8%
- 3767
 
8.1%
3 1797
 
3.9%
_ 822
 
1.8%
. 539
 
1.2%
1 466
 
1.0%
2 261
 
0.6%
Other values (8) 507
 
1.1%
Hangul
ValueCountFrequency (%)
9343
 
4.9%
9020
 
4.7%
8904
 
4.7%
7671
 
4.0%
6221
 
3.3%
6177
 
3.3%
5940
 
3.1%
4796
 
2.5%
4679
 
2.5%
4670
 
2.5%
Other values (167) 122584
64.5%
None
ValueCountFrequency (%)
· 172
100.0%
CJK Compat
ValueCountFrequency (%)
48
100.0%

민원구분
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
단순민원
5482 
복합민원
4518 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row복합민원
2nd row복합민원
3rd row단순민원
4th row단순민원
5th row복합민원

Common Values

ValueCountFrequency (%)
단순민원 5482
54.8%
복합민원 4518
45.2%

Length

2023-12-12T09:16:23.254993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:16:23.336291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
단순민원 5482
54.8%
복합민원 4518
45.2%

처리결과
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
해결
8912 
취하
 
588
처리중
 
358
착오
 
86
법적불가
 
25
Other values (5)
 
31

Length

Max length10
Median length2
Mean length2.051
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row해결
2nd row해결
3rd row해결
4th row해결
5th row해결

Common Values

ValueCountFrequency (%)
해결 8912
89.1%
취하 588
 
5.9%
처리중 358
 
3.6%
착오 86
 
0.9%
법적불가 25
 
0.2%
반려 12
 
0.1%
중복접수 시스템장애 11
 
0.1%
기타불가 5
 
0.1%
일부해결 2
 
< 0.1%
이송 1
 
< 0.1%

Length

2023-12-12T09:16:23.436512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:16:23.545081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
해결 8912
89.0%
취하 588
 
5.9%
처리중 358
 
3.6%
착오 86
 
0.9%
법적불가 25
 
0.2%
반려 12
 
0.1%
중복접수 11
 
0.1%
시스템장애 11
 
0.1%
기타불가 5
 
< 0.1%
일부해결 2
 
< 0.1%

처리일자
Date

MISSING 

Distinct645
Distinct (%)6.7%
Missing358
Missing (%)3.6%
Memory size156.2 KiB
Minimum2018-04-05 00:00:00
Maximum2021-10-08 00:00:00
2023-12-12T09:16:23.664349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:16:23.783358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct770
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2018-02-20 00:00:00
Maximum2022-08-11 00:00:00
2023-12-12T09:16:23.901108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:16:24.024699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

처리부서
Categorical

Distinct44
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
행정복지국 민원봉사과
3200 
산업건설국 지역경제과
2953 
압해읍
620 
지역경제과
555 
산업건설국 경제에너지과
429 
Other values (39)
2243 

Length

Max length13
Median length11
Mean length9.2553
Min length3

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row산업건설국 지역경제과
2nd row민원봉사실
3rd row행정복지국 세계유산과
4th row행정복지국 민원봉사과
5th row행정복지국 민원봉사과

Common Values

ValueCountFrequency (%)
행정복지국 민원봉사과 3200
32.0%
산업건설국 지역경제과 2953
29.5%
압해읍 620
 
6.2%
지역경제과 555
 
5.5%
산업건설국 경제에너지과 429
 
4.3%
민원봉사실 363
 
3.6%
산업건설국 해양수산과 223
 
2.2%
산업건설국 신재생에너지과 173
 
1.7%
산업건설국 안전건설과 154
 
1.5%
안좌면 130
 
1.3%
Other values (34) 1200
 
12.0%

Length

2023-12-12T09:16:24.142015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
산업건설국 4056
23.2%
지역경제과 3508
20.1%
행정복지국 3363
19.3%
민원봉사과 3200
18.3%
압해읍 620
 
3.6%
경제에너지과 429
 
2.5%
민원봉사실 363
 
2.1%
해양수산과 231
 
1.3%
신재생에너지과 173
 
1.0%
안전건설과 154
 
0.9%
Other values (30) 1358
 
7.8%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

Correlations

2023-12-12T09:16:24.210978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
민원구분처리결과처리부서
민원구분1.0000.1770.493
처리결과0.1771.0000.333
처리부서0.4930.3331.000
2023-12-12T09:16:24.292626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
처리부서민원구분처리결과
처리부서1.0000.3940.120
민원구분0.3941.0000.135
처리결과0.1200.1351.000
2023-12-12T09:16:24.374103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
민원구분처리결과처리부서
민원구분1.0000.1350.394
처리결과0.1351.0000.120
처리부서0.3940.1201.000

Missing values

2023-12-12T09:16:22.194303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:16:22.342049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

접수일자민원사무명민원구분처리결과처리일자처리예정일처리부서Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
16542018-07-17개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2019-06-262020-02-03산업건설국 지역경제과<NA><NA><NA><NA><NA><NA>
9182018-05-23건축허가-2층이하또는1천제곱미터미만 건축물-기타(건축사 업무대행 건축물)복합민원해결2018-06-012018-06-12민원봉사실<NA><NA><NA><NA><NA><NA>
86972020-01-13유해야생동물 포획허가단순민원해결2020-01-142020-01-16행정복지국 세계유산과<NA><NA><NA><NA><NA><NA>
37922018-12-14연안어업허가단순민원해결2018-12-172018-12-19행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
86012020-01-02개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2020-01-142020-01-22행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
38102018-12-14연안어업허가단순민원해결2018-12-172018-12-19행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
54312019-04-01전기사업양수.양도인가-시설용량 3천킬로와트 이하단순민원해결2019-05-012019-05-13산업건설국 지역경제과<NA><NA><NA><NA><NA><NA>
18952018-08-03개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2018-08-242018-08-24산업건설국 지역경제과<NA><NA><NA><NA><NA><NA>
85582019-11-14옥외광고물등의표시신고복합민원해결2019-11-142019-11-19압해읍<NA><NA><NA><NA><NA><NA>
51582019-03-11육상해수양식어업 허가-유형없음단순민원해결2019-03-112019-03-18행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
접수일자민원사무명민원구분처리결과처리일자처리예정일처리부서Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12
53682019-03-28개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2019-04-092019-04-17산업건설국 지역경제과<NA><NA><NA><NA><NA><NA>
101972020-06-18어업허가사항변경허가-구획어업,연안어업단순민원해결2020-06-192020-06-22행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
92672020-03-06어선개조(개조발주)허가-연안어업어선단순민원해결2020-03-062020-03-10행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
85182019-09-25옥외광고물등의표시신고복합민원해결2019-09-252019-09-30압해읍<NA><NA><NA><NA><NA><NA>
101442020-06-11전기사업허가사항변경-시설용량 3천킬로와트 이하단순민원해결2020-06-192020-07-22산업건설국 지역경제과<NA><NA><NA><NA><NA><NA>
4642018-04-11개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2018-07-262019-09-04지역경제과<NA><NA><NA><NA><NA><NA>
39682018-12-24구획어업허가단순민원해결2018-12-262018-12-28행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
14542018-07-06어선건조(건조발주)허가-연안어업어선단순민원해결2018-07-062018-07-11민원봉사실<NA><NA><NA><NA><NA><NA>
23882018-10-18어선건조(건조발주)허가-연안어업어선단순민원해결2018-10-182018-10-23행정복지국 민원봉사과<NA><NA><NA><NA><NA><NA>
115872021-05-03공유수면점.사용변경허가-기타의 공유수면단순민원해결2021-05-032021-06-21산업건설국 해양수산과<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

접수일자민원사무명민원구분처리결과처리일자처리예정일처리부서# duplicates
3642018-11-15연안어업허가단순민원해결2018-11-192018-11-20행정복지국 민원봉사과56
4072018-12-07연안어업허가단순민원해결2018-12-102018-12-12행정복지국 민원봉사과44
3552018-11-12연안어업허가단순민원해결2018-11-132018-11-15행정복지국 민원봉사과43
3562018-11-12연안어업허가단순민원해결2018-11-142018-11-15행정복지국 민원봉사과43
4862019-01-24개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원처리중<NA>2019-02-18산업건설국 지역경제과43
3352018-11-02연안어업허가단순민원해결2018-11-052018-11-07행정복지국 민원봉사과39
4132018-12-12연안어업허가단순민원해결2018-12-142018-12-17행정복지국 민원봉사과38
14992021-08-11전기사업허가사항변경-시설용량 3천킬로와트 이하단순민원해결2021-08-262021-09-27산업건설국 신재생에너지과34
4302018-12-19연안어업허가단순민원해결2018-12-202018-12-24행정복지국 민원봉사과33
1512018-06-14개발행위허가(토지형질변경, 토석채취, 공작물설치, 토지분할, 물건적치)복합민원해결2019-06-262020-01-13산업건설국 지역경제과32