Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1547
Duplicate rows (%)15.5%
Total size in memory634.8 KiB
Average record size in memory65.0 B

Variable types

Numeric1
Categorical2
Text2
DateTime2

Dataset

Description광주광역시 동구에 설치된 무인민원발급기에 접수된 민원 현황입니다. 2023년 데이터를 포함하고 있습니다. 발급기고유번호, 민원접수번호, 발급기명, 민원사무분류번호, 민원사무분류명, 접수일자, 접수요일 등으로 구성되어 있습니다.
Author광주광역시 동구
URLhttps://www.data.go.kr/data/15103415/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1547 (15.5%) duplicate rowsDuplicates
발급기고유번호 is highly overall correlated with 발급기명High correlation
발급기명 is highly overall correlated with 발급기고유번호High correlation
민원접수번호 is highly imbalanced (83.6%)Imbalance

Reproduction

Analysis started2024-03-14 22:44:56.332422
Analysis finished2024-03-14 22:44:58.034597
Duration1.7 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

발급기고유번호
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean480.4923
Minimum401
Maximum516
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T07:44:58.215596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum401
5-th percentile401
Q1405
median504
Q3508
95-th percentile514
Maximum516
Range115
Interquartile range (IQR)103

Descriptive statistics

Standard deviation45.75445
Coefficient of variation (CV)0.095224106
Kurtosis-0.70748099
Mean480.4923
Median Absolute Deviation (MAD)4
Skewness-1.1221926
Sum4804923
Variance2093.4697
MonotonicityNot monotonic
2024-03-15T07:44:58.518787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
503 1579
15.8%
401 1469
14.7%
504 1087
10.9%
507 824
 
8.2%
511 726
 
7.3%
506 658
 
6.6%
402 465
 
4.7%
510 392
 
3.9%
508 343
 
3.4%
513 323
 
3.2%
Other values (11) 2134
21.3%
ValueCountFrequency (%)
401 1469
14.7%
402 465
 
4.7%
403 130
 
1.3%
404 211
 
2.1%
405 246
 
2.5%
501 137
 
1.4%
502 319
 
3.2%
503 1579
15.8%
504 1087
10.9%
505 112
 
1.1%
ValueCountFrequency (%)
516 252
 
2.5%
515 186
 
1.9%
514 187
 
1.9%
513 323
 
3.2%
512 183
 
1.8%
511 726
7.3%
510 392
3.9%
509 171
 
1.7%
508 343
3.4%
507 824
8.2%

민원접수번호
Categorical

IMBALANCE 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
'202000000000000000
9156 
'202335900200008000
 
132
'202335900000039000
 
83
'202335900000041000
 
72
'202335900000040000
 
71
Other values (20)
 
486

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row'202000000000000000
2nd row'202000000000000000
3rd row'202000000000000000
4th row'202000000000000000
5th row'202000000000000000

Common Values

ValueCountFrequency (%)
'202000000000000000 9156
91.6%
'202335900200008000 132
 
1.3%
'202335900000039000 83
 
0.8%
'202335900000041000 72
 
0.7%
'202335900000040000 71
 
0.7%
'202335900310004000 65
 
0.7%
'202335900000042000 63
 
0.6%
'202335900440002000 47
 
0.5%
'202335900000038000 40
 
0.4%
'202335900260002000 35
 
0.4%
Other values (15) 236
 
2.4%

Length

2024-03-15T07:44:58.739909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
202000000000000000 9156
91.6%
202335900200008000 132
 
1.3%
202335900000039000 83
 
0.8%
202335900000041000 72
 
0.7%
202335900000040000 71
 
0.7%
202335900310004000 65
 
0.7%
202335900000042000 63
 
0.6%
202335900440002000 47
 
0.5%
202335900000038000 40
 
0.4%
202335900260002000 35
 
0.4%
Other values (15) 236
 
2.4%

발급기명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동구청민원실
1579 
전남대병원 1병동
1469 
조선대학교병원
1087 
학운동 주민센터
824 
충장동 행정복지센타
726 
Other values (16)
4315 

Length

Max length12
Median length9
Mean length8.0916
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조선대학교병원
2nd row학운동 주민센터
3rd row충장동 행정복지센타
4th row전남대병원 1병동
5th row광주세무서

Common Values

ValueCountFrequency (%)
동구청민원실 1579
15.8%
전남대병원 1병동 1469
14.7%
조선대학교병원 1087
10.9%
학운동 주민센터 824
 
8.2%
충장동 행정복지센타 726
 
7.3%
광주세무서 658
 
6.6%
지원2동행정복지센터 465
 
4.7%
지산1동 주민센터 392
 
3.9%
계림1동주민센터 343
 
3.4%
계림2동행정복지센터 323
 
3.2%
Other values (11) 2134
21.3%

Length

2024-03-15T07:44:59.060635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동구청민원실 1579
11.5%
전남대병원 1469
10.7%
1병동 1469
10.7%
주민센터 1216
 
8.9%
조선대학교병원 1087
 
7.9%
행정복지센타 912
 
6.7%
학운동 824
 
6.0%
충장동 726
 
5.3%
광주세무서 658
 
4.8%
지원2동행정복지센터 465
 
3.4%
Other values (15) 3304
24.1%
Distinct72
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T07:44:59.826966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.9712
Min length13

Characters and Unicode

Total characters139712
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)0.1%

Sample

1st row'9740000000000
2nd row'1310000000000
3rd row'1310000000000
4th row'9740000000000
5th row'1210000000000
ValueCountFrequency (%)
1310000000000 3824
38.2%
9740000000000 3362
33.6%
1210000000000 915
 
9.2%
sg4cadm203401 384
 
3.8%
9740000000401 307
 
3.1%
1310000001501 156
 
1.6%
1500000000000 135
 
1.4%
1460000000000 121
 
1.2%
1310000001502 72
 
0.7%
1340000000000 61
 
0.6%
Other values (50) 663
 
6.6%
2024-03-15T07:45:00.954546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 93045
66.6%
1 12536
 
9.0%
' 9712
 
7.0%
4 5401
 
3.9%
3 4985
 
3.6%
9 3757
 
2.7%
7 3738
 
2.7%
2 1851
 
1.3%
A 664
 
0.5%
M 640
 
0.5%
Other values (8) 3383
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 126136
90.3%
Other Punctuation 9712
 
7.0%
Uppercase Letter 3864
 
2.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 93045
73.8%
1 12536
 
9.9%
4 5401
 
4.3%
3 4985
 
4.0%
9 3757
 
3.0%
7 3738
 
3.0%
2 1851
 
1.5%
5 497
 
0.4%
6 249
 
0.2%
8 77
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
A 664
17.2%
M 640
16.6%
D 640
16.6%
G 640
16.6%
C 640
16.6%
S 593
15.3%
P 47
 
1.2%
Other Punctuation
ValueCountFrequency (%)
' 9712
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 135848
97.2%
Latin 3864
 
2.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 93045
68.5%
1 12536
 
9.2%
' 9712
 
7.1%
4 5401
 
4.0%
3 4985
 
3.7%
9 3757
 
2.8%
7 3738
 
2.8%
2 1851
 
1.4%
5 497
 
0.4%
6 249
 
0.2%
Latin
ValueCountFrequency (%)
A 664
17.2%
M 640
16.6%
D 640
16.6%
G 640
16.6%
C 640
16.6%
S 593
15.3%
P 47
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 139712
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 93045
66.6%
1 12536
 
9.0%
' 9712
 
7.0%
4 5401
 
3.9%
3 4985
 
3.6%
9 3757
 
2.7%
7 3738
 
2.7%
2 1851
 
1.3%
A 664
 
0.5%
M 640
 
0.5%
Other values (8) 3383
 
2.4%
Distinct65
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T07:45:01.855591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length25
Mean length8.4023
Min length4

Characters and Unicode

Total characters84023
Distinct characters124
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st row가족관계증명서
2nd row등기사항증명서
3rd row주민등록표(등본)
4th row가족관계증명서
5th row소득금액증명
ValueCountFrequency (%)
가족관계증명서 3469
30.4%
주민등록표(등본 2109
18.5%
주민등록표(초본 845
 
7.4%
등기사항증명서 718
 
6.3%
건강보험 443
 
3.9%
자격득실확인서 384
 
3.4%
소득금액증명 306
 
2.7%
납세증명서(국세완납증명 231
 
2.0%
지방세납세증명 226
 
2.0%
부가가치세 207
 
1.8%
Other values (77) 2474
21.7%
2024-03-15T07:45:03.226978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6252
 
7.4%
6252
 
7.4%
6008
 
7.2%
5789
 
6.9%
3989
 
4.7%
3633
 
4.3%
3608
 
4.3%
( 3504
 
4.2%
) 3504
 
4.2%
3473
 
4.1%
Other values (114) 38011
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75495
89.9%
Open Punctuation 3504
 
4.2%
Close Punctuation 3504
 
4.2%
Space Separator 1412
 
1.7%
Other Punctuation 108
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6252
 
8.3%
6252
 
8.3%
6008
 
8.0%
5789
 
7.7%
3989
 
5.3%
3633
 
4.8%
3608
 
4.8%
3473
 
4.6%
3174
 
4.2%
3161
 
4.2%
Other values (108) 30156
39.9%
Other Punctuation
ValueCountFrequency (%)
, 89
82.4%
/ 17
 
15.7%
· 2
 
1.9%
Open Punctuation
ValueCountFrequency (%)
( 3504
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3504
100.0%
Space Separator
ValueCountFrequency (%)
1412
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75495
89.9%
Common 8528
 
10.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6252
 
8.3%
6252
 
8.3%
6008
 
8.0%
5789
 
7.7%
3989
 
5.3%
3633
 
4.8%
3608
 
4.8%
3473
 
4.6%
3174
 
4.2%
3161
 
4.2%
Other values (108) 30156
39.9%
Common
ValueCountFrequency (%)
( 3504
41.1%
) 3504
41.1%
1412
16.6%
, 89
 
1.0%
/ 17
 
0.2%
· 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75495
89.9%
ASCII 8526
 
10.1%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6252
 
8.3%
6252
 
8.3%
6008
 
8.0%
5789
 
7.7%
3989
 
5.3%
3633
 
4.8%
3608
 
4.8%
3473
 
4.6%
3174
 
4.2%
3161
 
4.2%
Other values (108) 30156
39.9%
ASCII
ValueCountFrequency (%)
( 3504
41.1%
) 3504
41.1%
1412
16.6%
, 89
 
1.0%
/ 17
 
0.2%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct353
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-01-01 00:00:00
Maximum2023-12-31 00:00:00
2024-03-15T07:45:03.464504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:45:03.864525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-02-28 00:00:00
Maximum2024-02-28 00:00:00
2024-03-15T07:45:04.228930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T07:45:04.530199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T07:44:57.360486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T07:45:04.714175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발급기고유번호민원접수번호발급기명민원사무분류번호민원사무분류명
발급기고유번호1.0000.4551.0000.4300.511
민원접수번호0.4551.0000.6550.8310.274
발급기명1.0000.6551.0000.6230.689
민원사무분류번호0.4300.8310.6231.0000.992
민원사무분류명0.5110.2740.6890.9921.000
2024-03-15T07:45:04.878903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
민원접수번호발급기명
민원접수번호1.0000.231
발급기명0.2311.000
2024-03-15T07:45:05.020364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발급기고유번호민원접수번호발급기명
발급기고유번호1.0000.2170.999
민원접수번호0.2171.0000.231
발급기명0.9990.2311.000

Missing values

2024-03-15T07:44:57.716205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T07:44:57.936898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발급기고유번호민원접수번호발급기명민원사무분류번호민원사무분류명접수일자데이터기준일자
8039504'202000000000000000조선대학교병원'9740000000000가족관계증명서2023-02-012024-02-28
59377507'202000000000000000학운동 주민센터'1310000000000등기사항증명서2023-09-122024-02-28
78973511'202000000000000000충장동 행정복지센타'1310000000000주민등록표(등본)2023-12-192024-02-28
7529401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-01-312024-02-28
41026506'202000000000000000광주세무서'1210000000000소득금액증명2023-06-212024-02-28
37364515'202000000000000000학동 행정복지센타'SG4CADM203401건강보험 자격득실확인서2023-06-022024-02-28
50326506'202335900200007000광주세무서'1210000002101소득금액증명2023-08-032024-02-28
32803511'202000000000000000충장동 행정복지센타'SG4CADM203401건강보험 자격득실확인서2023-05-122024-02-28
12634501'202000000000000000계림동삼성홈플러스'SG4CADM203601지역 국민연금보험료 납부확인서2023-02-182024-02-28
6812501'202000000000000000계림동삼성홈플러스'1310000000000주민등록표(등본)2023-01-272024-02-28
발급기고유번호민원접수번호발급기명민원사무분류번호민원사무분류명접수일자데이터기준일자
30129503'202000000000000000동구청민원실'9740000000000가족관계증명서2023-05-012024-02-28
24678504'202000000000000000조선대학교병원'1310000000000주민등록표(등본)2023-04-062024-02-28
34632401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-05-222024-02-28
28936506'202000000000000000광주세무서'1210000000000납세증명서(국세완납증명)2023-04-252024-02-28
68128504'202000000000000000조선대학교병원'9740000000000가족관계증명서2023-10-292024-02-28
67697401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-10-262024-02-28
10276503'202000000000000000동구청민원실'1310000000000지방세 세목별 과세증명서2023-02-092024-02-28
57387507'202000000000000000학운동 주민센터'1210000000000소득금액증명2023-09-042024-02-28
75722405'202000000000000000서남동행정복지센터'1310000000000주민등록표(등본)2023-12-052024-02-28
3200506'202000000000000000광주세무서'1210000000000연금보험료 등 소득,세액공제확인서2023-01-132024-02-28

Duplicate rows

Most frequently occurring

발급기고유번호민원접수번호발급기명민원사무분류번호민원사무분류명접수일자데이터기준일자# duplicates
1155507'202000000000000000학운동 주민센터'1310000000000등기사항증명서2023-12-202024-02-2814
260401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-12-122024-02-2812
82401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-02-032024-02-2811
278401'202335900000039000전남대병원 1병동'9740000000401가족관계증명서2023-08-092024-02-2811
13401'202000000000000000전남대병원 1병동'1310000000000주민등록표(등본)2023-02-172024-02-2810
59401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-01-022024-02-2810
78401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-01-302024-02-2810
108401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-03-212024-02-2810
190401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-07-252024-02-2810
226401'202000000000000000전남대병원 1병동'9740000000000가족관계증명서2023-10-242024-02-2810