Overview

Dataset statistics

Number of variables6
Number of observations78
Missing cells69
Missing cells (%)14.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory49.6 B

Variable types

Categorical4
Text2

Dataset

Description경기도 성남시 무인민원발급기 수수료에 대한 데이터로, 경기도 성남시 무인민원 발급기 수수료에 관한 업무, 민원증명, 발급수수료 등의 항목을 제공합니다.
Author경기도 성남시
URLhttps://www.data.go.kr/data/3039817/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
업무 is highly overall correlated with 발급수수료_관내(원) and 1 other fieldsHigh correlation
발급수수료_관내(원) is highly overall correlated with 업무 and 1 other fieldsHigh correlation
발급수수료_관외(원) is highly overall correlated with 업무 and 1 other fieldsHigh correlation
발급수수료_관외(원) is highly imbalanced (55.2%)Imbalance
비고 has 69 (88.5%) missing valuesMissing
민원증명 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:37:01.200213
Analysis finished2024-03-14 11:37:01.916668
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업무
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)21.8%
Missing0
Missing (%)0.0%
Memory size752.0 B
국세청 증명
13 
고용보험/산재
국민연금 증명
교육제증명
여권
Other values (12)
34 

Length

Max length10
Median length7
Mean length5.1923077
Min length2

Unique

Unique2 ?
Unique (%)2.6%

Sample

1st row주민등록
2nd row주민등록
3rd row토지지적건축
4th row토지지적건축
5th row토지지적건축

Common Values

ValueCountFrequency (%)
국세청 증명 13
16.7%
고용보험/산재 9
11.5%
국민연금 증명 8
10.3%
교육제증명 8
10.3%
여권 6
7.7%
토지지적건축 6
7.7%
건강보험 증명 5
 
6.4%
차량 4
 
5.1%
농촌 3
 
3.8%
교통(경찰청) 3
 
3.8%
Other values (7) 13
16.7%

Length

2024-03-14T20:37:02.051926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
증명 26
25.0%
국세청 13
12.5%
고용보험/산재 9
 
8.7%
국민연금 8
 
7.7%
교육제증명 8
 
7.7%
여권 6
 
5.8%
토지지적건축 6
 
5.8%
건강보험 5
 
4.8%
차량 4
 
3.8%
보건복지 3
 
2.9%
Other values (8) 16
15.4%

민원증명
Text

UNIQUE 

Distinct78
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size752.0 B
2024-03-14T20:37:02.724485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length19
Mean length12.448718
Min length4

Characters and Unicode

Total characters971
Distinct characters141
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)100.0%

Sample

1st row주민등록등본
2nd row주민등록초본
3rd row개별공시지가확인원
4th row토지이용계획확인원
5th row토지대장등본
ValueCountFrequency (%)
납부확인서 6
 
4.7%
국민연금보험료 4
 
3.1%
건강장기요양보험료 3
 
2.3%
지역가입자 3
 
2.3%
고용보험 3
 
2.3%
농업경영체 2
 
1.6%
부가가치세 2
 
1.6%
검정고시 2
 
1.6%
직장가입자 2
 
1.6%
자격이력내역서(근로자용 2
 
1.6%
Other values (94) 99
77.3%
2024-03-14T20:37:03.889647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
50
 
5.1%
43
 
4.4%
40
 
4.1%
39
 
4.0%
32
 
3.3%
( 30
 
3.1%
) 30
 
3.1%
27
 
2.8%
21
 
2.2%
21
 
2.2%
Other values (131) 638
65.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 844
86.9%
Space Separator 50
 
5.1%
Open Punctuation 30
 
3.1%
Close Punctuation 30
 
3.1%
Other Punctuation 17
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
43
 
5.1%
40
 
4.7%
39
 
4.6%
32
 
3.8%
27
 
3.2%
21
 
2.5%
21
 
2.5%
19
 
2.3%
19
 
2.3%
18
 
2.1%
Other values (125) 565
66.9%
Other Punctuation
ValueCountFrequency (%)
, 9
52.9%
· 5
29.4%
/ 3
 
17.6%
Space Separator
ValueCountFrequency (%)
50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 844
86.9%
Common 127
 
13.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
43
 
5.1%
40
 
4.7%
39
 
4.6%
32
 
3.8%
27
 
3.2%
21
 
2.5%
21
 
2.5%
19
 
2.3%
19
 
2.3%
18
 
2.1%
Other values (125) 565
66.9%
Common
ValueCountFrequency (%)
50
39.4%
( 30
23.6%
) 30
23.6%
, 9
 
7.1%
· 5
 
3.9%
/ 3
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 844
86.9%
ASCII 122
 
12.6%
None 5
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
50
41.0%
( 30
24.6%
) 30
24.6%
, 9
 
7.4%
/ 3
 
2.5%
Hangul
ValueCountFrequency (%)
43
 
5.1%
40
 
4.7%
39
 
4.6%
32
 
3.8%
27
 
3.2%
21
 
2.5%
21
 
2.5%
19
 
2.3%
19
 
2.3%
18
 
2.1%
Other values (125) 565
66.9%
None
ValueCountFrequency (%)
· 5
100.0%

발급수수료_관내(원)
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)7.7%
Missing0
Missing (%)0.0%
Memory size752.0 B
0
59 
500
1000
 
3
300
 
3
무료
 
2

Length

Max length4
Median length1
Mean length1.5
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무료
2nd row무료
3rd row800
4th row1000
5th row500

Common Values

ValueCountFrequency (%)
0 59
75.6%
500 9
 
11.5%
1000 3
 
3.8%
300 3
 
3.8%
무료 2
 
2.6%
800 2
 
2.6%

Length

2024-03-14T20:37:04.319051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:37:04.671519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 59
75.6%
500 9
 
11.5%
1000 3
 
3.8%
300 3
 
3.8%
무료 2
 
2.6%
800 2
 
2.6%

발급수수료_관외(원)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct7
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Memory size752.0 B
0
61 
500
300
 
3
무료
 
2
1000
 
2
Other values (2)
 
3

Length

Max length4
Median length1
Mean length1.4615385
Min length1

Unique

Unique1 ?
Unique (%)1.3%

Sample

1st row무료
2nd row무료
3rd row800
4th row1000
5th row500

Common Values

ValueCountFrequency (%)
0 61
78.2%
500 7
 
9.0%
300 3
 
3.8%
무료 2
 
2.6%
1000 2
 
2.6%
1500 2
 
2.6%
800 1
 
1.3%

Length

2024-03-14T20:37:05.076198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:37:05.438837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 61
78.2%
500 7
 
9.0%
300 3
 
3.8%
무료 2
 
2.6%
1000 2
 
2.6%
1500 2
 
2.6%
800 1
 
1.3%

비고
Text

MISSING 

Distinct6
Distinct (%)66.7%
Missing69
Missing (%)88.5%
Memory size752.0 B
2024-03-14T20:37:06.337033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length14.111111
Min length5

Characters and Unicode

Total characters127
Distinct characters49
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)44.4%

Sample

1st row이해관계는 창구에서만 발급(수수료 500원)
2nd row20장 초과시 1장당초과 100원
3rd row20장 초과시 1장당초과 100원
4th row20장 초과시 1장당초과 100원
5th row도면추가시 장당 100원
ValueCountFrequency (%)
100원 4
15.4%
20장 3
11.5%
초과시 3
11.5%
1장당초과 3
11.5%
관외 2
 
7.7%
불가 2
 
7.7%
이해관계는 1
 
3.8%
창구에서만 1
 
3.8%
발급(수수료 1
 
3.8%
500원 1
 
3.8%
Other values (5) 5
19.2%
2024-03-14T20:37:07.139494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
13.4%
0 13
 
10.2%
1 8
 
6.3%
7
 
5.5%
6
 
4.7%
6
 
4.7%
5
 
3.9%
4
 
3.1%
4
 
3.1%
, 4
 
3.1%
Other values (39) 53
41.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79
62.2%
Decimal Number 25
 
19.7%
Space Separator 17
 
13.4%
Other Punctuation 4
 
3.1%
Close Punctuation 1
 
0.8%
Open Punctuation 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
8.9%
6
 
7.6%
6
 
7.6%
5
 
6.3%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
Other values (31) 37
46.8%
Decimal Number
ValueCountFrequency (%)
0 13
52.0%
1 8
32.0%
2 3
 
12.0%
5 1
 
4.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79
62.2%
Common 48
37.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
8.9%
6
 
7.6%
6
 
7.6%
5
 
6.3%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
Other values (31) 37
46.8%
Common
ValueCountFrequency (%)
17
35.4%
0 13
27.1%
1 8
16.7%
, 4
 
8.3%
2 3
 
6.2%
) 1
 
2.1%
5 1
 
2.1%
( 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79
62.2%
ASCII 48
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
17
35.4%
0 13
27.1%
1 8
16.7%
, 4
 
8.3%
2 3
 
6.2%
) 1
 
2.1%
5 1
 
2.1%
( 1
 
2.1%
Hangul
ValueCountFrequency (%)
7
 
8.9%
6
 
7.6%
6
 
7.6%
5
 
6.3%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
2
 
2.5%
2
 
2.5%
Other values (31) 37
46.8%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size752.0 B
2024-01-26
78 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-01-26
2nd row2024-01-26
3rd row2024-01-26
4th row2024-01-26
5th row2024-01-26

Common Values

ValueCountFrequency (%)
2024-01-26 78
100.0%

Length

2024-03-14T20:37:07.418917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:37:07.585042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-01-26 78
100.0%

Correlations

2024-03-14T20:37:07.708822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무민원증명발급수수료_관내(원)발급수수료_관외(원)비고
업무1.0001.0000.9000.8960.970
민원증명1.0001.0001.0001.0001.000
발급수수료_관내(원)0.9001.0001.0000.9510.850
발급수수료_관외(원)0.8961.0000.9511.0001.000
비고0.9701.0000.8501.0001.000
2024-03-14T20:37:07.878778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발급수수료_관외(원)발급수수료_관내(원)업무
발급수수료_관외(원)1.0000.8970.636
발급수수료_관내(원)0.8971.0000.647
업무0.6360.6471.000
2024-03-14T20:37:08.031046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무발급수수료_관내(원)발급수수료_관외(원)
업무1.0000.6470.636
발급수수료_관내(원)0.6471.0000.897
발급수수료_관외(원)0.6360.8971.000

Missing values

2024-03-14T20:37:01.654572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:37:01.842143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업무민원증명발급수수료_관내(원)발급수수료_관외(원)비고데이터기준일자
0주민등록주민등록등본무료무료<NA>2024-01-26
1주민등록주민등록초본무료무료이해관계는 창구에서만 발급(수수료 500원)2024-01-26
2토지지적건축개별공시지가확인원800800<NA>2024-01-26
3토지지적건축토지이용계획확인원10001000<NA>2024-01-26
4토지지적건축토지대장등본50050020장 초과시 1장당초과 100원2024-01-26
5토지지적건축임야대장등본50050020장 초과시 1장당초과 100원2024-01-26
6토지지적건축대지권등록부50050020장 초과시 1장당초과 100원2024-01-26
7토지지적건축건축물대장500500도면추가시 장당 100원2024-01-26
8차량건설기계등록원부(갑)5001500<NA>2024-01-26
9차량건설기계등록원부(을)5001500<NA>2024-01-26
업무민원증명발급수수료_관내(원)발급수수료_관외(원)비고데이터기준일자
68여권여권정보증명서00<NA>2024-01-26
69국민연금 증명국민연금 가입자 가입증명00<NA>2024-01-26
70국민연금 증명국민연금 수급증명(지급내역)00<NA>2024-01-26
71국민연금 증명연금소득원천징수영수증00<NA>2024-01-26
72국민연금 증명연금산정용 가입내역 확인서00<NA>2024-01-26
73국민연금 증명국민연금보험료 소득공제용 납부확인서00<NA>2024-01-26
74국민연금 증명국민연금보험료 납부확인서00<NA>2024-01-26
75교통(경찰청)운전경력증명서(국문)00<NA>2024-01-26
76교통(경찰청)운전경력증명서(영문)00<NA>2024-01-26
77교통(경찰청)교통사고사실확인원00<NA>2024-01-26