Overview

Dataset statistics

Number of variables4
Number of observations692
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.4 KiB
Average record size in memory33.2 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description서울특별시 용산구 민원사무편람 현황에 대한 데이터로 연번, 민원사무명, 소관부서, 수수료에 대한 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15066452/fileData.do

Alerts

연번 is highly overall correlated with 소관부서High correlation
소관부서 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:42:09.409905
Analysis finished2023-12-12 04:42:09.979148
Duration0.57 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct692
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean346.5
Minimum1
Maximum692
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.2 KiB
2023-12-12T13:42:10.053454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile35.55
Q1173.75
median346.5
Q3519.25
95-th percentile657.45
Maximum692
Range691
Interquartile range (IQR)345.5

Descriptive statistics

Standard deviation199.90748
Coefficient of variation (CV)0.57693356
Kurtosis-1.2
Mean346.5
Median Absolute Deviation (MAD)173
Skewness0
Sum239778
Variance39963
MonotonicityStrictly increasing
2023-12-12T13:42:10.192140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
467 1
 
0.1%
459 1
 
0.1%
460 1
 
0.1%
461 1
 
0.1%
462 1
 
0.1%
463 1
 
0.1%
464 1
 
0.1%
465 1
 
0.1%
466 1
 
0.1%
Other values (682) 682
98.6%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
692 1
0.1%
691 1
0.1%
690 1
0.1%
689 1
0.1%
688 1
0.1%
687 1
0.1%
686 1
0.1%
685 1
0.1%
684 1
0.1%
683 1
0.1%
Distinct689
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T13:42:10.438785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length40
Mean length17.236994
Min length5

Characters and Unicode

Total characters11928
Distinct characters337
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique686 ?
Unique (%)99.1%

Sample

1st row대수선·용도변경허가신청서
2nd row등록증, 허가증 재발급 신청서
3rd row사실증명 발급 신청서
4th row특정토양오염관리대상시설 설치신고서
5th row특정토양오염관리대상시설 설치 변경(폐쇄)신고서
ValueCountFrequency (%)
신청서 108
 
5.6%
신고서 78
 
4.0%
35
 
1.8%
변경신고서 30
 
1.6%
등록 17
 
0.9%
변경 14
 
0.7%
14
 
0.7%
신고 13
 
0.7%
등록신청서 11
 
0.6%
위임장 10
 
0.5%
Other values (1118) 1597
82.9%
2023-12-12T13:42:10.969334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1237
 
10.4%
710
 
6.0%
684
 
5.7%
370
 
3.1%
365
 
3.1%
343
 
2.9%
( 242
 
2.0%
) 241
 
2.0%
, 233
 
2.0%
214
 
1.8%
Other values (327) 7289
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9883
82.9%
Space Separator 1237
 
10.4%
Other Punctuation 282
 
2.4%
Open Punctuation 242
 
2.0%
Close Punctuation 242
 
2.0%
Decimal Number 13
 
0.1%
Modifier Symbol 11
 
0.1%
Math Symbol 7
 
0.1%
Connector Punctuation 4
 
< 0.1%
Dash Punctuation 3
 
< 0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
710
 
7.2%
684
 
6.9%
370
 
3.7%
365
 
3.7%
343
 
3.5%
214
 
2.2%
195
 
2.0%
194
 
2.0%
187
 
1.9%
178
 
1.8%
Other values (303) 6443
65.2%
Decimal Number
ValueCountFrequency (%)
1 4
30.8%
6 2
15.4%
9 2
15.4%
3 2
15.4%
2 1
 
7.7%
0 1
 
7.7%
7 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 233
82.6%
· 32
 
11.3%
/ 17
 
6.0%
Math Symbol
ValueCountFrequency (%)
> 3
42.9%
= 3
42.9%
1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 241
99.6%
] 1
 
0.4%
Uppercase Letter
ValueCountFrequency (%)
Z 1
50.0%
A 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
t 1
50.0%
Space Separator
ValueCountFrequency (%)
1237
100.0%
Open Punctuation
ValueCountFrequency (%)
( 242
100.0%
Modifier Symbol
ValueCountFrequency (%)
¸ 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9883
82.9%
Common 2041
 
17.1%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
710
 
7.2%
684
 
6.9%
370
 
3.7%
365
 
3.7%
343
 
3.5%
214
 
2.2%
195
 
2.0%
194
 
2.0%
187
 
1.9%
178
 
1.8%
Other values (303) 6443
65.2%
Common
ValueCountFrequency (%)
1237
60.6%
( 242
 
11.9%
) 241
 
11.8%
, 233
 
11.4%
· 32
 
1.6%
/ 17
 
0.8%
¸ 11
 
0.5%
_ 4
 
0.2%
1 4
 
0.2%
- 3
 
0.1%
Other values (10) 17
 
0.8%
Latin
ValueCountFrequency (%)
Z 1
25.0%
o 1
25.0%
t 1
25.0%
A 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9883
82.9%
ASCII 2001
 
16.8%
None 43
 
0.4%
Arrows 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1237
61.8%
( 242
 
12.1%
) 241
 
12.0%
, 233
 
11.6%
/ 17
 
0.8%
_ 4
 
0.2%
1 4
 
0.2%
- 3
 
0.1%
> 3
 
0.1%
= 3
 
0.1%
Other values (11) 14
 
0.7%
Hangul
ValueCountFrequency (%)
710
 
7.2%
684
 
6.9%
370
 
3.7%
365
 
3.7%
343
 
3.5%
214
 
2.2%
195
 
2.0%
194
 
2.0%
187
 
1.9%
178
 
1.8%
Other values (303) 6443
65.2%
None
ValueCountFrequency (%)
· 32
74.4%
¸ 11
 
25.6%
Arrows
ValueCountFrequency (%)
1
100.0%

소관부서
Categorical

HIGH CORRELATION 

Distinct34
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
교통행정과
54 
공원녹지과
47 
관광체육과
 
43
맑은환경과
 
42
부동산정보과
 
42
Other values (29)
464 

Length

Max length8
Median length5
Mean length4.9812139
Min length3

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row재정비사업과
2nd row보건의료과
3rd row민원여권과
4th row맑은환경과
5th row맑은환경과

Common Values

ValueCountFrequency (%)
교통행정과 54
 
7.8%
공원녹지과 47
 
6.8%
관광체육과 43
 
6.2%
맑은환경과 42
 
6.1%
부동산정보과 42
 
6.1%
민원여권과 41
 
5.9%
보건의료과 39
 
5.6%
보건위생과 35
 
5.1%
지역경제과 31
 
4.5%
문화진흥과 29
 
4.2%
Other values (24) 289
41.8%

Length

2023-12-12T13:42:11.164067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
교통행정과 54
 
7.8%
공원녹지과 47
 
6.8%
관광체육과 43
 
6.2%
맑은환경과 42
 
6.1%
부동산정보과 42
 
6.1%
민원여권과 41
 
5.9%
보건의료과 39
 
5.6%
보건위생과 35
 
5.1%
지역경제과 31
 
4.5%
문화진흥과 29
 
4.2%
Other values (24) 289
41.8%
Distinct120
Distinct (%)17.3%
Missing0
Missing (%)0.0%
Memory size5.5 KiB
2023-12-12T13:42:11.390718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length201
Median length2
Mean length5.9118497
Min length1

Characters and Unicode

Total characters4091
Distinct characters214
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique92 ?
Unique (%)13.3%

Sample

1st row4000원~1620000원
2nd row2000원
3rd row2000원
4th row무료
5th row무료
ValueCountFrequency (%)
무료 490
49.1%
31
 
3.1%
10000원 23
 
2.3%
20000원 21
 
2.1%
5000원 16
 
1.6%
1000원 13
 
1.3%
3000원 9
 
0.9%
30000원 8
 
0.8%
대당 7
 
0.7%
변경 7
 
0.7%
Other values (240) 372
37.3%
2023-12-12T13:42:12.184518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 721
17.6%
500
 
12.2%
494
 
12.1%
305
 
7.5%
257
 
6.3%
1 133
 
3.3%
5 89
 
2.2%
2 80
 
2.0%
: 68
 
1.7%
3 43
 
1.1%
Other values (204) 1401
34.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2193
53.6%
Decimal Number 1142
27.9%
Space Separator 305
 
7.5%
Lowercase Letter 192
 
4.7%
Other Punctuation 162
 
4.0%
Close Punctuation 28
 
0.7%
Open Punctuation 28
 
0.7%
Math Symbol 22
 
0.5%
Uppercase Letter 16
 
0.4%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
500
22.8%
494
22.5%
257
 
11.7%
33
 
1.5%
31
 
1.4%
30
 
1.4%
28
 
1.3%
24
 
1.1%
23
 
1.0%
23
 
1.0%
Other values (155) 750
34.2%
Lowercase Letter
ValueCountFrequency (%)
o 24
12.5%
t 20
10.4%
n 20
10.4%
w 16
 
8.3%
d 16
 
8.3%
s 12
 
6.2%
g 12
 
6.2%
a 8
 
4.2%
r 8
 
4.2%
b 8
 
4.2%
Other values (10) 48
25.0%
Decimal Number
ValueCountFrequency (%)
0 721
63.1%
1 133
 
11.6%
5 89
 
7.8%
2 80
 
7.0%
3 43
 
3.8%
4 31
 
2.7%
6 19
 
1.7%
8 10
 
0.9%
7 8
 
0.7%
9 8
 
0.7%
Other Punctuation
ValueCountFrequency (%)
: 68
42.0%
/ 41
25.3%
, 24
 
14.8%
. 16
 
9.9%
& 8
 
4.9%
? 4
 
2.5%
1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
C 4
25.0%
N 4
25.0%
B 4
25.0%
I 4
25.0%
Close Punctuation
ValueCountFrequency (%)
) 27
96.4%
1
 
3.6%
Open Punctuation
ValueCountFrequency (%)
( 27
96.4%
1
 
3.6%
Math Symbol
ValueCountFrequency (%)
= 12
54.5%
~ 10
45.5%
Space Separator
ValueCountFrequency (%)
305
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2193
53.6%
Common 1690
41.3%
Latin 208
 
5.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
500
22.8%
494
22.5%
257
 
11.7%
33
 
1.5%
31
 
1.4%
30
 
1.4%
28
 
1.3%
24
 
1.1%
23
 
1.0%
23
 
1.0%
Other values (155) 750
34.2%
Common
ValueCountFrequency (%)
0 721
42.7%
305
18.0%
1 133
 
7.9%
5 89
 
5.3%
2 80
 
4.7%
: 68
 
4.0%
3 43
 
2.5%
/ 41
 
2.4%
4 31
 
1.8%
) 27
 
1.6%
Other values (15) 152
 
9.0%
Latin
ValueCountFrequency (%)
o 24
 
11.5%
t 20
 
9.6%
n 20
 
9.6%
w 16
 
7.7%
d 16
 
7.7%
s 12
 
5.8%
g 12
 
5.8%
a 8
 
3.8%
r 8
 
3.8%
b 8
 
3.8%
Other values (14) 64
30.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2193
53.6%
ASCII 1895
46.3%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 721
38.0%
305
16.1%
1 133
 
7.0%
5 89
 
4.7%
2 80
 
4.2%
: 68
 
3.6%
3 43
 
2.3%
/ 41
 
2.2%
4 31
 
1.6%
) 27
 
1.4%
Other values (36) 357
18.8%
Hangul
ValueCountFrequency (%)
500
22.8%
494
22.5%
257
 
11.7%
33
 
1.5%
31
 
1.4%
30
 
1.4%
28
 
1.3%
24
 
1.1%
23
 
1.0%
23
 
1.0%
Other values (155) 750
34.2%
None
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Interactions

2023-12-12T13:42:09.719432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:42:12.297785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부서
연번1.0000.940
소관부서0.9401.000
2023-12-12T13:42:12.390026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번소관부서
연번1.0000.689
소관부서0.6891.000

Missing values

2023-12-12T13:42:09.852590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:42:09.944587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번민원사무명소관부서수수료
01대수선·용도변경허가신청서재정비사업과4000원~1620000원
12등록증, 허가증 재발급 신청서보건의료과2000원
23사실증명 발급 신청서민원여권과2000원
34특정토양오염관리대상시설 설치신고서맑은환경과무료
45특정토양오염관리대상시설 설치 변경(폐쇄)신고서맑은환경과무료
56이(미)용사면허 발급신청서민원여권과5500원
67이(미)용사면허 재발급신청서민원여권과3000원
78조리사면허증 기재사항 변경 신청서민원여권과890원
89조리사면허 발급/재발급 신청서민원여권과신규:5500원 재발급:3000원
910식품 영업신고(허가)사항 변경신고서 (영업소명칭변경, 법인대표자변경시)민원여권과상호변경 : 9300원 법인대표이사변경 : 무료
연번민원사무명소관부서수수료
682683토지취득자금 조달 및 토지이용계획서, 매뉴얼부동산정보과무료
683684대중문화예술기획업 등록증 휴업, 폐업, 영업재개 신청서문화진흥과무료
684685정화조 청소확인서 (6개월에 1회, 9개월에 1회)청소행정과무료
685686전문건설업 주력분야 추가 등록건설관리과무료
686687무허가건축물 해체(철거) 신고(허가)주택과무료
687688국내(국제)결혼중개업 변경신고(등록)여성가족과무료
688689주민세(사업소분)신고서세무2과무료
689690이륜자동차정기검사 유효기간 연장(유예)신청서맑은환경과무료
690691전기차 전용주차구역 불법주차 및 충전방해 행위 위반 과태료 처분에 관한 의견제출서맑은환경과무료
691692전기차 전용주차구역 불법주차 및 충전방해 행위 위반 과태료 이의신청서맑은환경과무료