Overview

Dataset statistics

Number of variables5
Number of observations46
Missing cells9
Missing cells (%)3.9%
Duplicate rows1
Duplicate rows (%)2.2%
Total size in memory1.9 KiB
Average record size in memory42.9 B

Variable types

Text3
Categorical2

Dataset

Description광주광역시 동구에 있는 행정사(다른 사람의 위임을 받아 행정 기관에 낼 서류, 주민의 권리ㆍ의무와 사실 증명에 관한 서류의 작성)업을 영위하는 사무소입니다.(업체명, 주소, 전화번호 등)
URLhttps://www.data.go.kr/data/15028760/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (2.2%) duplicate rowsDuplicates
전화번호 has 9 (19.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:22:41.128092
Analysis finished2023-12-12 07:22:41.543239
Duration0.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct45
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T16:22:41.707055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length11
Mean length11.282609
Min length5

Characters and Unicode

Total characters519
Distinct characters88
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)95.7%

Sample

1st row행정사이선동사무소
2nd row행정사 신연호 사무소
3rd row행정사 양용무 사무소
4th row행정사 김동진 사무소
5th row행정사사무소 가승
ValueCountFrequency (%)
사무소 40
29.2%
행정사 38
27.7%
행정사사무소 3
 
2.2%
최진이 2
 
1.5%
일반행정사 2
 
1.5%
번역 2
 
1.5%
행정심판 1
 
0.7%
김유곤사무소 1
 
0.7%
나창국 1
 
0.7%
김종민 1
 
0.7%
Other values (46) 46
33.6%
2023-12-12T16:22:42.160972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
17.7%
91
17.5%
55
10.6%
48
9.2%
47
9.1%
46
8.9%
12
 
2.3%
6
 
1.2%
4
 
0.8%
4
 
0.8%
Other values (78) 114
22.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 426
82.1%
Space Separator 91
 
17.5%
Uppercase Letter 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
21.6%
55
12.9%
48
11.3%
47
11.0%
46
10.8%
12
 
2.8%
6
 
1.4%
4
 
0.9%
4
 
0.9%
3
 
0.7%
Other values (75) 109
25.6%
Uppercase Letter
ValueCountFrequency (%)
L 1
50.0%
J 1
50.0%
Space Separator
ValueCountFrequency (%)
91
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 426
82.1%
Common 91
 
17.5%
Latin 2
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
21.6%
55
12.9%
48
11.3%
47
11.0%
46
10.8%
12
 
2.8%
6
 
1.4%
4
 
0.9%
4
 
0.9%
3
 
0.7%
Other values (75) 109
25.6%
Latin
ValueCountFrequency (%)
L 1
50.0%
J 1
50.0%
Common
ValueCountFrequency (%)
91
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 426
82.1%
ASCII 93
 
17.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
92
21.6%
55
12.9%
48
11.3%
47
11.0%
46
10.8%
12
 
2.8%
6
 
1.4%
4
 
0.9%
4
 
0.9%
3
 
0.7%
Other values (75) 109
25.6%
ASCII
ValueCountFrequency (%)
91
97.8%
L 1
 
1.1%
J 1
 
1.1%
Distinct41
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T16:22:42.499938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length31
Mean length26.434783
Min length20

Characters and Unicode

Total characters1216
Distinct characters75
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)80.4%

Sample

1st row광주광역시 동구 금남로 229, 306호(금남로2가)
2nd row광주광역시 동구 구성로204번길 20, 1115호(대인동)
3rd row광주광역시 동구 동명로 110, 315호(지산동)
4th row광주광역시 동구 지산로 82-1(지산동)
5th row광주광역시 동구 동명로 114, 4층(지산동)
ValueCountFrequency (%)
광주광역시 46
18.0%
동구 46
18.0%
지산동 23
 
9.0%
동명로 10
 
3.9%
지산로 7
 
2.7%
114, 4
 
1.6%
예술길 4
 
1.6%
83-2 3
 
1.2%
금남로 3
 
1.2%
대의동 3
 
1.2%
Other values (87) 106
41.6%
2023-12-12T16:22:42.968921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
209
17.2%
107
 
8.8%
94
 
7.7%
1 51
 
4.2%
48
 
3.9%
47
 
3.9%
47
 
3.9%
47
 
3.9%
46
 
3.8%
) 46
 
3.8%
Other values (65) 474
39.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 693
57.0%
Space Separator 209
 
17.2%
Decimal Number 189
 
15.5%
Close Punctuation 46
 
3.8%
Open Punctuation 46
 
3.8%
Other Punctuation 23
 
1.9%
Dash Punctuation 10
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
107
15.4%
94
13.6%
48
 
6.9%
47
 
6.8%
47
 
6.8%
47
 
6.8%
46
 
6.6%
45
 
6.5%
40
 
5.8%
15
 
2.2%
Other values (50) 157
22.7%
Decimal Number
ValueCountFrequency (%)
1 51
27.0%
2 31
16.4%
3 27
14.3%
4 20
 
10.6%
0 16
 
8.5%
6 12
 
6.3%
8 12
 
6.3%
7 9
 
4.8%
5 6
 
3.2%
9 5
 
2.6%
Space Separator
ValueCountFrequency (%)
209
100.0%
Close Punctuation
ValueCountFrequency (%)
) 46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 46
100.0%
Other Punctuation
ValueCountFrequency (%)
23
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 693
57.0%
Common 523
43.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
107
15.4%
94
13.6%
48
 
6.9%
47
 
6.8%
47
 
6.8%
47
 
6.8%
46
 
6.6%
45
 
6.5%
40
 
5.8%
15
 
2.2%
Other values (50) 157
22.7%
Common
ValueCountFrequency (%)
209
40.0%
1 51
 
9.8%
) 46
 
8.8%
( 46
 
8.8%
2 31
 
5.9%
3 27
 
5.2%
23
 
4.4%
4 20
 
3.8%
0 16
 
3.1%
6 12
 
2.3%
Other values (5) 42
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 693
57.0%
ASCII 500
41.1%
None 23
 
1.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
209
41.8%
1 51
 
10.2%
) 46
 
9.2%
( 46
 
9.2%
2 31
 
6.2%
3 27
 
5.4%
4 20
 
4.0%
0 16
 
3.2%
6 12
 
2.4%
8 12
 
2.4%
Other values (4) 30
 
6.0%
Hangul
ValueCountFrequency (%)
107
15.4%
94
13.6%
48
 
6.9%
47
 
6.8%
47
 
6.8%
47
 
6.8%
46
 
6.6%
45
 
6.5%
40
 
5.8%
15
 
2.2%
Other values (50) 157
22.7%
None
ValueCountFrequency (%)
23
100.0%

전화번호
Text

MISSING 

Distinct37
Distinct (%)100.0%
Missing9
Missing (%)19.6%
Memory size500.0 B
2023-12-12T16:22:43.237962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.756757
Min length1

Characters and Unicode

Total characters435
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row062-374-1345
2nd row062-419-8885
3rd row062-226-5193
4th row
5th row062-227-2427
ValueCountFrequency (%)
062-224-1535 1
 
2.8%
062-232-2287 1
 
2.8%
070-7745-6860 1
 
2.8%
062-229-0577 1
 
2.8%
062-434-0031 1
 
2.8%
062-223-1474 1
 
2.8%
062-674-9969 1
 
2.8%
062-225-7845 1
 
2.8%
062-226-0089 1
 
2.8%
062-226-3800 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T16:22:43.590276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 108
24.8%
- 72
16.6%
0 59
13.6%
6 48
11.0%
3 27
 
6.2%
4 26
 
6.0%
7 25
 
5.7%
9 19
 
4.4%
5 18
 
4.1%
1 17
 
3.9%
Other values (2) 16
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 362
83.2%
Dash Punctuation 72
 
16.6%
Space Separator 1
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 108
29.8%
0 59
16.3%
6 48
13.3%
3 27
 
7.5%
4 26
 
7.2%
7 25
 
6.9%
9 19
 
5.2%
5 18
 
5.0%
1 17
 
4.7%
8 15
 
4.1%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 435
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 108
24.8%
- 72
16.6%
0 59
13.6%
6 48
11.0%
3 27
 
6.2%
4 26
 
6.0%
7 25
 
5.7%
9 19
 
4.4%
5 18
 
4.1%
1 17
 
3.9%
Other values (2) 16
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 435
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 108
24.8%
- 72
16.6%
0 59
13.6%
6 48
11.0%
3 27
 
6.2%
4 26
 
6.0%
7 25
 
5.7%
9 19
 
4.4%
5 18
 
4.1%
1 17
 
3.9%
Other values (2) 16
 
3.7%

비고
Categorical

Distinct2
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
<NA>
36 
개인정보로 인한 전화번호 미수집
10 

Length

Max length17
Median length4
Mean length6.826087
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row개인정보로 인한 전화번호 미수집
3rd row<NA>
4th row개인정보로 인한 전화번호 미수집
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 36
78.3%
개인정보로 인한 전화번호 미수집 10
 
21.7%

Length

2023-12-12T16:22:43.734609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:22:43.844434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 36
47.4%
개인정보로 10
 
13.2%
인한 10
 
13.2%
전화번호 10
 
13.2%
미수집 10
 
13.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-04-12
46 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-04-12
2nd row2023-04-12
3rd row2023-04-12
4th row2023-04-12
5th row2023-04-12

Common Values

ValueCountFrequency (%)
2023-04-12 46
100.0%

Length

2023-12-12T16:22:43.946964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:22:44.030499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-12 46
100.0%

Correlations

2023-12-12T16:22:44.083115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정사 사무소소재지도로명주소전화번호
행정사 사무소1.0001.0001.000
소재지도로명주소1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-12T16:22:41.402650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:22:41.499094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

행정사 사무소소재지도로명주소전화번호비고데이터기준일자
0행정사이선동사무소광주광역시 동구 금남로 229, 306호(금남로2가)062-374-1345<NA>2023-04-12
1행정사 신연호 사무소광주광역시 동구 구성로204번길 20, 1115호(대인동)<NA>개인정보로 인한 전화번호 미수집2023-04-12
2행정사 양용무 사무소광주광역시 동구 동명로 110, 315호(지산동)062-419-8885<NA>2023-04-12
3행정사 김동진 사무소광주광역시 동구 지산로 82-1(지산동)<NA>개인정보로 인한 전화번호 미수집2023-04-12
4행정사사무소 가승광주광역시 동구 동명로 114, 4층(지산동)062-226-5193<NA>2023-04-12
5수 행정사사무소광주광역시 동구 필문대로 232-1, 3층(동명동)<NA>개인정보로 인한 전화번호 미수집2023-04-12
6스타 마크 탐정 행정사마크 이정남 행정사사무소광주광역시 동구 무등로 434, 1층 1호 102호(산수동)개인정보로 인한 전화번호 미수집2023-04-12
7김용선 행정사 사무소광주광역시 동구 동명로 115, 1층(지산동)062-227-2427<NA>2023-04-12
8최진이 번역 일반행정사 사무소광주광역시 동구 지산로63번길 3, 요천빌딩 4층 (지산동)<NA>개인정보로 인한 전화번호 미수집2023-04-12
9최진이 번역 일반행정사 사무소광주광역시 동구 지산로63번길 3, 요천빌딩 4층 (지산동)<NA>개인정보로 인한 전화번호 미수집2023-04-12
행정사 사무소소재지도로명주소전화번호비고데이터기준일자
36행정사 박종선 사무소광주광역시 동구 동계천로 89 (동명동)070-7745-6860<NA>2023-04-12
37행정사 김금호 사무소광주광역시 동구 참판로6번길 2 (계림동)062-251-2505<NA>2023-04-12
38행정사 정상우 사무소광주광역시 동구 예술길 33, 광주동부경찰서 (대의동)062-227-7401<NA>2023-04-12
39행정사 권석희 사무소광주광역시 동구 동명로110번길 3 (지산동)062-224-3171<NA>2023-04-12
40행정사 류종남 사무소광주광역시 동구 예술길 33, 광주동부경찰서 (대의동)062-222-7407<NA>2023-04-12
41행정사 서충열 사무소광주광역시 동구 필문대로205번길 12-1 (지산동)070-4609-9292<NA>2023-04-12
42행정사 김인호 사무소광주광역시 동구 동명로 114, 208호 (지산동)062-223-3498<NA>2023-04-12
43행정사 박봉조 사무소광주광역시 동구 동명로 114 (지산동)062-224-2191<NA>2023-04-12
44행정사 김재섭 사무소광주광역시 동구 예술길 29 (대의동)062-222-7153<NA>2023-04-12
45행정사 서승부 사무소광주광역시 동구 지산로 72, 14호 (지산동)062-225-2249<NA>2023-04-12

Duplicate rows

Most frequently occurring

행정사 사무소소재지도로명주소전화번호비고데이터기준일자# duplicates
0최진이 번역 일반행정사 사무소광주광역시 동구 지산로63번길 3, 요천빌딩 4층 (지산동)<NA>개인정보로 인한 전화번호 미수집2023-04-122