Overview

Dataset statistics

Number of variables3
Number of observations146
Missing cells40
Missing cells (%)9.1%
Duplicate rows3
Duplicate rows (%)2.1%
Total size in memory3.6 KiB
Average record size in memory24.9 B

Variable types

Text2
DateTime1

Dataset

Description대구광역시 수성구_행정사 사무소 현황_20201207
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15028852&dataSetDetailId=150288521a071b1a57265&provdMethod=FILE

Alerts

Dataset has 3 (2.1%) duplicate rowsDuplicates
전화번호 has 40 (27.4%) missing valuesMissing

Reproduction

Analysis started2024-04-17 15:03:26.790847
Analysis finished2024-04-17 15:03:27.000350
Duration0.21 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

명칭
Text

Distinct140
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-18T00:03:27.133495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length9.3972603
Min length5

Characters and Unicode

Total characters1372
Distinct characters165
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)93.2%

Sample

1st row도담행정사사무소
2nd row반도 행정사 사무소
3rd row전상훈 행정사사무소
4th row동반행정사사무소
5th row청호행정사사무소
ValueCountFrequency (%)
행정사 35
 
15.1%
사무소 25
 
10.8%
행정사사무소 8
 
3.4%
행정사무소 7
 
3.0%
토탈행정사사무소 3
 
1.3%
행정사합동사무소 3
 
1.3%
영일 3
 
1.3%
법무사사무소 2
 
0.9%
킹덤합동사무소 2
 
0.9%
전갑삼일반행정서사 2
 
0.9%
Other values (142) 142
61.2%
2024-04-18T00:03:27.417506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
242
17.6%
148
 
10.8%
137
 
10.0%
123
 
9.0%
117
 
8.5%
86
 
6.3%
44
 
3.2%
37
 
2.7%
26
 
1.9%
22
 
1.6%
Other values (155) 390
28.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1280
93.3%
Space Separator 86
 
6.3%
Uppercase Letter 5
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
242
18.9%
148
11.6%
137
 
10.7%
123
 
9.6%
117
 
9.1%
44
 
3.4%
37
 
2.9%
26
 
2.0%
22
 
1.7%
21
 
1.6%
Other values (148) 363
28.4%
Uppercase Letter
ValueCountFrequency (%)
S 1
20.0%
J 1
20.0%
K 1
20.0%
M 1
20.0%
E 1
20.0%
Space Separator
ValueCountFrequency (%)
86
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1280
93.3%
Common 87
 
6.3%
Latin 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
242
18.9%
148
11.6%
137
 
10.7%
123
 
9.6%
117
 
9.1%
44
 
3.4%
37
 
2.9%
26
 
2.0%
22
 
1.7%
21
 
1.6%
Other values (148) 363
28.4%
Latin
ValueCountFrequency (%)
S 1
20.0%
J 1
20.0%
K 1
20.0%
M 1
20.0%
E 1
20.0%
Common
ValueCountFrequency (%)
86
98.9%
& 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1280
93.3%
ASCII 92
 
6.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
242
18.9%
148
11.6%
137
 
10.7%
123
 
9.6%
117
 
9.1%
44
 
3.4%
37
 
2.9%
26
 
2.0%
22
 
1.7%
21
 
1.6%
Other values (148) 363
28.4%
ASCII
ValueCountFrequency (%)
86
93.5%
S 1
 
1.1%
J 1
 
1.1%
& 1
 
1.1%
K 1
 
1.1%
M 1
 
1.1%
E 1
 
1.1%
Distinct116
Distinct (%)79.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
Minimum1982-03-30 00:00:00
Maximum2020-12-01 00:00:00
2024-04-18T00:03:27.528311image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-18T00:03:27.636377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

전화번호
Text

MISSING 

Distinct101
Distinct (%)95.3%
Missing40
Missing (%)27.4%
Memory size1.3 KiB
2024-04-18T00:03:27.836530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.04717
Min length12

Characters and Unicode

Total characters1277
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)91.5%

Sample

1st row053-242-8030
2nd row053-743-4989
3rd row053-794-2004
4th row053-783-7900
5th row053-252-5300
ValueCountFrequency (%)
053-782-3581 3
 
2.8%
053-753-3300 2
 
1.9%
053-745-3577 2
 
1.9%
053-741-2941 2
 
1.9%
053-765-5543 1
 
0.9%
053-763-8645 1
 
0.9%
053-753-2800 1
 
0.9%
053-741-1738 1
 
0.9%
053-763-9148 1
 
0.9%
053-744-3160 1
 
0.9%
Other values (91) 91
85.8%
2024-04-18T00:03:28.117204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 212
16.6%
5 200
15.7%
0 170
13.3%
3 170
13.3%
7 124
9.7%
4 88
6.9%
1 74
 
5.8%
2 67
 
5.2%
8 65
 
5.1%
6 60
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1065
83.4%
Dash Punctuation 212
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 200
18.8%
0 170
16.0%
3 170
16.0%
7 124
11.6%
4 88
8.3%
1 74
 
6.9%
2 67
 
6.3%
8 65
 
6.1%
6 60
 
5.6%
9 47
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 212
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1277
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 212
16.6%
5 200
15.7%
0 170
13.3%
3 170
13.3%
7 124
9.7%
4 88
6.9%
1 74
 
5.8%
2 67
 
5.2%
8 65
 
5.1%
6 60
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1277
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 212
16.6%
5 200
15.7%
0 170
13.3%
3 170
13.3%
7 124
9.7%
4 88
6.9%
1 74
 
5.8%
2 67
 
5.2%
8 65
 
5.1%
6 60
 
4.7%

Missing values

2024-04-18T00:03:26.910580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-18T00:03:26.976091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

명칭신고연월일전화번호
0도담행정사사무소2020-12-01<NA>
1반도 행정사 사무소2017-10-11053-242-8030
2전상훈 행정사사무소2020-06-23<NA>
3동반행정사사무소2017-04-03<NA>
4청호행정사사무소2020-08-05<NA>
5요한행정사사무소2020-07-29<NA>
6에펠탑행정사사무소2020-07-29053-743-4989
7무학행정사사무소2019-08-09<NA>
8티케이행정사사무소2020-07-15053-794-2004
9부동산전문 행정사사무소2020-07-06053-783-7900
명칭신고연월일전화번호
136일반행정서사조덕래사무소1996-02-16053-555-9987
137일반행정서사이범태사무소1996-02-16053-653-8934
138일반행정서사김명석사무소1996-02-16053-586-0664
139전갑삼일반행정서사1996-02-16053-741-2941
140일반행정서사이태근사무소1996-02-16053-753-1129
141전갑삼일반행정서사1996-02-16053-741-2941
142일반행정서사구일회사무소1996-02-16053-743-1360
143일반행정서사권동원사무소1996-02-16053-743-9985
144일반행정서사정지경사무소1995-11-27054-246-0141
145일방행정서사임장규사무소1995-11-06053-794-1964

Duplicate rows

Most frequently occurring

명칭신고연월일전화번호# duplicates
0영일 행정사합동사무소2018-05-11053-782-35813
1전갑삼일반행정서사1996-02-16053-741-29412
2킹덤합동사무소2012-12-07053-753-33002