Overview

Dataset statistics

Number of variables6
Number of observations57
Missing cells32
Missing cells (%)9.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory51.3 B

Variable types

Numeric1
Text4
Categorical1

Dataset

Description인천광역시 연수구 소재 행정사 현황의 데이터에서 사무소명칭, 주소, 전화번호의 목록- 행정사 종류, 사무소명칭, 소재지, 전화번호로 구분
Author인천광역시 연수구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15028749&srcSe=7661IVAWM27C61E190

Alerts

행정사종류 is highly imbalanced (62.7%)Imbalance
사무소전화번호 has 32 (56.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-28 08:43:25.908988
Analysis finished2024-01-28 08:43:26.526406
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct57
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29
Minimum1
Maximum57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size645.0 B
2024-01-28T17:43:26.593055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.8
Q115
median29
Q343
95-th percentile54.2
Maximum57
Range56
Interquartile range (IQR)28

Descriptive statistics

Standard deviation16.598193
Coefficient of variation (CV)0.57235147
Kurtosis-1.2
Mean29
Median Absolute Deviation (MAD)14
Skewness0
Sum1653
Variance275.5
MonotonicityStrictly increasing
2024-01-28T17:43:26.730189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.8%
44 1
 
1.8%
32 1
 
1.8%
33 1
 
1.8%
34 1
 
1.8%
35 1
 
1.8%
36 1
 
1.8%
37 1
 
1.8%
38 1
 
1.8%
39 1
 
1.8%
Other values (47) 47
82.5%
ValueCountFrequency (%)
1 1
1.8%
2 1
1.8%
3 1
1.8%
4 1
1.8%
5 1
1.8%
6 1
1.8%
7 1
1.8%
8 1
1.8%
9 1
1.8%
10 1
1.8%
ValueCountFrequency (%)
57 1
1.8%
56 1
1.8%
55 1
1.8%
54 1
1.8%
53 1
1.8%
52 1
1.8%
51 1
1.8%
50 1
1.8%
49 1
1.8%
48 1
1.8%
Distinct49
Distinct (%)86.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2024-01-28T17:43:26.940768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length17
Mean length9.5614035
Min length5

Characters and Unicode

Total characters545
Distinct characters130
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)80.7%

Sample

1st row윈윈행정사사무소
2nd row행정사법인 태평양
3rd row행정사법인 태평양
4th row행정사법인 태평양
5th row행정사법인 태평양
ValueCountFrequency (%)
행정사법인 9
 
9.2%
행정사사무소 8
 
8.2%
행정사 7
 
7.1%
사무소 7
 
7.1%
해사인 5
 
5.1%
태평양 4
 
4.1%
행정사무소 3
 
3.1%
인천송도 2
 
2.0%
외번/일반 2
 
2.0%
공감 2
 
2.0%
Other values (49) 49
50.0%
2024-01-28T17:43:27.317177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
17.6%
58
 
10.6%
55
 
10.1%
41
 
7.5%
39
 
7.2%
39
 
7.2%
22
 
4.0%
9
 
1.7%
6
 
1.1%
5
 
0.9%
Other values (120) 175
32.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 464
85.1%
Space Separator 41
 
7.5%
Lowercase Letter 16
 
2.9%
Uppercase Letter 13
 
2.4%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%
Other Punctuation 3
 
0.6%
Decimal Number 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
20.7%
58
12.5%
55
11.9%
39
 
8.4%
39
 
8.4%
22
 
4.7%
9
 
1.9%
6
 
1.3%
5
 
1.1%
5
 
1.1%
Other values (92) 130
28.0%
Lowercase Letter
ValueCountFrequency (%)
l 4
25.0%
n 2
12.5%
o 2
12.5%
g 1
 
6.2%
i 1
 
6.2%
t 1
 
6.2%
u 1
 
6.2%
s 1
 
6.2%
y 1
 
6.2%
a 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
G 2
15.4%
S 2
15.4%
K 2
15.4%
A 1
7.7%
I 1
7.7%
V 1
7.7%
B 1
7.7%
D 1
7.7%
C 1
7.7%
O 1
7.7%
Other Punctuation
ValueCountFrequency (%)
/ 2
66.7%
& 1
33.3%
Decimal Number
ValueCountFrequency (%)
4 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
41
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 463
85.0%
Common 52
 
9.5%
Latin 29
 
5.3%
Han 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
20.7%
58
12.5%
55
11.9%
39
 
8.4%
39
 
8.4%
22
 
4.8%
9
 
1.9%
6
 
1.3%
5
 
1.1%
5
 
1.1%
Other values (91) 129
27.9%
Latin
ValueCountFrequency (%)
l 4
 
13.8%
G 2
 
6.9%
n 2
 
6.9%
S 2
 
6.9%
o 2
 
6.9%
K 2
 
6.9%
A 1
 
3.4%
I 1
 
3.4%
V 1
 
3.4%
B 1
 
3.4%
Other values (11) 11
37.9%
Common
ValueCountFrequency (%)
41
78.8%
( 3
 
5.8%
) 3
 
5.8%
/ 2
 
3.8%
4 1
 
1.9%
2 1
 
1.9%
& 1
 
1.9%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 463
85.0%
ASCII 81
 
14.9%
CJK 1
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
96
20.7%
58
12.5%
55
11.9%
39
 
8.4%
39
 
8.4%
22
 
4.8%
9
 
1.9%
6
 
1.3%
5
 
1.1%
5
 
1.1%
Other values (91) 129
27.9%
ASCII
ValueCountFrequency (%)
41
50.6%
l 4
 
4.9%
( 3
 
3.7%
) 3
 
3.7%
G 2
 
2.5%
n 2
 
2.5%
S 2
 
2.5%
o 2
 
2.5%
K 2
 
2.5%
/ 2
 
2.5%
Other values (18) 18
22.2%
CJK
ValueCountFrequency (%)
1
100.0%

행정사종류
Categorical

IMBALANCE 

Distinct3
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size588.0 B
일반행정사
51 
해사행정사
 
3
외국어번역행정사(영어)
 
3

Length

Max length12
Median length5
Mean length5.3684211
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반행정사
2nd row일반행정사
3rd row일반행정사
4th row일반행정사
5th row일반행정사

Common Values

ValueCountFrequency (%)
일반행정사 51
89.5%
해사행정사 3
 
5.3%
외국어번역행정사(영어) 3
 
5.3%

Length

2024-01-28T17:43:27.438506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-28T17:43:27.528145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반행정사 51
89.5%
해사행정사 3
 
5.3%
외국어번역행정사(영어 3
 
5.3%
Distinct53
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size588.0 B
2024-01-28T17:43:27.727137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters171
Distinct characters75
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)86.0%

Sample

1st row고성자
2nd row이종일
3rd row류길호
4th row함현수
5th row윤기현
ValueCountFrequency (%)
이동주 2
 
3.5%
김동진 2
 
3.5%
박인수 2
 
3.5%
조형진 2
 
3.5%
이상호 1
 
1.8%
간수웅 1
 
1.8%
정우택 1
 
1.8%
오명훈 1
 
1.8%
김창환 1
 
1.8%
이복훈 1
 
1.8%
Other values (43) 43
75.4%
2024-01-28T17:43:28.026856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
7.6%
8
 
4.7%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
Other values (65) 102
59.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 171
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
7.6%
8
 
4.7%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
Other values (65) 102
59.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 171
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
7.6%
8
 
4.7%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
Other values (65) 102
59.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 171
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
7.6%
8
 
4.7%
7
 
4.1%
7
 
4.1%
7
 
4.1%
6
 
3.5%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
Other values (65) 102
59.6%
Distinct48
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Memory size588.0 B
2024-01-28T17:43:28.274747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length35.052632
Min length24

Characters and Unicode

Total characters1998
Distinct characters135
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)77.2%

Sample

1st row인천광역시 연수구 앵고개로 242, 503-1호 (동춘동)
2nd row인천광역시 연수구 센트럴로 313, C동 814호 (송도동)
3rd row인천광역시 연수구 센트럴로 313, C동 814호 (송도동)
4th row인천광역시 연수구 센트럴로 313, C동 814호 (송도동)
5th row인천광역시 연수구 센트럴로 313, C동 814호 (송도동)
ValueCountFrequency (%)
인천광역시 57
 
15.2%
연수구 57
 
15.2%
센트럴로 17
 
4.5%
송도동 16
 
4.3%
313, 10
 
2.7%
263, 7
 
1.9%
c동 6
 
1.6%
11층 5
 
1.3%
5~6호 5
 
1.3%
동춘동, 5
 
1.3%
Other values (148) 190
50.7%
2024-01-28T17:43:28.625685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
318
 
15.9%
89
 
4.5%
1 88
 
4.4%
73
 
3.7%
73
 
3.7%
66
 
3.3%
66
 
3.3%
61
 
3.1%
60
 
3.0%
3 59
 
3.0%
Other values (125) 1045
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1094
54.8%
Decimal Number 372
 
18.6%
Space Separator 318
 
15.9%
Other Punctuation 66
 
3.3%
Open Punctuation 57
 
2.9%
Close Punctuation 57
 
2.9%
Uppercase Letter 23
 
1.2%
Dash Punctuation 6
 
0.3%
Math Symbol 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
8.1%
73
 
6.7%
73
 
6.7%
66
 
6.0%
61
 
5.6%
60
 
5.5%
58
 
5.3%
57
 
5.2%
57
 
5.2%
57
 
5.2%
Other values (101) 443
40.5%
Decimal Number
ValueCountFrequency (%)
1 88
23.7%
3 59
15.9%
2 50
13.4%
0 47
12.6%
4 32
 
8.6%
5 30
 
8.1%
6 24
 
6.5%
8 17
 
4.6%
7 17
 
4.6%
9 8
 
2.2%
Uppercase Letter
ValueCountFrequency (%)
C 8
34.8%
B 7
30.4%
D 2
 
8.7%
A 2
 
8.7%
T 1
 
4.3%
E 1
 
4.3%
R 1
 
4.3%
M 1
 
4.3%
Space Separator
ValueCountFrequency (%)
318
100.0%
Other Punctuation
ValueCountFrequency (%)
66
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1094
54.8%
Common 881
44.1%
Latin 23
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
8.1%
73
 
6.7%
73
 
6.7%
66
 
6.0%
61
 
5.6%
60
 
5.5%
58
 
5.3%
57
 
5.2%
57
 
5.2%
57
 
5.2%
Other values (101) 443
40.5%
Common
ValueCountFrequency (%)
318
36.1%
1 88
 
10.0%
66
 
7.5%
3 59
 
6.7%
( 57
 
6.5%
) 57
 
6.5%
2 50
 
5.7%
0 47
 
5.3%
4 32
 
3.6%
5 30
 
3.4%
Other values (6) 77
 
8.7%
Latin
ValueCountFrequency (%)
C 8
34.8%
B 7
30.4%
D 2
 
8.7%
A 2
 
8.7%
T 1
 
4.3%
E 1
 
4.3%
R 1
 
4.3%
M 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1094
54.8%
ASCII 838
41.9%
None 66
 
3.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
318
37.9%
1 88
 
10.5%
3 59
 
7.0%
( 57
 
6.8%
) 57
 
6.8%
2 50
 
6.0%
0 47
 
5.6%
4 32
 
3.8%
5 30
 
3.6%
6 24
 
2.9%
Other values (13) 76
 
9.1%
Hangul
ValueCountFrequency (%)
89
 
8.1%
73
 
6.7%
73
 
6.7%
66
 
6.0%
61
 
5.6%
60
 
5.5%
58
 
5.3%
57
 
5.2%
57
 
5.2%
57
 
5.2%
Other values (101) 443
40.5%
None
ValueCountFrequency (%)
66
100.0%

사무소전화번호
Text

MISSING 

Distinct18
Distinct (%)72.0%
Missing32
Missing (%)56.1%
Memory size588.0 B
2024-01-28T17:43:28.797973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length8.56
Min length1

Characters and Unicode

Total characters214
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)68.0%

Sample

1st row
2nd row
3rd row
4th row
5th row032-832-8258
ValueCountFrequency (%)
032-280-9000 1
 
5.9%
032-875-8110 1
 
5.9%
032-858-6553 1
 
5.9%
032-831-2857 1
 
5.9%
032-858-2260 1
 
5.9%
032-552-0069 1
 
5.9%
032-576-6777 1
 
5.9%
032-832-0937 1
 
5.9%
032-773-3115 1
 
5.9%
032-832-8258 1
 
5.9%
Other values (7) 7
41.2%
2024-01-28T17:43:29.080337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 34
15.9%
0 32
15.0%
3 27
12.6%
2 26
12.1%
8 22
10.3%
7 18
8.4%
5 16
7.5%
1 12
 
5.6%
6 10
 
4.7%
8
 
3.7%
Other values (2) 9
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 172
80.4%
Dash Punctuation 34
 
15.9%
Space Separator 8
 
3.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 32
18.6%
3 27
15.7%
2 26
15.1%
8 22
12.8%
7 18
10.5%
5 16
9.3%
1 12
 
7.0%
6 10
 
5.8%
9 5
 
2.9%
4 4
 
2.3%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 214
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 34
15.9%
0 32
15.0%
3 27
12.6%
2 26
12.1%
8 22
10.3%
7 18
8.4%
5 16
7.5%
1 12
 
5.6%
6 10
 
4.7%
8
 
3.7%
Other values (2) 9
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 214
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 34
15.9%
0 32
15.0%
3 27
12.6%
2 26
12.1%
8 22
10.3%
7 18
8.4%
5 16
7.5%
1 12
 
5.6%
6 10
 
4.7%
8
 
3.7%
Other values (2) 9
 
4.2%

Interactions

2024-01-28T17:43:26.295347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-28T17:43:29.167208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사무소명칭행정사종류행정사 성명사무소소재지사무소전화번호
연번1.0000.9900.3020.9870.9900.911
사무소명칭0.9901.0000.0000.9981.0001.000
행정사종류0.3020.0001.0000.0000.0000.571
행정사 성명0.9870.9980.0001.0001.0000.989
사무소소재지0.9901.0000.0001.0001.0000.993
사무소전화번호0.9111.0000.5710.9890.9931.000
2024-01-28T17:43:29.261414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정사종류
연번1.0000.169
행정사종류0.1691.000

Missing values

2024-01-28T17:43:26.398124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-28T17:43:26.483832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사무소명칭행정사종류행정사 성명사무소소재지사무소전화번호
01윈윈행정사사무소일반행정사고성자인천광역시 연수구 앵고개로 242, 503-1호 (동춘동)<NA>
12행정사법인 태평양일반행정사이종일인천광역시 연수구 센트럴로 313, C동 814호 (송도동)<NA>
23행정사법인 태평양일반행정사류길호인천광역시 연수구 센트럴로 313, C동 814호 (송도동)<NA>
34행정사법인 태평양일반행정사함현수인천광역시 연수구 센트럴로 313, C동 814호 (송도동)<NA>
45행정사법인 태평양일반행정사윤기현인천광역시 연수구 센트럴로 313, C동 814호 (송도동)<NA>
56OK행정사 사무소일반행정사김만호인천광역시 연수구 센트럴로 313, B동 523호 (송도동)<NA>
67불꽃마음행정사일반행정사김연인인천광역시 연수구 센트럴로 313, B동 606호 (송도동)<NA>
78행정사법인 해사인해사행정사조형진인천광역시 연수구 센트럴로 263, 11층 5~6호 (송도동)
89행정사법인 해사인일반행정사김동진인천광역시 연수구 센트럴로 263, 11층 5~6호 (송도동)
910행정사법인 해사인일반행정사이윤우인천광역시 연수구 센트럴로 263, 11층 5~6호 (송도동)
연번사무소명칭행정사종류행정사 성명사무소소재지사무소전화번호
4748인천종합행정사일반행정사홍성덕인천광역시 연수구 먼우금로83번길 49, 302동 501호 (동춘동, 대림3차아파트)032-576-6777
4849행정사백인석사무소일반행정사백인석인천광역시 연수구 컨벤시아대로130번길 58, 106동 1201호 (송도동, 송도자이하버뷰1단지)032-552-0069
4950장종호행정사사무소일반행정사장종호인천광역시 연수구 인천타워대로 323, D동 115호 (송도동)032-858-2260
5051(주) 세온행정사 사무소일반행정사황성남인천광역시 연수구 인천타워대로 323, 씨동 1702호 (송도동, 송도 센트로드)<NA>
5152박재필행정사사무소일반행정사박재필인천광역시 연수구 원인재로 124, 306호 (동춘동, 한양1차상가)<NA>
5253오명종행정사일반행정사오명종인천광역시 연수구 원인재로 124, B25호 (동춘동, 한양1차상가)<NA>
5354김광식행정사일반행정사김광식인천광역시 연수구 새말로 20, 지하5호 (연수동, 상가)<NA>
5455소백행정사사무소일반행정사간수웅인천광역시 연수구 청량로109번길 37 (옥련동)032-831-2857
5556이종원행정사일반행정사이종원인천광역시 연수구 동곡재로 182 (동춘동)<NA>
5657홍정희번역행정사사무소외국어번역행정사(영어)홍정희인천광역시 연수구 컨벤시아대로42번길 95, 1005동 1301호 (송도동, 더샵 엑스포)070-4205-7785