Overview

Dataset statistics

Number of variables7
Number of observations116
Missing cells38
Missing cells (%)4.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.6 KiB
Average record size in memory58.1 B

Variable types

Numeric1
DateTime1
Text3
Categorical2

Dataset

Description경기도_의정부시_행정사 사무소 현황 데이터로 순번, 신고일자, 상호명, 소재지도로명주소, 전화번호, 영업상태 등의 항목으로 구성되어 있습니다.
Author경기도 의정부시
URLhttps://www.data.go.kr/data/15039893/fileData.do

Alerts

엉업상태 has constant value ""Constant
번호 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 번호High correlation
전화번호 has 38 (32.8%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:11:23.887338
Analysis finished2023-12-12 10:11:24.898034
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct116
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.5
Minimum1
Maximum116
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-12T19:11:24.992902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.75
Q129.75
median58.5
Q387.25
95-th percentile110.25
Maximum116
Range115
Interquartile range (IQR)57.5

Descriptive statistics

Standard deviation33.630343
Coefficient of variation (CV)0.57487767
Kurtosis-1.2
Mean58.5
Median Absolute Deviation (MAD)29
Skewness0
Sum6786
Variance1131
MonotonicityStrictly increasing
2023-12-12T19:11:25.177442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.9%
75 1
 
0.9%
87 1
 
0.9%
86 1
 
0.9%
85 1
 
0.9%
84 1
 
0.9%
83 1
 
0.9%
82 1
 
0.9%
81 1
 
0.9%
80 1
 
0.9%
Other values (106) 106
91.4%
ValueCountFrequency (%)
1 1
0.9%
2 1
0.9%
3 1
0.9%
4 1
0.9%
5 1
0.9%
6 1
0.9%
7 1
0.9%
8 1
0.9%
9 1
0.9%
10 1
0.9%
ValueCountFrequency (%)
116 1
0.9%
115 1
0.9%
114 1
0.9%
113 1
0.9%
112 1
0.9%
111 1
0.9%
110 1
0.9%
109 1
0.9%
108 1
0.9%
107 1
0.9%
Distinct107
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
Minimum1995-12-19 00:00:00
Maximum2022-08-24 00:00:00
2023-12-12T19:11:25.332219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:11:25.489585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct115
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T19:11:25.793077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length8.4655172
Min length3

Characters and Unicode

Total characters982
Distinct characters171
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)98.3%

Sample

1st row정범진 행정사
2nd row김종하 행정사
3rd row한승웅행정사
4th row김진규행정사
5th row김주환 행정사
ValueCountFrequency (%)
행정사 32
 
16.9%
사무소 23
 
12.2%
행정사사무소 5
 
2.6%
행정사무소 4
 
2.1%
sl 2
 
1.1%
행운경영행정컨설팅 1
 
0.5%
정석화 1
 
0.5%
이정삼 1
 
0.5%
김보석 1
 
0.5%
심경섭행정사 1
 
0.5%
Other values (118) 118
62.4%
2023-12-12T19:11:26.262796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
176
17.9%
121
 
12.3%
113
 
11.5%
74
 
7.5%
73
 
7.4%
73
 
7.4%
17
 
1.7%
11
 
1.1%
8
 
0.8%
8
 
0.8%
Other values (161) 308
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 887
90.3%
Space Separator 73
 
7.4%
Uppercase Letter 12
 
1.2%
Lowercase Letter 4
 
0.4%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
19.8%
121
13.6%
113
12.7%
74
 
8.3%
73
 
8.2%
17
 
1.9%
11
 
1.2%
8
 
0.9%
8
 
0.9%
7
 
0.8%
Other values (146) 279
31.5%
Uppercase Letter
ValueCountFrequency (%)
S 3
25.0%
P 2
16.7%
L 2
16.7%
F 2
16.7%
B 1
 
8.3%
G 1
 
8.3%
D 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
m 1
25.0%
a 1
25.0%
e 1
25.0%
r 1
25.0%
Space Separator
ValueCountFrequency (%)
73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 885
90.1%
Common 79
 
8.0%
Latin 16
 
1.6%
Han 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
19.9%
121
13.7%
113
12.8%
74
 
8.4%
73
 
8.2%
17
 
1.9%
11
 
1.2%
8
 
0.9%
8
 
0.9%
7
 
0.8%
Other values (144) 277
31.3%
Latin
ValueCountFrequency (%)
S 3
18.8%
P 2
12.5%
L 2
12.5%
F 2
12.5%
B 1
 
6.2%
G 1
 
6.2%
m 1
 
6.2%
a 1
 
6.2%
e 1
 
6.2%
r 1
 
6.2%
Common
ValueCountFrequency (%)
73
92.4%
) 2
 
2.5%
( 2
 
2.5%
& 2
 
2.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 885
90.1%
ASCII 95
 
9.7%
CJK 2
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
176
19.9%
121
13.7%
113
12.8%
74
 
8.4%
73
 
8.2%
17
 
1.9%
11
 
1.2%
8
 
0.9%
8
 
0.9%
7
 
0.8%
Other values (144) 277
31.3%
ASCII
ValueCountFrequency (%)
73
76.8%
S 3
 
3.2%
) 2
 
2.1%
P 2
 
2.1%
( 2
 
2.1%
L 2
 
2.1%
F 2
 
2.1%
& 2
 
2.1%
B 1
 
1.1%
G 1
 
1.1%
Other values (5) 5
 
5.3%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct109
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
2023-12-12T19:11:26.706777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length34.112069
Min length20

Characters and Unicode

Total characters3957
Distinct characters180
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)89.7%

Sample

1st row경기도 의정부시 녹양로34번길 22 (가능동)
2nd row경기도 의정부시 녹양로34번길 22 (가능동)
3rd row경기도 의정부시 범골로 137 (의정부동)
4th row경기도 의정부시 동일로 451-25 (의정부동)
5th row경기도 의정부시 호동로 56 (호원동)
ValueCountFrequency (%)
경기도 115
 
15.2%
의정부시 115
 
15.2%
가능동 23
 
3.0%
의정부동 19
 
2.5%
녹양로34번길 16
 
2.1%
호원동, 11
 
1.5%
호국로 10
 
1.3%
시민로 9
 
1.2%
의정부동, 8
 
1.1%
1층 6
 
0.8%
Other values (304) 425
56.1%
2023-12-12T19:11:27.407436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
642
 
16.2%
155
 
3.9%
154
 
3.9%
154
 
3.9%
1 154
 
3.9%
150
 
3.8%
129
 
3.3%
121
 
3.1%
121
 
3.1%
2 121
 
3.1%
Other values (170) 2056
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2192
55.4%
Decimal Number 752
 
19.0%
Space Separator 642
 
16.2%
Other Punctuation 118
 
3.0%
Open Punctuation 116
 
2.9%
Close Punctuation 116
 
2.9%
Dash Punctuation 17
 
0.4%
Lowercase Letter 2
 
0.1%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
155
 
7.1%
154
 
7.0%
154
 
7.0%
150
 
6.8%
129
 
5.9%
121
 
5.5%
121
 
5.5%
117
 
5.3%
115
 
5.2%
99
 
4.5%
Other values (152) 877
40.0%
Decimal Number
ValueCountFrequency (%)
1 154
20.5%
2 121
16.1%
0 114
15.2%
3 88
11.7%
4 72
9.6%
5 66
8.8%
6 40
 
5.3%
8 40
 
5.3%
7 31
 
4.1%
9 26
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
642
100.0%
Other Punctuation
ValueCountFrequency (%)
118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 116
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2192
55.4%
Common 1761
44.5%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
155
 
7.1%
154
 
7.0%
154
 
7.0%
150
 
6.8%
129
 
5.9%
121
 
5.5%
121
 
5.5%
117
 
5.3%
115
 
5.2%
99
 
4.5%
Other values (152) 877
40.0%
Common
ValueCountFrequency (%)
642
36.5%
1 154
 
8.7%
2 121
 
6.9%
118
 
6.7%
( 116
 
6.6%
) 116
 
6.6%
0 114
 
6.5%
3 88
 
5.0%
4 72
 
4.1%
5 66
 
3.7%
Other values (5) 154
 
8.7%
Latin
ValueCountFrequency (%)
e 2
50.0%
A 1
25.0%
B 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2192
55.4%
ASCII 1647
41.6%
None 118
 
3.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
642
39.0%
1 154
 
9.4%
2 121
 
7.3%
( 116
 
7.0%
) 116
 
7.0%
0 114
 
6.9%
3 88
 
5.3%
4 72
 
4.4%
5 66
 
4.0%
6 40
 
2.4%
Other values (7) 118
 
7.2%
Hangul
ValueCountFrequency (%)
155
 
7.1%
154
 
7.0%
154
 
7.0%
150
 
6.8%
129
 
5.9%
121
 
5.5%
121
 
5.5%
117
 
5.3%
115
 
5.2%
99
 
4.5%
Other values (152) 877
40.0%
None
ValueCountFrequency (%)
118
100.0%

전화번호
Text

MISSING 

Distinct76
Distinct (%)97.4%
Missing38
Missing (%)32.8%
Memory size1.0 KiB
2023-12-12T19:11:27.803797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.910256
Min length1

Characters and Unicode

Total characters929
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)94.9%

Sample

1st row031-875-7621
2nd row031-872-9675
3rd row031-875-1291
4th row031-843-1810
5th row070-7559-1756
ValueCountFrequency (%)
031-876-0336 2
 
2.5%
070-4866-5174 2
 
2.5%
031-878-5390 1
 
1.3%
031-873-2200 1
 
1.3%
031-871-8202 1
 
1.3%
031-878-4830 1
 
1.3%
031-841-1900 1
 
1.3%
031-874-5201 1
 
1.3%
031-827-9134 1
 
1.3%
02-552-5587 1
 
1.3%
Other values (67) 67
84.8%
2023-12-12T19:11:28.344514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 152
16.4%
0 130
14.0%
3 122
13.1%
1 119
12.8%
8 100
10.8%
7 69
7.4%
2 56
 
6.0%
6 51
 
5.5%
5 46
 
5.0%
4 45
 
4.8%
Other values (2) 39
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 774
83.3%
Dash Punctuation 152
 
16.4%
Space Separator 3
 
0.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 130
16.8%
3 122
15.8%
1 119
15.4%
8 100
12.9%
7 69
8.9%
2 56
7.2%
6 51
 
6.6%
5 46
 
5.9%
4 45
 
5.8%
9 36
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 152
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 929
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 152
16.4%
0 130
14.0%
3 122
13.1%
1 119
12.8%
8 100
10.8%
7 69
7.4%
2 56
 
6.0%
6 51
 
5.5%
5 46
 
5.0%
4 45
 
4.8%
Other values (2) 39
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 929
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 152
16.4%
0 130
14.0%
3 122
13.1%
1 119
12.8%
8 100
10.8%
7 69
7.4%
2 56
 
6.0%
6 51
 
5.5%
5 46
 
5.0%
4 45
 
4.8%
Other values (2) 39
 
4.2%

엉업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
영업중
116 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업중
2nd row영업중
3rd row영업중
4th row영업중
5th row영업중

Common Values

ValueCountFrequency (%)
영업중 116
100.0%

Length

2023-12-12T19:11:28.545712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:28.690242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
영업중 116
100.0%

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
<NA>
78 
전화번호 미기재 또는 휴대전화번호 기재
38 

Length

Max length21
Median length4
Mean length9.5689655
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 78
67.2%
전화번호 미기재 또는 휴대전화번호 기재 38
32.8%

Length

2023-12-12T19:11:28.809751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:11:28.923841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 78
29.1%
전화번호 38
14.2%
미기재 38
14.2%
또는 38
14.2%
휴대전화번호 38
14.2%
기재 38
14.2%

Interactions

2023-12-12T19:11:24.232740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:11:29.003711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호전화번호
번호1.0000.983
전화번호0.9831.000
2023-12-12T19:11:29.113716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호비고
번호1.0001.000
비고1.0001.000

Missing values

2023-12-12T19:11:24.710730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:11:24.848698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호신고일자상호명소재지도로명주소전화번호엉업상태비고
011995-12-19정범진 행정사경기도 의정부시 녹양로34번길 22 (가능동)031-875-7621영업중<NA>
121996-01-10김종하 행정사경기도 의정부시 녹양로34번길 22 (가능동)031-872-9675영업중<NA>
231999-04-26한승웅행정사경기도 의정부시 범골로 137 (의정부동)031-875-1291영업중<NA>
342000-03-22김진규행정사경기도 의정부시 동일로 451-25 (의정부동)031-843-1810영업중<NA>
452000-04-27김주환 행정사경기도 의정부시 호동로 56 (호원동)070-7559-1756영업중<NA>
562000-06-19박기영행정사경기도 의정부시 범골로107번길 117-21 (의정부동)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
672000-10-02이효봉행정사경기도 의정부시 평화로 12, 107동 1402호 (호원동, 겅영아파트)031-873-7932영업중<NA>
782000-12-04미래행정사경기도 의정부시 호국로1310번길 7 (의정부동)031-821-1447영업중<NA>
892001-02-13전용철행정사경기도 의정부시 신흥로168번길 23 (의정부동)031-878-3806영업중<NA>
9102001-07-14김영호행정사경기도 의정부시 녹양로34번길 3 (녹양동)031-821-3025영업중<NA>
번호신고일자상호명소재지도로명주소전화번호엉업상태비고
1061072021-10-06재림행정사사무소경기도 의정부시 시민로 287, 110동 1502호(신곡동, e편한세상 신곡 파크비스타)031-874-8189영업중<NA>
1071082021-10-13코리아 행정심판청구 전문행정사 유청길 사무소경기도 의정부시 신흥로258번길 25, 해태프라자 8층 A18호(의정부동)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
1081092021-11-02드링크 행정사 사무소경기도 의정부시 용민로 441, 1404동 1905호(민락동, 엘에이치브라운빌리지)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
1091102021-11-18이은태 행정사 사무소경기도 의정부시 천보로416번길 16, 3층(금오동)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
1101112022-01-07을지행정사사무소경기도 의정부시 시민로 80, 센트럴타워 5층 512호(의정부동)031-871-7808영업중<NA>
1111122022-01-28행정사 김이원사무소경기도 의정부시 시민로19번길 22-6, 3층(의정부동)031-876-1636영업중<NA>
1121132022-02-18행정사무소 수경기도 의정부시 천보로 14, 2층 15호 그레이스모나코 2015호(민락동)02-432-6633영업중<NA>
1131142022-05-12제이S더함 행정사 정석화 사무소경기도 의정부시 부용로95번길 18, 9층 901호 해피플러스 3(금오동)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
1141152022-08-11행운경영행정컨설팅경기도 의정부시 범골로107번길 128, 103동 302호(의정부동, 삼익 리베리움)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재
1151162022-08-24마이썬 행정사 사무소경기도 의정부시 오목로 150, 201동 2003호(민락동, 민락주공2단지)<NA>영업중전화번호 미기재 또는 휴대전화번호 기재