Overview

Dataset statistics

Number of variables4
Number of observations266
Missing cells80
Missing cells (%)7.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.4 KiB
Average record size in memory32.5 B

Variable types

Text3
DateTime1

Dataset

Description성남시 내 방문판매업 현황 데이터이며, 사업장명, 인허가일자, 소재지도로명주소, 소재지전화번호 등의 항목으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15032413/fileData.do

Alerts

소재지전화번호 has 80 (30.1%) missing valuesMissing
사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:12:39.925241
Analysis finished2023-12-12 13:12:40.374219
Duration0.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct266
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T22:12:40.537364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length28
Mean length9.4097744
Min length2

Characters and Unicode

Total characters2503
Distinct characters374
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique266 ?
Unique (%)100.0%

Sample

1st row이지라이프
2nd row엘뤼프 엘스타 대림지사점
3rd row㈜인더코어비즈니스플랫폼
4th row닥터손
5th row가온
ValueCountFrequency (%)
주식회사 47
 
10.8%
9
 
2.1%
마임 6
 
1.4%
인셀덤 6
 
1.4%
에치와이 6
 
1.4%
기아자동차 4
 
0.9%
마임분당 3
 
0.7%
co 3
 
0.7%
성남지사 3
 
0.7%
분당 3
 
0.7%
Other values (334) 347
79.4%
2023-12-12T22:12:40.957779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
173
 
6.9%
87
 
3.5%
84
 
3.4%
73
 
2.9%
63
 
2.5%
( 56
 
2.2%
) 56
 
2.2%
53
 
2.1%
51
 
2.0%
50
 
2.0%
Other values (364) 1757
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1961
78.3%
Space Separator 173
 
6.9%
Lowercase Letter 124
 
5.0%
Uppercase Letter 109
 
4.4%
Open Punctuation 56
 
2.2%
Close Punctuation 56
 
2.2%
Other Symbol 12
 
0.5%
Other Punctuation 6
 
0.2%
Decimal Number 5
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
87
 
4.4%
84
 
4.3%
73
 
3.7%
63
 
3.2%
53
 
2.7%
51
 
2.6%
50
 
2.5%
41
 
2.1%
39
 
2.0%
36
 
1.8%
Other values (308) 1384
70.6%
Uppercase Letter
ValueCountFrequency (%)
N 9
 
8.3%
S 8
 
7.3%
E 8
 
7.3%
R 8
 
7.3%
O 7
 
6.4%
C 7
 
6.4%
K 7
 
6.4%
B 6
 
5.5%
H 6
 
5.5%
T 6
 
5.5%
Other values (13) 37
33.9%
Lowercase Letter
ValueCountFrequency (%)
o 15
12.1%
e 15
12.1%
n 12
9.7%
a 12
9.7%
r 10
8.1%
l 9
 
7.3%
t 9
 
7.3%
i 8
 
6.5%
u 5
 
4.0%
h 5
 
4.0%
Other values (12) 24
19.4%
Decimal Number
ValueCountFrequency (%)
0 2
40.0%
2 1
20.0%
1 1
20.0%
3 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 5
83.3%
& 1
 
16.7%
Space Separator
ValueCountFrequency (%)
173
100.0%
Open Punctuation
ValueCountFrequency (%)
( 56
100.0%
Close Punctuation
ValueCountFrequency (%)
) 56
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1971
78.7%
Common 297
 
11.9%
Latin 233
 
9.3%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
87
 
4.4%
84
 
4.3%
73
 
3.7%
63
 
3.2%
53
 
2.7%
51
 
2.6%
50
 
2.5%
41
 
2.1%
39
 
2.0%
36
 
1.8%
Other values (307) 1394
70.7%
Latin
ValueCountFrequency (%)
o 15
 
6.4%
e 15
 
6.4%
n 12
 
5.2%
a 12
 
5.2%
r 10
 
4.3%
l 9
 
3.9%
N 9
 
3.9%
t 9
 
3.9%
i 8
 
3.4%
S 8
 
3.4%
Other values (35) 126
54.1%
Common
ValueCountFrequency (%)
173
58.2%
( 56
 
18.9%
) 56
 
18.9%
. 5
 
1.7%
0 2
 
0.7%
- 1
 
0.3%
2 1
 
0.3%
1 1
 
0.3%
& 1
 
0.3%
3 1
 
0.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1959
78.3%
ASCII 530
 
21.2%
None 12
 
0.5%
CJK 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
173
32.6%
( 56
 
10.6%
) 56
 
10.6%
o 15
 
2.8%
e 15
 
2.8%
n 12
 
2.3%
a 12
 
2.3%
r 10
 
1.9%
l 9
 
1.7%
N 9
 
1.7%
Other values (45) 163
30.8%
Hangul
ValueCountFrequency (%)
87
 
4.4%
84
 
4.3%
73
 
3.7%
63
 
3.2%
53
 
2.7%
51
 
2.6%
50
 
2.6%
41
 
2.1%
39
 
2.0%
36
 
1.8%
Other values (306) 1382
70.5%
None
ValueCountFrequency (%)
12
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct248
Distinct (%)93.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
Minimum1996-11-08 00:00:00
Maximum2023-05-12 00:00:00
2023-12-12T22:12:41.169748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:12:41.290151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct252
Distinct (%)94.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T22:12:41.582890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length49
Mean length39.090226
Min length23

Characters and Unicode

Total characters10398
Distinct characters288
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)91.7%

Sample

1st row경기도 성남시 중원구 산성대로 496, 지하층 (은행동)
2nd row경기도 성남시 중원구 도촌로8번길 23, 쌍용오피스텔 5층 502호 (도촌동)
3rd row경기도 성남시 분당구 판교역로 231, 에이치스퀘어 에스동 710호 (삼평동)
4th row경기도 성남시 분당구 안골로48번길 12 (서현동)
5th row대구광역시 남구 안지랑로20길 24, 202호 (대명동)
ValueCountFrequency (%)
경기도 265
 
12.6%
성남시 265
 
12.6%
분당구 143
 
6.8%
중원구 69
 
3.3%
수정구 53
 
2.5%
야탑동 34
 
1.6%
1층 34
 
1.6%
상대원동 24
 
1.1%
성남대로 16
 
0.8%
성남동 15
 
0.7%
Other values (637) 1192
56.5%
2023-12-12T22:12:42.108438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1844
 
17.7%
1 402
 
3.9%
360
 
3.5%
346
 
3.3%
313
 
3.0%
308
 
3.0%
282
 
2.7%
279
 
2.7%
273
 
2.6%
272
 
2.6%
Other values (278) 5719
55.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5905
56.8%
Space Separator 1844
 
17.7%
Decimal Number 1697
 
16.3%
Other Punctuation 311
 
3.0%
Close Punctuation 269
 
2.6%
Open Punctuation 269
 
2.6%
Uppercase Letter 53
 
0.5%
Dash Punctuation 38
 
0.4%
Lowercase Letter 9
 
0.1%
Math Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
360
 
6.1%
346
 
5.9%
313
 
5.3%
282
 
4.8%
279
 
4.7%
273
 
4.6%
272
 
4.6%
272
 
4.6%
269
 
4.6%
172
 
2.9%
Other values (236) 3067
51.9%
Uppercase Letter
ValueCountFrequency (%)
B 9
17.0%
A 9
17.0%
K 6
11.3%
R 4
7.5%
D 4
7.5%
S 4
7.5%
E 4
7.5%
Z 3
 
5.7%
T 2
 
3.8%
C 2
 
3.8%
Other values (6) 6
11.3%
Decimal Number
ValueCountFrequency (%)
1 402
23.7%
2 218
12.8%
0 215
12.7%
3 186
11.0%
4 150
 
8.8%
5 142
 
8.4%
6 117
 
6.9%
7 105
 
6.2%
9 93
 
5.5%
8 69
 
4.1%
Lowercase Letter
ValueCountFrequency (%)
n 3
33.3%
b 1
 
11.1%
r 1
 
11.1%
e 1
 
11.1%
w 1
 
11.1%
o 1
 
11.1%
t 1
 
11.1%
Other Punctuation
ValueCountFrequency (%)
308
99.0%
/ 1
 
0.3%
. 1
 
0.3%
& 1
 
0.3%
Space Separator
ValueCountFrequency (%)
1844
100.0%
Close Punctuation
ValueCountFrequency (%)
) 269
100.0%
Open Punctuation
ValueCountFrequency (%)
( 269
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5905
56.8%
Common 4431
42.6%
Latin 62
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
360
 
6.1%
346
 
5.9%
313
 
5.3%
282
 
4.8%
279
 
4.7%
273
 
4.6%
272
 
4.6%
272
 
4.6%
269
 
4.6%
172
 
2.9%
Other values (236) 3067
51.9%
Latin
ValueCountFrequency (%)
B 9
14.5%
A 9
14.5%
K 6
9.7%
R 4
 
6.5%
D 4
 
6.5%
S 4
 
6.5%
E 4
 
6.5%
n 3
 
4.8%
Z 3
 
4.8%
T 2
 
3.2%
Other values (13) 14
22.6%
Common
ValueCountFrequency (%)
1844
41.6%
1 402
 
9.1%
308
 
7.0%
) 269
 
6.1%
( 269
 
6.1%
2 218
 
4.9%
0 215
 
4.9%
3 186
 
4.2%
4 150
 
3.4%
5 142
 
3.2%
Other values (9) 428
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5905
56.8%
ASCII 4185
40.2%
None 308
 
3.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1844
44.1%
1 402
 
9.6%
) 269
 
6.4%
( 269
 
6.4%
2 218
 
5.2%
0 215
 
5.1%
3 186
 
4.4%
4 150
 
3.6%
5 142
 
3.4%
6 117
 
2.8%
Other values (31) 373
 
8.9%
Hangul
ValueCountFrequency (%)
360
 
6.1%
346
 
5.9%
313
 
5.3%
282
 
4.8%
279
 
4.7%
273
 
4.6%
272
 
4.6%
272
 
4.6%
269
 
4.6%
172
 
2.9%
Other values (236) 3067
51.9%
None
ValueCountFrequency (%)
308
100.0%

소재지전화번호
Text

MISSING 

Distinct180
Distinct (%)96.8%
Missing80
Missing (%)30.1%
Memory size2.2 KiB
2023-12-12T22:12:42.427999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length12.005376
Min length9

Characters and Unicode

Total characters2233
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique175 ?
Unique (%)94.1%

Sample

1st row031-778-6788
2nd row031-8006-6211
3rd row031-718-6341
4th row02-515-9879
5th row1566-5276
ValueCountFrequency (%)
031 9
 
4.2%
6
 
2.8%
031-703-7452 3
 
1.4%
031-736-1672 2
 
0.9%
031-726-4648 2
 
0.9%
031-8092-3700 2
 
0.9%
031-705-5866 2
 
0.9%
031-705-1919 1
 
0.5%
1899-4442 1
 
0.5%
031-754-2153 1
 
0.5%
Other values (183) 183
86.3%
2023-12-12T22:12:42.915111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 377
16.9%
- 350
15.7%
1 291
13.0%
7 255
11.4%
3 249
11.2%
5 133
 
6.0%
8 125
 
5.6%
2 115
 
5.2%
6 112
 
5.0%
4 103
 
4.6%
Other values (3) 123
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1855
83.1%
Dash Punctuation 350
 
15.7%
Space Separator 26
 
1.2%
Math Symbol 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 377
20.3%
1 291
15.7%
7 255
13.7%
3 249
13.4%
5 133
 
7.2%
8 125
 
6.7%
2 115
 
6.2%
6 112
 
6.0%
4 103
 
5.6%
9 95
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 350
100.0%
Space Separator
ValueCountFrequency (%)
26
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2233
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 377
16.9%
- 350
15.7%
1 291
13.0%
7 255
11.4%
3 249
11.2%
5 133
 
6.0%
8 125
 
5.6%
2 115
 
5.2%
6 112
 
5.0%
4 103
 
4.6%
Other values (3) 123
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2233
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 377
16.9%
- 350
15.7%
1 291
13.0%
7 255
11.4%
3 249
11.2%
5 133
 
6.0%
8 125
 
5.6%
2 115
 
5.2%
6 112
 
5.0%
4 103
 
4.6%
Other values (3) 123
 
5.5%

Missing values

2023-12-12T22:12:40.258806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:12:40.339520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명인허가일자소재지도로명주소소재지전화번호
0이지라이프2018-12-04경기도 성남시 중원구 산성대로 496, 지하층 (은행동)<NA>
1엘뤼프 엘스타 대림지사점2023-05-12경기도 성남시 중원구 도촌로8번길 23, 쌍용오피스텔 5층 502호 (도촌동)<NA>
2㈜인더코어비즈니스플랫폼2023-05-08경기도 성남시 분당구 판교역로 231, 에이치스퀘어 에스동 710호 (삼평동)031-778-6788
3닥터손2023-04-27경기도 성남시 분당구 안골로48번길 12 (서현동)<NA>
4가온2023-04-24대구광역시 남구 안지랑로20길 24, 202호 (대명동)<NA>
5은성2023-04-17경기도 성남시 중원구 박석로 9-2 (상대원동)<NA>
6에이치디현대건설기계(주)2017-04-28경기도 성남시 분당구 분당수서로 477 (정자동)031-8006-6211
7탑셀바이오뱅크 성남고객센타2023-04-11경기도 성남시 중원구 성남대로 1142, 4층 (성남동)<NA>
8메디칼허브헬스케어2023-04-10경기도 성남시 중원구 도촌로8번길 23, 505호 (도촌동)<NA>
9주식회사 300텔레콤2023-03-13경기도 성남시 분당구 대왕판교로606번길 58, 판교푸르지오월드마크 1-105호 (삼평동)<NA>
사업장명인허가일자소재지도로명주소소재지전화번호
256이롬황성주생식 성남사업단2001-11-10경기도 성남시 수정구 공원로 322, 117호 (신흥동, 신동아파라디움)031-745-7487~8
257사임당화장품2001-02-21경기도 성남시 분당구 장미로48번길 10, 르네상스분당오피스텔 646호 (야탑동)031 733 2550
258케이지모빌리티 성남판매대리점2001-01-04경기도 성남시 수정구 성남대로 1169 (수진동, 남영빌딩)031 758 8484
259기아자동차온누리대리점주식회사1999-07-30경기도 성남시 중원구 산성대로 386 (금광동)031-733-3977
260현대제일로판매대리점1999-06-30경기도 성남시 중원구 둔촌대로 140(하대원동)031 756 3500
261현대상원판매대리점1999-06-17경기도 성남시 수정구 산성대로 465, 1층 (단대동, 농협)031 735 9100
262기아자동차 오리대리점1999-05-17경기도 성남시 분당구 금곡로7번길 2 (구미동)031-717-0084
263기아 태평역대리점1999-05-13경기도 성남시 수정구 성남대로 1195, 세실빌딩 1층 (수진동)031-754-6600
264기아자동차 대원대리점1996-11-18경기도 성남시 중원구 갈마치로 186 (상대원동, 반포테크노피아)031-747-4500
265고려자동차매매상사1996-11-08경기도 성남시 수정구 성남대로 1309 (태평동)031-732-3000