Overview

Dataset statistics

Number of variables4
Number of observations1000
Missing cells28
Missing cells (%)0.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.4 KiB
Average record size in memory32.1 B

Variable types

Text3
DateTime1

Dataset

Description서울특별시 성동구에 위치한 공장들에 대한 현황 정보입니다. 회사명, 공장 주소, 업종명, 생산품 목록 등의 정보를 포함합니다.
URLhttps://www.data.go.kr/data/15034124/fileData.do

Alerts

전화번호 has 28 (2.8%) missing valuesMissing

Reproduction

Analysis started2023-12-12 13:40:34.207647
Analysis finished2023-12-12 13:40:34.940551
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct991
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T22:40:35.162449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length7.183
Min length2

Characters and Unicode

Total characters7183
Distinct characters474
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique982 ?
Unique (%)98.2%

Sample

1st row(주) 세원정밀
2nd row윌전기공업(주)
3rd row경보전기(주)
4th row(주)어나더필
5th row기호알엔디(주)
ValueCountFrequency (%)
주식회사 62
 
5.6%
7
 
0.6%
주)선명제본 2
 
0.2%
발렌시아(주 2
 
0.2%
사회복지법인 2
 
0.2%
korea 2
 
0.2%
신일정밀 2
 
0.2%
사)한국지체장애인협회 2
 
0.2%
동아시험기 2
 
0.2%
주)다산에이디 2
 
0.2%
Other values (1017) 1021
92.3%
2023-12-12T22:40:35.629894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
623
 
8.7%
) 554
 
7.7%
( 553
 
7.7%
192
 
2.7%
182
 
2.5%
166
 
2.3%
114
 
1.6%
113
 
1.6%
108
 
1.5%
106
 
1.5%
Other values (464) 4472
62.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5846
81.4%
Close Punctuation 554
 
7.7%
Open Punctuation 553
 
7.7%
Space Separator 106
 
1.5%
Uppercase Letter 76
 
1.1%
Other Punctuation 25
 
0.3%
Lowercase Letter 12
 
0.2%
Decimal Number 10
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
623
 
10.7%
192
 
3.3%
182
 
3.1%
166
 
2.8%
114
 
2.0%
113
 
1.9%
108
 
1.8%
102
 
1.7%
94
 
1.6%
89
 
1.5%
Other values (420) 4063
69.5%
Uppercase Letter
ValueCountFrequency (%)
E 8
 
10.5%
N 7
 
9.2%
T 6
 
7.9%
C 6
 
7.9%
A 5
 
6.6%
O 5
 
6.6%
K 5
 
6.6%
G 5
 
6.6%
S 4
 
5.3%
I 4
 
5.3%
Other values (10) 21
27.6%
Lowercase Letter
ValueCountFrequency (%)
d 2
16.7%
p 1
8.3%
f 1
8.3%
t 1
8.3%
o 1
8.3%
b 1
8.3%
s 1
8.3%
h 1
8.3%
e 1
8.3%
n 1
8.3%
Decimal Number
ValueCountFrequency (%)
2 4
40.0%
1 3
30.0%
5 1
 
10.0%
0 1
 
10.0%
8 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 11
44.0%
& 7
28.0%
@ 6
24.0%
, 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 554
100.0%
Open Punctuation
ValueCountFrequency (%)
( 553
100.0%
Space Separator
ValueCountFrequency (%)
106
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5846
81.4%
Common 1249
 
17.4%
Latin 88
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
623
 
10.7%
192
 
3.3%
182
 
3.1%
166
 
2.8%
114
 
2.0%
113
 
1.9%
108
 
1.8%
102
 
1.7%
94
 
1.6%
89
 
1.5%
Other values (420) 4063
69.5%
Latin
ValueCountFrequency (%)
E 8
 
9.1%
N 7
 
8.0%
T 6
 
6.8%
C 6
 
6.8%
A 5
 
5.7%
O 5
 
5.7%
K 5
 
5.7%
G 5
 
5.7%
S 4
 
4.5%
I 4
 
4.5%
Other values (21) 33
37.5%
Common
ValueCountFrequency (%)
) 554
44.4%
( 553
44.3%
106
 
8.5%
. 11
 
0.9%
& 7
 
0.6%
@ 6
 
0.5%
2 4
 
0.3%
1 3
 
0.2%
, 1
 
0.1%
5 1
 
0.1%
Other values (3) 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5846
81.4%
ASCII 1337
 
18.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
623
 
10.7%
192
 
3.3%
182
 
3.1%
166
 
2.8%
114
 
2.0%
113
 
1.9%
108
 
1.8%
102
 
1.7%
94
 
1.6%
89
 
1.5%
Other values (420) 4063
69.5%
ASCII
ValueCountFrequency (%)
) 554
41.4%
( 553
41.4%
106
 
7.9%
. 11
 
0.8%
E 8
 
0.6%
& 7
 
0.5%
N 7
 
0.5%
T 6
 
0.4%
@ 6
 
0.4%
C 6
 
0.4%
Other values (34) 73
 
5.5%
Distinct917
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
2023-12-12T22:40:35.983409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length86
Median length52
Mean length34.605
Min length20

Characters and Unicode

Total characters34605
Distinct characters310
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique862 ?
Unique (%)86.2%

Sample

1st row서울특별시 성동구 성수이로20길 43 (성수동2가, (주)G.K문화인쇄)
2nd row서울특별시 성동구 아차산로 144, 304호 (성수동2가)
3rd row서울특별시 성동구 성수일로12가길 5, 경보전기(주) (성수동2가)
4th row서울특별시 성동구 아차산로11길 18 (성수동2가, (주)부래당, (주)언어더필)
5th row서울특별시 성동구 아차산로5길 24-45 (성수동2가)
ValueCountFrequency (%)
서울특별시 1000
 
15.9%
성동구 1000
 
15.9%
성수동2가 670
 
10.6%
성수동1가 235
 
3.7%
아차산로 92
 
1.5%
성수일로 78
 
1.2%
1층 68
 
1.1%
성수이로 60
 
1.0%
2층 59
 
0.9%
55 45
 
0.7%
Other values (983) 2985
47.4%
2023-12-12T22:40:36.494883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5544
 
16.0%
2356
 
6.8%
2103
 
6.1%
1 1484
 
4.3%
2 1363
 
3.9%
1307
 
3.8%
1055
 
3.0%
( 1052
 
3.0%
) 1052
 
3.0%
1047
 
3.0%
Other values (300) 16242
46.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 19773
57.1%
Decimal Number 5838
 
16.9%
Space Separator 5544
 
16.0%
Open Punctuation 1052
 
3.0%
Close Punctuation 1052
 
3.0%
Other Punctuation 994
 
2.9%
Uppercase Letter 156
 
0.5%
Dash Punctuation 153
 
0.4%
Lowercase Letter 27
 
0.1%
Math Symbol 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2356
 
11.9%
2103
 
10.6%
1307
 
6.6%
1055
 
5.3%
1047
 
5.3%
1020
 
5.2%
1001
 
5.1%
1001
 
5.1%
1000
 
5.1%
984
 
5.0%
Other values (245) 6899
34.9%
Uppercase Letter
ValueCountFrequency (%)
B 61
39.1%
S 18
 
11.5%
K 17
 
10.9%
I 14
 
9.0%
T 14
 
9.0%
A 8
 
5.1%
C 4
 
2.6%
M 3
 
1.9%
G 3
 
1.9%
N 2
 
1.3%
Other values (10) 12
 
7.7%
Lowercase Letter
ValueCountFrequency (%)
b 10
37.0%
e 4
 
14.8%
o 2
 
7.4%
s 1
 
3.7%
h 1
 
3.7%
n 1
 
3.7%
z 1
 
3.7%
i 1
 
3.7%
a 1
 
3.7%
m 1
 
3.7%
Other values (4) 4
 
14.8%
Decimal Number
ValueCountFrequency (%)
1 1484
25.4%
2 1363
23.3%
0 562
 
9.6%
4 509
 
8.7%
3 467
 
8.0%
5 440
 
7.5%
7 308
 
5.3%
6 292
 
5.0%
8 253
 
4.3%
9 160
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 977
98.3%
. 14
 
1.4%
& 2
 
0.2%
" 1
 
0.1%
Letter Number
ValueCountFrequency (%)
7
87.5%
1
 
12.5%
Space Separator
ValueCountFrequency (%)
5544
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1052
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1052
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 19773
57.1%
Common 14641
42.3%
Latin 191
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2356
 
11.9%
2103
 
10.6%
1307
 
6.6%
1055
 
5.3%
1047
 
5.3%
1020
 
5.2%
1001
 
5.1%
1001
 
5.1%
1000
 
5.1%
984
 
5.0%
Other values (245) 6899
34.9%
Latin
ValueCountFrequency (%)
B 61
31.9%
S 18
 
9.4%
K 17
 
8.9%
I 14
 
7.3%
T 14
 
7.3%
b 10
 
5.2%
A 8
 
4.2%
7
 
3.7%
C 4
 
2.1%
e 4
 
2.1%
Other values (26) 34
17.8%
Common
ValueCountFrequency (%)
5544
37.9%
1 1484
 
10.1%
2 1363
 
9.3%
( 1052
 
7.2%
) 1052
 
7.2%
, 977
 
6.7%
0 562
 
3.8%
4 509
 
3.5%
3 467
 
3.2%
5 440
 
3.0%
Other values (9) 1191
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 19773
57.1%
ASCII 14824
42.8%
Number Forms 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5544
37.4%
1 1484
 
10.0%
2 1363
 
9.2%
( 1052
 
7.1%
) 1052
 
7.1%
, 977
 
6.6%
0 562
 
3.8%
4 509
 
3.4%
3 467
 
3.2%
5 440
 
3.0%
Other values (43) 1374
 
9.3%
Hangul
ValueCountFrequency (%)
2356
 
11.9%
2103
 
10.6%
1307
 
6.6%
1055
 
5.3%
1047
 
5.3%
1020
 
5.2%
1001
 
5.1%
1001
 
5.1%
1000
 
5.1%
984
 
5.0%
Other values (245) 6899
34.9%
Number Forms
ValueCountFrequency (%)
7
87.5%
1
 
12.5%

전화번호
Text

MISSING 

Distinct952
Distinct (%)97.9%
Missing28
Missing (%)2.8%
Memory size7.9 KiB
2023-12-12T22:40:36.831130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length11
Mean length11.314815
Min length9

Characters and Unicode

Total characters10998
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique932 ?
Unique (%)95.9%

Sample

1st row02-461-3327
2nd row02-466-5620
3rd row02-465-1138
4th row02-460-0135
5th row02-499-9568
ValueCountFrequency (%)
02-499-2011 2
 
0.2%
02-467-4545 2
 
0.2%
02-461-4477 2
 
0.2%
02-498-3270 2
 
0.2%
02-469-7213 2
 
0.2%
02-466-3053 2
 
0.2%
02-497-7974 2
 
0.2%
02-469-4080 2
 
0.2%
02-468-9900 2
 
0.2%
02-2254-0786 2
 
0.2%
Other values (947) 957
98.0%
2023-12-12T22:40:37.253769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1939
17.6%
2 1767
16.1%
0 1703
15.5%
4 1165
10.6%
6 917
8.3%
1 697
 
6.3%
9 621
 
5.6%
7 597
 
5.4%
5 541
 
4.9%
3 533
 
4.8%
Other values (2) 518
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9051
82.3%
Dash Punctuation 1939
 
17.6%
Space Separator 8
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1767
19.5%
0 1703
18.8%
4 1165
12.9%
6 917
10.1%
1 697
 
7.7%
9 621
 
6.9%
7 597
 
6.6%
5 541
 
6.0%
3 533
 
5.9%
8 510
 
5.6%
Dash Punctuation
ValueCountFrequency (%)
- 1939
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10998
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1939
17.6%
2 1767
16.1%
0 1703
15.5%
4 1165
10.6%
6 917
8.3%
1 697
 
6.3%
9 621
 
5.6%
7 597
 
5.4%
5 541
 
4.9%
3 533
 
4.8%
Other values (2) 518
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10998
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1939
17.6%
2 1767
16.1%
0 1703
15.5%
4 1165
10.6%
6 917
8.3%
1 697
 
6.3%
9 621
 
5.6%
7 597
 
5.4%
5 541
 
4.9%
3 533
 
4.8%
Other values (2) 518
 
4.7%
Distinct853
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size7.9 KiB
Minimum1966-05-30 00:00:00
Maximum2020-02-11 00:00:00
2023-12-12T22:40:37.440451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T22:40:37.587678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-12T22:40:34.786977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:40:34.888796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명공장대표주소전화번호최초공장등록일
0(주) 세원정밀서울특별시 성동구 성수이로20길 43 (성수동2가, (주)G.K문화인쇄)02-461-33271999-01-09
1윌전기공업(주)서울특별시 성동구 아차산로 144, 304호 (성수동2가)02-466-56202000-12-23
2경보전기(주)서울특별시 성동구 성수일로12가길 5, 경보전기(주) (성수동2가)02-465-11381978-09-01
3(주)어나더필서울특별시 성동구 아차산로11길 18 (성수동2가, (주)부래당, (주)언어더필)02-460-01351999-06-08
4기호알엔디(주)서울특별시 성동구 아차산로5길 24-45 (성수동2가)02-499-95681996-05-03
5을지인쇄(주)서울특별시 성동구 성수이로 144-29, 205호 (성수동2가)02-467-37001989-11-15
6(주)비츠로컴서울특별시 성동구 성덕정길 151 (성수동2가, (주)비츠로시스)02-460-22282000-09-25
7(주)경신켄프라서울특별시 성동구 성수이로26길 25 (성수동2가)02-469-15761995-09-01
8(주)토피아넷서울특별시 성동구 아차산로11길 27, 701호 (성수동2가)02-466-60811996-02-03
9뉴론에스(주)서울특별시 성동구 성수이로24길 38 (성수동2가, 신흥아파트형공장)02-466-66631996-08-09
회사명공장대표주소전화번호최초공장등록일
990에스엔에스이서비스(주)서울특별시 성동구 아차산로7길 28, 219호(2층) (성수동2가, 성수쇼핑센타)02-539-36512014-11-27
991(주)카이저솔루션서울특별시 성동구 성수이로22길 37, 4층 404 (성수동2가, 성수동 아크밸리)02-971-09542015-02-06
992경성애드컴서울특별시 성동구 뚝섬로15길 17-19, 1층 (성수동2가)070-4283-42002015-07-03
993세원상사서울특별시 성동구 연무장길 35, 지층,1층 (성수동2가)02-466-79352015-08-04
994태광종합기획서울특별시 성동구 뚝섬로15길 16 (성수동2가) 1층02-2265-52902015-09-04
995국민프린텍서울특별시 성동구 성수일로8길 39, 4층~8층 (성수동2가)02-0469-93322018-04-23
996(주)인투필라테스(1공장)서울특별시 성동구 연무장18길 4, 공장동 2층 (성수동2가)070-776 -03612016-10-07
997주식회사 미소모형서울특별시 성동구 성수일로12길 34, 2층 (성수동2가)02-497-87522017-02-09
998일진서울특별시 성동구 아차산로 163 (성수동2가)02-464-39662017-03-23
999에프엔씨(f&c)서울특별시 성동구 용답중앙21길 2, 2층 (용답동)070-7613-34022017-05-12