Overview

Dataset statistics

Number of variables8
Number of observations6623
Missing cells5934
Missing cells (%)11.2%
Duplicate rows11
Duplicate rows (%)0.2%
Total size in memory420.5 KiB
Average record size in memory65.0 B

Variable types

Text6
Numeric1
Categorical1

Dataset

Description경남 창원시 관내(의창구, 성산구, 마산합포구, 마산회원구, 진해구)에 소재하고 있으며 제조업 공장으로 등록된 현황을 제공합니다.(2021년 8월 31일 기준)
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3066436

Alerts

Dataset has 11 (0.2%) duplicate rowsDuplicates
단지명 is highly imbalanced (60.2%)Imbalance
대표자 has 3006 (45.4%) missing valuesMissing
전화번호 has 568 (8.6%) missing valuesMissing
팩스번호 has 1270 (19.2%) missing valuesMissing
업종번호 has 1085 (16.4%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:59:31.622976
Analysis finished2023-12-10 22:59:32.987665
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct4275
Distinct (%)64.5%
Missing0
Missing (%)0.0%
Memory size51.9 KiB
2023-12-11T07:59:33.180866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length23
Mean length6.2518496
Min length1

Characters and Unicode

Total characters41406
Distinct characters575
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2378 ?
Unique (%)35.9%

Sample

1st row(유)리앤리테크
2nd row(유)비엔에스
3rd row(유)삼송 창원공장
4th row(유)티에스
5th row(주)A4
ValueCountFrequency (%)
주식회사 172
 
2.4%
창원공장 34
 
0.5%
2공장 28
 
0.4%
tech 17
 
0.2%
제2공장 17
 
0.2%
1공장 16
 
0.2%
11
 
0.2%
성진정밀 9
 
0.1%
창원지점 9
 
0.1%
유한회사 9
 
0.1%
Other values (4285) 6780
95.5%
2023-12-11T07:59:33.576430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3062
 
7.4%
) 2867
 
6.9%
( 2864
 
6.9%
1110
 
2.7%
1081
 
2.6%
1049
 
2.5%
960
 
2.3%
834
 
2.0%
824
 
2.0%
783
 
1.9%
Other values (565) 25972
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33482
80.9%
Close Punctuation 2867
 
6.9%
Open Punctuation 2864
 
6.9%
Uppercase Letter 1303
 
3.1%
Space Separator 491
 
1.2%
Decimal Number 174
 
0.4%
Other Punctuation 111
 
0.3%
Lowercase Letter 80
 
0.2%
Dash Punctuation 34
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3062
 
9.1%
1110
 
3.3%
1081
 
3.2%
1049
 
3.1%
960
 
2.9%
834
 
2.5%
824
 
2.5%
783
 
2.3%
753
 
2.2%
716
 
2.1%
Other values (508) 22310
66.6%
Uppercase Letter
ValueCountFrequency (%)
T 169
13.0%
E 161
12.4%
C 135
10.4%
S 131
10.1%
M 92
 
7.1%
N 86
 
6.6%
H 85
 
6.5%
G 83
 
6.4%
A 52
 
4.0%
K 44
 
3.4%
Other values (15) 265
20.3%
Lowercase Letter
ValueCountFrequency (%)
e 17
21.2%
c 10
12.5%
h 9
11.2%
n 8
10.0%
g 5
 
6.2%
i 5
 
6.2%
s 5
 
6.2%
w 4
 
5.0%
u 3
 
3.8%
l 3
 
3.8%
Other values (7) 11
13.8%
Decimal Number
ValueCountFrequency (%)
2 92
52.9%
1 47
27.0%
3 21
 
12.1%
4 10
 
5.7%
8 2
 
1.1%
5 2
 
1.1%
Other Punctuation
ValueCountFrequency (%)
. 69
62.2%
& 33
29.7%
? 4
 
3.6%
, 3
 
2.7%
/ 2
 
1.8%
Close Punctuation
ValueCountFrequency (%)
) 2867
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2864
100.0%
Space Separator
ValueCountFrequency (%)
491
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 34
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33482
80.9%
Common 6541
 
15.8%
Latin 1383
 
3.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3062
 
9.1%
1110
 
3.3%
1081
 
3.2%
1049
 
3.1%
960
 
2.9%
834
 
2.5%
824
 
2.5%
783
 
2.3%
753
 
2.2%
716
 
2.1%
Other values (508) 22310
66.6%
Latin
ValueCountFrequency (%)
T 169
12.2%
E 161
11.6%
C 135
 
9.8%
S 131
 
9.5%
M 92
 
6.7%
N 86
 
6.2%
H 85
 
6.1%
G 83
 
6.0%
A 52
 
3.8%
K 44
 
3.2%
Other values (32) 345
24.9%
Common
ValueCountFrequency (%)
) 2867
43.8%
( 2864
43.8%
491
 
7.5%
2 92
 
1.4%
. 69
 
1.1%
1 47
 
0.7%
- 34
 
0.5%
& 33
 
0.5%
3 21
 
0.3%
4 10
 
0.2%
Other values (5) 13
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33482
80.9%
ASCII 7924
 
19.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3062
 
9.1%
1110
 
3.3%
1081
 
3.2%
1049
 
3.1%
960
 
2.9%
834
 
2.5%
824
 
2.5%
783
 
2.3%
753
 
2.2%
716
 
2.1%
Other values (508) 22310
66.6%
ASCII
ValueCountFrequency (%)
) 2867
36.2%
( 2864
36.1%
491
 
6.2%
T 169
 
2.1%
E 161
 
2.0%
C 135
 
1.7%
S 131
 
1.7%
2 92
 
1.2%
M 92
 
1.2%
N 86
 
1.1%
Other values (47) 836
 
10.6%

대표자
Text

MISSING 

Distinct2883
Distinct (%)79.7%
Missing3006
Missing (%)45.4%
Memory size51.9 KiB
2023-12-11T07:59:33.897662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.1598009
Min length2

Characters and Unicode

Total characters11429
Distinct characters298
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2383 ?
Unique (%)65.9%

Sample

1st row이애경
2nd row임진태
3rd row이형찬
4th row권일
5th row장창민
ValueCountFrequency (%)
9
 
0.2%
이귀현 9
 
0.2%
최동윤 8
 
0.2%
황규복 8
 
0.2%
김병준 7
 
0.2%
박종길 7
 
0.2%
이수태 7
 
0.2%
김종덕 6
 
0.2%
이태용 6
 
0.2%
김두영 6
 
0.2%
Other values (2920) 3646
98.0%
2023-12-11T07:59:34.385577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
772
 
6.8%
540
 
4.7%
370
 
3.2%
351
 
3.1%
304
 
2.7%
239
 
2.1%
234
 
2.0%
214
 
1.9%
204
 
1.8%
193
 
1.7%
Other values (288) 8008
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11184
97.9%
Space Separator 105
 
0.9%
Other Punctuation 85
 
0.7%
Decimal Number 29
 
0.3%
Uppercase Letter 20
 
0.2%
Close Punctuation 3
 
< 0.1%
Open Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
772
 
6.9%
540
 
4.8%
370
 
3.3%
351
 
3.1%
304
 
2.7%
239
 
2.1%
234
 
2.1%
214
 
1.9%
204
 
1.8%
193
 
1.7%
Other values (272) 7763
69.4%
Uppercase Letter
ValueCountFrequency (%)
H 4
20.0%
N 2
10.0%
A 2
10.0%
W 2
10.0%
U 2
10.0%
Y 2
10.0%
K 2
10.0%
O 2
10.0%
C 2
10.0%
Decimal Number
ValueCountFrequency (%)
1 27
93.1%
2 1
 
3.4%
3 1
 
3.4%
Space Separator
ValueCountFrequency (%)
105
100.0%
Other Punctuation
ValueCountFrequency (%)
, 85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11184
97.9%
Common 225
 
2.0%
Latin 20
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
772
 
6.9%
540
 
4.8%
370
 
3.3%
351
 
3.1%
304
 
2.7%
239
 
2.1%
234
 
2.1%
214
 
1.9%
204
 
1.8%
193
 
1.7%
Other values (272) 7763
69.4%
Latin
ValueCountFrequency (%)
H 4
20.0%
N 2
10.0%
A 2
10.0%
W 2
10.0%
U 2
10.0%
Y 2
10.0%
K 2
10.0%
O 2
10.0%
C 2
10.0%
Common
ValueCountFrequency (%)
105
46.7%
, 85
37.8%
1 27
 
12.0%
) 3
 
1.3%
( 3
 
1.3%
2 1
 
0.4%
3 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11184
97.9%
ASCII 245
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
772
 
6.9%
540
 
4.8%
370
 
3.3%
351
 
3.1%
304
 
2.7%
239
 
2.1%
234
 
2.1%
214
 
1.9%
204
 
1.8%
193
 
1.7%
Other values (272) 7763
69.4%
ASCII
ValueCountFrequency (%)
105
42.9%
, 85
34.7%
1 27
 
11.0%
H 4
 
1.6%
) 3
 
1.2%
( 3
 
1.2%
N 2
 
0.8%
A 2
 
0.8%
W 2
 
0.8%
U 2
 
0.8%
Other values (6) 10
 
4.1%

전화번호
Text

MISSING 

Distinct3892
Distinct (%)64.3%
Missing568
Missing (%)8.6%
Memory size51.9 KiB
2023-12-11T07:59:34.629534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.028076
Min length11

Characters and Unicode

Total characters72830
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2144 ?
Unique (%)35.4%

Sample

1st row055-603-3106
2nd row055-276-6685
3rd row055-284-8661
4th row055-276-9281
5th row055-274-0365
ValueCountFrequency (%)
055-275-7930 9
 
0.1%
055-271-1980 9
 
0.1%
055-551-6601 8
 
0.1%
055-268-8000 8
 
0.1%
055-239-4700 8
 
0.1%
055-551-9200 8
 
0.1%
055-291-0471 6
 
0.1%
055-262-8916 6
 
0.1%
055-551-6685 6
 
0.1%
055-551-8320 6
 
0.1%
Other values (3882) 5981
98.8%
2023-12-11T07:59:35.014515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 15945
21.9%
- 12110
16.6%
0 10148
13.9%
2 8247
11.3%
6 4541
 
6.2%
7 4062
 
5.6%
1 4026
 
5.5%
8 4012
 
5.5%
9 3420
 
4.7%
3 3396
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 60720
83.4%
Dash Punctuation 12110
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 15945
26.3%
0 10148
16.7%
2 8247
13.6%
6 4541
 
7.5%
7 4062
 
6.7%
1 4026
 
6.6%
8 4012
 
6.6%
9 3420
 
5.6%
3 3396
 
5.6%
4 2923
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 12110
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 72830
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 15945
21.9%
- 12110
16.6%
0 10148
13.9%
2 8247
11.3%
6 4541
 
6.2%
7 4062
 
5.6%
1 4026
 
5.5%
8 4012
 
5.5%
9 3420
 
4.7%
3 3396
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 72830
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 15945
21.9%
- 12110
16.6%
0 10148
13.9%
2 8247
11.3%
6 4541
 
6.2%
7 4062
 
5.6%
1 4026
 
5.5%
8 4012
 
5.5%
9 3420
 
4.7%
3 3396
 
4.7%

팩스번호
Text

MISSING 

Distinct3330
Distinct (%)62.2%
Missing1270
Missing (%)19.2%
Memory size51.9 KiB
2023-12-11T07:59:35.251204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007472
Min length8

Characters and Unicode

Total characters64276
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1713 ?
Unique (%)32.0%

Sample

1st row055-603-3114
2nd row055-276-6686
3rd row055-284-8660
4th row055-288-9281
5th row055-274-0362
ValueCountFrequency (%)
055-275-7931 10
 
0.2%
055-239-3682 8
 
0.1%
055-265-3667 8
 
0.1%
055-296-6441 8
 
0.1%
055-282-9320 8
 
0.1%
055-294-5155 7
 
0.1%
055-267-1630 6
 
0.1%
055-267-2768 6
 
0.1%
055-291-0470 6
 
0.1%
055-551-6604 6
 
0.1%
Other values (3320) 5281
98.6%
2023-12-11T07:59:35.606713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 14367
22.4%
- 10706
16.7%
0 8178
12.7%
2 7476
11.6%
6 4301
 
6.7%
8 3589
 
5.6%
9 3456
 
5.4%
7 3270
 
5.1%
3 3229
 
5.0%
1 2938
 
4.6%
Other values (2) 2766
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 53564
83.3%
Dash Punctuation 10706
 
16.7%
Space Separator 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 14367
26.8%
0 8178
15.3%
2 7476
14.0%
6 4301
 
8.0%
8 3589
 
6.7%
9 3456
 
6.5%
7 3270
 
6.1%
3 3229
 
6.0%
1 2938
 
5.5%
4 2760
 
5.2%
Dash Punctuation
ValueCountFrequency (%)
- 10706
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 64276
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 14367
22.4%
- 10706
16.7%
0 8178
12.7%
2 7476
11.6%
6 4301
 
6.7%
8 3589
 
5.6%
9 3456
 
5.4%
7 3270
 
5.1%
3 3229
 
5.0%
1 2938
 
4.6%
Other values (2) 2766
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 64276
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 14367
22.4%
- 10706
16.7%
0 8178
12.7%
2 7476
11.6%
6 4301
 
6.7%
8 3589
 
5.6%
9 3456
 
5.4%
7 3270
 
5.1%
3 3229
 
5.0%
1 2938
 
4.6%
Other values (2) 2766
 
4.3%

주소
Text

Distinct3614
Distinct (%)54.6%
Missing0
Missing (%)0.0%
Memory size51.9 KiB
2023-12-11T07:59:35.869187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length59
Mean length34.5496
Min length18

Characters and Unicode

Total characters228822
Distinct characters447
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1909 ?
Unique (%)28.8%

Sample

1st row경상남도 창원시 성산구 반월로 16 (신촌동)
2nd row경상남도 창원시 성산구 완암로 50, 테크동 4층 410호 (성산동, SK테크노파크)
3rd row경상남도 창원시 성산구 정동로62번길 30 (성주동) (총 2 필지) (총 2 필지)
4th row경상남도 창원시 의창구 차룡단지로 75-2 (팔용동)
5th row경상남도 창원시 의창구 반계동 1484번지 경남테크노파크시험생산동 203
ValueCountFrequency (%)
경상남도 6624
 
14.0%
창원시 6624
 
14.0%
성산구 3381
 
7.1%
의창구 1371
 
2.9%
성산동 1195
 
2.5%
완암로 1133
 
2.4%
50 1004
 
2.1%
sk테크노파크 973
 
2.1%
마산회원구 858
 
1.8%
신촌동 851
 
1.8%
Other values (3030) 23391
49.3%
2023-12-11T07:59:36.257215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40922
 
17.9%
8722
 
3.8%
8413
 
3.7%
7911
 
3.5%
7432
 
3.2%
6934
 
3.0%
6771
 
3.0%
6664
 
2.9%
6663
 
2.9%
6657
 
2.9%
Other values (437) 121733
53.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 137451
60.1%
Space Separator 40922
 
17.9%
Decimal Number 29216
 
12.8%
Close Punctuation 6278
 
2.7%
Open Punctuation 6277
 
2.7%
Other Punctuation 4705
 
2.1%
Uppercase Letter 2514
 
1.1%
Dash Punctuation 1425
 
0.6%
Lowercase Letter 29
 
< 0.1%
Other Symbol 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8722
 
6.3%
8413
 
6.1%
7911
 
5.8%
7432
 
5.4%
6934
 
5.0%
6771
 
4.9%
6664
 
4.8%
6663
 
4.8%
6657
 
4.8%
6642
 
4.8%
Other values (379) 64642
47.0%
Uppercase Letter
ValueCountFrequency (%)
S 1030
41.0%
K 1011
40.2%
B 145
 
5.8%
C 52
 
2.1%
T 48
 
1.9%
N 33
 
1.3%
A 32
 
1.3%
E 31
 
1.2%
G 24
 
1.0%
M 17
 
0.7%
Other values (12) 91
 
3.6%
Lowercase Letter
ValueCountFrequency (%)
c 5
17.2%
e 3
10.3%
a 2
 
6.9%
n 2
 
6.9%
y 2
 
6.9%
g 2
 
6.9%
l 2
 
6.9%
i 2
 
6.9%
r 2
 
6.9%
x 2
 
6.9%
Other values (3) 5
17.2%
Decimal Number
ValueCountFrequency (%)
1 5945
20.3%
2 3855
13.2%
5 3213
11.0%
0 3194
10.9%
3 3142
10.8%
4 2520
8.6%
6 2303
 
7.9%
7 2020
 
6.9%
8 1532
 
5.2%
9 1492
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 4647
98.8%
· 25
 
0.5%
. 22
 
0.5%
& 8
 
0.2%
/ 3
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 6277
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 6276
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
40922
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1425
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 137455
60.1%
Common 88824
38.8%
Latin 2543
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8722
 
6.3%
8413
 
6.1%
7911
 
5.8%
7432
 
5.4%
6934
 
5.0%
6771
 
4.9%
6664
 
4.8%
6663
 
4.8%
6657
 
4.8%
6642
 
4.8%
Other values (380) 64646
47.0%
Latin
ValueCountFrequency (%)
S 1030
40.5%
K 1011
39.8%
B 145
 
5.7%
C 52
 
2.0%
T 48
 
1.9%
N 33
 
1.3%
A 32
 
1.3%
E 31
 
1.2%
G 24
 
0.9%
M 17
 
0.7%
Other values (25) 120
 
4.7%
Common
ValueCountFrequency (%)
40922
46.1%
) 6277
 
7.1%
( 6276
 
7.1%
1 5945
 
6.7%
, 4647
 
5.2%
2 3855
 
4.3%
5 3213
 
3.6%
0 3194
 
3.6%
3 3142
 
3.5%
4 2520
 
2.8%
Other values (12) 8833
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 137447
60.1%
ASCII 91342
39.9%
None 29
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
40922
44.8%
) 6277
 
6.9%
( 6276
 
6.9%
1 5945
 
6.5%
, 4647
 
5.1%
2 3855
 
4.2%
5 3213
 
3.5%
0 3194
 
3.5%
3 3142
 
3.4%
4 2520
 
2.8%
Other values (46) 11351
 
12.4%
Hangul
ValueCountFrequency (%)
8722
 
6.3%
8413
 
6.1%
7911
 
5.8%
7432
 
5.4%
6934
 
5.0%
6771
 
4.9%
6664
 
4.8%
6663
 
4.8%
6657
 
4.8%
6642
 
4.8%
Other values (376) 64638
47.0%
None
ValueCountFrequency (%)
· 25
86.2%
4
 
13.8%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

업종번호
Real number (ℝ)

MISSING 

Distinct299
Distinct (%)5.4%
Missing1085
Missing (%)16.4%
Infinite0
Infinite (%)0.0%
Mean26944.784
Minimum10121
Maximum58221
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size58.3 KiB
2023-12-11T07:59:36.572141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10121
5-th percentile20421
Q125924
median27307
Q329223
95-th percentile30399
Maximum58221
Range48100
Interquartile range (IQR)3299

Descriptive statistics

Standard deviation3761.6382
Coefficient of variation (CV)0.13960543
Kurtosis9.7876686
Mean26944.784
Median Absolute Deviation (MAD)1834
Skewness-1.9207255
Sum1.4922022 × 108
Variance14149922
MonotonicityNot monotonic
2023-12-11T07:59:36.684608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25924 935
 
14.1%
29294 376
 
5.7%
28123 315
 
4.8%
29223 291
 
4.4%
25929 183
 
2.8%
29229 167
 
2.5%
30399 130
 
2.0%
29120 114
 
1.7%
31114 86
 
1.3%
29142 79
 
1.2%
Other values (289) 2862
43.2%
(Missing) 1085
 
16.4%
ValueCountFrequency (%)
10121 4
 
0.1%
10129 12
0.2%
10211 7
0.1%
10212 12
0.2%
10213 9
0.1%
10219 7
0.1%
10220 6
0.1%
10301 6
0.1%
10309 9
0.1%
10403 1
 
< 0.1%
ValueCountFrequency (%)
58221 2
 
< 0.1%
58121 1
 
< 0.1%
41225 1
 
< 0.1%
38321 4
 
0.1%
34309 1
 
< 0.1%
34011 1
 
< 0.1%
33999 3
 
< 0.1%
33932 9
0.1%
33920 2
 
< 0.1%
33910 20
0.3%
Distinct838
Distinct (%)12.7%
Missing5
Missing (%)0.1%
Memory size51.9 KiB
2023-12-11T07:59:36.923692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length16.045029
Min length1

Characters and Unicode

Total characters106186
Distinct characters316
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique308 ?
Unique (%)4.7%

Sample

1st row그 외 자동차용 신품 부품 제조업 외 3 종
2nd row배전반 및 전기 자동제어반 제조업 외 12 종
3rd row그 외 자동차용 신품 부품 제조업 외 3 종
4th row절삭가공 및 유사처리업
5th row절삭가공 및 유사처리업 외 1 종
ValueCountFrequency (%)
제조업 4952
 
14.6%
3367
 
9.9%
3025
 
8.9%
2419
 
7.1%
기타 1410
 
4.2%
1 1237
 
3.7%
절삭가공 967
 
2.9%
유사처리업 967
 
2.9%
948
 
2.8%
금속 744
 
2.2%
Other values (579) 13836
40.8%
2023-12-11T07:59:37.302658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27292
25.7%
6840
 
6.4%
5984
 
5.6%
5598
 
5.3%
4398
 
4.1%
3415
 
3.2%
3027
 
2.9%
2427
 
2.3%
1926
 
1.8%
1615
 
1.5%
Other values (306) 43664
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75773
71.4%
Space Separator 27292
 
25.7%
Decimal Number 2657
 
2.5%
Other Punctuation 432
 
0.4%
Open Punctuation 16
 
< 0.1%
Close Punctuation 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6840
 
9.0%
5984
 
7.9%
5598
 
7.4%
4398
 
5.8%
3415
 
4.5%
3027
 
4.0%
2427
 
3.2%
1926
 
2.5%
1615
 
2.1%
1605
 
2.1%
Other values (291) 38938
51.4%
Decimal Number
ValueCountFrequency (%)
1 1461
55.0%
2 387
 
14.6%
3 371
 
14.0%
4 161
 
6.1%
5 94
 
3.5%
6 57
 
2.1%
7 50
 
1.9%
8 31
 
1.2%
0 31
 
1.2%
9 14
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 426
98.6%
. 6
 
1.4%
Space Separator
ValueCountFrequency (%)
27292
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75773
71.4%
Common 30413
28.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6840
 
9.0%
5984
 
7.9%
5598
 
7.4%
4398
 
5.8%
3415
 
4.5%
3027
 
4.0%
2427
 
3.2%
1926
 
2.5%
1615
 
2.1%
1605
 
2.1%
Other values (291) 38938
51.4%
Common
ValueCountFrequency (%)
27292
89.7%
1 1461
 
4.8%
, 426
 
1.4%
2 387
 
1.3%
3 371
 
1.2%
4 161
 
0.5%
5 94
 
0.3%
6 57
 
0.2%
7 50
 
0.2%
8 31
 
0.1%
Other values (5) 83
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75760
71.3%
ASCII 30413
28.6%
Compat Jamo 13
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27292
89.7%
1 1461
 
4.8%
, 426
 
1.4%
2 387
 
1.3%
3 371
 
1.2%
4 161
 
0.5%
5 94
 
0.3%
6 57
 
0.2%
7 50
 
0.2%
8 31
 
0.1%
Other values (5) 83
 
0.3%
Hangul
ValueCountFrequency (%)
6840
 
9.0%
5984
 
7.9%
5598
 
7.4%
4398
 
5.8%
3415
 
4.5%
3027
 
4.0%
2427
 
3.2%
1926
 
2.5%
1615
 
2.1%
1605
 
2.1%
Other values (290) 38925
51.4%
Compat Jamo
ValueCountFrequency (%)
13
100.0%

단지명
Categorical

IMBALANCE 

Distinct22
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size51.9 KiB
창원국가산업단지
4124 
<NA>
1606 
 
189
마천일반산업단지
 
133
진해마천지방산업단지
 
133
Other values (17)
438 

Length

Max length16
Median length8
Mean length6.9279783
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row창원국가산업단지
2nd row창원국가산업단지
3rd row창원국가산업단지
4th row창원국가산업단지
5th row창원국가산업단지

Common Values

ValueCountFrequency (%)
창원국가산업단지 4124
62.3%
<NA> 1606
 
24.2%
189
 
2.9%
마천일반산업단지 133
 
2.0%
진해마천지방산업단지 133
 
2.0%
마산자유무역지역 130
 
2.0%
진북일반산업단지 49
 
0.7%
남양일반산업단지 46
 
0.7%
진해남양지방산업단지 46
 
0.7%
창원일반산업단지 39
 
0.6%
Other values (12) 128
 
1.9%

Length

2023-12-11T07:59:37.424894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
창원국가산업단지 4124
64.1%
na 1606
 
25.0%
마천일반산업단지 133
 
2.1%
진해마천지방산업단지 133
 
2.1%
마산자유무역지역 130
 
2.0%
진북일반산업단지 49
 
0.8%
남양일반산업단지 46
 
0.7%
진해남양지방산업단지 46
 
0.7%
창원일반산업단지 39
 
0.6%
진북농공단지 20
 
0.3%
Other values (11) 108
 
1.7%

Interactions

2023-12-11T07:59:32.566846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:59:37.492203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종번호단지명
업종번호1.0000.748
단지명0.7481.000
2023-12-11T07:59:37.558260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종번호단지명
업종번호1.0000.427
단지명0.4271.000

Missing values

2023-12-11T07:59:32.687558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:59:32.797162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:59:32.920224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명대표자전화번호팩스번호주소업종번호업종명단지명
0(유)리앤리테크이애경055-603-3106055-603-3114경상남도 창원시 성산구 반월로 16 (신촌동)30399그 외 자동차용 신품 부품 제조업 외 3 종창원국가산업단지
1(유)비엔에스임진태055-276-6685055-276-6686경상남도 창원시 성산구 완암로 50, 테크동 4층 410호 (성산동, SK테크노파크)28123배전반 및 전기 자동제어반 제조업 외 12 종창원국가산업단지
2(유)삼송 창원공장이형찬055-284-8661055-284-8660경상남도 창원시 성산구 정동로62번길 30 (성주동) (총 2 필지) (총 2 필지)30399그 외 자동차용 신품 부품 제조업 외 3 종창원국가산업단지
3(유)티에스권일055-276-9281055-288-9281경상남도 창원시 의창구 차룡단지로 75-2 (팔용동)25924절삭가공 및 유사처리업창원국가산업단지
4(주)A4장창민055-274-0365055-274-0362경상남도 창원시 의창구 반계동 1484번지 경남테크노파크시험생산동 20325924절삭가공 및 유사처리업 외 1 종창원국가산업단지
5(주)S&TC정원휘055-212-6500055-212-6525경상남도 창원시 성산구 완암로 12 (성산동)29176증류기,열교환기 및 가스발생기 제조업 외 2 종창원국가산업단지
6(주)DH케미칼최강055-283-3241055-283-3245경상남도 창원시 성산구 성주로137번길 3 (남산동, 동호물산(주)) (총 2 필지)19221윤활유 및 그리스 제조업 외 3 종창원국가산업단지
7(주)KMT여정석055-297-8101055-297-8104경상남도 창원시 성산구 공단로166번길 13-14 (신촌동)29223금속 절삭기계 제조업창원국가산업단지
8(주)가앤온오혜정055-607-0738055-607-0739경상남도 창원시 성산구 완암로 50, 넥스동 11층 1119 (성산동, SK테크노파크)28901전기경보 및 신호장치 제조업창원국가산업단지
9(주)강림테크이훈055-297-7970055-294-5815경상남도 창원시 성산구 공단로 228-32 (웅남동)29111내연기관 제조업창원국가산업단지
업체명대표자전화번호팩스번호주소업종번호업종명단지명
6613한황산업(주)박준흠055-548-0700055-551-6780경상남도 창원시 진해구 남의로43번길 28 (남양동, 한황산업(주))<NA>그 외 자동차용 신품 부품 제조업 외 3 종진해마천지방산업단지
6614한황산업(주)박준흠055-546-6781<NA>경상남도 창원시 진해구 남영로527번길 87 (남양동)<NA>그 외 기타 분류 안된 금속 가공 제품 제조업 외 1 종진해남양지방산업단지
6615혜성사이희정055-543-2218055-542-8963경상남도 창원시 진해구 이동로40번길 3 (이동)<NA>남자용 겉옷 제조업 외 25 종
6616호진산업(주)이수용055-551-6767055-551-6770경상남도 창원시 진해구 남영로552번길 31 (남양동)<NA>그 외 기타 분류 안된 비금속 광물제품 제조업 외 3 종진해마천지방산업단지
6617화성금속김두영055-551-9123055-551-9128경상남도 창원시 진해구 남의로43번길 13 (남양동)24311선철주물 주조업진해마천지방산업단지
6618화성스텐남응식055-543-0495055-543-0498경상남도 창원시 진해구 안골로 75 (안골동)24290기타 1차 비철금속 제조업
6619화신산업(주)조준택055-543-4107055-543-4109경상남도 창원시 진해구 남영로574번길 28 (남양동)24199그 외 기타 1차 철강 제조업진해마천지방산업단지
6620화영수지김종천055-547-2401055-547-2462경상남도 창원시 진해구 남의로21번길 79 (남양동)<NA>폴리스티렌 발포 성형제품 제조업 외 1 종진해마천지방산업단지
6621환인금속심환석055-547-6235055-547-6237경상남도 창원시 진해구 남의로 33 (남양동, 화인금속)24311선철주물 주조업진해마천지방산업단지
6622환인금속공업(주)이인자055-547-6235055-547-6237경상남도 창원시 진해구 남의로 33 (남양동, 화인금속)24311선철주물 주조업진해마천지방산업단지

Duplicate rows

Most frequently occurring

업체명대표자전화번호팩스번호주소업종번호업종명단지명# duplicates
10진영전기(주)<NA>055-271-8838<NA>경상남도 창원시 마산합포구 진북면28121전기회로 개폐, 보호장치 제조업 외 1 종<NA>3
0(주)동국알앤에스<NA>055-271-3450<NA>경상남도 창원시 마산합포구 진전면23211정형 내화 요업제품 제조업 외 1 종<NA>2
1(주)비티엑스코리아 창원공장박성준055-281-4301055-281-4340경상남도 창원시 성산구 웅남로 760 (성주동)30399그 외 자동차용 신품 부품 제조업 외 3 종창원국가산업단지2
2(주)비티엑스코리아 창원공장<NA>055-281-4301055-281-4340경상남도 창원시 성산구 웅남로 760 (성주동)<NA>그 외 자동차용 신품 부품 제조업 외 3 종창원국가산업단지2
3(주)오리엔탈마린텍(1공장)김진호055-545-8300055-545-4144경상남도 창원시 진해구 명제로 164 (명동, 오리엔탈정공(주)) (총 5 필지)31114선박 구성 부분품 제조업진해국가산업단지2
4(주)오리엔탈마린텍(2공장)김진호055-545-8300055-545-4144경상남도 창원시 진해구 명제로 186 (명동, 오리엔탈정공(주)) (총 19 필지) (총 19 필지)31114선박 구성 부분품 제조업진해국가산업단지2
5(주)중앙오션 진해공장전병철055-545-3622055-545-3623경상남도 창원시 진해구 명제로 184 (명동)31114선박 구성 부분품 제조업진해국가산업단지2
6동우기계공업(주)신동국055-295-3261055-295-3260경상남도 창원시 의창구 평산로38번길 13 (팔용동)29241건설 및 채광용 기계장비 제조업 외 1 종창원국가산업단지2
7제이에이치산업<NA>055-271-7704<NA>경상남도 창원시 마산합포구 진북면25994금속 표시판 제조업<NA>2
8진성금속양진희055-552-2208055-552-0208경상남도 창원시 진해구 남영로522번길 21 (남양동, 쌍화흥조조창)24321알루미늄주물 주조업마천일반산업단지2