Overview

Dataset statistics

Number of variables7
Number of observations4734
Missing cells1021
Missing cells (%)3.1%
Duplicate rows3
Duplicate rows (%)0.1%
Total size in memory259.0 KiB
Average record size in memory56.0 B

Variable types

Text6
Categorical1

Dataset

Description창원시 공장등록 현황
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3066436

Alerts

데이터제공일자 has constant value ""Constant
Dataset has 3 (0.1%) duplicate rowsDuplicates
전화번호 has 390 (8.2%) missing valuesMissing
팩스번호 has 631 (13.3%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:59:23.091235
Analysis finished2023-12-10 22:59:24.466798
Duration1.38 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct4304
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size37.1 KiB
2023-12-11T07:59:24.655085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length6.3111534
Min length1

Characters and Unicode

Total characters29877
Distinct characters594
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3967 ?
Unique (%)83.8%

Sample

1st row(사)한국장애인사회복지회사회복지부
2nd row(유)경남라이팅
3rd row(유)국제보링1급정비
4th row(유)두루산업사
5th row(유)리앤리테크
ValueCountFrequency (%)
주식회사 146
 
2.9%
창원공장 23
 
0.4%
2공장 20
 
0.4%
제2공장 13
 
0.3%
1공장 12
 
0.2%
tech 11
 
0.2%
10
 
0.2%
창원지점 7
 
0.1%
성진정밀 6
 
0.1%
명성테크 5
 
0.1%
Other values (4322) 4864
95.1%
2023-12-11T07:59:25.033126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2224
 
7.4%
) 2065
 
6.9%
( 2062
 
6.9%
811
 
2.7%
757
 
2.5%
730
 
2.4%
668
 
2.2%
599
 
2.0%
583
 
2.0%
576
 
1.9%
Other values (584) 18802
62.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24155
80.8%
Close Punctuation 2065
 
6.9%
Open Punctuation 2062
 
6.9%
Uppercase Letter 920
 
3.1%
Space Separator 388
 
1.3%
Decimal Number 115
 
0.4%
Other Punctuation 79
 
0.3%
Lowercase Letter 58
 
0.2%
Dash Punctuation 22
 
0.1%
Other Symbol 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2224
 
9.2%
811
 
3.4%
757
 
3.1%
730
 
3.0%
668
 
2.8%
599
 
2.5%
583
 
2.4%
576
 
2.4%
524
 
2.2%
484
 
2.0%
Other values (524) 16199
67.1%
Uppercase Letter
ValueCountFrequency (%)
E 111
12.1%
T 108
11.7%
S 99
10.8%
C 87
9.5%
M 62
 
6.7%
H 59
 
6.4%
G 56
 
6.1%
N 56
 
6.1%
A 41
 
4.5%
K 32
 
3.5%
Other values (15) 209
22.7%
Lowercase Letter
ValueCountFrequency (%)
e 11
19.0%
c 7
12.1%
n 6
10.3%
h 6
10.3%
i 5
8.6%
s 4
 
6.9%
d 3
 
5.2%
g 3
 
5.2%
u 2
 
3.4%
t 2
 
3.4%
Other values (7) 9
15.5%
Decimal Number
ValueCountFrequency (%)
2 62
53.9%
1 30
26.1%
3 13
 
11.3%
4 6
 
5.2%
8 2
 
1.7%
9 1
 
0.9%
5 1
 
0.9%
Other Punctuation
ValueCountFrequency (%)
. 50
63.3%
& 24
30.4%
, 3
 
3.8%
/ 1
 
1.3%
1
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 2065
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2062
100.0%
Space Separator
ValueCountFrequency (%)
388
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24157
80.9%
Common 4732
 
15.8%
Latin 978
 
3.3%
Han 10
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2224
 
9.2%
811
 
3.4%
757
 
3.1%
730
 
3.0%
668
 
2.8%
599
 
2.5%
583
 
2.4%
576
 
2.4%
524
 
2.2%
484
 
2.0%
Other values (515) 16201
67.1%
Latin
ValueCountFrequency (%)
E 111
11.3%
T 108
11.0%
S 99
 
10.1%
C 87
 
8.9%
M 62
 
6.3%
H 59
 
6.0%
G 56
 
5.7%
N 56
 
5.7%
A 41
 
4.2%
K 32
 
3.3%
Other values (32) 267
27.3%
Common
ValueCountFrequency (%)
) 2065
43.6%
( 2062
43.6%
388
 
8.2%
2 62
 
1.3%
. 50
 
1.1%
1 30
 
0.6%
& 24
 
0.5%
- 22
 
0.5%
3 13
 
0.3%
4 6
 
0.1%
Other values (7) 10
 
0.2%
Han
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24145
80.8%
ASCII 5709
 
19.1%
None 13
 
< 0.1%
CJK 10
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2224
 
9.2%
811
 
3.4%
757
 
3.1%
730
 
3.0%
668
 
2.8%
599
 
2.5%
583
 
2.4%
576
 
2.4%
524
 
2.2%
484
 
2.0%
Other values (514) 16189
67.0%
ASCII
ValueCountFrequency (%)
) 2065
36.2%
( 2062
36.1%
388
 
6.8%
E 111
 
1.9%
T 108
 
1.9%
S 99
 
1.7%
C 87
 
1.5%
M 62
 
1.1%
2 62
 
1.1%
H 59
 
1.0%
Other values (48) 606
 
10.6%
None
ValueCountFrequency (%)
12
92.3%
1
 
7.7%
CJK
ValueCountFrequency (%)
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%
1
10.0%

전화번호
Text

MISSING 

Distinct3917
Distinct (%)90.2%
Missing390
Missing (%)8.2%
Memory size37.1 KiB
2023-12-11T07:59:25.266896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.029466
Min length9

Characters and Unicode

Total characters52256
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3569 ?
Unique (%)82.2%

Sample

1st row055-547-1438
2nd row055-287-6661
3rd row055-232-4442
4th row055-251-3040
5th row055-603-3106
ValueCountFrequency (%)
055-271-1980 8
 
0.2%
055-275-7930 5
 
0.1%
055-294-9438 4
 
0.1%
055-282-8080 4
 
0.1%
055-231-4455 4
 
0.1%
055-256-9261 4
 
0.1%
055-551-6601 4
 
0.1%
055-271-1891 4
 
0.1%
055-239-4700 4
 
0.1%
055-295-4401 4
 
0.1%
Other values (3907) 4299
99.0%
2023-12-11T07:59:25.619966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 11517
22.0%
- 8685
16.6%
0 7232
13.8%
2 6035
11.5%
6 3040
 
5.8%
7 2914
 
5.6%
1 2887
 
5.5%
8 2683
 
5.1%
9 2655
 
5.1%
3 2492
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 43571
83.4%
Dash Punctuation 8685
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 11517
26.4%
0 7232
16.6%
2 6035
13.9%
6 3040
 
7.0%
7 2914
 
6.7%
1 2887
 
6.6%
8 2683
 
6.2%
9 2655
 
6.1%
3 2492
 
5.7%
4 2116
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 8685
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 52256
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 11517
22.0%
- 8685
16.6%
0 7232
13.8%
2 6035
11.5%
6 3040
 
5.8%
7 2914
 
5.6%
1 2887
 
5.5%
8 2683
 
5.1%
9 2655
 
5.1%
3 2492
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 52256
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 11517
22.0%
- 8685
16.6%
0 7232
13.8%
2 6035
11.5%
6 3040
 
5.8%
7 2914
 
5.6%
1 2887
 
5.5%
8 2683
 
5.1%
9 2655
 
5.1%
3 2492
 
4.8%

팩스번호
Text

MISSING 

Distinct3659
Distinct (%)89.2%
Missing631
Missing (%)13.3%
Memory size37.1 KiB
2023-12-11T07:59:25.830200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.02096
Min length11

Characters and Unicode

Total characters49322
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3286 ?
Unique (%)80.1%

Sample

1st row055-263-6665
2nd row055-232-3663
3rd row055-251-4619
4th row055-603-3114
5th row055-715-3923
ValueCountFrequency (%)
055-275-7931 7
 
0.2%
055-271-1982 5
 
0.1%
055-294-5155 4
 
0.1%
055-286-9322 4
 
0.1%
055-282-9320 4
 
0.1%
055-296-6441 4
 
0.1%
055-239-3682 4
 
0.1%
055-256-9167 4
 
0.1%
055-265-3667 4
 
0.1%
055-295-8907 4
 
0.1%
Other values (3649) 4059
98.9%
2023-12-11T07:59:26.415992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 10935
22.2%
- 8206
16.6%
0 6315
12.8%
2 5848
11.9%
6 2966
 
6.0%
9 2801
 
5.7%
7 2627
 
5.3%
8 2532
 
5.1%
3 2501
 
5.1%
1 2377
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 41116
83.4%
Dash Punctuation 8206
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 10935
26.6%
0 6315
15.4%
2 5848
14.2%
6 2966
 
7.2%
9 2801
 
6.8%
7 2627
 
6.4%
8 2532
 
6.2%
3 2501
 
6.1%
1 2377
 
5.8%
4 2214
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 8206
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 49322
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 10935
22.2%
- 8206
16.6%
0 6315
12.8%
2 5848
11.9%
6 2966
 
6.0%
9 2801
 
5.7%
7 2627
 
5.3%
8 2532
 
5.1%
3 2501
 
5.1%
1 2377
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49322
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 10935
22.2%
- 8206
16.6%
0 6315
12.8%
2 5848
11.9%
6 2966
 
6.0%
9 2801
 
5.7%
7 2627
 
5.3%
8 2532
 
5.1%
3 2501
 
5.1%
1 2377
 
4.8%

주소
Text

Distinct3989
Distinct (%)84.3%
Missing0
Missing (%)0.0%
Memory size37.1 KiB
2023-12-11T07:59:26.726670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length73
Median length59
Mean length35.164977
Min length21

Characters and Unicode

Total characters166471
Distinct characters469
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3516 ?
Unique (%)74.3%

Sample

1st row경상남도 창원시 진해구 웅천동로 204 (성내동)
2nd row경상남도 창원시 성산구 완암로 50, 테크동 4층 410호 (성산동, SK테크노파크)
3rd row경상남도 창원시 마산회원구 내서읍 중리공단로 149 (현대자동차서비스)
4th row경상남도 창원시 마산회원구 봉암공단9길 10 (봉암동, (유)두루산업사)
5th row경상남도 창원시 성산구 반월로 16 (신촌동)
ValueCountFrequency (%)
경상남도 4735
 
13.8%
창원시 4735
 
13.8%
성산구 1727
 
5.0%
의창구 1363
 
4.0%
마산회원구 872
 
2.5%
팔용동 795
 
2.3%
성산동 589
 
1.7%
완암로 556
 
1.6%
50 503
 
1.5%
sk테크노파크 483
 
1.4%
Other values (3358) 18074
52.5%
2023-12-11T07:59:27.186721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29789
 
17.9%
6454
 
3.9%
6008
 
3.6%
5959
 
3.6%
5026
 
3.0%
4886
 
2.9%
4782
 
2.9%
4771
 
2.9%
4763
 
2.9%
4760
 
2.9%
Other values (459) 89273
53.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 100029
60.1%
Space Separator 29789
 
17.9%
Decimal Number 21904
 
13.2%
Close Punctuation 4635
 
2.8%
Open Punctuation 4633
 
2.8%
Other Punctuation 3033
 
1.8%
Uppercase Letter 1345
 
0.8%
Dash Punctuation 1077
 
0.6%
Lowercase Letter 23
 
< 0.1%
Other Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6454
 
6.5%
6008
 
6.0%
5959
 
6.0%
5026
 
5.0%
4886
 
4.9%
4782
 
4.8%
4771
 
4.8%
4763
 
4.8%
4760
 
4.8%
4625
 
4.6%
Other values (400) 47995
48.0%
Uppercase Letter
ValueCountFrequency (%)
S 516
38.4%
K 500
37.2%
B 91
 
6.8%
C 37
 
2.8%
T 29
 
2.2%
A 27
 
2.0%
N 20
 
1.5%
E 18
 
1.3%
G 16
 
1.2%
D 13
 
1.0%
Other values (13) 78
 
5.8%
Lowercase Letter
ValueCountFrequency (%)
a 5
21.7%
n 4
17.4%
c 3
13.0%
e 2
 
8.7%
x 1
 
4.3%
t 1
 
4.3%
s 1
 
4.3%
h 1
 
4.3%
y 1
 
4.3%
r 1
 
4.3%
Other values (3) 3
13.0%
Decimal Number
ValueCountFrequency (%)
1 4560
20.8%
2 3067
14.0%
3 2560
11.7%
5 2220
10.1%
0 2117
9.7%
4 1868
8.5%
6 1628
 
7.4%
7 1532
 
7.0%
8 1178
 
5.4%
9 1174
 
5.4%
Other Punctuation
ValueCountFrequency (%)
, 2982
98.3%
· 28
 
0.9%
. 14
 
0.5%
& 7
 
0.2%
/ 2
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 4634
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4632
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
29789
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1077
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 100031
60.1%
Common 65072
39.1%
Latin 1368
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6454
 
6.5%
6008
 
6.0%
5959
 
6.0%
5026
 
5.0%
4886
 
4.9%
4782
 
4.8%
4771
 
4.8%
4763
 
4.8%
4760
 
4.8%
4625
 
4.6%
Other values (401) 47997
48.0%
Latin
ValueCountFrequency (%)
S 516
37.7%
K 500
36.5%
B 91
 
6.7%
C 37
 
2.7%
T 29
 
2.1%
A 27
 
2.0%
N 20
 
1.5%
E 18
 
1.3%
G 16
 
1.2%
D 13
 
1.0%
Other values (26) 101
 
7.4%
Common
ValueCountFrequency (%)
29789
45.8%
) 4634
 
7.1%
( 4632
 
7.1%
1 4560
 
7.0%
2 3067
 
4.7%
, 2982
 
4.6%
3 2560
 
3.9%
5 2220
 
3.4%
0 2117
 
3.3%
4 1868
 
2.9%
Other values (12) 6643
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 100025
60.1%
ASCII 66412
39.9%
None 30
 
< 0.1%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29789
44.9%
) 4634
 
7.0%
( 4632
 
7.0%
1 4560
 
6.9%
2 3067
 
4.6%
, 2982
 
4.5%
3 2560
 
3.9%
5 2220
 
3.3%
0 2117
 
3.2%
4 1868
 
2.8%
Other values (47) 7983
 
12.0%
Hangul
ValueCountFrequency (%)
6454
 
6.5%
6008
 
6.0%
5959
 
6.0%
5026
 
5.0%
4886
 
4.9%
4782
 
4.8%
4771
 
4.8%
4763
 
4.8%
4760
 
4.8%
4625
 
4.6%
Other values (397) 47991
48.0%
None
ValueCountFrequency (%)
· 28
93.3%
2
 
6.7%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Distinct878
Distinct (%)18.5%
Missing0
Missing (%)0.0%
Memory size37.1 KiB
2023-12-11T07:59:27.489690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length16.203211
Min length3

Characters and Unicode

Total characters76706
Distinct characters323
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique467 ?
Unique (%)9.9%

Sample

1st row구조용 금속 판제품 및 공작물 제조업 외 1 종
2nd row배전반 및 전기 자동제어반 제조업 외 12 종
3rd row자동차 구조 및 장치 변경업 외 1 종
4th row그 외 기타 전자부품 제조업 외 1 종
5th row그 외 자동차용 신품 부품 제조업 외 3 종
ValueCountFrequency (%)
제조업 3601
 
14.7%
2451
 
10.0%
2123
 
8.7%
1791
 
7.3%
기타 1056
 
4.3%
1 910
 
3.7%
659
 
2.7%
절삭가공 635
 
2.6%
유사처리업 635
 
2.6%
금속 523
 
2.1%
Other values (607) 10150
41.4%
2023-12-11T07:59:27.942628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19809
25.8%
4910
 
6.4%
4350
 
5.7%
4088
 
5.3%
3221
 
4.2%
2491
 
3.2%
2125
 
2.8%
1799
 
2.3%
1331
 
1.7%
1192
 
1.6%
Other values (313) 31390
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54590
71.2%
Space Separator 19809
 
25.8%
Decimal Number 1955
 
2.5%
Other Punctuation 320
 
0.4%
Close Punctuation 16
 
< 0.1%
Open Punctuation 16
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4910
 
9.0%
4350
 
8.0%
4088
 
7.5%
3221
 
5.9%
2491
 
4.6%
2125
 
3.9%
1799
 
3.3%
1331
 
2.4%
1192
 
2.2%
1152
 
2.1%
Other values (298) 27931
51.2%
Decimal Number
ValueCountFrequency (%)
1 1064
54.4%
2 299
 
15.3%
3 256
 
13.1%
4 127
 
6.5%
5 70
 
3.6%
6 49
 
2.5%
7 48
 
2.5%
8 20
 
1.0%
0 15
 
0.8%
9 7
 
0.4%
Other Punctuation
ValueCountFrequency (%)
, 312
97.5%
. 8
 
2.5%
Space Separator
ValueCountFrequency (%)
19809
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54590
71.2%
Common 22116
28.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4910
 
9.0%
4350
 
8.0%
4088
 
7.5%
3221
 
5.9%
2491
 
4.6%
2125
 
3.9%
1799
 
3.3%
1331
 
2.4%
1192
 
2.2%
1152
 
2.1%
Other values (298) 27931
51.2%
Common
ValueCountFrequency (%)
19809
89.6%
1 1064
 
4.8%
, 312
 
1.4%
2 299
 
1.4%
3 256
 
1.2%
4 127
 
0.6%
5 70
 
0.3%
6 49
 
0.2%
7 48
 
0.2%
8 20
 
0.1%
Other values (5) 62
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54577
71.2%
ASCII 22116
28.8%
Compat Jamo 13
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
19809
89.6%
1 1064
 
4.8%
, 312
 
1.4%
2 299
 
1.4%
3 256
 
1.2%
4 127
 
0.6%
5 70
 
0.3%
6 49
 
0.2%
7 48
 
0.2%
8 20
 
0.1%
Other values (5) 62
 
0.3%
Hangul
ValueCountFrequency (%)
4910
 
9.0%
4350
 
8.0%
4088
 
7.5%
3221
 
5.9%
2491
 
4.6%
2125
 
3.9%
1799
 
3.3%
1331
 
2.4%
1192
 
2.2%
1152
 
2.1%
Other values (297) 27918
51.2%
Compat Jamo
ValueCountFrequency (%)
13
100.0%
Distinct3649
Distinct (%)77.1%
Missing0
Missing (%)0.0%
Memory size37.1 KiB
2023-12-11T07:59:28.222947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length44
Mean length9.3132657
Min length1

Characters and Unicode

Total characters44089
Distinct characters695
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3359 ?
Unique (%)71.0%

Sample

1st row울타리,난간
2nd row분전반, 태양광구조물,가로등
3rd row자동차정비
4th row산업용가습기
5th row자동차부품
ValueCountFrequency (%)
304
 
3.4%
부품 269
 
3.0%
253
 
2.8%
자동차부품 176
 
2.0%
기계부품 167
 
1.9%
159
 
1.8%
공작기계부품 129
 
1.4%
금형 97
 
1.1%
공작기계 96
 
1.1%
가공 83
 
0.9%
Other values (4044) 7270
80.8%
2023-12-11T07:59:28.639023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4301
 
9.8%
2125
 
4.8%
2087
 
4.7%
1817
 
4.1%
, 1784
 
4.0%
1062
 
2.4%
934
 
2.1%
859
 
1.9%
812
 
1.8%
669
 
1.5%
Other values (685) 27639
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33781
76.6%
Space Separator 4301
 
9.8%
Uppercase Letter 2267
 
5.1%
Other Punctuation 1903
 
4.3%
Lowercase Letter 1169
 
2.7%
Open Punctuation 302
 
0.7%
Close Punctuation 301
 
0.7%
Decimal Number 45
 
0.1%
Dash Punctuation 18
 
< 0.1%
Control 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2125
 
6.3%
2087
 
6.2%
1817
 
5.4%
1062
 
3.1%
934
 
2.8%
859
 
2.5%
812
 
2.4%
669
 
2.0%
593
 
1.8%
586
 
1.7%
Other values (613) 22237
65.8%
Uppercase Letter
ValueCountFrequency (%)
C 225
 
9.9%
E 195
 
8.6%
L 170
 
7.5%
S 155
 
6.8%
A 152
 
6.7%
D 140
 
6.2%
R 137
 
6.0%
P 132
 
5.8%
O 126
 
5.6%
T 120
 
5.3%
Other values (16) 715
31.5%
Lowercase Letter
ValueCountFrequency (%)
e 145
12.4%
r 104
 
8.9%
o 94
 
8.0%
l 83
 
7.1%
t 82
 
7.0%
a 80
 
6.8%
c 74
 
6.3%
i 70
 
6.0%
s 63
 
5.4%
n 60
 
5.1%
Other values (14) 314
26.9%
Decimal Number
ValueCountFrequency (%)
3 12
26.7%
1 10
22.2%
2 8
17.8%
8 3
 
6.7%
5 3
 
6.7%
4 3
 
6.7%
7 2
 
4.4%
0 2
 
4.4%
9 1
 
2.2%
6 1
 
2.2%
Other Punctuation
ValueCountFrequency (%)
, 1784
93.7%
/ 50
 
2.6%
. 40
 
2.1%
' 17
 
0.9%
& 10
 
0.5%
; 1
 
0.1%
1
 
0.1%
Space Separator
ValueCountFrequency (%)
4301
100.0%
Open Punctuation
ValueCountFrequency (%)
( 302
100.0%
Close Punctuation
ValueCountFrequency (%)
) 301
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33781
76.6%
Common 6872
 
15.6%
Latin 3436
 
7.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2125
 
6.3%
2087
 
6.2%
1817
 
5.4%
1062
 
3.1%
934
 
2.8%
859
 
2.5%
812
 
2.4%
669
 
2.0%
593
 
1.8%
586
 
1.7%
Other values (613) 22237
65.8%
Latin
ValueCountFrequency (%)
C 225
 
6.5%
E 195
 
5.7%
L 170
 
4.9%
S 155
 
4.5%
A 152
 
4.4%
e 145
 
4.2%
D 140
 
4.1%
R 137
 
4.0%
P 132
 
3.8%
O 126
 
3.7%
Other values (40) 1859
54.1%
Common
ValueCountFrequency (%)
4301
62.6%
, 1784
26.0%
( 302
 
4.4%
) 301
 
4.4%
/ 50
 
0.7%
. 40
 
0.6%
- 18
 
0.3%
' 17
 
0.2%
3 12
 
0.2%
1 10
 
0.1%
Other values (12) 37
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33781
76.6%
ASCII 10307
 
23.4%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4301
41.7%
, 1784
17.3%
( 302
 
2.9%
) 301
 
2.9%
C 225
 
2.2%
E 195
 
1.9%
L 170
 
1.6%
S 155
 
1.5%
A 152
 
1.5%
e 145
 
1.4%
Other values (61) 2577
25.0%
Hangul
ValueCountFrequency (%)
2125
 
6.3%
2087
 
6.2%
1817
 
5.4%
1062
 
3.1%
934
 
2.8%
859
 
2.5%
812
 
2.4%
669
 
2.0%
593
 
1.8%
586
 
1.7%
Other values (613) 22237
65.8%
None
ValueCountFrequency (%)
1
100.0%

데이터제공일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size37.1 KiB
2019-01-31
4734 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-01-31
2nd row2019-01-31
3rd row2019-01-31
4th row2019-01-31
5th row2019-01-31

Common Values

ValueCountFrequency (%)
2019-01-31 4734
100.0%

Length

2023-12-11T07:59:28.760699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:59:28.848788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-01-31 4734
100.0%

Missing values

2023-12-11T07:59:24.229513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:59:24.332394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:59:24.419140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명전화번호팩스번호주소업종명생산품데이터제공일자
0(사)한국장애인사회복지회사회복지부055-547-1438<NA>경상남도 창원시 진해구 웅천동로 204 (성내동)구조용 금속 판제품 및 공작물 제조업 외 1 종울타리,난간2019-01-31
1(유)경남라이팅055-287-6661055-263-6665경상남도 창원시 성산구 완암로 50, 테크동 4층 410호 (성산동, SK테크노파크)배전반 및 전기 자동제어반 제조업 외 12 종분전반, 태양광구조물,가로등2019-01-31
2(유)국제보링1급정비055-232-4442055-232-3663경상남도 창원시 마산회원구 내서읍 중리공단로 149 (현대자동차서비스)자동차 구조 및 장치 변경업 외 1 종자동차정비2019-01-31
3(유)두루산업사055-251-3040055-251-4619경상남도 창원시 마산회원구 봉암공단9길 10 (봉암동, (유)두루산업사)그 외 기타 전자부품 제조업 외 1 종산업용가습기2019-01-31
4(유)리앤리테크055-603-3106055-603-3114경상남도 창원시 성산구 반월로 16 (신촌동)그 외 자동차용 신품 부품 제조업 외 3 종자동차부품2019-01-31
5(유)반디엘이디055-715-3921055-715-3923경상남도 창원시 마산합포구 진북면 정현리 789-1번지일반용 전기 조명장치 제조업조명기구,기기2019-01-31
6(유)범아산전055-277-0061055-277-0062경상남도 창원시 의창구 태복산로7번길 1 (도계동)배전반 및 전기 자동제어반 제조업전기제어부분품2019-01-31
7(유)부길055-271-5080<NA>경상남도 창원시 마산합포구 진북면 부평리 342번지그 외 기타 플라스틱 제품 제조업플라스틱칩2019-01-31
8(유)삼송 창원공장055-284-8661055-284-8660경상남도 창원시 성산구 정동로62번길 30 (성주동) (총 2 필지) (총 2 필지)그 외 자동차용 신품 부품 제조업 외 3 종자동차용 안전밸브2019-01-31
9(유)송원산업055-271-8791<NA>경상남도 창원시 마산합포구 진북면 망곡리 83-3번지선박 구성 부분품 제조업선박의장부품2019-01-31
업체명전화번호팩스번호주소업종명생산품데이터제공일자
4724휘문정판지기공업사055-292-6701055-292-6700경상남도 창원시 의창구 팔용로345번길 25 (팔용동)금속 절삭기계 제조업 외 1 종기계부품2019-01-31
4725흥일제재소<NA><NA>경상남도 창원시 의창구 대산면 주남로382번길 49 (총 2 필지)일반 제재업각재 및 판재2019-01-31
4726흥일파티션055-256-9707055-252-9707경상남도 창원시 의창구 대산면 주남로382번길 49기타 목재가구 제조업 외 2 종파티션2019-01-31
4727흥진기업070-8871-0444<NA>경상남도 창원시 성산구 성주로137번길 21 (남산동)절연 코드세트 및 기타 도체 제조업에어컨전선제조2019-01-31
4728흥진테크055-267-5642055-261-5642경상남도 창원시 성산구 웅남로 502 (성산동)절삭가공 및 유사처리업금형부품2019-01-31
4729흥진항공산업055-255-9827055-255-9828경상남도 창원시 의창구 사화로 222-20 (팔용동, 흥진기계)육상 금속 골조 구조재 제조업 외 3 종항공기기체부품 등2019-01-31
4730희성테크055-286-8450055-286-8453경상남도 창원시 성산구 월림로67번길 5 (신촌동, 신촌동공장)경 인쇄업 외 1 종라벨류2019-01-31
4731희창정밀<NA>055-261-5298경상남도 창원시 성산구 월림로25번길 9 (신촌동)절삭가공 및 유사처리업기계부품2019-01-31
4732히팅플러스<NA><NA>경상남도 창원시 성산구 완암로 50, 넥스동 7층 705 (성산동, SK테크노파크)그 외 기타 금속가공업금속 구조물2019-01-31
4733힘멜테크055-276-5764055-901-0260경상남도 창원시 성산구 완암로 50, 테크동 9층 913 (성산동, SK테크노파크)기타 인쇄업 외 1 종인쇄물2019-01-31

Duplicate rows

Most frequently occurring

업체명전화번호팩스번호주소업종명생산품데이터제공일자# duplicates
0(주)가현055-295-3261055-295-3260경상남도 창원시 의창구 평산로38번길 13 (팔용동)건설 및 채광용 기계장비 제조업 외 1 종조선기자재, 중장비부품2019-01-312
1(주)비티엑스코리아 창원공장055-281-4301055-281-4340경상남도 창원시 성산구 웅남로 760 (성주동)그 외 자동차용 신품 부품 제조업 외 3 종자동차부품(차유리조립,브레이크파이프 등)2019-01-312
2진성금속055-552-2208055-552-0208경상남도 창원시 진해구 남영로522번길 21 (남양동, 쌍화흥조조창)알루미늄주물 주조업선박용전기부품2019-01-312