Overview

Dataset statistics

Number of variables9
Number of observations52
Missing cells6
Missing cells (%)1.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory76.5 B

Variable types

Numeric2
Text6
Categorical1

Dataset

Description부산광역시 동구 관내에 등록된 공장에 대한 현황입니다. 공장등록현황 데이터를 제공합니다. (업체명, 주소, 연락처 등)
Author부산광역시 동구
URLhttps://www.data.go.kr/data/3076852/fileData.do

Alerts

사업유형 has constant value ""Constant
업종명 has 2 (3.8%) missing valuesMissing
전화번호 has 4 (7.7%) missing valuesMissing
순번 has unique valuesUnique
공장대표주소(도로명) has unique valuesUnique
종업원수 has 4 (7.7%) zerosZeros

Reproduction

Analysis started2023-12-12 00:49:44.399717
Analysis finished2023-12-12 00:49:45.947500
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T09:49:46.057756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-12T09:49:46.513823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%
Distinct50
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T09:49:46.824381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length12
Mean length7.3846154
Min length4

Characters and Unicode

Total characters384
Distinct characters147
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)94.2%

Sample

1st row(주)대림테크
2nd row(주)디엘시스템
3rd row(주)무한손
4th row(주)미화합동
5th row(주)삼우이머션
ValueCountFrequency (%)
주식회사 8
 
12.3%
제이솔루션 3
 
4.6%
주)우협텍스타일 1
 
1.5%
삼보제약 1
 
1.5%
삼화텔콤 1
 
1.5%
세종어패럴 1
 
1.5%
신우어패럴 1
 
1.5%
씨케이섬유 1
 
1.5%
아티스린넨(주 1
 
1.5%
영진식품 1
 
1.5%
Other values (46) 46
70.8%
2023-12-12T09:49:47.392463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
8.6%
) 24
 
6.2%
( 20
 
5.2%
16
 
4.2%
13
 
3.4%
12
 
3.1%
11
 
2.9%
9
 
2.3%
8
 
2.1%
6
 
1.6%
Other values (137) 232
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 304
79.2%
Close Punctuation 24
 
6.2%
Uppercase Letter 21
 
5.5%
Open Punctuation 20
 
5.2%
Space Separator 13
 
3.4%
Other Punctuation 1
 
0.3%
Lowercase Letter 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
10.9%
16
 
5.3%
12
 
3.9%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.6%
Other values (119) 192
63.2%
Uppercase Letter
ValueCountFrequency (%)
E 4
19.0%
P 3
14.3%
D 2
9.5%
N 2
9.5%
U 2
9.5%
L 1
 
4.8%
O 1
 
4.8%
T 1
 
4.8%
J 1
 
4.8%
G 1
 
4.8%
Other values (3) 3
14.3%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Lowercase Letter
ValueCountFrequency (%)
r 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 304
79.2%
Common 58
 
15.1%
Latin 22
 
5.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
10.9%
16
 
5.3%
12
 
3.9%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.6%
Other values (119) 192
63.2%
Latin
ValueCountFrequency (%)
E 4
18.2%
P 3
13.6%
D 2
9.1%
N 2
9.1%
U 2
9.1%
L 1
 
4.5%
O 1
 
4.5%
T 1
 
4.5%
J 1
 
4.5%
G 1
 
4.5%
Other values (4) 4
18.2%
Common
ValueCountFrequency (%)
) 24
41.4%
( 20
34.5%
13
22.4%
. 1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 304
79.2%
ASCII 80
 
20.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
10.9%
16
 
5.3%
12
 
3.9%
11
 
3.6%
9
 
3.0%
8
 
2.6%
6
 
2.0%
6
 
2.0%
6
 
2.0%
5
 
1.6%
Other values (119) 192
63.2%
ASCII
ValueCountFrequency (%)
) 24
30.0%
( 20
25.0%
13
16.2%
E 4
 
5.0%
P 3
 
3.8%
D 2
 
2.5%
N 2
 
2.5%
U 2
 
2.5%
L 1
 
1.2%
O 1
 
1.2%
Other values (8) 8
 
10.0%
Distinct50
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T09:49:47.726629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.0384615
Min length2

Characters and Unicode

Total characters158
Distinct characters77
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)94.2%

Sample

1st row김영호
2nd row윤숙자
3rd row정학봉
4th row이성진
5th row김대희
ValueCountFrequency (%)
장홍석 3
 
5.8%
조인석 1
 
1.9%
인성진 1
 
1.9%
김장규 1
 
1.9%
강영민 1
 
1.9%
최호남 1
 
1.9%
조성주 1
 
1.9%
조정옥 1
 
1.9%
이미숙 1
 
1.9%
김태규 1
 
1.9%
Other values (40) 40
76.9%
2023-12-12T09:49:48.230946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
8.2%
7
 
4.4%
6
 
3.8%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (67) 101
63.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 157
99.4%
Other Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
8.3%
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (66) 100
63.7%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 157
99.4%
Common 1
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
8.3%
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (66) 100
63.7%
Common
ValueCountFrequency (%)
, 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 157
99.4%
ASCII 1
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
8.3%
7
 
4.5%
6
 
3.8%
6
 
3.8%
5
 
3.2%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (66) 100
63.7%
ASCII
ValueCountFrequency (%)
, 1
100.0%
Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T09:49:48.679408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length35
Mean length30.461538
Min length22

Characters and Unicode

Total characters1584
Distinct characters93
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row부산광역시 동구 성남로49번길 9 (좌천동, 세방기업) 홍익빌딩 4층
2nd row부산광역시 동구 성남로49번길 9 (좌천동) 홍익무선 빌딩 1층
3rd row부산광역시 동구 범일일길11번가길 19, 1층 (범일동)
4th row부산광역시 동구 중앙대로 243-3 (초량동) 외 1필지
5th row부산광역시 동구 중앙대로274번길 7-3, 상가 104호(초량동, 부산역 유림 줄리엣)
ValueCountFrequency (%)
부산광역시 52
 
16.0%
동구 52
 
16.0%
초량동 20
 
6.2%
범일동 16
 
4.9%
좌천동 9
 
2.8%
중앙대로 7
 
2.2%
3층 5
 
1.5%
초량중로 4
 
1.2%
수정동 4
 
1.2%
4
 
1.2%
Other values (115) 151
46.6%
2023-12-12T09:49:49.199538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
272
 
17.2%
108
 
6.8%
) 57
 
3.6%
( 57
 
3.6%
55
 
3.5%
54
 
3.4%
53
 
3.3%
52
 
3.3%
52
 
3.3%
52
 
3.3%
Other values (83) 772
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 896
56.6%
Space Separator 272
 
17.2%
Decimal Number 243
 
15.3%
Close Punctuation 57
 
3.6%
Open Punctuation 57
 
3.6%
Other Punctuation 42
 
2.7%
Dash Punctuation 15
 
0.9%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
108
 
12.1%
55
 
6.1%
54
 
6.0%
53
 
5.9%
52
 
5.8%
52
 
5.8%
52
 
5.8%
51
 
5.7%
30
 
3.3%
30
 
3.3%
Other values (67) 359
40.1%
Decimal Number
ValueCountFrequency (%)
1 50
20.6%
3 44
18.1%
2 35
14.4%
4 24
9.9%
9 22
9.1%
0 20
 
8.2%
5 13
 
5.3%
8 12
 
4.9%
6 12
 
4.9%
7 11
 
4.5%
Space Separator
ValueCountFrequency (%)
272
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Other Punctuation
ValueCountFrequency (%)
, 42
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 896
56.6%
Common 686
43.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
108
 
12.1%
55
 
6.1%
54
 
6.0%
53
 
5.9%
52
 
5.8%
52
 
5.8%
52
 
5.8%
51
 
5.7%
30
 
3.3%
30
 
3.3%
Other values (67) 359
40.1%
Common
ValueCountFrequency (%)
272
39.7%
) 57
 
8.3%
( 57
 
8.3%
1 50
 
7.3%
3 44
 
6.4%
, 42
 
6.1%
2 35
 
5.1%
4 24
 
3.5%
9 22
 
3.2%
0 20
 
2.9%
Other values (5) 63
 
9.2%
Latin
ValueCountFrequency (%)
B 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 896
56.6%
ASCII 688
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
272
39.5%
) 57
 
8.3%
( 57
 
8.3%
1 50
 
7.3%
3 44
 
6.4%
, 42
 
6.1%
2 35
 
5.1%
4 24
 
3.5%
9 22
 
3.2%
0 20
 
2.9%
Other values (6) 65
 
9.4%
Hangul
ValueCountFrequency (%)
108
 
12.1%
55
 
6.1%
54
 
6.0%
53
 
5.9%
52
 
5.8%
52
 
5.8%
52
 
5.8%
51
 
5.7%
30
 
3.3%
30
 
3.3%
Other values (67) 359
40.1%

업종명
Text

MISSING 

Distinct44
Distinct (%)88.0%
Missing2
Missing (%)3.8%
Memory size548.0 B
2023-12-12T09:49:49.476663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length22.5
Mean length16.68
Min length6

Characters and Unicode

Total characters834
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)76.0%

Sample

1st row유선 통신장비 제조업 외 3 종
2nd row방송장비 제조업 외 3 종
3rd row한복 제조업 외 5 종
4th row장류 제조업
5th row기타 무선 통신장비 제조업 외 2 종
ValueCountFrequency (%)
제조업 42
 
15.1%
33
 
11.8%
31
 
11.1%
17
 
6.1%
기타 12
 
4.3%
2 8
 
2.9%
통신장비 7
 
2.5%
4 7
 
2.5%
1 6
 
2.2%
무선 5
 
1.8%
Other values (75) 111
39.8%
2023-12-12T09:49:49.949688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
229
27.5%
53
 
6.4%
52
 
6.2%
48
 
5.8%
35
 
4.2%
32
 
3.8%
25
 
3.0%
17
 
2.0%
17
 
2.0%
12
 
1.4%
Other values (110) 314
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 565
67.7%
Space Separator 229
27.5%
Decimal Number 34
 
4.1%
Other Punctuation 6
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
53
 
9.4%
52
 
9.2%
48
 
8.5%
35
 
6.2%
32
 
5.7%
25
 
4.4%
17
 
3.0%
17
 
3.0%
12
 
2.1%
12
 
2.1%
Other values (100) 262
46.4%
Decimal Number
ValueCountFrequency (%)
1 10
29.4%
2 8
23.5%
4 7
20.6%
3 3
 
8.8%
9 2
 
5.9%
5 2
 
5.9%
0 1
 
2.9%
6 1
 
2.9%
Space Separator
ValueCountFrequency (%)
229
100.0%
Other Punctuation
ValueCountFrequency (%)
, 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 565
67.7%
Common 269
32.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
53
 
9.4%
52
 
9.2%
48
 
8.5%
35
 
6.2%
32
 
5.7%
25
 
4.4%
17
 
3.0%
17
 
3.0%
12
 
2.1%
12
 
2.1%
Other values (100) 262
46.4%
Common
ValueCountFrequency (%)
229
85.1%
1 10
 
3.7%
2 8
 
3.0%
4 7
 
2.6%
, 6
 
2.2%
3 3
 
1.1%
9 2
 
0.7%
5 2
 
0.7%
0 1
 
0.4%
6 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 565
67.7%
ASCII 269
32.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
229
85.1%
1 10
 
3.7%
2 8
 
3.0%
4 7
 
2.6%
, 6
 
2.2%
3 3
 
1.1%
9 2
 
0.7%
5 2
 
0.7%
0 1
 
0.4%
6 1
 
0.4%
Hangul
ValueCountFrequency (%)
53
 
9.4%
52
 
9.2%
48
 
8.5%
35
 
6.2%
32
 
5.7%
25
 
4.4%
17
 
3.0%
17
 
3.0%
12
 
2.1%
12
 
2.1%
Other values (100) 262
46.4%

사업유형
Categorical

CONSTANT 

Distinct1
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size548.0 B
제조업
52 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row제조업
2nd row제조업
3rd row제조업
4th row제조업
5th row제조업

Common Values

ValueCountFrequency (%)
제조업 52
100.0%

Length

2023-12-12T09:49:50.117135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:49:50.232402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 52
100.0%

전화번호
Text

MISSING 

Distinct45
Distinct (%)93.8%
Missing4
Missing (%)7.7%
Memory size548.0 B
2023-12-12T09:49:50.488382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.020833
Min length12

Characters and Unicode

Total characters577
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)89.6%

Sample

1st row051-441-5151
2nd row051-441-4700
3rd row051-647-4488
4th row051-468-2675
5th row051-977-0301
ValueCountFrequency (%)
051-466-1980 3
 
6.2%
051-644-6938 2
 
4.2%
051-467-6002 1
 
2.1%
051-636-8304 1
 
2.1%
051-467-8888 1
 
2.1%
051-467-1049 1
 
2.1%
051-464-8500 1
 
2.1%
051-311-2907 1
 
2.1%
051-630-3114 1
 
2.1%
051-466-7400 1
 
2.1%
Other values (35) 35
72.9%
2023-12-12T09:49:50.884025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 96
16.6%
1 83
14.4%
0 80
13.9%
5 68
11.8%
6 55
9.5%
4 52
9.0%
8 35
 
6.1%
7 35
 
6.1%
2 29
 
5.0%
3 24
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 481
83.4%
Dash Punctuation 96
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 83
17.3%
0 80
16.6%
5 68
14.1%
6 55
11.4%
4 52
10.8%
8 35
7.3%
7 35
7.3%
2 29
 
6.0%
3 24
 
5.0%
9 20
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 96
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 577
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 96
16.6%
1 83
14.4%
0 80
13.9%
5 68
11.8%
6 55
9.5%
4 52
9.0%
8 35
 
6.1%
7 35
 
6.1%
2 29
 
5.0%
3 24
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 577
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 96
16.6%
1 83
14.4%
0 80
13.9%
5 68
11.8%
6 55
9.5%
4 52
9.0%
8 35
 
6.1%
7 35
 
6.1%
2 29
 
5.0%
3 24
 
4.2%

종업원수
Real number (ℝ)

ZEROS 

Distinct23
Distinct (%)44.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.826923
Minimum0
Maximum92
Zeros4
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-12T09:49:51.034821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median5
Q311.25
95-th percentile61
Maximum92
Range92
Interquartile range (IQR)8.25

Descriptive statistics

Standard deviation18.780174
Coefficient of variation (CV)1.4641215
Kurtosis7.0023627
Mean12.826923
Median Absolute Deviation (MAD)3
Skewness2.6133758
Sum667
Variance352.69495
MonotonicityNot monotonic
2023-12-12T09:49:51.167288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=23)
ValueCountFrequency (%)
4 7
13.5%
3 6
11.5%
5 6
11.5%
0 4
 
7.7%
2 4
 
7.7%
7 3
 
5.8%
10 3
 
5.8%
16 2
 
3.8%
61 2
 
3.8%
11 2
 
3.8%
Other values (13) 13
25.0%
ValueCountFrequency (%)
0 4
7.7%
1 1
 
1.9%
2 4
7.7%
3 6
11.5%
4 7
13.5%
5 6
11.5%
6 1
 
1.9%
7 3
5.8%
8 1
 
1.9%
9 1
 
1.9%
ValueCountFrequency (%)
92 1
1.9%
63 1
1.9%
61 2
3.8%
40 1
1.9%
32 1
1.9%
26 1
1.9%
25 1
1.9%
22 1
1.9%
20 1
1.9%
16 2
3.8%
Distinct48
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-12T09:49:51.494586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length18
Mean length8.9038462
Min length2

Characters and Unicode

Total characters463
Distinct characters143
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)86.5%

Sample

1st row무선방송수신기, CCTV, 위성중계기, 동보장치, 무선방송송신기, 위성안테나, CCTV용 전원부
2nd row무선방송수신기, CCTV, 동보장치 등
3rd row한복
4th row된장
5th row포터블 VR KIT
ValueCountFrequency (%)
cctv 4
 
3.7%
4
 
3.7%
유니폼 3
 
2.8%
3
 
2.8%
의류 3
 
2.8%
인쇄물 3
 
2.8%
전광판 3
 
2.8%
화장품 2
 
1.9%
무선방송수신기 2
 
1.9%
동보장치 2
 
1.9%
Other values (74) 78
72.9%
2023-12-12T09:49:51.995943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
55
 
11.9%
, 33
 
7.1%
14
 
3.0%
11
 
2.4%
C 10
 
2.2%
9
 
1.9%
9
 
1.9%
8
 
1.7%
8
 
1.7%
8
 
1.7%
Other values (133) 298
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 346
74.7%
Space Separator 55
 
11.9%
Other Punctuation 33
 
7.1%
Uppercase Letter 25
 
5.4%
Lowercase Letter 4
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
4.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (122) 260
75.1%
Uppercase Letter
ValueCountFrequency (%)
C 10
40.0%
T 6
24.0%
V 6
24.0%
I 1
 
4.0%
K 1
 
4.0%
R 1
 
4.0%
Lowercase Letter
ValueCountFrequency (%)
c 2
50.0%
v 1
25.0%
t 1
25.0%
Space Separator
ValueCountFrequency (%)
55
100.0%
Other Punctuation
ValueCountFrequency (%)
, 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 346
74.7%
Common 88
 
19.0%
Latin 29
 
6.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
4.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (122) 260
75.1%
Latin
ValueCountFrequency (%)
C 10
34.5%
T 6
20.7%
V 6
20.7%
c 2
 
6.9%
I 1
 
3.4%
K 1
 
3.4%
R 1
 
3.4%
v 1
 
3.4%
t 1
 
3.4%
Common
ValueCountFrequency (%)
55
62.5%
, 33
37.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 346
74.7%
ASCII 117
 
25.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
55
47.0%
, 33
28.2%
C 10
 
8.5%
T 6
 
5.1%
V 6
 
5.1%
c 2
 
1.7%
I 1
 
0.9%
K 1
 
0.9%
R 1
 
0.9%
v 1
 
0.9%
Hangul
ValueCountFrequency (%)
14
 
4.0%
11
 
3.2%
9
 
2.6%
9
 
2.6%
8
 
2.3%
8
 
2.3%
8
 
2.3%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (122) 260
75.1%

Interactions

2023-12-12T09:49:45.309967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:45.108493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:45.397684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:49:45.209260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:49:52.146526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번회사명대표자명공장대표주소(도로명)업종명전화번호종업원수생산품
순번1.0000.9650.9651.0000.8380.9590.2650.683
회사명0.9651.0001.0001.0000.9541.0000.0000.967
대표자명0.9651.0001.0001.0000.9541.0000.0000.967
공장대표주소(도로명)1.0001.0001.0001.0001.0001.0001.0001.000
업종명0.8380.9540.9541.0001.0000.9810.9300.984
전화번호0.9591.0001.0001.0000.9811.0000.5430.966
종업원수0.2650.0000.0001.0000.9300.5431.0000.943
생산품0.6830.9670.9671.0000.9840.9660.9431.000
2023-12-12T09:49:52.302219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번종업원수
순번1.000-0.095
종업원수-0.0951.000

Missing values

2023-12-12T09:49:45.516982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:49:45.676854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:49:45.880859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번회사명대표자명공장대표주소(도로명)업종명사업유형전화번호종업원수생산품
01(주)대림테크김영호부산광역시 동구 성남로49번길 9 (좌천동, 세방기업) 홍익빌딩 4층유선 통신장비 제조업 외 3 종제조업051-441-51517무선방송수신기, CCTV, 위성중계기, 동보장치, 무선방송송신기, 위성안테나, CCTV용 전원부
12(주)디엘시스템윤숙자부산광역시 동구 성남로49번길 9 (좌천동) 홍익무선 빌딩 1층방송장비 제조업 외 3 종제조업051-441-47004무선방송수신기, CCTV, 동보장치 등
23(주)무한손정학봉부산광역시 동구 범일일길11번가길 19, 1층 (범일동)한복 제조업 외 5 종제조업051-647-448810한복
34(주)미화합동이성진부산광역시 동구 중앙대로 243-3 (초량동) 외 1필지장류 제조업제조업051-468-267516된장
45(주)삼우이머션김대희부산광역시 동구 중앙대로274번길 7-3, 상가 104호(초량동, 부산역 유림 줄리엣)기타 무선 통신장비 제조업 외 2 종제조업051-977-030163포터블 VR KIT
56(주)상록수박옥남부산광역시 동구 자성로 110, (3층,4층,5층) (범일동)근무복, 작업복 및 유사의복 제조업 외 11 종제조업051-632-765416유니폼,근무복
67(주)수성하이텍이기현부산광역시 동구 고관로 164, 일신빌딩 503호 (좌천동)배전반 및 전기 자동제어반 제조업제조업051-417-25083전기판넬
78(주)신일디엔피권봉재부산광역시 동구 자성로133번길 31, 1층 (범일동)경 인쇄업 외 5 종제조업051-469-827522인쇄물
89(주)에디넷이병걸부산광역시 동구 중앙대로308번길 3-6, 지하1층 (초량동)컴퓨터 제조업 외 4 종제조업051-465-05739관제장비
910(주)이오코리아임승미부산광역시 동구 중앙대로 290, 3층(초량동, 한서빌딩) (초량동)화장품 제조업제조업051-256-252511화장품
순번회사명대표자명공장대표주소(도로명)업종명사업유형전화번호종업원수생산품
4243주식회사 누리아이코리아안병호부산광역시 동구 중앙대로251번길 11 (초량동)기타 음향기기 제조업 외 10 종제조업051-466-666610CCTV, 방송장비 등
4344주식회사 센트프로조미내부산광역시 동구 중앙대로296번길 3-3, 301호 (초량동)표면광택제 및 실내가향제 제조업제조업<NA>0방향제
4445주식회사 제이솔루션장홍석부산광역시 동구 초량로13번길 87, 3층 (초량동)전시 및 광고용 조명장치 제조업 외 1 종제조업051-466-198061전광판
4546주식회사 제이솔루션장홍석부산광역시 동구 초량중로 14, 102호 (초량동, 애뜰안)육상 금속 골조 구조재 제조업 외 6 종제조업051-466-198061전원공급장치
4647주식회사 제이솔루션장홍석부산광역시 동구 초량중로 18, 1층 (초량동)전시 및 광고용 조명장치 제조업 외 9 종제조업051-466-198025전광판, 구내방송장치 제조
4748주식회사 조방변재하부산광역시 동구 자성로108번길 7 (범일동) 201근무복, 작업복 및 유사의복 제조업제조업<NA>5근무복, 작업복 및 유사의복 제조업
4849초량식품윤민자부산광역시 동구 초량로13번길 35 (초량동)기타 수산동물 가공 및 저장 처리업제조업051-467-60025어묵
4950탑 TOP LEE EUN JUNG이은정부산광역시 동구 자성공원로 7, 우제빌딩 4층(범일동)근무복, 작업복 및 유사의복 제조업 외 4 종제조업<NA>0의류, 오버로크
5051토마스코퍼레이션(주)인성진부산광역시 동구 중앙대로286번길 7-3, 1,2층 (초량동)선박 구성 부분품 제조업제조업051-441-808811선박엔진부품
5152현대산업사최명호부산광역시 동구 중앙대로 498 (범일동)그 외 기타 플라스틱 제품 제조업제조업051-644-77705자동차 부품, 콘테이너부품