Overview

Dataset statistics

Number of variables12
Number of observations145
Missing cells17
Missing cells (%)1.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.1 KiB
Average record size in memory99.9 B

Variable types

Numeric3
Text6
Categorical3

Dataset

Description경상남도 거제시 공장등록현황(회사명, 단지명, 설립구분, 전화번호, 생산품, 우편번호, 공장주소, 업종명, 위도, 경도, 기준일자)에 대한 정보를 제공합니다.
Author경상남도 거제시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15034978

Alerts

경도 is highly overall correlated with 위도 and 1 other fieldsHigh correlation
위도 is highly overall correlated with 경도 and 1 other fieldsHigh correlation
단지명 is highly overall correlated with 설립구분High correlation
설립구분 is highly overall correlated with 단지명High correlation
공장우편번호 is highly overall correlated with 경도 and 1 other fieldsHigh correlation
단지명 is highly imbalanced (81.3%)Imbalance
설립구분 is highly imbalanced (61.7%)Imbalance
전화번호 has 17 (11.7%) missing valuesMissing
순번 has unique valuesUnique
회사명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 23:44:37.912578
Analysis finished2023-12-10 23:44:40.027778
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean73
Minimum1
Maximum145
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-11T08:44:40.106604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.2
Q137
median73
Q3109
95-th percentile137.8
Maximum145
Range144
Interquartile range (IQR)72

Descriptive statistics

Standard deviation42.001984
Coefficient of variation (CV)0.57536964
Kurtosis-1.2
Mean73
Median Absolute Deviation (MAD)36
Skewness0
Sum10585
Variance1764.1667
MonotonicityStrictly increasing
2023-12-11T08:44:40.563206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
110 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
99 1
 
0.7%
100 1
 
0.7%
101 1
 
0.7%
Other values (135) 135
93.1%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
145 1
0.7%
144 1
0.7%
143 1
0.7%
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%

회사명
Text

UNIQUE 

Distinct145
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T08:44:40.827251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.0413793
Min length2

Characters and Unicode

Total characters1166
Distinct characters204
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)100.0%

Sample

1st row(주)거제전통메주
2nd row(주)건화
3rd row(주)건화 성포공장
4th row(주)금화
5th row(주)남명
ValueCountFrequency (%)
주식회사 13
 
7.2%
제2공장 3
 
1.7%
주)신풍 2
 
1.1%
어업회사법인 2
 
1.1%
주)장한 2
 
1.1%
신화기업(주 2
 
1.1%
농업회사법인 2
 
1.1%
서진중공업 2
 
1.1%
광신기계산업(주 2
 
1.1%
삼녹eng 2
 
1.1%
Other values (146) 148
82.2%
2023-12-11T08:44:41.199876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
104
 
8.9%
( 86
 
7.4%
) 86
 
7.4%
41
 
3.5%
35
 
3.0%
32
 
2.7%
32
 
2.7%
26
 
2.2%
25
 
2.1%
25
 
2.1%
Other values (194) 674
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 927
79.5%
Open Punctuation 86
 
7.4%
Close Punctuation 86
 
7.4%
Space Separator 35
 
3.0%
Uppercase Letter 23
 
2.0%
Decimal Number 7
 
0.6%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
104
 
11.2%
41
 
4.4%
32
 
3.5%
32
 
3.5%
26
 
2.8%
25
 
2.7%
25
 
2.7%
21
 
2.3%
20
 
2.2%
18
 
1.9%
Other values (174) 583
62.9%
Uppercase Letter
ValueCountFrequency (%)
G 3
13.0%
E 3
13.0%
M 2
8.7%
P 2
8.7%
N 2
8.7%
S 2
8.7%
U 2
8.7%
F 2
8.7%
I 1
 
4.3%
K 1
 
4.3%
Other values (3) 3
13.0%
Decimal Number
ValueCountFrequency (%)
2 6
85.7%
1 1
 
14.3%
Other Punctuation
ValueCountFrequency (%)
& 1
50.0%
, 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Close Punctuation
ValueCountFrequency (%)
) 86
100.0%
Space Separator
ValueCountFrequency (%)
35
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 927
79.5%
Common 216
 
18.5%
Latin 23
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
104
 
11.2%
41
 
4.4%
32
 
3.5%
32
 
3.5%
26
 
2.8%
25
 
2.7%
25
 
2.7%
21
 
2.3%
20
 
2.2%
18
 
1.9%
Other values (174) 583
62.9%
Latin
ValueCountFrequency (%)
G 3
13.0%
E 3
13.0%
M 2
8.7%
P 2
8.7%
N 2
8.7%
S 2
8.7%
U 2
8.7%
F 2
8.7%
I 1
 
4.3%
K 1
 
4.3%
Other values (3) 3
13.0%
Common
ValueCountFrequency (%)
( 86
39.8%
) 86
39.8%
35
16.2%
2 6
 
2.8%
& 1
 
0.5%
, 1
 
0.5%
1 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 927
79.5%
ASCII 239
 
20.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
104
 
11.2%
41
 
4.4%
32
 
3.5%
32
 
3.5%
26
 
2.8%
25
 
2.7%
25
 
2.7%
21
 
2.3%
20
 
2.2%
18
 
1.9%
Other values (174) 583
62.9%
ASCII
ValueCountFrequency (%)
( 86
36.0%
) 86
36.0%
35
14.6%
2 6
 
2.5%
G 3
 
1.3%
E 3
 
1.3%
M 2
 
0.8%
P 2
 
0.8%
N 2
 
0.8%
S 2
 
0.8%
Other values (10) 12
 
5.0%

단지명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
135 
거제오비일반산업단지
 
6
거제모사산업단지
 
1
옥포국가산업단지
 
1
죽도국가산업단지
 
1

Length

Max length12
Median length1
Mean length1.5931034
Min length1

Unique

Unique4 ?
Unique (%)2.8%

Sample

1st row
2nd row거제모사산업단지
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
135
93.1%
거제오비일반산업단지 6
 
4.1%
거제모사산업단지 1
 
0.7%
옥포국가산업단지 1
 
0.7%
죽도국가산업단지 1
 
0.7%
거제한내조선특화농공단지 1
 
0.7%

Length

2023-12-11T08:44:41.364460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:44:41.477696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
거제오비일반산업단지 6
60.0%
거제모사산업단지 1
 
10.0%
옥포국가산업단지 1
 
10.0%
죽도국가산업단지 1
 
10.0%
거제한내조선특화농공단지 1
 
10.0%

설립구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct5
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
일반
121 
창업
14 
일반산업단지
 
7
국가산업단지
 
2
농공단지
 
1

Length

Max length6
Median length2
Mean length2.262069
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row일반
2nd row일반산업단지
3rd row일반
4th row창업
5th row일반

Common Values

ValueCountFrequency (%)
일반 121
83.4%
창업 14
 
9.7%
일반산업단지 7
 
4.8%
국가산업단지 2
 
1.4%
농공단지 1
 
0.7%

Length

2023-12-11T08:44:41.629048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:44:41.768925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 121
83.4%
창업 14
 
9.7%
일반산업단지 7
 
4.8%
국가산업단지 2
 
1.4%
농공단지 1
 
0.7%

전화번호
Text

MISSING 

Distinct116
Distinct (%)90.6%
Missing17
Missing (%)11.7%
Memory size1.3 KiB
2023-12-11T08:44:42.002867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.007812
Min length12

Characters and Unicode

Total characters1537
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique104 ?
Unique (%)81.2%

Sample

1st row055-633-2270
2nd row055-639-5300
3rd row055-680-5600
4th row055-638-2974
5th row055-633-8230
ValueCountFrequency (%)
055-633-4960 2
 
1.6%
055-633-0560 2
 
1.6%
055-633-3340 2
 
1.6%
055-636-6434 2
 
1.6%
055-633-8200 2
 
1.6%
055-632-4060 2
 
1.6%
055-633-5104 2
 
1.6%
055-636-2155 2
 
1.6%
055-633-0034 2
 
1.6%
055-630-9895 2
 
1.6%
Other values (106) 108
84.4%
2023-12-11T08:44:42.398931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 315
20.5%
- 256
16.7%
0 222
14.4%
6 199
12.9%
3 194
12.6%
2 88
 
5.7%
4 65
 
4.2%
1 60
 
3.9%
8 50
 
3.3%
7 47
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1281
83.3%
Dash Punctuation 256
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 315
24.6%
0 222
17.3%
6 199
15.5%
3 194
15.1%
2 88
 
6.9%
4 65
 
5.1%
1 60
 
4.7%
8 50
 
3.9%
7 47
 
3.7%
9 41
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 256
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1537
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 315
20.5%
- 256
16.7%
0 222
14.4%
6 199
12.9%
3 194
12.6%
2 88
 
5.7%
4 65
 
4.2%
1 60
 
3.9%
8 50
 
3.3%
7 47
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1537
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 315
20.5%
- 256
16.7%
0 222
14.4%
6 199
12.9%
3 194
12.6%
2 88
 
5.7%
4 65
 
4.2%
1 60
 
3.9%
8 50
 
3.3%
7 47
 
3.1%
Distinct106
Distinct (%)73.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T08:44:42.676049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length24
Mean length8.4206897
Min length2

Characters and Unicode

Total characters1221
Distinct characters241
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)64.1%

Sample

1st row메주, 두부, 장류
2nd row선박구성부분품,도장및기타피막처리업
3rd row조선기자재
4th row선박구성부분품(계측관)
5th row콘크리트폰툰
ValueCountFrequency (%)
선박구성부분품 14
 
6.3%
철의장품 13
 
5.9%
4
 
1.8%
레미콘 4
 
1.8%
굴가공류 4
 
1.8%
액젖류 3
 
1.4%
3
 
1.4%
pipe 3
 
1.4%
강선 3
 
1.4%
선박 3
 
1.4%
Other values (152) 167
75.6%
2023-12-11T08:44:43.180806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
6.3%
62
 
5.1%
49
 
4.0%
, 48
 
3.9%
36
 
2.9%
36
 
2.9%
32
 
2.6%
32
 
2.6%
26
 
2.1%
24
 
2.0%
Other values (231) 799
65.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 965
79.0%
Uppercase Letter 90
 
7.4%
Space Separator 77
 
6.3%
Other Punctuation 52
 
4.3%
Open Punctuation 12
 
1.0%
Close Punctuation 12
 
1.0%
Lowercase Letter 8
 
0.7%
Decimal Number 5
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
6.4%
49
 
5.1%
36
 
3.7%
36
 
3.7%
32
 
3.3%
32
 
3.3%
26
 
2.7%
24
 
2.5%
22
 
2.3%
19
 
2.0%
Other values (195) 627
65.0%
Uppercase Letter
ValueCountFrequency (%)
E 10
11.1%
C 10
11.1%
O 10
11.1%
P 9
10.0%
S 8
 
8.9%
L 7
 
7.8%
A 5
 
5.6%
B 4
 
4.4%
R 3
 
3.3%
T 3
 
3.3%
Other values (9) 21
23.3%
Lowercase Letter
ValueCountFrequency (%)
r 2
25.0%
o 2
25.0%
e 1
12.5%
l 1
12.5%
c 1
12.5%
i 1
12.5%
Decimal Number
ValueCountFrequency (%)
6 1
20.0%
9 1
20.0%
7 1
20.0%
0 1
20.0%
1 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 48
92.3%
. 3
 
5.8%
/ 1
 
1.9%
Space Separator
ValueCountFrequency (%)
77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 965
79.0%
Common 158
 
12.9%
Latin 98
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
6.4%
49
 
5.1%
36
 
3.7%
36
 
3.7%
32
 
3.3%
32
 
3.3%
26
 
2.7%
24
 
2.5%
22
 
2.3%
19
 
2.0%
Other values (195) 627
65.0%
Latin
ValueCountFrequency (%)
E 10
 
10.2%
C 10
 
10.2%
O 10
 
10.2%
P 9
 
9.2%
S 8
 
8.2%
L 7
 
7.1%
A 5
 
5.1%
B 4
 
4.1%
R 3
 
3.1%
T 3
 
3.1%
Other values (15) 29
29.6%
Common
ValueCountFrequency (%)
77
48.7%
, 48
30.4%
( 12
 
7.6%
) 12
 
7.6%
. 3
 
1.9%
6 1
 
0.6%
9 1
 
0.6%
7 1
 
0.6%
0 1
 
0.6%
1 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 965
79.0%
ASCII 256
 
21.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
77
30.1%
, 48
18.8%
( 12
 
4.7%
) 12
 
4.7%
E 10
 
3.9%
C 10
 
3.9%
O 10
 
3.9%
P 9
 
3.5%
S 8
 
3.1%
L 7
 
2.7%
Other values (26) 53
20.7%
Hangul
ValueCountFrequency (%)
62
 
6.4%
49
 
5.1%
36
 
3.7%
36
 
3.7%
32
 
3.3%
32
 
3.3%
26
 
2.7%
24
 
2.5%
22
 
2.3%
19
 
2.0%
Other values (195) 627
65.0%

공장우편번호
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)24.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
53207
19 
53206
16 
53276
11 
53275
53202
 
7
Other values (30)
83 

Length

Max length7
Median length5
Mean length5.0137931
Min length5

Unique

Unique13 ?
Unique (%)9.0%

Sample

1st row53331
2nd row53206
3rd row53276
4th row53275
5th row53208

Common Values

ValueCountFrequency (%)
53207 19
 
13.1%
53206 16
 
11.0%
53276 11
 
7.6%
53275 9
 
6.2%
53202 7
 
4.8%
53277 7
 
4.8%
53278 7
 
4.8%
53279 7
 
4.8%
53205 6
 
4.1%
53274 6
 
4.1%
Other values (25) 50
34.5%

Length

2023-12-11T08:44:43.369306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
53207 19
 
13.1%
53206 16
 
11.0%
53276 11
 
7.6%
53275 9
 
6.2%
53202 7
 
4.8%
53277 7
 
4.8%
53278 7
 
4.8%
53279 7
 
4.8%
53274 6
 
4.1%
53205 6
 
4.1%
Other values (25) 50
34.5%
Distinct140
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T08:44:43.565604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length49
Mean length30.110345
Min length18

Characters and Unicode

Total characters4366
Distinct characters162
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)93.1%

Sample

1st row경상남도 거제시 동부면 삼거림1길 20 (총 2 필지)
2nd row경상남도 거제시 연초면 연하해안로 841-54 (연초면)
3rd row경상남도 거제시 사등면 성포로 303 (사등면) (총 12 필지)
4th row경상남도 거제시 사등면 거제대로 6025-17 (㈜정화) (총 3 필지)
5th row경상남도 거제시 연초면 소오비길 26
ValueCountFrequency (%)
경상남도 145
 
14.8%
거제시 145
 
14.8%
필지 51
 
5.2%
51
 
5.2%
연초면 49
 
5.0%
사등면 48
 
4.9%
연하해안로 28
 
2.9%
2 21
 
2.1%
거제대로 16
 
1.6%
둔덕면 14
 
1.4%
Other values (275) 412
42.0%
2023-12-11T08:44:43.903108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
837
 
19.2%
185
 
4.2%
185
 
4.2%
154
 
3.5%
151
 
3.5%
148
 
3.4%
146
 
3.3%
145
 
3.3%
145
 
3.3%
( 130
 
3.0%
Other values (152) 2140
49.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2512
57.5%
Space Separator 837
 
19.2%
Decimal Number 651
 
14.9%
Open Punctuation 130
 
3.0%
Close Punctuation 130
 
3.0%
Dash Punctuation 68
 
1.6%
Other Punctuation 28
 
0.6%
Other Symbol 6
 
0.1%
Uppercase Letter 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
185
 
7.4%
185
 
7.4%
154
 
6.1%
151
 
6.0%
148
 
5.9%
146
 
5.8%
145
 
5.8%
145
 
5.8%
96
 
3.8%
80
 
3.2%
Other values (131) 1077
42.9%
Decimal Number
ValueCountFrequency (%)
1 118
18.1%
2 95
14.6%
3 86
13.2%
4 73
11.2%
5 73
11.2%
0 57
8.8%
7 45
 
6.9%
9 36
 
5.5%
6 35
 
5.4%
8 33
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
A 1
25.0%
G 1
25.0%
M 1
25.0%
P 1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 21
75.0%
: 7
 
25.0%
Space Separator
ValueCountFrequency (%)
837
100.0%
Open Punctuation
ValueCountFrequency (%)
( 130
100.0%
Close Punctuation
ValueCountFrequency (%)
) 130
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 68
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2518
57.7%
Common 1844
42.2%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
185
 
7.3%
185
 
7.3%
154
 
6.1%
151
 
6.0%
148
 
5.9%
146
 
5.8%
145
 
5.8%
145
 
5.8%
96
 
3.8%
80
 
3.2%
Other values (132) 1083
43.0%
Common
ValueCountFrequency (%)
837
45.4%
( 130
 
7.0%
) 130
 
7.0%
1 118
 
6.4%
2 95
 
5.2%
3 86
 
4.7%
4 73
 
4.0%
5 73
 
4.0%
- 68
 
3.7%
0 57
 
3.1%
Other values (6) 177
 
9.6%
Latin
ValueCountFrequency (%)
A 1
25.0%
G 1
25.0%
M 1
25.0%
P 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2512
57.5%
ASCII 1848
42.3%
None 6
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
837
45.3%
( 130
 
7.0%
) 130
 
7.0%
1 118
 
6.4%
2 95
 
5.1%
3 86
 
4.7%
4 73
 
4.0%
5 73
 
4.0%
- 68
 
3.7%
0 57
 
3.1%
Other values (10) 181
 
9.8%
Hangul
ValueCountFrequency (%)
185
 
7.4%
185
 
7.4%
154
 
6.1%
151
 
6.0%
148
 
5.9%
146
 
5.8%
145
 
5.8%
145
 
5.8%
96
 
3.8%
80
 
3.2%
Other values (131) 1077
42.9%
None
ValueCountFrequency (%)
6
100.0%
Distinct142
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T08:44:44.187090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length49
Mean length26.462069
Min length18

Characters and Unicode

Total characters3837
Distinct characters124
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)95.9%

Sample

1st row경상남도 거제시 동부면 부춘리 781번지
2nd row경상남도 거제시 연초면 한내리 777번지
3rd row경상남도 거제시 사등면 성포리 1번지 외 11 필지
4th row경상남도 거제시 사등면 청곡리 204번지 외 2 필지
5th row경상남도 거제시 연초면 오비리 133-8번지
ValueCountFrequency (%)
경상남도 145
16.5%
거제시 145
16.5%
연초면 45
 
5.1%
사등면 44
 
5.0%
필지 39
 
4.4%
39
 
4.4%
오비리 22
 
2.5%
한내리 17
 
1.9%
둔덕면 14
 
1.6%
1 13
 
1.5%
Other values (228) 358
40.6%
2023-12-11T08:44:44.687461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
739
19.3%
197
 
5.1%
158
 
4.1%
157
 
4.1%
150
 
3.9%
148
 
3.9%
1 146
 
3.8%
146
 
3.8%
145
 
3.8%
145
 
3.8%
Other values (114) 1706
44.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2393
62.4%
Space Separator 739
 
19.3%
Decimal Number 590
 
15.4%
Dash Punctuation 76
 
2.0%
Close Punctuation 14
 
0.4%
Open Punctuation 14
 
0.4%
Other Punctuation 7
 
0.2%
Uppercase Letter 3
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
197
 
8.2%
158
 
6.6%
157
 
6.6%
150
 
6.3%
148
 
6.2%
146
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
Other values (95) 857
35.8%
Decimal Number
ValueCountFrequency (%)
1 146
24.7%
2 77
13.1%
7 55
 
9.3%
0 54
 
9.2%
5 52
 
8.8%
3 50
 
8.5%
4 49
 
8.3%
9 37
 
6.3%
6 36
 
6.1%
8 34
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
M 1
33.3%
P 1
33.3%
Space Separator
ValueCountFrequency (%)
739
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Punctuation
ValueCountFrequency (%)
: 7
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2394
62.4%
Common 1440
37.5%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
197
 
8.2%
158
 
6.6%
157
 
6.6%
150
 
6.3%
148
 
6.2%
146
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
Other values (96) 858
35.8%
Common
ValueCountFrequency (%)
739
51.3%
1 146
 
10.1%
2 77
 
5.3%
- 76
 
5.3%
7 55
 
3.8%
0 54
 
3.8%
5 52
 
3.6%
3 50
 
3.5%
4 49
 
3.4%
9 37
 
2.6%
Other values (5) 105
 
7.3%
Latin
ValueCountFrequency (%)
G 1
33.3%
M 1
33.3%
P 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2393
62.4%
ASCII 1443
37.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
739
51.2%
1 146
 
10.1%
2 77
 
5.3%
- 76
 
5.3%
7 55
 
3.8%
0 54
 
3.7%
5 52
 
3.6%
3 50
 
3.5%
4 49
 
3.4%
9 37
 
2.6%
Other values (8) 108
 
7.5%
Hangul
ValueCountFrequency (%)
197
 
8.2%
158
 
6.6%
157
 
6.6%
150
 
6.3%
148
 
6.2%
146
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
145
 
6.1%
Other values (95) 857
35.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct68
Distinct (%)46.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-11T08:44:44.966749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length16.855172
Min length6

Characters and Unicode

Total characters2444
Distinct characters138
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)33.8%

Sample

1st row장류 제조업 외 1 종
2nd row선박 구성 부분품 제조업 외 1 종
3rd row선박 구성부분품 제조업 외 1 종
4th row선박 구성 부분품 제조업
5th row그 외 기타 콘크리트 제품 및 유사제품 제조업
ValueCountFrequency (%)
제조업 122
15.5%
58
 
7.4%
54
 
6.9%
53
 
6.7%
선박 44
 
5.6%
구성 42
 
5.3%
부분품 42
 
5.3%
1 28
 
3.6%
수산동물 26
 
3.3%
기타 25
 
3.2%
Other values (106) 293
37.2%
2023-12-11T08:44:45.401313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
642
26.3%
175
 
7.2%
149
 
6.1%
149
 
6.1%
83
 
3.4%
60
 
2.5%
59
 
2.4%
54
 
2.2%
53
 
2.2%
49
 
2.0%
Other values (128) 971
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1735
71.0%
Space Separator 642
 
26.3%
Decimal Number 56
 
2.3%
Other Punctuation 9
 
0.4%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
175
 
10.1%
149
 
8.6%
149
 
8.6%
83
 
4.8%
60
 
3.5%
59
 
3.4%
54
 
3.1%
53
 
3.1%
49
 
2.8%
44
 
2.5%
Other values (115) 860
49.6%
Decimal Number
ValueCountFrequency (%)
1 30
53.6%
2 10
 
17.9%
3 8
 
14.3%
7 3
 
5.4%
5 2
 
3.6%
8 1
 
1.8%
6 1
 
1.8%
4 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
. 1
 
11.1%
Space Separator
ValueCountFrequency (%)
642
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1735
71.0%
Common 709
29.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
175
 
10.1%
149
 
8.6%
149
 
8.6%
83
 
4.8%
60
 
3.5%
59
 
3.4%
54
 
3.1%
53
 
3.1%
49
 
2.8%
44
 
2.5%
Other values (115) 860
49.6%
Common
ValueCountFrequency (%)
642
90.6%
1 30
 
4.2%
2 10
 
1.4%
, 8
 
1.1%
3 8
 
1.1%
7 3
 
0.4%
5 2
 
0.3%
8 1
 
0.1%
. 1
 
0.1%
6 1
 
0.1%
Other values (3) 3
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1732
70.9%
ASCII 709
29.0%
Compat Jamo 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
642
90.6%
1 30
 
4.2%
2 10
 
1.4%
, 8
 
1.1%
3 8
 
1.1%
7 3
 
0.4%
5 2
 
0.3%
8 1
 
0.1%
. 1
 
0.1%
6 1
 
0.1%
Other values (3) 3
 
0.4%
Hangul
ValueCountFrequency (%)
175
 
10.1%
149
 
8.6%
149
 
8.6%
83
 
4.8%
60
 
3.5%
59
 
3.4%
54
 
3.1%
53
 
3.1%
49
 
2.8%
44
 
2.5%
Other values (114) 857
49.5%
Compat Jamo
ValueCountFrequency (%)
3
100.0%

경도
Real number (ℝ)

HIGH CORRELATION 

Distinct135
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.58646
Minimum128.47485
Maximum128.71964
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-11T08:44:45.551152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum128.47485
5-th percentile128.48514
Q1128.53832
median128.59932
Q3128.61796
95-th percentile128.69125
Maximum128.71964
Range0.244783
Interquartile range (IQR)0.079636

Descriptive statistics

Standard deviation0.05931154
Coefficient of variation (CV)0.00046125806
Kurtosis-0.57561638
Mean128.58646
Median Absolute Deviation (MAD)0.041508
Skewness-0.014954125
Sum18645.037
Variance0.0035178588
MonotonicityNot monotonic
2023-12-11T08:44:45.734499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128.618997 3
 
2.1%
128.588134 2
 
1.4%
128.573149 2
 
1.4%
128.602097 2
 
1.4%
128.484175 2
 
1.4%
128.615266 2
 
1.4%
128.60434 2
 
1.4%
128.505699 2
 
1.4%
128.474854 2
 
1.4%
128.572744 1
 
0.7%
Other values (125) 125
86.2%
ValueCountFrequency (%)
128.474854 2
1.4%
128.476425 1
0.7%
128.479165 1
0.7%
128.47934 1
0.7%
128.484175 2
1.4%
128.484738 1
0.7%
128.486731 1
0.7%
128.490803 1
0.7%
128.496862 1
0.7%
128.496944 1
0.7%
ValueCountFrequency (%)
128.719637 1
0.7%
128.711268 1
0.7%
128.711124 1
0.7%
128.709298 1
0.7%
128.706611 1
0.7%
128.705054 1
0.7%
128.695721 1
0.7%
128.692161 1
0.7%
128.687628 1
0.7%
128.679851 1
0.7%

위도
Real number (ℝ)

HIGH CORRELATION 

Distinct135
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.90297
Minimum34.783723
Maximum34.999731
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2023-12-11T08:44:45.915450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34.783723
5-th percentile34.820763
Q134.883589
median34.911891
Q334.926586
95-th percentile34.963502
Maximum34.999731
Range0.216008
Interquartile range (IQR)0.042997

Descriptive statistics

Standard deviation0.04172986
Coefficient of variation (CV)0.0011955963
Kurtosis0.1207956
Mean34.90297
Median Absolute Deviation (MAD)0.02127
Skewness-0.54550821
Sum5060.9306
Variance0.0017413812
MonotonicityNot monotonic
2023-12-11T08:44:46.081254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34.917522 3
 
2.1%
34.858856 2
 
1.4%
34.885426 2
 
1.4%
34.927189 2
 
1.4%
34.8557 2
 
1.4%
34.919564 2
 
1.4%
34.925665 2
 
1.4%
34.820136 2
 
1.4%
34.878423 2
 
1.4%
34.884526 1
 
0.7%
Other values (125) 125
86.2%
ValueCountFrequency (%)
34.783723 1
0.7%
34.799896 1
0.7%
34.808668 1
0.7%
34.810037 1
0.7%
34.8141 1
0.7%
34.817371 1
0.7%
34.820136 2
1.4%
34.823269 1
0.7%
34.826439 1
0.7%
34.831545 1
0.7%
ValueCountFrequency (%)
34.999731 1
0.7%
34.985651 1
0.7%
34.977414 1
0.7%
34.97589 1
0.7%
34.973874 1
0.7%
34.963833 1
0.7%
34.963758 1
0.7%
34.963559 1
0.7%
34.963275 1
0.7%
34.962857 1
0.7%

Interactions

2023-12-11T08:44:39.396025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:38.812432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.096701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.500495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:38.921099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.187262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.582551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.006179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:39.285324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:44:46.194696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번단지명설립구분공장우편번호업종명경도위도
순번1.0000.0000.3220.5220.4730.0250.473
단지명0.0001.0000.9100.7410.9380.2350.000
설립구분0.3220.9101.0000.6050.8590.0000.000
공장우편번호0.5220.7410.6051.0000.9290.9800.979
업종명0.4730.9380.8590.9291.0000.7570.090
경도0.0250.2350.0000.9800.7571.0000.827
위도0.4730.0000.0000.9790.0900.8271.000
2023-12-11T08:44:46.317832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공장우편번호단지명설립구분
공장우편번호1.0000.3840.271
단지명0.3841.0000.858
설립구분0.2710.8581.000
2023-12-11T08:44:46.430170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번경도위도단지명설립구분공장우편번호
순번1.000-0.031-0.0670.0000.0450.184
경도-0.0311.0000.5190.1260.0000.763
위도-0.0670.5191.0000.0000.0000.756
단지명0.0000.1260.0001.0000.8580.384
설립구분0.0450.0000.0000.8581.0000.271
공장우편번호0.1840.7630.7560.3840.2711.000

Missing values

2023-12-11T08:44:39.737463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:44:39.952926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번회사명단지명설립구분전화번호생산품공장우편번호공장대표주소공장대표주소(지번)업종명경도위도
01(주)거제전통메주일반055-633-2270메주, 두부, 장류53331경상남도 거제시 동부면 삼거림1길 20 (총 2 필지)경상남도 거제시 동부면 부춘리 781번지장류 제조업 외 1 종128.59830734.810037
12(주)건화거제모사산업단지일반산업단지055-639-5300선박구성부분품,도장및기타피막처리업53206경상남도 거제시 연초면 연하해안로 841-54 (연초면)경상남도 거제시 연초면 한내리 777번지선박 구성 부분품 제조업 외 1 종128.59714334.94207
23(주)건화 성포공장일반055-680-5600조선기자재53276경상남도 거제시 사등면 성포로 303 (사등면) (총 12 필지)경상남도 거제시 사등면 성포리 1번지 외 11 필지선박 구성부분품 제조업 외 1 종128.53407934.91511
34(주)금화창업055-638-2974선박구성부분품(계측관)53275경상남도 거제시 사등면 거제대로 6025-17 (㈜정화) (총 3 필지)경상남도 거제시 사등면 청곡리 204번지 외 2 필지선박 구성 부분품 제조업128.50781934.895723
45(주)남명일반055-633-8230콘크리트폰툰53208경상남도 거제시 연초면 소오비길 26경상남도 거제시 연초면 오비리 133-8번지그 외 기타 콘크리트 제품 및 유사제품 제조업128.630734.900488
56(주)네오하이텍일반055-637-4622LED 조명53276경상남도 거제시 사등면 성포로 92-19, 상가동 101호 (삼우비취맨션)경상남도 거제시 사등면 성포리 367-6번지 삼우비취맨션 상가동 101호일반용 전기 조명장치 제조업128.52301134.91915
67(주)대기공업일반055-636-1462선박구성부분품53277경상남도 거제시 사등면 거제대로 5330-20 (총 3 필지)경상남도 거제시 사등면 사등리 1-3번지 외 2 필지선박 구성 부분품 제조업128.55764334.903387
78(주)대성쏠라일반055-638-1006태양광 구조물53244경상남도 거제시 수양로 180 (양정동, 아주상사)경상남도 거제시 양정동 100번지 아주상사육상 금속 골조 구조재 제조업 외 2 종128.65377234.872089
89(주)대흥일반055-632-6141선박구성부분품53276경상남도 거제시 사등면 성포로 350 (사등면) (총 2 필지)경상남도 거제시 사등면 사등리 2074-1번지 외 1 필지선박 구성 부분품 제조업128.53756534.915209
910(주)동림수산일반055-633-5103굴가공류53279경상남도 거제시 둔덕면 녹산1길 30 (동림수산)경상남도 거제시 둔덕면 술역리 30-18번지기타 수산동물 가공 및 저장 처리업128.49686234.833518
순번회사명단지명설립구분전화번호생산품공장우편번호공장대표주소공장대표주소(지번)업종명경도위도
135136하나단열일반055-636-0470암면제품53286경상남도 거제시 거제면 산촌명진길 72경상남도 거제시 거제면 명진리 267-1번지암면 및 유사제품 제조업 외 1 종128.60717734.838346
136137하이에어코리아(주)일반055-346-3500원형통풍관및연결부분품53275경상남도 거제시 사등면 지석로 26-1 (한국하이프레스거제공장)경상남도 거제시 사등면 지석리 628번지그 외 기타 일반목적용 기계 제조업 외 1 종128.51529534.901022
137138한려농산일반055-632-4717유자청53274경상남도 거제시 사등면 거제대로 6221-1 (한려농산)경상남도 거제시 사등면 오량리 1005번지기타 과실ㆍ채소 가공 및 저장 처리업 외 2 종128.49080334.886148
138139한미산업(주)일반<NA>광고물,금속구조물53278경상남도 거제시 사등면 모래실길 58-21경상남도 거제시 사등면 사곡리 697번지구조용 금속 판제품 및 공작물 제조업 외 3 종128.57784834.900464
139140해금강수산식품일반<NA>멸치액젓53281경상남도 거제시 둔덕면 법동어구로 702 (총 2 필지)경상남도 거제시 둔덕면 하둔리 580번지 외 1 필지수산동물 건조 및 염장품 제조업128.51036834.831545
140141해왕거제오비일반산업단지일반산업단지055-633-7300선박 구성부분품53207경상남도 거제시 연초면 연하해안로 473-41경상남도 거제시 연초면 오비리 1209번지선박 구성 부분품 제조업128.61899734.917522
141142해원일반055-633-4960배관용 파이프53275경상남도 거제시 사등면 지석로 34 (청유식품)경상남도 거제시 사등면 지석리 651번지선박 구성 부분품 제조업128.51636834.901538
142143호진산업(주)일반055-633-2708철의장품53206경상남도 거제시 연초면 연하해안로 725-2 (호진산업(주))경상남도 거제시 연초면 한내리 120-8번지선박 구성 부분품 제조업128.60527134.926586
143144홍근버섯영농조합법인일반<NA>건강기능식품53331경상남도 거제시 동부면 산촌리 412-1번지 외 4필지경상남도 거제시 동부면 산촌리 412-1번지 외 4 필지건강기능식품 제조업128.60740834.836067
144145효진수산일반<NA>수산물절임 젓갈53201경상남도 거제시 장목면 거제북로 1315-18, (구지번:장목리 368-17) (총 3 필지)경상남도 거제시 장목면 장목리 368-17번지 (구지번:장목리 368-17) 외 2 필지수산동물 건조 및 염장품 제조업128.67985134.999731