Overview

Dataset statistics

Number of variables9
Number of observations2020
Missing cells614
Missing cells (%)3.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory146.1 KiB
Average record size in memory74.1 B

Variable types

Numeric2
Categorical2
Text5

Dataset

Description함안군에 현재 등록되어 있는 공장들 현황입니다.
Author경상남도 함안군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=3068626

Alerts

대표업종번호 is highly overall correlated with 사업유형High correlation
사업유형 is highly overall correlated with 대표업종번호High correlation
단지명 is highly imbalanced (62.8%)Imbalance
사업유형 is highly imbalanced (99.4%)Imbalance
공장대표주소(도로명) has 28 (1.4%) missing valuesMissing
주원자재 has 584 (28.9%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:34:05.245876
Analysis finished2023-12-11 00:34:06.825875
Duration1.58 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct2020
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1010.5
Minimum1
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.9 KiB
2023-12-11T09:34:06.883177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile101.95
Q1505.75
median1010.5
Q31515.25
95-th percentile1919.05
Maximum2020
Range2019
Interquartile range (IQR)1009.5

Descriptive statistics

Standard deviation583.26809
Coefficient of variation (CV)0.57720741
Kurtosis-1.2
Mean1010.5
Median Absolute Deviation (MAD)505
Skewness0
Sum2041210
Variance340201.67
MonotonicityStrictly increasing
2023-12-11T09:34:06.995034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1344 1
 
< 0.1%
1357 1
 
< 0.1%
1356 1
 
< 0.1%
1355 1
 
< 0.1%
1354 1
 
< 0.1%
1353 1
 
< 0.1%
1352 1
 
< 0.1%
1351 1
 
< 0.1%
1350 1
 
< 0.1%
Other values (2010) 2010
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2020 1
< 0.1%
2019 1
< 0.1%
2018 1
< 0.1%
2017 1
< 0.1%
2016 1
< 0.1%
2015 1
< 0.1%
2014 1
< 0.1%
2013 1
< 0.1%
2012 1
< 0.1%
2011 1
< 0.1%

단지명
Categorical

IMBALANCE 

Distinct21
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
<NA>
1531 
함안일반산업단지
 
122
칠서일반산업단지
 
109
함안파수농공단지
 
37
함안칠원운서농공단지
 
33
Other values (16)
188 

Length

Max length12
Median length4
Mean length5.1118812
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row함안일반산업단지
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 1531
75.8%
함안일반산업단지 122
 
6.0%
칠서일반산업단지 109
 
5.4%
함안파수농공단지 37
 
1.8%
함안칠원운서농공단지 33
 
1.6%
함안산인농공단지 26
 
1.3%
함안법수농공단지 23
 
1.1%
함안용산농공단지 22
 
1.1%
함안군북농공단지 20
 
1.0%
함안대산대사일반산업단지 17
 
0.8%
Other values (11) 80
 
4.0%

Length

2023-12-11T09:34:07.128888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 1531
75.8%
함안일반산업단지 122
 
6.0%
칠서일반산업단지 109
 
5.4%
함안파수농공단지 37
 
1.8%
함안칠원운서농공단지 33
 
1.6%
함안산인농공단지 26
 
1.3%
함안법수농공단지 23
 
1.1%
함안용산농공단지 22
 
1.1%
함안군북농공단지 20
 
1.0%
함안대산대사일반산업단지 17
 
0.8%
Other values (11) 80
 
4.0%
Distinct1932
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2023-12-11T09:34:07.307122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length17
Mean length6.7821782
Min length2

Characters and Unicode

Total characters13700
Distinct characters471
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1855 ?
Unique (%)91.8%

Sample

1st row YSM
2nd row 대흥중자
3rd row 코오롱데크컴퍼지트(주)
4th row(사)경남신체장애인복지회
5th row(사)환경사랑나눔회 경남희망세상제작단
ValueCountFrequency (%)
주식회사 80
 
3.6%
2공장 21
 
0.9%
제2공장 12
 
0.5%
함안공장 10
 
0.4%
함안지점 8
 
0.4%
농업회사법인 7
 
0.3%
삼영엠텍(주 5
 
0.2%
주)에이스엔지니어링 4
 
0.2%
주)케이씨피 4
 
0.2%
칠서공장 4
 
0.2%
Other values (1936) 2075
93.0%
2023-12-11T09:34:07.608012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1166
 
8.5%
) 1073
 
7.8%
( 1072
 
7.8%
419
 
3.1%
393
 
2.9%
318
 
2.3%
266
 
1.9%
264
 
1.9%
250
 
1.8%
248
 
1.8%
Other values (461) 8231
60.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11009
80.4%
Close Punctuation 1073
 
7.8%
Open Punctuation 1072
 
7.8%
Space Separator 224
 
1.6%
Uppercase Letter 224
 
1.6%
Decimal Number 83
 
0.6%
Other Punctuation 14
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1166
 
10.6%
419
 
3.8%
393
 
3.6%
318
 
2.9%
266
 
2.4%
264
 
2.4%
250
 
2.3%
248
 
2.3%
232
 
2.1%
223
 
2.0%
Other values (423) 7230
65.7%
Uppercase Letter
ValueCountFrequency (%)
E 32
14.3%
N 26
11.6%
S 25
11.2%
G 24
10.7%
T 17
 
7.6%
C 15
 
6.7%
M 13
 
5.8%
K 8
 
3.6%
H 8
 
3.6%
J 7
 
3.1%
Other values (13) 49
21.9%
Decimal Number
ValueCountFrequency (%)
2 51
61.4%
1 13
 
15.7%
3 10
 
12.0%
4 5
 
6.0%
0 3
 
3.6%
5 1
 
1.2%
Other Punctuation
ValueCountFrequency (%)
& 6
42.9%
. 5
35.7%
: 1
 
7.1%
, 1
 
7.1%
/ 1
 
7.1%
Close Punctuation
ValueCountFrequency (%)
) 1073
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1072
100.0%
Space Separator
ValueCountFrequency (%)
224
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11008
80.4%
Common 2467
 
18.0%
Latin 224
 
1.6%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1166
 
10.6%
419
 
3.8%
393
 
3.6%
318
 
2.9%
266
 
2.4%
264
 
2.4%
250
 
2.3%
248
 
2.3%
232
 
2.1%
223
 
2.0%
Other values (422) 7229
65.7%
Latin
ValueCountFrequency (%)
E 32
14.3%
N 26
11.6%
S 25
11.2%
G 24
10.7%
T 17
 
7.6%
C 15
 
6.7%
M 13
 
5.8%
K 8
 
3.6%
H 8
 
3.6%
J 7
 
3.1%
Other values (13) 49
21.9%
Common
ValueCountFrequency (%)
) 1073
43.5%
( 1072
43.5%
224
 
9.1%
2 51
 
2.1%
1 13
 
0.5%
3 10
 
0.4%
& 6
 
0.2%
4 5
 
0.2%
. 5
 
0.2%
0 3
 
0.1%
Other values (5) 5
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11008
80.4%
ASCII 2691
 
19.6%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1166
 
10.6%
419
 
3.8%
393
 
3.6%
318
 
2.9%
266
 
2.4%
264
 
2.4%
250
 
2.3%
248
 
2.3%
232
 
2.1%
223
 
2.0%
Other values (422) 7229
65.7%
ASCII
ValueCountFrequency (%)
) 1073
39.9%
( 1072
39.8%
224
 
8.3%
2 51
 
1.9%
E 32
 
1.2%
N 26
 
1.0%
S 25
 
0.9%
G 24
 
0.9%
T 17
 
0.6%
C 15
 
0.6%
Other values (28) 132
 
4.9%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct1833
Distinct (%)92.0%
Missing28
Missing (%)1.4%
Memory size15.9 KiB
2023-12-11T09:34:07.856886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length45
Mean length26.677711
Min length6

Characters and Unicode

Total characters53142
Distinct characters348
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1696 ?
Unique (%)85.1%

Sample

1st row경상남도 함안군 법수면 윤외리 1555번지
2nd row경상남도 함안군 칠서면 함의로 209-40 (대흥중자) 외 1필지
3rd row경상남도 함안군 군북면 함안산단1길 26-23
4th row경상남도 함안군 대산면 하기리 415-4번지 외6필
5th row경상남도 함안군 군북면 국우로 95 외 1필지
ValueCountFrequency (%)
경상남도 1991
 
16.5%
함안군 1991
 
16.5%
칠원읍 630
 
5.2%
559
 
4.6%
군북면 342
 
2.8%
칠서면 339
 
2.8%
1필지 228
 
1.9%
법수면 208
 
1.7%
칠북면 147
 
1.2%
산인면 139
 
1.1%
Other values (2003) 5518
45.6%
2023-12-11T09:34:08.264921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10100
19.0%
2335
 
4.4%
2276
 
4.3%
2200
 
4.1%
2060
 
3.9%
2029
 
3.8%
2007
 
3.8%
2000
 
3.8%
1 1736
 
3.3%
1363
 
2.6%
Other values (338) 25036
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32406
61.0%
Space Separator 10100
 
19.0%
Decimal Number 7735
 
14.6%
Open Punctuation 938
 
1.8%
Close Punctuation 937
 
1.8%
Dash Punctuation 805
 
1.5%
Uppercase Letter 117
 
0.2%
Other Punctuation 104
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2335
 
7.2%
2276
 
7.0%
2200
 
6.8%
2060
 
6.4%
2029
 
6.3%
2007
 
6.2%
2000
 
6.2%
1363
 
4.2%
1172
 
3.6%
1033
 
3.2%
Other values (302) 13931
43.0%
Uppercase Letter
ValueCountFrequency (%)
S 15
12.8%
E 14
12.0%
G 11
9.4%
T 11
9.4%
N 10
8.5%
H 10
8.5%
M 9
7.7%
C 8
6.8%
D 6
 
5.1%
A 5
 
4.3%
Other values (8) 18
15.4%
Decimal Number
ValueCountFrequency (%)
1 1736
22.4%
2 1160
15.0%
3 937
12.1%
5 698
9.0%
4 680
 
8.8%
6 568
 
7.3%
7 527
 
6.8%
0 495
 
6.4%
9 491
 
6.3%
8 443
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 85
81.7%
. 11
 
10.6%
& 7
 
6.7%
/ 1
 
1.0%
Space Separator
ValueCountFrequency (%)
10100
100.0%
Open Punctuation
ValueCountFrequency (%)
( 938
100.0%
Close Punctuation
ValueCountFrequency (%)
) 937
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 805
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32406
61.0%
Common 20619
38.8%
Latin 117
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2335
 
7.2%
2276
 
7.0%
2200
 
6.8%
2060
 
6.4%
2029
 
6.3%
2007
 
6.2%
2000
 
6.2%
1363
 
4.2%
1172
 
3.6%
1033
 
3.2%
Other values (302) 13931
43.0%
Common
ValueCountFrequency (%)
10100
49.0%
1 1736
 
8.4%
2 1160
 
5.6%
( 938
 
4.5%
3 937
 
4.5%
) 937
 
4.5%
- 805
 
3.9%
5 698
 
3.4%
4 680
 
3.3%
6 568
 
2.8%
Other values (8) 2060
 
10.0%
Latin
ValueCountFrequency (%)
S 15
12.8%
E 14
12.0%
G 11
9.4%
T 11
9.4%
N 10
8.5%
H 10
8.5%
M 9
7.7%
C 8
6.8%
D 6
 
5.1%
A 5
 
4.3%
Other values (8) 18
15.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32406
61.0%
ASCII 20736
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10100
48.7%
1 1736
 
8.4%
2 1160
 
5.6%
( 938
 
4.5%
3 937
 
4.5%
) 937
 
4.5%
- 805
 
3.9%
5 698
 
3.4%
4 680
 
3.3%
6 568
 
2.7%
Other values (26) 2177
 
10.5%
Hangul
ValueCountFrequency (%)
2335
 
7.2%
2276
 
7.0%
2200
 
6.8%
2060
 
6.4%
2029
 
6.3%
2007
 
6.2%
2000
 
6.2%
1363
 
4.2%
1172
 
3.6%
1033
 
3.2%
Other values (302) 13931
43.0%

대표업종번호
Real number (ℝ)

HIGH CORRELATION 

Distinct261
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25787.403
Minimum10121
Maximum68112
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.9 KiB
2023-12-11T09:34:08.377301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10121
5-th percentile16101
Q124327.25
median25924
Q329223
95-th percentile31114
Maximum68112
Range57991
Interquartile range (IQR)4895.75

Descriptive statistics

Standard deviation4932.2988
Coefficient of variation (CV)0.19126776
Kurtosis4.5475153
Mean25787.403
Median Absolute Deviation (MAD)3213
Skewness-0.99470763
Sum52090554
Variance24327572
MonotonicityNot monotonic
2023-12-11T09:34:08.478397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25113 175
 
8.7%
29223 108
 
5.3%
30399 98
 
4.9%
25924 91
 
4.5%
31114 80
 
4.0%
25929 54
 
2.7%
25112 45
 
2.2%
25119 40
 
2.0%
25923 40
 
2.0%
29229 33
 
1.6%
Other values (251) 1256
62.2%
ValueCountFrequency (%)
10121 3
 
0.1%
10129 2
 
0.1%
10211 2
 
0.1%
10212 3
 
0.1%
10219 4
 
0.2%
10301 6
0.3%
10302 1
 
< 0.1%
10309 12
0.6%
10403 1
 
< 0.1%
10501 1
 
< 0.1%
ValueCountFrequency (%)
68112 1
 
< 0.1%
38321 5
0.2%
38312 1
 
< 0.1%
38311 2
 
0.1%
34019 2
 
0.1%
34011 2
 
0.1%
33993 1
 
< 0.1%
33992 4
0.2%
33910 1
 
< 0.1%
33303 3
0.1%
Distinct538
Distinct (%)26.6%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2023-12-11T09:34:08.955147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length27
Mean length17.161881
Min length3

Characters and Unicode

Total characters34667
Distinct characters291
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique287 ?
Unique (%)14.2%

Sample

1st row기타 비철금속 제련, 정련 및 합금 제조업
2nd row주형 및 금형 제조업
3rd row항공기용 부품 제조업 외 2 종
4th row위생용 종이제품 제조업
5th row라이터, 연소물 및 흡연용품 제조업
ValueCountFrequency (%)
제조업 1686
 
14.9%
1144
 
10.1%
865
 
7.6%
721
 
6.4%
1 476
 
4.2%
기타 467
 
4.1%
금속 446
 
3.9%
279
 
2.5%
골조 178
 
1.6%
구조재 178
 
1.6%
Other values (481) 4901
43.2%
2023-12-11T09:34:09.383139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9322
26.9%
2257
 
6.5%
2100
 
6.1%
2071
 
6.0%
1147
 
3.3%
1124
 
3.2%
879
 
2.5%
726
 
2.1%
721
 
2.1%
658
 
1.9%
Other values (281) 13662
39.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24266
70.0%
Space Separator 9322
 
26.9%
Decimal Number 911
 
2.6%
Other Punctuation 164
 
0.5%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2257
 
9.3%
2100
 
8.7%
2071
 
8.5%
1147
 
4.7%
1124
 
4.6%
879
 
3.6%
726
 
3.0%
721
 
3.0%
658
 
2.7%
602
 
2.5%
Other values (266) 11981
49.4%
Decimal Number
ValueCountFrequency (%)
1 510
56.0%
2 141
 
15.5%
3 130
 
14.3%
4 51
 
5.6%
5 30
 
3.3%
6 23
 
2.5%
7 14
 
1.5%
9 5
 
0.5%
8 4
 
0.4%
0 3
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 161
98.2%
. 3
 
1.8%
Space Separator
ValueCountFrequency (%)
9322
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24266
70.0%
Common 10401
30.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2257
 
9.3%
2100
 
8.7%
2071
 
8.5%
1147
 
4.7%
1124
 
4.6%
879
 
3.6%
726
 
3.0%
721
 
3.0%
658
 
2.7%
602
 
2.5%
Other values (266) 11981
49.4%
Common
ValueCountFrequency (%)
9322
89.6%
1 510
 
4.9%
, 161
 
1.5%
2 141
 
1.4%
3 130
 
1.2%
4 51
 
0.5%
5 30
 
0.3%
6 23
 
0.2%
7 14
 
0.1%
9 5
 
< 0.1%
Other values (5) 14
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24254
70.0%
ASCII 10401
30.0%
Compat Jamo 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9322
89.6%
1 510
 
4.9%
, 161
 
1.5%
2 141
 
1.4%
3 130
 
1.2%
4 51
 
0.5%
5 30
 
0.3%
6 23
 
0.2%
7 14
 
0.1%
9 5
 
< 0.1%
Other values (5) 14
 
0.1%
Hangul
ValueCountFrequency (%)
2257
 
9.3%
2100
 
8.7%
2071
 
8.5%
1147
 
4.7%
1124
 
4.6%
879
 
3.6%
726
 
3.0%
721
 
3.0%
658
 
2.7%
602
 
2.5%
Other values (265) 11969
49.3%
Compat Jamo
ValueCountFrequency (%)
12
100.0%

사업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
제조업
2019 
비제조업
 
1

Length

Max length4
Median length3
Mean length3.000495
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row제조업
2nd row제조업
3rd row제조업
4th row제조업
5th row제조업

Common Values

ValueCountFrequency (%)
제조업 2019
> 99.9%
비제조업 1
 
< 0.1%

Length

2023-12-11T09:34:09.490869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:34:09.568741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
제조업 2019
> 99.9%
비제조업 1
 
< 0.1%
Distinct1548
Distinct (%)76.7%
Missing2
Missing (%)0.1%
Memory size15.9 KiB
2023-12-11T09:34:09.779313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length45
Mean length8.7324083
Min length1

Characters and Unicode

Total characters17622
Distinct characters586
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1428 ?
Unique (%)70.8%

Sample

1st row마그네슘 인고트(괴)
2nd row산업용로봇
3rd row항공기 부품
4th row화장지
5th row폐합성수지류 비성형 SRF
ValueCountFrequency (%)
128
 
3.7%
자동차부품 110
 
3.2%
철구조물 95
 
2.8%
부품 83
 
2.4%
자동차 45
 
1.3%
45
 
1.3%
공작기계부품 42
 
1.2%
40
 
1.2%
기계부품 30
 
0.9%
공작기계 25
 
0.7%
Other values (1941) 2782
81.2%
2023-12-11T09:34:10.132034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1427
 
8.1%
819
 
4.6%
, 771
 
4.4%
695
 
3.9%
630
 
3.6%
393
 
2.2%
377
 
2.1%
300
 
1.7%
300
 
1.7%
294
 
1.7%
Other values (576) 11616
65.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14404
81.7%
Space Separator 1427
 
8.1%
Other Punctuation 794
 
4.5%
Uppercase Letter 480
 
2.7%
Lowercase Letter 307
 
1.7%
Open Punctuation 87
 
0.5%
Close Punctuation 87
 
0.5%
Decimal Number 30
 
0.2%
Dash Punctuation 4
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
819
 
5.7%
695
 
4.8%
630
 
4.4%
393
 
2.7%
377
 
2.6%
300
 
2.1%
300
 
2.1%
294
 
2.0%
282
 
2.0%
275
 
1.9%
Other values (509) 10039
69.7%
Uppercase Letter
ValueCountFrequency (%)
E 48
 
10.0%
A 41
 
8.5%
L 40
 
8.3%
C 40
 
8.3%
S 37
 
7.7%
R 30
 
6.2%
P 28
 
5.8%
T 28
 
5.8%
O 25
 
5.2%
D 20
 
4.2%
Other values (13) 143
29.8%
Lowercase Letter
ValueCountFrequency (%)
e 42
13.7%
a 29
 
9.4%
r 29
 
9.4%
s 25
 
8.1%
t 20
 
6.5%
c 19
 
6.2%
n 17
 
5.5%
l 16
 
5.2%
o 16
 
5.2%
d 14
 
4.6%
Other values (12) 80
26.1%
Decimal Number
ValueCountFrequency (%)
0 7
23.3%
1 7
23.3%
2 4
13.3%
5 3
10.0%
8 3
10.0%
7 2
 
6.7%
3 1
 
3.3%
4 1
 
3.3%
9 1
 
3.3%
6 1
 
3.3%
Other Punctuation
ValueCountFrequency (%)
, 771
97.1%
/ 11
 
1.4%
. 8
 
1.0%
' 2
 
0.3%
& 1
 
0.1%
% 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1427
100.0%
Open Punctuation
ValueCountFrequency (%)
( 87
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14404
81.7%
Common 2431
 
13.8%
Latin 787
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
819
 
5.7%
695
 
4.8%
630
 
4.4%
393
 
2.7%
377
 
2.6%
300
 
2.1%
300
 
2.1%
294
 
2.0%
282
 
2.0%
275
 
1.9%
Other values (509) 10039
69.7%
Latin
ValueCountFrequency (%)
E 48
 
6.1%
e 42
 
5.3%
A 41
 
5.2%
L 40
 
5.1%
C 40
 
5.1%
S 37
 
4.7%
R 30
 
3.8%
a 29
 
3.7%
r 29
 
3.7%
P 28
 
3.6%
Other values (35) 423
53.7%
Common
ValueCountFrequency (%)
1427
58.7%
, 771
31.7%
( 87
 
3.6%
) 87
 
3.6%
/ 11
 
0.5%
. 8
 
0.3%
0 7
 
0.3%
1 7
 
0.3%
2 4
 
0.2%
- 4
 
0.2%
Other values (12) 18
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14403
81.7%
ASCII 3218
 
18.3%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1427
44.3%
, 771
24.0%
( 87
 
2.7%
) 87
 
2.7%
E 48
 
1.5%
e 42
 
1.3%
A 41
 
1.3%
L 40
 
1.2%
C 40
 
1.2%
S 37
 
1.1%
Other values (57) 598
18.6%
Hangul
ValueCountFrequency (%)
819
 
5.7%
695
 
4.8%
630
 
4.4%
393
 
2.7%
377
 
2.6%
300
 
2.1%
300
 
2.1%
294
 
2.0%
282
 
2.0%
275
 
1.9%
Other values (508) 10038
69.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

주원자재
Text

MISSING 

Distinct855
Distinct (%)59.5%
Missing584
Missing (%)28.9%
Memory size15.9 KiB
2023-12-11T09:34:10.358950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length39
Mean length6.29039
Min length1

Characters and Unicode

Total characters9033
Distinct characters432
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique734 ?
Unique (%)51.1%

Sample

1st row복합재
2nd row종이
3rd row철강
4th row스텐레스, 승강기 부품
5th row
ValueCountFrequency (%)
철판 155
 
7.2%
철강재 90
 
4.2%
알루미늄 73
 
3.4%
철강 64
 
3.0%
58
 
2.7%
40
 
1.9%
36
 
1.7%
환봉 32
 
1.5%
특수강 28
 
1.3%
형강류 28
 
1.3%
Other values (925) 1549
71.9%
2023-12-11T09:34:10.718642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 788
 
8.7%
747
 
8.3%
659
 
7.3%
425
 
4.7%
355
 
3.9%
281
 
3.1%
191
 
2.1%
128
 
1.4%
121
 
1.3%
118
 
1.3%
Other values (422) 5220
57.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6331
70.1%
Other Punctuation 818
 
9.1%
Space Separator 747
 
8.3%
Uppercase Letter 667
 
7.4%
Lowercase Letter 218
 
2.4%
Decimal Number 112
 
1.2%
Close Punctuation 55
 
0.6%
Open Punctuation 55
 
0.6%
Dash Punctuation 29
 
0.3%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
659
 
10.4%
425
 
6.7%
355
 
5.6%
281
 
4.4%
191
 
3.0%
128
 
2.0%
121
 
1.9%
118
 
1.9%
114
 
1.8%
113
 
1.8%
Other values (358) 3826
60.4%
Uppercase Letter
ValueCountFrequency (%)
P 95
14.2%
S 85
12.7%
E 66
9.9%
C 51
 
7.6%
A 48
 
7.2%
L 42
 
6.3%
T 41
 
6.1%
B 34
 
5.1%
R 26
 
3.9%
M 25
 
3.7%
Other values (14) 154
23.1%
Lowercase Letter
ValueCountFrequency (%)
e 42
19.3%
s 25
11.5%
p 22
10.1%
t 18
8.3%
l 17
7.8%
a 17
7.8%
i 14
 
6.4%
c 10
 
4.6%
n 9
 
4.1%
r 8
 
3.7%
Other values (10) 36
16.5%
Decimal Number
ValueCountFrequency (%)
0 28
25.0%
4 17
15.2%
1 16
14.3%
5 12
10.7%
2 10
 
8.9%
3 10
 
8.9%
6 7
 
6.2%
8 7
 
6.2%
9 3
 
2.7%
7 2
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 788
96.3%
. 18
 
2.2%
/ 9
 
1.1%
: 2
 
0.2%
% 1
 
0.1%
Space Separator
ValueCountFrequency (%)
747
100.0%
Close Punctuation
ValueCountFrequency (%)
) 55
100.0%
Open Punctuation
ValueCountFrequency (%)
( 55
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 29
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6331
70.1%
Common 1817
 
20.1%
Latin 885
 
9.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
659
 
10.4%
425
 
6.7%
355
 
5.6%
281
 
4.4%
191
 
3.0%
128
 
2.0%
121
 
1.9%
118
 
1.9%
114
 
1.8%
113
 
1.8%
Other values (358) 3826
60.4%
Latin
ValueCountFrequency (%)
P 95
 
10.7%
S 85
 
9.6%
E 66
 
7.5%
C 51
 
5.8%
A 48
 
5.4%
e 42
 
4.7%
L 42
 
4.7%
T 41
 
4.6%
B 34
 
3.8%
R 26
 
2.9%
Other values (34) 355
40.1%
Common
ValueCountFrequency (%)
, 788
43.4%
747
41.1%
) 55
 
3.0%
( 55
 
3.0%
- 29
 
1.6%
0 28
 
1.5%
. 18
 
1.0%
4 17
 
0.9%
1 16
 
0.9%
5 12
 
0.7%
Other values (10) 52
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6331
70.1%
ASCII 2702
29.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 788
29.2%
747
27.6%
P 95
 
3.5%
S 85
 
3.1%
E 66
 
2.4%
) 55
 
2.0%
( 55
 
2.0%
C 51
 
1.9%
A 48
 
1.8%
e 42
 
1.6%
Other values (54) 670
24.8%
Hangul
ValueCountFrequency (%)
659
 
10.4%
425
 
6.7%
355
 
5.6%
281
 
4.4%
191
 
3.0%
128
 
2.0%
121
 
1.9%
118
 
1.9%
114
 
1.8%
113
 
1.8%
Other values (358) 3826
60.4%

Interactions

2023-12-11T09:34:06.399554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:34:06.241852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:34:06.481692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:34:06.322360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:34:10.795954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번단지명대표업종번호사업유형
순번1.0000.1970.0390.002
단지명0.1971.0000.4300.000
대표업종번호0.0390.4301.0001.000
사업유형0.0020.0001.0001.000
2023-12-11T09:34:10.868057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업유형단지명
사업유형1.0000.000
단지명0.0001.000
2023-12-11T09:34:10.935869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번대표업종번호단지명사업유형
순번1.000-0.0130.0620.000
대표업종번호-0.0131.0000.2120.999
단지명0.0620.2121.0000.000
사업유형0.0000.9990.0001.000

Missing values

2023-12-11T09:34:06.581887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:34:06.689695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T09:34:06.778358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번단지명회사명공장대표주소(도로명)대표업종번호업종명사업유형생산품주원자재
01<NA>YSM경상남도 함안군 법수면 윤외리 1555번지24219기타 비철금속 제련, 정련 및 합금 제조업제조업마그네슘 인고트(괴)<NA>
12<NA>대흥중자경상남도 함안군 칠서면 함의로 209-40 (대흥중자) 외 1필지29294주형 및 금형 제조업제조업산업용로봇<NA>
23함안일반산업단지코오롱데크컴퍼지트(주)경상남도 함안군 군북면 함안산단1길 26-2331322항공기용 부품 제조업 외 2 종제조업항공기 부품복합재
34<NA>(사)경남신체장애인복지회경상남도 함안군 대산면 하기리 415-4번지 외6필17902위생용 종이제품 제조업제조업화장지종이
45<NA>(사)환경사랑나눔회 경남희망세상제작단경상남도 함안군 군북면 국우로 95 외 1필지33992라이터, 연소물 및 흡연용품 제조업제조업폐합성수지류 비성형 SRF<NA>
56<NA>(유)국제케미칼경상남도 함안군 여항면 내곡1길 93 (국제케미칼) 외 3필지25913자동차용 금속 압형제품 제조업 외 1 종제조업우레탄 금형제품<NA>
67<NA>(유)부강기계경상남도 함안군 칠원읍 무기로 89 (식당)30310자동차 엔진용 신품 부품 제조업 외 7 종제조업기어,전동축철강
78칠서일반산업단지(유)실버엘레베이터코리아경상남도 함안군 칠서면 공단동4길 75 (칠서면)29162승강기 제조업제조업승강기, 승강기 부품스텐레스, 승강기 부품
89<NA>(유)씨지푸드경상남도 함안군 군북면 현포로 2910121가금류 가공 및 저장 처리업제조업닭가공품
910<NA>(유)액슬코리아 2공장경상남도 함안군 칠북면 화천1길 128 외 1필지25912금속 단조제품 제조업제조업AXLE SHAFT환봉
순번단지명회사명공장대표주소(도로명)대표업종번호업종명사업유형생산품주원자재
20102011<NA>효일테크경상남도 함안군 칠서면 함의로 349-3 외 2필지29223금속 절삭기계 제조업 외 1 종제조업산업기계, 선반부품금속, 철강류
20112012<NA>효찬테크경상남도 함안군 칠원읍 용정리 854-21번지29229기타 가공 공작기계 제조업제조업가공공작기계반제품
20122013함안일반산업단지후소산기(주)경상남도 함안군 군북면 함안산단2길 25-2925119기타 구조용 금속제품 제조업 외 1 종제조업철박스 , 금속가공 제품철판 , 형광류
20132014<NA>훈스틸경상남도 함안군 법수면 장백로 566 (법수면) 외 2필지25112구조용 금속 판제품 및 공작물 제조업 외 3 종제조업철골, 철구조물,철의장품선박의장품
20142015함안일반산업단지훌루테크(주)경상남도 함안군 군북면 함안산단7길 1324311선철주물 주조업 외 2 종제조업주물품 및 유압기기, 절삭가공선철 및 고철, 주철소재(FC,FCD,FCV)
20152016<NA>훌루테크머시닝경상남도 함안군 칠원읍 쇠만이길 5-50 외 2필지25924절삭가공 및 유사처리업제조업조타장치(스티어링기어)철강재
20162017<NA>휴먼중공업(주)경상남도 함안군 칠서면 계내리 12번지 외 1필지25200무기 및 총포탄 제조업 외 16 종제조업방위산업용 부품알미늄판
20172018<NA>흥일공업사경상남도 함안군 칠원읍 광려천북로 242-10 (태광산업)29223금속 절삭기계 제조업제조업소형선반철판,FRP
20182019<NA>희영정공경상남도 함안군 법수면 장백로 585 (에이치디시에스(주))25921금속 열처리업제조업주강품<NA>
20192020<NA>히트산업경상남도 함안군 법수면 법수로 407, 외2필지13999그 외 기타 분류 안된 섬유제품 제조업제조업위생타올부직포