Overview

Dataset statistics

Number of variables8
Number of observations1140
Missing cells222
Missing cells (%)2.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory72.5 KiB
Average record size in memory65.1 B

Variable types

Numeric1
Text6
DateTime1

Dataset

Description경상북도 포항시 제조업공장현황에 관한 데이터로 제조업공장 회사명,대표자,전화번호,팩스번호,생산품,업종명,공장주소 등의 정보를 제공합니다.
Author경상북도 포항시
URLhttps://www.data.go.kr/data/15034912/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 87 (7.6%) missing valuesMissing
팩스번호 has 135 (11.8%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 03:49:21.647280
Analysis finished2023-12-12 03:49:23.372983
Duration1.73 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct1140
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean570.5
Minimum1
Maximum1140
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.1 KiB
2023-12-12T12:49:23.492840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile57.95
Q1285.75
median570.5
Q3855.25
95-th percentile1083.05
Maximum1140
Range1139
Interquartile range (IQR)569.5

Descriptive statistics

Standard deviation329.23396
Coefficient of variation (CV)0.57709721
Kurtosis-1.2
Mean570.5
Median Absolute Deviation (MAD)285
Skewness0
Sum650370
Variance108395
MonotonicityStrictly increasing
2023-12-12T12:49:23.715828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
760 1
 
0.1%
766 1
 
0.1%
765 1
 
0.1%
764 1
 
0.1%
763 1
 
0.1%
762 1
 
0.1%
761 1
 
0.1%
759 1
 
0.1%
768 1
 
0.1%
Other values (1130) 1130
99.1%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1140 1
0.1%
1139 1
0.1%
1138 1
0.1%
1137 1
0.1%
1136 1
0.1%
1135 1
0.1%
1134 1
0.1%
1133 1
0.1%
1132 1
0.1%
1131 1
0.1%
Distinct1102
Distinct (%)96.7%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-12T12:49:24.083908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length20
Mean length7.5912281
Min length2

Characters and Unicode

Total characters8654
Distinct characters410
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1069 ?
Unique (%)93.8%

Sample

1st row(자)경화계전
2nd row(주)A.T.C
3rd row(주)OK산업
4th row(주)가산식품
5th row(주)거명
ValueCountFrequency (%)
주식회사 25
 
2.0%
제2공장 14
 
1.1%
제1공장 10
 
0.8%
2공장 8
 
0.6%
포항공장 6
 
0.5%
농업회사법인 5
 
0.4%
제3공장 5
 
0.4%
주)에스피네이처 4
 
0.3%
주)융진 4
 
0.3%
대혁산업(주 4
 
0.3%
Other values (1101) 1192
93.3%
2023-12-12T12:49:24.757130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
812
 
9.4%
( 765
 
8.8%
) 765
 
8.8%
216
 
2.5%
199
 
2.3%
193
 
2.2%
174
 
2.0%
164
 
1.9%
156
 
1.8%
135
 
1.6%
Other values (400) 5075
58.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6728
77.7%
Open Punctuation 765
 
8.8%
Close Punctuation 765
 
8.8%
Space Separator 174
 
2.0%
Uppercase Letter 118
 
1.4%
Decimal Number 72
 
0.8%
Other Punctuation 23
 
0.3%
Lowercase Letter 8
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
812
 
12.1%
216
 
3.2%
199
 
3.0%
193
 
2.9%
164
 
2.4%
156
 
2.3%
135
 
2.0%
125
 
1.9%
118
 
1.8%
114
 
1.7%
Other values (360) 4496
66.8%
Uppercase Letter
ValueCountFrequency (%)
E 18
15.3%
C 14
11.9%
N 11
9.3%
T 9
 
7.6%
G 9
 
7.6%
P 7
 
5.9%
H 7
 
5.9%
O 6
 
5.1%
F 6
 
5.1%
I 5
 
4.2%
Other values (10) 26
22.0%
Lowercase Letter
ValueCountFrequency (%)
a 2
25.0%
i 1
12.5%
g 1
12.5%
s 1
12.5%
r 1
12.5%
o 1
12.5%
e 1
12.5%
Decimal Number
ValueCountFrequency (%)
2 42
58.3%
1 18
25.0%
3 9
 
12.5%
5 2
 
2.8%
4 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 10
43.5%
, 7
30.4%
& 5
21.7%
/ 1
 
4.3%
Open Punctuation
ValueCountFrequency (%)
( 765
100.0%
Close Punctuation
ValueCountFrequency (%)
) 765
100.0%
Space Separator
ValueCountFrequency (%)
174
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6728
77.7%
Common 1800
 
20.8%
Latin 126
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
812
 
12.1%
216
 
3.2%
199
 
3.0%
193
 
2.9%
164
 
2.4%
156
 
2.3%
135
 
2.0%
125
 
1.9%
118
 
1.8%
114
 
1.7%
Other values (360) 4496
66.8%
Latin
ValueCountFrequency (%)
E 18
14.3%
C 14
11.1%
N 11
 
8.7%
T 9
 
7.1%
G 9
 
7.1%
P 7
 
5.6%
H 7
 
5.6%
O 6
 
4.8%
F 6
 
4.8%
I 5
 
4.0%
Other values (17) 34
27.0%
Common
ValueCountFrequency (%)
( 765
42.5%
) 765
42.5%
174
 
9.7%
2 42
 
2.3%
1 18
 
1.0%
. 10
 
0.6%
3 9
 
0.5%
, 7
 
0.4%
& 5
 
0.3%
5 2
 
0.1%
Other values (3) 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6728
77.7%
ASCII 1926
 
22.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
812
 
12.1%
216
 
3.2%
199
 
3.0%
193
 
2.9%
164
 
2.4%
156
 
2.3%
135
 
2.0%
125
 
1.9%
118
 
1.8%
114
 
1.7%
Other values (360) 4496
66.8%
ASCII
ValueCountFrequency (%)
( 765
39.7%
) 765
39.7%
174
 
9.0%
2 42
 
2.2%
1 18
 
0.9%
E 18
 
0.9%
C 14
 
0.7%
N 11
 
0.6%
. 10
 
0.5%
T 9
 
0.5%
Other values (30) 100
 
5.2%
Distinct1075
Distinct (%)94.3%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-12T12:49:25.212800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length50
Mean length30.096491
Min length19

Characters and Unicode

Total characters34310
Distinct characters285
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1015 ?
Unique (%)89.0%

Sample

1st row경상북도 포항시 남구 연일읍 철강로107번길 70-1
2nd row경상북도 포항시 남구 연일읍 동문로94번길 42
3rd row경상북도 포항시 남구 연일읍 동문로76번길 66
4th row경상북도 포항시 북구 기계면 지가리 1089-52번지 외 3필지
5th row경상북도 포항시 북구 청하면 미남길 62, , 미남길 70
ValueCountFrequency (%)
경상북도 1140
 
14.7%
포항시 1140
 
14.7%
남구 793
 
10.2%
북구 347
 
4.5%
연일읍 231
 
3.0%
필지 191
 
2.5%
191
 
2.5%
대송면 180
 
2.3%
흥해읍 116
 
1.5%
청하면 110
 
1.4%
Other values (1157) 3329
42.9%
2023-12-12T12:49:25.826608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6639
 
19.4%
1496
 
4.4%
1212
 
3.5%
1193
 
3.5%
1173
 
3.4%
1168
 
3.4%
1167
 
3.4%
1143
 
3.3%
1142
 
3.3%
1 990
 
2.9%
Other values (275) 16987
49.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20676
60.3%
Space Separator 6639
 
19.4%
Decimal Number 5052
 
14.7%
Close Punctuation 700
 
2.0%
Open Punctuation 700
 
2.0%
Dash Punctuation 305
 
0.9%
Other Punctuation 135
 
0.4%
Uppercase Letter 95
 
0.3%
Lowercase Letter 5
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1496
 
7.2%
1212
 
5.9%
1193
 
5.8%
1173
 
5.7%
1168
 
5.6%
1167
 
5.6%
1143
 
5.5%
1142
 
5.5%
881
 
4.3%
868
 
4.2%
Other values (232) 9233
44.7%
Uppercase Letter
ValueCountFrequency (%)
C 13
13.7%
E 10
10.5%
T 8
 
8.4%
P 7
 
7.4%
A 7
 
7.4%
S 6
 
6.3%
O 6
 
6.3%
I 5
 
5.3%
N 5
 
5.3%
F 5
 
5.3%
Other values (9) 23
24.2%
Decimal Number
ValueCountFrequency (%)
1 990
19.6%
2 740
14.6%
3 549
10.9%
4 532
10.5%
5 428
8.5%
6 390
 
7.7%
0 382
 
7.6%
7 375
 
7.4%
8 365
 
7.2%
9 301
 
6.0%
Lowercase Letter
ValueCountFrequency (%)
s 2
40.0%
b 1
20.0%
t 1
20.0%
u 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 124
91.9%
. 7
 
5.2%
& 4
 
3.0%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Space Separator
ValueCountFrequency (%)
6639
100.0%
Close Punctuation
ValueCountFrequency (%)
) 700
100.0%
Open Punctuation
ValueCountFrequency (%)
( 700
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 305
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20677
60.3%
Common 13533
39.4%
Latin 100
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1496
 
7.2%
1212
 
5.9%
1193
 
5.8%
1173
 
5.7%
1168
 
5.6%
1167
 
5.6%
1143
 
5.5%
1142
 
5.5%
881
 
4.3%
868
 
4.2%
Other values (233) 9234
44.7%
Latin
ValueCountFrequency (%)
C 13
13.0%
E 10
 
10.0%
T 8
 
8.0%
P 7
 
7.0%
A 7
 
7.0%
S 6
 
6.0%
O 6
 
6.0%
I 5
 
5.0%
N 5
 
5.0%
F 5
 
5.0%
Other values (13) 28
28.0%
Common
ValueCountFrequency (%)
6639
49.1%
1 990
 
7.3%
2 740
 
5.5%
) 700
 
5.2%
( 700
 
5.2%
3 549
 
4.1%
4 532
 
3.9%
5 428
 
3.2%
6 390
 
2.9%
0 382
 
2.8%
Other values (9) 1483
 
11.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20676
60.3%
ASCII 13633
39.7%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6639
48.7%
1 990
 
7.3%
2 740
 
5.4%
) 700
 
5.1%
( 700
 
5.1%
3 549
 
4.0%
4 532
 
3.9%
5 428
 
3.1%
6 390
 
2.9%
0 382
 
2.8%
Other values (32) 1583
 
11.6%
Hangul
ValueCountFrequency (%)
1496
 
7.2%
1212
 
5.9%
1193
 
5.8%
1173
 
5.7%
1168
 
5.6%
1167
 
5.6%
1143
 
5.5%
1142
 
5.5%
881
 
4.3%
868
 
4.2%
Other values (232) 9233
44.7%
None
ValueCountFrequency (%)
1
100.0%

전화번호
Text

MISSING 

Distinct957
Distinct (%)90.9%
Missing87
Missing (%)7.6%
Memory size9.0 KiB
2023-12-12T12:49:26.102645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.00095
Min length9

Characters and Unicode

Total characters12637
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique878 ?
Unique (%)83.4%

Sample

1st row054-286-6492
2nd row054-286-0764
3rd row054-285-2671
4th row054-243-1470
5th row054-256-5658
ValueCountFrequency (%)
054-231-6280 5
 
0.5%
054-278-2789 4
 
0.4%
054-278-6100 4
 
0.4%
054-271-1252 4
 
0.4%
054-271-4114 3
 
0.3%
054-284-8040 3
 
0.3%
054-223-3200 3
 
0.3%
054-275-7575 3
 
0.3%
054-285-1600 3
 
0.3%
054-286-3400 3
 
0.3%
Other values (947) 1018
96.7%
2023-12-12T12:49:26.623778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2103
16.6%
0 1828
14.5%
5 1687
13.3%
2 1614
12.8%
4 1530
12.1%
8 871
6.9%
7 737
 
5.8%
1 735
 
5.8%
6 599
 
4.7%
3 547
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10534
83.4%
Dash Punctuation 2103
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1828
17.4%
5 1687
16.0%
2 1614
15.3%
4 1530
14.5%
8 871
8.3%
7 737
7.0%
1 735
7.0%
6 599
 
5.7%
3 547
 
5.2%
9 386
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 2103
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12637
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2103
16.6%
0 1828
14.5%
5 1687
13.3%
2 1614
12.8%
4 1530
12.1%
8 871
6.9%
7 737
 
5.8%
1 735
 
5.8%
6 599
 
4.7%
3 547
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12637
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2103
16.6%
0 1828
14.5%
5 1687
13.3%
2 1614
12.8%
4 1530
12.1%
8 871
6.9%
7 737
 
5.8%
1 735
 
5.8%
6 599
 
4.7%
3 547
 
4.3%

팩스번호
Text

MISSING 

Distinct900
Distinct (%)89.6%
Missing135
Missing (%)11.8%
Memory size9.0 KiB
2023-12-12T12:49:26.968907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010945
Min length11

Characters and Unicode

Total characters12071
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique807 ?
Unique (%)80.3%

Sample

1st row054-286-6494
2nd row054-286-0762
3rd row054-285-2673
4th row054-255-5658
5th row054-261-4835
ValueCountFrequency (%)
054-278-7893 5
 
0.5%
054-271-1241 4
 
0.4%
054-285-1700 3
 
0.3%
054-286-6270 3
 
0.3%
054-232-6074 3
 
0.3%
054-231-6279 3
 
0.3%
054-285-2525 3
 
0.3%
054-246-6888 3
 
0.3%
054-285-6401 3
 
0.3%
054-242-2346 2
 
0.2%
Other values (890) 973
96.8%
2023-12-12T12:49:27.605218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2010
16.7%
0 1602
13.3%
5 1588
13.2%
2 1560
12.9%
4 1515
12.6%
8 867
7.2%
7 700
 
5.8%
6 636
 
5.3%
1 581
 
4.8%
3 531
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10061
83.3%
Dash Punctuation 2010
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1602
15.9%
5 1588
15.8%
2 1560
15.5%
4 1515
15.1%
8 867
8.6%
7 700
7.0%
6 636
 
6.3%
1 581
 
5.8%
3 531
 
5.3%
9 481
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 2010
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12071
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2010
16.7%
0 1602
13.3%
5 1588
13.2%
2 1560
12.9%
4 1515
12.6%
8 867
7.2%
7 700
 
5.8%
6 636
 
5.3%
1 581
 
4.8%
3 531
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12071
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2010
16.7%
0 1602
13.3%
5 1588
13.2%
2 1560
12.9%
4 1515
12.6%
8 867
7.2%
7 700
 
5.8%
6 636
 
5.3%
1 581
 
4.8%
3 531
 
4.4%
Distinct918
Distinct (%)80.5%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-12T12:49:28.078563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length34
Mean length8.3140351
Min length1

Characters and Unicode

Total characters9478
Distinct characters532
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique851 ?
Unique (%)74.6%

Sample

1st row전기판넬
2nd row실험설비,공장자동화설비
3rd row철구조물
4th row알콩메주, 전통메주
5th row부정형내화물(캐스타블)
ValueCountFrequency (%)
철구조물 66
 
3.5%
56
 
3.0%
산업기계 38
 
2.0%
29
 
1.5%
28
 
1.5%
배전반 27
 
1.4%
전기자동제어반 17
 
0.9%
레미콘 16
 
0.9%
기계부품 14
 
0.7%
산업용 12
 
0.6%
Other values (1168) 1579
83.9%
2023-12-12T12:49:28.738188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
773
 
8.2%
, 474
 
5.0%
388
 
4.1%
208
 
2.2%
188
 
2.0%
182
 
1.9%
152
 
1.6%
147
 
1.6%
147
 
1.6%
144
 
1.5%
Other values (522) 6675
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7361
77.7%
Space Separator 773
 
8.2%
Uppercase Letter 494
 
5.2%
Other Punctuation 484
 
5.1%
Lowercase Letter 239
 
2.5%
Open Punctuation 54
 
0.6%
Close Punctuation 54
 
0.6%
Dash Punctuation 10
 
0.1%
Decimal Number 8
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
388
 
5.3%
208
 
2.8%
188
 
2.6%
182
 
2.5%
152
 
2.1%
147
 
2.0%
147
 
2.0%
144
 
2.0%
128
 
1.7%
124
 
1.7%
Other values (462) 5553
75.4%
Lowercase Letter
ValueCountFrequency (%)
e 35
14.6%
l 27
11.3%
r 25
10.5%
o 21
8.8%
i 18
 
7.5%
a 17
 
7.1%
n 16
 
6.7%
t 15
 
6.3%
c 10
 
4.2%
b 9
 
3.8%
Other values (15) 46
19.2%
Uppercase Letter
ValueCountFrequency (%)
E 53
 
10.7%
C 52
 
10.5%
T 46
 
9.3%
L 35
 
7.1%
A 32
 
6.5%
P 30
 
6.1%
R 29
 
5.9%
S 28
 
5.7%
I 26
 
5.3%
H 21
 
4.3%
Other values (12) 142
28.7%
Other Punctuation
ValueCountFrequency (%)
, 474
97.9%
/ 6
 
1.2%
. 3
 
0.6%
* 1
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 3
37.5%
2 3
37.5%
9 1
 
12.5%
6 1
 
12.5%
Space Separator
ValueCountFrequency (%)
773
100.0%
Open Punctuation
ValueCountFrequency (%)
( 54
100.0%
Close Punctuation
ValueCountFrequency (%)
) 54
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7361
77.7%
Common 1384
 
14.6%
Latin 733
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
388
 
5.3%
208
 
2.8%
188
 
2.6%
182
 
2.5%
152
 
2.1%
147
 
2.0%
147
 
2.0%
144
 
2.0%
128
 
1.7%
124
 
1.7%
Other values (462) 5553
75.4%
Latin
ValueCountFrequency (%)
E 53
 
7.2%
C 52
 
7.1%
T 46
 
6.3%
L 35
 
4.8%
e 35
 
4.8%
A 32
 
4.4%
P 30
 
4.1%
R 29
 
4.0%
S 28
 
3.8%
l 27
 
3.7%
Other values (37) 366
49.9%
Common
ValueCountFrequency (%)
773
55.9%
, 474
34.2%
( 54
 
3.9%
) 54
 
3.9%
- 10
 
0.7%
/ 6
 
0.4%
1 3
 
0.2%
2 3
 
0.2%
. 3
 
0.2%
_ 1
 
0.1%
Other values (3) 3
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7361
77.7%
ASCII 2117
 
22.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
773
36.5%
, 474
22.4%
( 54
 
2.6%
) 54
 
2.6%
E 53
 
2.5%
C 52
 
2.5%
T 46
 
2.2%
L 35
 
1.7%
e 35
 
1.7%
A 32
 
1.5%
Other values (50) 509
24.0%
Hangul
ValueCountFrequency (%)
388
 
5.3%
208
 
2.8%
188
 
2.6%
182
 
2.5%
152
 
2.1%
147
 
2.0%
147
 
2.0%
144
 
2.0%
128
 
1.7%
124
 
1.7%
Other values (462) 5553
75.4%
Distinct368
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
2023-12-12T12:49:29.189028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length27
Mean length16.872807
Min length3

Characters and Unicode

Total characters19235
Distinct characters279
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)17.6%

Sample

1st row배전반 및 전기 자동제어반 제조업
2nd row그 외 기타 특수목적용 기계 제조업 외6 종
3rd row육상 금속 골조 구조재 제조업 외1 종
4th row장류 제조업
5th row비금속광물 분쇄물 생산업
ValueCountFrequency (%)
제조업 957
 
16.0%
489
 
8.2%
439
 
7.3%
외1 319
 
5.3%
기타 274
 
4.6%
금속 195
 
3.3%
158
 
2.6%
158
 
2.6%
육상 126
 
2.1%
구조재 126
 
2.1%
Other values (410) 2750
45.9%
2023-12-12T12:49:29.820639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4852
25.2%
1320
 
6.9%
1256
 
6.5%
1189
 
6.2%
653
 
3.4%
646
 
3.4%
495
 
2.6%
439
 
2.3%
1 368
 
1.9%
328
 
1.7%
Other values (269) 7689
40.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13741
71.4%
Space Separator 4852
 
25.2%
Decimal Number 539
 
2.8%
Other Punctuation 93
 
0.5%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1320
 
9.6%
1256
 
9.1%
1189
 
8.7%
653
 
4.8%
646
 
4.7%
495
 
3.6%
439
 
3.2%
328
 
2.4%
295
 
2.1%
280
 
2.0%
Other values (255) 6840
49.8%
Decimal Number
ValueCountFrequency (%)
1 368
68.3%
2 89
 
16.5%
3 43
 
8.0%
4 14
 
2.6%
5 10
 
1.9%
6 9
 
1.7%
7 4
 
0.7%
8 1
 
0.2%
9 1
 
0.2%
Other Punctuation
ValueCountFrequency (%)
, 89
95.7%
. 4
 
4.3%
Space Separator
ValueCountFrequency (%)
4852
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13741
71.4%
Common 5494
 
28.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1320
 
9.6%
1256
 
9.1%
1189
 
8.7%
653
 
4.8%
646
 
4.7%
495
 
3.6%
439
 
3.2%
328
 
2.4%
295
 
2.1%
280
 
2.0%
Other values (255) 6840
49.8%
Common
ValueCountFrequency (%)
4852
88.3%
1 368
 
6.7%
2 89
 
1.6%
, 89
 
1.6%
3 43
 
0.8%
4 14
 
0.3%
5 10
 
0.2%
6 9
 
0.2%
) 5
 
0.1%
( 5
 
0.1%
Other values (4) 10
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13739
71.4%
ASCII 5494
 
28.6%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4852
88.3%
1 368
 
6.7%
2 89
 
1.6%
, 89
 
1.6%
3 43
 
0.8%
4 14
 
0.3%
5 10
 
0.2%
6 9
 
0.2%
) 5
 
0.1%
( 5
 
0.1%
Other values (4) 10
 
0.2%
Hangul
ValueCountFrequency (%)
1320
 
9.6%
1256
 
9.1%
1189
 
8.7%
653
 
4.8%
646
 
4.7%
495
 
3.6%
439
 
3.2%
328
 
2.4%
295
 
2.1%
280
 
2.0%
Other values (254) 6838
49.8%
Compat Jamo
ValueCountFrequency (%)
2
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size9.0 KiB
Minimum2020-11-19 00:00:00
Maximum2020-11-19 00:00:00
2023-12-12T12:49:29.991616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:49:30.438464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T12:49:22.727518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T12:49:22.921274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:49:23.135746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T12:49:23.300089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번회사명공장소재지주소전화번호팩스번호생산품업종명데이터기준일자
01(자)경화계전경상북도 포항시 남구 연일읍 철강로107번길 70-1054-286-6492054-286-6494전기판넬배전반 및 전기 자동제어반 제조업2020-11-19
12(주)A.T.C경상북도 포항시 남구 연일읍 동문로94번길 42054-286-0764054-286-0762실험설비,공장자동화설비그 외 기타 특수목적용 기계 제조업 외6 종2020-11-19
23(주)OK산업경상북도 포항시 남구 연일읍 동문로76번길 66054-285-2671054-285-2673철구조물육상 금속 골조 구조재 제조업 외1 종2020-11-19
34(주)가산식품경상북도 포항시 북구 기계면 지가리 1089-52번지 외 3필지054-243-1470<NA>알콩메주, 전통메주장류 제조업2020-11-19
45(주)거명경상북도 포항시 북구 청하면 미남길 62, , 미남길 70054-256-5658054-255-5658부정형내화물(캐스타블)비금속광물 분쇄물 생산업2020-11-19
56(주)거양1공장경상북도 포항시 북구 흥해읍 도음로 888054-261-3801054-261-4835여과기연 및 아연 제련, 정련 및 합금 제조업 외1 종2020-11-19
67(주)거양3공장경상북도 포항시 북구 청하면 동해대로2315번길 56054-261-3801054-232-8919파형강관강관 제조업 외1 종2020-11-19
78(주)거양이엔지경상북도 포항시 남구 연일읍 형산강남로418번길 52-10054-255-1104054-255-1106철구조물육상 금속 골조 구조재 제조업 외1 종2020-11-19
89(주)경도공업경상북도 포항시 북구 기계면 성계길 226054-246-4251054-246-4254단순가공품금속 절삭기계 제조업 외1 종2020-11-19
910(주)경북외식산업경상북도 포항시 북구 문화로 39 (동빈1가)054-248-1761054-248-7998도시락도시락류 제조업 외1 종2020-11-19
순번회사명공장소재지주소전화번호팩스번호생산품업종명데이터기준일자
11301131홍덕산업(주) 제3공장경상북도 포항시 남구 장흥로39번길 51 (장흥동)054-278-3686054-278-3689스텐레스,와이어로프철강선 제조업2020-11-19
11311132홍덕산업(주)포항SC공장경상북도 포항시 남구 대송면 철강산단로66번길 80 (대송면)054-278-1411054-278-4720스틸코드,비드와이어철강선 제조업2020-11-19
11321133홍익통상경상북도 포항시 남구 대송면 철강로 228054-272-0175054-282-1597실린더, 가스켓유압기기 제조업2020-11-19
11331134홍천산업(주)경상북도 포항시 남구 대송로101번길 27 (장흥동)054-285-5548054-285-2127고철가공금속류 해체 및 선별업 외1 종2020-11-19
11341135화신이엘티(주)경상북도 포항시 남구 대송면 송덕로212번길 226054-285-3400054-285-4140강재절단가공, 중장비부품, 압력용기그 외 기타 1차 철강 제조업 외6 종2020-11-19
11351136효명경상북도 포항시 남구 상대로 151-2, , 2층 (대도동)054-273-0900054-249-4301LED 조명기구일반용 전기 조명장치 제조업2020-11-19
11361137효성중전기경상북도 포항시 남구 연일읍 동문로94번길 51054-285-0613054-285-1695전동기, 발전기전동기 및 발전기 제조업2020-11-19
11371138흥국제선(주) 포항공장경상북도 포항시 남구 장흥로39번길 70 (장흥동)054-285-1350054-285-3350용접재료그 외 기타 분류 안된 금속 가공 제품 제조업 외1 종2020-11-19
11381139흥해농업협동조합경상북도 포항시 북구 흥해읍 흥해로 120 (총 2 필지)054-261-5010054-262-2185일반미곡물 도정업2020-11-19
11391140흥해농업협동조합 제2공장경상북도 포항시 북구 흥해읍 약성리 45-1번지054-261-5001<NA>도정 쌀곡물 도정업2020-11-19