Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text4

Dataset

Description전국의 공장등록현황 자료입니다. 공장설립온라인지원시스템에 등록된 공장의 회사명, 산업단지명, 생산품, 공장주소를 포함하고 있습니다. 월 마지막 일을 기점으로 현황 데이터를 출력합니다.
Author한국산업단지공단
URLhttps://www.data.go.kr/data/15105482/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 19:13:15.662990
Analysis finished2024-03-14 19:13:18.934767
Duration3.27 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49248.05
Minimum14
Maximum98217
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T04:13:19.095627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum14
5-th percentile4817.4
Q124775
median49169.5
Q373770
95-th percentile93261.6
Maximum98217
Range98203
Interquartile range (IQR)48995

Descriptive statistics

Standard deviation28301.18
Coefficient of variation (CV)0.574666
Kurtosis-1.1956374
Mean49248.05
Median Absolute Deviation (MAD)24500
Skewness0.00062039976
Sum4.924805 × 108
Variance8.0095681 × 108
MonotonicityNot monotonic
2024-03-15T04:13:19.358651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14188 1
 
< 0.1%
94476 1
 
< 0.1%
84277 1
 
< 0.1%
34896 1
 
< 0.1%
86533 1
 
< 0.1%
33361 1
 
< 0.1%
95218 1
 
< 0.1%
17900 1
 
< 0.1%
23156 1
 
< 0.1%
82072 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
14 1
< 0.1%
43 1
< 0.1%
53 1
< 0.1%
69 1
< 0.1%
80 1
< 0.1%
86 1
< 0.1%
95 1
< 0.1%
99 1
< 0.1%
114 1
< 0.1%
124 1
< 0.1%
ValueCountFrequency (%)
98217 1
< 0.1%
98209 1
< 0.1%
98202 1
< 0.1%
98194 1
< 0.1%
98191 1
< 0.1%
98184 1
< 0.1%
98183 1
< 0.1%
98182 1
< 0.1%
98181 1
< 0.1%
98166 1
< 0.1%
Distinct9748
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T04:13:20.558246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length7.0406
Min length1

Characters and Unicode

Total characters70406
Distinct characters823
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9534 ?
Unique (%)95.3%

Sample

1st row흥진산업(주)
2nd row(주)삼공씨앤피
3rd row(주)쁘띠아미
4th row영인공영(주)
5th row(주)메가클라우드
ValueCountFrequency (%)
주식회사 639
 
5.8%
제2공장 32
 
0.3%
21
 
0.2%
2공장 21
 
0.2%
농업회사법인 19
 
0.2%
유한회사 9
 
0.1%
tech 8
 
0.1%
사단법인 7
 
0.1%
eng 6
 
0.1%
제1공장 6
 
0.1%
Other values (9918) 10273
93.0%
2024-03-15T04:13:21.971914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6157
 
8.7%
) 5501
 
7.8%
( 5496
 
7.8%
2401
 
3.4%
1992
 
2.8%
1205
 
1.7%
1177
 
1.7%
1168
 
1.7%
1076
 
1.5%
966
 
1.4%
Other values (813) 43267
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 56658
80.5%
Close Punctuation 5502
 
7.8%
Open Punctuation 5497
 
7.8%
Space Separator 1168
 
1.7%
Uppercase Letter 1024
 
1.5%
Decimal Number 179
 
0.3%
Lowercase Letter 171
 
0.2%
Other Punctuation 130
 
0.2%
Other Symbol 56
 
0.1%
Dash Punctuation 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6157
 
10.9%
2401
 
4.2%
1992
 
3.5%
1205
 
2.1%
1177
 
2.1%
1076
 
1.9%
966
 
1.7%
871
 
1.5%
863
 
1.5%
843
 
1.5%
Other values (741) 39107
69.0%
Uppercase Letter
ValueCountFrequency (%)
E 133
13.0%
S 88
 
8.6%
N 88
 
8.6%
T 81
 
7.9%
C 77
 
7.5%
G 70
 
6.8%
M 52
 
5.1%
H 47
 
4.6%
K 46
 
4.5%
D 40
 
3.9%
Other values (16) 302
29.5%
Lowercase Letter
ValueCountFrequency (%)
e 20
11.7%
n 19
11.1%
o 17
 
9.9%
s 15
 
8.8%
c 14
 
8.2%
i 10
 
5.8%
r 9
 
5.3%
a 8
 
4.7%
t 8
 
4.7%
h 8
 
4.7%
Other values (12) 43
25.1%
Decimal Number
ValueCountFrequency (%)
2 96
53.6%
1 33
 
18.4%
3 18
 
10.1%
0 10
 
5.6%
5 6
 
3.4%
9 5
 
2.8%
4 5
 
2.8%
6 5
 
2.8%
7 1
 
0.6%
Other Punctuation
ValueCountFrequency (%)
. 86
66.2%
& 31
 
23.8%
, 8
 
6.2%
/ 2
 
1.5%
@ 1
 
0.8%
1
 
0.8%
· 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 5501
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 5496
> 99.9%
[ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
1168
100.0%
Other Symbol
ValueCountFrequency (%)
56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 56712
80.5%
Common 12497
 
17.7%
Latin 1195
 
1.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6157
 
10.9%
2401
 
4.2%
1992
 
3.5%
1205
 
2.1%
1177
 
2.1%
1076
 
1.9%
966
 
1.7%
871
 
1.5%
863
 
1.5%
843
 
1.5%
Other values (740) 39161
69.1%
Latin
ValueCountFrequency (%)
E 133
 
11.1%
S 88
 
7.4%
N 88
 
7.4%
T 81
 
6.8%
C 77
 
6.4%
G 70
 
5.9%
M 52
 
4.4%
H 47
 
3.9%
K 46
 
3.8%
D 40
 
3.3%
Other values (38) 473
39.6%
Common
ValueCountFrequency (%)
) 5501
44.0%
( 5496
44.0%
1168
 
9.3%
2 96
 
0.8%
. 86
 
0.7%
1 33
 
0.3%
& 31
 
0.2%
- 19
 
0.2%
3 18
 
0.1%
0 10
 
0.1%
Other values (13) 39
 
0.3%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 56656
80.5%
ASCII 13690
 
19.4%
None 58
 
0.1%
CJK 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6157
 
10.9%
2401
 
4.2%
1992
 
3.5%
1205
 
2.1%
1177
 
2.1%
1076
 
1.9%
966
 
1.7%
871
 
1.5%
863
 
1.5%
843
 
1.5%
Other values (739) 39105
69.0%
ASCII
ValueCountFrequency (%)
) 5501
40.2%
( 5496
40.1%
1168
 
8.5%
E 133
 
1.0%
2 96
 
0.7%
S 88
 
0.6%
N 88
 
0.6%
. 86
 
0.6%
T 81
 
0.6%
C 77
 
0.6%
Other values (59) 876
 
6.4%
None
ValueCountFrequency (%)
56
96.6%
1
 
1.7%
· 1
 
1.7%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct156
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T04:13:22.794365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length1
Mean length5.2131
Min length1

Characters and Unicode

Total characters52131
Distinct characters208
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st row부산신평장림일반산업단지(협업단지)
2nd row
3rd row
4th row
5th row
ValueCountFrequency (%)
시화국가산업단지 632
 
12.7%
남동국가산업단지 534
 
10.8%
반월국가산업단지 509
 
10.2%
서울디지털국가산업단지 347
 
7.0%
성남일반산업단지 272
 
5.5%
성서지방산업단지 266
 
5.4%
명지녹산국가산업단지 135
 
2.7%
대덕테크노밸리 98
 
2.0%
대구제3일반산업단지 94
 
1.9%
한국수출산업(주안)국가산업단지 90
 
1.8%
Other values (148) 1989
40.1%
2024-03-15T04:13:23.751939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5698
 
10.9%
5346
 
10.3%
5136
 
9.9%
4853
 
9.3%
4786
 
9.2%
2771
 
5.3%
2556
 
4.9%
2025
 
3.9%
1499
 
2.9%
901
 
1.7%
Other values (198) 16560
31.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45867
88.0%
Space Separator 5136
 
9.9%
Decimal Number 351
 
0.7%
Close Punctuation 314
 
0.6%
Open Punctuation 314
 
0.6%
Other Punctuation 66
 
0.1%
Uppercase Letter 55
 
0.1%
Lowercase Letter 24
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5698
 
12.4%
5346
 
11.7%
4853
 
10.6%
4786
 
10.4%
2771
 
6.0%
2556
 
5.6%
2025
 
4.4%
1499
 
3.3%
901
 
2.0%
828
 
1.8%
Other values (173) 14604
31.8%
Uppercase Letter
ValueCountFrequency (%)
I 15
27.3%
P 14
25.5%
H 10
18.2%
C 5
 
9.1%
F 4
 
7.3%
W 2
 
3.6%
G 2
 
3.6%
K 2
 
3.6%
T 1
 
1.8%
Decimal Number
ValueCountFrequency (%)
3 117
33.3%
2 101
28.8%
1 91
25.9%
4 34
 
9.7%
5 8
 
2.3%
Lowercase Letter
ValueCountFrequency (%)
o 8
33.3%
k 4
16.7%
r 4
16.7%
a 4
16.7%
d 4
16.7%
Other Punctuation
ValueCountFrequency (%)
, 48
72.7%
. 18
 
27.3%
Space Separator
ValueCountFrequency (%)
5136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 314
100.0%
Open Punctuation
ValueCountFrequency (%)
( 314
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45867
88.0%
Common 6185
 
11.9%
Latin 79
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5698
 
12.4%
5346
 
11.7%
4853
 
10.6%
4786
 
10.4%
2771
 
6.0%
2556
 
5.6%
2025
 
4.4%
1499
 
3.3%
901
 
2.0%
828
 
1.8%
Other values (173) 14604
31.8%
Latin
ValueCountFrequency (%)
I 15
19.0%
P 14
17.7%
H 10
12.7%
o 8
10.1%
C 5
 
6.3%
k 4
 
5.1%
r 4
 
5.1%
a 4
 
5.1%
d 4
 
5.1%
F 4
 
5.1%
Other values (4) 7
8.9%
Common
ValueCountFrequency (%)
5136
83.0%
) 314
 
5.1%
( 314
 
5.1%
3 117
 
1.9%
2 101
 
1.6%
1 91
 
1.5%
, 48
 
0.8%
4 34
 
0.5%
. 18
 
0.3%
5 8
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 45867
88.0%
ASCII 6264
 
12.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5698
 
12.4%
5346
 
11.7%
4853
 
10.6%
4786
 
10.4%
2771
 
6.0%
2556
 
5.6%
2025
 
4.4%
1499
 
3.3%
901
 
2.0%
828
 
1.8%
Other values (173) 14604
31.8%
ASCII
ValueCountFrequency (%)
5136
82.0%
) 314
 
5.0%
( 314
 
5.0%
3 117
 
1.9%
2 101
 
1.6%
1 91
 
1.5%
, 48
 
0.8%
4 34
 
0.5%
. 18
 
0.3%
I 15
 
0.2%
Other values (15) 76
 
1.2%
Distinct8010
Distinct (%)80.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T04:13:25.055478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length168
Median length64
Mean length8.9359
Min length1

Characters and Unicode

Total characters89359
Distinct characters885
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7405 ?
Unique (%)74.1%

Sample

1st row철구제작 외
2nd row반질크림
3rd row쌀빵
4th row아파트용 소화전
5th rowCCTV, 주차관제, 조명기구
ValueCountFrequency (%)
492
 
2.7%
480
 
2.6%
부품 202
 
1.1%
금형 174
 
1.0%
170
 
0.9%
자동차부품 138
 
0.8%
자동차 110
 
0.6%
산업용 99
 
0.5%
전자부품 96
 
0.5%
기계부품 92
 
0.5%
Other values (8979) 16243
88.8%
2024-03-15T04:13:26.738840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8513
 
9.5%
, 4350
 
4.9%
3664
 
4.1%
2095
 
2.3%
1518
 
1.7%
1517
 
1.7%
1427
 
1.6%
1424
 
1.6%
1408
 
1.6%
1379
 
1.5%
Other values (875) 62064
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69703
78.0%
Space Separator 8514
 
9.5%
Other Punctuation 4546
 
5.1%
Uppercase Letter 3495
 
3.9%
Lowercase Letter 1851
 
2.1%
Close Punctuation 514
 
0.6%
Open Punctuation 514
 
0.6%
Decimal Number 167
 
0.2%
Dash Punctuation 52
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3664
 
5.3%
2095
 
3.0%
1518
 
2.2%
1517
 
2.2%
1427
 
2.0%
1424
 
2.0%
1408
 
2.0%
1379
 
2.0%
1298
 
1.9%
1067
 
1.5%
Other values (796) 52906
75.9%
Uppercase Letter
ValueCountFrequency (%)
C 423
12.1%
E 365
10.4%
D 343
9.8%
L 331
9.5%
P 300
 
8.6%
T 252
 
7.2%
S 188
 
5.4%
V 168
 
4.8%
A 151
 
4.3%
R 148
 
4.2%
Other values (16) 826
23.6%
Lowercase Letter
ValueCountFrequency (%)
e 238
12.9%
o 142
 
7.7%
t 136
 
7.3%
l 135
 
7.3%
r 131
 
7.1%
i 127
 
6.9%
a 121
 
6.5%
s 116
 
6.3%
n 101
 
5.5%
c 93
 
5.0%
Other values (14) 511
27.6%
Decimal Number
ValueCountFrequency (%)
3 35
21.0%
2 31
18.6%
0 25
15.0%
1 25
15.0%
4 17
10.2%
6 8
 
4.8%
5 8
 
4.8%
8 8
 
4.8%
9 7
 
4.2%
7 3
 
1.8%
Other Punctuation
ValueCountFrequency (%)
, 4350
95.7%
. 99
 
2.2%
/ 69
 
1.5%
' 11
 
0.2%
5
 
0.1%
& 4
 
0.1%
? 4
 
0.1%
% 2
 
< 0.1%
· 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
8513
> 99.9%
  1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 513
99.8%
] 1
 
0.2%
Open Punctuation
ValueCountFrequency (%)
( 513
99.8%
[ 1
 
0.2%
Math Symbol
ValueCountFrequency (%)
~ 1
50.0%
+ 1
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69698
78.0%
Common 14310
 
16.0%
Latin 5346
 
6.0%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3664
 
5.3%
2095
 
3.0%
1518
 
2.2%
1517
 
2.2%
1427
 
2.0%
1424
 
2.0%
1408
 
2.0%
1379
 
2.0%
1298
 
1.9%
1067
 
1.5%
Other values (795) 52901
75.9%
Latin
ValueCountFrequency (%)
C 423
 
7.9%
E 365
 
6.8%
D 343
 
6.4%
L 331
 
6.2%
P 300
 
5.6%
T 252
 
4.7%
e 238
 
4.5%
S 188
 
3.5%
V 168
 
3.1%
A 151
 
2.8%
Other values (40) 2587
48.4%
Common
ValueCountFrequency (%)
8513
59.5%
, 4350
30.4%
) 513
 
3.6%
( 513
 
3.6%
. 99
 
0.7%
/ 69
 
0.5%
- 52
 
0.4%
3 35
 
0.2%
2 31
 
0.2%
0 25
 
0.2%
Other values (19) 110
 
0.8%
Han
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69697
78.0%
ASCII 19648
 
22.0%
None 8
 
< 0.1%
CJK 5
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8513
43.3%
, 4350
22.1%
) 513
 
2.6%
( 513
 
2.6%
C 423
 
2.2%
E 365
 
1.9%
D 343
 
1.7%
L 331
 
1.7%
P 300
 
1.5%
T 252
 
1.3%
Other values (66) 3745
19.1%
Hangul
ValueCountFrequency (%)
3664
 
5.3%
2095
 
3.0%
1518
 
2.2%
1517
 
2.2%
1427
 
2.0%
1424
 
2.0%
1408
 
2.0%
1379
 
2.0%
1298
 
1.9%
1067
 
1.5%
Other values (794) 52900
75.9%
None
ValueCountFrequency (%)
5
62.5%
· 2
 
25.0%
  1
 
12.5%
CJK
ValueCountFrequency (%)
5
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct9839
Distinct (%)98.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T04:13:28.060581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length110
Median length66
Mean length33.2375
Min length1

Characters and Unicode

Total characters332375
Distinct characters672
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks8 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9715 ?
Unique (%)97.2%

Sample

1st row부산광역시 사하구 다산로 125 (다대동)
2nd row경기도 용인시 기흥구 용구대로1855번길 15, (하갈동 145-5번지) (하갈동, 삼공제약)
3rd row경기도 남양주시 화도읍 소래비로 121-12
4th row경기도 파주시 탄현면 한산로 62-5 ((주)영인공영)
5th row경기도 의왕시 경수대로 209, 5층 504-1호 (고천동, 의왕월드비젼)
ValueCountFrequency (%)
경기도 4258
 
6.5%
인천광역시 1386
 
2.1%
부산광역시 1130
 
1.7%
서울특별시 1103
 
1.7%
대구광역시 894
 
1.4%
799
 
1.2%
안산시 716
 
1.1%
시흥시 677
 
1.0%
단원구 668
 
1.0%
남동구 593
 
0.9%
Other values (12280) 53658
81.4%
2024-03-15T04:13:29.934203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56024
 
16.9%
1 13269
 
4.0%
11930
 
3.6%
11226
 
3.4%
9546
 
2.9%
) 9480
 
2.9%
( 9479
 
2.9%
2 8721
 
2.6%
8695
 
2.6%
, 7084
 
2.1%
Other values (662) 186921
56.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 185343
55.8%
Decimal Number 59311
 
17.8%
Space Separator 56024
 
16.9%
Close Punctuation 9508
 
2.9%
Open Punctuation 9507
 
2.9%
Other Punctuation 7145
 
2.1%
Dash Punctuation 2910
 
0.9%
Uppercase Letter 2410
 
0.7%
Lowercase Letter 102
 
< 0.1%
Math Symbol 69
 
< 0.1%
Other values (2) 46
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11930
 
6.4%
11226
 
6.1%
9546
 
5.2%
8695
 
4.7%
5735
 
3.1%
5677
 
3.1%
5106
 
2.8%
4855
 
2.6%
4799
 
2.6%
4574
 
2.5%
Other values (589) 113200
61.1%
Uppercase Letter
ValueCountFrequency (%)
B 737
30.6%
A 327
13.6%
L 286
 
11.9%
T 183
 
7.6%
I 130
 
5.4%
C 102
 
4.2%
K 88
 
3.7%
S 83
 
3.4%
M 65
 
2.7%
V 61
 
2.5%
Other values (15) 348
14.4%
Lowercase Letter
ValueCountFrequency (%)
l 38
37.3%
e 13
 
12.7%
r 7
 
6.9%
n 6
 
5.9%
t 6
 
5.9%
c 6
 
5.9%
a 5
 
4.9%
b 5
 
4.9%
o 4
 
3.9%
w 3
 
2.9%
Other values (7) 9
 
8.8%
Decimal Number
ValueCountFrequency (%)
1 13269
22.4%
2 8721
14.7%
3 6987
11.8%
0 5469
9.2%
4 5314
9.0%
5 4848
 
8.2%
6 4342
 
7.3%
7 3730
 
6.3%
8 3587
 
6.0%
9 3044
 
5.1%
Other Punctuation
ValueCountFrequency (%)
, 7084
99.1%
/ 29
 
0.4%
. 18
 
0.3%
& 9
 
0.1%
· 2
 
< 0.1%
: 1
 
< 0.1%
* 1
 
< 0.1%
; 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
22
64.7%
7
 
20.6%
5
 
14.7%
Close Punctuation
ValueCountFrequency (%)
) 9480
99.7%
] 28
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 9479
99.7%
[ 28
 
0.3%
Math Symbol
ValueCountFrequency (%)
~ 64
92.8%
5
 
7.2%
Other Symbol
ValueCountFrequency (%)
11
91.7%
1
 
8.3%
Space Separator
ValueCountFrequency (%)
56024
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2910
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 185343
55.8%
Common 144485
43.5%
Latin 2546
 
0.8%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11930
 
6.4%
11226
 
6.1%
9546
 
5.2%
8695
 
4.7%
5735
 
3.1%
5677
 
3.1%
5106
 
2.8%
4855
 
2.6%
4799
 
2.6%
4574
 
2.5%
Other values (589) 113200
61.1%
Latin
ValueCountFrequency (%)
B 737
28.9%
A 327
12.8%
L 286
 
11.2%
T 183
 
7.2%
I 130
 
5.1%
C 102
 
4.0%
K 88
 
3.5%
S 83
 
3.3%
M 65
 
2.6%
V 61
 
2.4%
Other values (35) 484
19.0%
Common
ValueCountFrequency (%)
56024
38.8%
1 13269
 
9.2%
) 9480
 
6.6%
( 9479
 
6.6%
2 8721
 
6.0%
, 7084
 
4.9%
3 6987
 
4.8%
0 5469
 
3.8%
4 5314
 
3.7%
5 4848
 
3.4%
Other values (17) 17810
 
12.3%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 185341
55.8%
ASCII 146979
44.2%
Number Forms 34
 
< 0.1%
Enclosed Alphanum 11
 
< 0.1%
Math Operators 5
 
< 0.1%
None 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
56024
38.1%
1 13269
 
9.0%
) 9480
 
6.4%
( 9479
 
6.4%
2 8721
 
5.9%
, 7084
 
4.8%
3 6987
 
4.8%
0 5469
 
3.7%
4 5314
 
3.6%
5 4848
 
3.3%
Other values (56) 20304
 
13.8%
Hangul
ValueCountFrequency (%)
11930
 
6.4%
11226
 
6.1%
9546
 
5.2%
8695
 
4.7%
5735
 
3.1%
5677
 
3.1%
5106
 
2.8%
4855
 
2.6%
4799
 
2.6%
4574
 
2.5%
Other values (587) 113198
61.1%
Number Forms
ValueCountFrequency (%)
22
64.7%
7
 
20.6%
5
 
14.7%
Enclosed Alphanum
ValueCountFrequency (%)
11
100.0%
Math Operators
ValueCountFrequency (%)
5
100.0%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
100.0%

Interactions

2024-03-15T04:13:18.183424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-15T04:13:18.531652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:13:18.851094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번회사명단지명생산품공장주소
1418714188흥진산업(주)부산신평장림일반산업단지(협업단지)철구제작 외부산광역시 사하구 다산로 125 (다대동)
9253792538(주)삼공씨앤피반질크림경기도 용인시 기흥구 용구대로1855번길 15, (하갈동 145-5번지) (하갈동, 삼공제약)
7935979360(주)쁘띠아미쌀빵경기도 남양주시 화도읍 소래비로 121-12
9684796848영인공영(주)아파트용 소화전경기도 파주시 탄현면 한산로 62-5 ((주)영인공영)
8907489075(주)메가클라우드CCTV, 주차관제, 조명기구경기도 의왕시 경수대로 209, 5층 504-1호 (고천동, 의왕월드비젼)
9119791198오륜플라스틱사출품,PVC원단,스폰지경기도 용인시 처인구 모현읍 문형동림로 31 (성호기업)
7298972990서울금속(주)시화국가산업단지볼트,너트 및 와셔 등 화스너류경기도 안산시 단원구 번영1로 8, 4마 101-2호(성곡동)
7968079681(주)리오앤코조명장치, 시계, 기타 생활용품경기도 남양주시 화도읍 창현리 128-8번지
7803178032주식회사 세타전기식 교통신호장치경기도 고양시 일산서구 송산로 528, 103호, 119호(덕이동)
3525535256(주)영풍트란스포트엔지니어링남동국가산업단지운반기계인천광역시 남동구 남동서로83번길 52, 116블럭 4로트 (고잔동)
순번회사명단지명생산품공장주소
4725547256(주)엘렉스코팅광주하남일반산업단지전자제품 외관도장광주광역시 광산구 하남산단3번로 123 (장덕동)
3781237813피엠테크남동국가산업단지기계부품인천광역시 남동구 호구포로 139, 410호(고잔동)
8220982210라온라이팅LED교통신호등, 조명기구경기도 시흥시 서울대학로 59-21, 4층 411 (정왕동)
1196011961(주)아라트산업용기계,선박수리부산광역시 영도구 해양로 33-52 (청학동)
7717377174와이비소프트(주)자전거 및 환자용 차량경기도 고양시 일산동구 하늘마을로 158 씨동 101, 902호(중산동, 대방트리플라온 비즈니스 타워)
8176381764대림전자시흥매화일반산업단지전자저항기경기도 시흥시 매화산단3길 50, A3-4(매화동)
7595075951(주)팜한농 반월공장반월국가산업단지화학제품경기도 안산시 단원구 해안로 131, 목내동 433 (목내동)
2899828999세명침구침구류대구광역시 달성군 화원읍 비슬로495길 9
5749257493주식회사 퓨어그래스자동차용 매트경기도 수원시 팔달구 경수대로507번길 5, 지하층 102호 (인계동, 정수빌딩)
9105791058(주) 나노전기차단기경기도 용인시 처인구 모현읍 매산리 609-16 외 2필지