Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory556.6 KiB
Average record size in memory57.0 B

Variable types

Numeric1
Text5

Dataset

Description경기도 화성시 제조업 현황에 대한 데이터로 회사명, 전화번호, 생산품, 공장대표주소, 업종명에 대한 데이터를 포함하고 있습니다.
Author경기도 화성시
URLhttps://www.data.go.kr/data/15112420/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-05-11 10:14:34.319114
Analysis finished2024-05-11 10:14:38.900466
Duration4.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10394.448
Minimum1
Maximum20748
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-11T10:14:39.255001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1065.9
Q15260.5
median10356
Q315591.25
95-th percentile19684.05
Maximum20748
Range20747
Interquartile range (IQR)10330.75

Descriptive statistics

Standard deviation5973.586
Coefficient of variation (CV)0.57469007
Kurtosis-1.1990345
Mean10394.448
Median Absolute Deviation (MAD)5167.5
Skewness-0.0019023471
Sum1.0394448 × 108
Variance35683729
MonotonicityNot monotonic
2024-05-11T10:14:39.846351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10272 1
 
< 0.1%
3835 1
 
< 0.1%
14687 1
 
< 0.1%
11148 1
 
< 0.1%
14512 1
 
< 0.1%
20181 1
 
< 0.1%
1849 1
 
< 0.1%
4887 1
 
< 0.1%
17464 1
 
< 0.1%
19790 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
8 1
< 0.1%
10 1
< 0.1%
12 1
< 0.1%
16 1
< 0.1%
17 1
< 0.1%
18 1
< 0.1%
ValueCountFrequency (%)
20748 1
< 0.1%
20746 1
< 0.1%
20743 1
< 0.1%
20742 1
< 0.1%
20741 1
< 0.1%
20737 1
< 0.1%
20736 1
< 0.1%
20732 1
< 0.1%
20731 1
< 0.1%
20728 1
< 0.1%
Distinct9334
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T10:14:40.596157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length5.0404
Min length1

Characters and Unicode

Total characters50404
Distinct characters779
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8798 ?
Unique (%)88.0%

Sample

1st row에스에스금속
2nd row피케이정공
3rd row제이디시스템
4th row로뎀바스
5th row티에이치텍
ValueCountFrequency (%)
주식회사 125
 
1.2%
농업회사법인 16
 
0.2%
tech 13
 
0.1%
코리아 11
 
0.1%
테크 11
 
0.1%
한텍시스템 10
 
0.1%
9
 
0.1%
이엔지 7
 
0.1%
에스 6
 
0.1%
태성테크 6
 
0.1%
Other values (9428) 10173
97.9%
2024-05-11T10:14:41.576038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3123
 
6.2%
2255
 
4.5%
1730
 
3.4%
1337
 
2.7%
1235
 
2.5%
1012
 
2.0%
895
 
1.8%
852
 
1.7%
849
 
1.7%
795
 
1.6%
Other values (769) 36321
72.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 48217
95.7%
Uppercase Letter 826
 
1.6%
Space Separator 403
 
0.8%
Lowercase Letter 263
 
0.5%
Close Punctuation 259
 
0.5%
Open Punctuation 257
 
0.5%
Other Punctuation 116
 
0.2%
Decimal Number 48
 
0.1%
Dash Punctuation 14
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3123
 
6.5%
2255
 
4.7%
1730
 
3.6%
1337
 
2.8%
1235
 
2.6%
1012
 
2.1%
895
 
1.9%
852
 
1.8%
849
 
1.8%
795
 
1.6%
Other values (700) 34134
70.8%
Uppercase Letter
ValueCountFrequency (%)
E 104
12.6%
T 80
 
9.7%
S 79
 
9.6%
N 64
 
7.7%
C 59
 
7.1%
G 57
 
6.9%
M 40
 
4.8%
H 37
 
4.5%
A 31
 
3.8%
I 31
 
3.8%
Other values (15) 244
29.5%
Lowercase Letter
ValueCountFrequency (%)
e 38
14.4%
o 29
11.0%
n 18
 
6.8%
l 18
 
6.8%
r 17
 
6.5%
a 17
 
6.5%
i 16
 
6.1%
c 15
 
5.7%
t 15
 
5.7%
s 12
 
4.6%
Other values (12) 68
25.9%
Decimal Number
ValueCountFrequency (%)
2 12
25.0%
1 6
12.5%
7 5
10.4%
3 5
10.4%
8 4
 
8.3%
5 4
 
8.3%
9 4
 
8.3%
4 4
 
8.3%
6 2
 
4.2%
0 2
 
4.2%
Other Punctuation
ValueCountFrequency (%)
. 76
65.5%
& 23
 
19.8%
15
 
12.9%
/ 2
 
1.7%
Space Separator
ValueCountFrequency (%)
402
99.8%
  1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 257
99.2%
2
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 255
99.2%
2
 
0.8%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 48211
95.6%
Common 1098
 
2.2%
Latin 1089
 
2.2%
Han 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3123
 
6.5%
2255
 
4.7%
1730
 
3.6%
1337
 
2.8%
1235
 
2.6%
1012
 
2.1%
895
 
1.9%
852
 
1.8%
849
 
1.8%
795
 
1.6%
Other values (694) 34128
70.8%
Latin
ValueCountFrequency (%)
E 104
 
9.6%
T 80
 
7.3%
S 79
 
7.3%
N 64
 
5.9%
C 59
 
5.4%
G 57
 
5.2%
M 40
 
3.7%
e 38
 
3.5%
H 37
 
3.4%
A 31
 
2.8%
Other values (37) 500
45.9%
Common
ValueCountFrequency (%)
402
36.6%
) 257
23.4%
( 255
23.2%
. 76
 
6.9%
& 23
 
2.1%
15
 
1.4%
- 14
 
1.3%
2 12
 
1.1%
1 6
 
0.5%
7 5
 
0.5%
Other values (12) 33
 
3.0%
Han
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
調 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 48211
95.6%
ASCII 2167
 
4.3%
None 20
 
< 0.1%
CJK 6
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3123
 
6.5%
2255
 
4.7%
1730
 
3.6%
1337
 
2.8%
1235
 
2.6%
1012
 
2.1%
895
 
1.9%
852
 
1.8%
849
 
1.8%
795
 
1.6%
Other values (694) 34128
70.8%
ASCII
ValueCountFrequency (%)
402
18.6%
) 257
 
11.9%
( 255
 
11.8%
E 104
 
4.8%
T 80
 
3.7%
S 79
 
3.6%
. 76
 
3.5%
N 64
 
3.0%
C 59
 
2.7%
G 57
 
2.6%
Other values (55) 734
33.9%
None
ValueCountFrequency (%)
15
75.0%
2
 
10.0%
2
 
10.0%
  1
 
5.0%
CJK
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
調 1
16.7%
Distinct6000
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T10:14:42.320867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length10.2015
Min length7

Characters and Unicode

Total characters102015
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5753 ?
Unique (%)57.5%

Sample

1st row031-356-1882
2nd row031-366-7242
3rd row데이터 미집계
4th row데이터 미집계
5th row031-236-3071
ValueCountFrequency (%)
데이터 3733
27.2%
미집계 3733
27.2%
031-357-9371 11
 
0.1%
031-434-6601 4
 
< 0.1%
031-8067-7182 4
 
< 0.1%
031-357-7075 3
 
< 0.1%
070-7770-5082 3
 
< 0.1%
031-356-6367 3
 
< 0.1%
031-488-9601 3
 
< 0.1%
031-227-7030 3
 
< 0.1%
Other values (5991) 6233
45.4%
2024-05-11T10:14:43.649210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 12488
12.2%
3 12452
12.2%
0 10620
 
10.4%
1 9219
 
9.0%
5 6208
 
6.1%
2 5149
 
5.0%
7 4201
 
4.1%
8 4156
 
4.1%
6 4107
 
4.0%
4 3986
 
3.9%
Other values (9) 29429
28.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 63394
62.1%
Other Letter 22398
 
22.0%
Dash Punctuation 12488
 
12.2%
Space Separator 3733
 
3.7%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 12452
19.6%
0 10620
16.8%
1 9219
14.5%
5 6208
9.8%
2 5149
8.1%
7 4201
 
6.6%
8 4156
 
6.6%
6 4107
 
6.5%
4 3986
 
6.3%
9 3296
 
5.2%
Other Letter
ValueCountFrequency (%)
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 12488
100.0%
Space Separator
ValueCountFrequency (%)
3733
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 79617
78.0%
Hangul 22398
 
22.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 12488
15.7%
3 12452
15.6%
0 10620
13.3%
1 9219
11.6%
5 6208
7.8%
2 5149
6.5%
7 4201
 
5.3%
8 4156
 
5.2%
6 4107
 
5.2%
4 3986
 
5.0%
Other values (3) 7031
8.8%
Hangul
ValueCountFrequency (%)
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 79617
78.0%
Hangul 22398
 
22.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 12488
15.7%
3 12452
15.6%
0 10620
13.3%
1 9219
11.6%
5 6208
7.8%
2 5149
6.5%
7 4201
 
5.3%
8 4156
 
5.2%
6 4107
 
5.2%
4 3986
 
5.0%
Other values (3) 7031
8.8%
Hangul
ValueCountFrequency (%)
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
3733
16.7%
Distinct4216
Distinct (%)42.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T10:14:44.400995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length7
Mean length8.8795
Min length1

Characters and Unicode

Total characters88795
Distinct characters743
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3872 ?
Unique (%)38.7%

Sample

1st row데이터 미집계
2nd row파이프배관자재
3rd row방송장치 제조
4th row데이터 미집계
5th row시험용설비
ValueCountFrequency (%)
데이터 4875
22.3%
미집계 4875
22.3%
1077
 
4.9%
539
 
2.5%
391
 
1.8%
부품 219
 
1.0%
금형 170
 
0.8%
반도체 149
 
0.7%
플라스틱 134
 
0.6%
자동차 116
 
0.5%
Other values (5196) 9309
42.6%
2024-05-11T10:14:45.750990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12030
 
13.5%
5480
 
6.2%
5258
 
5.9%
5132
 
5.8%
4997
 
5.6%
4897
 
5.5%
4894
 
5.5%
, 2177
 
2.5%
1670
 
1.9%
1398
 
1.6%
Other values (733) 40862
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 69587
78.4%
Space Separator 12031
 
13.5%
Uppercase Letter 2755
 
3.1%
Other Punctuation 2308
 
2.6%
Lowercase Letter 1243
 
1.4%
Open Punctuation 401
 
0.5%
Close Punctuation 400
 
0.5%
Decimal Number 65
 
0.1%
Math Symbol 4
 
< 0.1%
Final Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5480
 
7.9%
5258
 
7.6%
5132
 
7.4%
4997
 
7.2%
4897
 
7.0%
4894
 
7.0%
1670
 
2.4%
1398
 
2.0%
1194
 
1.7%
970
 
1.4%
Other values (658) 33697
48.4%
Uppercase Letter
ValueCountFrequency (%)
C 282
 
10.2%
L 263
 
9.5%
E 260
 
9.4%
P 245
 
8.9%
D 224
 
8.1%
S 181
 
6.6%
A 169
 
6.1%
R 143
 
5.2%
T 136
 
4.9%
O 110
 
4.0%
Other values (16) 742
26.9%
Lowercase Letter
ValueCountFrequency (%)
e 174
14.0%
r 127
10.2%
o 107
 
8.6%
a 92
 
7.4%
t 90
 
7.2%
l 88
 
7.1%
i 75
 
6.0%
s 74
 
6.0%
n 71
 
5.7%
c 50
 
4.0%
Other values (15) 295
23.7%
Decimal Number
ValueCountFrequency (%)
2 15
23.1%
1 13
20.0%
0 11
16.9%
3 11
16.9%
4 6
 
9.2%
5 5
 
7.7%
7 2
 
3.1%
6 2
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 2177
94.3%
/ 65
 
2.8%
. 37
 
1.6%
' 12
 
0.5%
· 9
 
0.4%
& 7
 
0.3%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
12030
> 99.9%
  1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 396
98.8%
[ 5
 
1.2%
Close Punctuation
ValueCountFrequency (%)
) 395
98.8%
] 5
 
1.2%
Math Symbol
ValueCountFrequency (%)
+ 3
75.0%
= 1
 
25.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 69575
78.4%
Common 15210
 
17.1%
Latin 3998
 
4.5%
Han 12
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5480
 
7.9%
5258
 
7.6%
5132
 
7.4%
4997
 
7.2%
4897
 
7.0%
4894
 
7.0%
1670
 
2.4%
1398
 
2.0%
1194
 
1.7%
970
 
1.4%
Other values (657) 33685
48.4%
Latin
ValueCountFrequency (%)
C 282
 
7.1%
L 263
 
6.6%
E 260
 
6.5%
P 245
 
6.1%
D 224
 
5.6%
S 181
 
4.5%
e 174
 
4.4%
A 169
 
4.2%
R 143
 
3.6%
T 136
 
3.4%
Other values (41) 1921
48.0%
Common
ValueCountFrequency (%)
12030
79.1%
, 2177
 
14.3%
( 396
 
2.6%
) 395
 
2.6%
/ 65
 
0.4%
. 37
 
0.2%
2 15
 
0.1%
1 13
 
0.1%
' 12
 
0.1%
0 11
 
0.1%
Other values (14) 59
 
0.4%
Han
ValueCountFrequency (%)
12
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 69573
78.4%
ASCII 19197
 
21.6%
CJK 12
 
< 0.1%
None 10
 
< 0.1%
Compat Jamo 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12030
62.7%
, 2177
 
11.3%
( 396
 
2.1%
) 395
 
2.1%
C 282
 
1.5%
L 263
 
1.4%
E 260
 
1.4%
P 245
 
1.3%
D 224
 
1.2%
S 181
 
0.9%
Other values (62) 2744
 
14.3%
Hangul
ValueCountFrequency (%)
5480
 
7.9%
5258
 
7.6%
5132
 
7.4%
4997
 
7.2%
4897
 
7.0%
4894
 
7.0%
1670
 
2.4%
1398
 
2.0%
1194
 
1.7%
970
 
1.4%
Other values (656) 33683
48.4%
CJK
ValueCountFrequency (%)
12
100.0%
None
ValueCountFrequency (%)
· 9
90.0%
  1
 
10.0%
Compat Jamo
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct9091
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T10:14:46.574918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length96
Median length68
Mean length28.8543
Min length7

Characters and Unicode

Total characters288543
Distinct characters610
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8357 ?
Unique (%)83.6%

Sample

1st row경기 화성시 마도면 청원로274번길 139 (청원리)
2nd row경기 화성시 마도면 석교남길 82-7 (석교리)
3rd row경기 화성시 동탄면 금곡로163번길 18, 다동 (금곡리)
4th row경기 화성시 북삼미로 259-16 (능동)
5th row경기 화성시 진안북길 83-1
ValueCountFrequency (%)
경기 9661
 
15.7%
화성시 9358
 
15.2%
팔탄면 1282
 
2.1%
정남면 1153
 
1.9%
향남읍 896
 
1.5%
장안면 692
 
1.1%
봉담읍 671
 
1.1%
마도면 558
 
0.9%
남양읍 524
 
0.9%
양감면 520
 
0.8%
Other values (8624) 36053
58.7%
2024-05-11T10:14:48.194145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51728
 
17.9%
10566
 
3.7%
1 10544
 
3.7%
10175
 
3.5%
10053
 
3.5%
9954
 
3.4%
9775
 
3.4%
( 9346
 
3.2%
) 9346
 
3.2%
7478
 
2.6%
Other values (600) 149578
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 159281
55.2%
Space Separator 51728
 
17.9%
Decimal Number 48665
 
16.9%
Open Punctuation 9348
 
3.2%
Close Punctuation 9348
 
3.2%
Dash Punctuation 5356
 
1.9%
Other Punctuation 4036
 
1.4%
Uppercase Letter 693
 
0.2%
Lowercase Letter 68
 
< 0.1%
Math Symbol 10
 
< 0.1%
Other values (2) 10
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10566
 
6.6%
10175
 
6.4%
10053
 
6.3%
9954
 
6.2%
9775
 
6.1%
7478
 
4.7%
6642
 
4.2%
6022
 
3.8%
5441
 
3.4%
5163
 
3.2%
Other values (530) 78012
49.0%
Uppercase Letter
ValueCountFrequency (%)
B 182
26.3%
A 163
23.5%
C 56
 
8.1%
I 45
 
6.5%
S 37
 
5.3%
T 31
 
4.5%
E 30
 
4.3%
D 22
 
3.2%
X 19
 
2.7%
K 15
 
2.2%
Other values (15) 93
13.4%
Lowercase Letter
ValueCountFrequency (%)
e 11
16.2%
c 8
11.8%
n 7
10.3%
t 6
8.8%
a 6
8.8%
m 5
7.4%
r 4
 
5.9%
i 4
 
5.9%
s 3
 
4.4%
b 3
 
4.4%
Other values (6) 11
16.2%
Decimal Number
ValueCountFrequency (%)
1 10544
21.7%
2 6728
13.8%
3 5290
10.9%
0 4293
8.8%
4 4281
8.8%
5 4172
 
8.6%
6 4149
 
8.5%
7 3307
 
6.8%
8 3018
 
6.2%
9 2882
 
5.9%
Other Punctuation
ValueCountFrequency (%)
, 3907
96.8%
. 123
 
3.0%
/ 3
 
0.1%
: 2
 
< 0.1%
# 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 6
60.0%
2
 
20.0%
> 1
 
10.0%
< 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 9346
> 99.9%
[ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 9346
> 99.9%
] 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
6
66.7%
3
33.3%
Space Separator
ValueCountFrequency (%)
51728
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5356
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 159282
55.2%
Common 128491
44.5%
Latin 770
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10566
 
6.6%
10175
 
6.4%
10053
 
6.3%
9954
 
6.2%
9775
 
6.1%
7478
 
4.7%
6642
 
4.2%
6022
 
3.8%
5441
 
3.4%
5163
 
3.2%
Other values (531) 78013
49.0%
Latin
ValueCountFrequency (%)
B 182
23.6%
A 163
21.2%
C 56
 
7.3%
I 45
 
5.8%
S 37
 
4.8%
T 31
 
4.0%
E 30
 
3.9%
D 22
 
2.9%
X 19
 
2.5%
K 15
 
1.9%
Other values (33) 170
22.1%
Common
ValueCountFrequency (%)
51728
40.3%
1 10544
 
8.2%
( 9346
 
7.3%
) 9346
 
7.3%
2 6728
 
5.2%
- 5356
 
4.2%
3 5290
 
4.1%
0 4293
 
3.3%
4 4281
 
3.3%
5 4172
 
3.2%
Other values (16) 17407
 
13.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 159281
55.2%
ASCII 129249
44.8%
Number Forms 9
 
< 0.1%
Math Operators 2
 
< 0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51728
40.0%
1 10544
 
8.2%
( 9346
 
7.2%
) 9346
 
7.2%
2 6728
 
5.2%
- 5356
 
4.1%
3 5290
 
4.1%
0 4293
 
3.3%
4 4281
 
3.3%
5 4172
 
3.2%
Other values (55) 18165
 
14.1%
Hangul
ValueCountFrequency (%)
10566
 
6.6%
10175
 
6.4%
10053
 
6.3%
9954
 
6.2%
9775
 
6.1%
7478
 
4.7%
6642
 
4.2%
6022
 
3.8%
5441
 
3.4%
5163
 
3.2%
Other values (530) 78012
49.0%
Number Forms
ValueCountFrequency (%)
6
66.7%
3
33.3%
Math Operators
ValueCountFrequency (%)
2
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct692
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-11T10:14:48.894931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length35
Mean length15.6801
Min length2

Characters and Unicode

Total characters156801
Distinct characters415
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique310 ?
Unique (%)3.1%

Sample

1st row기타 1차 비철금속 제조업
2nd row기타 가공 공작기계 제조업
3rd row방송장비 제조업
4th row기타 건축용 나무제품 제조업
5th row물질 검사, 측정 및 분석기구 제조업
ValueCountFrequency (%)
제조업 9808
20.8%
3671
 
7.8%
기타 3669
 
7.8%
2639
 
5.6%
2633
 
5.6%
기계 1331
 
2.8%
플라스틱 1020
 
2.2%
제품 744
 
1.6%
금형 646
 
1.4%
주형 645
 
1.4%
Other values (957) 20337
43.1%
2024-05-11T10:14:50.275915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
37182
23.7%
13335
 
8.5%
11458
 
7.3%
10469
 
6.7%
7677
 
4.9%
3898
 
2.5%
3786
 
2.4%
3704
 
2.4%
3689
 
2.4%
2657
 
1.7%
Other values (405) 58946
37.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 118158
75.4%
Space Separator 37182
 
23.7%
Other Punctuation 1181
 
0.8%
Decimal Number 198
 
0.1%
Open Punctuation 31
 
< 0.1%
Close Punctuation 31
 
< 0.1%
Uppercase Letter 13
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Connector Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13335
 
11.3%
11458
 
9.7%
10469
 
8.9%
7677
 
6.5%
3898
 
3.3%
3786
 
3.2%
3704
 
3.1%
3689
 
3.1%
2657
 
2.2%
2651
 
2.2%
Other values (373) 54834
46.4%
Decimal Number
ValueCountFrequency (%)
1 159
80.3%
2 10
 
5.1%
9 9
 
4.5%
0 7
 
3.5%
3 6
 
3.0%
4 3
 
1.5%
5 2
 
1.0%
8 1
 
0.5%
7 1
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
C 3
23.1%
T 2
15.4%
D 2
15.4%
S 1
 
7.7%
R 1
 
7.7%
I 1
 
7.7%
A 1
 
7.7%
E 1
 
7.7%
L 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
, 1088
92.1%
· 48
 
4.1%
/ 27
 
2.3%
. 13
 
1.1%
; 4
 
0.3%
& 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
o 2
50.0%
e 1
25.0%
m 1
25.0%
Space Separator
ValueCountFrequency (%)
37182
100.0%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 118158
75.4%
Common 38626
 
24.6%
Latin 17
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13335
 
11.3%
11458
 
9.7%
10469
 
8.9%
7677
 
6.5%
3898
 
3.3%
3786
 
3.2%
3704
 
3.1%
3689
 
3.1%
2657
 
2.2%
2651
 
2.2%
Other values (373) 54834
46.4%
Common
ValueCountFrequency (%)
37182
96.3%
, 1088
 
2.8%
1 159
 
0.4%
· 48
 
0.1%
( 31
 
0.1%
) 31
 
0.1%
/ 27
 
0.1%
. 13
 
< 0.1%
2 10
 
< 0.1%
9 9
 
< 0.1%
Other values (10) 28
 
0.1%
Latin
ValueCountFrequency (%)
C 3
17.6%
o 2
11.8%
T 2
11.8%
D 2
11.8%
S 1
 
5.9%
R 1
 
5.9%
I 1
 
5.9%
A 1
 
5.9%
E 1
 
5.9%
e 1
 
5.9%
Other values (2) 2
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 118130
75.3%
ASCII 38595
 
24.6%
None 48
 
< 0.1%
Compat Jamo 28
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
37182
96.3%
, 1088
 
2.8%
1 159
 
0.4%
( 31
 
0.1%
) 31
 
0.1%
/ 27
 
0.1%
. 13
 
< 0.1%
2 10
 
< 0.1%
9 9
 
< 0.1%
0 7
 
< 0.1%
Other values (21) 38
 
0.1%
Hangul
ValueCountFrequency (%)
13335
 
11.3%
11458
 
9.7%
10469
 
8.9%
7677
 
6.5%
3898
 
3.3%
3786
 
3.2%
3704
 
3.1%
3689
 
3.1%
2657
 
2.2%
2651
 
2.2%
Other values (372) 54806
46.4%
None
ValueCountFrequency (%)
· 48
100.0%
Compat Jamo
ValueCountFrequency (%)
28
100.0%

Interactions

2024-05-11T10:14:37.720318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-05-11T10:14:38.103819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-11T10:14:38.621016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번기업명전화번호주요상품주소산업분류
1027110272에스에스금속031-356-1882데이터 미집계경기 화성시 마도면 청원로274번길 139 (청원리)기타 1차 비철금속 제조업
1920819209피케이정공031-366-7242파이프배관자재경기 화성시 마도면 석교남길 82-7 (석교리)기타 가공 공작기계 제조업
1512615127제이디시스템데이터 미집계방송장치 제조경기 화성시 동탄면 금곡로163번길 18, 다동 (금곡리)방송장비 제조업
45204521로뎀바스데이터 미집계데이터 미집계경기 화성시 북삼미로 259-16 (능동)기타 건축용 나무제품 제조업
1853018531티에이치텍031-236-3071시험용설비경기 화성시 진안북길 83-1물질 검사, 측정 및 분석기구 제조업
910911광명이엔지031-357-4606케이블 트레이, 덕트, 찬넬 외경기 화성시 남양읍 장덕북길 110-59 (장덕리)기타 절연선 및 케이블 제조업
1024810249에스앤에스 코리아데이터 미집계데이터 미집계경기 화성시 동탄기흥로 614, 705호 (영천동,더퍼스트타워투)기타 무선 통신장비 제조업
55035504바오데이터 미집계데이터 미집계경기 화성시 향남읍 만년로151번길 74-12 (증거리)그 외 기타 전자부품 제조업
1212312124영일이엔지데이터 미집계데이터 미집계경기 화성시 남양읍 신남로 366-22 (신남리)산업용 냉장 및 냉동 장비 제조업
1055810559에스제이테크놀로지데이터 미집계데이터 미집계경기 화성시 양감면 안요골길82번길 56 (사창리)합성수지 및 기타 플라스틱 물질 제조업
연번기업명전화번호주요상품주소산업분류
1986219863한스틸이엔지데이터 미집계철강재경기 화성시 마도면 청원로 139-23 (청원리)그 외 기타 1차 철강 제조업
1761217613코스텍055-275-4178자동차 부품경기 화성시 장안면 장안공단5길 9 (수촌리)자동차 차체용 신품 부품 제조업
40624063디아테크031-337-5789네비게이션 등경기 화성시 동탄대로21길 10, 1704호 (영천동,더퍼스트타워)레이더, 항행용 무선기기 및 측량기구 제조업
1503315034정진테크031-357-0084금형경기 화성시 송산면 제부로 1169 (육일리)주형 및 금형 제조업
1644116442지투엔031-8059-6240PCR PAD, 연마패드 등세종 부강면 산수2길 11 (산수리)연마재 제조업
91769177씨에스테크노데이터 미집계데이터 미집계경기 화성시 정남면 정남동로 33-22 (덕절리)에너지 저장장치 제조업
45844585루미플렉스데이터 미집계데이터 미집계경기 화성시 향남읍 토성로 362-17 (요리)전시 및 광고용 조명장치 제조업
55665567반석데이터 미집계데이터 미집계경기 화성시 우정읍 버들로 985-1 (주곡리)금속 가구 제조업
1788017881태경케미칼031-297-6937폐플라스틱경기 화성시 봉담읍 복만터길 86-1 (당하리)합성수지 및 기타 플라스틱 물질 제조업
30523053대한프라스틱031-351-8064데이터 미집계경기 화성시 장안면 포승장안로 1208, 나동 (독정리)그 외 기타 플라스틱 제품 제조업