Overview

Dataset statistics

Number of variables8
Number of observations2417
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory153.6 KiB
Average record size in memory65.1 B

Variable types

Numeric1
Text7

Dataset

Description경상남도 양산시에 등록된 공장등록 현황으로 회사명, 대표자명, 도로명주소, 전화번호, 생산품 주원자재 등을 현황을 확인할 수 있습니다.
Author경상남도 양산시
URLhttps://www.data.go.kr/data/3065868/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 08:58:24.440480
Analysis finished2023-12-12 08:58:25.733492
Duration1.29 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct2417
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1209
Minimum1
Maximum2417
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.4 KiB
2023-12-12T17:58:25.812506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile121.8
Q1605
median1209
Q31813
95-th percentile2296.2
Maximum2417
Range2416
Interquartile range (IQR)1208

Descriptive statistics

Standard deviation697.87212
Coefficient of variation (CV)0.57723087
Kurtosis-1.2
Mean1209
Median Absolute Deviation (MAD)604
Skewness0
Sum2922153
Variance487025.5
MonotonicityStrictly increasing
2023-12-12T17:58:25.990862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1607 1
 
< 0.1%
1609 1
 
< 0.1%
1610 1
 
< 0.1%
1611 1
 
< 0.1%
1612 1
 
< 0.1%
1613 1
 
< 0.1%
1614 1
 
< 0.1%
1615 1
 
< 0.1%
1616 1
 
< 0.1%
Other values (2407) 2407
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2417 1
< 0.1%
2416 1
< 0.1%
2415 1
< 0.1%
2414 1
< 0.1%
2413 1
< 0.1%
2412 1
< 0.1%
2411 1
< 0.1%
2410 1
< 0.1%
2409 1
< 0.1%
2408 1
< 0.1%
Distinct2278
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:26.309136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length6.8063715
Min length2

Characters and Unicode

Total characters16451
Distinct characters504
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2152 ?
Unique (%)89.0%

Sample

1st row(유)거명
2nd row(유)대상공업
3rd row(유)대상공업 제2공장
4th row(유)대성석재
5th row(유)보금
ValueCountFrequency (%)
주식회사 106
 
4.0%
제2공장 22
 
0.8%
양산공장 20
 
0.8%
2공장 8
 
0.3%
양산지점 7
 
0.3%
주)성우하이텍 7
 
0.3%
양산 5
 
0.2%
4
 
0.2%
주)블루인더스 4
 
0.2%
제3공장 4
 
0.2%
Other values (2276) 2467
93.0%
2023-12-12T17:58:26.711319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1492
 
9.1%
) 1347
 
8.2%
( 1343
 
8.2%
448
 
2.7%
387
 
2.4%
387
 
2.4%
361
 
2.2%
255
 
1.6%
244
 
1.5%
240
 
1.5%
Other values (494) 9947
60.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13196
80.2%
Close Punctuation 1347
 
8.2%
Open Punctuation 1343
 
8.2%
Space Separator 255
 
1.6%
Uppercase Letter 204
 
1.2%
Decimal Number 63
 
0.4%
Other Punctuation 24
 
0.1%
Lowercase Letter 17
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1492
 
11.3%
448
 
3.4%
387
 
2.9%
387
 
2.9%
361
 
2.7%
244
 
1.8%
240
 
1.8%
230
 
1.7%
226
 
1.7%
225
 
1.7%
Other values (445) 8956
67.9%
Uppercase Letter
ValueCountFrequency (%)
S 21
 
10.3%
E 20
 
9.8%
C 19
 
9.3%
N 18
 
8.8%
G 14
 
6.9%
T 13
 
6.4%
R 12
 
5.9%
M 12
 
5.9%
F 8
 
3.9%
A 8
 
3.9%
Other values (14) 59
28.9%
Lowercase Letter
ValueCountFrequency (%)
e 4
23.5%
a 2
11.8%
n 2
11.8%
s 2
11.8%
l 1
 
5.9%
t 1
 
5.9%
y 1
 
5.9%
r 1
 
5.9%
o 1
 
5.9%
x 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 47
74.6%
1 7
 
11.1%
3 7
 
11.1%
6 1
 
1.6%
4 1
 
1.6%
Other Punctuation
ValueCountFrequency (%)
. 13
54.2%
& 6
25.0%
, 2
 
8.3%
/ 2
 
8.3%
1
 
4.2%
Close Punctuation
ValueCountFrequency (%)
) 1347
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1343
100.0%
Space Separator
ValueCountFrequency (%)
255
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13196
80.2%
Common 3034
 
18.4%
Latin 221
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1492
 
11.3%
448
 
3.4%
387
 
2.9%
387
 
2.9%
361
 
2.7%
244
 
1.8%
240
 
1.8%
230
 
1.7%
226
 
1.7%
225
 
1.7%
Other values (445) 8956
67.9%
Latin
ValueCountFrequency (%)
S 21
 
9.5%
E 20
 
9.0%
C 19
 
8.6%
N 18
 
8.1%
G 14
 
6.3%
T 13
 
5.9%
R 12
 
5.4%
M 12
 
5.4%
F 8
 
3.6%
A 8
 
3.6%
Other values (25) 76
34.4%
Common
ValueCountFrequency (%)
) 1347
44.4%
( 1343
44.3%
255
 
8.4%
2 47
 
1.5%
. 13
 
0.4%
1 7
 
0.2%
3 7
 
0.2%
& 6
 
0.2%
, 2
 
0.1%
/ 2
 
0.1%
Other values (4) 5
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13196
80.2%
ASCII 3254
 
19.8%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1492
 
11.3%
448
 
3.4%
387
 
2.9%
387
 
2.9%
361
 
2.7%
244
 
1.8%
240
 
1.8%
230
 
1.7%
226
 
1.7%
225
 
1.7%
Other values (445) 8956
67.9%
ASCII
ValueCountFrequency (%)
) 1347
41.4%
( 1343
41.3%
255
 
7.8%
2 47
 
1.4%
S 21
 
0.6%
E 20
 
0.6%
C 19
 
0.6%
N 18
 
0.6%
G 14
 
0.4%
T 13
 
0.4%
Other values (38) 157
 
4.8%
None
ValueCountFrequency (%)
1
100.0%
Distinct2060
Distinct (%)85.2%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:27.085790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length3
Mean length3.2838229
Min length2

Characters and Unicode

Total characters7937
Distinct characters259
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1778 ?
Unique (%)73.6%

Sample

1st row김익
2nd row지말순
3rd row지말순
4th row김일봉
5th row김도형
ValueCountFrequency (%)
12
 
0.5%
1명 9
 
0.4%
김화석 6
 
0.2%
1인 6
 
0.2%
김상헌 5
 
0.2%
이명근 5
 
0.2%
김진곤 5
 
0.2%
정성원 5
 
0.2%
김성훈 4
 
0.2%
박수곤 4
 
0.2%
Other values (2097) 2471
97.6%
2023-12-12T17:58:27.738926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
530
 
6.7%
346
 
4.4%
254
 
3.2%
240
 
3.0%
220
 
2.8%
155
 
2.0%
143
 
1.8%
137
 
1.7%
131
 
1.7%
128
 
1.6%
Other values (249) 5653
71.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7654
96.4%
Space Separator 137
 
1.7%
Other Punctuation 113
 
1.4%
Decimal Number 25
 
0.3%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
530
 
6.9%
346
 
4.5%
254
 
3.3%
240
 
3.1%
220
 
2.9%
155
 
2.0%
143
 
1.9%
131
 
1.7%
128
 
1.7%
122
 
1.6%
Other values (243) 5385
70.4%
Decimal Number
ValueCountFrequency (%)
1 24
96.0%
4 1
 
4.0%
Space Separator
ValueCountFrequency (%)
137
100.0%
Other Punctuation
ValueCountFrequency (%)
, 113
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7654
96.4%
Common 283
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
530
 
6.9%
346
 
4.5%
254
 
3.3%
240
 
3.1%
220
 
2.9%
155
 
2.0%
143
 
1.9%
131
 
1.7%
128
 
1.7%
122
 
1.6%
Other values (243) 5385
70.4%
Common
ValueCountFrequency (%)
137
48.4%
, 113
39.9%
1 24
 
8.5%
( 4
 
1.4%
) 4
 
1.4%
4 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7654
96.4%
ASCII 283
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
530
 
6.9%
346
 
4.5%
254
 
3.3%
240
 
3.1%
220
 
2.9%
155
 
2.0%
143
 
1.9%
131
 
1.7%
128
 
1.7%
122
 
1.6%
Other values (243) 5385
70.4%
ASCII
ValueCountFrequency (%)
137
48.4%
, 113
39.9%
1 24
 
8.5%
( 4
 
1.4%
) 4
 
1.4%
4 1
 
0.4%
Distinct2203
Distinct (%)91.1%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:28.129095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length51
Mean length26.709144
Min length7

Characters and Unicode

Total characters64556
Distinct characters400
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2025 ?
Unique (%)83.8%

Sample

1st row경상남도 양산시 유산공단4길 47-4 (유산동)
2nd row경상남도 양산시 산막동 147번지 외 7필지 외 2필지
3rd row경상남도 양산시 산막동 260번지 외 4필지
4th row경상남도 양산시 어실로 549 (어곡동, 대성석재)
5th row경상남도 양산시 소주공단5길 3 (주남동, 보금)
ValueCountFrequency (%)
경상남도 2405
 
18.0%
양산시 2405
 
18.0%
상북면 499
 
3.7%
어곡동 267
 
2.0%
250
 
1.9%
산막동 226
 
1.7%
북정동 170
 
1.3%
유산동 167
 
1.2%
소주동 167
 
1.2%
주남동 152
 
1.1%
Other values (2053) 6676
49.9%
2023-12-12T17:58:28.818498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10967
 
17.0%
3899
 
6.0%
2963
 
4.6%
2887
 
4.5%
2510
 
3.9%
2422
 
3.8%
2415
 
3.7%
2414
 
3.7%
( 2101
 
3.3%
) 2101
 
3.3%
Other values (390) 29877
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39147
60.6%
Space Separator 10967
 
17.0%
Decimal Number 8692
 
13.5%
Open Punctuation 2101
 
3.3%
Close Punctuation 2101
 
3.3%
Dash Punctuation 716
 
1.1%
Other Punctuation 658
 
1.0%
Uppercase Letter 166
 
0.3%
Lowercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3899
 
10.0%
2963
 
7.6%
2887
 
7.4%
2510
 
6.4%
2422
 
6.2%
2415
 
6.2%
2414
 
6.2%
1964
 
5.0%
1515
 
3.9%
1164
 
3.0%
Other values (345) 14994
38.3%
Uppercase Letter
ValueCountFrequency (%)
B 20
12.0%
C 18
10.8%
S 16
9.6%
E 16
9.6%
T 15
9.0%
G 12
 
7.2%
L 12
 
7.2%
K 12
 
7.2%
M 7
 
4.2%
F 7
 
4.2%
Other values (11) 31
18.7%
Decimal Number
ValueCountFrequency (%)
1 2055
23.6%
2 1304
15.0%
3 1113
12.8%
4 854
9.8%
6 664
 
7.6%
5 653
 
7.5%
0 533
 
6.1%
7 530
 
6.1%
8 514
 
5.9%
9 472
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
n 2
25.0%
d 1
12.5%
u 1
12.5%
t 1
12.5%
e 1
12.5%
s 1
12.5%
f 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 641
97.4%
& 9
 
1.4%
. 8
 
1.2%
Space Separator
ValueCountFrequency (%)
10967
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2101
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 716
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39147
60.6%
Common 25235
39.1%
Latin 174
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3899
 
10.0%
2963
 
7.6%
2887
 
7.4%
2510
 
6.4%
2422
 
6.2%
2415
 
6.2%
2414
 
6.2%
1964
 
5.0%
1515
 
3.9%
1164
 
3.0%
Other values (345) 14994
38.3%
Latin
ValueCountFrequency (%)
B 20
11.5%
C 18
10.3%
S 16
9.2%
E 16
9.2%
T 15
 
8.6%
G 12
 
6.9%
L 12
 
6.9%
K 12
 
6.9%
M 7
 
4.0%
F 7
 
4.0%
Other values (18) 39
22.4%
Common
ValueCountFrequency (%)
10967
43.5%
( 2101
 
8.3%
) 2101
 
8.3%
1 2055
 
8.1%
2 1304
 
5.2%
3 1113
 
4.4%
4 854
 
3.4%
- 716
 
2.8%
6 664
 
2.6%
5 653
 
2.6%
Other values (7) 2707
 
10.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39147
60.6%
ASCII 25409
39.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
10967
43.2%
( 2101
 
8.3%
) 2101
 
8.3%
1 2055
 
8.1%
2 1304
 
5.1%
3 1113
 
4.4%
4 854
 
3.4%
- 716
 
2.8%
6 664
 
2.6%
5 653
 
2.6%
Other values (35) 2881
 
11.3%
Hangul
ValueCountFrequency (%)
3899
 
10.0%
2963
 
7.6%
2887
 
7.4%
2510
 
6.4%
2422
 
6.2%
2415
 
6.2%
2414
 
6.2%
1964
 
5.0%
1515
 
3.9%
1164
 
3.0%
Other values (345) 14994
38.3%
Distinct1910
Distinct (%)79.0%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:29.125414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.368639
Min length7

Characters and Unicode

Total characters27478
Distinct characters21
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1737 ?
Unique (%)71.9%

Sample

1st row055-383-3233
2nd row055-388-3319
3rd row055-388-3319
4th row055-388-3456
5th row055-386-0622
ValueCountFrequency (%)
데이터 310
 
11.4%
미집게 310
 
11.4%
055-385-3671 6
 
0.2%
055-385-5805 4
 
0.1%
055-386-2224 3
 
0.1%
055-381-8309 3
 
0.1%
051-293-7781 3
 
0.1%
031-372-0404 3
 
0.1%
055-366-9991 3
 
0.1%
055-382-6916 3
 
0.1%
Other values (1901) 2079
76.2%
2023-12-12T17:58:29.614661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 5218
19.0%
- 4197
15.3%
0 3642
13.3%
3 2876
10.5%
8 1844
 
6.7%
1 1553
 
5.7%
7 1550
 
5.6%
6 1459
 
5.3%
2 1198
 
4.4%
4 1034
 
3.8%
Other values (11) 2907
10.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21105
76.8%
Dash Punctuation 4197
 
15.3%
Other Letter 1860
 
6.8%
Space Separator 310
 
1.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 5218
24.7%
0 3642
17.3%
3 2876
13.6%
8 1844
 
8.7%
1 1553
 
7.4%
7 1550
 
7.3%
6 1459
 
6.9%
2 1198
 
5.7%
4 1034
 
4.9%
9 731
 
3.5%
Other Letter
ValueCountFrequency (%)
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
Uppercase Letter
ValueCountFrequency (%)
A 2
33.3%
R 2
33.3%
S 2
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 4197
100.0%
Space Separator
ValueCountFrequency (%)
310
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25612
93.2%
Hangul 1860
 
6.8%
Latin 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
5 5218
20.4%
- 4197
16.4%
0 3642
14.2%
3 2876
11.2%
8 1844
 
7.2%
1 1553
 
6.1%
7 1550
 
6.1%
6 1459
 
5.7%
2 1198
 
4.7%
4 1034
 
4.0%
Other values (2) 1041
 
4.1%
Hangul
ValueCountFrequency (%)
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
Latin
ValueCountFrequency (%)
A 2
33.3%
R 2
33.3%
S 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25618
93.2%
Hangul 1860
 
6.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 5218
20.4%
- 4197
16.4%
0 3642
14.2%
3 2876
11.2%
8 1844
 
7.2%
1 1553
 
6.1%
7 1550
 
6.1%
6 1459
 
5.7%
2 1198
 
4.7%
4 1034
 
4.0%
Other values (5) 1047
 
4.1%
Hangul
ValueCountFrequency (%)
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
310
16.7%
Distinct2037
Distinct (%)84.4%
Missing4
Missing (%)0.2%
Memory size19.0 KiB
2023-12-12T17:58:30.043722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length48
Mean length8.6842105
Min length1

Characters and Unicode

Total characters20955
Distinct characters635
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1899 ?
Unique (%)78.7%

Sample

1st row끈, 망
2nd row밥솥부품
3rd row은나노테프론, 자동차부품 도장
4th row석제품(묘석)
5th row자동차 범퍼
ValueCountFrequency (%)
123
 
2.8%
104
 
2.4%
자동차부품 98
 
2.3%
부품 98
 
2.3%
자동차 89
 
2.1%
금형 43
 
1.0%
산업기계 33
 
0.8%
마스크 31
 
0.7%
자동차용 30
 
0.7%
기계부품 28
 
0.6%
Other values (2548) 3646
84.3%
2023-12-12T17:58:30.612041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1951
 
9.3%
, 875
 
4.2%
730
 
3.5%
658
 
3.1%
534
 
2.5%
526
 
2.5%
425
 
2.0%
365
 
1.7%
365
 
1.7%
333
 
1.6%
Other values (625) 14193
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16491
78.7%
Space Separator 1951
 
9.3%
Other Punctuation 925
 
4.4%
Uppercase Letter 919
 
4.4%
Lowercase Letter 312
 
1.5%
Close Punctuation 149
 
0.7%
Open Punctuation 148
 
0.7%
Decimal Number 41
 
0.2%
Dash Punctuation 19
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
730
 
4.4%
658
 
4.0%
534
 
3.2%
526
 
3.2%
425
 
2.6%
365
 
2.2%
365
 
2.2%
333
 
2.0%
324
 
2.0%
249
 
1.5%
Other values (559) 11982
72.7%
Uppercase Letter
ValueCountFrequency (%)
P 99
 
10.8%
E 95
 
10.3%
C 85
 
9.2%
S 63
 
6.9%
A 61
 
6.6%
L 57
 
6.2%
T 53
 
5.8%
D 49
 
5.3%
O 47
 
5.1%
R 47
 
5.1%
Other values (16) 263
28.6%
Lowercase Letter
ValueCountFrequency (%)
e 47
15.1%
l 28
 
9.0%
s 24
 
7.7%
i 23
 
7.4%
r 23
 
7.4%
a 22
 
7.1%
t 21
 
6.7%
o 20
 
6.4%
c 18
 
5.8%
p 15
 
4.8%
Other values (12) 71
22.8%
Decimal Number
ValueCountFrequency (%)
0 12
29.3%
1 10
24.4%
2 7
17.1%
6 4
 
9.8%
3 3
 
7.3%
8 2
 
4.9%
7 2
 
4.9%
5 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 875
94.6%
. 30
 
3.2%
/ 12
 
1.3%
' 5
 
0.5%
· 2
 
0.2%
& 1
 
0.1%
Space Separator
ValueCountFrequency (%)
1951
100.0%
Close Punctuation
ValueCountFrequency (%)
) 149
100.0%
Open Punctuation
ValueCountFrequency (%)
( 148
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16491
78.7%
Common 3233
 
15.4%
Latin 1231
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
730
 
4.4%
658
 
4.0%
534
 
3.2%
526
 
3.2%
425
 
2.6%
365
 
2.2%
365
 
2.2%
333
 
2.0%
324
 
2.0%
249
 
1.5%
Other values (559) 11982
72.7%
Latin
ValueCountFrequency (%)
P 99
 
8.0%
E 95
 
7.7%
C 85
 
6.9%
S 63
 
5.1%
A 61
 
5.0%
L 57
 
4.6%
T 53
 
4.3%
D 49
 
4.0%
e 47
 
3.8%
O 47
 
3.8%
Other values (38) 575
46.7%
Common
ValueCountFrequency (%)
1951
60.3%
, 875
27.1%
) 149
 
4.6%
( 148
 
4.6%
. 30
 
0.9%
- 19
 
0.6%
/ 12
 
0.4%
0 12
 
0.4%
1 10
 
0.3%
2 7
 
0.2%
Other values (8) 20
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16491
78.7%
ASCII 4462
 
21.3%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1951
43.7%
, 875
19.6%
) 149
 
3.3%
( 148
 
3.3%
P 99
 
2.2%
E 95
 
2.1%
C 85
 
1.9%
S 63
 
1.4%
A 61
 
1.4%
L 57
 
1.3%
Other values (55) 879
19.7%
Hangul
ValueCountFrequency (%)
730
 
4.4%
658
 
4.0%
534
 
3.2%
526
 
3.2%
425
 
2.6%
365
 
2.2%
365
 
2.2%
333
 
2.0%
324
 
2.0%
249
 
1.5%
Other values (559) 11982
72.7%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct344
Distinct (%)14.2%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:31.008346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0008275
Min length5

Characters and Unicode

Total characters12087
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)3.7%

Sample

1st row13921
2nd row25923
3rd row25923
4th row23919
5th row30399
ValueCountFrequency (%)
30399 145
 
6.0%
25924 72
 
3.0%
25113 62
 
2.6%
29294 58
 
2.4%
29299 55
 
2.3%
25999 50
 
2.1%
31114 49
 
2.0%
28123 41
 
1.7%
25929 38
 
1.6%
22191 36
 
1.5%
Other values (335) 1812
74.9%
2023-12-12T17:58:31.600572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 3580
29.6%
1 2324
19.2%
9 2197
18.2%
3 1316
 
10.9%
0 899
 
7.4%
4 631
 
5.2%
5 567
 
4.7%
8 200
 
1.7%
7 183
 
1.5%
6 183
 
1.5%
Other values (7) 7
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12080
99.9%
Other Letter 6
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 3580
29.6%
1 2324
19.2%
9 2197
18.2%
3 1316
 
10.9%
0 899
 
7.4%
4 631
 
5.2%
5 567
 
4.7%
8 200
 
1.7%
7 183
 
1.5%
6 183
 
1.5%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12081
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
2 3580
29.6%
1 2324
19.2%
9 2197
18.2%
3 1316
 
10.9%
0 899
 
7.4%
4 631
 
5.2%
5 567
 
4.7%
8 200
 
1.7%
7 183
 
1.5%
6 183
 
1.5%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12081
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 3580
29.6%
1 2324
19.2%
9 2197
18.2%
3 1316
 
10.9%
0 899
 
7.4%
4 631
 
5.2%
5 567
 
4.7%
8 200
 
1.7%
7 183
 
1.5%
6 183
 
1.5%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Distinct690
Distinct (%)28.5%
Missing0
Missing (%)0.0%
Memory size19.0 KiB
2023-12-12T17:58:32.059002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length26
Mean length17.687216
Min length3

Characters and Unicode

Total characters42750
Distinct characters332
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique370 ?
Unique (%)15.3%

Sample

1st row끈 및 로프 제조업 외 1 종
2nd row도장 및 기타 피막처리업
3rd row도장 및 기타 피막처리업
4th row기타 석제품 제조업
5th row그 외 자동차용 신품 부품 제조업 외 3 종
ValueCountFrequency (%)
제조업 2083
 
14.8%
1481
 
10.6%
986
 
7.0%
955
 
6.8%
기타 705
 
5.0%
1 528
 
3.8%
495
 
3.5%
금속 256
 
1.8%
신품 206
 
1.5%
자동차용 197
 
1.4%
Other values (621) 6141
43.8%
2023-12-12T17:58:32.708277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11616
27.2%
2779
 
6.5%
2483
 
5.8%
2429
 
5.7%
1505
 
3.5%
1402
 
3.3%
1044
 
2.4%
1014
 
2.4%
955
 
2.2%
773
 
1.8%
Other values (322) 16750
39.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29788
69.7%
Space Separator 11616
 
27.2%
Decimal Number 1056
 
2.5%
Other Punctuation 250
 
0.6%
Open Punctuation 20
 
< 0.1%
Close Punctuation 20
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2779
 
9.3%
2483
 
8.3%
2429
 
8.2%
1505
 
5.1%
1402
 
4.7%
1044
 
3.5%
1014
 
3.4%
955
 
3.2%
773
 
2.6%
719
 
2.4%
Other values (307) 14685
49.3%
Decimal Number
ValueCountFrequency (%)
1 587
55.6%
2 170
 
16.1%
3 157
 
14.9%
4 55
 
5.2%
5 39
 
3.7%
6 16
 
1.5%
7 12
 
1.1%
0 7
 
0.7%
9 7
 
0.7%
8 6
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 245
98.0%
. 5
 
2.0%
Space Separator
ValueCountFrequency (%)
11616
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29788
69.7%
Common 12962
30.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2779
 
9.3%
2483
 
8.3%
2429
 
8.2%
1505
 
5.1%
1402
 
4.7%
1044
 
3.5%
1014
 
3.4%
955
 
3.2%
773
 
2.6%
719
 
2.4%
Other values (307) 14685
49.3%
Common
ValueCountFrequency (%)
11616
89.6%
1 587
 
4.5%
, 245
 
1.9%
2 170
 
1.3%
3 157
 
1.2%
4 55
 
0.4%
5 39
 
0.3%
( 20
 
0.2%
) 20
 
0.2%
6 16
 
0.1%
Other values (5) 37
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29784
69.7%
ASCII 12962
30.3%
Compat Jamo 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11616
89.6%
1 587
 
4.5%
, 245
 
1.9%
2 170
 
1.3%
3 157
 
1.2%
4 55
 
0.4%
5 39
 
0.3%
( 20
 
0.2%
) 20
 
0.2%
6 16
 
0.1%
Other values (5) 37
 
0.3%
Hangul
ValueCountFrequency (%)
2779
 
9.3%
2483
 
8.3%
2429
 
8.2%
1505
 
5.1%
1402
 
4.7%
1044
 
3.5%
1014
 
3.4%
955
 
3.2%
773
 
2.6%
719
 
2.4%
Other values (306) 14681
49.3%
Compat Jamo
ValueCountFrequency (%)
4
100.0%

Interactions

2023-12-12T17:58:25.411597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T17:58:25.534897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:58:25.680369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번회사명대표자명공장대표주소(도로명)전화번호생산품대표업종번호업종명
01(유)거명김익경상남도 양산시 유산공단4길 47-4 (유산동)055-383-3233끈, 망13921끈 및 로프 제조업 외 1 종
12(유)대상공업지말순경상남도 양산시 산막동 147번지 외 7필지 외 2필지055-388-3319밥솥부품25923도장 및 기타 피막처리업
23(유)대상공업 제2공장지말순경상남도 양산시 산막동 260번지 외 4필지055-388-3319은나노테프론, 자동차부품 도장25923도장 및 기타 피막처리업
34(유)대성석재김일봉경상남도 양산시 어실로 549 (어곡동, 대성석재)055-388-3456석제품(묘석)23919기타 석제품 제조업
45(유)보금김도형경상남도 양산시 소주공단5길 3 (주남동, 보금)055-386-0622자동차 범퍼30399그 외 자동차용 신품 부품 제조업 외 3 종
56(유)신원김태형경상남도 양산시 소주공단5길 13 (주남동)055-363-5344자동차범퍼 도장, 부품30399그 외 자동차용 신품 부품 제조업 외 3 종
67(유)진안화학김용안경상남도 양산시 소주공단2길 52 (주남동, 진안화학)055-386-2391플라스틱제품22241운송장비 조립용 플라스틱제품 제조업 외 1 종
78(유)창성정밀김도훈경상남도 양산시 소주공단3길 22 (소주동)055-364-2311브레이크27215기기용 자동측정 및 제어장치 제조업
89(유)파로마가구허현숙경상남도 양산시 평산동 66-4번지055-382-1100목재가구16299그 외 기타 나무제품 제조업
910(유한)세흥김정규경상남도 양산시 유산공단4길 47-4 (유산동, (주)대명)055-346-3232해태망,끈,기타사류13922어망 및 기타 끈 가공품 제조업
순번회사명대표자명공장대표주소(도로명)전화번호생산품대표업종번호업종명
24072408효림정밀이정희경상남도 양산시 상북면 소토2길 29-29데이터 미집게치구 및 금형29294주형 및 금형 제조업
24082409효신산업이상훈경상남도 양산시 충렬로 69 (교동)055-387-8814도어패킹22191고무패킹류 제조업 외 1 종
24092410효은전기 주식회사장은희경상남도 양산시 상북면 율리길 34055-374-1131전기자동제어반 등28123배전반 및 전기 자동제어반 제조업
24102411휘푸드권형근경상남도 양산시 산막공단남12길 142 (북정동)055-785-4900냉동가공육10122육류 포장육 및 냉동육 가공업 (가금류 제외)
24112412휴먼베이스 오토메이션이대유경상남도 양산시 명곡로 321 (명곡동)데이터 미집게계장제어장치,프로세스제어반28123배전반 및 전기 자동제어반 제조업 외 2 종
24122413흥아특수고무이용호경상남도 양산시 유산공단9길 8 (유산동, 금영직물공업사)055-381-6585특수고무22191고무패킹류 제조업 외 1 종
24132414흥욱상사김동흥경상남도 양산시 하북면 백록리 1305번지055-374-9863부직포,캐미시트13992부직포 및 펠트 제조업
24142415희망나라박호규경상남도 양산시 매곡4길 8 (매곡동)055-365-8055복사용지17901문구용 종이제품 제조업 외 1 종
24152416희원산기김종원경상남도 양산시 소주로 119-26 (소주동)055-362-6147열풍건조기(산업용), 컨베이어장치, 집진기29150산업용 오븐, 노 및 노용 버너 제조업 외 3 종
24162417희창섬유최희택경상남도 양산시 상북면 석계산단로 181데이터 미집게섬유원단13109기타 방적업