Overview

Dataset statistics

Number of variables7
Number of observations2772
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)0.1%
Total size in memory151.7 KiB
Average record size in memory56.0 B

Variable types

Text7

Dataset

Description경상북도 경산시 제조업 등록현황으로 회사명, 공장주소, 생산품, 업종명, 전화번호, 팩스번호 등의 자료를 제공합니다.
Author경상북도 경산시
URLhttps://www.data.go.kr/data/15034960/fileData.do

Alerts

Dataset has 2 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-15 00:58:44.072422
Analysis finished2024-03-15 00:58:46.544648
Duration2.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct2612
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:58:47.241002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length6.3867244
Min length1

Characters and Unicode

Total characters17704
Distinct characters575
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2476 ?
Unique (%)89.3%

Sample

1st row (주) 레일온
2nd row(유)농업회사법인 삼미식품
3rd row(유)협신산자
4th row(주) 경도철강 가공센터
5th row(주) 썬로드
ValueCountFrequency (%)
주식회사 171
 
5.5%
경산지점 18
 
0.6%
경산공장 18
 
0.6%
농업회사법인 17
 
0.5%
2공장 10
 
0.3%
진성산업 5
 
0.2%
주)건화이엔지 5
 
0.2%
5
 
0.2%
대성테크 4
 
0.1%
주)신라공업 4
 
0.1%
Other values (2661) 2856
91.7%
2024-03-15T09:58:48.653687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1321
 
7.5%
( 1132
 
6.4%
) 1131
 
6.4%
520
 
2.9%
513
 
2.9%
405
 
2.3%
383
 
2.2%
372
 
2.1%
348
 
2.0%
288
 
1.6%
Other values (565) 11291
63.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14645
82.7%
Open Punctuation 1132
 
6.4%
Close Punctuation 1131
 
6.4%
Space Separator 348
 
2.0%
Uppercase Letter 325
 
1.8%
Decimal Number 56
 
0.3%
Other Punctuation 43
 
0.2%
Lowercase Letter 18
 
0.1%
Dash Punctuation 5
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1321
 
9.0%
520
 
3.6%
513
 
3.5%
405
 
2.8%
383
 
2.6%
372
 
2.5%
288
 
2.0%
276
 
1.9%
238
 
1.6%
234
 
1.6%
Other values (513) 10095
68.9%
Uppercase Letter
ValueCountFrequency (%)
E 38
 
11.7%
S 36
 
11.1%
N 31
 
9.5%
C 30
 
9.2%
T 19
 
5.8%
P 16
 
4.9%
G 16
 
4.9%
D 14
 
4.3%
A 14
 
4.3%
O 14
 
4.3%
Other values (14) 97
29.8%
Lowercase Letter
ValueCountFrequency (%)
c 3
16.7%
e 3
16.7%
o 2
11.1%
t 2
11.1%
x 1
 
5.6%
i 1
 
5.6%
l 1
 
5.6%
n 1
 
5.6%
r 1
 
5.6%
a 1
 
5.6%
Other values (2) 2
11.1%
Decimal Number
ValueCountFrequency (%)
2 40
71.4%
1 7
 
12.5%
3 5
 
8.9%
7 1
 
1.8%
0 1
 
1.8%
4 1
 
1.8%
6 1
 
1.8%
Other Punctuation
ValueCountFrequency (%)
. 31
72.1%
& 10
 
23.3%
, 1
 
2.3%
/ 1
 
2.3%
Open Punctuation
ValueCountFrequency (%)
( 1132
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1131
100.0%
Space Separator
ValueCountFrequency (%)
348
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14646
82.7%
Common 2715
 
15.3%
Latin 343
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1321
 
9.0%
520
 
3.6%
513
 
3.5%
405
 
2.8%
383
 
2.6%
372
 
2.5%
288
 
2.0%
276
 
1.9%
238
 
1.6%
234
 
1.6%
Other values (514) 10096
68.9%
Latin
ValueCountFrequency (%)
E 38
 
11.1%
S 36
 
10.5%
N 31
 
9.0%
C 30
 
8.7%
T 19
 
5.5%
P 16
 
4.7%
G 16
 
4.7%
D 14
 
4.1%
A 14
 
4.1%
O 14
 
4.1%
Other values (26) 115
33.5%
Common
ValueCountFrequency (%)
( 1132
41.7%
) 1131
41.7%
348
 
12.8%
2 40
 
1.5%
. 31
 
1.1%
& 10
 
0.4%
1 7
 
0.3%
- 5
 
0.2%
3 5
 
0.2%
, 1
 
< 0.1%
Other values (5) 5
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14645
82.7%
ASCII 3058
 
17.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1321
 
9.0%
520
 
3.6%
513
 
3.5%
405
 
2.8%
383
 
2.6%
372
 
2.5%
288
 
2.0%
276
 
1.9%
238
 
1.6%
234
 
1.6%
Other values (513) 10095
68.9%
ASCII
ValueCountFrequency (%)
( 1132
37.0%
) 1131
37.0%
348
 
11.4%
2 40
 
1.3%
E 38
 
1.2%
S 36
 
1.2%
N 31
 
1.0%
. 31
 
1.0%
C 30
 
1.0%
T 19
 
0.6%
Other values (41) 222
 
7.3%
None
ValueCountFrequency (%)
1
100.0%
Distinct2542
Distinct (%)91.7%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:58:49.841724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length53
Mean length25.155123
Min length3

Characters and Unicode

Total characters69730
Distinct characters375
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2391 ?
Unique (%)86.3%

Sample

1st row경상북도 경산시 하양읍 가마실길 50, 알앤디비센터 105, M102호
2nd row경상북도 경산시 자인면 자인공단2로4길 8
3rd row경상북도 경산시 압량읍 의송길 91
4th row경상북도 경산시 남산면 하대리 7번지 외 7필지 외 7필지
5th row경상북도 경산시 남산면 서원천로 260-17
ValueCountFrequency (%)
경상북도 2717
 
16.9%
경산시 2716
 
16.9%
진량읍 861
 
5.4%
481
 
3.0%
압량읍 403
 
2.5%
와촌면 350
 
2.2%
1필지 263
 
1.6%
하양읍 258
 
1.6%
자인면 250
 
1.6%
남천면 227
 
1.4%
Other values (2323) 7527
46.9%
2024-03-15T09:58:51.602668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13282
19.0%
5558
 
8.0%
3253
 
4.7%
2787
 
4.0%
2776
 
4.0%
2735
 
3.9%
2725
 
3.9%
1 2305
 
3.3%
2 1674
 
2.4%
1540
 
2.2%
Other values (365) 31095
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42416
60.8%
Space Separator 13282
 
19.0%
Decimal Number 10864
 
15.6%
Dash Punctuation 1042
 
1.5%
Open Punctuation 814
 
1.2%
Close Punctuation 814
 
1.2%
Other Punctuation 297
 
0.4%
Uppercase Letter 155
 
0.2%
Lowercase Letter 45
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5558
 
13.1%
3253
 
7.7%
2787
 
6.6%
2776
 
6.5%
2735
 
6.4%
2725
 
6.4%
1540
 
3.6%
1449
 
3.4%
1355
 
3.2%
1296
 
3.1%
Other values (311) 16942
39.9%
Uppercase Letter
ValueCountFrequency (%)
B 30
19.4%
C 19
12.3%
D 17
11.0%
R 16
10.3%
A 14
9.0%
M 9
 
5.8%
G 9
 
5.8%
E 7
 
4.5%
N 6
 
3.9%
T 6
 
3.9%
Other values (10) 22
14.2%
Lowercase Letter
ValueCountFrequency (%)
e 17
37.8%
i 6
 
13.3%
l 5
 
11.1%
c 4
 
8.9%
o 4
 
8.9%
g 2
 
4.4%
s 2
 
4.4%
d 1
 
2.2%
u 1
 
2.2%
r 1
 
2.2%
Other values (2) 2
 
4.4%
Decimal Number
ValueCountFrequency (%)
1 2305
21.2%
2 1674
15.4%
3 1196
11.0%
4 1111
10.2%
5 947
8.7%
6 847
 
7.8%
0 779
 
7.2%
8 731
 
6.7%
7 697
 
6.4%
9 577
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 263
88.6%
& 18
 
6.1%
. 14
 
4.7%
/ 1
 
0.3%
: 1
 
0.3%
Open Punctuation
ValueCountFrequency (%)
( 809
99.4%
[ 5
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 809
99.4%
] 5
 
0.6%
Space Separator
ValueCountFrequency (%)
13282
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1042
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42378
60.8%
Common 27114
38.9%
Latin 200
 
0.3%
Han 38
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5558
 
13.1%
3253
 
7.7%
2787
 
6.6%
2776
 
6.6%
2735
 
6.5%
2725
 
6.4%
1540
 
3.6%
1449
 
3.4%
1355
 
3.2%
1296
 
3.1%
Other values (308) 16904
39.9%
Latin
ValueCountFrequency (%)
B 30
15.0%
C 19
 
9.5%
D 17
 
8.5%
e 17
 
8.5%
R 16
 
8.0%
A 14
 
7.0%
M 9
 
4.5%
G 9
 
4.5%
E 7
 
3.5%
N 6
 
3.0%
Other values (22) 56
28.0%
Common
ValueCountFrequency (%)
13282
49.0%
1 2305
 
8.5%
2 1674
 
6.2%
3 1196
 
4.4%
4 1111
 
4.1%
- 1042
 
3.8%
5 947
 
3.5%
6 847
 
3.1%
( 809
 
3.0%
) 809
 
3.0%
Other values (12) 3092
 
11.4%
Han
ValueCountFrequency (%)
19
50.0%
16
42.1%
3
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42378
60.8%
ASCII 27314
39.2%
CJK 38
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13282
48.6%
1 2305
 
8.4%
2 1674
 
6.1%
3 1196
 
4.4%
4 1111
 
4.1%
- 1042
 
3.8%
5 947
 
3.5%
6 847
 
3.1%
( 809
 
3.0%
) 809
 
3.0%
Other values (44) 3292
 
12.1%
Hangul
ValueCountFrequency (%)
5558
 
13.1%
3253
 
7.7%
2787
 
6.6%
2776
 
6.6%
2735
 
6.5%
2725
 
6.4%
1540
 
3.6%
1449
 
3.4%
1355
 
3.2%
1296
 
3.1%
Other values (308) 16904
39.9%
CJK
ValueCountFrequency (%)
19
50.0%
16
42.1%
3
 
7.9%
Distinct2586
Distinct (%)93.3%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:58:52.776231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length62
Median length53
Mean length24.992063
Min length15

Characters and Unicode

Total characters69278
Distinct characters254
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2438 ?
Unique (%)88.0%

Sample

1st row경상북도 경산시 하양읍 부호리 33 경일대학교 알앤디비센터 105, M102호
2nd row경상북도 경산시 자인면 북사리 1092-2번지
3rd row경상북도 경산시 압량읍 의송리 208번지
4th row경상북도 경산시 남산면 하대리 7번지 외 7필지
5th row경상북도 경산시 남산면 경리 15번지
ValueCountFrequency (%)
경상북도 2772
18.1%
경산시 2771
18.0%
진량읍 862
 
5.6%
490
 
3.2%
압량읍 416
 
2.7%
와촌면 362
 
2.4%
하양읍 272
 
1.8%
1필지 270
 
1.8%
신상리 261
 
1.7%
자인면 248
 
1.6%
Other values (2648) 6631
43.2%
2024-03-15T09:58:54.164379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12584
18.2%
5642
 
8.1%
3156
 
4.6%
3091
 
4.5%
2886
 
4.2%
2810
 
4.1%
1 2790
 
4.0%
2783
 
4.0%
2667
 
3.8%
2500
 
3.6%
Other values (244) 28369
40.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42859
61.9%
Space Separator 12584
 
18.2%
Decimal Number 11455
 
16.5%
Dash Punctuation 1772
 
2.6%
Open Punctuation 191
 
0.3%
Close Punctuation 191
 
0.3%
Uppercase Letter 116
 
0.2%
Other Punctuation 68
 
0.1%
Lowercase Letter 41
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5642
13.2%
3156
 
7.4%
3091
 
7.2%
2886
 
6.7%
2810
 
6.6%
2783
 
6.5%
2667
 
6.2%
2500
 
5.8%
1907
 
4.4%
1577
 
3.7%
Other values (204) 13840
32.3%
Uppercase Letter
ValueCountFrequency (%)
C 32
27.6%
B 25
21.6%
D 14
12.1%
R 14
12.1%
A 10
 
8.6%
M 5
 
4.3%
G 3
 
2.6%
E 2
 
1.7%
O 2
 
1.7%
P 2
 
1.7%
Other values (5) 7
 
6.0%
Decimal Number
ValueCountFrequency (%)
1 2790
24.4%
2 1596
13.9%
3 1186
10.4%
0 1099
 
9.6%
4 924
 
8.1%
5 909
 
7.9%
8 839
 
7.3%
6 735
 
6.4%
7 707
 
6.2%
9 670
 
5.8%
Lowercase Letter
ValueCountFrequency (%)
e 32
78.0%
l 5
 
12.2%
c 3
 
7.3%
i 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
, 31
45.6%
. 23
33.8%
& 13
19.1%
/ 1
 
1.5%
Open Punctuation
ValueCountFrequency (%)
( 186
97.4%
[ 5
 
2.6%
Close Punctuation
ValueCountFrequency (%)
) 186
97.4%
] 5
 
2.6%
Space Separator
ValueCountFrequency (%)
12584
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1772
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42699
61.6%
Common 26262
37.9%
Han 160
 
0.2%
Latin 157
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5642
13.2%
3156
 
7.4%
3091
 
7.2%
2886
 
6.8%
2810
 
6.6%
2783
 
6.5%
2667
 
6.2%
2500
 
5.9%
1907
 
4.5%
1577
 
3.7%
Other values (201) 13680
32.0%
Common
ValueCountFrequency (%)
12584
47.9%
1 2790
 
10.6%
- 1772
 
6.7%
2 1596
 
6.1%
3 1186
 
4.5%
0 1099
 
4.2%
4 924
 
3.5%
5 909
 
3.5%
8 839
 
3.2%
6 735
 
2.8%
Other values (11) 1828
 
7.0%
Latin
ValueCountFrequency (%)
e 32
20.4%
C 32
20.4%
B 25
15.9%
D 14
8.9%
R 14
8.9%
A 10
 
6.4%
M 5
 
3.2%
l 5
 
3.2%
c 3
 
1.9%
G 3
 
1.9%
Other values (9) 14
8.9%
Han
ValueCountFrequency (%)
80
50.0%
74
46.2%
6
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42699
61.6%
ASCII 26419
38.1%
CJK 160
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12584
47.6%
1 2790
 
10.6%
- 1772
 
6.7%
2 1596
 
6.0%
3 1186
 
4.5%
0 1099
 
4.2%
4 924
 
3.5%
5 909
 
3.4%
8 839
 
3.2%
6 735
 
2.8%
Other values (30) 1985
 
7.5%
Hangul
ValueCountFrequency (%)
5642
13.2%
3156
 
7.4%
3091
 
7.2%
2886
 
6.8%
2810
 
6.6%
2783
 
6.5%
2667
 
6.2%
2500
 
5.9%
1907
 
4.5%
1577
 
3.7%
Other values (201) 13680
32.0%
CJK
ValueCountFrequency (%)
80
50.0%
74
46.2%
6
 
3.8%
Distinct2140
Distinct (%)77.2%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:58:55.194992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length48
Mean length8.2074315
Min length1

Characters and Unicode

Total characters22751
Distinct characters702
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1924 ?
Unique (%)69.4%

Sample

1st row철도안전용품 등
2nd row튀김가루,소스류
3rd row면직물
4th row건축자재가공철근
5th row가드레일,난간,금속재울타리
ValueCountFrequency (%)
92
 
2.1%
자동차부품 74
 
1.7%
70
 
1.6%
연사 67
 
1.5%
자동차 33
 
0.8%
직물 33
 
0.8%
플라스틱 32
 
0.7%
창호 31
 
0.7%
금형 30
 
0.7%
부품 28
 
0.6%
Other values (2669) 3873
88.8%
2024-03-15T09:58:56.620815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1604
 
7.1%
, 1027
 
4.5%
565
 
2.5%
524
 
2.3%
502
 
2.2%
452
 
2.0%
393
 
1.7%
383
 
1.7%
368
 
1.6%
352
 
1.5%
Other values (692) 16581
72.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 18656
82.0%
Space Separator 1604
 
7.1%
Other Punctuation 1067
 
4.7%
Uppercase Letter 581
 
2.6%
Close Punctuation 272
 
1.2%
Open Punctuation 272
 
1.2%
Lowercase Letter 263
 
1.2%
Decimal Number 25
 
0.1%
Dash Punctuation 7
 
< 0.1%
Control 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
565
 
3.0%
524
 
2.8%
502
 
2.7%
452
 
2.4%
393
 
2.1%
383
 
2.1%
368
 
2.0%
352
 
1.9%
302
 
1.6%
294
 
1.6%
Other values (623) 14521
77.8%
Uppercase Letter
ValueCountFrequency (%)
P 86
14.8%
E 79
13.6%
C 51
 
8.8%
L 43
 
7.4%
T 37
 
6.4%
A 33
 
5.7%
D 33
 
5.7%
S 27
 
4.6%
V 27
 
4.6%
R 26
 
4.5%
Other values (15) 139
23.9%
Lowercase Letter
ValueCountFrequency (%)
e 34
12.9%
p 28
 
10.6%
a 19
 
7.2%
r 19
 
7.2%
s 18
 
6.8%
o 15
 
5.7%
l 15
 
5.7%
n 14
 
5.3%
i 14
 
5.3%
t 14
 
5.3%
Other values (12) 73
27.8%
Other Punctuation
ValueCountFrequency (%)
, 1027
96.3%
. 24
 
2.2%
/ 10
 
0.9%
· 2
 
0.2%
' 1
 
0.1%
1
 
0.1%
? 1
 
0.1%
% 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
4 6
24.0%
1 5
20.0%
0 5
20.0%
2 5
20.0%
3 3
12.0%
8 1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 271
99.6%
] 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 271
99.6%
[ 1
 
0.4%
Control
ValueCountFrequency (%)
2
50.0%
2
50.0%
Space Separator
ValueCountFrequency (%)
1604
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 18654
82.0%
Common 3251
 
14.3%
Latin 844
 
3.7%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
565
 
3.0%
524
 
2.8%
502
 
2.7%
452
 
2.4%
393
 
2.1%
383
 
2.1%
368
 
2.0%
352
 
1.9%
302
 
1.6%
294
 
1.6%
Other values (622) 14519
77.8%
Latin
ValueCountFrequency (%)
P 86
 
10.2%
E 79
 
9.4%
C 51
 
6.0%
L 43
 
5.1%
T 37
 
4.4%
e 34
 
4.0%
A 33
 
3.9%
D 33
 
3.9%
p 28
 
3.3%
S 27
 
3.2%
Other values (37) 393
46.6%
Common
ValueCountFrequency (%)
1604
49.3%
, 1027
31.6%
) 271
 
8.3%
( 271
 
8.3%
. 24
 
0.7%
/ 10
 
0.3%
- 7
 
0.2%
4 6
 
0.2%
1 5
 
0.2%
0 5
 
0.2%
Other values (12) 21
 
0.6%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 18654
82.0%
ASCII 4092
 
18.0%
None 3
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1604
39.2%
, 1027
25.1%
) 271
 
6.6%
( 271
 
6.6%
P 86
 
2.1%
E 79
 
1.9%
C 51
 
1.2%
L 43
 
1.1%
T 37
 
0.9%
e 34
 
0.8%
Other values (57) 589
 
14.4%
Hangul
ValueCountFrequency (%)
565
 
3.0%
524
 
2.8%
502
 
2.7%
452
 
2.4%
393
 
2.1%
383
 
2.1%
368
 
2.0%
352
 
1.9%
302
 
1.6%
294
 
1.6%
Other values (622) 14519
77.8%
None
ValueCountFrequency (%)
· 2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
2
100.0%
Distinct659
Distinct (%)23.8%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:58:57.687874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length28
Mean length16.959957
Min length5

Characters and Unicode

Total characters47013
Distinct characters329
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique310 ?
Unique (%)11.2%

Sample

1st row그 외 기타 분류 안된 금속 가공 제품 제조업
2nd row천연 및 혼합조제 조미료 제조업
3rd row면직물 직조업 외 2 종
4th row그 외 기타 금속가공업
5th row구조용 금속 판제품 및 공작물 제조업 외 7 종
ValueCountFrequency (%)
제조업 2308
 
15.2%
1343
 
8.8%
1220
 
8.0%
944
 
6.2%
기타 694
 
4.6%
1 563
 
3.7%
399
 
2.6%
금속 267
 
1.8%
신품 233
 
1.5%
부품 214
 
1.4%
Other values (592) 7000
46.1%
2024-03-15T09:58:59.162468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12413
26.4%
3048
 
6.5%
2928
 
6.2%
2858
 
6.1%
1370
 
2.9%
1234
 
2.6%
1220
 
2.6%
1147
 
2.4%
984
 
2.1%
920
 
2.0%
Other values (319) 18891
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 33258
70.7%
Space Separator 12413
 
26.4%
Decimal Number 960
 
2.0%
Other Punctuation 336
 
0.7%
Close Punctuation 23
 
< 0.1%
Open Punctuation 23
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3048
 
9.2%
2928
 
8.8%
2858
 
8.6%
1370
 
4.1%
1234
 
3.7%
1220
 
3.7%
1147
 
3.4%
984
 
3.0%
920
 
2.8%
713
 
2.1%
Other values (304) 16836
50.6%
Decimal Number
ValueCountFrequency (%)
1 579
60.3%
3 165
 
17.2%
2 123
 
12.8%
4 40
 
4.2%
6 22
 
2.3%
5 19
 
2.0%
7 5
 
0.5%
9 3
 
0.3%
8 3
 
0.3%
0 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 321
95.5%
. 15
 
4.5%
Space Separator
ValueCountFrequency (%)
12413
100.0%
Close Punctuation
ValueCountFrequency (%)
) 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 33258
70.7%
Common 13755
29.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3048
 
9.2%
2928
 
8.8%
2858
 
8.6%
1370
 
4.1%
1234
 
3.7%
1220
 
3.7%
1147
 
3.4%
984
 
3.0%
920
 
2.8%
713
 
2.1%
Other values (304) 16836
50.6%
Common
ValueCountFrequency (%)
12413
90.2%
1 579
 
4.2%
, 321
 
2.3%
3 165
 
1.2%
2 123
 
0.9%
4 40
 
0.3%
) 23
 
0.2%
( 23
 
0.2%
6 22
 
0.2%
5 19
 
0.1%
Other values (5) 27
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 33224
70.7%
ASCII 13755
29.3%
Compat Jamo 34
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12413
90.2%
1 579
 
4.2%
, 321
 
2.3%
3 165
 
1.2%
2 123
 
0.9%
4 40
 
0.3%
) 23
 
0.2%
( 23
 
0.2%
6 22
 
0.2%
5 19
 
0.1%
Other values (5) 27
 
0.2%
Hangul
ValueCountFrequency (%)
3048
 
9.2%
2928
 
8.8%
2858
 
8.6%
1370
 
4.1%
1234
 
3.7%
1220
 
3.7%
1147
 
3.5%
984
 
3.0%
920
 
2.8%
713
 
2.1%
Other values (303) 16802
50.6%
Compat Jamo
ValueCountFrequency (%)
34
100.0%
Distinct1910
Distinct (%)68.9%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:59:00.004685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length9.8391053
Min length3

Characters and Unicode

Total characters27274
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1739 ?
Unique (%)62.7%

Sample

1st row053-852-8439
2nd row053-856-9928
3rd row053-811-4360
4th row053-851-8334
5th row052-257-6255
ValueCountFrequency (%)
부존재 669
 
24.1%
053-856-5101 5
 
0.2%
053-813-4518 4
 
0.1%
053-962-4839 4
 
0.1%
053-859-1100 4
 
0.1%
053-851-8600 3
 
0.1%
053-852-2601 3
 
0.1%
053-981-8806 3
 
0.1%
053-856-9811 3
 
0.1%
053-817-9113 3
 
0.1%
Other values (1900) 2071
74.7%
2024-03-15T09:59:01.129999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 4272
15.7%
- 4200
15.4%
0 3514
12.9%
3 3269
12.0%
8 2701
9.9%
1 1976
7.2%
7 1267
 
4.6%
2 1168
 
4.3%
6 1145
 
4.2%
4 941
 
3.5%
Other values (4) 2821
10.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 21067
77.2%
Dash Punctuation 4200
 
15.4%
Other Letter 2007
 
7.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 4272
20.3%
0 3514
16.7%
3 3269
15.5%
8 2701
12.8%
1 1976
9.4%
7 1267
 
6.0%
2 1168
 
5.5%
6 1145
 
5.4%
4 941
 
4.5%
9 814
 
3.9%
Other Letter
ValueCountFrequency (%)
669
33.3%
669
33.3%
669
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 4200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 25267
92.6%
Hangul 2007
 
7.4%

Most frequent character per script

Common
ValueCountFrequency (%)
5 4272
16.9%
- 4200
16.6%
0 3514
13.9%
3 3269
12.9%
8 2701
10.7%
1 1976
7.8%
7 1267
 
5.0%
2 1168
 
4.6%
6 1145
 
4.5%
4 941
 
3.7%
Hangul
ValueCountFrequency (%)
669
33.3%
669
33.3%
669
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25267
92.6%
Hangul 2007
 
7.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 4272
16.9%
- 4200
16.6%
0 3514
13.9%
3 3269
12.9%
8 2701
10.7%
1 1976
7.8%
7 1267
 
5.0%
2 1168
 
4.6%
6 1145
 
4.5%
4 941
 
3.7%
Hangul
ValueCountFrequency (%)
669
33.3%
669
33.3%
669
33.3%
Distinct1680
Distinct (%)60.6%
Missing0
Missing (%)0.0%
Memory size21.8 KiB
2024-03-15T09:59:01.969862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.994228
Min length3

Characters and Unicode

Total characters24932
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1543 ?
Unique (%)55.7%

Sample

1st row053-852-8440
2nd row053-856-3555
3rd row053-811-4361
4th row053-851-8335
5th row052-257-6233
ValueCountFrequency (%)
부존재 932
33.6%
053-856-8031 5
 
0.2%
053-813-8528 4
 
0.1%
053-811-7376 4
 
0.1%
053-859-1110 4
 
0.1%
053-856-1153 3
 
0.1%
053-382-0423 3
 
0.1%
053-852-2344 3
 
0.1%
053-856-7202 3
 
0.1%
053-964-3398 3
 
0.1%
Other values (1670) 1808
65.2%
2024-03-15T09:59:02.996154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 3752
15.0%
- 3678
14.8%
3 2874
11.5%
0 2829
11.3%
8 2289
9.2%
1 1518
6.1%
7 1179
 
4.7%
2 1082
 
4.3%
6 1069
 
4.3%
4 957
 
3.8%
Other values (4) 3705
14.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18404
73.8%
Dash Punctuation 3678
 
14.8%
Other Letter 2850
 
11.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 3752
20.4%
3 2874
15.6%
0 2829
15.4%
8 2289
12.4%
1 1518
8.2%
7 1179
 
6.4%
2 1082
 
5.9%
6 1069
 
5.8%
4 957
 
5.2%
9 855
 
4.6%
Other Letter
ValueCountFrequency (%)
950
33.3%
950
33.3%
950
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 3678
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 22082
88.6%
Hangul 2850
 
11.4%

Most frequent character per script

Common
ValueCountFrequency (%)
5 3752
17.0%
- 3678
16.7%
3 2874
13.0%
0 2829
12.8%
8 2289
10.4%
1 1518
6.9%
7 1179
 
5.3%
2 1082
 
4.9%
6 1069
 
4.8%
4 957
 
4.3%
Hangul
ValueCountFrequency (%)
950
33.3%
950
33.3%
950
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22082
88.6%
Hangul 2850
 
11.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 3752
17.0%
- 3678
16.7%
3 2874
13.0%
0 2829
12.8%
8 2289
10.4%
1 1518
6.9%
7 1179
 
5.3%
2 1082
 
4.9%
6 1069
 
4.8%
4 957
 
4.3%
Hangul
ValueCountFrequency (%)
950
33.3%
950
33.3%
950
33.3%

Missing values

2024-03-15T09:58:46.058271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T09:58:46.419041image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명공장대표주소(도로명)공장대표주소(지번)생산품업종명전화번호팩스번호
0(주) 레일온경상북도 경산시 하양읍 가마실길 50, 알앤디비센터 105, M102호경상북도 경산시 하양읍 부호리 33 경일대학교 알앤디비센터 105, M102호철도안전용품 등그 외 기타 분류 안된 금속 가공 제품 제조업053-852-8439053-852-8440
1(유)농업회사법인 삼미식품경상북도 경산시 자인면 자인공단2로4길 8경상북도 경산시 자인면 북사리 1092-2번지튀김가루,소스류천연 및 혼합조제 조미료 제조업053-856-9928053-856-3555
2(유)협신산자경상북도 경산시 압량읍 의송길 91경상북도 경산시 압량읍 의송리 208번지면직물면직물 직조업 외 2 종053-811-4360053-811-4361
3(주) 경도철강 가공센터경상북도 경산시 남산면 하대리 7번지 외 7필지 외 7필지경상북도 경산시 남산면 하대리 7번지 외 7필지건축자재가공철근그 외 기타 금속가공업053-851-8334053-851-8335
4(주) 썬로드경상북도 경산시 남산면 서원천로 260-17경상북도 경산시 남산면 경리 15번지가드레일,난간,금속재울타리구조용 금속 판제품 및 공작물 제조업 외 7 종052-257-6255052-257-6233
5(주) 현진경상북도 경산시 남천면 대명길3길 24-37경상북도 경산시 남천면 대명리 309번지가로등주,LED조명,무대장치,태양광발전장치일반용 전기 조명장치 제조업 외 3 종053-751-1417053-742-1417
6(주) 화산경상북도 경산시 진량읍 일연로115길 18경상북도 경산시 진량읍 선화리 141-3번지일회용부탄가스캔금속 캔 및 기타 포장용기 제조업054-335-6666054-335-6683
7(주)E.V산업경상북도 경산시 진량읍 일연로 491-2 (유성ENG)경상북도 경산시 진량읍 가야리 40-0번지흡습제그 외 기타 분류 안된 화학제품 제조업 외 1 종053-816-1505053-816-1505
8(주)KDS경상북도 경산시 진량읍 공단6로 77 (에스엘주 진량공장)경상북도 경산시 진량읍 신상리 1208-6번지자동차문짝여닫음장치그 외 자동차용 신품 부품 제조업 외 3 종053-856-4500053-856-4504
9(주)KS레미콘경상북도 경산시 압량읍 가일길 79 (K.S레미콘)경상북도 경산시 압량읍 가일리 462-11번지레미콘레미콘 제조업053-818-8161부존재
회사명공장대표주소(도로명)공장대표주소(지번)생산품업종명전화번호팩스번호
2762휴먼플러스(주)CNC사업본부경상북도 경산시 진량읍 공단9로 6경상북도 경산시 진량읍 신제리 575번지자동차부품그 외 자동차용 신품 부품 제조업 외 1 종053-710-2030053-852-2031
2763휴몬트코리아경상북도 경산시 진량읍 공단4로 127경상북도 경산시 진량읍 신상리 1207-2등산용 스틱, 캠핑용 의자기타 운동 및 경기용구 제조업 외 1 종부존재053-626-1501
2764흥생농장경상북도 경산시 진량읍 선화리 227 외 2필지경상북도 경산시 진량읍 선화리 227 외 2필지구운계란그 외 기타 식료품 제조업부존재부존재
2765흥생산업경상북도 경산시 압량읍 정상지길42길 3경상북도 경산시 압량읍 당음리 435-32금속문,샤시,창틀금속 문, 창, 셔터 및 관련제품 제조업부존재부존재
2766흥생푸드경상북도 경산시 와촌면 계전길9길 87-21경상북도 경산시 와촌면 계전리 174가공란(알가공업)그 외 기타 식료품 제조업부존재부존재
2767흥성실업경상북도 경산시 남천면 신석길 46-7경상북도 경산시 남천면 신석리 459-13번지견및인조섬유직물화학섬유직물 직조업053-812-3331053-812-3331
2768흥욱산업경상북도 경산시 와촌면 박사강변로 162-4 외 1필지경상북도 경산시 와촌면 박사리 21-2 외 1필지선별기, 세척기, 건조기농업 및 임업용 기계 제조업부존재부존재
2769흥창스틸(주)경상북도 경산시 자인면 한장군로 412, (북사리 1084-3) 외 1필지경상북도 경산시 자인면 북사리 1084-3번지 (북사리 1084-3) 외 1필지철망,휀스금속선 가공제품 제조업 외 1 종053-851-8486053-851-1750
2770희승무역주식회사경상북도 경산시 진량읍 진성로 407-22 (총 2 필지) 외 1필지경상북도 경산시 진량읍 광석리 178번지 외 1필지기계자수자수제품 및 자수용재료 제조업 외 1 종053-853-7744053-853-8825
2771히아브 특장경상북도 경산시 와촌면 계당리 172-2 외 1필지경상북도 경산시 와촌면 계당리 172-2 외 1필지유압적하기산업용 트럭 및 적재기 제조업053-852-7708부존재

Duplicate rows

Most frequently occurring

회사명공장대표주소(도로명)공장대표주소(지번)생산품업종명전화번호팩스번호# duplicates
0남경산업경상북도 경산시 남천면 상대로 127 (주동제C&P) 외 1필지경상북도 경산시 남천면 송백리 159번지 외 1필지캐릭터카드기타 인쇄업 외 1 종053-813-6600부존재2
1대우연사경상북도 경산시 남산면 대왕로 60-9경상북도 경산시 남산면 산양리 288번지연사연사 및 가공사 제조업053-792-1751부존재2