Overview

Dataset statistics

Number of variables6
Number of observations855
Missing cells205
Missing cells (%)4.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.2 KiB
Average record size in memory48.2 B

Variable types

Text5
DateTime1

Dataset

Description제조업체 등록현황(기업명,대표자명,업종명,전화번호,사업장소재지)에대한제조업체현황을참고바랍니다.
Author경상북도 성주군
URLhttps://www.data.go.kr/data/15031119/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 205 (24.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:49:50.439535
Analysis finished2023-12-12 11:49:51.324909
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct838
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T20:49:51.542349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length6.3380117
Min length2

Characters and Unicode

Total characters5419
Distinct characters367
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique823 ?
Unique (%)96.3%

Sample

1st row(주)CK월드
2nd row(주)고려창호
3rd row(주)그린텍
4th row(주)대교
5th row(주)대림산업
ValueCountFrequency (%)
주식회사 30
 
3.3%
성주공장 7
 
0.8%
성주지점 5
 
0.5%
제2공장 4
 
0.4%
하나섬유 3
 
0.3%
현대산업 3
 
0.3%
영창케미칼(주 3
 
0.3%
우리산업 2
 
0.2%
주)청진이엔씨 2
 
0.2%
주)금성산업 2
 
0.2%
Other values (839) 857
93.4%
2023-12-12T20:49:52.185481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
453
 
8.4%
( 385
 
7.1%
) 385
 
7.1%
192
 
3.5%
174
 
3.2%
133
 
2.5%
122
 
2.3%
113
 
2.1%
95
 
1.8%
77
 
1.4%
Other values (357) 3290
60.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4485
82.8%
Open Punctuation 385
 
7.1%
Close Punctuation 385
 
7.1%
Uppercase Letter 77
 
1.4%
Space Separator 63
 
1.2%
Decimal Number 13
 
0.2%
Lowercase Letter 6
 
0.1%
Other Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
453
 
10.1%
192
 
4.3%
174
 
3.9%
133
 
3.0%
122
 
2.7%
113
 
2.5%
95
 
2.1%
77
 
1.7%
68
 
1.5%
68
 
1.5%
Other values (321) 2990
66.7%
Uppercase Letter
ValueCountFrequency (%)
S 11
14.3%
E 9
11.7%
C 8
10.4%
M 6
 
7.8%
H 5
 
6.5%
G 5
 
6.5%
A 4
 
5.2%
F 4
 
5.2%
N 4
 
5.2%
K 3
 
3.9%
Other values (11) 18
23.4%
Lowercase Letter
ValueCountFrequency (%)
n 1
16.7%
a 1
16.7%
l 1
16.7%
p 1
16.7%
t 1
16.7%
m 1
16.7%
Decimal Number
ValueCountFrequency (%)
2 10
76.9%
4 1
 
7.7%
1 1
 
7.7%
3 1
 
7.7%
Other Punctuation
ValueCountFrequency (%)
& 4
80.0%
. 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 385
100.0%
Close Punctuation
ValueCountFrequency (%)
) 385
100.0%
Space Separator
ValueCountFrequency (%)
63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4485
82.8%
Common 851
 
15.7%
Latin 83
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
453
 
10.1%
192
 
4.3%
174
 
3.9%
133
 
3.0%
122
 
2.7%
113
 
2.5%
95
 
2.1%
77
 
1.7%
68
 
1.5%
68
 
1.5%
Other values (321) 2990
66.7%
Latin
ValueCountFrequency (%)
S 11
13.3%
E 9
 
10.8%
C 8
 
9.6%
M 6
 
7.2%
H 5
 
6.0%
G 5
 
6.0%
A 4
 
4.8%
F 4
 
4.8%
N 4
 
4.8%
K 3
 
3.6%
Other values (17) 24
28.9%
Common
ValueCountFrequency (%)
( 385
45.2%
) 385
45.2%
63
 
7.4%
2 10
 
1.2%
& 4
 
0.5%
4 1
 
0.1%
1 1
 
0.1%
3 1
 
0.1%
. 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4485
82.8%
ASCII 934
 
17.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
453
 
10.1%
192
 
4.3%
174
 
3.9%
133
 
3.0%
122
 
2.7%
113
 
2.5%
95
 
2.1%
77
 
1.7%
68
 
1.5%
68
 
1.5%
Other values (321) 2990
66.7%
ASCII
ValueCountFrequency (%)
( 385
41.2%
) 385
41.2%
63
 
6.7%
S 11
 
1.2%
2 10
 
1.1%
E 9
 
1.0%
C 8
 
0.9%
M 6
 
0.6%
H 5
 
0.5%
G 5
 
0.5%
Other values (26) 47
 
5.0%
Distinct792
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T20:49:52.597422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.1555556
Min length2

Characters and Unicode

Total characters2698
Distinct characters210
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique734 ?
Unique (%)85.8%

Sample

1st row조광섭
2nd row최상기
3rd row이정곤
4th row김칠연
5th row허만우
ValueCountFrequency (%)
이성일 4
 
0.5%
강병하 3
 
0.3%
강일규 3
 
0.3%
박종대 3
 
0.3%
유정희 3
 
0.3%
김영길 2
 
0.2%
김은진 2
 
0.2%
윤상억 2
 
0.2%
이구휘 2
 
0.2%
박치근 2
 
0.2%
Other values (799) 851
97.0%
2023-12-12T20:49:53.143798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
170
 
6.3%
134
 
5.0%
90
 
3.3%
80
 
3.0%
64
 
2.4%
62
 
2.3%
54
 
2.0%
45
 
1.7%
42
 
1.6%
40
 
1.5%
Other values (200) 1917
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2647
98.1%
Space Separator 25
 
0.9%
Other Punctuation 23
 
0.9%
Decimal Number 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
170
 
6.4%
134
 
5.1%
90
 
3.4%
80
 
3.0%
64
 
2.4%
62
 
2.3%
54
 
2.0%
45
 
1.7%
42
 
1.6%
40
 
1.5%
Other values (196) 1866
70.5%
Other Punctuation
ValueCountFrequency (%)
, 22
95.7%
. 1
 
4.3%
Space Separator
ValueCountFrequency (%)
25
100.0%
Decimal Number
ValueCountFrequency (%)
1 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2647
98.1%
Common 51
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
170
 
6.4%
134
 
5.1%
90
 
3.4%
80
 
3.0%
64
 
2.4%
62
 
2.3%
54
 
2.0%
45
 
1.7%
42
 
1.6%
40
 
1.5%
Other values (196) 1866
70.5%
Common
ValueCountFrequency (%)
25
49.0%
, 22
43.1%
1 3
 
5.9%
. 1
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2647
98.1%
ASCII 51
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
170
 
6.4%
134
 
5.1%
90
 
3.4%
80
 
3.0%
64
 
2.4%
62
 
2.3%
54
 
2.0%
45
 
1.7%
42
 
1.6%
40
 
1.5%
Other values (196) 1866
70.5%
ASCII
ValueCountFrequency (%)
25
49.0%
, 22
43.1%
1 3
 
5.9%
. 1
 
2.0%

전화번호
Text

MISSING 

Distinct618
Distinct (%)95.1%
Missing205
Missing (%)24.0%
Memory size6.8 KiB
2023-12-12T20:49:53.436816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.998462
Min length11

Characters and Unicode

Total characters7799
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique589 ?
Unique (%)90.6%

Sample

1st row054-932-3898
2nd row054-933-4502
3rd row053-586-6222
4th row053-587-0062
5th row054-974-6006
ValueCountFrequency (%)
054-931-8907 3
 
0.5%
054-933-2519 3
 
0.5%
054-933-3991 3
 
0.5%
054-931-9567 2
 
0.3%
054-931-6240 2
 
0.3%
054-933-1771 2
 
0.3%
054-933-5102 2
 
0.3%
054-933-8954 2
 
0.3%
054-930-2500 2
 
0.3%
054-933-0686 2
 
0.3%
Other values (608) 627
96.5%
2023-12-12T20:49:54.014498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1300
16.7%
3 1085
13.9%
0 991
12.7%
5 960
12.3%
4 843
10.8%
9 796
10.2%
1 611
7.8%
2 362
 
4.6%
8 300
 
3.8%
6 281
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6499
83.3%
Dash Punctuation 1300
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1085
16.7%
0 991
15.2%
5 960
14.8%
4 843
13.0%
9 796
12.2%
1 611
9.4%
2 362
 
5.6%
8 300
 
4.6%
6 281
 
4.3%
7 270
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 1300
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7799
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1300
16.7%
3 1085
13.9%
0 991
12.7%
5 960
12.3%
4 843
10.8%
9 796
10.2%
1 611
7.8%
2 362
 
4.6%
8 300
 
3.8%
6 281
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7799
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1300
16.7%
3 1085
13.9%
0 991
12.7%
5 960
12.3%
4 843
10.8%
9 796
10.2%
1 611
7.8%
2 362
 
4.6%
8 300
 
3.8%
6 281
 
3.6%
Distinct789
Distinct (%)92.3%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T20:49:54.483932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length27
Mean length17.414035
Min length13

Characters and Unicode

Total characters14889
Distinct characters109
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique733 ?
Unique (%)85.7%

Sample

1st row성주군 성주읍 성주산업단지로1길 3
2nd row성주군 성주읍 성주산업단지로2길 144
3rd row성주군 성주읍 성주산업단지로1길 120
4th row성주군 성주읍 성주산업단지로 24
5th row성주군 성주읍 성주산업단지로2길 96
ValueCountFrequency (%)
성주군 854
24.8%
선남면 408
 
11.8%
성주읍 131
 
3.8%
월항면 126
 
3.7%
용암면 109
 
3.2%
선노로 52
 
1.5%
나선로 35
 
1.0%
초전면 34
 
1.0%
월항농공단지1길 31
 
0.9%
명관로 30
 
0.9%
Other values (776) 1637
47.5%
2023-12-12T20:49:55.576984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2594
17.4%
1160
 
7.8%
1117
 
7.5%
855
 
5.7%
724
 
4.9%
1 586
 
3.9%
536
 
3.6%
2 486
 
3.3%
421
 
2.8%
379
 
2.5%
Other values (99) 6031
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8919
59.9%
Decimal Number 3011
 
20.2%
Space Separator 2594
 
17.4%
Dash Punctuation 351
 
2.4%
Other Punctuation 13
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1160
13.0%
1117
 
12.5%
855
 
9.6%
724
 
8.1%
536
 
6.0%
421
 
4.7%
379
 
4.2%
333
 
3.7%
325
 
3.6%
240
 
2.7%
Other values (85) 2829
31.7%
Decimal Number
ValueCountFrequency (%)
1 586
19.5%
2 486
16.1%
3 349
11.6%
4 304
10.1%
5 254
8.4%
7 230
 
7.6%
6 227
 
7.5%
9 201
 
6.7%
8 189
 
6.3%
0 185
 
6.1%
Space Separator
ValueCountFrequency (%)
2594
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 351
100.0%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8919
59.9%
Common 5969
40.1%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1160
13.0%
1117
 
12.5%
855
 
9.6%
724
 
8.1%
536
 
6.0%
421
 
4.7%
379
 
4.2%
333
 
3.7%
325
 
3.6%
240
 
2.7%
Other values (85) 2829
31.7%
Common
ValueCountFrequency (%)
2594
43.5%
1 586
 
9.8%
2 486
 
8.1%
- 351
 
5.9%
3 349
 
5.8%
4 304
 
5.1%
5 254
 
4.3%
7 230
 
3.9%
6 227
 
3.8%
9 201
 
3.4%
Other values (3) 387
 
6.5%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8919
59.9%
ASCII 5970
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2594
43.5%
1 586
 
9.8%
2 486
 
8.1%
- 351
 
5.9%
3 349
 
5.8%
4 304
 
5.1%
5 254
 
4.3%
7 230
 
3.9%
6 227
 
3.8%
9 201
 
3.4%
Other values (4) 388
 
6.5%
Hangul
ValueCountFrequency (%)
1160
13.0%
1117
 
12.5%
855
 
9.6%
724
 
8.1%
536
 
6.0%
421
 
4.7%
379
 
4.2%
333
 
3.7%
325
 
3.6%
240
 
2.7%
Other values (85) 2829
31.7%
Distinct291
Distinct (%)34.0%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
2023-12-12T20:49:55.989939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length14.977778
Min length4

Characters and Unicode

Total characters12806
Distinct characters261
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)19.9%

Sample

1st row기체 여과기 제조업
2nd row금속 문, 창, 셔터 및 관련제품 제조업 외 1 종
3rd row그외 기타 자동차 부품 제조업
4th row그외 기타 일반목적용 기계 제조업
5th row그외 기타 플라스틱 제품 제조업 외 1 종
ValueCountFrequency (%)
제조업 687
 
18.3%
380
 
10.1%
기타 207
 
5.5%
130
 
3.5%
130
 
3.5%
그외 92
 
2.4%
1 84
 
2.2%
연사 71
 
1.9%
가공사 71
 
1.9%
플라스틱 66
 
1.8%
Other values (366) 1839
48.9%
2023-12-12T20:49:56.748990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2905
22.7%
904
 
7.1%
887
 
6.9%
886
 
6.9%
380
 
3.0%
374
 
2.9%
262
 
2.0%
233
 
1.8%
223
 
1.7%
213
 
1.7%
Other values (251) 5539
43.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9649
75.3%
Space Separator 2905
 
22.7%
Decimal Number 136
 
1.1%
Other Punctuation 116
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
904
 
9.4%
887
 
9.2%
886
 
9.2%
380
 
3.9%
374
 
3.9%
262
 
2.7%
233
 
2.4%
223
 
2.3%
213
 
2.2%
183
 
1.9%
Other values (241) 5104
52.9%
Decimal Number
ValueCountFrequency (%)
1 90
66.2%
2 22
 
16.2%
3 7
 
5.1%
4 6
 
4.4%
5 5
 
3.7%
0 4
 
2.9%
6 2
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 111
95.7%
· 5
 
4.3%
Space Separator
ValueCountFrequency (%)
2905
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9649
75.3%
Common 3157
 
24.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
904
 
9.4%
887
 
9.2%
886
 
9.2%
380
 
3.9%
374
 
3.9%
262
 
2.7%
233
 
2.4%
223
 
2.3%
213
 
2.2%
183
 
1.9%
Other values (241) 5104
52.9%
Common
ValueCountFrequency (%)
2905
92.0%
, 111
 
3.5%
1 90
 
2.9%
2 22
 
0.7%
3 7
 
0.2%
4 6
 
0.2%
5 5
 
0.2%
· 5
 
0.2%
0 4
 
0.1%
6 2
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9649
75.3%
ASCII 3152
 
24.6%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2905
92.2%
, 111
 
3.5%
1 90
 
2.9%
2 22
 
0.7%
3 7
 
0.2%
4 6
 
0.2%
5 5
 
0.2%
0 4
 
0.1%
6 2
 
0.1%
Hangul
ValueCountFrequency (%)
904
 
9.4%
887
 
9.2%
886
 
9.2%
380
 
3.9%
374
 
3.9%
262
 
2.7%
233
 
2.4%
223
 
2.3%
213
 
2.2%
183
 
1.9%
Other values (241) 5104
52.9%
None
ValueCountFrequency (%)
· 5
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.8 KiB
Minimum2018-07-31 00:00:00
Maximum2018-07-31 00:00:00
2023-12-12T20:49:56.930123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:49:57.072249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-12T20:49:51.128751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:49:51.255826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명대표자명전화번호공장대표주소업종명데이터기준일자
0(주)CK월드조광섭054-932-3898성주군 성주읍 성주산업단지로1길 3기체 여과기 제조업2018-07-31
1(주)고려창호최상기<NA>성주군 성주읍 성주산업단지로2길 144금속 문, 창, 셔터 및 관련제품 제조업 외 1 종2018-07-31
2(주)그린텍이정곤054-933-4502성주군 성주읍 성주산업단지로1길 120그외 기타 자동차 부품 제조업2018-07-31
3(주)대교김칠연053-586-6222성주군 성주읍 성주산업단지로 24그외 기타 일반목적용 기계 제조업2018-07-31
4(주)대림산업허만우053-587-0062성주군 성주읍 성주산업단지로2길 96그외 기타 플라스틱 제품 제조업 외 1 종2018-07-31
5(주)대일산업서상목054-974-6006성주군 성주읍 성주산업단지로 50금속 조립구조재 제조업2018-07-31
6(주)대창하이테크한원규054-932-1235성주군 성주읍 성주산업단지로1길 17금속 성형기계 제조업 외 1 종2018-07-31
7(주)동양엔지니어링백인국054-932-9842성주군 성주읍 성주산업단지로1길 54자동차 엔진용 부품 제조업2018-07-31
8(주)맥스로텍 성주공장김인환053-584-6540성주군 성주읍 성주산업단지로2길 131자동차 엔진용 부품 제조업 외 2 종2018-07-31
9(주)문루프곽민정<NA>성주군 성주읍 성주산업단지로 80그외 기타 자동차 부품 제조업2018-07-31
회사명대표자명전화번호공장대표주소업종명데이터기준일자
845현텍스박군호054-933-5142성주군 선남면 나선로 882화학섬유직물 직조업2018-07-31
846화남산업도재화054-931-9966성주군 대가면 도남2길 21부직포 및 펠트 제조업2018-07-31
847현진폴리머김향연<NA>성주군 선남면 선월로 29가공 및 재생 플라스틱원료 생산업2018-07-31
848호식이푸드김병훈<NA>성주군 선남면 용신리 1407번지그외 기타 식료품 제조업2018-07-31
849황금 ENG정찬기<NA>성주군 선남면 문방1길 61-11포장용 플라스틱 성형용기 제조업2018-07-31
850호명건설배복용054-277-1064성주군 선남면 명포1길 16금속 조립구조재 제조업2018-07-31
851화신코어김상현<NA>성주군 용암면 선송리 163-1번지주형 및 금형 제조업2018-07-31
852효명테크(주)이효환054-931-9948성주군 선남면 선노로 540-12그외 기타 자동차 부품 제조업2018-07-31
853희성섬유신명옥<NA>성주군 선남면 도성리 73-1번지연사 및 가공사 제조업2018-07-31
854흥일산업유일흥053-584-3272성주군 용암면 사곡길 190표면가공목재 및 특정 목적용 제재목 제조업2018-07-31