Overview

Dataset statistics

Number of variables4
Number of observations36
Missing cells4
Missing cells (%)2.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 KiB
Average record size in memory35.7 B

Variable types

Text4

Dataset

Description경주시에 위치한 직물(섬유, 부직포, 면사, 마스크, 면직물 등) 생산공장현황입니다.(업체명, 주소, 연락처, 생산품 등)
URLhttps://www.data.go.kr/data/15062579/fileData.do

Alerts

주원자재 has 4 (11.1%) missing valuesMissing
회사명 has unique valuesUnique
생산품 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:06:26.505241
Analysis finished2023-12-12 09:06:27.065834
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

회사명
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:06:27.228206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length9
Mean length6.8888889
Min length2

Characters and Unicode

Total characters248
Distinct characters100
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row(주)다산이노텍
2nd row(주)다이유진코리아
3rd row(주)디케이글로벌
4th row(주)삼우엠티엘
5th row(주)삼원
ValueCountFrequency (%)
주식회사 3
 
7.3%
유림기업 1
 
2.4%
임고은 1
 
2.4%
1
 
2.4%
프레임 1
 
2.4%
삼경 1
 
2.4%
삼화기업 1
 
2.4%
성웅텍스 1
 
2.4%
수인더스트리 1
 
2.4%
자광두류공장 1
 
2.4%
Other values (29) 29
70.7%
2023-12-12T18:06:27.685781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
10.1%
( 20
 
8.1%
) 20
 
8.1%
8
 
3.2%
8
 
3.2%
6
 
2.4%
6
 
2.4%
5
 
2.0%
5
 
2.0%
5
 
2.0%
Other values (90) 140
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 199
80.2%
Open Punctuation 20
 
8.1%
Close Punctuation 20
 
8.1%
Space Separator 5
 
2.0%
Other Symbol 2
 
0.8%
Decimal Number 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
12.6%
8
 
4.0%
8
 
4.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (84) 122
61.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
1 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 201
81.0%
Common 47
 
19.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
12.4%
8
 
4.0%
8
 
4.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (85) 124
61.7%
Common
ValueCountFrequency (%)
( 20
42.6%
) 20
42.6%
5
 
10.6%
2 1
 
2.1%
1 1
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 199
80.2%
ASCII 47
 
19.0%
None 2
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
12.6%
8
 
4.0%
8
 
4.0%
6
 
3.0%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
Other values (84) 122
61.3%
ASCII
ValueCountFrequency (%)
( 20
42.6%
) 20
42.6%
5
 
10.6%
2 1
 
2.1%
1 1
 
2.1%
None
ValueCountFrequency (%)
2
100.0%
Distinct35
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:06:27.991255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length25.083333
Min length18

Characters and Unicode

Total characters903
Distinct characters61
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)94.4%

Sample

1st row경상북도 경주시 천북면 동산리 861-32번지 외 1필지
2nd row경상북도 경주시 천북면 신당리 12-5번지
3rd row경상북도 경주시 천북면 성지리 491-19
4th row경상북도 경주시 안강읍 두류리 370-1번지 외 5필지
5th row경상북도 경주시 외동읍 제내리 645-2번지 외 14필지
ValueCountFrequency (%)
경상북도 36
17.9%
경주시 36
17.9%
외동읍 14
 
7.0%
12
 
6.0%
1필지 9
 
4.5%
냉천리 7
 
3.5%
천북면 4
 
2.0%
황성동 4
 
2.0%
안강읍 4
 
2.0%
강동면 3
 
1.5%
Other values (60) 72
35.8%
2023-12-12T18:06:28.507636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
165
18.3%
72
 
8.0%
1 58
 
6.4%
41
 
4.5%
40
 
4.4%
36
 
4.0%
36
 
4.0%
36
 
4.0%
36
 
4.0%
30
 
3.3%
Other values (51) 353
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 544
60.2%
Decimal Number 169
 
18.7%
Space Separator 165
 
18.3%
Dash Punctuation 23
 
2.5%
Open Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
72
13.2%
41
 
7.5%
40
 
7.4%
36
 
6.6%
36
 
6.6%
36
 
6.6%
36
 
6.6%
30
 
5.5%
28
 
5.1%
26
 
4.8%
Other values (37) 163
30.0%
Decimal Number
ValueCountFrequency (%)
1 58
34.3%
2 23
 
13.6%
4 15
 
8.9%
9 13
 
7.7%
0 13
 
7.7%
6 12
 
7.1%
5 11
 
6.5%
3 9
 
5.3%
7 8
 
4.7%
8 7
 
4.1%
Space Separator
ValueCountFrequency (%)
165
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 544
60.2%
Common 359
39.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
72
13.2%
41
 
7.5%
40
 
7.4%
36
 
6.6%
36
 
6.6%
36
 
6.6%
36
 
6.6%
30
 
5.5%
28
 
5.1%
26
 
4.8%
Other values (37) 163
30.0%
Common
ValueCountFrequency (%)
165
46.0%
1 58
 
16.2%
- 23
 
6.4%
2 23
 
6.4%
4 15
 
4.2%
9 13
 
3.6%
0 13
 
3.6%
6 12
 
3.3%
5 11
 
3.1%
3 9
 
2.5%
Other values (4) 17
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 544
60.2%
ASCII 359
39.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
165
46.0%
1 58
 
16.2%
- 23
 
6.4%
2 23
 
6.4%
4 15
 
4.2%
9 13
 
3.6%
0 13
 
3.6%
6 12
 
3.3%
5 11
 
3.1%
3 9
 
2.5%
Other values (4) 17
 
4.7%
Hangul
ValueCountFrequency (%)
72
13.2%
41
 
7.5%
40
 
7.4%
36
 
6.6%
36
 
6.6%
36
 
6.6%
36
 
6.6%
30
 
5.5%
28
 
5.1%
26
 
4.8%
Other values (37) 163
30.0%

생산품
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-12T18:06:28.828279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.8055556
Min length2

Characters and Unicode

Total characters317
Distinct characters146
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row점착테이프 제조용 원단 제조
2nd row방석 및 매트
3rd row마스크(KF94, KF80, KF-AD)
4th row부직포 등
5th row부직포, 펠트
ValueCountFrequency (%)
부직포 4
 
5.6%
2
 
2.8%
매트 2
 
2.8%
면직물 2
 
2.8%
염색 2
 
2.8%
점착테이프 1
 
1.4%
흡음재 1
 
1.4%
휄트 1
 
1.4%
캔버스 1
 
1.4%
회화용 1
 
1.4%
Other values (54) 54
76.1%
2023-12-12T18:06:29.333695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
11.0%
, 16
 
5.0%
8
 
2.5%
7
 
2.2%
7
 
2.2%
6
 
1.9%
5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
Other values (136) 218
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 215
67.8%
Space Separator 35
 
11.0%
Uppercase Letter 24
 
7.6%
Other Punctuation 17
 
5.4%
Lowercase Letter 12
 
3.8%
Decimal Number 5
 
1.6%
Close Punctuation 4
 
1.3%
Open Punctuation 4
 
1.3%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
3.7%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
Other values (106) 158
73.5%
Uppercase Letter
ValueCountFrequency (%)
F 4
16.7%
P 3
12.5%
C 3
12.5%
K 3
12.5%
B 2
8.3%
R 2
8.3%
E 2
8.3%
D 1
 
4.2%
A 1
 
4.2%
I 1
 
4.2%
Other values (2) 2
8.3%
Lowercase Letter
ValueCountFrequency (%)
l 3
25.0%
a 2
16.7%
t 2
16.7%
p 1
 
8.3%
e 1
 
8.3%
o 1
 
8.3%
h 1
 
8.3%
g 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
4 2
40.0%
0 1
20.0%
8 1
20.0%
9 1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 16
94.1%
/ 1
 
5.9%
Space Separator
ValueCountFrequency (%)
35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 215
67.8%
Common 66
 
20.8%
Latin 36
 
11.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8
 
3.7%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
Other values (106) 158
73.5%
Latin
ValueCountFrequency (%)
F 4
 
11.1%
P 3
 
8.3%
C 3
 
8.3%
K 3
 
8.3%
l 3
 
8.3%
a 2
 
5.6%
B 2
 
5.6%
t 2
 
5.6%
R 2
 
5.6%
E 2
 
5.6%
Other values (10) 10
27.8%
Common
ValueCountFrequency (%)
35
53.0%
, 16
24.2%
) 4
 
6.1%
( 4
 
6.1%
4 2
 
3.0%
- 1
 
1.5%
/ 1
 
1.5%
0 1
 
1.5%
8 1
 
1.5%
9 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 215
67.8%
ASCII 102
32.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35
34.3%
, 16
15.7%
) 4
 
3.9%
F 4
 
3.9%
( 4
 
3.9%
P 3
 
2.9%
C 3
 
2.9%
K 3
 
2.9%
l 3
 
2.9%
a 2
 
2.0%
Other values (20) 25
24.5%
Hangul
ValueCountFrequency (%)
8
 
3.7%
7
 
3.3%
7
 
3.3%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
4
 
1.9%
Other values (106) 158
73.5%

주원자재
Text

MISSING 

Distinct31
Distinct (%)96.9%
Missing4
Missing (%)11.1%
Memory size420.0 B
2023-12-12T18:06:29.587584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length7.28125
Min length1

Characters and Unicode

Total characters233
Distinct characters106
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)93.8%

Sample

1st row부직포, 비말필터, EARING, 코 심
2nd row섬유
3rd rowFIBER
4th row폴리머, 철
5th rowPP/PE
ValueCountFrequency (%)
부직포 5
 
8.8%
원사 4
 
7.0%
2
 
3.5%
2
 
3.5%
파이프 1
 
1.8%
1
 
1.8%
n/felt 1
 
1.8%
펠트 1
 
1.8%
목화 1
 
1.8%
쥬트로프(황마로프 1
 
1.8%
Other values (38) 38
66.7%
2023-12-12T18:06:29.951369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
10.7%
, 18
 
7.7%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
6
 
2.6%
P 6
 
2.6%
E 5
 
2.1%
5
 
2.1%
Other values (96) 143
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 159
68.2%
Space Separator 25
 
10.7%
Uppercase Letter 25
 
10.7%
Other Punctuation 20
 
8.6%
Close Punctuation 2
 
0.9%
Open Punctuation 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
3
 
1.9%
Other values (80) 107
67.3%
Uppercase Letter
ValueCountFrequency (%)
P 6
24.0%
E 5
20.0%
I 2
 
8.0%
N 2
 
8.0%
T 2
 
8.0%
R 2
 
8.0%
F 2
 
8.0%
L 1
 
4.0%
G 1
 
4.0%
A 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 18
90.0%
/ 2
 
10.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 159
68.2%
Common 49
 
21.0%
Latin 25
 
10.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
3
 
1.9%
Other values (80) 107
67.3%
Latin
ValueCountFrequency (%)
P 6
24.0%
E 5
20.0%
I 2
 
8.0%
N 2
 
8.0%
T 2
 
8.0%
R 2
 
8.0%
F 2
 
8.0%
L 1
 
4.0%
G 1
 
4.0%
A 1
 
4.0%
Common
ValueCountFrequency (%)
25
51.0%
, 18
36.7%
) 2
 
4.1%
( 2
 
4.1%
/ 2
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 159
68.2%
ASCII 74
31.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25
33.8%
, 18
24.3%
P 6
 
8.1%
E 5
 
6.8%
) 2
 
2.7%
I 2
 
2.7%
( 2
 
2.7%
N 2
 
2.7%
/ 2
 
2.7%
T 2
 
2.7%
Other values (6) 8
 
10.8%
Hangul
ValueCountFrequency (%)
7
 
4.4%
6
 
3.8%
6
 
3.8%
6
 
3.8%
6
 
3.8%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
3
 
1.9%
Other values (80) 107
67.3%

Correlations

2023-12-12T18:06:30.050839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
회사명공장대표주소(지번)생산품주원자재
회사명1.0001.0001.0001.000
공장대표주소(지번)1.0001.0001.0000.996
생산품1.0001.0001.0001.000
주원자재1.0000.9961.0001.000

Missing values

2023-12-12T18:06:26.919611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:06:27.021800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명공장대표주소(지번)생산품주원자재
0(주)다산이노텍경상북도 경주시 천북면 동산리 861-32번지 외 1필지점착테이프 제조용 원단 제조<NA>
1(주)다이유진코리아경상북도 경주시 천북면 신당리 12-5번지방석 및 매트<NA>
2(주)디케이글로벌경상북도 경주시 천북면 성지리 491-19마스크(KF94, KF80, KF-AD)부직포, 비말필터, EARING, 코 심
3(주)삼우엠티엘경상북도 경주시 안강읍 두류리 370-1번지 외 5필지부직포 등섬유
4(주)삼원경상북도 경주시 외동읍 제내리 645-2번지 외 14필지부직포, 펠트FIBER
5(주)삼원냉천공장경상북도 경주시 외동읍 냉천리 1156-2번지자동차용발판 , 발판고정용 금속환폴리머, 철
6(주)엠시피경상북도 경주시 외동읍 냉천리 1156-12번지PP/PE Cloth, FIBC BagPP/PE
7(주)월핀경상북도 경주시 외동읍 모화리 50-91번지지오그리드PET그리스원사, 코팅용잉크, 지관, PP제품
8(주)지음이공일공경상북도 경주시 외동읍 냉천리 432번지 외 1필지콘크리트 섬유보강재<NA>
9(주)청오산업경상북도 경주시 외동읍 모화리 209-4번지 외 1필지헤드레스트(베게포)각종커버천연가죽,인조가족,직물
회사명공장대표주소(지번)생산품주원자재
26임고은경상북도 경주시 하동 201-24번지천,의류 염색섬류
27자광두류공장경상북도 경주시 안강읍 양월리 691-2번지이중공간지(폴리에스터 매트)폴리에스터 원사
28주식회사 대광하이텍경상북도 경주시 외동읍 구어리 1041철제pallet, 대형텐트하우스파이프, 바퀴(캐스터), 내외부 포장재, 천막
29주식회사 민투경상북도 경주시 외동읍 녹동리 55-5 1층산업용 방진마스크부직포
30케이알에스티대한동방(주)경상북도 경주시 강동면 왕신리 398-5번지 외 1필지부직포, 브러쉬부직포, 세라믹디스크
31태광산업(주)제1공장경상북도 경주시 황성동 1069번지면직물면사
32태광산업(주)제2공장경상북도 경주시 황성동 1068번지순면사원면
33태화방직(주)경상북도 경주시 외동읍 모화리 1410번지섬유나일론
34하나테크(주)경상북도 경주시 구황동 226번지생사,실크잠견
35혜명섬유(주)경상북도 경주시 서면 아화리 997번지소모사, 혼방모사