Overview

Dataset statistics

Number of variables4
Number of observations730
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory22.9 KiB
Average record size in memory32.2 B

Variable types

Text4

Dataset

Description열처리업체 등록관리 및 소독마크관리, 업체별 열처리 실적관리(소독계획. 열처리소독작업결과서, 원자재소요량확인서 등)을 정보화한 시스템이며
Author농림축산식품부 농림축산검역본부
URLhttps://www.data.go.kr/data/3055529/fileData.do

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 00:47:14.168487
Analysis finished2023-12-12 00:47:15.079094
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct704
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size5.8 KiB
2023-12-12T09:47:15.392278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length7.1219178
Min length2

Characters and Unicode

Total characters5199
Distinct characters293
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique683 ?
Unique (%)93.6%

Sample

1st row진성산업㈜
2nd row(주)뉴-그린
3rd row(주)신흥지엔티 평택공장
4th row㈜한성플랜지
5th row대진산업(주)
ValueCountFrequency (%)
주식회사 54
 
6.7%
주)한성목재 4
 
0.5%
대림수출포장 3
 
0.4%
한국수출포장 3
 
0.4%
벽암산업(주 3
 
0.4%
한성수출포장 3
 
0.4%
수출포장 3
 
0.4%
3
 
0.4%
주)성은글로벌 3
 
0.4%
정우수출포장 2
 
0.2%
Other values (713) 731
90.0%
2023-12-12T09:47:16.120098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
356
 
6.8%
298
 
5.7%
( 291
 
5.6%
) 291
 
5.6%
290
 
5.6%
217
 
4.2%
208
 
4.0%
194
 
3.7%
190
 
3.7%
117
 
2.3%
Other values (283) 2747
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4453
85.7%
Open Punctuation 291
 
5.6%
Close Punctuation 291
 
5.6%
Space Separator 82
 
1.6%
Other Symbol 37
 
0.7%
Uppercase Letter 31
 
0.6%
Lowercase Letter 7
 
0.1%
Decimal Number 3
 
0.1%
Other Punctuation 3
 
0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
356
 
8.0%
298
 
6.7%
290
 
6.5%
217
 
4.9%
208
 
4.7%
194
 
4.4%
190
 
4.3%
117
 
2.6%
108
 
2.4%
98
 
2.2%
Other values (252) 2377
53.4%
Uppercase Letter
ValueCountFrequency (%)
S 5
16.1%
K 4
12.9%
E 2
 
6.5%
D 2
 
6.5%
L 2
 
6.5%
J 2
 
6.5%
G 2
 
6.5%
B 2
 
6.5%
C 2
 
6.5%
O 2
 
6.5%
Other values (6) 6
19.4%
Lowercase Letter
ValueCountFrequency (%)
h 1
14.3%
c 1
14.3%
e 1
14.3%
s 1
14.3%
i 1
14.3%
g 1
14.3%
o 1
14.3%
Other Punctuation
ValueCountFrequency (%)
& 2
66.7%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 291
100.0%
Close Punctuation
ValueCountFrequency (%)
) 291
100.0%
Space Separator
ValueCountFrequency (%)
82
100.0%
Other Symbol
ValueCountFrequency (%)
37
100.0%
Decimal Number
ValueCountFrequency (%)
2 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4490
86.4%
Common 671
 
12.9%
Latin 38
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
356
 
7.9%
298
 
6.6%
290
 
6.5%
217
 
4.8%
208
 
4.6%
194
 
4.3%
190
 
4.2%
117
 
2.6%
108
 
2.4%
98
 
2.2%
Other values (253) 2414
53.8%
Latin
ValueCountFrequency (%)
S 5
 
13.2%
K 4
 
10.5%
E 2
 
5.3%
D 2
 
5.3%
L 2
 
5.3%
J 2
 
5.3%
G 2
 
5.3%
B 2
 
5.3%
C 2
 
5.3%
O 2
 
5.3%
Other values (13) 13
34.2%
Common
ValueCountFrequency (%)
( 291
43.4%
) 291
43.4%
82
 
12.2%
2 3
 
0.4%
& 2
 
0.3%
. 1
 
0.1%
- 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4453
85.7%
ASCII 709
 
13.6%
None 37
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
356
 
8.0%
298
 
6.7%
290
 
6.5%
217
 
4.9%
208
 
4.7%
194
 
4.4%
190
 
4.3%
117
 
2.6%
108
 
2.4%
98
 
2.2%
Other values (252) 2377
53.4%
ASCII
ValueCountFrequency (%)
( 291
41.0%
) 291
41.0%
82
 
11.6%
S 5
 
0.7%
K 4
 
0.6%
2 3
 
0.4%
E 2
 
0.3%
D 2
 
0.3%
L 2
 
0.3%
J 2
 
0.3%
Other values (20) 25
 
3.5%
None
ValueCountFrequency (%)
37
100.0%

주소
Text

Distinct720
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size5.8 KiB
2023-12-12T09:47:16.648714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length30
Mean length21.70274
Min length14

Characters and Unicode

Total characters15843
Distinct characters317
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique710 ?
Unique (%)97.3%

Sample

1st row인천광역시 서구 북항로363번길 58
2nd row경기도 화성시 향남읍 토성로359번길 10
3rd row경기도 평택시 은실5길 90
4th row인천광역시 남동구 앵고개로 659
5th row전라남도 여수시 소라면 화양로 1938-1
ValueCountFrequency (%)
경기도 175
 
5.0%
경상남도 132
 
3.8%
경상북도 93
 
2.6%
부산광역시 68
 
1.9%
인천광역시 64
 
1.8%
화성시 61
 
1.7%
강서구 57
 
1.6%
김해시 54
 
1.5%
충청남도 46
 
1.3%
서구 45
 
1.3%
Other values (1442) 2722
77.4%
2023-12-12T09:47:17.289834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2794
 
17.6%
674
 
4.3%
1 593
 
3.7%
548
 
3.5%
544
 
3.4%
441
 
2.8%
2 404
 
2.6%
376
 
2.4%
353
 
2.2%
3 348
 
2.2%
Other values (307) 8768
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9849
62.2%
Decimal Number 2932
 
18.5%
Space Separator 2794
 
17.6%
Dash Punctuation 251
 
1.6%
Uppercase Letter 11
 
0.1%
Other Punctuation 4
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
674
 
6.8%
548
 
5.6%
544
 
5.5%
441
 
4.5%
376
 
3.8%
353
 
3.6%
343
 
3.5%
301
 
3.1%
293
 
3.0%
254
 
2.6%
Other values (286) 5722
58.1%
Decimal Number
ValueCountFrequency (%)
1 593
20.2%
2 404
13.8%
3 348
11.9%
4 274
9.3%
6 248
8.5%
7 235
 
8.0%
5 215
 
7.3%
9 215
 
7.3%
8 210
 
7.2%
0 190
 
6.5%
Uppercase Letter
ValueCountFrequency (%)
B 5
45.5%
L 3
27.3%
A 1
 
9.1%
D 1
 
9.1%
E 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 3
75.0%
/ 1
 
25.0%
Space Separator
ValueCountFrequency (%)
2794
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 251
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9849
62.2%
Common 5983
37.8%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
674
 
6.8%
548
 
5.6%
544
 
5.5%
441
 
4.5%
376
 
3.8%
353
 
3.6%
343
 
3.5%
301
 
3.1%
293
 
3.0%
254
 
2.6%
Other values (286) 5722
58.1%
Common
ValueCountFrequency (%)
2794
46.7%
1 593
 
9.9%
2 404
 
6.8%
3 348
 
5.8%
4 274
 
4.6%
- 251
 
4.2%
6 248
 
4.1%
7 235
 
3.9%
5 215
 
3.6%
9 215
 
3.6%
Other values (6) 406
 
6.8%
Latin
ValueCountFrequency (%)
B 5
45.5%
L 3
27.3%
A 1
 
9.1%
D 1
 
9.1%
E 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9849
62.2%
ASCII 5994
37.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2794
46.6%
1 593
 
9.9%
2 404
 
6.7%
3 348
 
5.8%
4 274
 
4.6%
- 251
 
4.2%
6 248
 
4.1%
7 235
 
3.9%
5 215
 
3.6%
9 215
 
3.6%
Other values (11) 417
 
7.0%
Hangul
ValueCountFrequency (%)
674
 
6.8%
548
 
5.6%
544
 
5.5%
441
 
4.5%
376
 
3.8%
353
 
3.6%
343
 
3.5%
301
 
3.1%
293
 
3.0%
254
 
2.6%
Other values (286) 5722
58.1%
Distinct726
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size5.8 KiB
2023-12-12T09:47:17.657477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010959
Min length11

Characters and Unicode

Total characters8768
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique722 ?
Unique (%)98.9%

Sample

1st row032-575-7600
2nd row031-354-3100
3rd row031-652-5451
4th row032-813-6340
5th row061-683-6363
ValueCountFrequency (%)
061-683-9171 2
 
0.3%
062-953-3873 2
 
0.3%
044-251-6221 2
 
0.3%
031-222-1251 2
 
0.3%
031-984-4304 1
 
0.1%
054-776-9421 1
 
0.1%
053-851-9566 1
 
0.1%
032-575-7600 1
 
0.1%
031-692-3233 1
 
0.1%
054-745-0510 1
 
0.1%
Other values (716) 716
98.1%
2023-12-12T09:47:18.150503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1460
16.7%
0 1189
13.6%
5 1065
12.1%
3 922
10.5%
1 782
8.9%
4 684
7.8%
2 677
7.7%
6 572
 
6.5%
7 506
 
5.8%
8 482
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7308
83.3%
Dash Punctuation 1460
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1189
16.3%
5 1065
14.6%
3 922
12.6%
1 782
10.7%
4 684
9.4%
2 677
9.3%
6 572
7.8%
7 506
6.9%
8 482
6.6%
9 429
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 1460
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8768
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1460
16.7%
0 1189
13.6%
5 1065
12.1%
3 922
10.5%
1 782
8.9%
4 684
7.8%
2 677
7.7%
6 572
 
6.5%
7 506
 
5.8%
8 482
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8768
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1460
16.7%
0 1189
13.6%
5 1065
12.1%
3 922
10.5%
1 782
8.9%
4 684
7.8%
2 677
7.7%
6 572
 
6.5%
7 506
 
5.8%
8 482
 
5.5%
Distinct699
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size5.8 KiB
2023-12-12T09:47:18.591660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.1123288
Min length2

Characters and Unicode

Total characters2272
Distinct characters201
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique674 ?
Unique (%)92.3%

Sample

1st row홍진기
2nd row이윤기
3rd row구본재
4th row김진안
5th row안명욱
ValueCountFrequency (%)
구재본 6
 
0.8%
1명 6
 
0.8%
4
 
0.5%
성원제 3
 
0.4%
이은학 3
 
0.4%
이기황 2
 
0.3%
송기현 2
 
0.3%
2
 
0.3%
박상복 2
 
0.3%
2
 
0.3%
Other values (704) 725
95.8%
2023-12-12T09:47:19.144530image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
132
 
5.8%
122
 
5.4%
81
 
3.6%
59
 
2.6%
56
 
2.5%
49
 
2.2%
42
 
1.8%
40
 
1.8%
39
 
1.7%
36
 
1.6%
Other values (191) 1616
71.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2226
98.0%
Space Separator 31
 
1.4%
Other Punctuation 9
 
0.4%
Decimal Number 6
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
132
 
5.9%
122
 
5.5%
81
 
3.6%
59
 
2.7%
56
 
2.5%
49
 
2.2%
42
 
1.9%
40
 
1.8%
39
 
1.8%
36
 
1.6%
Other values (188) 1570
70.5%
Space Separator
ValueCountFrequency (%)
31
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%
Decimal Number
ValueCountFrequency (%)
1 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2226
98.0%
Common 46
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
132
 
5.9%
122
 
5.5%
81
 
3.6%
59
 
2.7%
56
 
2.5%
49
 
2.2%
42
 
1.9%
40
 
1.8%
39
 
1.8%
36
 
1.6%
Other values (188) 1570
70.5%
Common
ValueCountFrequency (%)
31
67.4%
, 9
 
19.6%
1 6
 
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2226
98.0%
ASCII 46
 
2.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
132
 
5.9%
122
 
5.5%
81
 
3.6%
59
 
2.7%
56
 
2.5%
49
 
2.2%
42
 
1.9%
40
 
1.8%
39
 
1.8%
36
 
1.6%
Other values (188) 1570
70.5%
ASCII
ValueCountFrequency (%)
31
67.4%
, 9
 
19.6%
1 6
 
13.0%

Missing values

2023-12-12T09:47:14.852449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:47:15.013150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명주소전화번호대표자명
0진성산업㈜인천광역시 서구 북항로363번길 58032-575-7600홍진기
1(주)뉴-그린경기도 화성시 향남읍 토성로359번길 10031-354-3100이윤기
2(주)신흥지엔티 평택공장경기도 평택시 은실5길 90031-652-5451구본재
3㈜한성플랜지인천광역시 남동구 앵고개로 659032-813-6340김진안
4대진산업(주)전라남도 여수시 소라면 화양로 1938-1061-683-6363안명욱
5주식회사 신흥 구미공장경상북도 칠곡군 기산면 행정2길 97-3054-475-1800박상복 외 1명
6(주)송덕패키징경상북도 칠곡군 석적읍 중지3길 68054-975-3242이재필,이길희
7(주)형진목재부산광역시 강서구 녹산산단381로86번길 14-21051-831-0748조희관
8(주)서울수출포장경기도 화성시 양감면 초록로 660031-352-8420임경빈
9㈜신영목재전라북도 군산시 외항로 1148063-464-9830김종환
업체명주소전화번호대표자명
720부광산업경상북도 칠곡군 석적읍 중지3길 48-12054-975-7712이홍규
721원팩글로벌부산광역시 강서구 범방2로9번길 11051-971-4025민병현
722(주)와이앤피수출포장인천광역시 서구 원전로69번길 4032-565-6967박상연
723한미수출포장충청북도 청주시 청원구 오창읍 성산2길 47-9043-218-4631전현숙 외 1명
724(주)하이텍우드전라남도 함평군 학교면 동함평산단길 34-11061-322-0500강동헌
725동남목재수출포장경상남도 김해시 상동면 상동로 297055-329-0724신용선
726(주)산호수출포장경상남도 함안군 대산면 이산로 100-8055-714-1070최은수외 1명
727송림산업울산광역시 울주군 온양읍 용당내광로 361-5052-264-0061김재관
728한영목재경상남도 밀양시 삼랑진읍 미율로 394-16055-355-2109김병주
729제일파렛트충청북도 청주시 서원구 현도면 죽전2길 56043-269-2167한승우

Duplicate rows

Most frequently occurring

업체명주소전화번호대표자명# duplicates
0화성산업(주)전라남도 여수시 소라면 덕양로 377061-683-9171김철곤2