Overview

Dataset statistics

Number of variables4
Number of observations640
Missing cells2
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory32.2 B

Variable types

Text4

Dataset

Description목재포장재에 대한 열처리업을 수행하고 있는 수출입목재 열처리업체 목록 입니다.
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220215000000001895

Reproduction

Analysis started2023-12-11 03:06:08.111606
Analysis finished2023-12-11 03:06:08.619913
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct617
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-11T12:06:08.796365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length6.85625
Min length2

Characters and Unicode

Total characters4388
Distinct characters277
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique600 ?
Unique (%)93.8%

Sample

1st row업체명
2nd row광일산업
3rd row(주)동남우드
4th row현대기업
5th row유민목재
ValueCountFrequency (%)
주식회사 35
 
5.0%
대림수출포장 4
 
0.6%
주)한성목재 4
 
0.6%
벽암산업(주 3
 
0.4%
한국수출포장 3
 
0.4%
3
 
0.4%
주)성은물류 3
 
0.4%
skc(주 2
 
0.3%
미소물산 2
 
0.3%
일겸목재 2
 
0.3%
Other values (624) 638
91.3%
2023-12-11T12:06:09.224244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
 
6.3%
256
 
5.8%
245
 
5.6%
( 231
 
5.3%
) 231
 
5.3%
183
 
4.2%
176
 
4.0%
172
 
3.9%
167
 
3.8%
120
 
2.7%
Other values (267) 2331
53.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3755
85.6%
Open Punctuation 231
 
5.3%
Close Punctuation 231
 
5.3%
Space Separator 59
 
1.3%
Other Symbol 49
 
1.1%
Uppercase Letter 37
 
0.8%
Lowercase Letter 19
 
0.4%
Other Punctuation 5
 
0.1%
Dash Punctuation 1
 
< 0.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
 
7.4%
256
 
6.8%
245
 
6.5%
183
 
4.9%
176
 
4.7%
172
 
4.6%
167
 
4.4%
120
 
3.2%
109
 
2.9%
85
 
2.3%
Other values (229) 1966
52.4%
Uppercase Letter
ValueCountFrequency (%)
S 6
16.2%
J 4
10.8%
M 3
8.1%
K 3
8.1%
C 3
8.1%
L 3
8.1%
P 3
8.1%
B 2
 
5.4%
T 2
 
5.4%
E 2
 
5.4%
Other values (5) 6
16.2%
Lowercase Letter
ValueCountFrequency (%)
o 2
 
10.5%
i 2
 
10.5%
g 2
 
10.5%
c 2
 
10.5%
e 1
 
5.3%
h 1
 
5.3%
s 1
 
5.3%
n 1
 
5.3%
p 1
 
5.3%
a 1
 
5.3%
Other values (5) 5
26.3%
Other Punctuation
ValueCountFrequency (%)
. 4
80.0%
& 1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 231
100.0%
Close Punctuation
ValueCountFrequency (%)
) 231
100.0%
Space Separator
ValueCountFrequency (%)
59
100.0%
Other Symbol
ValueCountFrequency (%)
49
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3804
86.7%
Common 528
 
12.0%
Latin 56
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
 
7.3%
256
 
6.7%
245
 
6.4%
183
 
4.8%
176
 
4.6%
172
 
4.5%
167
 
4.4%
120
 
3.2%
109
 
2.9%
85
 
2.2%
Other values (230) 2015
53.0%
Latin
ValueCountFrequency (%)
S 6
 
10.7%
J 4
 
7.1%
M 3
 
5.4%
K 3
 
5.4%
C 3
 
5.4%
L 3
 
5.4%
P 3
 
5.4%
o 2
 
3.6%
i 2
 
3.6%
B 2
 
3.6%
Other values (20) 25
44.6%
Common
ValueCountFrequency (%)
( 231
43.8%
) 231
43.8%
59
 
11.2%
. 4
 
0.8%
- 1
 
0.2%
& 1
 
0.2%
2 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3755
85.6%
ASCII 584
 
13.3%
None 49
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
276
 
7.4%
256
 
6.8%
245
 
6.5%
183
 
4.9%
176
 
4.7%
172
 
4.6%
167
 
4.4%
120
 
3.2%
109
 
2.9%
85
 
2.3%
Other values (229) 1966
52.4%
ASCII
ValueCountFrequency (%)
( 231
39.6%
) 231
39.6%
59
 
10.1%
S 6
 
1.0%
J 4
 
0.7%
. 4
 
0.7%
M 3
 
0.5%
K 3
 
0.5%
C 3
 
0.5%
L 3
 
0.5%
Other values (27) 37
 
6.3%
None
ValueCountFrequency (%)
49
100.0%
Distinct637
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-11T12:06:09.558692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length25.751562
Min length2

Characters and Unicode

Total characters16481
Distinct characters325
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique634 ?
Unique (%)99.1%

Sample

1st row주소
2nd row전라북도 익산시 오산면 광양1길 3-1
3rd row경기도 화성시 향남읍 토성로359번길 10
4th row부산광역시 영도구 해양로73번길 67
5th row경상남도 김해시 상동면 상동로 135
ValueCountFrequency (%)
경기도 84
 
2.4%
경기 62
 
1.8%
경상남도 58
 
1.7%
김해시 49
 
1.4%
경남 48
 
1.4%
경북 47
 
1.3%
경상북도 46
 
1.3%
서구 45
 
1.3%
화성시 44
 
1.3%
강서구 41
 
1.2%
Other values (1681) 2988
85.1%
2023-12-11T12:06:10.027384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2894
 
17.6%
1 585
 
3.5%
497
 
3.0%
473
 
2.9%
( 469
 
2.8%
) 469
 
2.8%
2 413
 
2.5%
384
 
2.3%
3 373
 
2.3%
367
 
2.2%
Other values (315) 9557
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9119
55.3%
Decimal Number 3113
 
18.9%
Space Separator 2894
 
17.6%
Open Punctuation 469
 
2.8%
Close Punctuation 469
 
2.8%
Dash Punctuation 295
 
1.8%
Other Punctuation 106
 
0.6%
Uppercase Letter 16
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
497
 
5.5%
473
 
5.2%
384
 
4.2%
367
 
4.0%
326
 
3.6%
311
 
3.4%
289
 
3.2%
272
 
3.0%
267
 
2.9%
250
 
2.7%
Other values (290) 5683
62.3%
Decimal Number
ValueCountFrequency (%)
1 585
18.8%
2 413
13.3%
3 373
12.0%
4 304
9.8%
7 256
8.2%
6 251
8.1%
5 247
7.9%
8 244
7.8%
0 236
7.6%
9 204
 
6.6%
Uppercase Letter
ValueCountFrequency (%)
B 4
25.0%
L 4
25.0%
S 2
12.5%
C 2
12.5%
R 1
 
6.2%
H 1
 
6.2%
A 1
 
6.2%
K 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 103
97.2%
. 2
 
1.9%
/ 1
 
0.9%
Space Separator
ValueCountFrequency (%)
2894
100.0%
Open Punctuation
ValueCountFrequency (%)
( 469
100.0%
Close Punctuation
ValueCountFrequency (%)
) 469
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 295
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9119
55.3%
Common 7346
44.6%
Latin 16
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
497
 
5.5%
473
 
5.2%
384
 
4.2%
367
 
4.0%
326
 
3.6%
311
 
3.4%
289
 
3.2%
272
 
3.0%
267
 
2.9%
250
 
2.7%
Other values (290) 5683
62.3%
Common
ValueCountFrequency (%)
2894
39.4%
1 585
 
8.0%
( 469
 
6.4%
) 469
 
6.4%
2 413
 
5.6%
3 373
 
5.1%
4 304
 
4.1%
- 295
 
4.0%
7 256
 
3.5%
6 251
 
3.4%
Other values (7) 1037
 
14.1%
Latin
ValueCountFrequency (%)
B 4
25.0%
L 4
25.0%
S 2
12.5%
C 2
12.5%
R 1
 
6.2%
H 1
 
6.2%
A 1
 
6.2%
K 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9119
55.3%
ASCII 7362
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2894
39.3%
1 585
 
7.9%
( 469
 
6.4%
) 469
 
6.4%
2 413
 
5.6%
3 373
 
5.1%
4 304
 
4.1%
- 295
 
4.0%
7 256
 
3.5%
6 251
 
3.4%
Other values (15) 1053
 
14.3%
Hangul
ValueCountFrequency (%)
497
 
5.5%
473
 
5.2%
384
 
4.2%
367
 
4.0%
326
 
3.6%
311
 
3.4%
289
 
3.2%
272
 
3.0%
267
 
2.9%
250
 
2.7%
Other values (290) 5683
62.3%
Distinct638
Distinct (%)100.0%
Missing2
Missing (%)0.3%
Memory size5.1 KiB
2023-12-11T12:06:10.277996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length12
Mean length12.084639
Min length4

Characters and Unicode

Total characters7710
Distinct characters21
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique638 ?
Unique (%)100.0%

Sample

1st row전화번호
2nd row070-8886-2976
3rd row031-354-3103
4th row051-412-4074
5th row055-321-7376
ValueCountFrequency (%)
043-263-4774 1
 
0.2%
031-352-9268 1
 
0.2%
061-335-1835 1
 
0.2%
041-564-4320 1
 
0.2%
041-533-6318 1
 
0.2%
054-285-2085 1
 
0.2%
052-225-5425 1
 
0.2%
043-533-1254 1
 
0.2%
043-838-4600 1
 
0.2%
041-544-2950 1
 
0.2%
Other values (633) 633
98.4%
2023-12-11T12:06:10.663656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1182
15.3%
0 1035
13.4%
5 939
12.2%
3 820
10.6%
1 676
8.8%
2 620
8.0%
4 603
7.8%
6 507
6.6%
7 434
 
5.6%
8 414
 
5.4%
Other values (11) 480
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6408
83.1%
Dash Punctuation 1182
 
15.3%
Close Punctuation 96
 
1.2%
Math Symbol 9
 
0.1%
Space Separator 5
 
0.1%
Other Punctuation 5
 
0.1%
Other Letter 4
 
0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1035
16.2%
5 939
14.7%
3 820
12.8%
1 676
10.5%
2 620
9.7%
4 603
9.4%
6 507
7.9%
7 434
6.8%
8 414
 
6.5%
9 360
 
5.6%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Other Punctuation
ValueCountFrequency (%)
, 3
60.0%
/ 2
40.0%
Dash Punctuation
ValueCountFrequency (%)
- 1182
100.0%
Close Punctuation
ValueCountFrequency (%)
) 96
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7706
99.9%
Hangul 4
 
0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1182
15.3%
0 1035
13.4%
5 939
12.2%
3 820
10.6%
1 676
8.8%
2 620
8.0%
4 603
7.8%
6 507
6.6%
7 434
 
5.6%
8 414
 
5.4%
Other values (7) 476
6.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7706
99.9%
Hangul 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1182
15.3%
0 1035
13.4%
5 939
12.2%
3 820
10.6%
1 676
8.8%
2 620
8.0%
4 603
7.8%
6 507
6.6%
7 434
 
5.6%
8 414
 
5.4%
Other values (7) 476
6.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct610
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Memory size5.1 KiB
2023-12-11T12:06:11.004488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length3
Mean length3.0703125
Min length2

Characters and Unicode

Total characters1965
Distinct characters199
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique587 ?
Unique (%)91.7%

Sample

1st row대표자명
2nd row안선오
3rd row고변상
4th row김염주
5th row김종민
ValueCountFrequency (%)
구민모 5
 
0.8%
이재필 4
 
0.6%
성원제 3
 
0.5%
이은학 3
 
0.5%
지인순 2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
2
 
0.3%
Other values (619) 638
95.9%
2023-12-11T12:06:11.481597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
 
5.5%
108
 
5.5%
71
 
3.6%
54
 
2.7%
53
 
2.7%
40
 
2.0%
35
 
1.8%
35
 
1.8%
33
 
1.7%
32
 
1.6%
Other values (189) 1395
71.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1931
98.3%
Space Separator 30
 
1.5%
Other Punctuation 4
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
 
5.6%
108
 
5.6%
71
 
3.7%
54
 
2.8%
53
 
2.7%
40
 
2.1%
35
 
1.8%
35
 
1.8%
33
 
1.7%
32
 
1.7%
Other values (187) 1361
70.5%
Space Separator
ValueCountFrequency (%)
30
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1931
98.3%
Common 34
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
 
5.6%
108
 
5.6%
71
 
3.7%
54
 
2.8%
53
 
2.7%
40
 
2.1%
35
 
1.8%
35
 
1.8%
33
 
1.7%
32
 
1.7%
Other values (187) 1361
70.5%
Common
ValueCountFrequency (%)
30
88.2%
, 4
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1931
98.3%
ASCII 34
 
1.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
109
 
5.6%
108
 
5.6%
71
 
3.7%
54
 
2.8%
53
 
2.7%
40
 
2.1%
35
 
1.8%
35
 
1.8%
33
 
1.7%
32
 
1.7%
Other values (187) 1361
70.5%
ASCII
ValueCountFrequency (%)
30
88.2%
, 4
 
11.8%

Missing values

2023-12-11T12:06:08.513510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:06:08.590011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

Unnamed: 0Unnamed: 1기간 :2014년 12월
0업체명주소전화번호대표자명
1광일산업전라북도 익산시 오산면 광양1길 3-1070-8886-2976안선오
2(주)동남우드경기도 화성시 향남읍 토성로359번길 10031-354-3103고변상
3현대기업부산광역시 영도구 해양로73번길 67051-412-4074김염주
4유민목재경상남도 김해시 상동면 상동로 135055-321-7376김종민
5신한수출포장부산광역시 강서구 식만로85번길 22051-972-3463구정봉
6주식회사 성화수출포장경상북도 구미시 장천면 신장2길 54-7054-475-2242장홍수
7태송파렛트경기도 김포시 대곶면 대곶남로 340031-989-4651이정덕
8공단목재상사충청북도 청주시 흥덕구 사직대로 37043-263-4774박준식
9(주)명성수출포장경기도 시흥시 옥구천동로44번길 35 (시화공단 2바 204호)031-433-0285박용국
Unnamed: 0Unnamed: 1기간 :2014년 12월
630㈜신영목재전북 군산시 외항로 1148 (오식도동)063-464-9830김종환
631(주)서울수출포장경기 화성시 양감면 초록로 660 (사창리)031-352-8420임경빈
632(주)형진목재부산 강서구 녹산산단381로86번길 14-21 (송정동)051-831-0748 / 0749조희관
633(주)송덕패키징경북 칠곡군 석적읍 중지3길 68, (718-832) (중지리)054-975-3242이재필
634신흥글로벌㈜구미공장경북 구미시 3공단로 89-39, (730-340) (시미동)054-475-1800박상복
635대진산업(주)전라남도 여수시 소라면 화양로 1938-1061-683-6363안명욱
636㈜한성플랜지인천 남동구 앵고개로 659 (고잔동)032-813-6340김진안
637㈜신흥목재평택공장경기 평택시 은실5길 90 (세교동)031-652-5451구자만
638(주)뉴-그린경기도 화성시 향남읍 토성로 359번길 10031-354-3100이윤기
639진성산업㈜인천광역시 서구 북항로363번길 58032-575-7600홍진기