Overview

Dataset statistics

Number of variables4
Number of observations1852
Missing cells455
Missing cells (%)6.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory58.0 KiB
Average record size in memory32.1 B

Variable types

Text4

Dataset

Description충청남도에 본사 또는 공장이 소재한 중소수출 기업 현황에 대한 데이터로 기업명, 대표자, 수출품목의 항목을 제공합니다. 출처는 충남 온라인수출지원시스템 가입 기업입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=425&beforeMenuCd=DOM_000000201001001000&publicdatapk=3062422

Alerts

공장 소재지 has 438 (23.7%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:24:15.223314
Analysis finished2024-01-09 20:24:16.266494
Duration1.04 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1831
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size14.6 KiB
2024-01-10T05:24:16.456880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length18
Mean length7.9033477
Min length1

Characters and Unicode

Total characters14637
Distinct characters616
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1810 ?
Unique (%)97.7%

Sample

1st row주식회사 대수하이테크
2nd row투반산업
3rd row베르상스퍼시픽
4th row주식회사 씨티에스엔지니어링
5th row(주)위스코
ValueCountFrequency (%)
주식회사 319
 
13.6%
38
 
1.6%
농업회사법인 32
 
1.4%
유한회사 6
 
0.3%
코리아 5
 
0.2%
영농조합법인 3
 
0.1%
어업회사법인 3
 
0.1%
협동조합 3
 
0.1%
바이오텍 3
 
0.1%
농업회사 3
 
0.1%
Other values (1889) 1934
82.3%
2024-01-10T05:24:16.831577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1323
 
9.0%
) 937
 
6.4%
( 928
 
6.3%
551
 
3.8%
517
 
3.5%
498
 
3.4%
489
 
3.3%
434
 
3.0%
418
 
2.9%
270
 
1.8%
Other values (606) 8272
56.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11898
81.3%
Close Punctuation 938
 
6.4%
Open Punctuation 929
 
6.3%
Space Separator 498
 
3.4%
Uppercase Letter 193
 
1.3%
Lowercase Letter 127
 
0.9%
Other Punctuation 27
 
0.2%
Decimal Number 20
 
0.1%
Dash Punctuation 3
 
< 0.1%
Other Symbol 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1323
 
11.1%
551
 
4.6%
517
 
4.3%
489
 
4.1%
434
 
3.6%
418
 
3.5%
270
 
2.3%
215
 
1.8%
173
 
1.5%
167
 
1.4%
Other values (546) 7341
61.7%
Uppercase Letter
ValueCountFrequency (%)
S 20
 
10.4%
A 20
 
10.4%
M 13
 
6.7%
K 13
 
6.7%
P 12
 
6.2%
C 12
 
6.2%
O 12
 
6.2%
R 10
 
5.2%
H 10
 
5.2%
B 9
 
4.7%
Other values (10) 62
32.1%
Lowercase Letter
ValueCountFrequency (%)
o 17
13.4%
e 14
11.0%
a 11
8.7%
t 11
8.7%
d 10
7.9%
r 10
7.9%
s 10
7.9%
m 9
 
7.1%
n 7
 
5.5%
h 5
 
3.9%
Other values (10) 23
18.1%
Decimal Number
ValueCountFrequency (%)
2 6
30.0%
1 5
25.0%
3 2
 
10.0%
6 2
 
10.0%
5 2
 
10.0%
7 1
 
5.0%
4 1
 
5.0%
0 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 15
55.6%
& 7
25.9%
, 4
 
14.8%
: 1
 
3.7%
Close Punctuation
ValueCountFrequency (%)
) 937
99.9%
] 1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 928
99.9%
[ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
498
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11900
81.3%
Common 2416
 
16.5%
Latin 320
 
2.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1323
 
11.1%
551
 
4.6%
517
 
4.3%
489
 
4.1%
434
 
3.6%
418
 
3.5%
270
 
2.3%
215
 
1.8%
173
 
1.5%
167
 
1.4%
Other values (546) 7343
61.7%
Latin
ValueCountFrequency (%)
S 20
 
6.2%
A 20
 
6.2%
o 17
 
5.3%
e 14
 
4.4%
M 13
 
4.1%
K 13
 
4.1%
P 12
 
3.8%
C 12
 
3.8%
O 12
 
3.8%
a 11
 
3.4%
Other values (30) 176
55.0%
Common
ValueCountFrequency (%)
) 937
38.8%
( 928
38.4%
498
20.6%
. 15
 
0.6%
& 7
 
0.3%
2 6
 
0.2%
1 5
 
0.2%
, 4
 
0.2%
- 3
 
0.1%
3 2
 
0.1%
Other values (9) 11
 
0.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11897
81.3%
ASCII 2736
 
18.7%
None 3
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1323
 
11.1%
551
 
4.6%
517
 
4.3%
489
 
4.1%
434
 
3.6%
418
 
3.5%
270
 
2.3%
215
 
1.8%
173
 
1.5%
167
 
1.4%
Other values (545) 7340
61.7%
ASCII
ValueCountFrequency (%)
) 937
34.2%
( 928
33.9%
498
18.2%
S 20
 
0.7%
A 20
 
0.7%
o 17
 
0.6%
. 15
 
0.5%
e 14
 
0.5%
M 13
 
0.5%
K 13
 
0.5%
Other values (49) 261
 
9.5%
None
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct100
Distinct (%)5.4%
Missing1
Missing (%)0.1%
Memory size14.6 KiB
2024-01-10T05:24:17.012202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length6
Mean length6.6315505
Min length5

Characters and Unicode

Total characters12275
Distinct characters86
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)1.9%

Sample

1st row충남 천안시
2nd row충남 공주시
3rd row충남 서천군
4th row충남 천안시
5th row인천 남동구
ValueCountFrequency (%)
충남 1187
32.1%
천안시 655
17.7%
충청남도 501
13.5%
아산시 348
 
9.4%
금산군 125
 
3.4%
당진시 89
 
2.4%
논산시 85
 
2.3%
홍성군 69
 
1.9%
예산군 63
 
1.7%
공주시 54
 
1.5%
Other values (76) 526
14.2%
2024-01-10T05:24:17.330189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1851
15.1%
1725
14.1%
1691
13.8%
1432
11.7%
714
 
5.8%
677
 
5.5%
677
 
5.5%
522
 
4.3%
514
 
4.2%
356
 
2.9%
Other values (76) 2116
17.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10423
84.9%
Space Separator 1851
 
15.1%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1725
16.5%
1691
16.2%
1432
13.7%
714
6.9%
677
 
6.5%
677
 
6.5%
522
 
5.0%
514
 
4.9%
356
 
3.4%
348
 
3.3%
Other values (74) 1767
17.0%
Space Separator
ValueCountFrequency (%)
1851
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10423
84.9%
Common 1852
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1725
16.5%
1691
16.2%
1432
13.7%
714
6.9%
677
 
6.5%
677
 
6.5%
522
 
5.0%
514
 
4.9%
356
 
3.4%
348
 
3.3%
Other values (74) 1767
17.0%
Common
ValueCountFrequency (%)
1851
99.9%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10423
84.9%
ASCII 1852
 
15.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1851
99.9%
3 1
 
0.1%
Hangul
ValueCountFrequency (%)
1725
16.5%
1691
16.2%
1432
13.7%
714
6.9%
677
 
6.5%
677
 
6.5%
522
 
5.0%
514
 
4.9%
356
 
3.4%
348
 
3.3%
Other values (74) 1767
17.0%

공장 소재지
Text

MISSING 

Distinct58
Distinct (%)4.1%
Missing438
Missing (%)23.7%
Memory size14.6 KiB
2024-01-10T05:24:17.511521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length6.6011315
Min length5

Characters and Unicode

Total characters9334
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)1.2%

Sample

1st row대전 유성구
2nd row충남 공주시
3rd row충남 서천군
4th row충남 천안시
5th row충남 서산시
ValueCountFrequency (%)
충남 963
34.1%
천안시 465
16.4%
충청남도 405
14.3%
아산시 274
 
9.7%
금산군 121
 
4.3%
당진시 82
 
2.9%
논산시 76
 
2.7%
예산군 65
 
2.3%
홍성군 57
 
2.0%
공주시 49
 
1.7%
Other values (41) 271
 
9.6%
2024-01-10T05:24:17.927138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1414
15.1%
1378
14.8%
1371
14.7%
1071
11.5%
586
6.3%
507
 
5.4%
482
 
5.2%
433
 
4.6%
415
 
4.4%
334
 
3.6%
Other values (56) 1343
14.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7920
84.9%
Space Separator 1414
 
15.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1378
17.4%
1371
17.3%
1071
13.5%
586
7.4%
507
 
6.4%
482
 
6.1%
433
 
5.5%
415
 
5.2%
334
 
4.2%
274
 
3.5%
Other values (55) 1069
13.5%
Space Separator
ValueCountFrequency (%)
1414
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7920
84.9%
Common 1414
 
15.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1378
17.4%
1371
17.3%
1071
13.5%
586
7.4%
507
 
6.4%
482
 
6.1%
433
 
5.5%
415
 
5.2%
334
 
4.2%
274
 
3.5%
Other values (55) 1069
13.5%
Common
ValueCountFrequency (%)
1414
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7920
84.9%
ASCII 1414
 
15.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1414
100.0%
Hangul
ValueCountFrequency (%)
1378
17.4%
1371
17.3%
1071
13.5%
586
7.4%
507
 
6.4%
482
 
6.1%
433
 
5.5%
415
 
5.2%
334
 
4.2%
274
 
3.5%
Other values (55) 1069
13.5%
Distinct740
Distinct (%)40.3%
Missing16
Missing (%)0.9%
Memory size14.6 KiB
2024-01-10T05:24:18.201247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length37
Mean length6.9918301
Min length1

Characters and Unicode

Total characters12837
Distinct characters501
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique615 ?
Unique (%)33.5%

Sample

1st row기타화학공업제품
2nd row기타직물
3rd row유리제품
4th row탱크.배관.계기류 등 자재
5th row철강관및철강선
ValueCountFrequency (%)
농산가공품 132
 
5.8%
플라스틱 90
 
3.9%
제품 87
 
3.8%
자동차부품 71
 
3.1%
수산가공품 59
 
2.6%
기타생활용품 49
 
2.1%
비누치약및화장품 46
 
2.0%
반도체제조용장비 43
 
1.9%
화장품 36
 
1.6%
기타철강금속제품 31
 
1.4%
Other values (904) 1651
71.9%
2024-01-10T05:24:18.559507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
944
 
7.4%
858
 
6.7%
459
 
3.6%
407
 
3.2%
369
 
2.9%
338
 
2.6%
320
 
2.5%
283
 
2.2%
, 269
 
2.1%
265
 
2.1%
Other values (491) 8325
64.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11390
88.7%
Space Separator 459
 
3.6%
Other Punctuation 350
 
2.7%
Lowercase Letter 344
 
2.7%
Uppercase Letter 219
 
1.7%
Decimal Number 34
 
0.3%
Open Punctuation 20
 
0.2%
Close Punctuation 20
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
944
 
8.3%
858
 
7.5%
407
 
3.6%
369
 
3.2%
338
 
3.0%
320
 
2.8%
283
 
2.5%
265
 
2.3%
226
 
2.0%
201
 
1.8%
Other values (433) 7179
63.0%
Lowercase Letter
ValueCountFrequency (%)
e 51
14.8%
t 33
9.6%
r 33
9.6%
i 29
8.4%
o 27
7.8%
l 24
 
7.0%
c 23
 
6.7%
a 23
 
6.7%
n 20
 
5.8%
s 14
 
4.1%
Other values (14) 67
19.5%
Uppercase Letter
ValueCountFrequency (%)
C 30
13.7%
P 22
10.0%
E 18
 
8.2%
L 17
 
7.8%
D 15
 
6.8%
S 14
 
6.4%
V 14
 
6.4%
A 14
 
6.4%
T 13
 
5.9%
B 10
 
4.6%
Other values (12) 52
23.7%
Decimal Number
ValueCountFrequency (%)
0 19
55.9%
2 8
23.5%
3 4
 
11.8%
1 2
 
5.9%
8 1
 
2.9%
Other Punctuation
ValueCountFrequency (%)
, 269
76.9%
. 69
 
19.7%
/ 12
 
3.4%
Space Separator
ValueCountFrequency (%)
459
100.0%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11390
88.7%
Common 884
 
6.9%
Latin 563
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
944
 
8.3%
858
 
7.5%
407
 
3.6%
369
 
3.2%
338
 
3.0%
320
 
2.8%
283
 
2.5%
265
 
2.3%
226
 
2.0%
201
 
1.8%
Other values (433) 7179
63.0%
Latin
ValueCountFrequency (%)
e 51
 
9.1%
t 33
 
5.9%
r 33
 
5.9%
C 30
 
5.3%
i 29
 
5.2%
o 27
 
4.8%
l 24
 
4.3%
c 23
 
4.1%
a 23
 
4.1%
P 22
 
3.9%
Other values (36) 268
47.6%
Common
ValueCountFrequency (%)
459
51.9%
, 269
30.4%
. 69
 
7.8%
( 20
 
2.3%
) 20
 
2.3%
0 19
 
2.1%
/ 12
 
1.4%
2 8
 
0.9%
3 4
 
0.5%
1 2
 
0.2%
Other values (2) 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11390
88.7%
ASCII 1447
 
11.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
944
 
8.3%
858
 
7.5%
407
 
3.6%
369
 
3.2%
338
 
3.0%
320
 
2.8%
283
 
2.5%
265
 
2.3%
226
 
2.0%
201
 
1.8%
Other values (433) 7179
63.0%
ASCII
ValueCountFrequency (%)
459
31.7%
, 269
18.6%
. 69
 
4.8%
e 51
 
3.5%
t 33
 
2.3%
r 33
 
2.3%
C 30
 
2.1%
i 29
 
2.0%
o 27
 
1.9%
l 24
 
1.7%
Other values (48) 423
29.2%

Correlations

2024-01-10T05:24:18.639241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본사 소재지공장 소재지
본사 소재지1.0000.994
공장 소재지0.9941.000

Missing values

2024-01-10T05:24:16.070005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:24:16.137895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-01-10T05:24:16.217672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기업명본사 소재지공장 소재지수출품목
0주식회사 대수하이테크충남 천안시대전 유성구기타화학공업제품
1투반산업충남 공주시충남 공주시기타직물
2베르상스퍼시픽충남 서천군충남 서천군유리제품
3주식회사 씨티에스엔지니어링충남 천안시충남 천안시탱크.배관.계기류 등 자재
4(주)위스코인천 남동구충남 서산시철강관및철강선
5주식회사 이노렉스충남 천안시충남 천안시기타산업기계
6주식회사 뮤즈나인충남 천안시<NA>화장품 및 화장용품
7동우플라스틱(주)충남 아산시충남 아산시플라스틱 제품
8오디하이텍(주)충남 홍성군충남 홍성군Transparnet LCD,PANEL PC
9(주)씽크소프트충남 아산시충남 아산시소프트웨어
기업명본사 소재지공장 소재지수출품목
1842(주)보덕에프앤지충청남도 금산군충청남도 금산군농산가공품
1843(주)쓰리제이충청남도 논산시충청남도 논산시샤워용품
1844(주)쿠스코충청남도 아산시<NA>기타생활용품,기타생활용품
1845광천토굴전통식품충청남도 홍성군충청남도 홍성군농산가공품
1846주식회사 노블오카리나충청남도 홍성군충청남도 홍성군관악기,오카리나
1847(주)스킨렉스경기도 성남시충청남도 천안시의료용기기
1848(주)우창충청남도 서산시충청남도 서산시PVC안정제
1849갓바위식품(주)충남 보령시충남 보령시수산가공품,해조류
1850농업회사법인 하늘빛(주)충청남도 공주시충청남도 공주시농산가공품
1851빌드켐 주식회사충남 공주시충남 공주시도료및잉크,드라이몰탈