Overview

Dataset statistics

Number of variables6
Number of observations311
Missing cells8
Missing cells (%)0.4%
Duplicate rows1
Duplicate rows (%)0.3%
Total size in memory14.7 KiB
Average record size in memory48.4 B

Variable types

Categorical2
Text3
DateTime1

Dataset

Description마산, 군산, 대불, 동해, 율촌, 울산, 김제 등 자유무역지역 입주업체 현황자료 제공(업체명, 주소, 주생산품, 투자국, 입주허가일)
Author산업통상자원부
URLhttps://www.data.go.kr/data/15050350/fileData.do

Alerts

Dataset has 1 (0.3%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 12:15:06.338945
Analysis finished2023-12-12 12:15:07.678726
Duration1.34 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct8
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
마산
130 
울산
43 
김제
40 
대불
31 
군산
28 
Other values (3)
39 

Length

Max length4
Median length2
Mean length2.0128617
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row마산
2nd row마산
3rd row마산
4th row마산
5th row마산

Common Values

ValueCountFrequency (%)
마산 130
41.8%
울산 43
 
13.8%
김제 40
 
12.9%
대불 31
 
10.0%
군산 28
 
9.0%
율촌 20
 
6.4%
동해 17
 
5.5%
<NA> 2
 
0.6%

Length

2023-12-12T21:15:07.805269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:15:08.014627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
마산 130
41.8%
울산 43
 
13.8%
김제 40
 
12.9%
대불 31
 
10.0%
군산 28
 
9.0%
율촌 20
 
6.4%
동해 17
 
5.5%
na 2
 
0.6%
Distinct307
Distinct (%)99.4%
Missing2
Missing (%)0.6%
Memory size2.6 KiB
2023-12-12T21:15:08.312598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length15
Mean length6.2783172
Min length3

Characters and Unicode

Total characters1940
Distinct characters301
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique305 ?
Unique (%)98.7%

Sample

1st row한국소니전자㈜
2nd row한국중천전화산업㈜
3rd row티와이모듈코리아㈜
4th row한국성전㈜
5th row㈜아로텍
ValueCountFrequency (%)
㈜영광 2
 
0.6%
농업회사법인 2
 
0.6%
2
 
0.6%
㈜네오플라테크 2
 
0.6%
피에스씨㈜ 2
 
0.6%
㈜알파온 1
 
0.3%
㈜윤성 1
 
0.3%
주)한국체서피크 1
 
0.3%
㈜노르코아쿠아월드 1
 
0.3%
㈜지엠아이동해 1
 
0.3%
Other values (305) 305
95.3%
2023-12-12T21:15:08.771026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
240
 
12.4%
94
 
4.8%
77
 
4.0%
( 59
 
3.0%
) 59
 
3.0%
58
 
3.0%
39
 
2.0%
38
 
2.0%
33
 
1.7%
29
 
1.5%
Other values (291) 1214
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1530
78.9%
Other Symbol 240
 
12.4%
Open Punctuation 59
 
3.0%
Close Punctuation 59
 
3.0%
Uppercase Letter 31
 
1.6%
Space Separator 11
 
0.6%
Other Punctuation 7
 
0.4%
Lowercase Letter 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
6.1%
77
 
5.0%
58
 
3.8%
39
 
2.5%
38
 
2.5%
33
 
2.2%
29
 
1.9%
28
 
1.8%
28
 
1.8%
26
 
1.7%
Other values (270) 1080
70.6%
Uppercase Letter
ValueCountFrequency (%)
S 7
22.6%
M 5
16.1%
T 4
12.9%
E 2
 
6.5%
H 2
 
6.5%
C 2
 
6.5%
I 2
 
6.5%
P 2
 
6.5%
K 2
 
6.5%
A 1
 
3.2%
Other values (2) 2
 
6.5%
Lowercase Letter
ValueCountFrequency (%)
h 1
33.3%
c 1
33.3%
e 1
33.3%
Other Punctuation
ValueCountFrequency (%)
. 5
71.4%
& 2
 
28.6%
Other Symbol
ValueCountFrequency (%)
240
100.0%
Open Punctuation
ValueCountFrequency (%)
( 59
100.0%
Close Punctuation
ValueCountFrequency (%)
) 59
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1770
91.2%
Common 136
 
7.0%
Latin 34
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
240
 
13.6%
94
 
5.3%
77
 
4.4%
58
 
3.3%
39
 
2.2%
38
 
2.1%
33
 
1.9%
29
 
1.6%
28
 
1.6%
28
 
1.6%
Other values (271) 1106
62.5%
Latin
ValueCountFrequency (%)
S 7
20.6%
M 5
14.7%
T 4
11.8%
E 2
 
5.9%
H 2
 
5.9%
C 2
 
5.9%
I 2
 
5.9%
P 2
 
5.9%
K 2
 
5.9%
A 1
 
2.9%
Other values (5) 5
14.7%
Common
ValueCountFrequency (%)
( 59
43.4%
) 59
43.4%
11
 
8.1%
. 5
 
3.7%
& 2
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1530
78.9%
None 240
 
12.4%
ASCII 170
 
8.8%

Most frequent character per block

None
ValueCountFrequency (%)
240
100.0%
Hangul
ValueCountFrequency (%)
94
 
6.1%
77
 
5.0%
58
 
3.8%
39
 
2.5%
38
 
2.5%
33
 
2.2%
29
 
1.9%
28
 
1.8%
28
 
1.8%
26
 
1.7%
Other values (270) 1080
70.6%
ASCII
ValueCountFrequency (%)
( 59
34.7%
) 59
34.7%
11
 
6.5%
S 7
 
4.1%
M 5
 
2.9%
. 5
 
2.9%
T 4
 
2.4%
E 2
 
1.2%
H 2
 
1.2%
C 2
 
1.2%
Other values (10) 14
 
8.2%
Distinct302
Distinct (%)97.7%
Missing2
Missing (%)0.6%
Memory size2.6 KiB
2023-12-12T21:15:09.084657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length129
Median length85
Mean length38.161812
Min length27

Characters and Unicode

Total characters11792
Distinct characters99
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique295 ?
Unique (%)95.5%

Sample

1st row경상남도 창원시 마산회원구 자유무역2길 76 (24,905㎡)
2nd row경상남도 창원시 마산회원구 자유무역2길 26 (7,348㎡)
3rd row경상남도 창원시 마산회원구 자유무역3길 43 (13,868㎡)
4th row경상남도 창원시 마산회원구 자유무역3길 150 (5,289㎡)
5th row경상남도 창원시 마산회원구 자유무역3길 146 (6,643㎡)
ValueCountFrequency (%)
마산회원구 130
 
6.4%
경상남도 130
 
6.4%
창원시 130
 
6.4%
전라북도 68
 
3.4%
자유무역3길 65
 
3.2%
표준공장 61
 
3.0%
전라남도 53
 
2.6%
울산광역시 43
 
2.1%
울주군 43
 
2.1%
김제시 40
 
2.0%
Other values (553) 1256
62.2%
2023-12-12T21:15:09.501888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1711
 
14.5%
1 593
 
5.0%
3 453
 
3.8%
2 420
 
3.6%
, 347
 
2.9%
330
 
2.8%
) 325
 
2.8%
324
 
2.7%
( 322
 
2.7%
281
 
2.4%
Other values (89) 6686
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5377
45.6%
Decimal Number 3100
26.3%
Space Separator 1711
 
14.5%
Other Punctuation 506
 
4.3%
Close Punctuation 325
 
2.8%
Other Symbol 324
 
2.7%
Open Punctuation 322
 
2.7%
Dash Punctuation 103
 
0.9%
Uppercase Letter 21
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
330
 
6.1%
281
 
5.2%
277
 
5.2%
269
 
5.0%
224
 
4.2%
215
 
4.0%
184
 
3.4%
177
 
3.3%
172
 
3.2%
170
 
3.2%
Other values (62) 3078
57.2%
Decimal Number
ValueCountFrequency (%)
1 593
19.1%
3 453
14.6%
2 420
13.5%
4 257
8.3%
5 248
8.0%
0 244
7.9%
9 240
7.7%
6 227
 
7.3%
7 222
 
7.2%
8 196
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
B 8
38.1%
C 4
19.0%
A 4
19.0%
F 2
 
9.5%
D 1
 
4.8%
G 1
 
4.8%
E 1
 
4.8%
Other Punctuation
ValueCountFrequency (%)
, 347
68.6%
. 158
31.2%
/ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
1711
100.0%
Close Punctuation
ValueCountFrequency (%)
) 325
100.0%
Other Symbol
ValueCountFrequency (%)
324
100.0%
Open Punctuation
ValueCountFrequency (%)
( 322
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 103
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6393
54.2%
Hangul 5377
45.6%
Latin 22
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
330
 
6.1%
281
 
5.2%
277
 
5.2%
269
 
5.0%
224
 
4.2%
215
 
4.0%
184
 
3.4%
177
 
3.3%
172
 
3.2%
170
 
3.2%
Other values (62) 3078
57.2%
Common
ValueCountFrequency (%)
1711
26.8%
1 593
 
9.3%
3 453
 
7.1%
2 420
 
6.6%
, 347
 
5.4%
) 325
 
5.1%
324
 
5.1%
( 322
 
5.0%
4 257
 
4.0%
5 248
 
3.9%
Other values (9) 1393
21.8%
Latin
ValueCountFrequency (%)
B 8
36.4%
C 4
18.2%
A 4
18.2%
F 2
 
9.1%
D 1
 
4.5%
c 1
 
4.5%
G 1
 
4.5%
E 1
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6091
51.7%
Hangul 5377
45.6%
CJK Compat 324
 
2.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1711
28.1%
1 593
 
9.7%
3 453
 
7.4%
2 420
 
6.9%
, 347
 
5.7%
) 325
 
5.3%
( 322
 
5.3%
4 257
 
4.2%
5 248
 
4.1%
0 244
 
4.0%
Other values (16) 1171
19.2%
Hangul
ValueCountFrequency (%)
330
 
6.1%
281
 
5.2%
277
 
5.2%
269
 
5.0%
224
 
4.2%
215
 
4.0%
184
 
3.4%
177
 
3.3%
172
 
3.2%
170
 
3.2%
Other values (62) 3078
57.2%
CJK Compat
ValueCountFrequency (%)
324
100.0%
Distinct272
Distinct (%)88.0%
Missing2
Missing (%)0.6%
Memory size2.6 KiB
2023-12-12T21:15:09.801462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length28
Mean length11.68932
Min length2

Characters and Unicode

Total characters3612
Distinct characters348
Distinct categories7 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique257 ?
Unique (%)83.2%

Sample

1st row방송용 유무선 송수신기
2nd rowTimer Switch Motor
3rd rowinductor condensor LCD용 인버터
4th row휴대폰 키패드
5th rowLCD용 인버터 보드
ValueCountFrequency (%)
55
 
6.7%
제조업 39
 
4.8%
부품 32
 
3.9%
자동차 17
 
2.1%
선박블럭 14
 
1.7%
기타 12
 
1.5%
열교환기 10
 
1.2%
7
 
0.9%
제조 7
 
0.9%
자동차용 6
 
0.7%
Other values (485) 620
75.7%
2023-12-12T21:15:10.214512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
510
 
14.1%
143
 
4.0%
104
 
2.9%
98
 
2.7%
94
 
2.6%
, 91
 
2.5%
84
 
2.3%
71
 
2.0%
62
 
1.7%
56
 
1.6%
Other values (338) 2299
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2628
72.8%
Space Separator 510
 
14.1%
Lowercase Letter 187
 
5.2%
Uppercase Letter 143
 
4.0%
Other Punctuation 96
 
2.7%
Open Punctuation 24
 
0.7%
Close Punctuation 24
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
143
 
5.4%
104
 
4.0%
98
 
3.7%
94
 
3.6%
84
 
3.2%
71
 
2.7%
62
 
2.4%
56
 
2.1%
52
 
2.0%
49
 
1.9%
Other values (292) 1815
69.1%
Lowercase Letter
ValueCountFrequency (%)
r 25
13.4%
e 25
13.4%
o 17
9.1%
t 15
8.0%
s 14
7.5%
c 14
7.5%
l 13
 
7.0%
a 12
 
6.4%
i 11
 
5.9%
n 7
 
3.7%
Other values (10) 34
18.2%
Uppercase Letter
ValueCountFrequency (%)
C 16
11.2%
S 13
 
9.1%
E 12
 
8.4%
T 12
 
8.4%
L 12
 
8.4%
D 10
 
7.0%
A 10
 
7.0%
P 10
 
7.0%
R 7
 
4.9%
I 7
 
4.9%
Other values (10) 34
23.8%
Other Punctuation
ValueCountFrequency (%)
, 91
94.8%
/ 4
 
4.2%
. 1
 
1.0%
Space Separator
ValueCountFrequency (%)
510
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2626
72.7%
Common 654
 
18.1%
Latin 330
 
9.1%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
143
 
5.4%
104
 
4.0%
98
 
3.7%
94
 
3.6%
84
 
3.2%
71
 
2.7%
62
 
2.4%
56
 
2.1%
52
 
2.0%
49
 
1.9%
Other values (291) 1813
69.0%
Latin
ValueCountFrequency (%)
r 25
 
7.6%
e 25
 
7.6%
o 17
 
5.2%
C 16
 
4.8%
t 15
 
4.5%
s 14
 
4.2%
c 14
 
4.2%
S 13
 
3.9%
l 13
 
3.9%
E 12
 
3.6%
Other values (30) 166
50.3%
Common
ValueCountFrequency (%)
510
78.0%
, 91
 
13.9%
( 24
 
3.7%
) 24
 
3.7%
/ 4
 
0.6%
. 1
 
0.2%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2626
72.7%
ASCII 984
 
27.2%
CJK Compat Ideographs 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
510
51.8%
, 91
 
9.2%
r 25
 
2.5%
e 25
 
2.5%
( 24
 
2.4%
) 24
 
2.4%
o 17
 
1.7%
C 16
 
1.6%
t 15
 
1.5%
s 14
 
1.4%
Other values (36) 223
22.7%
Hangul
ValueCountFrequency (%)
143
 
5.4%
104
 
4.0%
98
 
3.7%
94
 
3.6%
84
 
3.2%
71
 
2.7%
62
 
2.4%
56
 
2.1%
52
 
2.0%
49
 
1.9%
Other values (291) 1813
69.0%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%

투자국
Categorical

Distinct27
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
한국
123 
일본
47 
국내
44 
미국
28 
중국
27 
Other values (22)
42 

Length

Max length6
Median length2
Mean length2.1543408
Min length2

Unique

Unique13 ?
Unique (%)4.2%

Sample

1st row네덜란드
2nd row일본
3rd row한국
4th row일본
5th row한국

Common Values

ValueCountFrequency (%)
한국 123
39.5%
일본 47
 
15.1%
국내 44
 
14.1%
미국 28
 
9.0%
중국 27
 
8.7%
홍콩 6
 
1.9%
인도네시아 5
 
1.6%
캐나다 4
 
1.3%
싱가포르 3
 
1.0%
독일 3
 
1.0%
Other values (17) 21
 
6.8%

Length

2023-12-12T21:15:10.368214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국 123
39.4%
일본 47
 
15.1%
국내 44
 
14.1%
미국 29
 
9.3%
중국 28
 
9.0%
홍콩 6
 
1.9%
인도네시아 5
 
1.6%
캐나다 4
 
1.3%
싱가포르 3
 
1.0%
독일 3
 
1.0%
Other values (16) 20
 
6.4%
Distinct290
Distinct (%)93.9%
Missing2
Missing (%)0.6%
Memory size2.6 KiB
Minimum1971-03-16 00:00:00
Maximum2023-09-22 00:00:00
2023-12-12T21:15:10.502004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:15:10.659837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Correlations

2023-12-12T21:15:10.754329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분투자국
구분1.0000.707
투자국0.7071.000
2023-12-12T21:15:10.848490image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
투자국구분
투자국1.0000.382
구분0.3821.000
2023-12-12T21:15:10.939340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분투자국
구분1.0000.382
투자국0.3821.000

Missing values

2023-12-12T21:15:07.237602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:15:07.385081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T21:15:07.559251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

구분업체명주소(임대면적)주생산품투자국입주허가일
0마산한국소니전자㈜경상남도 창원시 마산회원구 자유무역2길 76 (24,905㎡)방송용 유무선 송수신기네덜란드1972-08-24
1마산한국중천전화산업㈜경상남도 창원시 마산회원구 자유무역2길 26 (7,348㎡)Timer Switch Motor일본1972-11-01
2마산티와이모듈코리아㈜경상남도 창원시 마산회원구 자유무역3길 43 (13,868㎡)inductor condensor LCD용 인버터한국2019-01-28
3마산한국성전㈜경상남도 창원시 마산회원구 자유무역3길 150 (5,289㎡)휴대폰 키패드일본1972-12-21
4마산㈜아로텍경상남도 창원시 마산회원구 자유무역3길 146 (6,643㎡)LCD용 인버터 보드한국1972-12-21
5마산한일차단기㈜경상남도 창원시 마산회원구 자유무역3길 40 (6,982㎡)전력용 차단기한국1993-08-02
6마산㈜엔지피경상남도 창원시 마산회원구 자유무역3길 118 (4,719㎡)태양광 발전장치,LED조명 등미국1996-11-28
7마산엠에스이㈜경상남도 창원시 마산회원구 봉암공단13길 23-29 표준공장 9호동 (6,225㎡)PCB Assembly한국2007-03-28
8마산㈜테크노전자경상남도 창원시 마산회원구 봉암공단13길 23-23 표준공장 8호동 (3,119㎡)전기회로개폐 보호 및 접속장치한국2007-11-01
9마산인피니텍㈜경상남도 창원시 마산회원구 봉암공단13길 23-29 표준공장 9호동 (907.08㎡)전원용IC일본2009-03-19
구분업체명주소(임대면적)주생산품투자국입주허가일
301김제랜드솔루션㈜전라북도 김제시 백산면 부거리1576 (51,325.90㎡)기계국내2022-08-10
302김제㈜백제중공업전라북도 김제시 백산면 부거리 1576-6)16,498.20㎡)기계국내2022-10-21
303김제㈜지엔전라북도 김제시 백산면 부거리 1581-3(13,266.60㎡)운송장비국내2022-10-25
304김제㈜웰바이오텍피디알엔전라북도 김제시 백산면 자유무역길 195-20 A동 5층 (3,044.61㎡)석유화학미국2022-12-02
305김제미래클전라북도 김제시 백산면 부거리 1580-4(15,485.40㎡)운송장비러시아2023-03-16
306김제㈜석경에이티전라북도 김제시 백산면 부거리 1573(31,078.30㎡)화학국내2023-07-19
307김제㈜알파온전라북도 김제시 백산면 부거리 1576-7(18,320.20㎡)운송장비국내2023-08-18
308김제㈜에이치알 이엔아이전라북도 김제시 백산면 부거리 1572-1(27,055.30㎡)운송장비국내2023-08-29
309<NA><NA><NA><NA><NA><NA>
310<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

구분업체명주소(임대면적)주생산품투자국입주허가일# duplicates
0<NA><NA><NA><NA><NA><NA>2