Overview

Dataset statistics

Number of variables22
Number of observations1507
Missing cells1118
Missing cells (%)3.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory265.0 KiB
Average record size in memory180.1 B

Variable types

Categorical11
Text7
Numeric4

Dataset

Description전국지식산업센터현황의 csv파일로 전국 시, 군, 구별 지식산업센터의 이름,규모현황 등을 데이터 기준일자로 제공하고 있습니다.
Author한국산업단지공단
URLhttps://www.data.go.kr/data/15117154/fileData.do

Alerts

입지구분 has constant value ""Constant
지목 is highly imbalanced (59.5%)Imbalance
용도지역1 is highly imbalanced (86.4%)Imbalance
설치자 is highly imbalanced (75.3%)Imbalance
단지명 has 924 (61.3%) missing valuesMissing
공장대표주소(도로명) has 194 (12.9%) missing valuesMissing
건축면적 is highly skewed (γ1 = 24.85902914)Skewed
제조면적 is highly skewed (γ1 = 29.472871)Skewed
용지면적 has 54 (3.6%) zerosZeros
건축면적 has 41 (2.7%) zerosZeros
제조면적 has 56 (3.7%) zerosZeros
부대면적 has 76 (5.0%) zerosZeros

Reproduction

Analysis started2023-12-12 07:17:00.616087
Analysis finished2023-12-12 07:17:02.132180
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도
Categorical

Distinct17
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
경기도
688 
서울특별시
393 
인천광역시
81 
부산광역시
 
68
충청남도
 
38
Other values (12)
239 

Length

Max length7
Median length5
Mean length4.0172528
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원특별자치도
2nd row강원특별자치도
3rd row강원특별자치도
4th row강원특별자치도
5th row강원특별자치도

Common Values

ValueCountFrequency (%)
경기도 688
45.7%
서울특별시 393
26.1%
인천광역시 81
 
5.4%
부산광역시 68
 
4.5%
충청남도 38
 
2.5%
충청북도 35
 
2.3%
대구광역시 35
 
2.3%
경상남도 30
 
2.0%
광주광역시 30
 
2.0%
전라남도 21
 
1.4%
Other values (7) 88
 
5.8%

Length

2023-12-12T16:17:02.213936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 688
45.7%
서울특별시 393
26.1%
인천광역시 81
 
5.4%
부산광역시 68
 
4.5%
충청남도 38
 
2.5%
충청북도 35
 
2.3%
대구광역시 35
 
2.3%
광주광역시 30
 
2.0%
경상남도 30
 
2.0%
전라남도 21
 
1.4%
Other values (7) 88
 
5.8%
Distinct122
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2023-12-12T16:17:02.504521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.8168547
Min length1

Characters and Unicode

Total characters5752
Distinct characters106
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)2.2%

Sample

1st row동해시
2nd row동해시
3rd row동해시
4th row양양군
5th row양양군
ValueCountFrequency (%)
금천구 138
 
7.6%
시흥시 118
 
6.5%
성동구 88
 
4.9%
부천시 64
 
3.5%
화성시 56
 
3.1%
안양시 55
 
3.0%
성남시 54
 
3.0%
구로구 52
 
2.9%
영등포구 49
 
2.7%
동안구 44
 
2.4%
Other values (121) 1089
60.3%
2023-12-12T16:17:02.975232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
996
17.3%
939
16.3%
304
 
5.3%
265
 
4.6%
239
 
4.2%
187
 
3.3%
155
 
2.7%
153
 
2.7%
146
 
2.5%
144
 
2.5%
Other values (96) 2224
38.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5448
94.7%
Space Separator 304
 
5.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
996
18.3%
939
17.2%
265
 
4.9%
239
 
4.4%
187
 
3.4%
155
 
2.8%
153
 
2.8%
146
 
2.7%
144
 
2.6%
139
 
2.6%
Other values (95) 2085
38.3%
Space Separator
ValueCountFrequency (%)
304
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5448
94.7%
Common 304
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
996
18.3%
939
17.2%
265
 
4.9%
239
 
4.4%
187
 
3.4%
155
 
2.8%
153
 
2.8%
146
 
2.7%
144
 
2.6%
139
 
2.6%
Other values (95) 2085
38.3%
Common
ValueCountFrequency (%)
304
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5448
94.7%
ASCII 304
 
5.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
996
18.3%
939
17.2%
265
 
4.9%
239
 
4.4%
187
 
3.4%
155
 
2.8%
153
 
2.8%
146
 
2.7%
144
 
2.6%
139
 
2.6%
Other values (95) 2085
38.3%
ASCII
ValueCountFrequency (%)
304
100.0%
Distinct1466
Distinct (%)97.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2023-12-12T16:17:03.417927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length9.2992701
Min length2

Characters and Unicode

Total characters14014
Distinct characters524
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1440 ?
Unique (%)95.6%

Sample

1st row표준공장1동
2nd row표준공장2동
3rd row표준공장3동
4th row양양군 현남면 북분리 425-7, 460-2
5th row양양읍 연창리 180-106, 180-14
ValueCountFrequency (%)
지식산업센터 174
 
7.6%
주식회사 25
 
1.1%
sk 20
 
0.9%
v1 18
 
0.8%
서울숲 14
 
0.6%
center 13
 
0.6%
미정 12
 
0.5%
동탄 10
 
0.4%
tower 9
 
0.4%
타워 8
 
0.4%
Other values (1695) 1982
86.7%
2023-12-12T16:17:04.016132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
790
 
5.6%
427
 
3.0%
426
 
3.0%
398
 
2.8%
397
 
2.8%
396
 
2.8%
384
 
2.7%
342
 
2.4%
331
 
2.4%
319
 
2.3%
Other values (514) 9804
70.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11346
81.0%
Space Separator 790
 
5.6%
Uppercase Letter 752
 
5.4%
Decimal Number 529
 
3.8%
Open Punctuation 160
 
1.1%
Close Punctuation 159
 
1.1%
Lowercase Letter 148
 
1.1%
Dash Punctuation 92
 
0.7%
Other Punctuation 19
 
0.1%
Letter Number 15
 
0.1%
Other values (2) 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
427
 
3.8%
426
 
3.8%
398
 
3.5%
397
 
3.5%
396
 
3.5%
384
 
3.4%
342
 
3.0%
331
 
2.9%
319
 
2.8%
291
 
2.6%
Other values (445) 7635
67.3%
Uppercase Letter
ValueCountFrequency (%)
T 91
 
12.1%
I 88
 
11.7%
S 68
 
9.0%
K 57
 
7.6%
A 46
 
6.1%
C 45
 
6.0%
B 40
 
5.3%
E 33
 
4.4%
N 28
 
3.7%
R 27
 
3.6%
Other values (16) 229
30.5%
Lowercase Letter
ValueCountFrequency (%)
e 42
28.4%
r 23
15.5%
t 19
12.8%
n 15
 
10.1%
w 10
 
6.8%
c 10
 
6.8%
o 9
 
6.1%
i 4
 
2.7%
b 3
 
2.0%
a 3
 
2.0%
Other values (7) 10
 
6.8%
Decimal Number
ValueCountFrequency (%)
1 145
27.4%
2 118
22.3%
3 67
12.7%
5 36
 
6.8%
4 35
 
6.6%
6 34
 
6.4%
9 28
 
5.3%
7 25
 
4.7%
0 21
 
4.0%
8 20
 
3.8%
Other Punctuation
ValueCountFrequency (%)
, 7
36.8%
. 5
26.3%
& 4
21.1%
: 2
 
10.5%
/ 1
 
5.3%
Letter Number
ValueCountFrequency (%)
10
66.7%
4
 
26.7%
1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 159
99.4%
[ 1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 158
99.4%
] 1
 
0.6%
Space Separator
ValueCountFrequency (%)
790
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 92
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11347
81.0%
Common 1750
 
12.5%
Latin 915
 
6.5%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
427
 
3.8%
426
 
3.8%
398
 
3.5%
397
 
3.5%
396
 
3.5%
384
 
3.4%
342
 
3.0%
331
 
2.9%
319
 
2.8%
291
 
2.6%
Other values (444) 7636
67.3%
Latin
ValueCountFrequency (%)
T 91
 
9.9%
I 88
 
9.6%
S 68
 
7.4%
K 57
 
6.2%
A 46
 
5.0%
C 45
 
4.9%
e 42
 
4.6%
B 40
 
4.4%
E 33
 
3.6%
N 28
 
3.1%
Other values (36) 377
41.2%
Common
ValueCountFrequency (%)
790
45.1%
( 159
 
9.1%
) 158
 
9.0%
1 145
 
8.3%
2 118
 
6.7%
- 92
 
5.3%
3 67
 
3.8%
5 36
 
2.1%
4 35
 
2.0%
6 34
 
1.9%
Other values (12) 116
 
6.6%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11344
80.9%
ASCII 2650
 
18.9%
Number Forms 15
 
0.1%
None 3
 
< 0.1%
CJK 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
790
29.8%
( 159
 
6.0%
) 158
 
6.0%
1 145
 
5.5%
2 118
 
4.5%
- 92
 
3.5%
T 91
 
3.4%
I 88
 
3.3%
S 68
 
2.6%
3 67
 
2.5%
Other values (55) 874
33.0%
Hangul
ValueCountFrequency (%)
427
 
3.8%
426
 
3.8%
398
 
3.5%
397
 
3.5%
396
 
3.5%
384
 
3.4%
342
 
3.0%
331
 
2.9%
319
 
2.8%
291
 
2.6%
Other values (443) 7633
67.3%
Number Forms
ValueCountFrequency (%)
10
66.7%
4
 
26.7%
1
 
6.7%
None
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

입지구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
지식산업센터
1507 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지식산업센터
2nd row지식산업센터
3rd row지식산업센터
4th row지식산업센터
5th row지식산업센터

Common Values

ValueCountFrequency (%)
지식산업센터 1507
100.0%

Length

2023-12-12T16:17:04.198639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:04.314559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지식산업센터 1507
100.0%
Distinct1059
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2023-12-12T16:17:04.597079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length24
Mean length8.8619774
Min length1

Characters and Unicode

Total characters13355
Distinct characters456
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique951 ?
Unique (%)63.1%

Sample

1st row(국 산업통상자원부)
2nd row(국 산업통상자원부)
3rd row(국)산업통상자원부
4th row스마트산업 주식회사 외 1
5th row(주)스마트라이프 외 1
ValueCountFrequency (%)
주식회사 187
 
10.3%
주)하나자산신탁 36
 
2.0%
케이비부동산신탁(주 35
 
1.9%
주)무궁화신탁 23
 
1.3%
아시아신탁(주 23
 
1.3%
코리아신탁(주 22
 
1.2%
신한자산신탁 21
 
1.2%
한국자산신탁(주 20
 
1.1%
하나자산신탁 19
 
1.0%
주)한국토지신탁 13
 
0.7%
Other values (1106) 1419
78.1%
2023-12-12T16:17:05.034085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1141
 
8.5%
( 906
 
6.8%
) 906
 
6.8%
462
 
3.5%
407
 
3.0%
390
 
2.9%
352
 
2.6%
322
 
2.4%
295
 
2.2%
290
 
2.2%
Other values (446) 7884
59.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10999
82.4%
Open Punctuation 906
 
6.8%
Close Punctuation 906
 
6.8%
Space Separator 322
 
2.4%
Decimal Number 90
 
0.7%
Uppercase Letter 64
 
0.5%
Other Symbol 36
 
0.3%
Other Punctuation 14
 
0.1%
Dash Punctuation 11
 
0.1%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1141
 
10.4%
462
 
4.2%
407
 
3.7%
390
 
3.5%
352
 
3.2%
295
 
2.7%
290
 
2.6%
270
 
2.5%
243
 
2.2%
233
 
2.1%
Other values (408) 6916
62.9%
Uppercase Letter
ValueCountFrequency (%)
K 12
18.8%
I 7
10.9%
B 7
10.9%
S 6
9.4%
N 6
9.4%
C 6
9.4%
E 4
 
6.2%
T 4
 
6.2%
G 3
 
4.7%
A 3
 
4.7%
Other values (5) 6
9.4%
Decimal Number
ValueCountFrequency (%)
1 26
28.9%
2 20
22.2%
5 9
 
10.0%
6 7
 
7.8%
8 7
 
7.8%
7 5
 
5.6%
0 5
 
5.6%
3 5
 
5.6%
4 4
 
4.4%
9 2
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
c 1
16.7%
m 1
16.7%
g 1
16.7%
n 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 10
71.4%
. 4
 
28.6%
Open Punctuation
ValueCountFrequency (%)
( 906
100.0%
Close Punctuation
ValueCountFrequency (%)
) 906
100.0%
Space Separator
ValueCountFrequency (%)
322
100.0%
Other Symbol
ValueCountFrequency (%)
36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11035
82.6%
Common 2250
 
16.8%
Latin 70
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1141
 
10.3%
462
 
4.2%
407
 
3.7%
390
 
3.5%
352
 
3.2%
295
 
2.7%
290
 
2.6%
270
 
2.4%
243
 
2.2%
233
 
2.1%
Other values (409) 6952
63.0%
Latin
ValueCountFrequency (%)
K 12
17.1%
I 7
10.0%
B 7
10.0%
S 6
8.6%
N 6
8.6%
C 6
8.6%
E 4
 
5.7%
T 4
 
5.7%
G 3
 
4.3%
A 3
 
4.3%
Other values (10) 12
17.1%
Common
ValueCountFrequency (%)
( 906
40.3%
) 906
40.3%
322
 
14.3%
1 26
 
1.2%
2 20
 
0.9%
- 11
 
0.5%
, 10
 
0.4%
5 9
 
0.4%
6 7
 
0.3%
8 7
 
0.3%
Other values (7) 26
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10999
82.4%
ASCII 2320
 
17.4%
None 36
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1141
 
10.4%
462
 
4.2%
407
 
3.7%
390
 
3.5%
352
 
3.2%
295
 
2.7%
290
 
2.6%
270
 
2.5%
243
 
2.2%
233
 
2.1%
Other values (408) 6916
62.9%
ASCII
ValueCountFrequency (%)
( 906
39.1%
) 906
39.1%
322
 
13.9%
1 26
 
1.1%
2 20
 
0.9%
K 12
 
0.5%
- 11
 
0.5%
, 10
 
0.4%
5 9
 
0.4%
I 7
 
0.3%
Other values (27) 91
 
3.9%
None
ValueCountFrequency (%)
36
100.0%

등록구분
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
등록
983 
승인
524 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row승인
2nd row승인
3rd row등록
4th row승인
5th row승인

Common Values

ValueCountFrequency (%)
등록 983
65.2%
승인 524
34.8%

Length

2023-12-12T16:17:05.187459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:05.297684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
등록 983
65.2%
승인 524
34.8%

단지명
Text

MISSING 

Distinct86
Distinct (%)14.8%
Missing924
Missing (%)61.3%
Memory size11.9 KiB
2023-12-12T16:17:05.535095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length10.560892
Min length4

Characters and Unicode

Total characters6157
Distinct characters157
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)6.3%

Sample

1st row동해자유무역지역
2nd row동해자유무역지역
3rd row동해자유무역지역
4th row춘천후평일반산업단지
5th row춘천후평일반산업단지
ValueCountFrequency (%)
서울디지털국가산업단지 160
27.2%
성남일반산업단지 42
 
7.1%
파주출판문화정보국가산업단지 36
 
6.1%
반월국가산업단지 26
 
4.4%
남동국가산업단지 23
 
3.9%
홍성내포도시첨단산업단지 21
 
3.6%
광주첨단과학국가산업단지 17
 
2.9%
부산센텀시티일반산업단지 17
 
2.9%
성서지방산업단지 16
 
2.7%
창원국가산업단지 16
 
2.7%
Other values (77) 214
36.4%
2023-12-12T16:17:05.997621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
767
 
12.5%
627
 
10.2%
619
 
10.1%
580
 
9.4%
346
 
5.6%
326
 
5.3%
184
 
3.0%
183
 
3.0%
164
 
2.7%
160
 
2.6%
Other values (147) 2201
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6040
98.1%
Decimal Number 35
 
0.6%
Close Punctuation 28
 
0.5%
Open Punctuation 28
 
0.5%
Space Separator 10
 
0.2%
Uppercase Letter 9
 
0.1%
Other Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
767
 
12.7%
627
 
10.4%
619
 
10.2%
580
 
9.6%
346
 
5.7%
326
 
5.4%
184
 
3.0%
183
 
3.0%
164
 
2.7%
160
 
2.6%
Other values (134) 2084
34.5%
Decimal Number
ValueCountFrequency (%)
2 24
68.6%
1 5
 
14.3%
4 3
 
8.6%
3 2
 
5.7%
5 1
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
H 3
33.3%
P 3
33.3%
I 3
33.3%
Other Punctuation
ValueCountFrequency (%)
. 4
57.1%
, 3
42.9%
Close Punctuation
ValueCountFrequency (%)
) 28
100.0%
Open Punctuation
ValueCountFrequency (%)
( 28
100.0%
Space Separator
ValueCountFrequency (%)
10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6040
98.1%
Common 108
 
1.8%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
767
 
12.7%
627
 
10.4%
619
 
10.2%
580
 
9.6%
346
 
5.7%
326
 
5.4%
184
 
3.0%
183
 
3.0%
164
 
2.7%
160
 
2.6%
Other values (134) 2084
34.5%
Common
ValueCountFrequency (%)
) 28
25.9%
( 28
25.9%
2 24
22.2%
10
 
9.3%
1 5
 
4.6%
. 4
 
3.7%
4 3
 
2.8%
, 3
 
2.8%
3 2
 
1.9%
5 1
 
0.9%
Latin
ValueCountFrequency (%)
H 3
33.3%
P 3
33.3%
I 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6040
98.1%
ASCII 117
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
767
 
12.7%
627
 
10.4%
619
 
10.2%
580
 
9.6%
346
 
5.7%
326
 
5.4%
184
 
3.0%
183
 
3.0%
164
 
2.7%
160
 
2.6%
Other values (134) 2084
34.5%
ASCII
ValueCountFrequency (%)
) 28
23.9%
( 28
23.9%
2 24
20.5%
10
 
8.5%
1 5
 
4.3%
. 4
 
3.4%
4 3
 
2.6%
H 3
 
2.6%
P 3
 
2.6%
I 3
 
2.6%
Other values (3) 6
 
5.1%
Distinct155
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2023-12-12T16:17:06.301859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length10.186463
Min length3

Characters and Unicode

Total characters15351
Distinct characters146
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)3.5%

Sample

1st row자료없음
2nd row자료없음
3rd row자료없음
4th row강원특별자치도 양양군
5th row강원특별자치도 양양군
ValueCountFrequency (%)
경기도 512
17.9%
한국산업단지공단 291
 
10.2%
서울특별시 228
 
8.0%
서울지역본부 160
 
5.6%
시흥시 105
 
3.7%
성동구 87
 
3.0%
부천시 62
 
2.2%
안양시 55
 
1.9%
화성시 53
 
1.9%
영등포구 48
 
1.7%
Other values (155) 1256
44.0%
2023-12-12T16:17:06.788696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1350
 
8.8%
1078
 
7.0%
869
 
5.7%
779
 
5.1%
679
 
4.4%
661
 
4.3%
602
 
3.9%
545
 
3.6%
483
 
3.1%
458
 
3.0%
Other values (136) 7847
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13891
90.5%
Space Separator 1350
 
8.8%
Close Punctuation 53
 
0.3%
Open Punctuation 53
 
0.3%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1078
 
7.8%
869
 
6.3%
779
 
5.6%
679
 
4.9%
661
 
4.8%
602
 
4.3%
545
 
3.9%
483
 
3.5%
458
 
3.3%
451
 
3.2%
Other values (132) 7286
52.5%
Space Separator
ValueCountFrequency (%)
1350
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Other Punctuation
ValueCountFrequency (%)
· 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13891
90.5%
Common 1460
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1078
 
7.8%
869
 
6.3%
779
 
5.6%
679
 
4.9%
661
 
4.8%
602
 
4.3%
545
 
3.9%
483
 
3.5%
458
 
3.3%
451
 
3.2%
Other values (132) 7286
52.5%
Common
ValueCountFrequency (%)
1350
92.5%
) 53
 
3.6%
( 53
 
3.6%
· 4
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13891
90.5%
ASCII 1456
 
9.5%
None 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1350
92.7%
) 53
 
3.6%
( 53
 
3.6%
Hangul
ValueCountFrequency (%)
1078
 
7.8%
869
 
6.3%
779
 
5.6%
679
 
4.9%
661
 
4.8%
602
 
4.3%
545
 
3.9%
483
 
3.5%
458
 
3.3%
451
 
3.2%
Other values (132) 7286
52.5%
None
ValueCountFrequency (%)
· 4
100.0%

산단구분
Categorical

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
개별
924 
국가산업단지
329 
지방산업단지
197 
도시첨단산업단지
 
52
자유무역지역
 
4

Length

Max length8
Median length2
Mean length3.6151294
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row자유무역지역
2nd row자유무역지역
3rd row자유무역지역
4th row개별
5th row개별

Common Values

ValueCountFrequency (%)
개별 924
61.3%
국가산업단지 329
 
21.8%
지방산업단지 197
 
13.1%
도시첨단산업단지 52
 
3.5%
자유무역지역 4
 
0.3%
농공단지 1
 
0.1%

Length

2023-12-12T16:17:07.011817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:07.133544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개별 924
61.3%
국가산업단지 329
 
21.8%
지방산업단지 197
 
13.1%
도시첨단산업단지 52
 
3.5%
자유무역지역 4
 
0.3%
농공단지 1
 
0.1%

상태
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
[완료신고]
745 
[신설]승인
409 
[변경완료신고]
238 
[신설변경]승인
 
65
[분양공고안]승인
 
50

Length

Max length9
Median length6
Mean length6.5016589
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[신설]승인
2nd row[신설]승인
3rd row[완료신고]
4th row[신설]승인
5th row[신설]승인

Common Values

ValueCountFrequency (%)
[완료신고] 745
49.4%
[신설]승인 409
27.1%
[변경완료신고] 238
 
15.8%
[신설변경]승인 65
 
4.3%
[분양공고안]승인 50
 
3.3%

Length

2023-12-12T16:17:07.287582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:07.438576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
완료신고 745
49.4%
신설]승인 409
27.1%
변경완료신고 238
 
15.8%
신설변경]승인 65
 
4.3%
분양공고안]승인 50
 
3.3%

지목
Categorical

IMBALANCE 

Distinct46
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
공장용지
723 
507 
대, 대
 
50
공장용지, 공장용지
 
45
<NA>
 
38
Other values (41)
144 

Length

Max length110
Median length4
Mean length4.0630392
Min length1

Unique

Unique23 ?
Unique (%)1.5%

Sample

1st row공장용지
2nd row공장용지
3rd row공장용지
4th row대, 대
5th row잡종지, 잡종지

Common Values

ValueCountFrequency (%)
공장용지 723
48.0%
507
33.6%
대, 대 50
 
3.3%
공장용지, 공장용지 45
 
3.0%
<NA> 38
 
2.5%
공장용지, 공장용지, 공장용지 20
 
1.3%
대, 대, 대 17
 
1.1%
16
 
1.1%
잡종지 11
 
0.7%
11
 
0.7%
Other values (36) 69
 
4.6%

Length

2023-12-12T16:17:07.590084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
공장용지 973
49.0%
804
40.5%
na 38
 
1.9%
37
 
1.9%
잡종지 36
 
1.8%
35
 
1.8%
도로 33
 
1.7%
임야 16
 
0.8%
주차장 5
 
0.3%
창고용지 3
 
0.2%
Other values (4) 4
 
0.2%

용지면적
Real number (ℝ)

ZEROS 

Distinct1394
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8006.4879
Minimum0
Maximum234210
Zeros54
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size13.4 KiB
2023-12-12T16:17:07.764167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile507.8
Q12327.1
median5192
Q39486.35
95-th percentile25658.637
Maximum234210
Range234210
Interquartile range (IQR)7159.25

Descriptive statistics

Standard deviation12186.755
Coefficient of variation (CV)1.52211
Kurtosis110.80735
Mean8006.4879
Median Absolute Deviation (MAD)3311
Skewness8.1287155
Sum12065777
Variance1.4851701 × 108
MonotonicityNot monotonic
2023-12-12T16:17:07.915435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 54
 
3.6%
3306.0 6
 
0.4%
4959.0 5
 
0.3%
6086.0 4
 
0.3%
18231.3 3
 
0.2%
3305.0 3
 
0.2%
3012.0 2
 
0.1%
5271.0 2
 
0.1%
840.6 2
 
0.1%
1001.0 2
 
0.1%
Other values (1384) 1424
94.5%
ValueCountFrequency (%)
0.0 54
3.6%
14.81 1
 
0.1%
55.45 1
 
0.1%
56.21 1
 
0.1%
118.14 1
 
0.1%
147.1 1
 
0.1%
195.0 1
 
0.1%
232.32 2
 
0.1%
267.5 1
 
0.1%
324.0 1
 
0.1%
ValueCountFrequency (%)
234210.0 1
0.1%
165344.0 1
0.1%
122853.0 1
0.1%
101474.59 1
0.1%
99997.0 1
0.1%
99173.8 1
0.1%
90804.6 1
0.1%
73273.0 1
0.1%
70500.0 1
0.1%
67281.6 1
0.1%

건축면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct1451
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40989.053
Minimum0
Maximum2977625.5
Zeros41
Zeros (%)2.7%
Negative0
Negative (%)0.0%
Memory size13.4 KiB
2023-12-12T16:17:08.054410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1081.254
Q18695.38
median26129.38
Q353942.25
95-th percentile121021.52
Maximum2977625.5
Range2977625.5
Interquartile range (IQR)45246.87

Descriptive statistics

Standard deviation88228.54
Coefficient of variation (CV)2.1524903
Kurtosis816.08096
Mean40989.053
Median Absolute Deviation (MAD)20027.08
Skewness24.859029
Sum61770503
Variance7.7842752 × 109
MonotonicityNot monotonic
2023-12-12T16:17:08.228398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 41
 
2.7%
32343.279 2
 
0.1%
425.25 2
 
0.1%
4958.0 2
 
0.1%
16103.65 2
 
0.1%
64116.0 2
 
0.1%
64154.77 2
 
0.1%
238482.62 2
 
0.1%
60262.04 2
 
0.1%
39145.96 2
 
0.1%
Other values (1441) 1448
96.1%
ValueCountFrequency (%)
0.0 41
2.7%
70.65 1
 
0.1%
130.5 1
 
0.1%
140.89 1
 
0.1%
240.5 1
 
0.1%
243.64 1
 
0.1%
387.1 1
 
0.1%
417.66 1
 
0.1%
425.25 2
 
0.1%
431.06 1
 
0.1%
ValueCountFrequency (%)
2977625.47 1
0.1%
360107.56 1
0.1%
331601.67 1
0.1%
330282.14 1
0.1%
291184.06 1
0.1%
287024.48 1
0.1%
281713.42 1
0.1%
280707.71 1
0.1%
270405.8 1
0.1%
269190.25 1
0.1%

제조면적
Real number (ℝ)

SKEWED  ZEROS 

Distinct1436
Distinct (%)95.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29434.781
Minimum0
Maximum2942430
Zeros56
Zeros (%)3.7%
Negative0
Negative (%)0.0%
Memory size13.4 KiB
2023-12-12T16:17:08.387106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile517.591
Q15068.67
median16874.91
Q337244.61
95-th percentile89929.078
Maximum2942430
Range2942430
Interquartile range (IQR)32175.94

Descriptive statistics

Standard deviation82505.689
Coefficient of variation (CV)2.803
Kurtosis1033.443
Mean29434.781
Median Absolute Deviation (MAD)13508.507
Skewness29.472871
Sum44358215
Variance6.8071887 × 109
MonotonicityNot monotonic
2023-12-12T16:17:08.520556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 56
 
3.7%
43211.23 2
 
0.1%
4759.498 2
 
0.1%
36249.86 2
 
0.1%
4775.06 2
 
0.1%
13473.02 2
 
0.1%
16103.65 2
 
0.1%
3306.0 2
 
0.1%
93591.91 2
 
0.1%
25911.467 2
 
0.1%
Other values (1426) 1433
95.1%
ValueCountFrequency (%)
0.0 56
3.7%
32.81 1
 
0.1%
39.87 1
 
0.1%
76.65 1
 
0.1%
103.2 1
 
0.1%
145.75 1
 
0.1%
147.73 1
 
0.1%
233.2 1
 
0.1%
372.75 1
 
0.1%
391.98 1
 
0.1%
ValueCountFrequency (%)
2942430.0 1
0.1%
346480.67 1
0.1%
315161.09 1
0.1%
233037.99 1
0.1%
231684.61 1
0.1%
223520.69 1
0.1%
219786.54 1
0.1%
219615.29 1
0.1%
213491.79 1
0.1%
208644.41 1
0.1%

부대면적
Real number (ℝ)

ZEROS 

Distinct1411
Distinct (%)93.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11554.272
Minimum0
Maximum169303.08
Zeros76
Zeros (%)5.0%
Negative0
Negative (%)0.0%
Memory size13.4 KiB
2023-12-12T16:17:08.928389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.003
Q11363.535
median5167.29
Q314685.25
95-th percentile45213.014
Maximum169303.08
Range169303.08
Interquartile range (IQR)13321.715

Descriptive statistics

Standard deviation17378.001
Coefficient of variation (CV)1.5040325
Kurtosis16.888022
Mean11554.272
Median Absolute Deviation (MAD)4579.33
Skewness3.4386826
Sum17412288
Variance3.0199493 × 108
MonotonicityNot monotonic
2023-12-12T16:17:09.053845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 76
 
5.0%
0.01 7
 
0.5%
1652.0 2
 
0.1%
1.0 2
 
0.1%
7745.05 2
 
0.1%
18650.0 2
 
0.1%
4811.34 2
 
0.1%
144890.71 2
 
0.1%
25672.94 2
 
0.1%
4321.15 2
 
0.1%
Other values (1401) 1408
93.4%
ValueCountFrequency (%)
0.0 76
5.0%
0.01 7
 
0.5%
0.02 1
 
0.1%
0.1 1
 
0.1%
1.0 2
 
0.1%
24.1 1
 
0.1%
27.3 1
 
0.1%
30.78 1
 
0.1%
42.12 1
 
0.1%
42.17 1
 
0.1%
ValueCountFrequency (%)
169303.08 1
0.1%
144890.71 2
0.1%
134599.6 1
0.1%
117228.31 1
0.1%
112821.58 1
0.1%
110393.58 1
0.1%
110138.81 1
0.1%
101627.25 1
0.1%
100877.49 1
0.1%
98563.68 1
0.1%
Distinct1282
Distinct (%)97.6%
Missing194
Missing (%)12.9%
Memory size11.9 KiB
2023-12-12T16:17:09.336608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length71
Median length52
Mean length30.671744
Min length14

Characters and Unicode

Total characters40272
Distinct characters464
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1260 ?
Unique (%)96.0%

Sample

1st row강원특별자치도 동해시 공단1로 177 (구호동)
2nd row강원특별자치도 동해시 공단1로 177 (구호동)
3rd row강원특별자치도 동해시 공단1로 177 (구호동)
4th row강원특별자치도 양양군 양양읍 동해대로 2662 외 1필지
5th row강원특별자치도 원주시 혁신로 19
ValueCountFrequency (%)
경기도 599
 
7.5%
서울특별시 377
 
4.7%
197
 
2.5%
금천구 138
 
1.7%
가산동 111
 
1.4%
시흥시 108
 
1.4%
1필지 99
 
1.2%
성동구 87
 
1.1%
인천광역시 77
 
1.0%
성수동2가 62
 
0.8%
Other values (2488) 6118
76.7%
2023-12-12T16:17:09.802338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6664
 
16.5%
1513
 
3.8%
1500
 
3.7%
1 1245
 
3.1%
1240
 
3.1%
) 1191
 
3.0%
( 1190
 
3.0%
1033
 
2.6%
2 878
 
2.2%
779
 
1.9%
Other values (454) 23039
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 24273
60.3%
Space Separator 6664
 
16.5%
Decimal Number 5660
 
14.1%
Close Punctuation 1197
 
3.0%
Open Punctuation 1196
 
3.0%
Other Punctuation 614
 
1.5%
Dash Punctuation 346
 
0.9%
Uppercase Letter 266
 
0.7%
Lowercase Letter 32
 
0.1%
Letter Number 19
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1513
 
6.2%
1500
 
6.2%
1240
 
5.1%
1033
 
4.3%
779
 
3.2%
678
 
2.8%
667
 
2.7%
666
 
2.7%
597
 
2.5%
554
 
2.3%
Other values (394) 15046
62.0%
Uppercase Letter
ValueCountFrequency (%)
I 37
13.9%
T 35
13.2%
B 25
 
9.4%
L 23
 
8.6%
E 17
 
6.4%
S 15
 
5.6%
K 15
 
5.6%
A 12
 
4.5%
R 12
 
4.5%
O 11
 
4.1%
Other values (14) 64
24.1%
Decimal Number
ValueCountFrequency (%)
1 1245
22.0%
2 878
15.5%
3 711
12.6%
5 522
9.2%
4 459
 
8.1%
6 452
 
8.0%
0 409
 
7.2%
7 367
 
6.5%
9 315
 
5.6%
8 302
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
e 11
34.4%
r 5
15.6%
n 5
15.6%
t 4
 
12.5%
c 3
 
9.4%
u 1
 
3.1%
o 1
 
3.1%
w 1
 
3.1%
y 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
, 602
98.0%
/ 6
 
1.0%
. 5
 
0.8%
& 1
 
0.2%
Letter Number
ValueCountFrequency (%)
10
52.6%
5
26.3%
4
 
21.1%
Close Punctuation
ValueCountFrequency (%)
) 1191
99.5%
] 6
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 1190
99.5%
[ 6
 
0.5%
Other Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
6664
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 346
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 24273
60.3%
Common 15682
38.9%
Latin 317
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1513
 
6.2%
1500
 
6.2%
1240
 
5.1%
1033
 
4.3%
779
 
3.2%
678
 
2.8%
667
 
2.7%
666
 
2.7%
597
 
2.5%
554
 
2.3%
Other values (394) 15046
62.0%
Latin
ValueCountFrequency (%)
I 37
 
11.7%
T 35
 
11.0%
B 25
 
7.9%
L 23
 
7.3%
E 17
 
5.4%
S 15
 
4.7%
K 15
 
4.7%
A 12
 
3.8%
R 12
 
3.8%
e 11
 
3.5%
Other values (26) 115
36.3%
Common
ValueCountFrequency (%)
6664
42.5%
1 1245
 
7.9%
) 1191
 
7.6%
( 1190
 
7.6%
2 878
 
5.6%
3 711
 
4.5%
, 602
 
3.8%
5 522
 
3.3%
4 459
 
2.9%
6 452
 
2.9%
Other values (14) 1768
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 24273
60.3%
ASCII 15978
39.7%
Number Forms 19
 
< 0.1%
Enclosed Alphanum 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6664
41.7%
1 1245
 
7.8%
) 1191
 
7.5%
( 1190
 
7.4%
2 878
 
5.5%
3 711
 
4.4%
, 602
 
3.8%
5 522
 
3.3%
4 459
 
2.9%
6 452
 
2.8%
Other values (45) 2064
 
12.9%
Hangul
ValueCountFrequency (%)
1513
 
6.2%
1500
 
6.2%
1240
 
5.1%
1033
 
4.3%
779
 
3.2%
678
 
2.8%
667
 
2.7%
666
 
2.7%
597
 
2.5%
554
 
2.3%
Other values (394) 15046
62.0%
Number Forms
ValueCountFrequency (%)
10
52.6%
5
26.3%
4
 
21.1%
Enclosed Alphanum
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct1474
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2023-12-12T16:17:10.085451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length55
Mean length24.339084
Min length10

Characters and Unicode

Total characters36679
Distinct characters396
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1448 ?
Unique (%)96.1%

Sample

1st row강원특별자치도 동해시 구호동 247번지
2nd row강원특별자치도 동해시 구호동 247번지
3rd row강원특별자치도 동해시 구호동 247번지
4th row강원특별자치도 양양군 현남면 북분리 425-7 외 1필지
5th row강원특별자치도 양양군 양양읍 연창리 180-106 외 1필지
ValueCountFrequency (%)
경기도 685
 
9.0%
서울특별시 392
 
5.1%
240
 
3.1%
금천구 138
 
1.8%
1필지 125
 
1.6%
가산동 118
 
1.5%
시흥시 118
 
1.5%
성동구 89
 
1.2%
인천광역시 81
 
1.1%
부산광역시 68
 
0.9%
Other values (2481) 5596
73.2%
2023-12-12T16:17:10.519758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6167
 
16.8%
1711
 
4.7%
1649
 
4.5%
1 1467
 
4.0%
1430
 
3.9%
- 1241
 
3.4%
1128
 
3.1%
2 1038
 
2.8%
984
 
2.7%
927
 
2.5%
Other values (386) 18937
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21660
59.1%
Decimal Number 7104
 
19.4%
Space Separator 6167
 
16.8%
Dash Punctuation 1241
 
3.4%
Uppercase Letter 210
 
0.6%
Close Punctuation 98
 
0.3%
Open Punctuation 97
 
0.3%
Other Punctuation 76
 
0.2%
Lowercase Letter 19
 
0.1%
Letter Number 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1711
 
7.9%
1649
 
7.6%
1430
 
6.6%
1128
 
5.2%
984
 
4.5%
927
 
4.3%
761
 
3.5%
719
 
3.3%
518
 
2.4%
435
 
2.0%
Other values (333) 11398
52.6%
Uppercase Letter
ValueCountFrequency (%)
B 51
24.3%
L 47
22.4%
I 18
 
8.6%
T 15
 
7.1%
K 9
 
4.3%
S 8
 
3.8%
A 8
 
3.8%
E 7
 
3.3%
C 6
 
2.9%
N 5
 
2.4%
Other values (14) 36
17.1%
Decimal Number
ValueCountFrequency (%)
1 1467
20.7%
2 1038
14.6%
3 767
10.8%
4 665
9.4%
5 637
9.0%
6 599
8.4%
7 534
 
7.5%
0 507
 
7.1%
9 447
 
6.3%
8 443
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
e 7
36.8%
t 3
15.8%
r 3
15.8%
n 3
15.8%
c 3
15.8%
Other Punctuation
ValueCountFrequency (%)
, 66
86.8%
/ 5
 
6.6%
. 4
 
5.3%
& 1
 
1.3%
Letter Number
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%
Close Punctuation
ValueCountFrequency (%)
) 93
94.9%
] 5
 
5.1%
Open Punctuation
ValueCountFrequency (%)
( 92
94.8%
[ 5
 
5.2%
Space Separator
ValueCountFrequency (%)
6167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1241
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21660
59.1%
Common 14784
40.3%
Latin 235
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1711
 
7.9%
1649
 
7.6%
1430
 
6.6%
1128
 
5.2%
984
 
4.5%
927
 
4.3%
761
 
3.5%
719
 
3.3%
518
 
2.4%
435
 
2.0%
Other values (333) 11398
52.6%
Latin
ValueCountFrequency (%)
B 51
21.7%
L 47
20.0%
I 18
 
7.7%
T 15
 
6.4%
K 9
 
3.8%
S 8
 
3.4%
A 8
 
3.4%
e 7
 
3.0%
E 7
 
3.0%
C 6
 
2.6%
Other values (22) 59
25.1%
Common
ValueCountFrequency (%)
6167
41.7%
1 1467
 
9.9%
- 1241
 
8.4%
2 1038
 
7.0%
3 767
 
5.2%
4 665
 
4.5%
5 637
 
4.3%
6 599
 
4.1%
7 534
 
3.6%
0 507
 
3.4%
Other values (11) 1162
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21660
59.1%
ASCII 15013
40.9%
Number Forms 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6167
41.1%
1 1467
 
9.8%
- 1241
 
8.3%
2 1038
 
6.9%
3 767
 
5.1%
4 665
 
4.4%
5 637
 
4.2%
6 599
 
4.0%
7 534
 
3.6%
0 507
 
3.4%
Other values (40) 1391
 
9.3%
Hangul
ValueCountFrequency (%)
1711
 
7.9%
1649
 
7.6%
1430
 
6.6%
1128
 
5.2%
984
 
4.5%
927
 
4.3%
761
 
3.5%
719
 
3.3%
518
 
2.4%
435
 
2.0%
Other values (333) 11398
52.6%
Number Forms
ValueCountFrequency (%)
3
50.0%
2
33.3%
1
 
16.7%

분양형태
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
분양
644 
<NA>
529 
임대
169 
분양/임대
165 

Length

Max length5
Median length2
Mean length3.0305242
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row임대
3rd row임대
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
분양 644
42.7%
<NA> 529
35.1%
임대 169
 
11.2%
분양/임대 165
 
10.9%

Length

2023-12-12T16:17:10.684123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:10.807588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
분양 644
42.7%
na 529
35.1%
임대 169
 
11.2%
분양/임대 165
 
10.9%

건축상태
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
건축완료
767 
미착공
327 
<NA>
323 
건축중
90 

Length

Max length4
Median length4
Mean length3.7232913
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건축완료
2nd row건축완료
3rd row건축완료
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
건축완료 767
50.9%
미착공 327
21.7%
<NA> 323
21.4%
건축중 90
 
6.0%

Length

2023-12-12T16:17:10.944810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:11.088704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건축완료 767
50.9%
미착공 327
21.7%
na 323
21.4%
건축중 90
 
6.0%

용도지역1
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
도시지역
1463 
<NA>
 
31
관리지역
 
13

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row도시지역
2nd row도시지역
3rd row도시지역
4th row관리지역
5th row도시지역

Common Values

ValueCountFrequency (%)
도시지역 1463
97.1%
<NA> 31
 
2.1%
관리지역 13
 
0.9%

Length

2023-12-12T16:17:11.218765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:11.316982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도시지역 1463
97.1%
na 31
 
2.1%
관리지역 13
 
0.9%

용도지역2
Categorical

Distinct7
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
공업지역
938 
주거지역
432 
상업지역
 
77
<NA>
 
33
녹지지역
 
15
Other values (2)
 
12

Length

Max length6
Median length4
Mean length4.0159257
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공업지역
2nd row공업지역
3rd row공업지역
4th row계획관리지역
5th row주거지역

Common Values

ValueCountFrequency (%)
공업지역 938
62.2%
주거지역 432
28.7%
상업지역 77
 
5.1%
<NA> 33
 
2.2%
녹지지역 15
 
1.0%
계획관리지역 10
 
0.7%
관리지역기타 2
 
0.1%

Length

2023-12-12T16:17:11.450233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:11.573341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공업지역 938
62.2%
주거지역 432
28.7%
상업지역 77
 
5.1%
na 33
 
2.2%
녹지지역 15
 
1.0%
계획관리지역 10
 
0.7%
관리지역기타 2
 
0.1%

설치자
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
민간
1393 
공공
 
113
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0013271
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row공공
2nd row공공
3rd row공공
4th row민간
5th row민간

Common Values

ValueCountFrequency (%)
민간 1393
92.4%
공공 113
 
7.5%
<NA> 1
 
0.1%

Length

2023-12-12T16:17:11.690892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:17:11.793192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 1393
92.4%
공공 113
 
7.5%
na 1
 
0.1%

Sample

시도시군구지식산업센터명입지구분회사명등록구분단지명관할기관산단구분상태지목용지면적건축면적제조면적부대면적공장대표주소(도로명)공장대표주소(지번)분양형태건축상태용도지역1용도지역2설치자
0강원특별자치도동해시표준공장1동지식산업센터(국 산업통상자원부)승인동해자유무역지역자료없음자유무역지역[신설]승인공장용지2272.64273.833101.581172.25강원특별자치도 동해시 공단1로 177 (구호동)강원특별자치도 동해시 구호동 247번지<NA>건축완료도시지역공업지역공공
1강원특별자치도동해시표준공장2동지식산업센터(국 산업통상자원부)승인동해자유무역지역자료없음자유무역지역[신설]승인공장용지3389.657046.544913.972132.57강원특별자치도 동해시 공단1로 177 (구호동)강원특별자치도 동해시 구호동 247번지임대건축완료도시지역공업지역공공
2강원특별자치도동해시표준공장3동지식산업센터(국)산업통상자원부등록동해자유무역지역자료없음자유무역지역[완료신고]공장용지9468.012207.5412207.540.0강원특별자치도 동해시 공단1로 177 (구호동)강원특별자치도 동해시 구호동 247번지임대건축완료도시지역공업지역공공
3강원특별자치도양양군양양군 현남면 북분리 425-7, 460-2지식산업센터스마트산업 주식회사 외 1승인<NA>강원특별자치도 양양군개별[신설]승인대, 대9684.021498.097618.4713879.62<NA>강원특별자치도 양양군 현남면 북분리 425-7 외 1필지<NA><NA>관리지역계획관리지역민간
4강원특별자치도양양군양양읍 연창리 180-106, 180-14지식산업센터(주)스마트라이프 외 1승인<NA>강원특별자치도 양양군개별[신설]승인잡종지, 잡종지2384.022078.927732.4414346.48강원특별자치도 양양군 양양읍 동해대로 2662 외 1필지강원특별자치도 양양군 양양읍 연창리 180-106 외 1필지<NA><NA>도시지역주거지역민간
5강원특별자치도원주시B&I지식산업센터원주지식산업센터한국투자부동산신탁 주식회사승인<NA>강원특별자치도 원주시개별[신설]승인10480.058196.745545.9912650.71<NA>강원특별자치도 원주시 지정면 가곡리 1335<NA>미착공도시지역공업지역민간
6강원특별자치도원주시H타워지식산업센터(주)해피룸등록<NA>강원특별자치도 원주시개별[변경완료신고]3212.119190.45413507.35683.154강원특별자치도 원주시 혁신로 19강원특별자치도 원주시 반곡동 1860-5번지분양/임대건축완료도시지역주거지역민간
7강원특별자치도원주시강원혁신지식산업센터지식산업센터강원도청등록<NA>강원특별자치도 원주시개별[완료신고]4843.813804.1911096.542707.65강원특별자치도 원주시 세계로 9강원특별자치도 원주시 반곡동 1913-12번지임대건축완료도시지역주거지역공공
8강원특별자치도원주시미정지식산업센터이케이홀딩스 주식회사승인<NA>강원특별자치도 원주시개별[신설]승인공장용지10092.565782.87156908.9458873.926<NA>강원특별자치도 원주시 지정면 신평리 1106-2<NA>미착공도시지역공업지역민간
9강원특별자치도원주시선진지식산업센터지식산업센터코리아신탁 주식회사승인<NA>강원특별자치도 원주시개별[신설]승인13344.299998.3489814.5310183.81<NA>강원특별자치도 원주시 반곡동 1858-4<NA>미착공도시지역주거지역민간
시도시군구지식산업센터명입지구분회사명등록구분단지명관할기관산단구분상태지목용지면적건축면적제조면적부대면적공장대표주소(도로명)공장대표주소(지번)분양형태건축상태용도지역1용도지역2설치자
1497충청북도청주시 청원구주식회사 더청림지식산업센터주식회사 더청림승인<NA>충청북도 청주시개별[신설]승인대, 대1350.116550.479145.627404.85<NA>충청북도 청주시 청원구 오창읍 양청리 809-3 외 1필지분양미착공도시지역상업지역민간
1498충청북도청주시 청원구에코바로개발지식산업센터에코바로개발(주)승인청주오창과학일반산업단지오창과학산업단지관리공단지방산업단지[신설]승인공장용지7457.643679.3615467.2828212.08충청북도 청주시 청원구 오창읍 양청송대길 18충청북도 청주시 청원구 오창읍 양청리 810-7<NA><NA>도시지역공업지역민간
1499충청북도청주시 청원구청주 청원구 오창읍 각리 644-5 지식산업센터지식산업센터(주)명정보기술승인청주오창과학일반산업단지오창과학산업단지관리공단지방산업단지[신설]승인공장용지9848.288745.0339957.86648787.164충청북도 청주시 청원구 오창읍 과학산업3로 168충청북도 청주시 청원구 오창읍 각리 644-5분양<NA>도시지역공업지역민간
1500충청북도청주시 청원구청주미래누리터지식산업센터청주시등록청주오창과학일반산업단지오창과학산업단지관리공단지방산업단지[완료신고]8342.458397.492557.325840.17충청북도 청주시 청원구 오창읍 양청송대길 10충청북도 청주시 청원구 오창읍 양청리 810-13번지임대건축완료도시지역주거지역공공
1501충청북도청주시 흥덕구HS비즈타워지식산업센터한세이프(주)등록청주일반산업단지청주산업단지관리공단지방산업단지[완료신고]공장용지1659.09982.2862902.587079.706충청북도 청주시 흥덕구 직지대로 442 (송정동)충청북도 청주시 흥덕구 송정동 70-60번지분양건축완료도시지역공업지역민간
1502충청북도청주시 흥덕구세중테크노밸리지식산업센터(주)세중등록청주일반산업단지청주산업단지관리공단지방산업단지[변경완료신고]공장용지, 공장용지6992.039896.5913758.5826138.01충청북도 청주시 흥덕구 공단로 134 (송정동, 세중테크노밸리) 외 1필지충청북도 청주시흥덕구 송정동 279-5 번지 외 1필지분양/임대건축완료도시지역공업지역민간
1503충청북도청주시 흥덕구직지스타지식산업센터(주)하나자산신탁등록청주일반산업단지청주산업단지관리공단지방산업단지[변경완료신고]공장용지, 공장용지, 공장용지10655.062534.0113229.5149304.5충청북도 청주시 흥덕구 직지대로436번길 76 (송정동) 외 2필지충청북도 청주시 흥덕구 송정동 70-73 외 2필지분양/임대건축완료도시지역공업지역민간
1504충청북도청주시 흥덕구청주테크노S타워지식산업센터도시개발(주)등록청주일반산업단지청주산업단지관리공단지방산업단지[변경완료신고]공장용지9037.026129.389271.60916857.771충청북도 청주시 흥덕구 직지대로 530충청북도 청주시 흥덕구 송정동 407번지분양건축완료도시지역공업지역민간
1505충청북도청주시 흥덕구티원타워지식산업센터(주)삼다산업개발등록청주일반산업단지청주산업단지관리공단지방산업단지[변경완료신고]공장용지, 공장용지, 공장용지7316.040763.3329437.7231325.612충청북도 청주시 흥덕구 봉명로 31 (복대동) 외 2필지충청북도 청주시 흥덕구 복대동 100-5번지 외 2필지분양/임대건축완료도시지역공업지역민간
1506충청북도충주시충주 지식산업센터지식산업센터충주시청승인<NA>충청북도 충주시개별[신설]승인<NA>7328.19654.74388.425266.28<NA>충청북도 충주시 주덕읍 화곡리 1256-1임대미착공도시지역공업지역공공