Overview

Dataset statistics

Number of variables13
Number of observations2677
Missing cells1559
Missing cells (%)4.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory277.2 KiB
Average record size in memory106.0 B

Variable types

Numeric2
Categorical2
Text9

Dataset

Description남양주시 제조업을 가지고 있는 공장 등록에 대한 데이터로 회사명, 전화번호, 팩스번호, 생산품, 주소(도로명,지번) 등의 항목을 제공합니다.
Author경기도 남양주시
URLhttps://www.data.go.kr/data/15005268/fileData.do

Alerts

단지명 is highly overall correlated with 설립구분High correlation
설립구분 is highly overall correlated with 단지명High correlation
단지명 is highly imbalanced (83.8%)Imbalance
설립구분 is highly imbalanced (51.3%)Imbalance
전화번호 has 356 (13.3%) missing valuesMissing
팩스번호 has 465 (17.4%) missing valuesMissing
주원자재 has 722 (27.0%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-14 11:07:29.369067
Analysis finished2024-03-14 11:07:33.944106
Duration4.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct2677
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1339
Minimum1
Maximum2677
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.7 KiB
2024-03-14T20:07:34.165105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile134.8
Q1670
median1339
Q32008
95-th percentile2543.2
Maximum2677
Range2676
Interquartile range (IQR)1338

Descriptive statistics

Standard deviation772.92766
Coefficient of variation (CV)0.57724246
Kurtosis-1.2
Mean1339
Median Absolute Deviation (MAD)669
Skewness0
Sum3584503
Variance597417.17
MonotonicityStrictly increasing
2024-03-14T20:07:34.605125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1780 1
 
< 0.1%
1782 1
 
< 0.1%
1783 1
 
< 0.1%
1784 1
 
< 0.1%
1785 1
 
< 0.1%
1786 1
 
< 0.1%
1787 1
 
< 0.1%
1788 1
 
< 0.1%
1789 1
 
< 0.1%
Other values (2667) 2667
99.6%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2677 1
< 0.1%
2676 1
< 0.1%
2675 1
< 0.1%
2674 1
< 0.1%
2673 1
< 0.1%
2672 1
< 0.1%
2671 1
< 0.1%
2670 1
< 0.1%
2669 1
< 0.1%
2668 1
< 0.1%

단지명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
<NA>
2560 
남양주진관일반산업단지
 
61
남양주금곡일반산업단지
 
28
남양주광릉테크노밸리일반산업단지
 
28

Length

Max length16
Median length4
Mean length4.3582368
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 2560
95.6%
남양주진관일반산업단지 61
 
2.3%
남양주금곡일반산업단지 28
 
1.0%
남양주광릉테크노밸리일반산업단지 28
 
1.0%

Length

2024-03-14T20:07:35.035448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:07:35.369389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 2560
95.6%
남양주진관일반산업단지 61
 
2.3%
남양주금곡일반산업단지 28
 
1.0%
남양주광릉테크노밸리일반산업단지 28
 
1.0%
Distinct2602
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-03-14T20:07:36.407922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length7.0579006
Min length2

Characters and Unicode

Total characters18894
Distinct characters655
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2534 ?
Unique (%)94.7%

Sample

1st row㈜티에스메탈
2nd row교문chemical
3rd row(사)보성 장애인엘이디사업단
4th row(사)아름다움나눔협회사업단
5th row(사)장애인녹색일자리사랑회 디지털사업단
ValueCountFrequency (%)
주식회사 472
 
14.4%
제2공장 11
 
0.3%
농업회사법인 7
 
0.2%
디자인 5
 
0.2%
제1공장 5
 
0.2%
제3공장 4
 
0.1%
유한회사 4
 
0.1%
주)해광 4
 
0.1%
펌프킨 3
 
0.1%
선메딕스 3
 
0.1%
Other values (2657) 2765
84.2%
2024-03-14T20:07:37.648912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1456
 
7.7%
( 956
 
5.1%
) 956
 
5.1%
657
 
3.5%
610
 
3.2%
562
 
3.0%
551
 
2.9%
520
 
2.8%
473
 
2.5%
362
 
1.9%
Other values (645) 11791
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15836
83.8%
Open Punctuation 958
 
5.1%
Close Punctuation 958
 
5.1%
Space Separator 610
 
3.2%
Uppercase Letter 345
 
1.8%
Lowercase Letter 69
 
0.4%
Decimal Number 54
 
0.3%
Other Punctuation 39
 
0.2%
Other Symbol 19
 
0.1%
Dash Punctuation 3
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1456
 
9.2%
657
 
4.1%
562
 
3.5%
551
 
3.5%
520
 
3.3%
473
 
3.0%
362
 
2.3%
293
 
1.9%
269
 
1.7%
243
 
1.5%
Other values (578) 10450
66.0%
Uppercase Letter
ValueCountFrequency (%)
E 39
 
11.3%
N 29
 
8.4%
S 27
 
7.8%
C 26
 
7.5%
A 24
 
7.0%
O 21
 
6.1%
G 20
 
5.8%
T 19
 
5.5%
K 15
 
4.3%
R 15
 
4.3%
Other values (14) 110
31.9%
Lowercase Letter
ValueCountFrequency (%)
a 8
11.6%
n 7
 
10.1%
e 6
 
8.7%
m 6
 
8.7%
o 5
 
7.2%
i 5
 
7.2%
t 4
 
5.8%
l 4
 
5.8%
r 3
 
4.3%
y 3
 
4.3%
Other values (11) 18
26.1%
Decimal Number
ValueCountFrequency (%)
2 29
53.7%
1 14
25.9%
3 5
 
9.3%
0 2
 
3.7%
4 2
 
3.7%
5 1
 
1.9%
6 1
 
1.9%
Other Punctuation
ValueCountFrequency (%)
. 22
56.4%
& 12
30.8%
, 2
 
5.1%
; 2
 
5.1%
/ 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 956
99.8%
[ 2
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 956
99.8%
] 2
 
0.2%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
610
100.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15855
83.9%
Common 2625
 
13.9%
Latin 414
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1456
 
9.2%
657
 
4.1%
562
 
3.5%
551
 
3.5%
520
 
3.3%
473
 
3.0%
362
 
2.3%
293
 
1.8%
269
 
1.7%
243
 
1.5%
Other values (579) 10469
66.0%
Latin
ValueCountFrequency (%)
E 39
 
9.4%
N 29
 
7.0%
S 27
 
6.5%
C 26
 
6.3%
A 24
 
5.8%
O 21
 
5.1%
G 20
 
4.8%
T 19
 
4.6%
K 15
 
3.6%
R 15
 
3.6%
Other values (35) 179
43.2%
Common
ValueCountFrequency (%)
( 956
36.4%
) 956
36.4%
610
23.2%
2 29
 
1.1%
. 22
 
0.8%
1 14
 
0.5%
& 12
 
0.5%
3 5
 
0.2%
- 3
 
0.1%
0 2
 
0.1%
Other values (11) 16
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15836
83.8%
ASCII 3039
 
16.1%
None 19
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1456
 
9.2%
657
 
4.1%
562
 
3.5%
551
 
3.5%
520
 
3.3%
473
 
3.0%
362
 
2.3%
293
 
1.9%
269
 
1.7%
243
 
1.5%
Other values (578) 10450
66.0%
ASCII
ValueCountFrequency (%)
( 956
31.5%
) 956
31.5%
610
20.1%
E 39
 
1.3%
N 29
 
1.0%
2 29
 
1.0%
S 27
 
0.9%
C 26
 
0.9%
A 24
 
0.8%
. 22
 
0.7%
Other values (56) 321
 
10.6%
None
ValueCountFrequency (%)
19
100.0%

설립구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
일반
2247 
창업
313 
일반산업단지
 
117

Length

Max length6
Median length2
Mean length2.1748226
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 2247
83.9%
창업 313
 
11.7%
일반산업단지 117
 
4.4%

Length

2024-03-14T20:07:37.907110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T20:07:38.094927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 2247
83.9%
창업 313
 
11.7%
일반산업단지 117
 
4.4%

전화번호
Text

MISSING 

Distinct2186
Distinct (%)94.2%
Missing356
Missing (%)13.3%
Memory size21.0 KiB
2024-03-14T20:07:38.906400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.977596
Min length9

Characters and Unicode

Total characters27800
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2066 ?
Unique (%)89.0%

Sample

1st row031-566-5121
2nd row031-511-1612
3rd row070-7765-7571
4th row031-575-6292
5th row031-575-6292
ValueCountFrequency (%)
031-592-4161 5
 
0.2%
031-544-0078 3
 
0.1%
031-593-5611 3
 
0.1%
031-572-4197 3
 
0.1%
031-593-6020 3
 
0.1%
031-593-2926 3
 
0.1%
031-595-9866 3
 
0.1%
031-574-7417 3
 
0.1%
02-458-3411 3
 
0.1%
031-592-6960 3
 
0.1%
Other values (2176) 2289
98.6%
2024-03-14T20:07:40.171551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4628
16.6%
0 3728
13.4%
1 3584
12.9%
5 3362
12.1%
3 3266
11.7%
7 2019
7.3%
2 1945
7.0%
9 1661
 
6.0%
4 1329
 
4.8%
8 1154
 
4.2%
Other values (4) 1124
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 23163
83.3%
Dash Punctuation 4628
 
16.6%
Uppercase Letter 9
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3728
16.1%
1 3584
15.5%
5 3362
14.5%
3 3266
14.1%
7 2019
8.7%
2 1945
8.4%
9 1661
7.2%
4 1329
 
5.7%
8 1154
 
5.0%
6 1115
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
A 3
33.3%
R 3
33.3%
S 3
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 4628
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 27791
> 99.9%
Latin 9
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4628
16.7%
0 3728
13.4%
1 3584
12.9%
5 3362
12.1%
3 3266
11.8%
7 2019
7.3%
2 1945
7.0%
9 1661
 
6.0%
4 1329
 
4.8%
8 1154
 
4.2%
Latin
ValueCountFrequency (%)
A 3
33.3%
R 3
33.3%
S 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27800
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4628
16.6%
0 3728
13.4%
1 3584
12.9%
5 3362
12.1%
3 3266
11.7%
7 2019
7.3%
2 1945
7.0%
9 1661
 
6.0%
4 1329
 
4.8%
8 1154
 
4.2%
Other values (4) 1124
 
4.0%

팩스번호
Text

MISSING 

Distinct2066
Distinct (%)93.4%
Missing465
Missing (%)17.4%
Memory size21.0 KiB
2024-03-14T20:07:41.093438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.983273
Min length11

Characters and Unicode

Total characters26507
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1937 ?
Unique (%)87.6%

Sample

1st row031-593-6803
2nd row031-595-5128
3rd row031-511-2832
4th row031-574-5464
5th row031-595-0244
ValueCountFrequency (%)
031-592-4169 5
 
0.2%
031-573-7624 4
 
0.2%
031-595-9877 3
 
0.1%
031-574-1599 3
 
0.1%
031-528-4199 3
 
0.1%
031-527-6582 3
 
0.1%
031-593-5651 3
 
0.1%
031-575-6275 3
 
0.1%
031-573-5535 3
 
0.1%
031-592-0953 3
 
0.1%
Other values (2056) 2179
98.5%
2024-03-14T20:07:42.236109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 4424
16.7%
1 3317
12.5%
0 3304
12.5%
5 3216
12.1%
3 3184
12.0%
2 1898
7.2%
7 1831
6.9%
9 1730
 
6.5%
4 1335
 
5.0%
8 1148
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22083
83.3%
Dash Punctuation 4424
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3317
15.0%
0 3304
15.0%
5 3216
14.6%
3 3184
14.4%
2 1898
8.6%
7 1831
8.3%
9 1730
7.8%
4 1335
6.0%
8 1148
 
5.2%
6 1120
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 4424
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26507
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 4424
16.7%
1 3317
12.5%
0 3304
12.5%
5 3216
12.1%
3 3184
12.0%
2 1898
7.2%
7 1831
6.9%
9 1730
 
6.5%
4 1335
 
5.0%
8 1148
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26507
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 4424
16.7%
1 3317
12.5%
0 3304
12.5%
5 3216
12.1%
3 3184
12.0%
2 1898
7.2%
7 1831
6.9%
9 1730
 
6.5%
4 1335
 
5.0%
8 1148
 
4.3%
Distinct2430
Distinct (%)90.9%
Missing4
Missing (%)0.1%
Memory size21.0 KiB
2024-03-14T20:07:43.363388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length134
Median length59
Mean length11.023569
Min length1

Characters and Unicode

Total characters29466
Distinct characters749
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2301 ?
Unique (%)86.1%

Sample

1st row각종철강조립품
2nd row빨래건조대 부품
3rd rowLED조명기구
4th row가로등주, 가로등기구, LED등기구
5th rowLED조명, LED전광판, 광고안내판
ValueCountFrequency (%)
184
 
3.4%
132
 
2.4%
44
 
0.8%
가구 34
 
0.6%
플라스틱 31
 
0.6%
배전반 31
 
0.6%
간판 31
 
0.6%
마스크 31
 
0.6%
의류 21
 
0.4%
케이스 20
 
0.4%
Other values (3347) 4830
89.6%
2024-03-14T20:07:44.785947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2736
 
9.3%
, 2145
 
7.3%
1104
 
3.7%
590
 
2.0%
525
 
1.8%
461
 
1.6%
445
 
1.5%
436
 
1.5%
423
 
1.4%
405
 
1.4%
Other values (739) 20196
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 23106
78.4%
Space Separator 2736
 
9.3%
Other Punctuation 2237
 
7.6%
Uppercase Letter 722
 
2.5%
Open Punctuation 236
 
0.8%
Close Punctuation 233
 
0.8%
Lowercase Letter 147
 
0.5%
Decimal Number 36
 
0.1%
Dash Punctuation 9
 
< 0.1%
Letter Number 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1104
 
4.8%
590
 
2.6%
525
 
2.3%
461
 
2.0%
445
 
1.9%
436
 
1.9%
423
 
1.8%
405
 
1.8%
395
 
1.7%
387
 
1.7%
Other values (672) 17935
77.6%
Lowercase Letter
ValueCountFrequency (%)
t 15
 
10.2%
e 15
 
10.2%
l 12
 
8.2%
c 12
 
8.2%
a 11
 
7.5%
s 9
 
6.1%
o 9
 
6.1%
n 8
 
5.4%
i 8
 
5.4%
p 7
 
4.8%
Other values (13) 41
27.9%
Uppercase Letter
ValueCountFrequency (%)
C 97
13.4%
E 87
12.0%
L 85
11.8%
P 84
11.6%
D 80
11.1%
V 51
7.1%
T 42
 
5.8%
A 33
 
4.6%
S 29
 
4.0%
R 24
 
3.3%
Other values (12) 110
15.2%
Decimal Number
ValueCountFrequency (%)
0 6
16.7%
3 5
13.9%
1 5
13.9%
2 4
11.1%
6 4
11.1%
5 4
11.1%
7 3
8.3%
4 2
 
5.6%
9 2
 
5.6%
8 1
 
2.8%
Other Punctuation
ValueCountFrequency (%)
, 2145
95.9%
. 79
 
3.5%
/ 10
 
0.4%
· 3
 
0.1%
Letter Number
ValueCountFrequency (%)
1
50.0%
1
50.0%
Space Separator
ValueCountFrequency (%)
2736
100.0%
Open Punctuation
ValueCountFrequency (%)
( 236
100.0%
Close Punctuation
ValueCountFrequency (%)
) 233
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Modifier Symbol
ValueCountFrequency (%)
˙ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 23105
78.4%
Common 5489
 
18.6%
Latin 871
 
3.0%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1104
 
4.8%
590
 
2.6%
525
 
2.3%
461
 
2.0%
445
 
1.9%
436
 
1.9%
423
 
1.8%
405
 
1.8%
395
 
1.7%
387
 
1.7%
Other values (671) 17934
77.6%
Latin
ValueCountFrequency (%)
C 97
 
11.1%
E 87
 
10.0%
L 85
 
9.8%
P 84
 
9.6%
D 80
 
9.2%
V 51
 
5.9%
T 42
 
4.8%
A 33
 
3.8%
S 29
 
3.3%
R 24
 
2.8%
Other values (37) 259
29.7%
Common
ValueCountFrequency (%)
2736
49.8%
, 2145
39.1%
( 236
 
4.3%
) 233
 
4.2%
. 79
 
1.4%
/ 10
 
0.2%
- 9
 
0.2%
0 6
 
0.1%
3 5
 
0.1%
1 5
 
0.1%
Other values (10) 25
 
0.5%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 23103
78.4%
ASCII 6354
 
21.6%
None 3
 
< 0.1%
Number Forms 2
 
< 0.1%
Compat Jamo 2
 
< 0.1%
CJK 1
 
< 0.1%
Modifier Letters 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2736
43.1%
, 2145
33.8%
( 236
 
3.7%
) 233
 
3.7%
C 97
 
1.5%
E 87
 
1.4%
L 85
 
1.3%
P 84
 
1.3%
D 80
 
1.3%
. 79
 
1.2%
Other values (53) 492
 
7.7%
Hangul
ValueCountFrequency (%)
1104
 
4.8%
590
 
2.6%
525
 
2.3%
461
 
2.0%
445
 
1.9%
436
 
1.9%
423
 
1.8%
405
 
1.8%
395
 
1.7%
387
 
1.7%
Other values (669) 17932
77.6%
None
ValueCountFrequency (%)
· 3
100.0%
Number Forms
ValueCountFrequency (%)
1
50.0%
1
50.0%
CJK
ValueCountFrequency (%)
1
100.0%
Modifier Letters
ValueCountFrequency (%)
˙ 1
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%
Distinct2533
Distinct (%)95.0%
Missing12
Missing (%)0.4%
Memory size21.0 KiB
2024-03-14T20:07:45.926072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length54
Mean length28.77561
Min length16

Characters and Unicode

Total characters76687
Distinct characters389
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2425 ?
Unique (%)91.0%

Sample

1st row경기도 남양주시 화도읍 비룡로 244-48
2nd row경기도 남양주시 화도읍 녹촌로135번길 11-37
3rd row경기도 남양주시 화도읍 수레로 1027-1, 3층
4th row경기도 남양주시 오남읍 진건오남로708번길 39-21, 가,나동 1층 (오남읍)
5th row경기도 남양주시 별내면 청학로중앙길 41, 1층
ValueCountFrequency (%)
경기도 2665
 
16.6%
남양주시 2665
 
16.6%
진접읍 777
 
4.9%
화도읍 638
 
4.0%
수동면 346
 
2.2%
진건읍 324
 
2.0%
320
 
2.0%
오남읍 263
 
1.6%
1필지 191
 
1.2%
와부읍 147
 
0.9%
Other values (2336) 7676
47.9%
2024-03-14T20:07:47.634657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13347
 
17.4%
3313
 
4.3%
1 3296
 
4.3%
3140
 
4.1%
3086
 
4.0%
2965
 
3.9%
2900
 
3.8%
2805
 
3.7%
2675
 
3.5%
2 2522
 
3.3%
Other values (379) 36638
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 44965
58.6%
Decimal Number 14603
 
19.0%
Space Separator 13347
 
17.4%
Dash Punctuation 1338
 
1.7%
Other Punctuation 963
 
1.3%
Open Punctuation 647
 
0.8%
Close Punctuation 647
 
0.8%
Uppercase Letter 158
 
0.2%
Math Symbol 8
 
< 0.1%
Lowercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3313
 
7.4%
3140
 
7.0%
3086
 
6.9%
2965
 
6.6%
2900
 
6.4%
2805
 
6.2%
2675
 
5.9%
2502
 
5.6%
2151
 
4.8%
1649
 
3.7%
Other values (332) 17779
39.5%
Uppercase Letter
ValueCountFrequency (%)
A 35
22.2%
B 33
20.9%
F 16
10.1%
S 13
 
8.2%
M 10
 
6.3%
N 8
 
5.1%
C 7
 
4.4%
J 6
 
3.8%
O 6
 
3.8%
E 5
 
3.2%
Other values (8) 19
12.0%
Decimal Number
ValueCountFrequency (%)
1 3296
22.6%
2 2522
17.3%
3 1449
9.9%
4 1339
9.2%
5 1053
 
7.2%
7 1035
 
7.1%
9 1033
 
7.1%
0 1028
 
7.0%
6 989
 
6.8%
8 859
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
a 2
25.0%
f 1
12.5%
m 1
12.5%
p 1
12.5%
b 1
12.5%
s 1
12.5%
j 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 953
99.0%
/ 4
 
0.4%
. 2
 
0.2%
& 2
 
0.2%
\ 1
 
0.1%
; 1
 
0.1%
Space Separator
ValueCountFrequency (%)
13347
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1338
100.0%
Open Punctuation
ValueCountFrequency (%)
( 647
100.0%
Close Punctuation
ValueCountFrequency (%)
) 647
100.0%
Math Symbol
ValueCountFrequency (%)
~ 8
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 44968
58.6%
Common 31553
41.1%
Latin 166
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3313
 
7.4%
3140
 
7.0%
3086
 
6.9%
2965
 
6.6%
2900
 
6.4%
2805
 
6.2%
2675
 
5.9%
2502
 
5.6%
2151
 
4.8%
1649
 
3.7%
Other values (333) 17782
39.5%
Latin
ValueCountFrequency (%)
A 35
21.1%
B 33
19.9%
F 16
9.6%
S 13
 
7.8%
M 10
 
6.0%
N 8
 
4.8%
C 7
 
4.2%
J 6
 
3.6%
O 6
 
3.6%
E 5
 
3.0%
Other values (15) 27
16.3%
Common
ValueCountFrequency (%)
13347
42.3%
1 3296
 
10.4%
2 2522
 
8.0%
3 1449
 
4.6%
4 1339
 
4.2%
- 1338
 
4.2%
5 1053
 
3.3%
7 1035
 
3.3%
9 1033
 
3.3%
0 1028
 
3.3%
Other values (11) 4113
 
13.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 44965
58.6%
ASCII 31719
41.4%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13347
42.1%
1 3296
 
10.4%
2 2522
 
8.0%
3 1449
 
4.6%
4 1339
 
4.2%
- 1338
 
4.2%
5 1053
 
3.3%
7 1035
 
3.3%
9 1033
 
3.3%
0 1028
 
3.2%
Other values (36) 4279
 
13.5%
Hangul
ValueCountFrequency (%)
3313
 
7.4%
3140
 
7.0%
3086
 
6.9%
2965
 
6.6%
2900
 
6.4%
2805
 
6.2%
2675
 
5.9%
2502
 
5.6%
2151
 
4.8%
1649
 
3.7%
Other values (332) 17779
39.5%
None
ValueCountFrequency (%)
3
100.0%
Distinct2511
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-03-14T20:07:48.861626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length52
Mean length25.183041
Min length17

Characters and Unicode

Total characters67415
Distinct characters268
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2389 ?
Unique (%)89.2%

Sample

1st row경기도 남양주시 화도읍 가곡리 191-8번지
2nd row경기도 남양주시 화도읍 녹촌리 150-1번지
3rd row경기도 남양주시 화도읍 차산리 114-4번지 3층
4th row경기도 남양주시 오남읍 오남리 108-21번지 가,나동 1층
5th row경기도 남양주시 별내면 청학리 460-2번지 1층
ValueCountFrequency (%)
남양주시 2678
18.3%
경기도 2677
18.3%
진접읍 774
 
5.3%
화도읍 635
 
4.3%
수동면 343
 
2.3%
진건읍 318
 
2.2%
오남읍 263
 
1.8%
진벌리 195
 
1.3%
송천리 184
 
1.3%
차산리 168
 
1.2%
Other values (2552) 6364
43.6%
2024-03-14T20:07:50.348921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12020
 
17.8%
3365
 
5.0%
3100
 
4.6%
2979
 
4.4%
2824
 
4.2%
2695
 
4.0%
2688
 
4.0%
2682
 
4.0%
2519
 
3.7%
1 2437
 
3.6%
Other values (258) 30106
44.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41312
61.3%
Space Separator 12020
 
17.8%
Decimal Number 11553
 
17.1%
Dash Punctuation 2136
 
3.2%
Other Punctuation 136
 
0.2%
Uppercase Letter 97
 
0.1%
Close Punctuation 73
 
0.1%
Open Punctuation 73
 
0.1%
Lowercase Letter 8
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3365
 
8.1%
3100
 
7.5%
2979
 
7.2%
2824
 
6.8%
2695
 
6.5%
2688
 
6.5%
2682
 
6.5%
2519
 
6.1%
2290
 
5.5%
2138
 
5.2%
Other values (216) 14032
34.0%
Uppercase Letter
ValueCountFrequency (%)
A 23
23.7%
B 23
23.7%
M 11
11.3%
C 7
 
7.2%
F 7
 
7.2%
S 7
 
7.2%
J 6
 
6.2%
N 5
 
5.2%
D 3
 
3.1%
E 2
 
2.1%
Other values (3) 3
 
3.1%
Decimal Number
ValueCountFrequency (%)
1 2437
21.1%
2 1530
13.2%
3 1238
10.7%
4 1211
10.5%
5 1128
9.8%
8 870
 
7.5%
6 865
 
7.5%
0 778
 
6.7%
7 773
 
6.7%
9 723
 
6.3%
Lowercase Letter
ValueCountFrequency (%)
a 2
25.0%
s 1
12.5%
j 1
12.5%
p 1
12.5%
b 1
12.5%
m 1
12.5%
f 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 127
93.4%
/ 4
 
2.9%
. 2
 
1.5%
\ 1
 
0.7%
; 1
 
0.7%
& 1
 
0.7%
Space Separator
ValueCountFrequency (%)
12020
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2136
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41314
61.3%
Common 25996
38.6%
Latin 105
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3365
 
8.1%
3100
 
7.5%
2979
 
7.2%
2824
 
6.8%
2695
 
6.5%
2688
 
6.5%
2682
 
6.5%
2519
 
6.1%
2290
 
5.5%
2138
 
5.2%
Other values (217) 14034
34.0%
Common
ValueCountFrequency (%)
12020
46.2%
1 2437
 
9.4%
- 2136
 
8.2%
2 1530
 
5.9%
3 1238
 
4.8%
4 1211
 
4.7%
5 1128
 
4.3%
8 870
 
3.3%
6 865
 
3.3%
0 778
 
3.0%
Other values (11) 1783
 
6.9%
Latin
ValueCountFrequency (%)
A 23
21.9%
B 23
21.9%
M 11
10.5%
C 7
 
6.7%
F 7
 
6.7%
S 7
 
6.7%
J 6
 
5.7%
N 5
 
4.8%
D 3
 
2.9%
E 2
 
1.9%
Other values (10) 11
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41312
61.3%
ASCII 26101
38.7%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12020
46.1%
1 2437
 
9.3%
- 2136
 
8.2%
2 1530
 
5.9%
3 1238
 
4.7%
4 1211
 
4.6%
5 1128
 
4.3%
8 870
 
3.3%
6 865
 
3.3%
0 778
 
3.0%
Other values (31) 1888
 
7.2%
Hangul
ValueCountFrequency (%)
3365
 
8.1%
3100
 
7.5%
2979
 
7.2%
2824
 
6.8%
2695
 
6.5%
2688
 
6.5%
2682
 
6.5%
2519
 
6.1%
2290
 
5.5%
2138
 
5.2%
Other values (216) 14032
34.0%
None
ValueCountFrequency (%)
2
100.0%

대표업종번호
Real number (ℝ)

Distinct334
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24331.678
Minimum10121
Maximum38321
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size23.7 KiB
2024-03-14T20:07:50.665196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10121
5-th percentile10749
Q120499
median25932
Q329172
95-th percentile33303
Maximum38321
Range28200
Interquartile range (IQR)8673

Descriptive statistics

Standard deviation6765.1873
Coefficient of variation (CV)0.27804031
Kurtosis-0.55416678
Mean24331.678
Median Absolute Deviation (MAD)3633
Skewness-0.67481356
Sum65135903
Variance45767759
MonotonicityNot monotonic
2024-03-14T20:07:51.086551image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32029 159
 
5.9%
25932 96
 
3.6%
22299 76
 
2.8%
25112 64
 
2.4%
28123 61
 
2.3%
25111 57
 
2.1%
28422 49
 
1.8%
32091 49
 
1.8%
25113 46
 
1.7%
33910 46
 
1.7%
Other values (324) 1974
73.7%
ValueCountFrequency (%)
10121 10
0.4%
10122 5
 
0.2%
10129 14
0.5%
10211 2
 
0.1%
10219 4
 
0.1%
10220 4
 
0.1%
10301 7
0.3%
10302 1
 
< 0.1%
10309 9
0.3%
10402 1
 
< 0.1%
ValueCountFrequency (%)
38321 5
 
0.2%
36129 1
 
< 0.1%
34019 1
 
< 0.1%
33999 24
0.9%
33993 9
 
0.3%
33992 2
 
0.1%
33991 10
 
0.4%
33932 14
 
0.5%
33920 2
 
0.1%
33910 46
1.7%
Distinct1044
Distinct (%)39.0%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-03-14T20:07:52.637586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length250
Median length5
Mean length13.615988
Min length5

Characters and Unicode

Total characters36450
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique724 ?
Unique (%)27.0%

Sample

1st row25112
2nd row22299
3rd row28410, 28422
4th row25112, 25113, 25114, 25119, 28422, 28423, 28429, 28903
5th row25994, 26421, 26429, 28112, 28113, 28123, 28410, 28422, 28423, 28429, 28903, 31202, 33910, 33932
ValueCountFrequency (%)
32029 201
 
3.4%
25932 141
 
2.4%
25112 140
 
2.3%
25113 122
 
2.0%
25999 113
 
1.9%
22299 111
 
1.9%
32091 103
 
1.7%
28123 103
 
1.7%
28422 92
 
1.5%
25114 84
 
1.4%
Other values (398) 4762
79.7%
2024-03-14T20:07:54.592517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8564
23.5%
1 6220
17.1%
9 4840
13.3%
, 3295
 
9.0%
3295
 
9.0%
3 2999
 
8.2%
0 1914
 
5.3%
5 1717
 
4.7%
4 1608
 
4.4%
8 793
 
2.2%
Other values (2) 1205
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 29860
81.9%
Other Punctuation 3295
 
9.0%
Space Separator 3295
 
9.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 8564
28.7%
1 6220
20.8%
9 4840
16.2%
3 2999
 
10.0%
0 1914
 
6.4%
5 1717
 
5.8%
4 1608
 
5.4%
8 793
 
2.7%
7 633
 
2.1%
6 572
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 3295
100.0%
Space Separator
ValueCountFrequency (%)
3295
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36450
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 8564
23.5%
1 6220
17.1%
9 4840
13.3%
, 3295
 
9.0%
3295
 
9.0%
3 2999
 
8.2%
0 1914
 
5.3%
5 1717
 
4.7%
4 1608
 
4.4%
8 793
 
2.2%
Other values (2) 1205
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 8564
23.5%
1 6220
17.1%
9 4840
13.3%
, 3295
 
9.0%
3295
 
9.0%
3 2999
 
8.2%
0 1914
 
5.3%
5 1717
 
4.7%
4 1608
 
4.4%
8 793
 
2.2%
Other values (2) 1205
 
3.3%
Distinct799
Distinct (%)29.8%
Missing0
Missing (%)0.0%
Memory size21.0 KiB
2024-03-14T20:07:55.809583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length17.16399
Min length3

Characters and Unicode

Total characters45948
Distinct characters332
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique421 ?
Unique (%)15.7%

Sample

1st row구조용 금속 판제품 및 공작물 제조업
2nd row그 외 기타 플라스틱 제품 제조업
3rd row일반용 전기 조명장치 제조업 외 1 종
4th row구조용 금속 판제품 및 공작물 제조업 외 7 종
5th row전시 및 광고용 조명장치 제조업 외 13 종
ValueCountFrequency (%)
제조업 2509
 
16.7%
1435
 
9.6%
1105
 
7.4%
1090
 
7.3%
기타 909
 
6.1%
1 486
 
3.2%
금속 351
 
2.3%
345
 
2.3%
2 233
 
1.6%
플라스틱 189
 
1.3%
Other values (596) 6339
42.3%
2024-03-14T20:07:57.219022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12319
26.8%
3328
 
7.2%
2898
 
6.3%
2763
 
6.0%
1872
 
4.1%
1459
 
3.2%
1139
 
2.5%
1105
 
2.4%
924
 
2.0%
841
 
1.8%
Other values (322) 17300
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32113
69.9%
Space Separator 12319
 
26.8%
Decimal Number 1156
 
2.5%
Other Punctuation 322
 
0.7%
Close Punctuation 19
 
< 0.1%
Open Punctuation 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3328
 
10.4%
2898
 
9.0%
2763
 
8.6%
1872
 
5.8%
1459
 
4.5%
1139
 
3.5%
1105
 
3.4%
924
 
2.9%
841
 
2.6%
809
 
2.5%
Other values (307) 14975
46.6%
Decimal Number
ValueCountFrequency (%)
1 545
47.1%
2 253
21.9%
3 152
 
13.1%
4 69
 
6.0%
5 37
 
3.2%
6 31
 
2.7%
7 29
 
2.5%
9 16
 
1.4%
0 13
 
1.1%
8 11
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 317
98.4%
. 5
 
1.6%
Space Separator
ValueCountFrequency (%)
12319
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32113
69.9%
Common 13835
30.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3328
 
10.4%
2898
 
9.0%
2763
 
8.6%
1872
 
5.8%
1459
 
4.5%
1139
 
3.5%
1105
 
3.4%
924
 
2.9%
841
 
2.6%
809
 
2.5%
Other values (307) 14975
46.6%
Common
ValueCountFrequency (%)
12319
89.0%
1 545
 
3.9%
, 317
 
2.3%
2 253
 
1.8%
3 152
 
1.1%
4 69
 
0.5%
5 37
 
0.3%
6 31
 
0.2%
7 29
 
0.2%
) 19
 
0.1%
Other values (5) 64
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32095
69.9%
ASCII 13835
30.1%
Compat Jamo 18
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12319
89.0%
1 545
 
3.9%
, 317
 
2.3%
2 253
 
1.8%
3 152
 
1.1%
4 69
 
0.5%
5 37
 
0.3%
6 31
 
0.2%
7 29
 
0.2%
) 19
 
0.1%
Other values (5) 64
 
0.5%
Hangul
ValueCountFrequency (%)
3328
 
10.4%
2898
 
9.0%
2763
 
8.6%
1872
 
5.8%
1459
 
4.5%
1139
 
3.5%
1105
 
3.4%
924
 
2.9%
841
 
2.6%
809
 
2.5%
Other values (306) 14957
46.6%
Compat Jamo
ValueCountFrequency (%)
18
100.0%

주원자재
Text

MISSING 

Distinct1459
Distinct (%)74.6%
Missing722
Missing (%)27.0%
Memory size21.0 KiB
2024-03-14T20:07:58.913620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length80
Median length46
Mean length9.2378517
Min length1

Characters and Unicode

Total characters18060
Distinct characters579
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1347 ?
Unique (%)68.9%

Sample

1st rowH-형강,L-형강
2nd rowPP,PE
3rd row공대, 컨버터, 커버, PCB모듈
4th row철재배관파이프, 스탠배관파이프, 등기구부속자재
5th rowLED램프, SMPS파워
ValueCountFrequency (%)
182
 
4.5%
목재 139
 
3.4%
철판 115
 
2.9%
알루미늄 101
 
2.5%
71
 
1.8%
원단 68
 
1.7%
플라스틱 67
 
1.7%
스테인레스 50
 
1.2%
금속 48
 
1.2%
40
 
1.0%
Other values (1688) 3149
78.1%
2024-03-14T20:08:01.241565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2096
 
11.6%
, 1889
 
10.5%
629
 
3.5%
423
 
2.3%
378
 
2.1%
P 330
 
1.8%
330
 
1.8%
236
 
1.3%
221
 
1.2%
212
 
1.2%
Other values (569) 11316
62.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11662
64.6%
Space Separator 2096
 
11.6%
Other Punctuation 1987
 
11.0%
Uppercase Letter 1556
 
8.6%
Lowercase Letter 447
 
2.5%
Close Punctuation 102
 
0.6%
Open Punctuation 102
 
0.6%
Decimal Number 88
 
0.5%
Dash Punctuation 20
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
629
 
5.4%
423
 
3.6%
378
 
3.2%
330
 
2.8%
236
 
2.0%
221
 
1.9%
212
 
1.8%
208
 
1.8%
201
 
1.7%
185
 
1.6%
Other values (504) 8639
74.1%
Uppercase Letter
ValueCountFrequency (%)
P 330
21.2%
S 152
9.8%
C 145
9.3%
E 145
9.3%
B 123
 
7.9%
D 103
 
6.6%
L 98
 
6.3%
A 83
 
5.3%
M 71
 
4.6%
T 46
 
3.0%
Other values (13) 260
16.7%
Lowercase Letter
ValueCountFrequency (%)
s 66
14.8%
e 56
12.5%
p 54
12.1%
l 33
 
7.4%
t 31
 
6.9%
c 29
 
6.5%
a 27
 
6.0%
i 26
 
5.8%
u 21
 
4.7%
o 20
 
4.5%
Other values (12) 84
18.8%
Decimal Number
ValueCountFrequency (%)
0 24
27.3%
4 15
17.0%
1 14
15.9%
3 13
14.8%
2 7
 
8.0%
6 5
 
5.7%
5 4
 
4.5%
8 2
 
2.3%
9 2
 
2.3%
7 2
 
2.3%
Other Punctuation
ValueCountFrequency (%)
, 1889
95.1%
. 81
 
4.1%
/ 16
 
0.8%
% 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 101
99.0%
] 1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 101
99.0%
[ 1
 
1.0%
Space Separator
ValueCountFrequency (%)
2096
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11662
64.6%
Common 4395
 
24.3%
Latin 2003
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
629
 
5.4%
423
 
3.6%
378
 
3.2%
330
 
2.8%
236
 
2.0%
221
 
1.9%
212
 
1.8%
208
 
1.8%
201
 
1.7%
185
 
1.6%
Other values (504) 8639
74.1%
Latin
ValueCountFrequency (%)
P 330
16.5%
S 152
 
7.6%
C 145
 
7.2%
E 145
 
7.2%
B 123
 
6.1%
D 103
 
5.1%
L 98
 
4.9%
A 83
 
4.1%
M 71
 
3.5%
s 66
 
3.3%
Other values (35) 687
34.3%
Common
ValueCountFrequency (%)
2096
47.7%
, 1889
43.0%
) 101
 
2.3%
( 101
 
2.3%
. 81
 
1.8%
0 24
 
0.5%
- 20
 
0.5%
/ 16
 
0.4%
4 15
 
0.3%
1 14
 
0.3%
Other values (10) 38
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11662
64.6%
ASCII 6398
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2096
32.8%
, 1889
29.5%
P 330
 
5.2%
S 152
 
2.4%
C 145
 
2.3%
E 145
 
2.3%
B 123
 
1.9%
D 103
 
1.6%
) 101
 
1.6%
( 101
 
1.6%
Other values (55) 1213
19.0%
Hangul
ValueCountFrequency (%)
629
 
5.4%
423
 
3.6%
378
 
3.2%
330
 
2.8%
236
 
2.0%
221
 
1.9%
212
 
1.8%
208
 
1.8%
201
 
1.7%
185
 
1.6%
Other values (504) 8639
74.1%

Interactions

2024-03-14T20:07:32.390446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:07:31.878926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:07:32.649287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T20:07:32.132198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T20:08:01.503568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번단지명설립구분대표업종번호
순번1.0000.1290.1850.113
단지명0.1291.000NaN0.647
설립구분0.185NaN1.0000.114
대표업종번호0.1130.6470.1141.000
2024-03-14T20:08:01.760144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단지명설립구분
단지명1.0001.000
설립구분1.0001.000
2024-03-14T20:08:02.001637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번대표업종번호단지명설립구분
순번1.000-0.0060.0700.112
대표업종번호-0.0061.0000.3710.069
단지명0.0700.3711.0001.000
설립구분0.1120.0691.0001.000

Missing values

2024-03-14T20:07:33.032179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T20:07:33.478835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T20:07:33.729472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번단지명회사명설립구분전화번호팩스번호생산품공장대표주소(도로명)공장대표주소(지번)대표업종번호업종번호업종명주원자재
01<NA>㈜티에스메탈일반<NA>031-593-6803각종철강조립품경기도 남양주시 화도읍 비룡로 244-48경기도 남양주시 화도읍 가곡리 191-8번지2511225112구조용 금속 판제품 및 공작물 제조업H-형강,L-형강
12<NA>교문chemical일반031-566-5121031-595-5128빨래건조대 부품경기도 남양주시 화도읍 녹촌로135번길 11-37경기도 남양주시 화도읍 녹촌리 150-1번지2229922299그 외 기타 플라스틱 제품 제조업PP,PE
23<NA>(사)보성 장애인엘이디사업단일반031-511-1612031-511-2832LED조명기구경기도 남양주시 화도읍 수레로 1027-1, 3층경기도 남양주시 화도읍 차산리 114-4번지 3층2842228410, 28422일반용 전기 조명장치 제조업 외 1 종공대, 컨버터, 커버, PCB모듈
34<NA>(사)아름다움나눔협회사업단일반070-7765-7571<NA>가로등주, 가로등기구, LED등기구경기도 남양주시 오남읍 진건오남로708번길 39-21, 가,나동 1층 (오남읍)경기도 남양주시 오남읍 오남리 108-21번지 가,나동 1층2511225112, 25113, 25114, 25119, 28422, 28423, 28429, 28903구조용 금속 판제품 및 공작물 제조업 외 7 종철재배관파이프, 스탠배관파이프, 등기구부속자재
45<NA>(사)장애인녹색일자리사랑회 디지털사업단일반031-575-6292031-574-5464LED조명, LED전광판, 광고안내판경기도 남양주시 별내면 청학로중앙길 41, 1층경기도 남양주시 별내면 청학리 460-2번지 1층2842325994, 26421, 26429, 28112, 28113, 28123, 28410, 28422, 28423, 28429, 28903, 31202, 33910, 33932전시 및 광고용 조명장치 제조업 외 13 종LED램프, SMPS파워
56<NA>(사)장애인녹색일자리사랑회디지털사업단일반031-575-6292<NA>방송장비 cctv경기도 남양주시 두물로27번길 38-10, 102호 (별내동)경기도 남양주시 별내동 2031-3번지 102호2642126410, 26421, 26429, 28123방송장비 제조업 외 3 종<NA>
67<NA>(사)장애인생산품판매지원협회 아름다운 사람들창업031-994-7461031-595-0244실물모형, 조형물<NA>경기도 남양주시 화도읍 금남리 606-10 ,남양주시 화도읍 폭포로477번길 2, 주1동 일부3393233910, 33932전시용 모형 제조업 외 1 종플라스틱
78<NA>(사)한국장애인문화진흥협회일반031-591-5907031-594-5908사무용가구경기도 남양주시 수동면 남가로 1402-16경기도 남양주시 수동면 운수리 251-33번지3202932029, 32091기타 목재가구 제조업 외 1 종합판 철물 엣지
89<NA>(유)보문특수칼라일반031-527-7616031-527-7656옵셋인쇄(BOX 특수인쇄)경기도 남양주시 진건읍 독정로273번길 31-3경기도 남양주시 진건읍 송능리 60-10번지1811918113, 18119기타 인쇄업 외 1 종종이, 잉크
910<NA>(유)보문특수칼라일반031-527-7616031-527-7656종이포장지경기도 남양주시 진건읍 독정로273번길 19-1경기도 남양주시 진건읍 송능리 58-2번지1811918113, 18119기타 인쇄업 외 1 종종이
순번단지명회사명설립구분전화번호팩스번호생산품공장대표주소(도로명)공장대표주소(지번)대표업종번호업종번호업종명주원자재
26672668<NA>효성전자일반031-593-8045031-593-8046스피커내진동판및코일경기도 남양주시 수동면 비룡로586번길 63, 1층경기도 남양주시 수동면 송천리 344-2번지2629926295, 26299그 외 기타 전자부품 제조업 외 1 종코일,원단수지
26682669<NA>효자원식품일반031-592-4940031-592-7577기타식품경기도 남양주시 의안로260번길 46-18 (평내동)경기도 남양주시 평내동 35-1번지1061210612곡물 제분업<NA>
26692670<NA>훈비네식품일반031-594-3772031-594-3788조미김, 김자반경기도 남양주시 화도읍 소래비로85번길 24 (총 3 필지) 외 2필지경기도 남양주시 화도읍 마석우리 20-9번지1022010220수산식물 가공 및 저장 처리업<NA>
26702671<NA>휴먼바이브일반031-575-0521031-575-0529진동운동기, 전기도어록, 온풍기, 전자키경기도 남양주시 진건읍 진건오남로384번길 19경기도 남양주시 진건읍 송능리 140-2번지3330128123, 28512, 33301체조, 육상 및 체력단련용 장비 제조업 외 2 종철재
26712672<NA>휴인테크창업031-574-4605031-574-4609eco 내진,면진시스템, 냉방온방조립식패널시스템경기도 남양주시 진접읍 부마로80번길 71-15 (일호산업)경기도 남양주시 진접읍 부평리 17-4번지2511325112, 25113, 25114육상 금속 골조 구조재 제조업 외 2 종금속판재,철강재,비철금속파이프
26722673<NA>흥안종합상사일반<NA><NA>일반철물자재,쑥뜸기경기도 남양주시 화도읍 수레로964번길 7경기도 남양주시 화도읍 차산리 735번지2593225932일반철물 제조업철판, 동판
26732674<NA>흥일프라콤일반031-591-1850031-591-1851진공성형식품용기,전자 및 산업용트레이경기도 남양주시 화도읍 폭포로 362경기도 남양주시 화도읍 창현리 281-7번지2223122231플라스틱 포대, 봉투 및 유사제품 제조업<NA>
26742675<NA>흥진산업일반<NA><NA>가구경기도 남양주시 와부읍 수레로 683 (애니페이퍼)경기도 남양주시 와부읍 월문리 115-2번지3202932029기타 목재가구 제조업<NA>
26752676<NA>희성정밀기계일반031-593-0043031-593-0044볼트제작기계경기도 남양주시 화도읍 재재기로 101경기도 남양주시 화도읍 차산리 298-1번지2922929229기타 가공 공작기계 제조업<NA>
26762677남양주진관일반산업단지희진상사일반산업단지031-527-5170031-527-5172레이져 가공품, 수지제판경기도 남양주시 진건읍 진관산단로59번길 35 1층경기도 남양주시 진건읍 진관리 981-02592925929그 외 기타 금속가공업철판