Overview

Dataset statistics

Number of variables12
Number of observations264
Missing cells2
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.4 KiB
Average record size in memory98.5 B

Variable types

Text6
Categorical3
Numeric2
DateTime1

Dataset

Description오산시 관내 기업체 현황으로 회사명, 단지명, 대표자명, 보유구분, 종업원수, 생산품, 규모, 대표주소, 업종명, 등에 정보를 제공합니다.
Author경기도 오산시
URLhttps://www.data.go.kr/data/15033599/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
종업원수 is highly overall correlated with 규모High correlation
우편번호 is highly overall correlated with 단지명High correlation
단지명 is highly overall correlated with 우편번호High correlation
규모 is highly overall correlated with 종업원수High correlation
종업원수 has 4 (1.5%) zerosZeros

Reproduction

Analysis started2023-12-12 08:11:15.224849
Analysis finished2023-12-12 08:11:16.584191
Duration1.36 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct262
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:11:16.764382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length8.0492424
Min length2

Characters and Unicode

Total characters2125
Distinct characters268
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique260 ?
Unique (%)98.5%

Sample

1st row(사)한국지체장애인협회영상음향사업소
2nd row(주)가나ENC
3rd row(주)경성
4th row(주)그린웍스
5th row(주)나라기술
ValueCountFrequency (%)
제2공장 9
 
3.0%
주식회사 5
 
1.7%
오산공장 5
 
1.7%
제이씨앤엠(주 3
 
1.0%
주)아모레퍼시픽 3
 
1.0%
주)티로보틱스 3
 
1.0%
농업회사법인 2
 
0.7%
주)원우정밀 2
 
0.7%
주)와이솔 2
 
0.7%
주)에스지티 2
 
0.7%
Other values (253) 265
88.0%
2023-12-12T17:11:17.142001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
218
 
10.3%
( 213
 
10.0%
) 213
 
10.0%
84
 
4.0%
68
 
3.2%
57
 
2.7%
38
 
1.8%
37
 
1.7%
35
 
1.6%
29
 
1.4%
Other values (258) 1133
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1625
76.5%
Open Punctuation 213
 
10.0%
Close Punctuation 213
 
10.0%
Space Separator 37
 
1.7%
Decimal Number 17
 
0.8%
Other Symbol 11
 
0.5%
Uppercase Letter 7
 
0.3%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
218
 
13.4%
84
 
5.2%
68
 
4.2%
57
 
3.5%
38
 
2.3%
35
 
2.2%
29
 
1.8%
29
 
1.8%
26
 
1.6%
25
 
1.5%
Other values (242) 1016
62.5%
Uppercase Letter
ValueCountFrequency (%)
E 1
14.3%
W 1
14.3%
K 1
14.3%
N 1
14.3%
C 1
14.3%
T 1
14.3%
B 1
14.3%
Decimal Number
ValueCountFrequency (%)
2 11
64.7%
3 3
 
17.6%
4 2
 
11.8%
1 1
 
5.9%
Open Punctuation
ValueCountFrequency (%)
( 213
100.0%
Close Punctuation
ValueCountFrequency (%)
) 213
100.0%
Space Separator
ValueCountFrequency (%)
37
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1636
77.0%
Common 482
 
22.7%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
218
 
13.3%
84
 
5.1%
68
 
4.2%
57
 
3.5%
38
 
2.3%
35
 
2.1%
29
 
1.8%
29
 
1.8%
26
 
1.6%
25
 
1.5%
Other values (243) 1027
62.8%
Common
ValueCountFrequency (%)
( 213
44.2%
) 213
44.2%
37
 
7.7%
2 11
 
2.3%
3 3
 
0.6%
4 2
 
0.4%
. 2
 
0.4%
1 1
 
0.2%
Latin
ValueCountFrequency (%)
E 1
14.3%
W 1
14.3%
K 1
14.3%
N 1
14.3%
C 1
14.3%
T 1
14.3%
B 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1625
76.5%
ASCII 489
 
23.0%
None 11
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
218
 
13.4%
84
 
5.2%
68
 
4.2%
57
 
3.5%
38
 
2.3%
35
 
2.2%
29
 
1.8%
29
 
1.8%
26
 
1.6%
25
 
1.5%
Other values (242) 1016
62.5%
ASCII
ValueCountFrequency (%)
( 213
43.6%
) 213
43.6%
37
 
7.6%
2 11
 
2.2%
3 3
 
0.6%
4 2
 
0.4%
. 2
 
0.4%
E 1
 
0.2%
W 1
 
0.2%
K 1
 
0.2%
Other values (5) 5
 
1.0%
None
ValueCountFrequency (%)
11
100.0%

단지명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
<NA>
174 
오산가장2일반산업단지
51 
오산가장지방산업단지
33 
오산세마일반산업단지
 
6

Length

Max length11
Median length4
Mean length6.2386364
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row오산가장2일반산업단지

Common Values

ValueCountFrequency (%)
<NA> 174
65.9%
오산가장2일반산업단지 51
 
19.3%
오산가장지방산업단지 33
 
12.5%
오산세마일반산업단지 6
 
2.3%

Length

2023-12-12T17:11:17.337367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:17.456107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 174
65.9%
오산가장2일반산업단지 51
 
19.3%
오산가장지방산업단지 33
 
12.5%
오산세마일반산업단지 6
 
2.3%
Distinct244
Distinct (%)92.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:11:17.921276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length3
Mean length3.4204545
Min length3

Characters and Unicode

Total characters903
Distinct characters181
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique227 ?
Unique (%)86.0%

Sample

1st row김광환
2nd row한상권
3rd row이헌방
4th row전영실
5th row유옥상
ValueCountFrequency (%)
안승욱 4
 
1.4%
서경배 3
 
1.1%
조용진 3
 
1.1%
김용담 2
 
0.7%
윤진해 2
 
0.7%
이윤재 2
 
0.7%
강규정 2
 
0.7%
성수경 2
 
0.7%
황부광 2
 
0.7%
이동건 2
 
0.7%
Other values (252) 259
91.5%
2023-12-12T17:11:18.535005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57
 
6.3%
37
 
4.1%
25
 
2.8%
25
 
2.8%
23
 
2.5%
22
 
2.4%
20
 
2.2%
18
 
2.0%
18
 
2.0%
18
 
2.0%
Other values (171) 640
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 856
94.8%
Space Separator 20
 
2.2%
Other Punctuation 15
 
1.7%
Uppercase Letter 9
 
1.0%
Decimal Number 1
 
0.1%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
6.7%
37
 
4.3%
25
 
2.9%
25
 
2.9%
23
 
2.7%
22
 
2.6%
18
 
2.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
Other values (160) 596
69.6%
Uppercase Letter
ValueCountFrequency (%)
N 2
22.2%
C 2
22.2%
H 2
22.2%
U 1
11.1%
G 1
11.1%
E 1
11.1%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
, 15
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 856
94.8%
Common 38
 
4.2%
Latin 9
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
6.7%
37
 
4.3%
25
 
2.9%
25
 
2.9%
23
 
2.7%
22
 
2.6%
18
 
2.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
Other values (160) 596
69.6%
Latin
ValueCountFrequency (%)
N 2
22.2%
C 2
22.2%
H 2
22.2%
U 1
11.1%
G 1
11.1%
E 1
11.1%
Common
ValueCountFrequency (%)
20
52.6%
, 15
39.5%
1 1
 
2.6%
) 1
 
2.6%
( 1
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 856
94.8%
ASCII 47
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
57
 
6.7%
37
 
4.3%
25
 
2.9%
25
 
2.9%
23
 
2.7%
22
 
2.6%
18
 
2.1%
18
 
2.1%
18
 
2.1%
17
 
2.0%
Other values (160) 596
69.6%
ASCII
ValueCountFrequency (%)
20
42.6%
, 15
31.9%
N 2
 
4.3%
C 2
 
4.3%
H 2
 
4.3%
1 1
 
2.1%
U 1
 
2.1%
) 1
 
2.1%
G 1
 
2.1%
( 1
 
2.1%

보유구분
Categorical

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
자가
157 
임대
107 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row임대
2nd row자가
3rd row자가
4th row임대
5th row자가

Common Values

ValueCountFrequency (%)
자가 157
59.5%
임대 107
40.5%

Length

2023-12-12T17:11:18.726013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:18.850785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자가 157
59.5%
임대 107
40.5%

종업원수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct86
Distinct (%)32.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.037879
Minimum0
Maximum1514
Zeros4
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-12T17:11:18.991235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q16
median14
Q338.5
95-th percentile174.65
Maximum1514
Range1514
Interquartile range (IQR)32.5

Descriptive statistics

Standard deviation120.24398
Coefficient of variation (CV)2.6118488
Kurtosis87.406313
Mean46.037879
Median Absolute Deviation (MAD)10.5
Skewness8.08704
Sum12154
Variance14458.615
MonotonicityNot monotonic
2023-12-12T17:11:19.216408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3 15
 
5.7%
4 13
 
4.9%
5 13
 
4.9%
7 12
 
4.5%
2 11
 
4.2%
10 11
 
4.2%
11 11
 
4.2%
8 10
 
3.8%
1 8
 
3.0%
40 7
 
2.7%
Other values (76) 153
58.0%
ValueCountFrequency (%)
0 4
 
1.5%
1 8
3.0%
2 11
4.2%
3 15
5.7%
4 13
4.9%
5 13
4.9%
6 5
 
1.9%
7 12
4.5%
8 10
3.8%
9 6
 
2.3%
ValueCountFrequency (%)
1514 1
0.4%
600 1
0.4%
500 1
0.4%
420 1
0.4%
400 1
0.4%
395 1
0.4%
380 1
0.4%
329 1
0.4%
311 1
0.4%
271 1
0.4%
Distinct251
Distinct (%)95.4%
Missing1
Missing (%)0.4%
Memory size2.2 KiB
2023-12-12T17:11:19.517930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length36
Mean length12.577947
Min length2

Characters and Unicode

Total characters3308
Distinct characters410
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)93.5%

Sample

1st rowCCTV, 방송장비, 교통 신호장치
2nd row닥트
3rd row알루미늄 표면처리 및 도장
4th row조류충돌방지제품(스티커, 필름)
5th row화장품 제조용 교반기
ValueCountFrequency (%)
반도체 26
 
3.8%
21
 
3.1%
화장품 15
 
2.2%
부품 14
 
2.1%
12
 
1.8%
제조용 9
 
1.3%
장비 8
 
1.2%
디스플레이 6
 
0.9%
산업용 6
 
0.9%
기계 5
 
0.7%
Other values (475) 555
82.0%
2023-12-12T17:11:20.078607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
417
 
12.6%
, 154
 
4.7%
98
 
3.0%
83
 
2.5%
72
 
2.2%
68
 
2.1%
66
 
2.0%
50
 
1.5%
49
 
1.5%
45
 
1.4%
Other values (400) 2206
66.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2302
69.6%
Space Separator 417
 
12.6%
Uppercase Letter 279
 
8.4%
Other Punctuation 157
 
4.7%
Lowercase Letter 94
 
2.8%
Open Punctuation 26
 
0.8%
Close Punctuation 26
 
0.8%
Decimal Number 5
 
0.2%
Dash Punctuation 1
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
 
4.3%
83
 
3.6%
72
 
3.1%
68
 
3.0%
66
 
2.9%
50
 
2.2%
49
 
2.1%
45
 
2.0%
45
 
2.0%
40
 
1.7%
Other values (346) 1686
73.2%
Uppercase Letter
ValueCountFrequency (%)
C 42
15.1%
L 31
11.1%
E 30
10.8%
D 30
10.8%
P 18
 
6.5%
A 16
 
5.7%
S 14
 
5.0%
R 14
 
5.0%
T 12
 
4.3%
B 12
 
4.3%
Other values (12) 60
21.5%
Lowercase Letter
ValueCountFrequency (%)
e 19
20.2%
a 13
13.8%
r 11
11.7%
t 10
10.6%
l 5
 
5.3%
n 5
 
5.3%
s 5
 
5.3%
c 4
 
4.3%
i 3
 
3.2%
o 3
 
3.2%
Other values (11) 16
17.0%
Other Punctuation
ValueCountFrequency (%)
, 154
98.1%
/ 2
 
1.3%
& 1
 
0.6%
Decimal Number
ValueCountFrequency (%)
3 2
40.0%
2 2
40.0%
4 1
20.0%
Space Separator
ValueCountFrequency (%)
417
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2301
69.6%
Common 633
 
19.1%
Latin 373
 
11.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
98
 
4.3%
83
 
3.6%
72
 
3.1%
68
 
3.0%
66
 
2.9%
50
 
2.2%
49
 
2.1%
45
 
2.0%
45
 
2.0%
40
 
1.7%
Other values (345) 1685
73.2%
Latin
ValueCountFrequency (%)
C 42
 
11.3%
L 31
 
8.3%
E 30
 
8.0%
D 30
 
8.0%
e 19
 
5.1%
P 18
 
4.8%
A 16
 
4.3%
S 14
 
3.8%
R 14
 
3.8%
a 13
 
3.5%
Other values (33) 146
39.1%
Common
ValueCountFrequency (%)
417
65.9%
, 154
 
24.3%
( 26
 
4.1%
) 26
 
4.1%
3 2
 
0.3%
/ 2
 
0.3%
2 2
 
0.3%
4 1
 
0.2%
- 1
 
0.2%
& 1
 
0.2%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2301
69.6%
ASCII 1006
30.4%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
417
41.5%
, 154
 
15.3%
C 42
 
4.2%
L 31
 
3.1%
E 30
 
3.0%
D 30
 
3.0%
( 26
 
2.6%
) 26
 
2.6%
e 19
 
1.9%
P 18
 
1.8%
Other values (44) 213
21.2%
Hangul
ValueCountFrequency (%)
98
 
4.3%
83
 
3.6%
72
 
3.1%
68
 
3.0%
66
 
2.9%
50
 
2.2%
49
 
2.1%
45
 
2.0%
45
 
2.0%
40
 
1.7%
Other values (345) 1685
73.2%
CJK
ValueCountFrequency (%)
1
100.0%

규모
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
소기업
191 
중기업
55 
대기업
 
18

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소기업
2nd row소기업
3rd row소기업
4th row소기업
5th row소기업

Common Values

ValueCountFrequency (%)
소기업 191
72.3%
중기업 55
 
20.8%
대기업 18
 
6.8%

Length

2023-12-12T17:11:20.268288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T17:11:20.411839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소기업 191
72.3%
중기업 55
 
20.8%
대기업 18
 
6.8%

우편번호
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18113.716
Minimum18100
Maximum18151
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.4 KiB
2023-12-12T17:11:20.545148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18100
5-th percentile18102
Q118103
median18104
Q318126
95-th percentile18145
Maximum18151
Range51
Interquartile range (IQR)23

Descriptive statistics

Standard deviation15.626546
Coefficient of variation (CV)0.00086269136
Kurtosis-0.015925853
Mean18113.716
Median Absolute Deviation (MAD)2
Skewness1.1823656
Sum4782021
Variance244.18895
MonotonicityNot monotonic
2023-12-12T17:11:20.709228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
18103 73
27.7%
18102 36
13.6%
18126 28
 
10.6%
18105 21
 
8.0%
18112 15
 
5.7%
18104 15
 
5.7%
18145 14
 
5.3%
18111 11
 
4.2%
18144 9
 
3.4%
18101 7
 
2.7%
Other values (12) 35
13.3%
ValueCountFrequency (%)
18100 4
 
1.5%
18101 7
 
2.7%
18102 36
13.6%
18103 73
27.7%
18104 15
 
5.7%
18105 21
 
8.0%
18111 11
 
4.2%
18112 15
 
5.7%
18118 1
 
0.4%
18119 4
 
1.5%
ValueCountFrequency (%)
18151 4
 
1.5%
18150 6
 
2.3%
18149 1
 
0.4%
18148 1
 
0.4%
18145 14
5.3%
18144 9
 
3.4%
18137 2
 
0.8%
18136 2
 
0.8%
18128 6
 
2.3%
18126 28
10.6%
Distinct258
Distinct (%)98.1%
Missing1
Missing (%)0.4%
Memory size2.2 KiB
2023-12-12T17:11:21.094901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length56
Median length44
Mean length29.190114
Min length17

Characters and Unicode

Total characters7677
Distinct characters200
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)96.6%

Sample

1st row경기도 오산시 역광장로 77, 광산빌딩 2층(201~203호) (오산동)
2nd row경기도 오산시 세남로14번길 13-30 (세교동)
3rd row경기도 오산시 황새로149번길 9 (누읍동, (주)경성)
4th row경기도 오산시 독산성로 425, 307호 (세교동) 307호
5th row경기도 오산시 가장산업서로 56-8 (가장동)
ValueCountFrequency (%)
경기도 263
 
17.4%
오산시 263
 
17.4%
가장동 60
 
4.0%
가장산업서북로 34
 
2.2%
31
 
2.1%
가장산업동로 26
 
1.7%
누읍동 22
 
1.5%
1필지 22
 
1.5%
가장산업서로 18
 
1.2%
독산성로 18
 
1.2%
Other values (423) 755
49.9%
2023-12-12T17:11:21.751846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1249
 
16.3%
399
 
5.2%
327
 
4.3%
( 299
 
3.9%
) 299
 
3.9%
281
 
3.7%
280
 
3.6%
275
 
3.6%
269
 
3.5%
264
 
3.4%
Other values (190) 3735
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4296
56.0%
Space Separator 1249
 
16.3%
Decimal Number 1247
 
16.2%
Open Punctuation 299
 
3.9%
Close Punctuation 299
 
3.9%
Other Punctuation 138
 
1.8%
Dash Punctuation 137
 
1.8%
Uppercase Letter 11
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
399
 
9.3%
327
 
7.6%
281
 
6.5%
280
 
6.5%
275
 
6.4%
269
 
6.3%
264
 
6.1%
245
 
5.7%
183
 
4.3%
172
 
4.0%
Other values (170) 1601
37.3%
Decimal Number
ValueCountFrequency (%)
1 248
19.9%
2 171
13.7%
4 148
11.9%
3 130
10.4%
5 105
8.4%
8 104
8.3%
6 95
 
7.6%
0 91
 
7.3%
7 85
 
6.8%
9 70
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
A 4
36.4%
F 3
27.3%
B 3
27.3%
T 1
 
9.1%
Space Separator
ValueCountFrequency (%)
1249
100.0%
Open Punctuation
ValueCountFrequency (%)
( 299
100.0%
Close Punctuation
ValueCountFrequency (%)
) 299
100.0%
Other Punctuation
ValueCountFrequency (%)
, 138
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 137
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4296
56.0%
Common 3370
43.9%
Latin 11
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
399
 
9.3%
327
 
7.6%
281
 
6.5%
280
 
6.5%
275
 
6.4%
269
 
6.3%
264
 
6.1%
245
 
5.7%
183
 
4.3%
172
 
4.0%
Other values (170) 1601
37.3%
Common
ValueCountFrequency (%)
1249
37.1%
( 299
 
8.9%
) 299
 
8.9%
1 248
 
7.4%
2 171
 
5.1%
4 148
 
4.4%
, 138
 
4.1%
- 137
 
4.1%
3 130
 
3.9%
5 105
 
3.1%
Other values (6) 446
 
13.2%
Latin
ValueCountFrequency (%)
A 4
36.4%
F 3
27.3%
B 3
27.3%
T 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4296
56.0%
ASCII 3381
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1249
36.9%
( 299
 
8.8%
) 299
 
8.8%
1 248
 
7.3%
2 171
 
5.1%
4 148
 
4.4%
, 138
 
4.1%
- 137
 
4.1%
3 130
 
3.8%
5 105
 
3.1%
Other values (10) 457
 
13.5%
Hangul
ValueCountFrequency (%)
399
 
9.3%
327
 
7.6%
281
 
6.5%
280
 
6.5%
275
 
6.4%
269
 
6.3%
264
 
6.1%
245
 
5.7%
183
 
4.3%
172
 
4.0%
Other values (170) 1601
37.3%
Distinct260
Distinct (%)98.5%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:11:22.123158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length42
Mean length21.340909
Min length14

Characters and Unicode

Total characters5634
Distinct characters137
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique256 ?
Unique (%)97.0%

Sample

1st row경기도 오산시 오산동 603-9번지 광산빌딩 2층(201~203호)
2nd row경기도 오산시 세교동 74번지
3rd row경기도 오산시 누읍동 80-3번지
4th row경기도 오산시 세교동 595-1 307호 307호
5th row경기도 오산시 가장동 392-8번지
ValueCountFrequency (%)
경기도 264
20.7%
오산시 264
20.7%
가장동 79
 
6.2%
32
 
2.5%
세교동 30
 
2.4%
지곶동 27
 
2.1%
누읍동 26
 
2.0%
1필지 23
 
1.8%
원동 20
 
1.6%
내삼미동 14
 
1.1%
Other values (346) 495
38.9%
2023-12-12T17:11:22.670066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1032
18.3%
287
 
5.1%
280
 
5.0%
273
 
4.8%
267
 
4.7%
264
 
4.7%
264
 
4.7%
264
 
4.7%
229
 
4.1%
- 214
 
3.8%
Other values (127) 2260
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3125
55.5%
Decimal Number 1203
 
21.4%
Space Separator 1032
 
18.3%
Dash Punctuation 214
 
3.8%
Close Punctuation 16
 
0.3%
Other Punctuation 16
 
0.3%
Open Punctuation 16
 
0.3%
Uppercase Letter 11
 
0.2%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
287
 
9.2%
280
 
9.0%
273
 
8.7%
267
 
8.5%
264
 
8.4%
264
 
8.4%
264
 
8.4%
229
 
7.3%
161
 
5.2%
83
 
2.7%
Other values (107) 753
24.1%
Decimal Number
ValueCountFrequency (%)
1 213
17.7%
3 177
14.7%
2 135
11.2%
5 133
11.1%
4 132
11.0%
7 100
8.3%
9 96
8.0%
8 81
 
6.7%
6 69
 
5.7%
0 67
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
A 4
36.4%
B 3
27.3%
F 3
27.3%
L 1
 
9.1%
Space Separator
ValueCountFrequency (%)
1032
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Other Punctuation
ValueCountFrequency (%)
, 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3125
55.5%
Common 2498
44.3%
Latin 11
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
287
 
9.2%
280
 
9.0%
273
 
8.7%
267
 
8.5%
264
 
8.4%
264
 
8.4%
264
 
8.4%
229
 
7.3%
161
 
5.2%
83
 
2.7%
Other values (107) 753
24.1%
Common
ValueCountFrequency (%)
1032
41.3%
- 214
 
8.6%
1 213
 
8.5%
3 177
 
7.1%
2 135
 
5.4%
5 133
 
5.3%
4 132
 
5.3%
7 100
 
4.0%
9 96
 
3.8%
8 81
 
3.2%
Other values (6) 185
 
7.4%
Latin
ValueCountFrequency (%)
A 4
36.4%
B 3
27.3%
F 3
27.3%
L 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3125
55.5%
ASCII 2509
44.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1032
41.1%
- 214
 
8.5%
1 213
 
8.5%
3 177
 
7.1%
2 135
 
5.4%
5 133
 
5.3%
4 132
 
5.3%
7 100
 
4.0%
9 96
 
3.8%
8 81
 
3.2%
Other values (10) 196
 
7.8%
Hangul
ValueCountFrequency (%)
287
 
9.2%
280
 
9.0%
273
 
8.7%
267
 
8.5%
264
 
8.4%
264
 
8.4%
264
 
8.4%
229
 
7.3%
161
 
5.2%
83
 
2.7%
Other values (107) 753
24.1%
Distinct155
Distinct (%)58.7%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-12T17:11:23.018052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length25
Mean length16.234848
Min length3

Characters and Unicode

Total characters4286
Distinct characters215
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)43.6%

Sample

1st row방송장비 제조업 외 2 종
2nd row구조용 금속 판제품 및 공작물 제조업
3rd row도장 및 기타 피막처리업 외 2 종
4th row그 외 기타 플라스틱 제품 제조업 외 2 종
5th row그 외 기타 특수목적용 기계 제조업
ValueCountFrequency (%)
제조업 250
18.1%
134
 
9.7%
97
 
7.0%
84
 
6.1%
기타 59
 
4.3%
1 53
 
3.8%
기계 53
 
3.8%
제조용 40
 
2.9%
37
 
2.7%
반도체 33
 
2.4%
Other values (226) 542
39.2%
2023-12-12T17:11:23.952390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1118
26.1%
344
 
8.0%
308
 
7.2%
275
 
6.4%
187
 
4.4%
135
 
3.1%
100
 
2.3%
99
 
2.3%
84
 
2.0%
69
 
1.6%
Other values (205) 1567
36.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3042
71.0%
Space Separator 1118
 
26.1%
Decimal Number 98
 
2.3%
Other Punctuation 26
 
0.6%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
344
 
11.3%
308
 
10.1%
275
 
9.0%
187
 
6.1%
135
 
4.4%
100
 
3.3%
99
 
3.3%
84
 
2.8%
69
 
2.3%
67
 
2.2%
Other values (193) 1374
45.2%
Decimal Number
ValueCountFrequency (%)
1 54
55.1%
2 26
26.5%
3 8
 
8.2%
4 6
 
6.1%
7 2
 
2.0%
6 1
 
1.0%
5 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
, 25
96.2%
. 1
 
3.8%
Space Separator
ValueCountFrequency (%)
1118
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3042
71.0%
Common 1244
29.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
344
 
11.3%
308
 
10.1%
275
 
9.0%
187
 
6.1%
135
 
4.4%
100
 
3.3%
99
 
3.3%
84
 
2.8%
69
 
2.3%
67
 
2.2%
Other values (193) 1374
45.2%
Common
ValueCountFrequency (%)
1118
89.9%
1 54
 
4.3%
2 26
 
2.1%
, 25
 
2.0%
3 8
 
0.6%
4 6
 
0.5%
7 2
 
0.2%
. 1
 
0.1%
( 1
 
0.1%
) 1
 
0.1%
Other values (2) 2
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3041
71.0%
ASCII 1244
29.0%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1118
89.9%
1 54
 
4.3%
2 26
 
2.1%
, 25
 
2.0%
3 8
 
0.6%
4 6
 
0.5%
7 2
 
0.2%
. 1
 
0.1%
( 1
 
0.1%
) 1
 
0.1%
Other values (2) 2
 
0.2%
Hangul
ValueCountFrequency (%)
344
 
11.3%
308
 
10.1%
275
 
9.0%
187
 
6.1%
135
 
4.4%
100
 
3.3%
99
 
3.3%
84
 
2.8%
69
 
2.3%
67
 
2.2%
Other values (192) 1373
45.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
Minimum2023-10-06 00:00:00
Maximum2023-10-06 00:00:00
2023-12-12T17:11:24.116508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:24.235068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T17:11:16.110093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:15.959481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:16.187722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:11:16.034579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:11:24.338325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
단지명보유구분종업원수규모우편번호
단지명1.0000.0730.1470.612NaN
보유구분0.0731.0000.1010.1270.280
종업원수0.1470.1011.0000.5810.053
규모0.6120.1270.5811.0000.310
우편번호NaN0.2800.0530.3101.000
2023-12-12T17:11:24.519349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
규모단지명보유구분
규모1.0000.2800.209
단지명0.2801.0000.119
보유구분0.2090.1191.000
2023-12-12T17:11:24.625714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종업원수우편번호단지명보유구분규모
종업원수1.000-0.1980.1370.1230.528
우편번호-0.1981.0001.0000.2090.203
단지명0.1371.0001.0000.1190.280
보유구분0.1230.2090.1191.0000.209
규모0.5280.2030.2800.2091.000

Missing values

2023-12-12T17:11:16.291716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:11:16.440006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T17:11:16.535691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

회사명단지명대표자명보유구분종업원수생산품규모우편번호대표주소공장대표주소(지번)업종명데이터기준일자
0(사)한국지체장애인협회영상음향사업소<NA>김광환임대18CCTV, 방송장비, 교통 신호장치소기업18137경기도 오산시 역광장로 77, 광산빌딩 2층(201~203호) (오산동)경기도 오산시 오산동 603-9번지 광산빌딩 2층(201~203호)방송장비 제조업 외 2 종2023-10-06
1(주)가나ENC<NA>한상권자가4닥트소기업18105경기도 오산시 세남로14번길 13-30 (세교동)경기도 오산시 세교동 74번지구조용 금속 판제품 및 공작물 제조업2023-10-06
2(주)경성<NA>이헌방자가19알루미늄 표면처리 및 도장소기업18126경기도 오산시 황새로149번길 9 (누읍동, (주)경성)경기도 오산시 누읍동 80-3번지도장 및 기타 피막처리업 외 2 종2023-10-06
3(주)그린웍스<NA>전영실임대3조류충돌방지제품(스티커, 필름)소기업18105경기도 오산시 독산성로 425, 307호 (세교동) 307호경기도 오산시 세교동 595-1 307호 307호그 외 기타 플라스틱 제품 제조업 외 2 종2023-10-06
4(주)나라기술오산가장2일반산업단지유옥상자가16화장품 제조용 교반기소기업18103경기도 오산시 가장산업서로 56-8 (가장동)경기도 오산시 가장동 392-8번지그 외 기타 특수목적용 기계 제조업2023-10-06
5(주)넥썸<NA>김남원자가14반도체 제조용 칠러, 반도체 제오용 센서소기업18145경기도 오산시 남부대로362번길 40(갈곶동)경기도 오산시 갈곶동 319반도체 제조용 기계 제조업2023-10-06
6(주)뉴젠텍오산가장지방산업단지강동원자가11반도체 설계 조립 및 검사용 부품소기업18103경기도 오산시 가장산업동로 28-73 (가장동)경기도 오산시 가장동 376-5번지반도체 제조용 기계 제조업2023-10-06
7(주)다올이엔지오산가장2일반산업단지권오익자가26자동화 설비, 지그중기업18102경기도 오산시 가장산업서북로 75(지곶동)경기도 오산시 지곶동 562-4그 외 기타 전기장비 제조업2023-10-06
8(주)다인디앤씨<NA>전우철자가16목재용 도료소기업18126경기도 오산시 수목원로 46-31(누읍동)경기도 오산시 누읍동 141-6일반용 도료 및 관련제품 제조업2023-10-06
9(주)대림제지<NA>류창승자가167골판지원지대기업18126경기도 오산시 황새로 169, 총18필지(누읍동 7외 17필지) (누읍동, 대림제지(주)) 외 2필지경기도 오산시 누읍동 382-1번지 대림제지(주) 총18필지(누읍동 7외 17필지) 외 2필지기타 종이 및 판지 제조업2023-10-06
회사명단지명대표자명보유구분종업원수생산품규모우편번호대표주소공장대표주소(지번)업종명데이터기준일자
254페스코코리아<NA>정철흠임대3살균제소기업18136경기도 오산시 시장길 11, 1층 일부 (오산동)경기도 오산시 오산동 859-30번지 1층 일부화학 살균.살충제 및 농업용 약제 제조업2023-10-06
255포트로닉(주)<NA>정환국자가13CCTV CAMERA소기업18144경기도 오산시 밀머리로 58-6 (원동, (주)한국빅타)경기도 오산시 원동 543-1번지기타 전기 변환장치 제조업 외 2 종2023-10-06
256하나테크<NA>정성민자가2코터(Coater) 노광기(Stepper)소기업18145경기도 오산시 남부대로362번길 34(갈곶동)경기도 오산시 갈곶동 322-1반도체 제조용 기계 제조업2023-10-06
257하나테크윈<NA>이석봉자가3전기자동제어반소기업18151경기도 오산시 남부대로 486-51 (청호동)경기도 오산시 청호동 139번지배전반 및 전기 자동제어반 제조업2023-10-06
258하이리움산업㈜오산가장2일반산업단지김서영자가32차량용 시트 공기조화장치, 차량용 헤드레스트 공기조화장치, 차량용 냉온장 컵홀더 외소기업18103경기도 오산시 가장산업서로 61(가장동)경기도 오산시 가장동 393-3공기 조화장치 제조업2023-10-06
259한국수출포장공업(주)<NA>허용삼자가55골판지원지대기업18126경기도 오산시 황새로149번길 11 (누읍동, 한국수출포장공업(주)) 외 1필지경기도 오산시 누읍동 80-1번지 외 1필지골판지 제조업 외 4 종2023-10-06
260한들(주)<NA>전상엽자가22김치소기업18101경기도 오산시 양산로 182 (양산동, 참들김치)경기도 오산시 양산동 540-2번지김치류 제조업 외 1 종2023-10-06
261해피이엔씨(주)<NA>문헌균임대7가로등 자동점멸기, LED가로등기구, LED램프용안정기소기업18112경기도 오산시 문시로 110-20, 1층 (외삼미동)경기도 오산시 외삼미동 330-22번지 1층일반용 전기 조명장치 제조업 외 2 종2023-10-06
262효원산업(주)<NA>김성일임대8플라스틱 창틀소기업18111경기도 오산시 경기대로 868-33 (세교동)경기도 오산시 세교동 124-2번지금속 문, 창, 셔터 및 관련제품 제조업 외 1 종2023-10-06
263흥인<NA>이인수자가3인조피혁 표면코팅제소기업18148경기도 오산시 동부대로568번길 102 (부산동)경기도 오산시 부산동 515번지일반용 도료 및 관련제품 제조업2023-10-06