Overview

Dataset statistics

Number of variables5
Number of observations252
Missing cells84
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.2 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 남동구에 위치한 전자부품제조업체 현황에 대한 데이터로 연번, 업체명, 소재지, 전화번호, 데이터기준일자 항목을 제공합니다.
Author인천광역시 남동구
URLhttps://www.data.go.kr/data/15091422/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 84 (33.3%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:55:13.151208
Analysis finished2023-12-12 04:55:13.642301
Duration0.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct252
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.5
Minimum1
Maximum252
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T13:55:13.718109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.55
Q163.75
median126.5
Q3189.25
95-th percentile239.45
Maximum252
Range251
Interquartile range (IQR)125.5

Descriptive statistics

Standard deviation72.890329
Coefficient of variation (CV)0.57620813
Kurtosis-1.2
Mean126.5
Median Absolute Deviation (MAD)63
Skewness0
Sum31878
Variance5313
MonotonicityStrictly increasing
2023-12-12T13:55:13.884529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
175 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
168 1
 
0.4%
169 1
 
0.4%
Other values (242) 242
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
252 1
0.4%
251 1
0.4%
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
Distinct243
Distinct (%)96.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T13:55:14.163550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length6.7063492
Min length2

Characters and Unicode

Total characters1690
Distinct characters200
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique234 ?
Unique (%)92.9%

Sample

1st row(주)강민전자
2nd row(주)거산테크
3rd row(주)고려에스크
4th row(주)그린하이테크
5th row(주)나노스트림
ValueCountFrequency (%)
주식회사 5
 
1.9%
tech 3
 
1.1%
주)동진티아이 2
 
0.8%
신양전선(주 2
 
0.8%
주)나노앤텍 2
 
0.8%
주)에스비전자 2
 
0.8%
아이에스테크놀로지(주 2
 
0.8%
주)엠엔에이치일렉트로닉스 2
 
0.8%
케이에스전자(주 2
 
0.8%
한국단자공업(주 2
 
0.8%
Other values (238) 240
90.9%
2023-12-12T13:55:14.563243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
140
 
8.3%
) 137
 
8.1%
( 137
 
8.1%
107
 
6.3%
66
 
3.9%
62
 
3.7%
50
 
3.0%
45
 
2.7%
39
 
2.3%
38
 
2.2%
Other values (190) 869
51.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1341
79.3%
Close Punctuation 137
 
8.1%
Open Punctuation 137
 
8.1%
Uppercase Letter 51
 
3.0%
Space Separator 14
 
0.8%
Lowercase Letter 6
 
0.4%
Decimal Number 2
 
0.1%
Dash Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
140
 
10.4%
107
 
8.0%
66
 
4.9%
62
 
4.6%
50
 
3.7%
45
 
3.4%
39
 
2.9%
38
 
2.8%
27
 
2.0%
26
 
1.9%
Other values (166) 741
55.3%
Uppercase Letter
ValueCountFrequency (%)
T 9
17.6%
E 7
13.7%
C 7
13.7%
H 7
13.7%
M 4
7.8%
S 4
7.8%
J 3
 
5.9%
K 3
 
5.9%
B 2
 
3.9%
Y 1
 
2.0%
Other values (4) 4
7.8%
Lowercase Letter
ValueCountFrequency (%)
e 2
33.3%
c 2
33.3%
h 2
33.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
3 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 137
100.0%
Open Punctuation
ValueCountFrequency (%)
( 137
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1341
79.3%
Common 292
 
17.3%
Latin 57
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
140
 
10.4%
107
 
8.0%
66
 
4.9%
62
 
4.6%
50
 
3.7%
45
 
3.4%
39
 
2.9%
38
 
2.8%
27
 
2.0%
26
 
1.9%
Other values (166) 741
55.3%
Latin
ValueCountFrequency (%)
T 9
15.8%
E 7
12.3%
C 7
12.3%
H 7
12.3%
M 4
7.0%
S 4
7.0%
J 3
 
5.3%
K 3
 
5.3%
e 2
 
3.5%
c 2
 
3.5%
Other values (7) 9
15.8%
Common
ValueCountFrequency (%)
) 137
46.9%
( 137
46.9%
14
 
4.8%
- 1
 
0.3%
2 1
 
0.3%
3 1
 
0.3%
. 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1341
79.3%
ASCII 349
 
20.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
140
 
10.4%
107
 
8.0%
66
 
4.9%
62
 
4.6%
50
 
3.7%
45
 
3.4%
39
 
2.9%
38
 
2.8%
27
 
2.0%
26
 
1.9%
Other values (166) 741
55.3%
ASCII
ValueCountFrequency (%)
) 137
39.3%
( 137
39.3%
14
 
4.0%
T 9
 
2.6%
E 7
 
2.0%
C 7
 
2.0%
H 7
 
2.0%
M 4
 
1.1%
S 4
 
1.1%
J 3
 
0.9%
Other values (14) 20
 
5.7%
Distinct221
Distinct (%)87.7%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T13:55:14.826726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length51
Mean length34.746032
Min length19

Characters and Unicode

Total characters8756
Distinct characters127
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique200 ?
Unique (%)79.4%

Sample

1st row인천광역시 남동구 청능대로 250, 씨동 101호 (고잔동)
2nd row인천광역시 남동구 남동대로49번길 104, 697-6 [128-7블록] (고잔동)
3rd row인천광역시 남동구 남동동로33번길 28-12 (고잔동)
4th row인천광역시 남동구 은봉로 52, 1112호(논현동, NIC지식산업센터) 1112호
5th row인천광역시 남동구 호구포로 189, 9층 902호 (고잔동, 남동테크노타워)
ValueCountFrequency (%)
인천광역시 252
 
15.4%
남동구 252
 
15.4%
고잔동 144
 
8.8%
논현동 48
 
2.9%
남촌동 27
 
1.7%
남동서로 24
 
1.5%
호구포로 21
 
1.3%
함박뫼로 19
 
1.2%
남동동로 16
 
1.0%
은봉로 12
 
0.7%
Other values (415) 819
50.1%
2023-12-12T13:55:15.268710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1387
 
15.8%
678
 
7.7%
1 404
 
4.6%
397
 
4.5%
364
 
4.2%
281
 
3.2%
( 262
 
3.0%
) 262
 
3.0%
257
 
2.9%
253
 
2.9%
Other values (117) 4211
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4834
55.2%
Decimal Number 1752
 
20.0%
Space Separator 1387
 
15.8%
Open Punctuation 263
 
3.0%
Close Punctuation 263
 
3.0%
Other Punctuation 196
 
2.2%
Dash Punctuation 33
 
0.4%
Uppercase Letter 27
 
0.3%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
678
14.0%
397
 
8.2%
364
 
7.5%
281
 
5.8%
257
 
5.3%
253
 
5.2%
253
 
5.2%
252
 
5.2%
252
 
5.2%
179
 
3.7%
Other values (92) 1668
34.5%
Decimal Number
ValueCountFrequency (%)
1 404
23.1%
3 204
11.6%
2 174
9.9%
5 158
 
9.0%
4 155
 
8.8%
0 149
 
8.5%
6 136
 
7.8%
9 133
 
7.6%
8 120
 
6.8%
7 119
 
6.8%
Uppercase Letter
ValueCountFrequency (%)
B 8
29.6%
C 7
25.9%
N 4
14.8%
I 4
14.8%
A 2
 
7.4%
L 1
 
3.7%
D 1
 
3.7%
Open Punctuation
ValueCountFrequency (%)
( 262
99.6%
[ 1
 
0.4%
Close Punctuation
ValueCountFrequency (%)
) 262
99.6%
] 1
 
0.4%
Space Separator
ValueCountFrequency (%)
1387
100.0%
Other Punctuation
ValueCountFrequency (%)
, 196
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Lowercase Letter
ValueCountFrequency (%)
b 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4834
55.2%
Common 3894
44.5%
Latin 28
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
678
14.0%
397
 
8.2%
364
 
7.5%
281
 
5.8%
257
 
5.3%
253
 
5.2%
253
 
5.2%
252
 
5.2%
252
 
5.2%
179
 
3.7%
Other values (92) 1668
34.5%
Common
ValueCountFrequency (%)
1387
35.6%
1 404
 
10.4%
( 262
 
6.7%
) 262
 
6.7%
3 204
 
5.2%
, 196
 
5.0%
2 174
 
4.5%
5 158
 
4.1%
4 155
 
4.0%
0 149
 
3.8%
Other values (7) 543
 
13.9%
Latin
ValueCountFrequency (%)
B 8
28.6%
C 7
25.0%
N 4
14.3%
I 4
14.3%
A 2
 
7.1%
L 1
 
3.6%
D 1
 
3.6%
b 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4834
55.2%
ASCII 3922
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1387
35.4%
1 404
 
10.3%
( 262
 
6.7%
) 262
 
6.7%
3 204
 
5.2%
, 196
 
5.0%
2 174
 
4.4%
5 158
 
4.0%
4 155
 
4.0%
0 149
 
3.8%
Other values (15) 571
14.6%
Hangul
ValueCountFrequency (%)
678
14.0%
397
 
8.2%
364
 
7.5%
281
 
5.8%
257
 
5.3%
253
 
5.2%
253
 
5.2%
252
 
5.2%
252
 
5.2%
179
 
3.7%
Other values (92) 1668
34.5%

전화번호
Text

MISSING 

Distinct156
Distinct (%)92.9%
Missing84
Missing (%)33.3%
Memory size2.1 KiB
2023-12-12T13:55:15.536034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.059524
Min length12

Characters and Unicode

Total characters2026
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)86.3%

Sample

1st row032-815-7083
2nd row032-822-7296
3rd row070-4125-3309
4th row032-710-8335
5th row032-819-8440
ValueCountFrequency (%)
032-812-9914 3
 
1.8%
032-815-1411 2
 
1.2%
032-850-2600 2
 
1.2%
032-813-7431 2
 
1.2%
032-814-0173 2
 
1.2%
032-462-5553 2
 
1.2%
032-500-1713 2
 
1.2%
032-811-7736 2
 
1.2%
032-428-1469 2
 
1.2%
032-814-9981 2
 
1.2%
Other values (146) 147
87.5%
2023-12-12T13:55:16.020818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 336
16.6%
0 270
13.3%
2 266
13.1%
3 260
12.8%
1 232
11.5%
8 201
9.9%
7 108
 
5.3%
5 93
 
4.6%
4 92
 
4.5%
9 84
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1690
83.4%
Dash Punctuation 336
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 270
16.0%
2 266
15.7%
3 260
15.4%
1 232
13.7%
8 201
11.9%
7 108
 
6.4%
5 93
 
5.5%
4 92
 
5.4%
9 84
 
5.0%
6 84
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 336
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2026
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 336
16.6%
0 270
13.3%
2 266
13.1%
3 260
12.8%
1 232
11.5%
8 201
9.9%
7 108
 
5.3%
5 93
 
4.6%
4 92
 
4.5%
9 84
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2026
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 336
16.6%
0 270
13.3%
2 266
13.1%
3 260
12.8%
1 232
11.5%
8 201
9.9%
7 108
 
5.3%
5 93
 
4.6%
4 92
 
4.5%
9 84
 
4.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-09-11
252 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-11
2nd row2023-09-11
3rd row2023-09-11
4th row2023-09-11
5th row2023-09-11

Common Values

ValueCountFrequency (%)
2023-09-11 252
100.0%

Length

2023-12-12T13:55:16.238229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:55:16.364310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-11 252
100.0%

Interactions

2023-12-12T13:55:13.391755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T13:55:13.514755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:55:13.607502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지전화번호데이터기준일자
01(주)강민전자인천광역시 남동구 청능대로 250, 씨동 101호 (고잔동)032-815-70832023-09-11
12(주)거산테크인천광역시 남동구 남동대로49번길 104, 697-6 [128-7블록] (고잔동)032-822-72962023-09-11
23(주)고려에스크인천광역시 남동구 남동동로33번길 28-12 (고잔동)070-4125-33092023-09-11
34(주)그린하이테크인천광역시 남동구 은봉로 52, 1112호(논현동, NIC지식산업센터) 1112호032-710-83352023-09-11
45(주)나노스트림인천광역시 남동구 호구포로 189, 9층 902호 (고잔동, 남동테크노타워)032-819-84402023-09-11
56(주)나노앤텍인천광역시 남동구 남동서로 351 (남촌동)032-812-99142023-09-11
67(주)나노앤텍 제3공장인천광역시 남동구 능허대로 552 (고잔동)032-812-99142023-09-11
78(주)뉴두리테크인천광역시 남동구 함박뫼로 340, 4층 (논현동)032-815-07252023-09-11
89(주)다산인천광역시 남동구 남동동로77번길 44, 142블럭 13로트 (고잔동)032-811-67002023-09-11
910(주)다인테크놀로지인천광역시 남동구 호구포로14번길 21-15, 166블록 6로트 (고잔동)032-432-07792023-09-11
연번업체명소재지전화번호데이터기준일자
242243필자동화인천광역시 남동구 남동서로269번길 31, 20블록2로트 (논현동)<NA>2023-09-11
243244하도에스에이인천광역시 남동구 함박뫼로318번길 20, 17블록 4로트 301호 (논현동)<NA>2023-09-11
244245하도전자(주)인천광역시 남동구 함박뫼로318번길 20, 17블록4로트(203호) (논현동)032-814-95252023-09-11
245246한국단자공업(주)인천광역시 남동구 남동대로155번길 70, 85블럭 16로트 (고잔동)032-814-99812023-09-11
246247한국단자공업(주)인천광역시 남동구 은봉로 123, 49블럭 3로트 (논현동)032-814-99812023-09-11
247248한성텍인천광역시 남동구 은청로 88, 나동 2층 (고잔동)032-816-06542023-09-11
248249한피스전자인천광역시 남동구 남동대로79번길 71, 129블럭 2로트(고잔동)<NA>2023-09-11
249250한호전자인천광역시 남동구 경인로 744 (간석동, 한호빌딩)032-514-08132023-09-11
250251현성ASB인천광역시 남동구 남동서로 96, 119블럭 1로트(690번지) (고잔동)<NA>2023-09-11
251252화신전자인천광역시 남동구 은봉로 52 (논현동) NIC지식산업센터 1동 6층 608호032-811-77362023-09-11