Overview

Dataset statistics

Number of variables5
Number of observations199
Missing cells118
Missing cells (%)11.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory41.7 B

Variable types

Numeric1
Text4

Dataset

Description인천광역시 남동구 무역업현황에 대한 데이터로 연번, 기업명, 대표자, 전화번호, 무역 종류에 대한 데이터를 제공합니다.
Author인천광역시 남동구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15102747&srcSe=7661IVAWM27C61E190

Alerts

대표자 has 26 (13.1%) missing valuesMissing
전화번호 has 92 (46.2%) missing valuesMissing
연번 has unique valuesUnique
기업명 has unique valuesUnique

Reproduction

Analysis started2024-03-18 04:57:23.186824
Analysis finished2024-03-18 04:57:24.462597
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct199
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100
Minimum1
Maximum199
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.9 KiB
2024-03-18T13:57:24.563988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.9
Q150.5
median100
Q3149.5
95-th percentile189.1
Maximum199
Range198
Interquartile range (IQR)99

Descriptive statistics

Standard deviation57.590508
Coefficient of variation (CV)0.57590508
Kurtosis-1.2
Mean100
Median Absolute Deviation (MAD)50
Skewness0
Sum19900
Variance3316.6667
MonotonicityStrictly increasing
2024-03-18T13:57:24.828724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
138 1
 
0.5%
128 1
 
0.5%
129 1
 
0.5%
130 1
 
0.5%
131 1
 
0.5%
132 1
 
0.5%
133 1
 
0.5%
134 1
 
0.5%
135 1
 
0.5%
Other values (189) 189
95.0%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
199 1
0.5%
198 1
0.5%
197 1
0.5%
196 1
0.5%
195 1
0.5%
194 1
0.5%
193 1
0.5%
192 1
0.5%
191 1
0.5%
190 1
0.5%

기업명
Text

UNIQUE 

Distinct199
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-18T13:57:25.216026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length12
Mean length6.8894472
Min length2

Characters and Unicode

Total characters1371
Distinct characters263
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique199 ?
Unique (%)100.0%

Sample

1st row(주)엔티에스
2nd row(주)예그리나
3rd row(주)트리샤
4th row(주)레인보우
5th row한국캐스팅(주)
ValueCountFrequency (%)
주)엔티에스 1
 
0.5%
도담푸드 1
 
0.5%
태일엠투 1
 
0.5%
통인 1
 
0.5%
모토라인 1
 
0.5%
피더스 1
 
0.5%
신성테크 1
 
0.5%
신일코퍼레이션 1
 
0.5%
이피이 1
 
0.5%
진일인더스 1
 
0.5%
Other values (194) 194
95.1%
2024-03-18T13:57:25.580246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 121
 
8.8%
) 121
 
8.8%
119
 
8.7%
62
 
4.5%
59
 
4.3%
24
 
1.8%
23
 
1.7%
22
 
1.6%
19
 
1.4%
17
 
1.2%
Other values (253) 784
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1112
81.1%
Open Punctuation 121
 
8.8%
Close Punctuation 121
 
8.8%
Lowercase Letter 6
 
0.4%
Space Separator 5
 
0.4%
Uppercase Letter 5
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
10.7%
62
 
5.6%
59
 
5.3%
24
 
2.2%
23
 
2.1%
22
 
2.0%
19
 
1.7%
17
 
1.5%
17
 
1.5%
14
 
1.3%
Other values (238) 736
66.2%
Lowercase Letter
ValueCountFrequency (%)
n 1
16.7%
i 1
16.7%
g 1
16.7%
d 1
16.7%
a 1
16.7%
r 1
16.7%
Uppercase Letter
ValueCountFrequency (%)
T 1
20.0%
M 1
20.0%
S 1
20.0%
U 1
20.0%
C 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 121
100.0%
Close Punctuation
ValueCountFrequency (%)
) 121
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1112
81.1%
Common 248
 
18.1%
Latin 11
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
10.7%
62
 
5.6%
59
 
5.3%
24
 
2.2%
23
 
2.1%
22
 
2.0%
19
 
1.7%
17
 
1.5%
17
 
1.5%
14
 
1.3%
Other values (238) 736
66.2%
Latin
ValueCountFrequency (%)
n 1
9.1%
i 1
9.1%
g 1
9.1%
d 1
9.1%
T 1
9.1%
a 1
9.1%
r 1
9.1%
M 1
9.1%
S 1
9.1%
U 1
9.1%
Common
ValueCountFrequency (%)
( 121
48.8%
) 121
48.8%
5
 
2.0%
. 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1112
81.1%
ASCII 259
 
18.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 121
46.7%
) 121
46.7%
5
 
1.9%
n 1
 
0.4%
i 1
 
0.4%
g 1
 
0.4%
d 1
 
0.4%
T 1
 
0.4%
a 1
 
0.4%
r 1
 
0.4%
Other values (5) 5
 
1.9%
Hangul
ValueCountFrequency (%)
119
 
10.7%
62
 
5.6%
59
 
5.3%
24
 
2.2%
23
 
2.1%
22
 
2.0%
19
 
1.7%
17
 
1.5%
17
 
1.5%
14
 
1.3%
Other values (238) 736
66.2%

대표자
Text

MISSING 

Distinct172
Distinct (%)99.4%
Missing26
Missing (%)13.1%
Memory size1.7 KiB
2024-03-18T13:57:25.990610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.3641618
Min length2

Characters and Unicode

Total characters582
Distinct characters161
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique171 ?
Unique (%)98.8%

Sample

1st row선충석
2nd row한성수
3rd row황인자
4th row하윤석
5th row김종국
ValueCountFrequency (%)
이현진 2
 
1.1%
황적인 1
 
0.6%
선충석 1
 
0.6%
신일수 1
 
0.6%
이상헌 1
 
0.6%
니나알렉산드라닉스 1
 
0.6%
최경애 1
 
0.6%
황수미 1
 
0.6%
엄동진 1
 
0.6%
김일자 1
 
0.6%
Other values (165) 165
93.8%
2024-03-18T13:57:26.440234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
31
 
5.3%
24
 
4.1%
18
 
3.1%
17
 
2.9%
16
 
2.7%
14
 
2.4%
13
 
2.2%
11
 
1.9%
11
 
1.9%
10
 
1.7%
Other values (151) 417
71.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 529
90.9%
Uppercase Letter 45
 
7.7%
Other Punctuation 5
 
0.9%
Space Separator 3
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
31
 
5.9%
24
 
4.5%
18
 
3.4%
17
 
3.2%
16
 
3.0%
14
 
2.6%
13
 
2.5%
11
 
2.1%
11
 
2.1%
10
 
1.9%
Other values (129) 364
68.8%
Uppercase Letter
ValueCountFrequency (%)
A 7
15.6%
I 4
 
8.9%
S 4
 
8.9%
D 4
 
8.9%
N 3
 
6.7%
U 3
 
6.7%
K 2
 
4.4%
O 2
 
4.4%
R 2
 
4.4%
H 2
 
4.4%
Other values (9) 12
26.7%
Other Punctuation
ValueCountFrequency (%)
/ 4
80.0%
. 1
 
20.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 529
90.9%
Latin 45
 
7.7%
Common 8
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
31
 
5.9%
24
 
4.5%
18
 
3.4%
17
 
3.2%
16
 
3.0%
14
 
2.6%
13
 
2.5%
11
 
2.1%
11
 
2.1%
10
 
1.9%
Other values (129) 364
68.8%
Latin
ValueCountFrequency (%)
A 7
15.6%
I 4
 
8.9%
S 4
 
8.9%
D 4
 
8.9%
N 3
 
6.7%
U 3
 
6.7%
K 2
 
4.4%
O 2
 
4.4%
R 2
 
4.4%
H 2
 
4.4%
Other values (9) 12
26.7%
Common
ValueCountFrequency (%)
/ 4
50.0%
3
37.5%
. 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 529
90.9%
ASCII 53
 
9.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
31
 
5.9%
24
 
4.5%
18
 
3.4%
17
 
3.2%
16
 
3.0%
14
 
2.6%
13
 
2.5%
11
 
2.1%
11
 
2.1%
10
 
1.9%
Other values (129) 364
68.8%
ASCII
ValueCountFrequency (%)
A 7
13.2%
I 4
 
7.5%
/ 4
 
7.5%
S 4
 
7.5%
D 4
 
7.5%
3
 
5.7%
N 3
 
5.7%
U 3
 
5.7%
K 2
 
3.8%
O 2
 
3.8%
Other values (12) 17
32.1%

전화번호
Text

MISSING 

Distinct107
Distinct (%)100.0%
Missing92
Missing (%)46.2%
Memory size1.7 KiB
2024-03-18T13:57:26.631404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.084112
Min length11

Characters and Unicode

Total characters1293
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)100.0%

Sample

1st row032-814-2340
2nd row070-5015-2390
3rd row032-821-2212
4th row032-813-3364
5th row032-818-3270
ValueCountFrequency (%)
032-818-3907 1
 
0.9%
032-822-7404 1
 
0.9%
070-4322-2015 1
 
0.9%
032-812-1242 1
 
0.9%
032-822-7998 1
 
0.9%
070-4066-3360 1
 
0.9%
032-442-5834 1
 
0.9%
032-811-8715 1
 
0.9%
032-811-7491 1
 
0.9%
032-472-8031 1
 
0.9%
Other values (97) 97
90.7%
2024-03-18T13:57:26.968049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 214
16.6%
2 195
15.1%
0 189
14.6%
3 160
12.4%
1 112
8.7%
8 105
8.1%
4 82
 
6.3%
7 78
 
6.0%
6 56
 
4.3%
5 56
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1079
83.4%
Dash Punctuation 214
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 195
18.1%
0 189
17.5%
3 160
14.8%
1 112
10.4%
8 105
9.7%
4 82
7.6%
7 78
 
7.2%
6 56
 
5.2%
5 56
 
5.2%
9 46
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 214
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1293
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 214
16.6%
2 195
15.1%
0 189
14.6%
3 160
12.4%
1 112
8.7%
8 105
8.1%
4 82
 
6.3%
7 78
 
6.0%
6 56
 
4.3%
5 56
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1293
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 214
16.6%
2 195
15.1%
0 189
14.6%
3 160
12.4%
1 112
8.7%
8 105
8.1%
4 82
 
6.3%
7 78
 
6.0%
6 56
 
4.3%
5 56
 
4.3%

종류
Text

Distinct176
Distinct (%)88.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
2024-03-18T13:57:27.155658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length55
Mean length26.693467
Min length2

Characters and Unicode

Total characters5312
Distinct characters306
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)85.4%

Sample

1st row반도체제조용정밀그라인더(Grinder)장비제조, 초정밀반도체장비, 기기용자동측정및제어장치제도, 휴대폰액정커버글라스
2nd row화장품, 의약부외품, 마스크제조, 도소매, 무역(화장품, 의약부외품)
3rd row메니큐어, 화장품제조, 메니큐어, 화장품, 마스크, 메니큐어, 화장품미용재료
4th row화장품(마스크팩)제조, 화장품, 의약외품, 펄프, 종이제품, 체외진단의료기기
5th row자동차부품, 전자부품, 알루미늄주물(전자부품, 자동차변속기케이스, 재봉기부품, 건축자재)주조, 금형제조
ValueCountFrequency (%)
무역업 53
 
6.6%
무역 25
 
3.1%
기타무역업 24
 
3.0%
도매및상품중개업 18
 
2.3%
도매업 16
 
2.0%
도소매업 15
 
1.9%
도소매 15
 
1.9%
전자상거래업 13
 
1.6%
화장품 11
 
1.4%
전자상거래 11
 
1.4%
Other values (479) 596
74.8%
2024-03-18T13:57:27.461861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 598
 
11.3%
598
 
11.3%
301
 
5.7%
186
 
3.5%
170
 
3.2%
160
 
3.0%
155
 
2.9%
141
 
2.7%
136
 
2.6%
121
 
2.3%
Other values (296) 2746
51.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4012
75.5%
Other Punctuation 600
 
11.3%
Space Separator 598
 
11.3%
Close Punctuation 40
 
0.8%
Open Punctuation 40
 
0.8%
Uppercase Letter 14
 
0.3%
Lowercase Letter 6
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
301
 
7.5%
186
 
4.6%
170
 
4.2%
160
 
4.0%
155
 
3.9%
141
 
3.5%
136
 
3.4%
121
 
3.0%
103
 
2.6%
98
 
2.4%
Other values (273) 2441
60.8%
Uppercase Letter
ValueCountFrequency (%)
A 2
14.3%
P 2
14.3%
E 2
14.3%
G 1
7.1%
H 1
7.1%
V 1
7.1%
S 1
7.1%
C 1
7.1%
F 1
7.1%
L 1
7.1%
Lowercase Letter
ValueCountFrequency (%)
r 2
33.3%
n 1
16.7%
i 1
16.7%
d 1
16.7%
e 1
16.7%
Other Punctuation
ValueCountFrequency (%)
, 598
99.7%
. 2
 
0.3%
Decimal Number
ValueCountFrequency (%)
2 1
50.0%
3 1
50.0%
Space Separator
ValueCountFrequency (%)
598
100.0%
Close Punctuation
ValueCountFrequency (%)
) 40
100.0%
Open Punctuation
ValueCountFrequency (%)
( 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4012
75.5%
Common 1280
 
24.1%
Latin 20
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
301
 
7.5%
186
 
4.6%
170
 
4.2%
160
 
4.0%
155
 
3.9%
141
 
3.5%
136
 
3.4%
121
 
3.0%
103
 
2.6%
98
 
2.4%
Other values (273) 2441
60.8%
Latin
ValueCountFrequency (%)
A 2
 
10.0%
r 2
 
10.0%
P 2
 
10.0%
E 2
 
10.0%
n 1
 
5.0%
G 1
 
5.0%
i 1
 
5.0%
d 1
 
5.0%
e 1
 
5.0%
H 1
 
5.0%
Other values (6) 6
30.0%
Common
ValueCountFrequency (%)
, 598
46.7%
598
46.7%
) 40
 
3.1%
( 40
 
3.1%
. 2
 
0.2%
2 1
 
0.1%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4012
75.5%
ASCII 1300
 
24.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 598
46.0%
598
46.0%
) 40
 
3.1%
( 40
 
3.1%
A 2
 
0.2%
r 2
 
0.2%
P 2
 
0.2%
E 2
 
0.2%
. 2
 
0.2%
2 1
 
0.1%
Other values (13) 13
 
1.0%
Hangul
ValueCountFrequency (%)
301
 
7.5%
186
 
4.6%
170
 
4.2%
160
 
4.0%
155
 
3.9%
141
 
3.5%
136
 
3.4%
121
 
3.0%
103
 
2.6%
98
 
2.4%
Other values (273) 2441
60.8%

Interactions

2024-03-18T13:57:23.958664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-18T13:57:24.137538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:57:24.252761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-18T13:57:24.396811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번기업명대표자전화번호종류
01(주)엔티에스선충석032-814-2340반도체제조용정밀그라인더(Grinder)장비제조, 초정밀반도체장비, 기기용자동측정및제어장치제도, 휴대폰액정커버글라스
12(주)예그리나한성수070-5015-2390화장품, 의약부외품, 마스크제조, 도소매, 무역(화장품, 의약부외품)
23(주)트리샤황인자032-821-2212메니큐어, 화장품제조, 메니큐어, 화장품, 마스크, 메니큐어, 화장품미용재료
34(주)레인보우하윤석032-813-3364화장품(마스크팩)제조, 화장품, 의약외품, 펄프, 종이제품, 체외진단의료기기
45한국캐스팅(주)김종국032-818-3270자동차부품, 전자부품, 알루미늄주물(전자부품, 자동차변속기케이스, 재봉기부품, 건축자재)주조, 금형제조
56한국가와사키로보틱스(주)하다신이치032-821-6941산업용기계기구(산업용로보트기계)도소매, 무역, 기술용역
67(주)시스템산업임웅택032-811-8091산업기계(크레인)부품제조, 기계설비공사
78디에스솔텍(주)이대성032-817-1693초고압접속자재제조, 초고압접속자재, 도금
89오성화학공업(주)권영후032-547-3321계면활성제, 합성유기염료, 특수윤활유, 금속가공유제제조, 도매, 무역, 부동산
910(주)아주화장품황인석032-433-1260화장품, 화장품원료제조, 도소매, 무역, 전자상거래
연번기업명대표자전화번호종류
189190싸이언스앤피플송용석<NA>무역업, 수출대행, 소프트웨어개발및공급업, 수출용역컨설팅수출대행
190191재운상사이인준<NA>블라인드, 마대, 도배, 실내장식및내장목공사업, 화장품도매업, 무역업, 의료기기, 의료용품, 전자상거래업
191192한별산업김봉래032-446-0015재생재료, 금속분말, 무역업(수출입업, 수출입주선), 기타공학연구개발업
192193해성무역<NA>070-4218-6635도소매
193194러블리캔디<NA><NA>기타무역업, 도소매
194195실버렉스최영철032-821-6607도금, 은폼, 엔지니어링, 무역, 통신판매
195196닥터카남병현<NA>전자상거래, 도소매업
196197에이치제이스포츠심현정<NA>도매업
197198메타씽킹박종석<NA>무역업
198199오션드롭이창호<NA>도매업