Overview

Dataset statistics

Number of variables6
Number of observations1219
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory58.5 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Categorical4
Text1

Dataset

Description물량집적을 통한 물류애로 해소 및 마케팅을 지원하는 온라인수출 공동물류 사업의 지원기업 별 지역, 업종, 업력, 품목에 관한 데이터.본 자료로 온라인수출 지원사업 참여를 통한 수출 활성화에 기여하기를 소망함.
Author중소벤처기업진흥공단
URLhttps://www.data.go.kr/data/15091631/fileData.do

Alerts

업종2 is highly overall correlated with 업종1High correlation
업종1 is highly overall correlated with 업종2High correlation
연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:58:04.714002
Analysis finished2023-12-12 18:58:05.543642
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1219
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean610
Minimum1
Maximum1219
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size10.8 KiB
2023-12-13T03:58:05.645831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile61.9
Q1305.5
median610
Q3914.5
95-th percentile1158.1
Maximum1219
Range1218
Interquartile range (IQR)609

Descriptive statistics

Standard deviation352.0393
Coefficient of variation (CV)0.5771136
Kurtosis-1.2
Mean610
Median Absolute Deviation (MAD)305
Skewness0
Sum743590
Variance123931.67
MonotonicityStrictly increasing
2023-12-13T03:58:05.841930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
812 1
 
0.1%
819 1
 
0.1%
818 1
 
0.1%
817 1
 
0.1%
816 1
 
0.1%
815 1
 
0.1%
814 1
 
0.1%
813 1
 
0.1%
811 1
 
0.1%
Other values (1209) 1209
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1219 1
0.1%
1218 1
0.1%
1217 1
0.1%
1216 1
0.1%
1215 1
0.1%
1214 1
0.1%
1213 1
0.1%
1212 1
0.1%
1211 1
0.1%
1210 1
0.1%

지역
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
서울특별시
510 
경기도
411 
인천광역시
66 
부산광역시
65 
대구광역시
 
36
Other values (12)
131 

Length

Max length7
Median length5
Mean length4.2543068
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
서울특별시 510
41.8%
경기도 411
33.7%
인천광역시 66
 
5.4%
부산광역시 65
 
5.3%
대구광역시 36
 
3.0%
경상남도 22
 
1.8%
대전광역시 20
 
1.6%
충청북도 17
 
1.4%
전라북도 15
 
1.2%
충청남도 14
 
1.1%
Other values (7) 43
 
3.5%

Length

2023-12-13T03:58:06.039787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 510
41.8%
경기도 411
33.7%
인천광역시 66
 
5.4%
부산광역시 65
 
5.3%
대구광역시 36
 
3.0%
경상남도 22
 
1.8%
대전광역시 20
 
1.6%
충청북도 17
 
1.4%
전라북도 15
 
1.2%
충청남도 14
 
1.1%
Other values (7) 43
 
3.5%

업종1
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
서비스업
667 
제조업
552 

Length

Max length4
Median length4
Mean length3.5471698
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서비스업
2nd row제조업
3rd row제조업
4th row서비스업
5th row제조업

Common Values

ValueCountFrequency (%)
서비스업 667
54.7%
제조업 552
45.3%

Length

2023-12-13T03:58:06.198057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:58:06.341247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서비스업 667
54.7%
제조업 552
45.3%

업종2
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
유통
608 
기타
348 
잡화
110 
섬유
 
49
전자
 
29
Other values (5)
75 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유통
2nd row기계
3rd row화공
4th row유통
5th row식료

Common Values

ValueCountFrequency (%)
유통 608
49.9%
기타 348
28.5%
잡화 110
 
9.0%
섬유 49
 
4.0%
전자 29
 
2.4%
화공 26
 
2.1%
식료 24
 
2.0%
기계 11
 
0.9%
전기 8
 
0.7%
금속 6
 
0.5%

Length

2023-12-13T03:58:06.466565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:58:06.627252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유통 608
49.9%
기타 348
28.5%
잡화 110
 
9.0%
섬유 49
 
4.0%
전자 29
 
2.4%
화공 26
 
2.1%
식료 24
 
2.0%
기계 11
 
0.9%
전기 8
 
0.7%
금속 6
 
0.5%

업력
Categorical

Distinct5
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
5년미만
322 
10년이상
268 
7년미만
245 
10년미만
232 
3년미만
152 

Length

Max length5
Median length4
Mean length4.4101723
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3년미만
2nd row10년이상
3rd row10년미만
4th row3년미만
5th row10년이상

Common Values

ValueCountFrequency (%)
5년미만 322
26.4%
10년이상 268
22.0%
7년미만 245
20.1%
10년미만 232
19.0%
3년미만 152
12.5%

Length

2023-12-13T03:58:06.967207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:58:07.129555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5년미만 322
26.4%
10년이상 268
22.0%
7년미만 245
20.1%
10년미만 232
19.0%
3년미만 152
12.5%
Distinct285
Distinct (%)23.4%
Missing0
Missing (%)0.0%
Memory size9.7 KiB
2023-12-13T03:58:07.577419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length5.6431501
Min length1

Characters and Unicode

Total characters6879
Distinct characters299
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique158 ?
Unique (%)13.0%

Sample

1st row범용 부품
2nd row자동차 부품
3rd row스킨 케어
4th row스킨 케어
5th row스낵 식품
ValueCountFrequency (%)
화장품 229
 
9.4%
기타 228
 
9.4%
171
 
7.0%
케어 127
 
5.2%
스킨 119
 
4.9%
용품 68
 
2.8%
제품 66
 
2.7%
액세서리 64
 
2.6%
식품 34
 
1.4%
스포츠 29
 
1.2%
Other values (345) 1292
53.2%
2023-12-13T03:58:08.717808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1208
 
17.6%
429
 
6.2%
288
 
4.2%
270
 
3.9%
259
 
3.8%
258
 
3.8%
231
 
3.4%
172
 
2.5%
160
 
2.3%
156
 
2.3%
Other values (289) 3448
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5596
81.3%
Space Separator 1208
 
17.6%
Other Punctuation 28
 
0.4%
Uppercase Letter 25
 
0.4%
Decimal Number 15
 
0.2%
Dash Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
429
 
7.7%
288
 
5.1%
270
 
4.8%
259
 
4.6%
258
 
4.6%
231
 
4.1%
172
 
3.1%
160
 
2.9%
156
 
2.8%
133
 
2.4%
Other values (273) 3240
57.9%
Uppercase Letter
ValueCountFrequency (%)
T 8
32.0%
D 4
16.0%
L 3
 
12.0%
E 3
 
12.0%
C 3
 
12.0%
V 1
 
4.0%
U 1
 
4.0%
S 1
 
4.0%
B 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
& 14
50.0%
, 9
32.1%
% 5
 
17.9%
Decimal Number
ValueCountFrequency (%)
0 10
66.7%
1 5
33.3%
Space Separator
ValueCountFrequency (%)
1208
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5596
81.3%
Common 1258
 
18.3%
Latin 25
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
429
 
7.7%
288
 
5.1%
270
 
4.8%
259
 
4.6%
258
 
4.6%
231
 
4.1%
172
 
3.1%
160
 
2.9%
156
 
2.8%
133
 
2.4%
Other values (273) 3240
57.9%
Latin
ValueCountFrequency (%)
T 8
32.0%
D 4
16.0%
L 3
 
12.0%
E 3
 
12.0%
C 3
 
12.0%
V 1
 
4.0%
U 1
 
4.0%
S 1
 
4.0%
B 1
 
4.0%
Common
ValueCountFrequency (%)
1208
96.0%
& 14
 
1.1%
0 10
 
0.8%
, 9
 
0.7%
- 7
 
0.6%
% 5
 
0.4%
1 5
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5596
81.3%
ASCII 1283
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1208
94.2%
& 14
 
1.1%
0 10
 
0.8%
, 9
 
0.7%
T 8
 
0.6%
- 7
 
0.5%
% 5
 
0.4%
1 5
 
0.4%
D 4
 
0.3%
L 3
 
0.2%
Other values (6) 10
 
0.8%
Hangul
ValueCountFrequency (%)
429
 
7.7%
288
 
5.1%
270
 
4.8%
259
 
4.6%
258
 
4.6%
231
 
4.1%
172
 
3.1%
160
 
2.9%
156
 
2.8%
133
 
2.4%
Other values (273) 3240
57.9%

Interactions

2023-12-13T03:58:05.164121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:58:08.871293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역업종1업종2업력
연번1.0000.8730.1080.1760.040
지역0.8731.0000.0980.3090.000
업종10.1080.0981.0000.9910.216
업종20.1760.3090.9911.0000.301
업력0.0400.0000.2160.3011.000
2023-12-13T03:58:09.055398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업력지역업종2업종1
업력1.0000.0000.1290.263
지역0.0001.0000.1250.087
업종20.1290.1251.0000.912
업종10.2630.0870.9121.000
2023-12-13T03:58:09.199420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역업종1업종2업력
연번1.0000.5860.0830.0550.017
지역0.5861.0000.0870.1250.000
업종10.0830.0871.0000.9120.263
업종20.0550.1250.9121.0000.129
업력0.0170.0000.2630.1291.000

Missing values

2023-12-13T03:58:05.330106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:58:05.489281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지역업종1업종2업력수출품목명
01강원도서비스업유통3년미만범용 부품
12강원도제조업기계10년이상자동차 부품
23강원도제조업화공10년미만스킨 케어
34강원도서비스업유통3년미만스킨 케어
45강원도제조업식료10년이상스낵 식품
56강원도제조업기계5년미만야채 제품
67강원도제조업기타7년미만기타 스포츠 및 엔터테인먼트 제품
78강원도제조업기타7년미만악기세트
89강원도서비스업유통7년미만여성복
910강원도서비스업유통5년미만주방 나이프 및 액세서리
연번지역업종1업종2업력수출품목명
12091210충청북도제조업기타7년미만여성복
12101211충청북도제조업기타7년미만스킨 케어
12111212충청북도제조업식료10년이상곡물 제품
12121213충청북도서비스업유통5년미만스킨 케어
12131214충청북도제조업기계10년미만기타 기계 및 산업 장비
12141215충청북도제조업기계10년이상멘즈 슈즈
12151216충청북도제조업기타10년이상기타
12161217충청북도서비스업유통5년미만기타 완구 및 취미
12171218충청북도서비스업유통10년미만기타 주방 용품
12181219충청북도제조업기타5년미만방역 마스크