Overview

Dataset statistics

Number of variables4
Number of observations114
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.8 KiB
Average record size in memory34.2 B

Variable types

Categorical3
Numeric1

Dataset

Description수출화물 목재포장재 열처리 지역별, 품목별 처리수량
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220214000000001866

Alerts

품목명 is highly overall correlated with 단위High correlation
단위 is highly overall correlated with 품목명High correlation

Reproduction

Analysis started2023-12-11 03:17:50.867020
Analysis finished2023-12-11 03:17:51.659797
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시/도
Categorical

Distinct15
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
경기도
11 
충청북도
11 
부산광역시
인천광역시
경상북도
Other values (10)
65 

Length

Max length7
Median length5
Mean length4.4473684
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
경기도 11
9.6%
충청북도 11
9.6%
부산광역시 9
 
7.9%
인천광역시 9
 
7.9%
경상북도 9
 
7.9%
경상남도 9
 
7.9%
울산광역시 8
 
7.0%
충청남도 8
 
7.0%
전라북도 8
 
7.0%
전라남도 8
 
7.0%
Other values (5) 24
21.1%

Length

2023-12-11T12:17:51.742366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 11
9.6%
충청북도 11
9.6%
부산광역시 9
 
7.9%
인천광역시 9
 
7.9%
경상북도 9
 
7.9%
경상남도 9
 
7.9%
울산광역시 8
 
7.0%
충청남도 8
 
7.0%
전라북도 8
 
7.0%
전라남도 8
 
7.0%
Other values (5) 24
21.1%

품목명
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
파렛트
15 
나무상자
14 
각재
14 
판재
13 
지지목(버팀목)
12 
Other values (6)
46 

Length

Max length8
Median length4
Mean length3.6578947
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row파렛트
2nd row나무상자
3rd row파렛트
4th row받침목(짐깔개)
5th row드럼

Common Values

ValueCountFrequency (%)
파렛트 15
13.2%
나무상자 14
12.3%
각재 14
12.3%
판재 13
11.4%
지지목(버팀목) 12
10.5%
스키드 11
9.6%
받침목(짐깔개) 10
8.8%
드럼 8
7.0%
단판 8
7.0%
기타 6
 
5.3%

Length

2023-12-11T12:17:51.895970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
파렛트 15
13.2%
나무상자 14
12.3%
각재 14
12.3%
판재 13
11.4%
지지목(버팀목 12
10.5%
스키드 11
9.6%
받침목(짐깔개 10
8.8%
드럼 8
7.0%
단판 8
7.0%
기타 6
 
5.3%

수량
Real number (ℝ)

Distinct110
Distinct (%)96.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6154.7638
Minimum0.55
Maximum69654
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T12:17:52.066059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.55
5-th percentile19.65
Q1169.063
median800
Q33122.0747
95-th percentile34045.95
Maximum69654
Range69653.45
Interquartile range (IQR)2953.0117

Descriptive statistics

Standard deviation13111.169
Coefficient of variation (CV)2.1302474
Kurtosis8.0355628
Mean6154.7638
Median Absolute Deviation (MAD)748.4
Skewness2.829157
Sum701643.07
Variance1.7190276 × 108
MonotonicityNot monotonic
2023-12-11T12:17:52.216439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150.0 3
 
2.6%
800.0 2
 
1.8%
20.0 2
 
1.8%
16780.0 1
 
0.9%
245.3632 1
 
0.9%
3324.0 1
 
0.9%
220.0 1
 
0.9%
2055.0 1
 
0.9%
37905.0 1
 
0.9%
36.0 1
 
0.9%
Other values (100) 100
87.7%
ValueCountFrequency (%)
0.55 1
0.9%
2.0 1
0.9%
5.0 1
0.9%
13.5756 1
0.9%
16.0 1
0.9%
19.0 1
0.9%
20.0 2
1.8%
20.921 1
0.9%
21.0 1
0.9%
24.45 1
0.9%
ValueCountFrequency (%)
69654.0 1
0.9%
57965.0 1
0.9%
50834.0 1
0.9%
49379.0 1
0.9%
38265.0 1
0.9%
37905.0 1
0.9%
31968.0 1
0.9%
30908.0 1
0.9%
30611.0 1
0.9%
28932.0 1
0.9%

단위
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size1.0 KiB
79 
35 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
79
69.3%
35
30.7%

Length

2023-12-11T12:17:52.400496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:17:52.499455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
79
69.3%
35
30.7%

Interactions

2023-12-11T12:17:51.354119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:17:52.556749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시/도품목명수량단위
시/도1.0000.0000.0000.000
품목명0.0001.0000.3691.000
수량0.0000.3691.0000.161
단위0.0001.0000.1611.000
2023-12-11T12:17:52.648579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
품목명시/도단위
품목명1.0000.0000.959
시/도0.0001.0000.000
단위0.9590.0001.000
2023-12-11T12:17:52.744123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수량시/도품목명단위
수량1.0000.0000.1720.154
시/도0.0001.0000.0000.000
품목명0.1720.0001.0000.959
단위0.1540.0000.9591.000

Missing values

2023-12-11T12:17:51.502662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:17:51.625030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시/도품목명수량단위
0서울특별시파렛트2540.0
1서울특별시나무상자21.0
2부산광역시파렛트15834.0
3부산광역시받침목(짐깔개)30908.0
4부산광역시드럼775.0
5부산광역시나무상자942.0
6부산광역시각재1377.78244
7부산광역시판재224.82
8부산광역시지지목(버팀목)31968.0
9부산광역시스키드3182.0
시/도품목명수량단위
104경상북도기타958.0
105경상남도단판176.824
106경상남도파렛트20217.0
107경상남도받침목(짐깔개)2500.0
108경상남도드럼150.0
109경상남도나무상자1631.0
110경상남도각재5044.637
111경상남도판재775.253
112경상남도스키드1054.0
113경상남도기타25.0