Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 170 |
Missing cells | 232 |
Missing cells (%) | 22.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 8.9 KiB |
Average record size in memory | 53.8 B |
Variable types
Text | 1 |
---|---|
Numeric | 5 |
Dataset
Description | 국내 석유제품의 제주 지역 소비량에 관한 자료로 산업별(농림수산업,광업,식품.담배업,섬유제품업,목재업,제지.인쇄업,화학제품업,요업,철강업,비철금속산업,기계조립업,수송장비업,기타제조업,건설업,기타에너지,발전,석유정제,개스제조,철도,도로,해운,항공,상업,가정,공공,기타), 제품별로 작성 단위 : 물량(KL) |
---|---|
URL | https://www.data.go.kr/data/15121148/fileData.do |
2018 is highly overall correlated with 2019 and 3 other fields | High correlation |
2019 is highly overall correlated with 2018 and 3 other fields | High correlation |
2020 is highly overall correlated with 2018 and 3 other fields | High correlation |
2021 is highly overall correlated with 2018 and 3 other fields | High correlation |
2022 is highly overall correlated with 2018 and 3 other fields | High correlation |
2018 has 36 (21.2%) missing values | Missing |
2019 has 38 (22.4%) missing values | Missing |
2020 has 50 (29.4%) missing values | Missing |
2021 has 54 (31.8%) missing values | Missing |
2022 has 54 (31.8%) missing values | Missing |
시군구_산업_제품 has unique values | Unique |
2022 has 2 (1.2%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-12 03:14:11.247281 |
---|---|
Analysis finished | 2023-12-12 03:14:14.825006 |
Duration | 3.58 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
시군구_산업_제품
Text
UNIQUE
 
Distinct | 170 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.5 KiB |
Length
Max length | 23 |
---|---|
Median length | 19 |
Mean length | 17.994118 |
Min length | 11 |
Characters and Unicode
Total characters | 3059 |
---|---|
Distinct characters | 94 |
Distinct categories | 8 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 170 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 제주제주시_농림수산업_무연보통휘발유 |
---|---|
2nd row | 제주제주시_농림수산업_실내등유 |
3rd row | 제주제주시_농림수산업_경유(0.05%) |
4th row | 제주제주시_농림수산업_경유(0.001%) |
5th row | 제주제주시_농림수산업_경질중유(2.0%) |
Value | Count | Frequency (%) |
제주제주시_농림수산업_무연보통휘발유 | 1 | 0.6% |
제주서귀포시_요업_경유(0.001 | 1 | 0.6% |
제주서귀포시_식품.담배업_중유(0.3 | 1 | 0.6% |
제주서귀포시_건설업_실내등유 | 1 | 0.6% |
제주서귀포시_식품.담배업_프로판 | 1 | 0.6% |
제주서귀포시_제지.인쇄업_중유(0.3 | 1 | 0.6% |
제주서귀포시_화학제품업_경유(0.001 | 1 | 0.6% |
제주서귀포시_화학제품업_중유(0.3 | 1 | 0.6% |
제주서귀포시_화학제품업_부생연료유(중유형 | 1 | 0.6% |
제주서귀포시_요업_경유(0.05 | 1 | 0.6% |
Other values (160) | 160 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 340 | 11.1% |
제 | 295 | 9.6% |
주 | 268 | 8.8% |
시 | 170 | 5.6% |
유 | 162 | 5.3% |
0 | 150 | 4.9% |
) | 106 | 3.5% |
( | 106 | 3.5% |
업 | 99 | 3.2% |
. | 98 | 3.2% |
Other values (84) | 1265 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2063 | |
Connector Punctuation | 340 | 11.1% |
Decimal Number | 240 | 7.8% |
Other Punctuation | 185 | 6.0% |
Close Punctuation | 106 | 3.5% |
Open Punctuation | 106 | 3.5% |
Uppercase Letter | 16 | 0.5% |
Dash Punctuation | 3 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
제 | 295 | 14.3% |
주 | 268 | 13.0% |
시 | 170 | 8.2% |
유 | 162 | 7.9% |
업 | 99 | 4.8% |
서 | 72 | 3.5% |
귀 | 72 | 3.5% |
포 | 72 | 3.5% |
중 | 54 | 2.6% |
경 | 51 | 2.5% |
Other values (67) | 748 |
Decimal Number
Value | Count | Frequency (%) |
0 | 150 | |
3 | 34 | 14.2% |
1 | 30 | 12.5% |
5 | 18 | 7.5% |
2 | 6 | 2.5% |
4 | 2 | 0.8% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 4 | |
J | 3 | |
E | 3 | |
T | 3 | |
A | 3 |
Other Punctuation
Value | Count | Frequency (%) |
. | 98 | |
% | 87 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 340 |
Close Punctuation
Value | Count | Frequency (%) |
) | 106 |
Open Punctuation
Value | Count | Frequency (%) |
( | 106 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2063 | |
Common | 980 | |
Latin | 16 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
제 | 295 | 14.3% |
주 | 268 | 13.0% |
시 | 170 | 8.2% |
유 | 162 | 7.9% |
업 | 99 | 4.8% |
서 | 72 | 3.5% |
귀 | 72 | 3.5% |
포 | 72 | 3.5% |
중 | 54 | 2.6% |
경 | 51 | 2.5% |
Other values (67) | 748 |
Common
Value | Count | Frequency (%) |
_ | 340 | |
0 | 150 | |
) | 106 | 10.8% |
( | 106 | 10.8% |
. | 98 | 10.0% |
% | 87 | 8.9% |
3 | 34 | 3.5% |
1 | 30 | 3.1% |
5 | 18 | 1.8% |
2 | 6 | 0.6% |
Other values (2) | 5 | 0.5% |
Latin
Value | Count | Frequency (%) |
C | 4 | |
J | 3 | |
E | 3 | |
T | 3 | |
A | 3 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2063 | |
ASCII | 996 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 340 | |
0 | 150 | |
) | 106 | 10.6% |
( | 106 | 10.6% |
. | 98 | 9.8% |
% | 87 | 8.7% |
3 | 34 | 3.4% |
1 | 30 | 3.0% |
5 | 18 | 1.8% |
2 | 6 | 0.6% |
Other values (7) | 21 | 2.1% |
Hangul
Value | Count | Frequency (%) |
제 | 295 | 14.3% |
주 | 268 | 13.0% |
시 | 170 | 8.2% |
유 | 162 | 7.9% |
업 | 99 | 4.8% |
서 | 72 | 3.5% |
귀 | 72 | 3.5% |
포 | 72 | 3.5% |
중 | 54 | 2.6% |
경 | 51 | 2.5% |
Other values (67) | 748 |
2018
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 123 |
---|---|
Distinct (%) | 91.8% |
Missing | 36 |
Missing (%) | 21.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11633.552 |
Minimum | 1 |
---|---|
Maximum | 234970 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 5 |
Q1 | 64.25 |
median | 231 |
Q3 | 2207 |
95-th percentile | 71270.2 |
Maximum | 234970 |
Range | 234969 |
Interquartile range (IQR) | 2142.75 |
Descriptive statistics
Standard deviation | 36069.906 |
---|---|
Coefficient of variation (CV) | 3.1005066 |
Kurtosis | 20.596257 |
Mean | 11633.552 |
Median Absolute Deviation (MAD) | 223 |
Skewness | 4.320929 |
Sum | 1558896 |
Variance | 1.3010381 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 3 | 1.8% |
1 | 3 | 1.8% |
4 | 2 | 1.2% |
24 | 2 | 1.2% |
8 | 2 | 1.2% |
103 | 2 | 1.2% |
31 | 2 | 1.2% |
73 | 2 | 1.2% |
11 | 2 | 1.2% |
1966 | 1 | 0.6% |
Other values (113) | 113 | |
(Missing) | 36 | 21.2% |
Value | Count | Frequency (%) |
1 | 3 | |
2 | 1 | 0.6% |
4 | 2 | |
5 | 3 | |
6 | 1 | 0.6% |
7 | 1 | 0.6% |
8 | 2 | |
11 | 2 | |
15 | 1 | 0.6% |
19 | 1 | 0.6% |
Value | Count | Frequency (%) |
234970 | 1 | |
220221 | 1 | |
145062 | 1 | |
125013 | 1 | |
113003 | 1 | |
104688 | 1 | |
85526 | 1 | |
63594 | 1 | |
51607 | 1 | |
51530 | 1 |
2019
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 126 |
---|---|
Distinct (%) | 95.5% |
Missing | 38 |
Missing (%) | 22.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11699.242 |
Minimum | 1 |
---|---|
Maximum | 237208 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 9.55 |
Q1 | 77.25 |
median | 256.5 |
Q3 | 1230 |
95-th percentile | 74975.1 |
Maximum | 237208 |
Range | 237207 |
Interquartile range (IQR) | 1152.75 |
Descriptive statistics
Standard deviation | 36514.158 |
---|---|
Coefficient of variation (CV) | 3.1210703 |
Kurtosis | 22.254391 |
Mean | 11699.242 |
Median Absolute Deviation (MAD) | 243 |
Skewness | 4.4411155 |
Sum | 1544300 |
Variance | 1.3332838 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
15 | 3 | 1.8% |
841 | 2 | 1.2% |
180 | 2 | 1.2% |
5 | 2 | 1.2% |
7 | 2 | 1.2% |
38 | 1 | 0.6% |
1055 | 1 | 0.6% |
1 | 1 | 0.6% |
2339 | 1 | 0.6% |
120 | 1 | 0.6% |
Other values (116) | 116 | |
(Missing) | 38 | 22.4% |
Value | Count | Frequency (%) |
1 | 1 | 0.6% |
3 | 1 | 0.6% |
5 | 2 | |
7 | 2 | |
9 | 1 | 0.6% |
10 | 1 | 0.6% |
11 | 1 | 0.6% |
13 | 1 | 0.6% |
14 | 1 | 0.6% |
15 | 3 |
Value | Count | Frequency (%) |
237208 | 1 | |
235463 | 1 | |
130506 | 1 | |
128276 | 1 | |
104456 | 1 | |
84218 | 1 | |
82762 | 1 | |
68604 | 1 | |
62434 | 1 | |
54394 | 1 |
2020
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 112 |
---|---|
Distinct (%) | 93.3% |
Missing | 50 |
Missing (%) | 29.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9723.25 |
Minimum | 0 |
---|---|
Maximum | 206460 |
Zeros | 1 |
Zeros (%) | 0.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 70.5 |
median | 359 |
Q3 | 1338.5 |
95-th percentile | 54330.9 |
Maximum | 206460 |
Range | 206460 |
Interquartile range (IQR) | 1268 |
Descriptive statistics
Standard deviation | 29337.836 |
---|---|
Coefficient of variation (CV) | 3.0172871 |
Kurtosis | 22.746301 |
Mean | 9723.25 |
Median Absolute Deviation (MAD) | 346.5 |
Skewness | 4.4687861 |
Sum | 1166790 |
Variance | 8.6070865 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 3 | 1.8% |
9 | 2 | 1.2% |
47 | 2 | 1.2% |
4 | 2 | 1.2% |
39 | 2 | 1.2% |
26 | 2 | 1.2% |
20 | 2 | 1.2% |
453 | 1 | 0.6% |
2022 | 1 | 0.6% |
17 | 1 | 0.6% |
Other values (102) | 102 | |
(Missing) | 50 |
Value | Count | Frequency (%) |
0 | 1 | 0.6% |
2 | 1 | 0.6% |
3 | 3 | |
4 | 2 | |
6 | 1 | 0.6% |
8 | 1 | 0.6% |
9 | 2 | |
11 | 1 | 0.6% |
12 | 1 | 0.6% |
13 | 1 | 0.6% |
Value | Count | Frequency (%) |
206460 | 1 | |
150996 | 1 | |
124014 | 1 | |
99795 | 1 | |
73296 | 1 | |
55773 | 1 | |
54255 | 1 | |
50013 | 1 | |
46111 | 1 | |
42473 | 1 |
2021
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 109 |
---|---|
Distinct (%) | 94.0% |
Missing | 54 |
Missing (%) | 31.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 10706.853 |
Minimum | 0 |
---|---|
Maximum | 210893 |
Zeros | 1 |
Zeros (%) | 0.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 55.75 |
median | 354 |
Q3 | 1707.5 |
95-th percentile | 62610.25 |
Maximum | 210893 |
Range | 210893 |
Interquartile range (IQR) | 1651.75 |
Descriptive statistics
Standard deviation | 32691.817 |
---|---|
Coefficient of variation (CV) | 3.0533543 |
Kurtosis | 21.296367 |
Mean | 10706.853 |
Median Absolute Deviation (MAD) | 335 |
Skewness | 4.3939928 |
Sum | 1241995 |
Variance | 1.0687549 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
31 | 3 | 1.8% |
2 | 3 | 1.8% |
46 | 2 | 1.2% |
341 | 2 | 1.2% |
4 | 2 | 1.2% |
0 | 1 | 0.6% |
93 | 1 | 0.6% |
127 | 1 | 0.6% |
27 | 1 | 0.6% |
280 | 1 | 0.6% |
Other values (99) | 99 | |
(Missing) | 54 |
Value | Count | Frequency (%) |
0 | 1 | 0.6% |
2 | 3 | |
3 | 1 | 0.6% |
4 | 2 | |
7 | 1 | 0.6% |
8 | 1 | 0.6% |
12 | 1 | 0.6% |
14 | 1 | 0.6% |
15 | 1 | 0.6% |
17 | 1 | 0.6% |
Value | Count | Frequency (%) |
210893 | 1 | |
191807 | 1 | |
136477 | 1 | |
102054 | 1 | |
70785 | 1 | |
63178 | 1 | |
62421 | 1 | |
55750 | 1 | |
47443 | 1 | |
45679 | 1 |
2022
Real number (ℝ)
HIGH CORRELATION
  MISSING
  ZEROS
 
Distinct | 108 |
---|---|
Distinct (%) | 93.1% |
Missing | 54 |
Missing (%) | 31.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11012.052 |
Minimum | 0 |
---|---|
Maximum | 211065 |
Zeros | 2 |
Zeros (%) | 1.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.6 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 2.75 |
Q1 | 53.25 |
median | 368 |
Q3 | 1715.5 |
95-th percentile | 59414 |
Maximum | 211065 |
Range | 211065 |
Interquartile range (IQR) | 1662.25 |
Descriptive statistics
Standard deviation | 32818.091 |
---|---|
Coefficient of variation (CV) | 2.9801977 |
Kurtosis | 20.353258 |
Mean | 11012.052 |
Median Absolute Deviation (MAD) | 346.5 |
Skewness | 4.2868297 |
Sum | 1277398 |
Variance | 1.0770271 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 3 | 1.8% |
0 | 2 | 1.2% |
61 | 2 | 1.2% |
13 | 2 | 1.2% |
7 | 2 | 1.2% |
45 | 2 | 1.2% |
520 | 2 | 1.2% |
475 | 1 | 0.6% |
21 | 1 | 0.6% |
146 | 1 | 0.6% |
Other values (98) | 98 | |
(Missing) | 54 |
Value | Count | Frequency (%) |
0 | 2 | |
1 | 1 | 0.6% |
2 | 3 | |
3 | 1 | 0.6% |
7 | 2 | |
10 | 1 | 0.6% |
12 | 1 | 0.6% |
13 | 2 | |
15 | 1 | 0.6% |
21 | 1 | 0.6% |
Value | Count | Frequency (%) |
211065 | 1 | |
186782 | 1 | |
144475 | 1 | |
97085 | 1 | |
70155 | 1 | |
63398 | 1 | |
58086 | 1 | |
54977 | 1 | |
53382 | 1 | |
49471 | 1 |
2018 | 2019 | 2020 | 2021 | 2022 | |
---|---|---|---|---|---|
2018 | 1.000 | 0.934 | 0.936 | 0.987 | 0.893 |
2019 | 0.934 | 1.000 | 0.917 | 0.977 | 0.889 |
2020 | 0.936 | 0.917 | 1.000 | 0.958 | 0.992 |
2021 | 0.987 | 0.977 | 0.958 | 1.000 | 0.955 |
2022 | 0.893 | 0.889 | 0.992 | 0.955 | 1.000 |
2018 | 2019 | 2020 | 2021 | 2022 | |
---|---|---|---|---|---|
2018 | 1.000 | 0.960 | 0.870 | 0.792 | 0.812 |
2019 | 0.960 | 1.000 | 0.885 | 0.797 | 0.857 |
2020 | 0.870 | 0.885 | 1.000 | 0.896 | 0.912 |
2021 | 0.792 | 0.797 | 0.896 | 1.000 | 0.902 |
2022 | 0.812 | 0.857 | 0.912 | 0.902 | 1.000 |
시군구_산업_제품 | 2018 | 2019 | 2020 | 2021 | 2022 | |
---|---|---|---|---|---|---|
0 | 제주제주시_농림수산업_무연보통휘발유 | 5488 | 1107 | 312 | 313 | 345 |
1 | 제주제주시_농림수산업_실내등유 | 6615 | 1105 | 518 | 546 | 525 |
2 | 제주제주시_농림수산업_경유(0.05%) | 2072 | 2829 | 4377 | 46 | <NA> |
3 | 제주제주시_농림수산업_경유(0.001%) | 2556 | 1309 | 2270 | 1149 | 1080 |
4 | 제주제주시_농림수산업_경질중유(2.0%) | 108 | 7 | <NA> | <NA> | <NA> |
5 | 제주제주시_농림수산업_경질중유(0.3%) | 5 | 3 | <NA> | <NA> | <NA> |
6 | 제주제주시_농림수산업_중유(1.0%) | 7 | <NA> | <NA> | <NA> | <NA> |
7 | 제주제주시_농림수산업_중유(0.5%) | 116 | <NA> | <NA> | <NA> | <NA> |
8 | 제주제주시_농림수산업_중유(0.3%) | 734 | 664 | 396 | 225 | 112 |
9 | 제주제주시_농림수산업_부생연료유(등유형) | <NA> | <NA> | <NA> | 2741 | 7114 |
시군구_산업_제품 | 2018 | 2019 | 2020 | 2021 | 2022 | |
---|---|---|---|---|---|---|
160 | 제주서귀포시_가정_용제원료 | 8 | 9 | 11 | 8 | 7 |
161 | 제주서귀포시_가정_프로판 | 16510 | 17419 | 17363 | 17825 | 18683 |
162 | 제주서귀포시_가정_부탄 | 220 | 221 | 188 | 177 | 181 |
163 | 제주서귀포시_공공_무연보통휘발유 | 180 | 99 | 80 | 37 | 59 |
164 | 제주서귀포시_공공_실내등유 | 316 | 276 | 270 | 257 | 147 |
165 | 제주서귀포시_공공_경유(0.05%) | 38066 | 23822 | 24222 | 13752 | 13738 |
166 | 제주서귀포시_공공_경유(0.001%) | 1108 | 841 | 828 | 341 | 378 |
167 | 제주서귀포시_공공_경질중유(0.3%) | 242 | 287 | 765 | <NA> | <NA> |
168 | 제주서귀포시_공공_중유(0.3%) | 43 | 57 | 9 | 117 | 12 |
169 | 제주서귀포시_공공_프로판 | 150 | 162 | 160 | 164 | 177 |