Overview

Dataset statistics

Number of variables7
Number of observations120
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.2 KiB
Average record size in memory61.1 B

Variable types

Categorical3
Numeric3
Text1

Dataset

Description한국광해광업공단에서는 광종(동,아연,철광석,니켈,석탄,우라늄) 메이저 생산기업 상위 10개의 생산순위 와 2020년~2021년 생산량 정보를 제공합니다.
URLhttps://www.data.go.kr/data/3070252/fileData.do

Alerts

생산량단위 is highly overall correlated with 생산량 and 1 other fieldsHigh correlation
광종 is highly overall correlated with 년도 and 1 other fieldsHigh correlation
순위 is highly overall correlated with 점유율(퍼센트)High correlation
생산량 is highly overall correlated with 점유율(퍼센트) and 1 other fieldsHigh correlation
점유율(퍼센트) is highly overall correlated with 순위 and 1 other fieldsHigh correlation
년도 is highly overall correlated with 광종High correlation

Reproduction

Analysis started2023-12-12 07:55:15.714674
Analysis finished2023-12-12 07:55:17.379470
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년도
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2020
60 
2021
60 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2020
3rd row2020
4th row2020
5th row2020

Common Values

ValueCountFrequency (%)
2020 60
50.0%
2021 60
50.0%

Length

2023-12-12T16:55:17.441669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:55:17.526812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 60
50.0%
2021 60
50.0%

광종
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
20 
아연
20 
철광석
20 
니켈
20 
석탄
10 
Other values (3)
30 

Length

Max length15
Median length8
Mean length3.6666667
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
20
16.7%
아연 20
16.7%
철광석 20
16.7%
니켈 20
16.7%
석탄 10
8.3%
우라늄 10
8.3%
석탄(2017) 10
8.3%
우라늄(U3O8, 2017) 10
8.3%

Length

2023-12-12T16:55:17.619853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:55:17.736421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20
15.4%
아연 20
15.4%
철광석 20
15.4%
니켈 20
15.4%
석탄 10
7.7%
우라늄 10
7.7%
석탄(2017 10
7.7%
우라늄(u3o8 10
7.7%
2017 10
7.7%

순위
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.5
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T16:55:17.846367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5.5
Q38
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.8843245
Coefficient of variation (CV)0.52442263
Kurtosis-1.2251099
Mean5.5
Median Absolute Deviation (MAD)2.5
Skewness0
Sum660
Variance8.3193277
MonotonicityNot monotonic
2023-12-12T16:55:17.941180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 12
10.0%
2 12
10.0%
3 12
10.0%
4 12
10.0%
5 12
10.0%
6 12
10.0%
7 12
10.0%
8 12
10.0%
9 12
10.0%
10 12
10.0%
ValueCountFrequency (%)
1 12
10.0%
2 12
10.0%
3 12
10.0%
4 12
10.0%
5 12
10.0%
6 12
10.0%
7 12
10.0%
8 12
10.0%
9 12
10.0%
10 12
10.0%
ValueCountFrequency (%)
10 12
10.0%
9 12
10.0%
8 12
10.0%
7 12
10.0%
6 12
10.0%
5 12
10.0%
4 12
10.0%
3 12
10.0%
2 12
10.0%
1 12
10.0%
Distinct87
Distinct (%)73.1%
Missing1
Missing (%)0.8%
Memory size1.1 KiB
2023-12-12T16:55:18.209870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length21
Mean length16.05042
Min length3

Characters and Unicode

Total characters1910
Distinct characters59
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)58.0%

Sample

1st rowCodelco
2nd rowGlencore
3rd rowBHP Group
4th rowFreeport-McMoRan
5th rowSouthern Copper (ex SPCC)
ValueCountFrequency (%)
ltd 20
 
6.5%
group 16
 
5.2%
co 14
 
4.5%
plc 11
 
3.6%
bhp 10
 
3.2%
glencore 8
 
2.6%
corp 7
 
2.3%
energy 7
 
2.3%
mining 7
 
2.3%
vale 6
 
1.9%
Other values (112) 202
65.6%
2023-12-12T16:55:18.608732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
190
 
9.9%
o 136
 
7.1%
n 134
 
7.0%
e 124
 
6.5%
i 102
 
5.3%
a 100
 
5.2%
r 100
 
5.2%
t 93
 
4.9%
l 75
 
3.9%
c 72
 
3.8%
Other values (49) 784
41.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1262
66.1%
Uppercase Letter 386
 
20.2%
Space Separator 190
 
9.9%
Other Punctuation 60
 
3.1%
Decimal Number 4
 
0.2%
Open Punctuation 3
 
0.2%
Close Punctuation 3
 
0.2%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 136
10.8%
n 134
10.6%
e 124
9.8%
i 102
 
8.1%
a 100
 
7.9%
r 100
 
7.9%
t 93
 
7.4%
l 75
 
5.9%
c 72
 
5.7%
u 50
 
4.0%
Other values (15) 276
21.9%
Uppercase Letter
ValueCountFrequency (%)
C 49
12.7%
A 36
 
9.3%
M 34
 
8.8%
G 30
 
7.8%
P 29
 
7.5%
S 26
 
6.7%
L 26
 
6.7%
H 18
 
4.7%
N 17
 
4.4%
R 16
 
4.1%
Other values (14) 105
27.2%
Other Punctuation
ValueCountFrequency (%)
. 50
83.3%
& 5
 
8.3%
? 4
 
6.7%
/ 1
 
1.7%
Decimal Number
ValueCountFrequency (%)
3 2
50.0%
2 2
50.0%
Space Separator
ValueCountFrequency (%)
190
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1648
86.3%
Common 262
 
13.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 136
 
8.3%
n 134
 
8.1%
e 124
 
7.5%
i 102
 
6.2%
a 100
 
6.1%
r 100
 
6.1%
t 93
 
5.6%
l 75
 
4.6%
c 72
 
4.4%
u 50
 
3.0%
Other values (39) 662
40.2%
Common
ValueCountFrequency (%)
190
72.5%
. 50
 
19.1%
& 5
 
1.9%
? 4
 
1.5%
( 3
 
1.1%
) 3
 
1.1%
- 2
 
0.8%
3 2
 
0.8%
2 2
 
0.8%
/ 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1910
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
190
 
9.9%
o 136
 
7.1%
n 134
 
7.0%
e 124
 
6.5%
i 102
 
5.3%
a 100
 
5.2%
r 100
 
5.2%
t 93
 
4.9%
l 75
 
3.9%
c 72
 
3.8%
Other values (49) 784
41.0%

생산량
Real number (ℝ)

HIGH CORRELATION 

Distinct101
Distinct (%)84.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1060029.9
Minimum31
Maximum31560863
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T16:55:18.765139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum31
5-th percentile37.95
Q181
median250.5
Q31107.75
95-th percentile6247150.2
Maximum31560863
Range31560832
Interquartile range (IQR)1026.75

Descriptive statistics

Standard deviation4388279.4
Coefficient of variation (CV)4.1397695
Kurtosis28.360926
Mean1060029.9
Median Absolute Deviation (MAD)207.5
Skewness5.1339943
Sum1.2720359 × 108
Variance1.9256996 × 1013
MonotonicityNot monotonic
2023-12-12T16:55:18.906329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
245 2
 
1.7%
36 2
 
1.7%
560 2
 
1.7%
419 2
 
1.7%
37 2
 
1.7%
56 2
 
1.7%
44 2
 
1.7%
42 2
 
1.7%
72 2
 
1.7%
172 2
 
1.7%
Other values (91) 100
83.3%
ValueCountFrequency (%)
31 1
0.8%
33 1
0.8%
36 2
1.7%
37 2
1.7%
38 1
0.8%
39 1
0.8%
40 1
0.8%
41 1
0.8%
42 2
1.7%
44 2
1.7%
ValueCountFrequency (%)
31560863 1
0.8%
23843200 1
0.8%
20905305 1
0.8%
13254370 1
0.8%
8839239 1
0.8%
7322117 1
0.8%
6190573 1
0.8%
6058960 1
0.8%
5057404 1
0.8%
4094936 1
0.8%

생산량단위
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
천톤
60 
백만톤
40 
톤U
10 
LBS
10 

Length

Max length3
Median length2
Mean length2.4166667
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row천톤
2nd row천톤
3rd row천톤
4th row천톤
5th row천톤

Common Values

ValueCountFrequency (%)
천톤 60
50.0%
백만톤 40
33.3%
톤U 10
 
8.3%
LBS 10
 
8.3%

Length

2023-12-12T16:55:19.031708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:55:19.138070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
천톤 60
50.0%
백만톤 40
33.3%
톤u 10
 
8.3%
lbs 10
 
8.3%

점유율(퍼센트)
Real number (ℝ)

HIGH CORRELATION 

Distinct64
Distinct (%)53.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.5866667
Minimum0.8
Maximum22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-12T16:55:19.242104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.8
5-th percentile1.1
Q11.7
median3.15
Q36.15
95-th percentile12.03
Maximum22
Range21.2
Interquartile range (IQR)4.45

Descriptive statistics

Standard deviation3.9178097
Coefficient of variation (CV)0.85417362
Kurtosis4.5317415
Mean4.5866667
Median Absolute Deviation (MAD)1.75
Skewness1.8671997
Sum550.4
Variance15.349232
MonotonicityNot monotonic
2023-12-12T16:55:19.384437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.5 7
 
5.8%
1.4 4
 
3.3%
1.1 4
 
3.3%
2.3 4
 
3.3%
1.6 3
 
2.5%
1.3 3
 
2.5%
1.2 3
 
2.5%
1.8 3
 
2.5%
2.0 3
 
2.5%
1.7 3
 
2.5%
Other values (54) 83
69.2%
ValueCountFrequency (%)
0.8 2
 
1.7%
0.9 2
 
1.7%
1.1 4
3.3%
1.2 3
2.5%
1.3 3
2.5%
1.4 4
3.3%
1.5 7
5.8%
1.6 3
2.5%
1.7 3
2.5%
1.8 3
2.5%
ValueCountFrequency (%)
22.0 1
0.8%
20.5 1
0.8%
15.5 1
0.8%
13.6 1
0.8%
13.3 1
0.8%
12.6 1
0.8%
12.0 1
0.8%
11.9 1
0.8%
10.4 1
0.8%
10.2 1
0.8%

Interactions

2023-12-12T16:55:16.966846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.088959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.700439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:17.040079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.194754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.791122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:17.114484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.287448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:55:16.884665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:55:19.470902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도광종순위기업명생산량생산량단위점유율(퍼센트)
년도1.0000.7070.0000.3240.1940.5570.000
광종0.7071.0000.0000.0000.5561.0000.625
순위0.0000.0001.0000.8710.0000.0000.586
기업명0.3240.0000.8711.0000.9930.9410.802
생산량0.1940.5560.0000.9931.0000.6820.713
생산량단위0.5571.0000.0000.9410.6821.0000.712
점유율(퍼센트)0.0000.6250.5860.8020.7130.7121.000
2023-12-12T16:55:19.559574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년도생산량단위광종
년도1.0000.3780.526
생산량단위0.3781.0000.983
광종0.5260.9831.000
2023-12-12T16:55:19.631757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순위생산량점유율(퍼센트)년도광종생산량단위
순위1.000-0.398-0.7470.0000.0000.000
생산량-0.3981.0000.7110.2020.3370.539
점유율(퍼센트)-0.7470.7111.0000.0000.2510.378
년도0.0000.2020.0001.0000.5260.378
광종0.0000.3370.2510.5261.0000.983
생산량단위0.0000.5390.3780.3780.9831.000

Missing values

2023-12-12T16:55:17.206671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:55:17.333858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년도광종순위기업명생산량생산량단위점유율(퍼센트)
020201Codelco1727천톤8.2
120202Glencore1316천톤6.3
220203BHP Group1221천톤5.8
320204Freeport-McMoRan1177천톤5.6
420205Southern Copper (ex SPCC)993천톤4.7
520206First Quantum Minerals731천톤3.5
620207KGHM Polska Miedz542천톤2.6
720208Rio Tinto497천톤2.4
820209Antofagasta plc489천톤2.3
9202010Anglo American plc462천톤2.2
년도광종순위기업명생산량생산량단위점유율(퍼센트)
1102021우라늄(U3O8, 2017)1JSC Natl Atomic Co Kazatomprom31560863LBS20.5
1112021우라늄(U3O8, 2017)2Cameco Corp.23843200LBS15.5
1122021우라늄(U3O8, 2017)3Orano SA20905305LBS13.6
1132021우라늄(U3O8, 2017)4Uranium One Inc.13254370LBS8.6
1142021우라늄(U3O8, 2017)5Navoi Mining & Metallurgical8839239LBS5.7
1152021우라늄(U3O8, 2017)6JSC Atomredmetzoloto7322117LBS4.8
1162021우라늄(U3O8, 2017)7BHP Group6190573LBS4.0
1172021우라늄(U3O8, 2017)8Energy Asia (BVI) Ltd.6058960LBS3.9
1182021우라늄(U3O8, 2017)9Energy Rsrc Australia Ltd5057404LBS3.3
1192021우라늄(U3O8, 2017)10China National Nuclear Corp.4094936LBS2.7