Overview

Dataset statistics

Number of variables5
Number of observations223
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory43.6 B

Variable types

Text2
Numeric3

Dataset

Description세라믹산업 對 중국 현황 관련 자료입니다. (분류, 광물, 무역수지 / 단위:천달러 등 항목 제공) (기준연도: 2014년도)
Author한국세라믹기술원
URLhttps://www.data.go.kr/data/15051226/fileData.do

Alerts

무역수지(2012 / 천달러) is highly overall correlated with 무역수지(2013 / 천달러) and 1 other fieldsHigh correlation
무역수지(2013 / 천달러) is highly overall correlated with 무역수지(2012 / 천달러) and 1 other fieldsHigh correlation
무역수지(2014 / 천달러) is highly overall correlated with 무역수지(2012 / 천달러) and 1 other fieldsHigh correlation
분 류 has unique valuesUnique
무역수지(2012 / 천달러) has 56 (25.1%) zerosZeros
무역수지(2013 / 천달러) has 56 (25.1%) zerosZeros
무역수지(2014 / 천달러) has 55 (24.7%) zerosZeros

Reproduction

Analysis started2023-12-12 16:23:22.992962
Analysis finished2023-12-12 16:23:24.732710
Duration1.74 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분 류
Text

UNIQUE 

Distinct223
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T01:23:25.128810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.5067265
Min length1

Characters and Unicode

Total characters1005
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique223 ?
Unique (%)100.0%

Sample

1st rowA
2nd rowA01
3rd rowA0101
4th rowA0102
5th rowA02
ValueCountFrequency (%)
a 1
 
0.4%
d0104 1
 
0.4%
d0201 1
 
0.4%
d0202 1
 
0.4%
d0203 1
 
0.4%
d0204 1
 
0.4%
d03 1
 
0.4%
d0301 1
 
0.4%
d0302 1
 
0.4%
d0303 1
 
0.4%
Other values (213) 213
95.5%
2023-12-13T01:23:25.807502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 376
37.4%
1 90
 
9.0%
2 76
 
7.6%
3 61
 
6.1%
D 53
 
5.3%
4 52
 
5.2%
B 52
 
5.2%
A 50
 
5.0%
5 47
 
4.7%
E 35
 
3.5%
Other values (5) 113
 
11.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 782
77.8%
Uppercase Letter 223
 
22.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 376
48.1%
1 90
 
11.5%
2 76
 
9.7%
3 61
 
7.8%
4 52
 
6.6%
5 47
 
6.0%
6 30
 
3.8%
7 23
 
2.9%
8 17
 
2.2%
9 10
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
D 53
23.8%
B 52
23.3%
A 50
22.4%
E 35
15.7%
C 33
14.8%

Most occurring scripts

ValueCountFrequency (%)
Common 782
77.8%
Latin 223
 
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 376
48.1%
1 90
 
11.5%
2 76
 
9.7%
3 61
 
7.8%
4 52
 
6.6%
5 47
 
6.0%
6 30
 
3.8%
7 23
 
2.9%
8 17
 
2.2%
9 10
 
1.3%
Latin
ValueCountFrequency (%)
D 53
23.8%
B 52
23.3%
A 50
22.4%
E 35
15.7%
C 33
14.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1005
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 376
37.4%
1 90
 
9.0%
2 76
 
7.6%
3 61
 
6.1%
D 53
 
5.3%
4 52
 
5.2%
B 52
 
5.2%
A 50
 
5.0%
5 47
 
4.7%
E 35
 
3.5%
Other values (5) 113
 
11.2%

광물
Text

Distinct212
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T01:23:26.104670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length19
Mean length6.3497758
Min length2

Characters and Unicode

Total characters1416
Distinct characters221
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)92.4%

Sample

1st row광물
2nd row규산질 원료
3rd row규사
4th row규조토
5th row규산알루미늄 원료
ValueCountFrequency (%)
기타 26
 
7.1%
부품 22
 
6.0%
원료 15
 
4.1%
13
 
3.6%
세라믹 12
 
3.3%
제품 6
 
1.6%
반도체 4
 
1.1%
도자기 4
 
1.1%
4
 
1.1%
복합산화물 4
 
1.1%
Other values (226) 254
69.8%
2023-12-13T01:23:26.550431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
141
 
10.0%
50
 
3.5%
46
 
3.2%
45
 
3.2%
43
 
3.0%
35
 
2.5%
33
 
2.3%
32
 
2.3%
31
 
2.2%
28
 
2.0%
Other values (211) 932
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1237
87.4%
Space Separator 141
 
10.0%
Other Punctuation 10
 
0.7%
Uppercase Letter 10
 
0.7%
Close Punctuation 6
 
0.4%
Open Punctuation 6
 
0.4%
Lowercase Letter 5
 
0.4%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
Lowercase Letter
ValueCountFrequency (%)
e 1
20.0%
l 1
20.0%
u 1
20.0%
d 1
20.0%
o 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
L 3
30.0%
E 3
30.0%
D 3
30.0%
M 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
/ 9
90.0%
· 1
 
10.0%
Space Separator
ValueCountFrequency (%)
141
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1237
87.4%
Common 164
 
11.6%
Latin 15
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
Latin
ValueCountFrequency (%)
L 3
20.0%
E 3
20.0%
D 3
20.0%
e 1
 
6.7%
l 1
 
6.7%
u 1
 
6.7%
d 1
 
6.7%
o 1
 
6.7%
M 1
 
6.7%
Common
ValueCountFrequency (%)
141
86.0%
/ 9
 
5.5%
) 6
 
3.7%
( 6
 
3.7%
· 1
 
0.6%
1 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1237
87.4%
ASCII 178
 
12.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
141
79.2%
/ 9
 
5.1%
) 6
 
3.4%
( 6
 
3.4%
L 3
 
1.7%
E 3
 
1.7%
D 3
 
1.7%
1 1
 
0.6%
e 1
 
0.6%
l 1
 
0.6%
Other values (4) 4
 
2.2%
Hangul
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
None
ValueCountFrequency (%)
· 1
100.0%

무역수지(2012 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct159
Distinct (%)71.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-20108.269
Minimum-2159958
Maximum1042730
Zeros56
Zeros (%)25.1%
Negative103
Negative (%)46.2%
Memory size2.1 KiB
2023-12-13T01:23:26.705578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2159958
5-th percentile-179310.4
Q1-7854
median0
Q3112
95-th percentile102397.5
Maximum1042730
Range3202688
Interquartile range (IQR)7966

Descriptive statistics

Standard deviation210263.02
Coefficient of variation (CV)-10.456545
Kurtosis53.941429
Mean-20108.269
Median Absolute Deviation (MAD)3921
Skewness-4.6102736
Sum-4484144
Variance4.4210536 × 1010
MonotonicityNot monotonic
2023-12-13T01:23:26.854766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 56
 
25.1%
10135 2
 
0.9%
-12 2
 
0.9%
-485 2
 
0.9%
54 2
 
0.9%
-30521 2
 
0.9%
6457 2
 
0.9%
-22963 2
 
0.9%
4610 2
 
0.9%
8472 2
 
0.9%
Other values (149) 149
66.8%
ValueCountFrequency (%)
-2159958 1
0.4%
-900110 1
0.4%
-754218 1
0.4%
-703667 1
0.4%
-596526 1
0.4%
-295086 1
0.4%
-256881 1
0.4%
-222318 1
0.4%
-221192 1
0.4%
-219794 1
0.4%
ValueCountFrequency (%)
1042730 1
0.4%
794276 1
0.4%
575322 1
0.4%
257439 1
0.4%
226365 1
0.4%
217663 1
0.4%
211336 1
0.4%
184346 1
0.4%
184150 1
0.4%
139423 1
0.4%

무역수지(2013 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct160
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-27107.229
Minimum-2482213
Maximum883559
Zeros56
Zeros (%)25.1%
Negative101
Negative (%)45.3%
Memory size2.1 KiB
2023-12-13T01:23:26.993901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2482213
5-th percentile-177859
Q1-8953
median0
Q3260.5
95-th percentile118964.4
Maximum883559
Range3365772
Interquartile range (IQR)9213.5

Descriptive statistics

Standard deviation226345.64
Coefficient of variation (CV)-8.3500105
Kurtosis66.662044
Mean-27107.229
Median Absolute Deviation (MAD)4623
Skewness-6.4208578
Sum-6044912
Variance5.123235 × 1010
MonotonicityNot monotonic
2023-12-13T01:23:27.129347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 56
 
25.1%
-23004 2
 
0.9%
7305 2
 
0.9%
161 2
 
0.9%
-26415 2
 
0.9%
827 2
 
0.9%
-446 2
 
0.9%
33884 2
 
0.9%
7539 2
 
0.9%
-1890 1
 
0.4%
Other values (150) 150
67.3%
ValueCountFrequency (%)
-2482213 1
0.4%
-1050495 1
0.4%
-889152 1
0.4%
-878156 1
0.4%
-655573 1
0.4%
-375302 1
0.4%
-306414 1
0.4%
-249063 1
0.4%
-215728 1
0.4%
-215256 1
0.4%
ValueCountFrequency (%)
883559 1
0.4%
558245 1
0.4%
286122 1
0.4%
271525 1
0.4%
241932 1
0.4%
222665 1
0.4%
214969 1
0.4%
210154 1
0.4%
152348 1
0.4%
149372 1
0.4%

무역수지(2014 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct160
Distinct (%)71.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-23785.52
Minimum-2337646
Maximum795795
Zeros55
Zeros (%)24.7%
Negative98
Negative (%)43.9%
Memory size2.1 KiB
2023-12-13T01:23:27.263984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-2337646
5-th percentile-221837.7
Q1-9620
median0
Q3506
95-th percentile139100.3
Maximum795795
Range3133441
Interquartile range (IQR)10126

Descriptive statistics

Standard deviation212691.78
Coefficient of variation (CV)-8.9420696
Kurtosis66.16478
Mean-23785.52
Median Absolute Deviation (MAD)3794
Skewness-6.2387175
Sum-5304171
Variance4.5237792 × 1010
MonotonicityNot monotonic
2023-12-13T01:23:27.385191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 55
 
24.7%
-20323 2
 
0.9%
1246 2
 
0.9%
-222 2
 
0.9%
94 2
 
0.9%
61 2
 
0.9%
-38230 2
 
0.9%
10136 2
 
0.9%
25601 2
 
0.9%
3266 2
 
0.9%
Other values (150) 150
67.3%
ValueCountFrequency (%)
-2337646 1
0.4%
-964500 1
0.4%
-709724 1
0.4%
-707530 1
0.4%
-500368 1
0.4%
-471839 1
0.4%
-287687 1
0.4%
-263889 1
0.4%
-251689 1
0.4%
-225356 1
0.4%
ValueCountFrequency (%)
795795 1
0.4%
509453 1
0.4%
317632 1
0.4%
306523 1
0.4%
305856 1
0.4%
291004 1
0.4%
273229 1
0.4%
236480 1
0.4%
236272 1
0.4%
225123 1
0.4%

Interactions

2023-12-13T01:23:24.187228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.240692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.549152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:24.286730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.340554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.654405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:24.437284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.453647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:23:23.759763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:23:27.464168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
무역수지(2012 / 천달러)1.0000.8650.927
무역수지(2013 / 천달러)0.8651.0000.983
무역수지(2014 / 천달러)0.9270.9831.000
2023-12-13T01:23:27.543719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
무역수지(2012 / 천달러)1.0000.9490.890
무역수지(2013 / 천달러)0.9491.0000.944
무역수지(2014 / 천달러)0.8900.9441.000

Missing values

2023-12-13T01:23:24.557037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:23:24.681871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분 류광물무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
0A광물-208671-167412-193830
1A01규산질 원료-10623-10755-10241
2A0101규사-4913-2589-1443
3A0102규조토-5709-8166-8798
4A02규산알루미늄 원료-37458-36080-42618
5A0201실리마나이트족 광물-30-9
6A0202카올린족 광물-10607-11016-13133
7A0203엽납석788549
8A0204점토-26927-25149-29524
9A03알루미나 원료6457730510136
분 류광물무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
213E0601필터456979752
214E0602촉매담체-122914545
215E0603기타000
216E07열적 세라믹 부품-1346-2195-2268
217E0701내열세라믹 부품000
218E0702발열용 부품-367-1244-1389
219E0703금속제조용 부품-979-951-879
220E08방탄 세라믹 부품000
221E0801방탄용 부품000
222E0802기타000