Overview

Dataset statistics

Number of variables5
Number of observations223
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.5 KiB
Average record size in memory43.6 B

Variable types

Text2
Numeric3

Dataset

Description세라믹산업 對 일본 무역수지 현황 자료(세라믹 분류, 무역수지 / 단위:천달러 등) 입니다. (기준연도 : 2014년도)
Author한국세라믹기술원
URLhttps://www.data.go.kr/data/15051229/fileData.do

Alerts

무역수지(2012 / 천달러) is highly overall correlated with 무역수지(2013 / 천달러) and 1 other fieldsHigh correlation
무역수지(2013 / 천달러) is highly overall correlated with 무역수지(2012 / 천달러) and 1 other fieldsHigh correlation
무역수지(2014 / 천달러) is highly overall correlated with 무역수지(2012 / 천달러) and 1 other fieldsHigh correlation
분 류 has unique valuesUnique
무역수지(2012 / 천달러) has 58 (26.0%) zerosZeros
무역수지(2013 / 천달러) has 58 (26.0%) zerosZeros
무역수지(2014 / 천달러) has 62 (27.8%) zerosZeros

Reproduction

Analysis started2023-12-12 08:04:05.461826
Analysis finished2023-12-12 08:04:06.958706
Duration1.5 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분 류
Text

UNIQUE 

Distinct223
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T17:04:07.305598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.5067265
Min length1

Characters and Unicode

Total characters1005
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique223 ?
Unique (%)100.0%

Sample

1st rowA
2nd rowA01
3rd rowA0101
4th rowA0102
5th rowA02
ValueCountFrequency (%)
a 1
 
0.4%
d0104 1
 
0.4%
d0201 1
 
0.4%
d0202 1
 
0.4%
d0203 1
 
0.4%
d0204 1
 
0.4%
d03 1
 
0.4%
d0301 1
 
0.4%
d0302 1
 
0.4%
d0303 1
 
0.4%
Other values (213) 213
95.5%
2023-12-12T17:04:07.951592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 376
37.4%
1 90
 
9.0%
2 76
 
7.6%
3 61
 
6.1%
D 53
 
5.3%
4 52
 
5.2%
B 52
 
5.2%
A 50
 
5.0%
5 47
 
4.7%
E 35
 
3.5%
Other values (5) 113
 
11.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 782
77.8%
Uppercase Letter 223
 
22.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 376
48.1%
1 90
 
11.5%
2 76
 
9.7%
3 61
 
7.8%
4 52
 
6.6%
5 47
 
6.0%
6 30
 
3.8%
7 23
 
2.9%
8 17
 
2.2%
9 10
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
D 53
23.8%
B 52
23.3%
A 50
22.4%
E 35
15.7%
C 33
14.8%

Most occurring scripts

ValueCountFrequency (%)
Common 782
77.8%
Latin 223
 
22.2%

Most frequent character per script

Common
ValueCountFrequency (%)
0 376
48.1%
1 90
 
11.5%
2 76
 
9.7%
3 61
 
7.8%
4 52
 
6.6%
5 47
 
6.0%
6 30
 
3.8%
7 23
 
2.9%
8 17
 
2.2%
9 10
 
1.3%
Latin
ValueCountFrequency (%)
D 53
23.8%
B 52
23.3%
A 50
22.4%
E 35
15.7%
C 33
14.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1005
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 376
37.4%
1 90
 
9.0%
2 76
 
7.6%
3 61
 
6.1%
D 53
 
5.3%
4 52
 
5.2%
B 52
 
5.2%
A 50
 
5.0%
5 47
 
4.7%
E 35
 
3.5%
Other values (5) 113
 
11.2%

광물
Text

Distinct212
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T17:04:08.358341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length19
Mean length6.3497758
Min length2

Characters and Unicode

Total characters1416
Distinct characters221
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)92.4%

Sample

1st row광물
2nd row규산질 원료
3rd row규사
4th row규조토
5th row규산알루미늄 원료
ValueCountFrequency (%)
기타 26
 
7.1%
부품 22
 
6.0%
원료 15
 
4.1%
13
 
3.6%
세라믹 12
 
3.3%
제품 6
 
1.6%
반도체 4
 
1.1%
도자기 4
 
1.1%
4
 
1.1%
복합산화물 4
 
1.1%
Other values (226) 254
69.8%
2023-12-12T17:04:08.839027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
141
 
10.0%
50
 
3.5%
46
 
3.2%
45
 
3.2%
43
 
3.0%
35
 
2.5%
33
 
2.3%
32
 
2.3%
31
 
2.2%
28
 
2.0%
Other values (211) 932
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1237
87.4%
Space Separator 141
 
10.0%
Other Punctuation 10
 
0.7%
Uppercase Letter 10
 
0.7%
Close Punctuation 6
 
0.4%
Open Punctuation 6
 
0.4%
Lowercase Letter 5
 
0.4%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
Lowercase Letter
ValueCountFrequency (%)
e 1
20.0%
l 1
20.0%
u 1
20.0%
d 1
20.0%
o 1
20.0%
Uppercase Letter
ValueCountFrequency (%)
L 3
30.0%
E 3
30.0%
D 3
30.0%
M 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
/ 9
90.0%
· 1
 
10.0%
Space Separator
ValueCountFrequency (%)
141
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1237
87.4%
Common 164
 
11.6%
Latin 15
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
Latin
ValueCountFrequency (%)
L 3
20.0%
E 3
20.0%
D 3
20.0%
e 1
 
6.7%
l 1
 
6.7%
u 1
 
6.7%
d 1
 
6.7%
o 1
 
6.7%
M 1
 
6.7%
Common
ValueCountFrequency (%)
141
86.0%
/ 9
 
5.5%
) 6
 
3.7%
( 6
 
3.7%
· 1
 
0.6%
1 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1237
87.4%
ASCII 178
 
12.6%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
141
79.2%
/ 9
 
5.1%
) 6
 
3.4%
( 6
 
3.4%
L 3
 
1.7%
E 3
 
1.7%
D 3
 
1.7%
1 1
 
0.6%
e 1
 
0.6%
l 1
 
0.6%
Other values (4) 4
 
2.2%
Hangul
ValueCountFrequency (%)
50
 
4.0%
46
 
3.7%
45
 
3.6%
43
 
3.5%
35
 
2.8%
33
 
2.7%
32
 
2.6%
31
 
2.5%
28
 
2.3%
24
 
1.9%
Other values (196) 870
70.3%
None
ValueCountFrequency (%)
· 1
100.0%

무역수지(2012 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct154
Distinct (%)69.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-48994.009
Minimum-1845302
Maximum46467
Zeros58
Zeros (%)26.0%
Negative133
Negative (%)59.6%
Memory size2.1 KiB
2023-12-12T17:04:09.030101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1845302
5-th percentile-175871.7
Q1-15187.5
median-476
Q30
95-th percentile1366.5
Maximum46467
Range1891769
Interquartile range (IQR)15187.5

Descriptive statistics

Standard deviation207684.86
Coefficient of variation (CV)-4.2389849
Kurtosis49.478764
Mean-48994.009
Median Absolute Deviation (MAD)1478
Skewness-6.8393629
Sum-10925664
Variance4.3133003 × 1010
MonotonicityNot monotonic
2023-12-12T17:04:09.210730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
 
26.0%
2809 2
 
0.9%
-276 2
 
0.9%
-42739 2
 
0.9%
-23796 2
 
0.9%
6 2
 
0.9%
-162 2
 
0.9%
86 2
 
0.9%
-294 2
 
0.9%
1397 2
 
0.9%
Other values (144) 147
65.9%
ValueCountFrequency (%)
-1845302 1
0.4%
-1622943 1
0.4%
-1280605 1
0.4%
-1271443 1
0.4%
-404272 1
0.4%
-319867 1
0.4%
-285342 1
0.4%
-266176 1
0.4%
-237250 1
0.4%
-196492 1
0.4%
ValueCountFrequency (%)
46467 1
0.4%
11790 1
0.4%
6803 1
0.4%
5699 1
0.4%
4085 1
0.4%
3064 1
0.4%
2809 2
0.9%
1921 1
0.4%
1554 1
0.4%
1397 2
0.9%

무역수지(2013 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct156
Distinct (%)70.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-37859.821
Minimum-1218830
Maximum39461
Zeros58
Zeros (%)26.0%
Negative134
Negative (%)60.1%
Memory size2.1 KiB
2023-12-12T17:04:09.393536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1218830
5-th percentile-125474.9
Q1-12456.5
median-606
Q30
95-th percentile1669
Maximum39461
Range1258291
Interquartile range (IQR)12456.5

Descriptive statistics

Standard deviation145947.14
Coefficient of variation (CV)-3.8549348
Kurtosis43.387693
Mean-37859.821
Median Absolute Deviation (MAD)1822
Skewness-6.3584819
Sum-8442740
Variance2.1300567 × 1010
MonotonicityNot monotonic
2023-12-12T17:04:09.596452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 58
 
26.0%
2130 2
 
0.9%
-13986 2
 
0.9%
-47 2
 
0.9%
-104 2
 
0.9%
93 2
 
0.9%
-15 2
 
0.9%
1565 2
 
0.9%
-163 2
 
0.9%
1669 2
 
0.9%
Other values (146) 147
65.9%
ValueCountFrequency (%)
-1218830 1
0.4%
-1134586 1
0.4%
-956428 1
0.4%
-844124 1
0.4%
-382220 1
0.4%
-293438 1
0.4%
-281065 1
0.4%
-233951 1
0.4%
-204374 1
0.4%
-186686 1
0.4%
ValueCountFrequency (%)
39461 1
0.4%
12160 1
0.4%
9376 1
0.4%
6635 1
0.4%
4183 1
0.4%
3521 1
0.4%
3195 1
0.4%
2130 2
0.9%
1807 1
0.4%
1798 1
0.4%

무역수지(2014 / 천달러)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct153
Distinct (%)68.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-29788.283
Minimum-1049735
Maximum59454
Zeros62
Zeros (%)27.8%
Negative128
Negative (%)57.4%
Memory size2.1 KiB
2023-12-12T17:04:09.804304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-1049735
5-th percentile-104177.5
Q1-10638
median-294
Q30
95-th percentile3556.8
Maximum59454
Range1109189
Interquartile range (IQR)10638

Descriptive statistics

Standard deviation109231.81
Coefficient of variation (CV)-3.6669387
Kurtosis47.412485
Mean-29788.283
Median Absolute Deviation (MAD)1604
Skewness-6.3822074
Sum-6642787
Variance1.1931588 × 1010
MonotonicityNot monotonic
2023-12-12T17:04:10.001688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 62
27.8%
3626 2
 
0.9%
-14275 2
 
0.9%
6368 2
 
0.9%
112 2
 
0.9%
-21 2
 
0.9%
3574 2
 
0.9%
-166 2
 
0.9%
1627 2
 
0.9%
-61687 2
 
0.9%
Other values (143) 143
64.1%
ValueCountFrequency (%)
-1049735 1
0.4%
-753561 1
0.4%
-608832 1
0.4%
-534761 1
0.4%
-340927 1
0.4%
-268329 1
0.4%
-250193 1
0.4%
-231275 1
0.4%
-193710 1
0.4%
-143263 1
0.4%
ValueCountFrequency (%)
59454 1
0.4%
29613 1
0.4%
15881 1
0.4%
6368 2
0.9%
5469 1
0.4%
4761 1
0.4%
4296 1
0.4%
3626 2
0.9%
3574 2
0.9%
3402 1
0.4%

Interactions

2023-12-12T17:04:06.359555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:05.683159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:06.047152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:06.537118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:05.810409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:06.148132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:06.646708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:05.934473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T17:04:06.249601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T17:04:10.115545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
무역수지(2012 / 천달러)1.0000.9810.948
무역수지(2013 / 천달러)0.9811.0000.953
무역수지(2014 / 천달러)0.9480.9531.000
2023-12-12T17:04:10.244042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
무역수지(2012 / 천달러)1.0000.9250.875
무역수지(2013 / 천달러)0.9251.0000.929
무역수지(2014 / 천달러)0.8750.9291.000

Missing values

2023-12-12T17:04:06.785697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T17:04:06.907101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분 류광물무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
0A광물-52319-40195-36428
1A01규산질 원료-256-675-2656
2A0101규사138-430-2388
3A0102규조토-394-244-268
4A02규산알루미늄 원료-2633-42721
5A0201실리마나이트족 광물000
6A0202카올린족 광물-955-606-495
7A0203엽납석408541834296
8A0204점토-5764-4004-3780
9A03알루미나 원료-18-15-21
분 류광물무역수지(2012 / 천달러)무역수지(2013 / 천달러)무역수지(2014 / 천달러)
213E0601필터-958-383-223
214E0602촉매담체-2506-2523-2584
215E0603기타000
216E07열적 세라믹 부품-3676-4377-4096
217E0701내열세라믹 부품000
218E0702발열용 부품-1889-3278-3766
219E0703금속제조용 부품-1787-1099-329
220E08방탄 세라믹 부품000
221E0801방탄용 부품000
222E0802기타000