Overview

Dataset statistics

Number of variables10
Number of observations1354
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory109.9 KiB
Average record size in memory83.1 B

Variable types

Numeric1
Categorical6
Text2
DateTime1

Dataset

Description부산광역시_식품방사능검사현황_20231231
Author부산광역시
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083358

Alerts

요오드검출량 베크렐_킬로그램(Bq_kg) has constant value ""Constant
적부판정 has constant value ""Constant
수입국 is highly overall correlated with 원산지 and 1 other fieldsHigh correlation
원산지 is highly overall correlated with 수입국High correlation
세슘검출량 베크렐_킬로그램(Bq_kg) is highly overall correlated with 수입국High correlation
수입국 is highly imbalanced (55.4%)Imbalance
세슘검출량 베크렐_킬로그램(Bq_kg) is highly imbalanced (98.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-30 09:10:50.379076
Analysis finished2024-03-30 09:10:56.894178
Duration6.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1354
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean677.5
Minimum1
Maximum1354
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.0 KiB
2024-03-30T09:10:57.173075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile68.65
Q1339.25
median677.5
Q31015.75
95-th percentile1286.35
Maximum1354
Range1353
Interquartile range (IQR)676.5

Descriptive statistics

Standard deviation391.01044
Coefficient of variation (CV)0.57713719
Kurtosis-1.2
Mean677.5
Median Absolute Deviation (MAD)338.5
Skewness0
Sum917335
Variance152889.17
MonotonicityStrictly increasing
2024-03-30T09:10:57.833861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
901 1
 
0.1%
909 1
 
0.1%
908 1
 
0.1%
907 1
 
0.1%
906 1
 
0.1%
905 1
 
0.1%
904 1
 
0.1%
903 1
 
0.1%
902 1
 
0.1%
Other values (1344) 1344
99.3%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1354 1
0.1%
1353 1
0.1%
1352 1
0.1%
1351 1
0.1%
1350 1
0.1%
1349 1
0.1%
1348 1
0.1%
1347 1
0.1%
1346 1
0.1%
1345 1
0.1%

분류
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
수산물
888 
가공식품
283 
농산물
183 

Length

Max length4
Median length3
Mean length3.2090103
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row농산물
2nd row농산물
3rd row농산물
4th row가공식품
5th row가공식품

Common Values

ValueCountFrequency (%)
수산물 888
65.6%
가공식품 283
 
20.9%
농산물 183
 
13.5%

Length

2024-03-30T09:10:58.357353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T09:10:58.916106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수산물 888
65.6%
가공식품 283
 
20.9%
농산물 183
 
13.5%
Distinct723
Distinct (%)53.4%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
2024-03-30T09:10:59.617499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length59
Mean length6.3618907
Min length1

Characters and Unicode

Total characters8614
Distinct characters539
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique572 ?
Unique (%)42.2%

Sample

1st row로컬대파
2nd row김제 한입 고구마
3rd row대추 방울 토마토
4th rowKIKKOMAN SOY SAUCE
5th rowOIGATSUO TSUYU
ValueCountFrequency (%)
고등어 47
 
2.5%
오징어 32
 
1.7%
참돔 28
 
1.5%
농어 28
 
1.5%
광어 27
 
1.4%
가자미 27
 
1.4%
동태 25
 
1.3%
삼치 24
 
1.3%
돌돔 22
 
1.2%
sauce 17
 
0.9%
Other values (1009) 1634
85.5%
2024-03-30T09:11:00.885248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
557
 
6.5%
288
 
3.3%
A 227
 
2.6%
I 210
 
2.4%
O 196
 
2.3%
S 191
 
2.2%
E 178
 
2.1%
) 177
 
2.1%
( 177
 
2.1%
U 151
 
1.8%
Other values (529) 6262
72.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5247
60.9%
Uppercase Letter 2268
26.3%
Space Separator 557
 
6.5%
Close Punctuation 178
 
2.1%
Open Punctuation 178
 
2.1%
Decimal Number 118
 
1.4%
Other Punctuation 28
 
0.3%
Lowercase Letter 26
 
0.3%
Dash Punctuation 12
 
0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
288
 
5.5%
143
 
2.7%
125
 
2.4%
122
 
2.3%
97
 
1.8%
89
 
1.7%
86
 
1.6%
85
 
1.6%
85
 
1.6%
84
 
1.6%
Other values (463) 4043
77.1%
Uppercase Letter
ValueCountFrequency (%)
A 227
 
10.0%
I 210
 
9.3%
O 196
 
8.6%
S 191
 
8.4%
E 178
 
7.8%
U 151
 
6.7%
K 133
 
5.9%
N 123
 
5.4%
R 122
 
5.4%
M 104
 
4.6%
Other values (16) 633
27.9%
Lowercase Letter
ValueCountFrequency (%)
g 3
11.5%
i 3
11.5%
m 2
 
7.7%
s 2
 
7.7%
l 2
 
7.7%
e 2
 
7.7%
r 2
 
7.7%
k 1
 
3.8%
c 1
 
3.8%
w 1
 
3.8%
Other values (7) 7
26.9%
Decimal Number
ValueCountFrequency (%)
2 31
26.3%
0 31
26.3%
1 22
18.6%
3 10
 
8.5%
5 7
 
5.9%
7 5
 
4.2%
9 5
 
4.2%
8 3
 
2.5%
4 2
 
1.7%
6 2
 
1.7%
Other Punctuation
ValueCountFrequency (%)
. 9
32.1%
% 8
28.6%
& 5
17.9%
, 4
14.3%
/ 1
 
3.6%
! 1
 
3.6%
Close Punctuation
ValueCountFrequency (%)
) 177
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 177
99.4%
[ 1
 
0.6%
Space Separator
ValueCountFrequency (%)
557
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5231
60.7%
Latin 2294
26.6%
Common 1073
 
12.5%
Han 16
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
288
 
5.5%
143
 
2.7%
125
 
2.4%
122
 
2.3%
97
 
1.9%
89
 
1.7%
86
 
1.6%
85
 
1.6%
85
 
1.6%
84
 
1.6%
Other values (457) 4027
77.0%
Latin
ValueCountFrequency (%)
A 227
 
9.9%
I 210
 
9.2%
O 196
 
8.5%
S 191
 
8.3%
E 178
 
7.8%
U 151
 
6.6%
K 133
 
5.8%
N 123
 
5.4%
R 122
 
5.3%
M 104
 
4.5%
Other values (33) 659
28.7%
Common
ValueCountFrequency (%)
557
51.9%
) 177
 
16.5%
( 177
 
16.5%
2 31
 
2.9%
0 31
 
2.9%
1 22
 
2.1%
- 12
 
1.1%
3 10
 
0.9%
. 9
 
0.8%
% 8
 
0.7%
Other values (13) 39
 
3.6%
Han
ValueCountFrequency (%)
5
31.2%
4
25.0%
4
25.0%
1
 
6.2%
1
 
6.2%
1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5231
60.7%
ASCII 3367
39.1%
CJK 16
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
557
16.5%
A 227
 
6.7%
I 210
 
6.2%
O 196
 
5.8%
S 191
 
5.7%
E 178
 
5.3%
) 177
 
5.3%
( 177
 
5.3%
U 151
 
4.5%
K 133
 
4.0%
Other values (56) 1170
34.7%
Hangul
ValueCountFrequency (%)
288
 
5.5%
143
 
2.7%
125
 
2.4%
122
 
2.3%
97
 
1.9%
89
 
1.7%
86
 
1.6%
85
 
1.6%
85
 
1.6%
84
 
1.6%
Other values (457) 4027
77.0%
CJK
ValueCountFrequency (%)
5
31.2%
4
25.0%
4
25.0%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Distinct60
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
2024-03-30T09:11:01.554743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.2104874
Min length1

Characters and Unicode

Total characters4347
Distinct characters100
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique40 ?
Unique (%)3.0%

Sample

1st row식물성
2nd row식물성
3rd row식물성
4th row가공식품
5th row가공식품
ValueCountFrequency (%)
동물성 798
58.4%
가공식품 235
 
17.2%
식물성 202
 
14.8%
수산물 13
 
1.0%
기타 13
 
1.0%
수산물가공품 13
 
1.0%
어묵 10
 
0.7%
소스 8
 
0.6%
양념젓갈 6
 
0.4%
오징어 3
 
0.2%
Other values (51) 66
 
4.8%
2024-03-30T09:11:02.801276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1027
23.6%
1000
23.0%
800
18.4%
439
10.1%
250
 
5.8%
249
 
5.7%
249
 
5.7%
29
 
0.7%
27
 
0.6%
23
 
0.5%
Other values (90) 254
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4329
99.6%
Space Separator 13
 
0.3%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1027
23.7%
1000
23.1%
800
18.5%
439
10.1%
250
 
5.8%
249
 
5.8%
249
 
5.8%
29
 
0.7%
27
 
0.6%
23
 
0.5%
Other values (86) 236
 
5.5%
Space Separator
ValueCountFrequency (%)
13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4329
99.6%
Common 18
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1027
23.7%
1000
23.1%
800
18.5%
439
10.1%
250
 
5.8%
249
 
5.8%
249
 
5.8%
29
 
0.7%
27
 
0.6%
23
 
0.5%
Other values (86) 236
 
5.5%
Common
ValueCountFrequency (%)
13
72.2%
) 2
 
11.1%
( 2
 
11.1%
. 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4329
99.6%
ASCII 18
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1027
23.7%
1000
23.1%
800
18.5%
439
10.1%
250
 
5.8%
249
 
5.8%
249
 
5.8%
29
 
0.7%
27
 
0.6%
23
 
0.5%
Other values (86) 236
 
5.5%
ASCII
ValueCountFrequency (%)
13
72.2%
) 2
 
11.1%
( 2
 
11.1%
. 1
 
5.6%
Distinct108
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
Minimum2023-02-20 00:00:00
Maximum2023-12-14 00:00:00
2024-03-30T09:11:03.279710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-30T09:11:03.690350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

원산지
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
국내산
809 
수입산
545 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내산
2nd row국내산
3rd row국내산
4th row수입산
5th row수입산

Common Values

ValueCountFrequency (%)
국내산 809
59.7%
수입산 545
40.3%

Length

2024-03-30T09:11:04.048329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T09:11:04.371978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내산 809
59.7%
수입산 545
40.3%

수입국
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct44
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
대한민국
812 
일본
169 
중국
82 
미국
 
58
러시아
 
45
Other values (39)
188 

Length

Max length6
Median length4
Mean length3.3810931
Min length2

Unique

Unique15 ?
Unique (%)1.1%

Sample

1st row대한민국
2nd row대한민국
3rd row대한민국
4th row일본
5th row일본

Common Values

ValueCountFrequency (%)
대한민국 812
60.0%
일본 169
 
12.5%
중국 82
 
6.1%
미국 58
 
4.3%
러시아 45
 
3.3%
태국 28
 
2.1%
베트남 27
 
2.0%
노르웨이 18
 
1.3%
대만 12
 
0.9%
인도네시아 9
 
0.7%
Other values (34) 94
 
6.9%

Length

2024-03-30T09:11:04.877386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대한민국 812
60.0%
일본 169
 
12.5%
중국 82
 
6.1%
미국 58
 
4.3%
러시아 45
 
3.3%
태국 28
 
2.1%
베트남 27
 
2.0%
노르웨이 18
 
1.3%
대만 12
 
0.9%
인도네시아 9
 
0.7%
Other values (33) 94
 
6.9%

세슘검출량 베크렐_킬로그램(Bq_kg)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
0
1351 
1
 
2
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1351
99.8%
1 2
 
0.1%
2 1
 
0.1%

Length

2024-03-30T09:11:05.312888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T09:11:05.655797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1351
99.8%
1 2
 
0.1%
2 1
 
0.1%
Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
0
1354 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 1354
100.0%

Length

2024-03-30T09:11:06.106984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T09:11:06.462258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 1354
100.0%

적부판정
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
적합
1354 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row적합
2nd row적합
3rd row적합
4th row적합
5th row적합

Common Values

ValueCountFrequency (%)
적합 1354
100.0%

Length

2024-03-30T09:11:06.951635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T09:11:07.514623image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
적합 1354
100.0%

Interactions

2024-03-30T09:10:55.451451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-30T09:11:07.740297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류품목(또는 식품유형)원산지수입국세슘검출량 베크렐_킬로그램(Bq_kg)
연번1.0000.4380.6140.1640.2970.000
분류0.4381.0000.9960.0730.6620.097
품목(또는 식품유형)0.6140.9961.0000.3120.7720.000
원산지0.1640.0730.3121.0001.0000.026
수입국0.2970.6620.7721.0001.0000.963
세슘검출량 베크렐_킬로그램(Bq_kg)0.0000.0970.0000.0260.9631.000
2024-03-30T09:11:08.052446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
세슘검출량 베크렐_킬로그램(Bq_kg)분류수입국원산지
세슘검출량 베크렐_킬로그램(Bq_kg)1.0000.0290.8500.043
분류0.0291.0000.4190.120
수입국0.8500.4191.0000.980
원산지0.0430.1200.9801.000
2024-03-30T09:11:08.402679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번분류원산지수입국세슘검출량 베크렐_킬로그램(Bq_kg)
연번1.0000.2930.1250.1050.000
분류0.2931.0000.1200.4190.029
원산지0.1250.1201.0000.9800.043
수입국0.1050.4190.9801.0000.850
세슘검출량 베크렐_킬로그램(Bq_kg)0.0000.0290.0430.8501.000

Missing values

2024-03-30T09:10:55.939842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T09:10:56.639862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번분류제품명품목(또는 식품유형)수거일원산지수입국세슘검출량 베크렐_킬로그램(Bq_kg)요오드검출량 베크렐_킬로그램(Bq_kg)적부판정
01농산물로컬대파식물성2023-02-21국내산대한민국00적합
12농산물김제 한입 고구마식물성2023-02-21국내산대한민국00적합
23농산물대추 방울 토마토식물성2023-02-21국내산대한민국00적합
34가공식품KIKKOMAN SOY SAUCE가공식품2023-02-21수입산일본00적합
45가공식품OIGATSUO TSUYU가공식품2023-02-21수입산일본00적합
56가공식품SINSUETI MISO(AWASE MISO)가공식품2023-02-21수입산일본00적합
67수산물달고기순살동물성2023-02-24국내산대한민국00적합
78수산물대구순살동물성2023-02-24국내산대한민국00적합
89수산물삼치순살동물성2023-02-24국내산대한민국00적합
910수산물명태절단동물성2023-02-24국내산대한민국00적합
연번분류제품명품목(또는 식품유형)수거일원산지수입국세슘검출량 베크렐_킬로그램(Bq_kg)요오드검출량 베크렐_킬로그램(Bq_kg)적부판정
13441345농산물가시오이식물성2023-11-28국내산대한민국00적합
13451346가공식품수라 양반 돼지고기김치찌개가공식품2023-11-28국내산대한민국00적합
13461347가공식품동원참치DHA가공식품2023-11-28국내산대한민국00적합
13471348수산물깍다구동물성2023-12-04국내산대한민국00적합
13481349수산물아귀동물성2023-12-04국내산대한민국00적합
13491350수산물칼치동물성2023-12-04국내산대한민국00적합
13501351수산물물메기동물성2023-12-04국내산대한민국00적합
13511352수산물대구동물성2023-12-04국내산대한민국00적합
13521353수산물고등어동물성2023-12-04국내산대한민국00적합
13531354수산물오징어동물성2023-12-04국내산대한민국00적합