Overview

Dataset statistics

Number of variables5
Number of observations562
Missing cells0
Missing cells (%)0.0%
Duplicate rows51
Duplicate rows (%)9.1%
Total size in memory22.6 KiB
Average record size in memory41.2 B

Variable types

DateTime1
Categorical2
Text1
Numeric1

Dataset

Description가축거래 상인에 대한 등록현황
Author농림축산식품부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220216000000001969

Alerts

최종수정일시 has constant value ""Constant
Dataset has 51 (9.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 03:46:08.268752
Analysis finished2023-12-11 03:46:08.827734
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

최종수정일시
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
Minimum2022-06-01 00:00:00
Maximum2022-06-01 00:00:00
2023-12-11T12:46:08.890719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T12:46:09.017914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

시도명
Categorical

Distinct17
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
경기도
88 
전라남도
81 
경상북도
67 
충청남도
66 
경상남도
51 
Other values (12)
209 

Length

Max length7
Median length4
Mean length3.9483986
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 88
15.7%
전라남도 81
14.4%
경상북도 67
11.9%
충청남도 66
11.7%
경상남도 51
9.1%
강원도 47
8.4%
전라북도 44
7.8%
충청북도 32
 
5.7%
광주광역시 16
 
2.8%
부산광역시 14
 
2.5%
Other values (7) 56
10.0%

Length

2023-12-11T12:46:09.167891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 88
15.7%
전라남도 81
14.4%
경상북도 67
11.9%
충청남도 66
11.7%
경상남도 51
9.1%
강원도 47
8.4%
전라북도 44
7.8%
충청북도 32
 
5.7%
광주광역시 16
 
2.8%
부산광역시 14
 
2.5%
Other values (7) 56
10.0%
Distinct173
Distinct (%)30.8%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
2023-12-11T12:46:09.528233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length8
Mean length7.8879004
Min length7

Characters and Unicode

Total characters4433
Distinct characters124
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)5.7%

Sample

1st row경상남도 통영시
2nd row경상남도 밀양시
3rd row경상남도 밀양시
4th row경상남도 의령군
5th row경기도 화성시
ValueCountFrequency (%)
경기도 88
 
7.9%
전라남도 81
 
7.2%
경상북도 67
 
6.0%
충청남도 66
 
5.9%
경상남도 51
 
4.5%
강원도 47
 
4.2%
전라북도 44
 
3.9%
충청북도 32
 
2.9%
광주광역시 16
 
1.4%
부산광역시 14
 
1.2%
Other values (169) 615
54.9%
2023-12-11T12:46:10.040511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
559
 
12.6%
490
 
11.1%
344
 
7.8%
233
 
5.3%
228
 
5.1%
217
 
4.9%
156
 
3.5%
138
 
3.1%
125
 
2.8%
122
 
2.8%
Other values (114) 1821
41.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3874
87.4%
Space Separator 559
 
12.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
490
 
12.6%
344
 
8.9%
233
 
6.0%
228
 
5.9%
217
 
5.6%
156
 
4.0%
138
 
3.6%
125
 
3.2%
122
 
3.1%
115
 
3.0%
Other values (113) 1706
44.0%
Space Separator
ValueCountFrequency (%)
559
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3874
87.4%
Common 559
 
12.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
490
 
12.6%
344
 
8.9%
233
 
6.0%
228
 
5.9%
217
 
5.6%
156
 
4.0%
138
 
3.6%
125
 
3.2%
122
 
3.1%
115
 
3.0%
Other values (113) 1706
44.0%
Common
ValueCountFrequency (%)
559
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3874
87.4%
ASCII 559
 
12.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
559
100.0%
Hangul
ValueCountFrequency (%)
490
 
12.6%
344
 
8.9%
233
 
6.0%
228
 
5.9%
217
 
5.6%
156
 
4.0%
138
 
3.6%
125
 
3.2%
122
 
3.1%
115
 
3.0%
Other values (113) 1706
44.0%

축종명
Categorical

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.5 KiB
229 
186 
오리
76 
돼지
44 
기타
27 

Length

Max length2
Median length1
Mean length1.2615658
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row돼지

Common Values

ValueCountFrequency (%)
229
40.7%
186
33.1%
오리 76
 
13.5%
돼지 44
 
7.8%
기타 27
 
4.8%

Length

2023-12-11T12:46:10.219549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T12:46:10.352387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
229
40.7%
186
33.1%
오리 76
 
13.5%
돼지 44
 
7.8%
기타 27
 
4.8%

농가수
Real number (ℝ)

Distinct22
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.6761566
Minimum1
Maximum58
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.1 KiB
2023-12-11T12:46:10.502528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile8
Maximum58
Range57
Interquartile range (IQR)2

Descriptive statistics

Standard deviation3.8075496
Coefficient of variation (CV)1.4227679
Kurtosis83.619182
Mean2.6761566
Median Absolute Deviation (MAD)0
Skewness7.0811102
Sum1504
Variance14.497434
MonotonicityNot monotonic
2023-12-11T12:46:10.650372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
1 303
53.9%
2 94
 
16.7%
3 53
 
9.4%
4 37
 
6.6%
5 16
 
2.8%
7 12
 
2.1%
6 12
 
2.1%
8 9
 
1.6%
9 6
 
1.1%
14 4
 
0.7%
Other values (12) 16
 
2.8%
ValueCountFrequency (%)
1 303
53.9%
2 94
 
16.7%
3 53
 
9.4%
4 37
 
6.6%
5 16
 
2.8%
6 12
 
2.1%
7 12
 
2.1%
8 9
 
1.6%
9 6
 
1.1%
10 3
 
0.5%
ValueCountFrequency (%)
58 1
 
0.2%
23 1
 
0.2%
21 1
 
0.2%
20 1
 
0.2%
19 1
 
0.2%
17 2
0.4%
16 2
0.4%
15 1
 
0.2%
14 4
0.7%
13 1
 
0.2%

Interactions

2023-12-11T12:46:08.507093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T12:46:10.733164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명축종명농가수
시도명1.0000.2480.000
축종명0.2481.0000.251
농가수0.0000.2511.000
2023-12-11T12:46:10.824421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축종명시도명
축종명1.0000.128
시도명0.1281.000
2023-12-11T12:46:10.912920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농가수시도명축종명
농가수1.0000.0000.096
시도명0.0001.0000.128
축종명0.0960.1281.000

Missing values

2023-12-11T12:46:08.659549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:46:08.767455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

최종수정일시시도명시군구명축종명농가수
02022-06-01경상남도경상남도 통영시1
12022-06-01경상남도경상남도 밀양시2
22022-06-01경상남도경상남도 밀양시1
32022-06-01경상남도경상남도 의령군1
42022-06-01경기도경기도 화성시돼지1
52022-06-01경기도경기도 화성시오리1
62022-06-01경기도경기도 화성시기타1
72022-06-01경기도경기도 광주시오리1
82022-06-01경기도경기도 광주시돼지1
92022-06-01경기도경기도 양주시1
최종수정일시시도명시군구명축종명농가수
5522022-06-01전라남도전라남도 영암군6
5532022-06-01전라남도전라남도 무안군4
5542022-06-01경상북도경상북도 봉화군4
5552022-06-01경상북도경상북도 문경시기타1
5562022-06-01경상북도경상북도 경산시1
5572022-06-01경상북도경상북도 고령군오리2
5582022-06-01경상북도경상북도 성주군4
5592022-06-01경기도경기도 이천시1
5602022-06-01경기도경기도 이천시1
5612022-06-01경기도경기도 안성시6

Duplicate rows

Most frequently occurring

최종수정일시시도명시군구명축종명농가수# duplicates
382022-06-01전라남도전라남도 강진군기타17
22022-06-01강원도강원도 횡성군13
82022-06-01경기도경기도 이천시13
252022-06-01경상북도경상북도 청도군13
02022-06-01강원도강원도 양양군12
12022-06-01강원도강원도 춘천시12
32022-06-01경기도경기도 광주시돼지12
42022-06-01경기도경기도 광주시오리12
52022-06-01경기도경기도 안성시12
62022-06-01경기도경기도 양주시12