Overview

Dataset statistics

Number of variables5
Number of observations51
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory44.6 B

Variable types

Numeric2
Text2
Categorical1

Dataset

Description전라남도 강진군 가금류 농장현황에 관한 데이터로서 사업장명칭, 축종, 사육두수, 사업장소재지 등에 대한 정보를 포함하고 있음
URLhttps://www.data.go.kr/data/15076983/fileData.do

Alerts

연번 is highly overall correlated with 사육두수 and 1 other fieldsHigh correlation
사육두수 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
축종 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
연번 has unique valuesUnique
사업장소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:11:28.784085
Analysis finished2023-12-12 06:11:29.705981
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26
Minimum1
Maximum51
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T15:11:29.785722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.5
Q113.5
median26
Q338.5
95-th percentile48.5
Maximum51
Range50
Interquartile range (IQR)25

Descriptive statistics

Standard deviation14.866069
Coefficient of variation (CV)0.57177187
Kurtosis-1.2
Mean26
Median Absolute Deviation (MAD)13
Skewness0
Sum1326
Variance221
MonotonicityStrictly increasing
2023-12-12T15:11:29.941262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
2.0%
2 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
36 1
 
2.0%
Other values (41) 41
80.4%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
51 1
2.0%
50 1
2.0%
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
Distinct49
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:11:30.222411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length4.2941176
Min length4

Characters and Unicode

Total characters219
Distinct characters83
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)92.2%

Sample

1st row탐진농장
2nd row태시농장
3rd row도성농장
4th row치영농장
5th row에덴농장
ValueCountFrequency (%)
부성축산 2
 
3.9%
구곡농장 2
 
3.9%
영풍농장 1
 
2.0%
행복농장 1
 
2.0%
모작골농장 1
 
2.0%
영동마을농장 1
 
2.0%
탐진농장 1
 
2.0%
황금닭농장 1
 
2.0%
초원농장 1
 
2.0%
정균농장 1
 
2.0%
Other values (39) 39
76.5%
2023-12-12T15:11:30.697375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49
22.4%
48
21.9%
7
 
3.2%
6
 
2.7%
5
 
2.3%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.4%
2
 
0.9%
Other values (73) 87
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 219
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
22.4%
48
21.9%
7
 
3.2%
6
 
2.7%
5
 
2.3%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.4%
2
 
0.9%
Other values (73) 87
39.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 219
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
22.4%
48
21.9%
7
 
3.2%
6
 
2.7%
5
 
2.3%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.4%
2
 
0.9%
Other values (73) 87
39.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 219
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
49
22.4%
48
21.9%
7
 
3.2%
6
 
2.7%
5
 
2.3%
4
 
1.8%
4
 
1.8%
4
 
1.8%
3
 
1.4%
2
 
0.9%
Other values (73) 87
39.7%

축종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
27 
오리
24 

Length

Max length2
Median length1
Mean length1.4705882
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row오리
2nd row오리
3rd row오리
4th row오리
5th row오리

Common Values

ValueCountFrequency (%)
27
52.9%
오리 24
47.1%

Length

2023-12-12T15:11:30.893590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:11:30.996547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
27
52.9%
오리 24
47.1%

사육두수
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)60.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44190.196
Minimum3000
Maximum160000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T15:11:31.109296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3000
5-th percentile9000
Q117750
median43000
Q360000
95-th percentile92500
Maximum160000
Range157000
Interquartile range (IQR)42250

Descriptive statistics

Standard deviation31564.83
Coefficient of variation (CV)0.71429487
Kurtosis2.014285
Mean44190.196
Median Absolute Deviation (MAD)24500
Skewness1.0911757
Sum2253700
Variance9.963385 × 108
MonotonicityNot monotonic
2023-12-12T15:11:31.263820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
60000 8
 
15.7%
50000 4
 
7.8%
70000 4
 
7.8%
12000 3
 
5.9%
9000 2
 
3.9%
18500 2
 
3.9%
23000 2
 
3.9%
43000 2
 
3.9%
85000 2
 
3.9%
22000 1
 
2.0%
Other values (21) 21
41.2%
ValueCountFrequency (%)
3000 1
 
2.0%
8000 1
 
2.0%
9000 2
3.9%
10000 1
 
2.0%
11000 1
 
2.0%
12000 3
5.9%
13000 1
 
2.0%
15000 1
 
2.0%
16600 1
 
2.0%
17500 1
 
2.0%
ValueCountFrequency (%)
160000 1
 
2.0%
100000 1
 
2.0%
95000 1
 
2.0%
90000 1
 
2.0%
85000 2
 
3.9%
80000 1
 
2.0%
70000 4
7.8%
60000 8
15.7%
55000 1
 
2.0%
50000 4
7.8%

사업장소재지
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:11:31.596326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length21.431373
Min length18

Characters and Unicode

Total characters1093
Distinct characters65
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row전라남도 강진군 군동면 장산리 622
2nd row전라남도 강진군 군동면 용소리 75-2
3rd row전라남도 강진군 마량면 영동리 51-3
4th row전라남도 강진군 도암면 계라리 606-11
5th row전라남도 강진군 도암면 석문리 1267-1
ValueCountFrequency (%)
전라남도 51
19.9%
강진군 51
19.9%
도암면 11
 
4.3%
칠량면 10
 
3.9%
신전면 9
 
3.5%
성전면 7
 
2.7%
석문리 6
 
2.3%
송천리 5
 
2.0%
작천면 5
 
2.0%
영동리 5
 
2.0%
Other values (77) 96
37.5%
2023-12-12T15:11:32.099280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
205
18.8%
68
 
6.2%
62
 
5.7%
54
 
4.9%
53
 
4.8%
52
 
4.8%
52
 
4.8%
52
 
4.8%
51
 
4.7%
50
 
4.6%
Other values (55) 394
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 664
60.8%
Space Separator 205
 
18.8%
Decimal Number 188
 
17.2%
Dash Punctuation 36
 
3.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
10.2%
62
 
9.3%
54
 
8.1%
53
 
8.0%
52
 
7.8%
52
 
7.8%
52
 
7.8%
51
 
7.7%
50
 
7.5%
12
 
1.8%
Other values (43) 158
23.8%
Decimal Number
ValueCountFrequency (%)
1 35
18.6%
6 26
13.8%
7 21
11.2%
2 21
11.2%
4 19
10.1%
3 18
9.6%
5 15
8.0%
8 12
 
6.4%
9 11
 
5.9%
0 10
 
5.3%
Space Separator
ValueCountFrequency (%)
205
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 664
60.8%
Common 429
39.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
10.2%
62
 
9.3%
54
 
8.1%
53
 
8.0%
52
 
7.8%
52
 
7.8%
52
 
7.8%
51
 
7.7%
50
 
7.5%
12
 
1.8%
Other values (43) 158
23.8%
Common
ValueCountFrequency (%)
205
47.8%
- 36
 
8.4%
1 35
 
8.2%
6 26
 
6.1%
7 21
 
4.9%
2 21
 
4.9%
4 19
 
4.4%
3 18
 
4.2%
5 15
 
3.5%
8 12
 
2.8%
Other values (2) 21
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 664
60.8%
ASCII 429
39.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
205
47.8%
- 36
 
8.4%
1 35
 
8.2%
6 26
 
6.1%
7 21
 
4.9%
2 21
 
4.9%
4 19
 
4.4%
3 18
 
4.2%
5 15
 
3.5%
8 12
 
2.8%
Other values (2) 21
 
4.9%
Hangul
ValueCountFrequency (%)
68
10.2%
62
 
9.3%
54
 
8.1%
53
 
8.0%
52
 
7.8%
52
 
7.8%
52
 
7.8%
51
 
7.7%
50
 
7.5%
12
 
1.8%
Other values (43) 158
23.8%

Interactions

2023-12-12T15:11:29.263427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:11:29.059829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:11:29.386538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:11:29.149294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:11:32.250209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명칭축종사육두수사업장소재지
연번1.0001.0000.9980.5141.000
사업장명칭1.0001.0001.0000.0001.000
축종0.9981.0001.0000.9431.000
사육두수0.5140.0000.9431.0001.000
사업장소재지1.0001.0001.0001.0001.000
2023-12-12T15:11:32.394316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사육두수축종
연번1.0000.6910.860
사육두수0.6911.0000.701
축종0.8600.7011.000

Missing values

2023-12-12T15:11:29.546932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:11:29.667531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭축종사육두수사업장소재지
01탐진농장오리22000전라남도 강진군 군동면 장산리 622
12태시농장오리15000전라남도 강진군 군동면 용소리 75-2
23도성농장오리9000전라남도 강진군 마량면 영동리 51-3
34치영농장오리16600전라남도 강진군 도암면 계라리 606-11
45에덴농장오리18000전라남도 강진군 도암면 석문리 1267-1
56도암농장오리18500전라남도 강진군 도암면 석문리 692-6
67석문농장오리23000전라남도 강진군 도암면 석문리 1265
78호림농장오리43000전라남도 강진군 도암면 석문리 692-7
89대벌농장오리21000전라남도 강진군 신전면 벌정리 1005-4
910대덕농장오리10000전라남도 강진군 신전면 벌정리 741
연번사업장명칭축종사육두수사업장소재지
4142인성농장60000전라남도 강진군 도암면 덕년리 74-3
4243원주농장60000전라남도 강진군 도암면 덕년리 237-28
4344한일농장60000전라남도 강진군 도암면 석문리 1257-1
4445삼성농장60000전라남도 강진군 도암면 석문리 765-6
4546송천농장85000전라남도 강진군 신전면 송천리 394
4647대진농장70000전라남도 강진군 성전면 금당리 산 41-7
4748영풍농장60000전라남도 강진군 성전면 영풍리 391-6
4849처인농장55000전라남도 강진군 성전면 월평리 695
4950모작골농장80000전라남도 강진군 작천면 갈동리 1300-1
5051종점농장50000전라남도 강진군 옴천면 기좌리 117-1