Overview

Dataset statistics

Number of variables7
Number of observations542
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory30.8 KiB
Average record size in memory58.2 B

Variable types

Numeric2
Text2
Categorical3

Dataset

Description경상남도 사천시 내 축산농가에 관한 현황정보를 나타낸 파일데이터로 사업장명, 주사육업종,사육두수 등을 포함하고 있습니다.
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15037765

Alerts

데이터기준일자 has constant value ""Constant
주사육업종 is highly imbalanced (70.9%)Imbalance
연번 has unique valuesUnique
사육두수 has 19 (3.5%) zerosZeros

Reproduction

Analysis started2023-12-11 00:20:01.831432
Analysis finished2023-12-11 00:20:02.785152
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct542
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean271.5
Minimum1
Maximum542
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-11T09:20:02.853604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile28.05
Q1136.25
median271.5
Q3406.75
95-th percentile514.95
Maximum542
Range541
Interquartile range (IQR)270.5

Descriptive statistics

Standard deviation156.60619
Coefficient of variation (CV)0.57681839
Kurtosis-1.2
Mean271.5
Median Absolute Deviation (MAD)135.5
Skewness0
Sum147153
Variance24525.5
MonotonicityStrictly increasing
2023-12-11T09:20:03.004020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
358 1
 
0.2%
372 1
 
0.2%
371 1
 
0.2%
370 1
 
0.2%
369 1
 
0.2%
368 1
 
0.2%
367 1
 
0.2%
366 1
 
0.2%
365 1
 
0.2%
Other values (532) 532
98.2%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
542 1
0.2%
541 1
0.2%
540 1
0.2%
539 1
0.2%
538 1
0.2%
537 1
0.2%
536 1
0.2%
535 1
0.2%
534 1
0.2%
533 1
0.2%
Distinct533
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-11T09:20:03.293429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length5.1383764
Min length3

Characters and Unicode

Total characters2785
Distinct characters261
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique525 ?
Unique (%)96.9%

Sample

1st row상아농장
2nd row덕우목장
3rd row상현농장
4th row김영제농장
5th row박영자 농장
ValueCountFrequency (%)
농장 182
 
24.0%
목장 5
 
0.7%
목단농장 3
 
0.4%
한우 3
 
0.4%
2농장 3
 
0.4%
진수농장 2
 
0.3%
주식회사 2
 
0.3%
대신 2
 
0.3%
어류골 2
 
0.3%
성원농장 2
 
0.3%
Other values (545) 551
72.8%
2023-12-11T09:20:03.708228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
503
 
18.1%
443
 
15.9%
215
 
7.7%
65
 
2.3%
45
 
1.6%
44
 
1.6%
43
 
1.5%
36
 
1.3%
35
 
1.3%
35
 
1.3%
Other values (251) 1321
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2545
91.4%
Space Separator 215
 
7.7%
Decimal Number 19
 
0.7%
Open Punctuation 3
 
0.1%
Close Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
503
 
19.8%
443
 
17.4%
65
 
2.6%
45
 
1.8%
44
 
1.7%
43
 
1.7%
36
 
1.4%
35
 
1.4%
35
 
1.4%
30
 
1.2%
Other values (242) 1266
49.7%
Decimal Number
ValueCountFrequency (%)
2 13
68.4%
1 2
 
10.5%
7 1
 
5.3%
6 1
 
5.3%
5 1
 
5.3%
4 1
 
5.3%
Space Separator
ValueCountFrequency (%)
215
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2545
91.4%
Common 240
 
8.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
503
 
19.8%
443
 
17.4%
65
 
2.6%
45
 
1.8%
44
 
1.7%
43
 
1.7%
36
 
1.4%
35
 
1.4%
35
 
1.4%
30
 
1.2%
Other values (242) 1266
49.7%
Common
ValueCountFrequency (%)
215
89.6%
2 13
 
5.4%
( 3
 
1.2%
) 3
 
1.2%
1 2
 
0.8%
7 1
 
0.4%
6 1
 
0.4%
5 1
 
0.4%
4 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2545
91.4%
ASCII 240
 
8.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
503
 
19.8%
443
 
17.4%
65
 
2.6%
45
 
1.8%
44
 
1.7%
43
 
1.7%
36
 
1.4%
35
 
1.4%
35
 
1.4%
30
 
1.2%
Other values (242) 1266
49.7%
ASCII
ValueCountFrequency (%)
215
89.6%
2 13
 
5.4%
( 3
 
1.2%
) 3
 
1.2%
1 2
 
0.8%
7 1
 
0.4%
6 1
 
0.4%
5 1
 
0.4%
4 1
 
0.4%
Distinct527
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-12-11T09:20:03.967062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length46
Mean length25.416974
Min length4

Characters and Unicode

Total characters13776
Distinct characters126
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique519 ?
Unique (%)95.8%

Sample

1st row경상남도 사천시 곤양면 가화리 산 176번지
2nd row경상남도 사천시 곤양면 환덕리 산 26번지 7호
3rd row경상남도 사천시 사천읍 구암리 923번지
4th row경상남도 사천시 축동면 반용리 455번지 1호
5th row경상남도 사천시 사남면 우천리 285번지 2호
ValueCountFrequency (%)
경상남도 534
18.0%
사천시 534
18.0%
곤양면 155
 
5.2%
서포면 117
 
3.9%
1호 96
 
3.2%
2호 66
 
2.2%
사남면 52
 
1.7%
정동면 49
 
1.6%
곤명면 46
 
1.5%
3호 38
 
1.3%
Other values (558) 1286
43.3%
2023-12-11T09:20:04.355304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3518
25.5%
638
 
4.6%
588
 
4.3%
583
 
4.2%
543
 
3.9%
540
 
3.9%
534
 
3.9%
534
 
3.9%
534
 
3.9%
529
 
3.8%
Other values (116) 5235
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8196
59.5%
Space Separator 3518
25.5%
Decimal Number 2015
 
14.6%
Other Punctuation 21
 
0.2%
Dash Punctuation 16
 
0.1%
Close Punctuation 5
 
< 0.1%
Open Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
638
 
7.8%
588
 
7.2%
583
 
7.1%
543
 
6.6%
540
 
6.6%
534
 
6.5%
534
 
6.5%
534
 
6.5%
529
 
6.5%
486
 
5.9%
Other values (101) 2687
32.8%
Decimal Number
ValueCountFrequency (%)
1 363
18.0%
2 299
14.8%
5 218
10.8%
3 213
10.6%
4 187
9.3%
7 163
8.1%
8 152
7.5%
6 150
7.4%
0 136
 
6.7%
9 134
 
6.7%
Space Separator
ValueCountFrequency (%)
3518
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8196
59.5%
Common 5580
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
638
 
7.8%
588
 
7.2%
583
 
7.1%
543
 
6.6%
540
 
6.6%
534
 
6.5%
534
 
6.5%
534
 
6.5%
529
 
6.5%
486
 
5.9%
Other values (101) 2687
32.8%
Common
ValueCountFrequency (%)
3518
63.0%
1 363
 
6.5%
2 299
 
5.4%
5 218
 
3.9%
3 213
 
3.8%
4 187
 
3.4%
7 163
 
2.9%
8 152
 
2.7%
6 150
 
2.7%
0 136
 
2.4%
Other values (5) 181
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8196
59.5%
ASCII 5580
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3518
63.0%
1 363
 
6.5%
2 299
 
5.4%
5 218
 
3.9%
3 213
 
3.8%
4 187
 
3.4%
7 163
 
2.9%
8 152
 
2.7%
6 150
 
2.7%
0 136
 
2.4%
Other values (5) 181
 
3.2%
Hangul
ValueCountFrequency (%)
638
 
7.8%
588
 
7.2%
583
 
7.1%
543
 
6.6%
540
 
6.6%
534
 
6.5%
534
 
6.5%
534
 
6.5%
529
 
6.5%
486
 
5.9%
Other values (101) 2687
32.8%

주사육업종
Categorical

IMBALANCE 

Distinct10
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
한우
465 
젖소
 
29
돼지
 
15
산양
 
7
염소
 
7
Other values (5)
 
19

Length

Max length6
Median length2
Mean length2.0369004
Min length2

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row산양
2nd row한우
3rd row한우
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 465
85.8%
젖소 29
 
5.4%
돼지 15
 
2.8%
산양 7
 
1.3%
염소 7
 
1.3%
육우 5
 
0.9%
육계 5
 
0.9%
종계/산란계 5
 
0.9%
사슴 3
 
0.6%
오리 1
 
0.2%

Length

2023-12-11T09:20:04.514488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:20:04.631289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한우 465
85.8%
젖소 29
 
5.4%
돼지 15
 
2.8%
산양 7
 
1.3%
염소 7
 
1.3%
육우 5
 
0.9%
육계 5
 
0.9%
종계/산란계 5
 
0.9%
사슴 3
 
0.6%
오리 1
 
0.2%
Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
허가대상
424 
등록대상
118 

Length

Max length5
Median length4
Mean length4.2177122
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row등록대상
2nd row허가대상
3rd row허가대상
4th row허가대상
5th row허가대상

Common Values

ValueCountFrequency (%)
허가대상 424
78.2%
등록대상 118
 
21.8%

Length

2023-12-11T09:20:04.791534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:20:04.888190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
허가대상 424
78.2%
등록대상 118
 
21.8%

사육두수
Real number (ℝ)

ZEROS 

Distinct116
Distinct (%)21.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean419.08303
Minimum0
Maximum70000
Zeros19
Zeros (%)3.5%
Negative0
Negative (%)0.0%
Memory size4.9 KiB
2023-12-11T09:20:04.997718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q15
median11
Q338.75
95-th percentile284.9
Maximum70000
Range70000
Interquartile range (IQR)33.75

Descriptive statistics

Standard deviation4110.2535
Coefficient of variation (CV)9.8077309
Kurtosis203.46725
Mean419.08303
Median Absolute Deviation (MAD)9
Skewness13.695908
Sum227143
Variance16894184
MonotonicityNot monotonic
2023-12-11T09:20:05.133960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 44
 
8.1%
5 31
 
5.7%
10 27
 
5.0%
4 27
 
5.0%
3 25
 
4.6%
7 21
 
3.9%
6 21
 
3.9%
0 19
 
3.5%
1 18
 
3.3%
8 17
 
3.1%
Other values (106) 292
53.9%
ValueCountFrequency (%)
0 19
3.5%
1 18
3.3%
2 44
8.1%
3 25
4.6%
4 27
5.0%
5 31
5.7%
6 21
3.9%
7 21
3.9%
8 17
 
3.1%
9 13
 
2.4%
ValueCountFrequency (%)
70000 1
0.2%
52000 1
0.2%
30000 1
0.2%
21603 1
0.2%
15000 1
0.2%
3000 1
0.2%
2560 1
0.2%
2400 1
0.2%
1700 1
0.2%
1650 1
0.2%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2023-06-27
542 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-27
2nd row2023-06-27
3rd row2023-06-27
4th row2023-06-27
5th row2023-06-27

Common Values

ValueCountFrequency (%)
2023-06-27 542
100.0%

Length

2023-12-11T09:20:05.319294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:20:05.412843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-27 542
100.0%

Interactions

2023-12-11T09:20:02.269507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:20:02.121171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:20:02.342211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:20:02.191852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:20:05.476068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종등록 또는 허가여부사육두수
연번1.0000.3670.6160.011
주사육업종0.3671.0000.4630.591
등록 또는 허가여부0.6160.4631.0000.000
사육두수0.0110.5910.0001.000
2023-12-11T09:20:05.584765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록 또는 허가여부주사육업종
등록 또는 허가여부1.0000.353
주사육업종0.3531.000
2023-12-11T09:20:05.686499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사육두수주사육업종등록 또는 허가여부
연번1.000-0.3560.1200.474
사육두수-0.3561.0000.3610.000
주사육업종0.1200.3611.0000.353
등록 또는 허가여부0.4740.0000.3531.000

Missing values

2023-12-11T09:20:02.662155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:20:02.748612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지주사육업종등록 또는 허가여부사육두수데이터기준일자
01상아농장경상남도 사천시 곤양면 가화리 산 176번지산양등록대상10002023-06-27
12덕우목장경상남도 사천시 곤양면 환덕리 산 26번지 7호한우허가대상482023-06-27
23상현농장경상남도 사천시 사천읍 구암리 923번지한우허가대상832023-06-27
34김영제농장경상남도 사천시 축동면 반용리 455번지 1호한우허가대상192023-06-27
45박영자 농장경상남도 사천시 사남면 우천리 285번지 2호한우허가대상132023-06-27
56정성권농장경상남도 사천시 곤양면 묵곡리 875번지 1호한우허가대상102023-06-27
67노청길농장경상남도 사천시 이홀동 30번지 1호한우허가대상922023-06-27
78황호성농장경상남도 사천시 곤양면 환덕리 1047번지 2호한우허가대상182023-06-27
89이영남농장경상남도 사천시 곤명면 송림리 259번지 1호산양등록대상502023-06-27
910동훈축산경상남도 사천시 정동면 학촌리 162번지한우허가대상1502023-06-27
연번업체명소재지주사육업종등록 또는 허가여부사육두수데이터기준일자
532533서일농장경상남도 사천시 서포면 금진리 205번지육계허가대상2502023-06-27
533534한탑농장경상남도 사천시 곤양면 환덕리 897번지 9호 외 1(산103-2)한우허가대상102023-06-27
534535남경농장경상남도 사천시 곤양면 환덕리 249번지 17호 , 249-18한우허가대상02023-06-27
535536두량축산경상남도 사천시 사천읍 두량리 204번지 2호한우허가대상02023-06-27
536537대신6농장경상남도 사천시 곤양면 대진리 294번지 ,297-2한우허가대상02023-06-27
537538대신7농장경상남도 사천시 곤양면 대진리 286번지 2호 ,288 ,289, 292, 293한우허가대상02023-06-27
538539수정농장경상남도 사천시 서포면 구평리 748번지한우허가대상22023-06-27
539540와룡농장경상남도 사천시 정동면 풍정리 32번지 1호염소등록대상2002023-06-27
540541염소농장경상남도 사천시 정동면 학촌리 65번지 2호염소등록대상162023-06-27
541542에코농장경상남도 사천시 곤양면 환덕리 1356번지 2호 ,1356-3, 1356-4한우허가대상02023-06-27