Overview

Dataset statistics

Number of variables13
Number of observations8175
Missing cells48586
Missing cells (%)45.7%
Duplicate rows31
Duplicate rows (%)0.4%
Total size in memory878.3 KiB
Average record size in memory110.0 B

Variable types

Categorical4
Text1
DateTime2
Numeric1
Unsupported5

Dataset

Description축산물 식육 가공업체 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=VD324O00A775FJ9IZO76416755&infSeq=1

Alerts

Dataset has 31 (0.4%) duplicate rowsDuplicates
축산물가공업구분명 is highly overall correlated with 축산업무구분명High correlation
축산업무구분명 is highly overall correlated with 축산물가공업구분명High correlation
폐업일자 has 6051 (74.0%) missing valuesMissing
소재지면적(㎡) has 1660 (20.3%) missing valuesMissing
소재지도로명주소 has 8175 (100.0%) missing valuesMissing
소재지지번주소 has 8175 (100.0%) missing valuesMissing
소재지우편번호 has 8175 (100.0%) missing valuesMissing
WGS84위도 has 8175 (100.0%) missing valuesMissing
WGS84경도 has 8175 (100.0%) missing valuesMissing
소재지면적(㎡) is highly skewed (γ1 = 22.30306027)Skewed
소재지도로명주소 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지지번주소 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
WGS84위도 is an unsupported type, check if it needs cleaning or further analysisUnsupported
WGS84경도 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적(㎡) has 4554 (55.7%) zerosZeros

Reproduction

Analysis started2023-12-10 21:34:37.639304
Analysis finished2023-12-10 21:34:39.469914
Duration1.83 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

Distinct33
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
화성시
880 
안성시
856 
파주시
666 
포천시
564 
양주시
 
364
Other values (28)
4845 

Length

Max length4
Median length3
Mean length3.06263
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
화성시 880
 
10.8%
안성시 856
 
10.5%
파주시 666
 
8.1%
포천시 564
 
6.9%
양주시 364
 
4.5%
용인시 354
 
4.3%
여주시 349
 
4.3%
남양주시 344
 
4.2%
이천시 341
 
4.2%
고양시 333
 
4.1%
Other values (23) 3124
38.2%

Length

2023-12-11T06:34:39.540330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
화성시 880
 
10.8%
안성시 856
 
10.5%
파주시 666
 
8.1%
포천시 564
 
6.9%
양주시 364
 
4.5%
용인시 354
 
4.3%
여주시 349
 
4.3%
남양주시 344
 
4.2%
이천시 341
 
4.2%
고양시 333
 
4.1%
Other values (23) 3124
38.2%
Distinct7071
Distinct (%)86.5%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
2023-12-11T06:34:39.819252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length5.7663609
Min length1

Characters and Unicode

Total characters47140
Distinct characters869
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6336 ?
Unique (%)77.5%

Sample

1st row재종농장
2nd row위곡농장
3rd row쩡's푸드
4th row가평축산업협동조합경제사업본부
5th row(주)산과들 농업회사법인
ValueCountFrequency (%)
주식회사 655
 
6.8%
농업회사법인 152
 
1.6%
농장 50
 
0.5%
42
 
0.4%
목장 32
 
0.3%
20
 
0.2%
우리농장 15
 
0.2%
유한회사 13
 
0.1%
제2공장 12
 
0.1%
대성농장 12
 
0.1%
Other values (7302) 8586
89.5%
2023-12-11T06:34:40.259317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4288
 
9.1%
2791
 
5.9%
1930
 
4.1%
1750
 
3.7%
1417
 
3.0%
) 1222
 
2.6%
( 1221
 
2.6%
1104
 
2.3%
1084
 
2.3%
933
 
2.0%
Other values (859) 29400
62.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 41901
88.9%
Space Separator 1418
 
3.0%
Close Punctuation 1222
 
2.6%
Open Punctuation 1221
 
2.6%
Uppercase Letter 607
 
1.3%
Lowercase Letter 365
 
0.8%
Decimal Number 272
 
0.6%
Other Punctuation 77
 
0.2%
Dash Punctuation 49
 
0.1%
Other Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4288
 
10.2%
2791
 
6.7%
1930
 
4.6%
1750
 
4.2%
1104
 
2.6%
1084
 
2.6%
933
 
2.2%
821
 
2.0%
800
 
1.9%
687
 
1.6%
Other values (780) 25713
61.4%
Uppercase Letter
ValueCountFrequency (%)
F 66
 
10.9%
S 54
 
8.9%
O 52
 
8.6%
D 47
 
7.7%
C 40
 
6.6%
E 40
 
6.6%
B 33
 
5.4%
T 28
 
4.6%
K 27
 
4.4%
P 26
 
4.3%
Other values (15) 194
32.0%
Lowercase Letter
ValueCountFrequency (%)
e 47
12.9%
o 41
 
11.2%
a 31
 
8.5%
t 26
 
7.1%
n 24
 
6.6%
m 18
 
4.9%
i 18
 
4.9%
d 16
 
4.4%
r 16
 
4.4%
g 15
 
4.1%
Other values (15) 113
31.0%
Other Punctuation
ValueCountFrequency (%)
& 36
46.8%
. 21
27.3%
, 6
 
7.8%
! 3
 
3.9%
/ 2
 
2.6%
· 2
 
2.6%
* 2
 
2.6%
; 1
 
1.3%
1
 
1.3%
1
 
1.3%
Other values (2) 2
 
2.6%
Decimal Number
ValueCountFrequency (%)
2 160
58.8%
1 55
 
20.2%
3 23
 
8.5%
4 10
 
3.7%
0 6
 
2.2%
9 5
 
1.8%
6 4
 
1.5%
5 4
 
1.5%
8 3
 
1.1%
7 2
 
0.7%
Space Separator
ValueCountFrequency (%)
1417
99.9%
  1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1222
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1221
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 41900
88.9%
Common 4259
 
9.0%
Latin 973
 
2.1%
Han 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4288
 
10.2%
2791
 
6.7%
1930
 
4.6%
1750
 
4.2%
1104
 
2.6%
1084
 
2.6%
933
 
2.2%
821
 
2.0%
800
 
1.9%
687
 
1.6%
Other values (774) 25712
61.4%
Latin
ValueCountFrequency (%)
F 66
 
6.8%
S 54
 
5.5%
O 52
 
5.3%
D 47
 
4.8%
e 47
 
4.8%
o 41
 
4.2%
C 40
 
4.1%
E 40
 
4.1%
B 33
 
3.4%
a 31
 
3.2%
Other values (41) 522
53.6%
Common
ValueCountFrequency (%)
1417
33.3%
) 1222
28.7%
( 1221
28.7%
2 160
 
3.8%
1 55
 
1.3%
- 49
 
1.2%
& 36
 
0.8%
3 23
 
0.5%
. 21
 
0.5%
4 10
 
0.2%
Other values (17) 45
 
1.1%
Han
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 41893
88.9%
ASCII 5226
 
11.1%
None 12
 
< 0.1%
CJK 8
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4288
 
10.2%
2791
 
6.7%
1930
 
4.6%
1750
 
4.2%
1104
 
2.6%
1084
 
2.6%
933
 
2.2%
821
 
2.0%
800
 
1.9%
687
 
1.6%
Other values (773) 25705
61.4%
ASCII
ValueCountFrequency (%)
1417
27.1%
) 1222
23.4%
( 1221
23.4%
2 160
 
3.1%
F 66
 
1.3%
1 55
 
1.1%
S 54
 
1.0%
O 52
 
1.0%
- 49
 
0.9%
D 47
 
0.9%
Other values (63) 883
16.9%
None
ValueCountFrequency (%)
7
58.3%
· 2
 
16.7%
1
 
8.3%
1
 
8.3%
  1
 
8.3%
CJK
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Number Forms
ValueCountFrequency (%)
1
100.0%
Distinct3525
Distinct (%)43.1%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
Minimum1972-05-11 00:00:00
Maximum2023-12-05 00:00:00
2023-12-11T06:34:40.413677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:34:40.541444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
정상
4189 
폐업
1308 
운영중
1246 
폐업 등
824 
말소
 
309
Other values (2)
 
299

Length

Max length4
Median length2
Mean length2.363792
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말소
2nd row말소
3rd row운영중
4th row운영중
5th row운영중

Common Values

ValueCountFrequency (%)
정상 4189
51.2%
폐업 1308
 
16.0%
운영중 1246
 
15.2%
폐업 등 824
 
10.1%
말소 309
 
3.8%
휴업 259
 
3.2%
휴업 등 40
 
0.5%

Length

2023-12-11T06:34:40.680191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:34:40.790800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 4189
46.3%
폐업 2132
23.6%
운영중 1246
 
13.8%
864
 
9.6%
말소 309
 
3.4%
휴업 299
 
3.3%

폐업일자
Date

MISSING 

Distinct1081
Distinct (%)50.9%
Missing6051
Missing (%)74.0%
Memory size64.0 KiB
Minimum2003-07-03 00:00:00
Maximum2023-12-20 00:00:00
2023-12-11T06:34:40.919679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:34:41.047532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소재지면적(㎡)
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct1757
Distinct (%)27.0%
Missing1660
Missing (%)20.3%
Infinite0
Infinite (%)0.0%
Mean191.94866
Minimum0
Maximum48187.7
Zeros4554
Zeros (%)55.7%
Negative0
Negative (%)0.0%
Memory size72.0 KiB
2023-12-11T06:34:41.173615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q3105.66
95-th percentile687.322
Maximum48187.7
Range48187.7
Interquartile range (IQR)105.66

Descriptive statistics

Standard deviation1086.3653
Coefficient of variation (CV)5.6596659
Kurtosis755.46013
Mean191.94866
Median Absolute Deviation (MAD)0
Skewness22.30306
Sum1250545.5
Variance1180189.6
MonotonicityNot monotonic
2023-12-11T06:34:41.376359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 4554
55.7%
198.0 32
 
0.4%
330.0 6
 
0.1%
152.0 5
 
0.1%
99.0 5
 
0.1%
130.5 5
 
0.1%
396.0 4
 
< 0.1%
180.0 4
 
< 0.1%
199.5 4
 
< 0.1%
160.0 4
 
< 0.1%
Other values (1747) 1892
23.1%
(Missing) 1660
 
20.3%
ValueCountFrequency (%)
0.0 4554
55.7%
1.0 1
 
< 0.1%
3.4 1
 
< 0.1%
9.0 1
 
< 0.1%
15.0 1
 
< 0.1%
16.0 1
 
< 0.1%
17.69 1
 
< 0.1%
18.0 2
 
< 0.1%
19.9 1
 
< 0.1%
19.95 1
 
< 0.1%
ValueCountFrequency (%)
48187.7 1
< 0.1%
30267.0 1
< 0.1%
22298.84 1
< 0.1%
20305.0 1
< 0.1%
17948.0 1
< 0.1%
16923.0 1
< 0.1%
14544.3 1
< 0.1%
11286.0 1
< 0.1%
11161.17 1
< 0.1%
9998.25 1
< 0.1%

축산물가공업구분명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
<NA>
4802 
식육가공업
2110 
단미
720 
수입
 
361
보조
 
124
Other values (4)
 
58

Length

Max length5
Median length4
Mean length3.955841
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row식육가공업
4th row식육가공업
5th row식육가공업

Common Values

ValueCountFrequency (%)
<NA> 4802
58.7%
식육가공업 2110
25.8%
단미 720
 
8.8%
수입 361
 
4.4%
보조 124
 
1.5%
종계업 40
 
0.5%
종돈업 9
 
0.1%
배합 6
 
0.1%
종오리업 3
 
< 0.1%

Length

2023-12-11T06:34:41.521429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:34:41.644175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 4802
58.7%
식육가공업 2110
25.8%
단미 720
 
8.8%
수입 361
 
4.4%
보조 124
 
1.5%
종계업 40
 
0.5%
종돈업 9
 
0.1%
배합 6
 
0.1%
종오리업 3
 
< 0.1%

축산업무구분명
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size64.0 KiB
가축사육업
4590 
축산물가공업
2110 
사료제조업
1352 
종축업
 
52
가축인공수정소
 
43
Other values (2)
 
28

Length

Max length7
Median length5
Mean length5.249052
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가축사육업
2nd row가축사육업
3rd row축산물가공업
4th row축산물가공업
5th row축산물가공업

Common Values

ValueCountFrequency (%)
가축사육업 4590
56.1%
축산물가공업 2110
25.8%
사료제조업 1352
 
16.5%
종축업 52
 
0.6%
가축인공수정소 43
 
0.5%
도축업 14
 
0.2%
부화업 14
 
0.2%

Length

2023-12-11T06:34:41.798204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:34:41.930741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가축사육업 4590
56.1%
축산물가공업 2110
25.8%
사료제조업 1352
 
16.5%
종축업 52
 
0.6%
가축인공수정소 43
 
0.5%
도축업 14
 
0.2%
부화업 14
 
0.2%

소재지도로명주소
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8175
Missing (%)100.0%
Memory size72.0 KiB

소재지지번주소
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8175
Missing (%)100.0%
Memory size72.0 KiB

소재지우편번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8175
Missing (%)100.0%
Memory size72.0 KiB

WGS84위도
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8175
Missing (%)100.0%
Memory size72.0 KiB

WGS84경도
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8175
Missing (%)100.0%
Memory size72.0 KiB

Interactions

2023-12-11T06:34:38.610794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:34:42.015224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명소재지면적(㎡)축산물가공업구분명축산업무구분명
시군명1.0000.5020.0660.3990.521
영업상태명0.5021.0000.0280.6410.814
소재지면적(㎡)0.0660.0281.0000.0550.436
축산물가공업구분명0.3990.6410.0551.0001.000
축산업무구분명0.5210.8140.4361.0001.000
2023-12-11T06:34:42.103443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업상태명축산물가공업구분명시군명축산업무구분명
영업상태명1.0000.4130.2340.412
축산물가공업구분명0.4131.0000.1520.999
시군명0.2340.1521.0000.245
축산업무구분명0.4120.9990.2451.000
2023-12-11T06:34:42.180092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지면적(㎡)시군명영업상태명축산물가공업구분명축산업무구분명
소재지면적(㎡)1.0000.0270.0100.0290.164
시군명0.0271.0000.2340.1520.245
영업상태명0.0100.2341.0000.4130.412
축산물가공업구분명0.0290.1520.4131.0000.999
축산업무구분명0.1640.2450.4120.9991.000

Missing values

2023-12-11T06:34:39.011164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:34:39.198627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:34:39.367531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산물가공업구분명축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군재종농장20070510말소<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
1가평군위곡농장20050503말소<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
2가평군쩡's푸드20150119운영중<NA>82.29식육가공업축산물가공업<NA><NA><NA><NA><NA>
3가평군가평축산업협동조합경제사업본부20160728운영중<NA>193.1식육가공업축산물가공업<NA><NA><NA><NA><NA>
4가평군(주)산과들 농업회사법인20180618운영중<NA>235.45식육가공업축산물가공업<NA><NA><NA><NA><NA>
5가평군자연유통20150205운영중<NA>233.5식육가공업축산물가공업<NA><NA><NA><NA><NA>
6가평군용광덕목장20100128정상<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
7가평군참터목장20080708정상<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
8가평군신황보목장20050722정상<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
9가평군장원목장20050722정상<NA>0.0<NA>가축사육업<NA><NA><NA><NA><NA>
시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산물가공업구분명축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
8165<NA>엠에스에프20010214폐업 등201703160.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8166<NA>(주)사세통상20020907폐업 등20160526814.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8167<NA>(주)동규엔터프라이즈20021226폐업 등200603153670.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8168<NA>(주)보람식품20010908폐업 등20120828280.81식육가공업축산물가공업<NA><NA><NA><NA><NA>
8169<NA>교하식품20011024폐업 등2006032382.7식육가공업축산물가공업<NA><NA><NA><NA><NA>
8170<NA>한솔유통20041022폐업 등20141124664.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8171<NA>훔메유통(주)20050430폐업 등200905112178.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8172<NA>익수제약(주)20060523휴업 등<NA>0.0식육가공업축산물가공업<NA><NA><NA><NA><NA>
8173<NA>주식회사 더콥20090609휴업 등<NA>115.67식육가공업축산물가공업<NA><NA><NA><NA><NA>
8174<NA>영송혜물산19950126휴업 등<NA>122.69식육가공업축산물가공업<NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산물가공업구분명축산업무구분명# duplicates
11동두천시소요산생물보호센터20130607폐업20221223<NA><NA>가축사육업4
15여주시여주자영농업고등학교20051115정상<NA>0.0<NA>가축사육업4
2고양시더 DOG립만세20211001폐업202212290.0단미사료제조업3
20파주시깅스키친20210430정상<NA>0.0단미사료제조업3
21파주시더DOG립만세20210416폐업202211170.0단미사료제조업3
25파주시주식회사 봄봄2023-08-17정상<NA>0.0단미사료제조업3
0고양시(주)베스트칩20211029정상<NA>0.0단미사료제조업2
1고양시6DECO20210802정상<NA>0.0단미사료제조업2
3고양시멍멍스푼20220401정상<NA>0.0단미사료제조업2
4고양시멍뭉식탁20211203정상<NA>0.0단미사료제조업2