Overview

Dataset statistics

Number of variables6
Number of observations226
Missing cells85
Missing cells (%)6.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.9 KiB
Average record size in memory49.6 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description대구광역시 소재 축산물가공업(식육가공업, 유가공업, 알가공업)에 대한 현황 파일입니다. 본 정보에는 업소명과 영업장소재지 및 유선전화번호가 포함됩니다.
URLhttps://www.data.go.kr/data/15042998/fileData.do

Alerts

영업상태 has constant value ""Constant
가공업구분 is highly imbalanced (87.3%)Imbalance
소재지전화번호 has 85 (37.6%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:40:13.659200
Analysis finished2023-12-12 19:40:14.364859
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct226
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.5
Minimum1
Maximum226
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-13T04:40:14.450895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.25
Q157.25
median113.5
Q3169.75
95-th percentile214.75
Maximum226
Range225
Interquartile range (IQR)112.5

Descriptive statistics

Standard deviation65.384759
Coefficient of variation (CV)0.57607717
Kurtosis-1.2
Mean113.5
Median Absolute Deviation (MAD)56.5
Skewness0
Sum25651
Variance4275.1667
MonotonicityStrictly increasing
2023-12-13T04:40:14.623022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
171 1
 
0.4%
145 1
 
0.4%
146 1
 
0.4%
147 1
 
0.4%
148 1
 
0.4%
149 1
 
0.4%
150 1
 
0.4%
151 1
 
0.4%
152 1
 
0.4%
Other values (216) 216
95.6%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
226 1
0.4%
225 1
0.4%
224 1
0.4%
223 1
0.4%
222 1
0.4%
221 1
0.4%
220 1
0.4%
219 1
0.4%
218 1
0.4%
217 1
0.4%
Distinct225
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T04:40:14.940115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length6.4424779
Min length2

Characters and Unicode

Total characters1456
Distinct characters259
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)99.1%

Sample

1st row제일푸드(JEIL FOOD)
2nd row두민푸드
3rd row글로벌식품
4th row(주)대원미트
5th row(주)유림씨엔에프
ValueCountFrequency (%)
주식회사 22
 
8.0%
농업회사법인 8
 
2.9%
파인식품 2
 
0.7%
주)진우식품 2
 
0.7%
세이브 2
 
0.7%
동원식품 1
 
0.4%
태성미트 1
 
0.4%
창대푸드 1
 
0.4%
에이원 1
 
0.4%
태원미트 1
 
0.4%
Other values (233) 233
85.0%
2023-12-13T04:40:15.793312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
81
 
5.6%
73
 
5.0%
72
 
4.9%
70
 
4.8%
( 60
 
4.1%
) 60
 
4.1%
50
 
3.4%
48
 
3.3%
38
 
2.6%
35
 
2.4%
Other values (249) 869
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1242
85.3%
Open Punctuation 60
 
4.1%
Close Punctuation 60
 
4.1%
Space Separator 48
 
3.3%
Uppercase Letter 41
 
2.8%
Other Punctuation 3
 
0.2%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
81
 
6.5%
73
 
5.9%
72
 
5.8%
70
 
5.6%
50
 
4.0%
38
 
3.1%
35
 
2.8%
21
 
1.7%
20
 
1.6%
19
 
1.5%
Other values (225) 763
61.4%
Uppercase Letter
ValueCountFrequency (%)
F 6
14.6%
D 5
12.2%
S 5
12.2%
B 3
 
7.3%
C 3
 
7.3%
J 3
 
7.3%
H 2
 
4.9%
I 2
 
4.9%
O 2
 
4.9%
G 2
 
4.9%
Other values (8) 8
19.5%
Lowercase Letter
ValueCountFrequency (%)
c 1
50.0%
f 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Space Separator
ValueCountFrequency (%)
48
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1242
85.3%
Common 171
 
11.7%
Latin 43
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
81
 
6.5%
73
 
5.9%
72
 
5.8%
70
 
5.6%
50
 
4.0%
38
 
3.1%
35
 
2.8%
21
 
1.7%
20
 
1.6%
19
 
1.5%
Other values (225) 763
61.4%
Latin
ValueCountFrequency (%)
F 6
14.0%
D 5
11.6%
S 5
11.6%
B 3
 
7.0%
C 3
 
7.0%
J 3
 
7.0%
H 2
 
4.7%
I 2
 
4.7%
O 2
 
4.7%
G 2
 
4.7%
Other values (10) 10
23.3%
Common
ValueCountFrequency (%)
( 60
35.1%
) 60
35.1%
48
28.1%
& 3
 
1.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1242
85.3%
ASCII 214
 
14.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
81
 
6.5%
73
 
5.9%
72
 
5.8%
70
 
5.6%
50
 
4.0%
38
 
3.1%
35
 
2.8%
21
 
1.7%
20
 
1.6%
19
 
1.5%
Other values (225) 763
61.4%
ASCII
ValueCountFrequency (%)
( 60
28.0%
) 60
28.0%
48
22.4%
F 6
 
2.8%
D 5
 
2.3%
S 5
 
2.3%
B 3
 
1.4%
C 3
 
1.4%
& 3
 
1.4%
J 3
 
1.4%
Other values (14) 18
 
8.4%
Distinct225
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-13T04:40:16.246044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length36
Mean length25.141593
Min length19

Characters and Unicode

Total characters5682
Distinct characters169
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique224 ?
Unique (%)99.1%

Sample

1st row대구광역시 동구 반야월북로 6-4 (율암동)
2nd row대구광역시 동구 도평로43길 11 (도동)
3rd row대구광역시 동구 방천로1길 105 (지저동)
4th row대구광역시 동구 신덕로6길 26 (신평동)
5th row대구광역시 서구 국채보상로 38-20 (중리동)
ValueCountFrequency (%)
대구광역시 226
 
19.8%
북구 68
 
6.0%
달성군 54
 
4.7%
동구 37
 
3.2%
서구 31
 
2.7%
달서구 22
 
1.9%
논공읍 18
 
1.6%
다사읍 13
 
1.1%
옥포읍 12
 
1.1%
노원동3가 11
 
1.0%
Other values (456) 650
56.9%
2023-12-13T04:40:16.765578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
917
 
16.1%
413
 
7.3%
245
 
4.3%
244
 
4.3%
230
 
4.0%
228
 
4.0%
226
 
4.0%
1 225
 
4.0%
198
 
3.5%
( 173
 
3.0%
Other values (159) 2583
45.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3365
59.2%
Space Separator 917
 
16.1%
Decimal Number 917
 
16.1%
Open Punctuation 173
 
3.0%
Close Punctuation 173
 
3.0%
Dash Punctuation 98
 
1.7%
Other Punctuation 36
 
0.6%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
413
 
12.3%
245
 
7.3%
244
 
7.3%
230
 
6.8%
228
 
6.8%
226
 
6.7%
198
 
5.9%
157
 
4.7%
94
 
2.8%
85
 
2.5%
Other values (141) 1245
37.0%
Decimal Number
ValueCountFrequency (%)
1 225
24.5%
2 127
13.8%
3 110
12.0%
4 89
 
9.7%
5 79
 
8.6%
6 69
 
7.5%
7 64
 
7.0%
0 54
 
5.9%
9 51
 
5.6%
8 49
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 34
94.4%
. 2
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
B 2
66.7%
A 1
33.3%
Space Separator
ValueCountFrequency (%)
917
100.0%
Open Punctuation
ValueCountFrequency (%)
( 173
100.0%
Close Punctuation
ValueCountFrequency (%)
) 173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 98
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3365
59.2%
Common 2314
40.7%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
413
 
12.3%
245
 
7.3%
244
 
7.3%
230
 
6.8%
228
 
6.8%
226
 
6.7%
198
 
5.9%
157
 
4.7%
94
 
2.8%
85
 
2.5%
Other values (141) 1245
37.0%
Common
ValueCountFrequency (%)
917
39.6%
1 225
 
9.7%
( 173
 
7.5%
) 173
 
7.5%
2 127
 
5.5%
3 110
 
4.8%
- 98
 
4.2%
4 89
 
3.8%
5 79
 
3.4%
6 69
 
3.0%
Other values (6) 254
 
11.0%
Latin
ValueCountFrequency (%)
B 2
66.7%
A 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3365
59.2%
ASCII 2317
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
917
39.6%
1 225
 
9.7%
( 173
 
7.5%
) 173
 
7.5%
2 127
 
5.5%
3 110
 
4.7%
- 98
 
4.2%
4 89
 
3.8%
5 79
 
3.4%
6 69
 
3.0%
Other values (8) 257
 
11.1%
Hangul
ValueCountFrequency (%)
413
 
12.3%
245
 
7.3%
244
 
7.3%
230
 
6.8%
228
 
6.8%
226
 
6.7%
198
 
5.9%
157
 
4.7%
94
 
2.8%
85
 
2.5%
Other values (141) 1245
37.0%

소재지전화번호
Text

MISSING 

Distinct137
Distinct (%)97.2%
Missing85
Missing (%)37.6%
Memory size1.9 KiB
2023-12-13T04:40:17.021383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035461
Min length12

Characters and Unicode

Total characters1697
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique133 ?
Unique (%)94.3%

Sample

1st row053-951-2868
2nd row053-986-2387
3rd row053-984-8866
4th row070-8201-3210
5th row053-573-9981
ValueCountFrequency (%)
053-746-0092 2
 
1.4%
053-592-1313 2
 
1.4%
053-638-8988 2
 
1.4%
053-572-1543 2
 
1.4%
053-623-3500 1
 
0.7%
053-312-1400 1
 
0.7%
053-354-8292 1
 
0.7%
053-585-0768 1
 
0.7%
053-652-6569 1
 
0.7%
053-555-4763 1
 
0.7%
Other values (127) 127
90.1%
2023-12-13T04:40:17.419954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 283
16.7%
- 282
16.6%
3 268
15.8%
0 224
13.2%
2 118
7.0%
6 107
 
6.3%
1 106
 
6.2%
8 95
 
5.6%
9 80
 
4.7%
7 67
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1415
83.4%
Dash Punctuation 282
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 283
20.0%
3 268
18.9%
0 224
15.8%
2 118
8.3%
6 107
 
7.6%
1 106
 
7.5%
8 95
 
6.7%
9 80
 
5.7%
7 67
 
4.7%
4 67
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 282
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1697
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 283
16.7%
- 282
16.6%
3 268
15.8%
0 224
13.2%
2 118
7.0%
6 107
 
6.3%
1 106
 
6.2%
8 95
 
5.6%
9 80
 
4.7%
7 67
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1697
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 283
16.7%
- 282
16.6%
3 268
15.8%
0 224
13.2%
2 118
7.0%
6 107
 
6.3%
1 106
 
6.2%
8 95
 
5.6%
9 80
 
4.7%
7 67
 
3.9%

가공업구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
식육가공업
220 
알가공업
 
4
유가공업
 
2

Length

Max length5
Median length5
Mean length4.9734513
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식육가공업
2nd row식육가공업
3rd row식육가공업
4th row식육가공업
5th row식육가공업

Common Values

ValueCountFrequency (%)
식육가공업 220
97.3%
알가공업 4
 
1.8%
유가공업 2
 
0.9%

Length

2023-12-13T04:40:17.570399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:17.723917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식육가공업 220
97.3%
알가공업 4
 
1.8%
유가공업 2
 
0.9%

영업상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
정상
226 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정상
2nd row정상
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 226
100.0%

Length

2023-12-13T04:40:17.844466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:17.943691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 226
100.0%

Interactions

2023-12-13T04:40:13.994963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:40:18.014269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호가공업구분
번호1.0000.204
가공업구분0.2041.000
2023-12-13T04:40:18.104757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호가공업구분
번호1.0000.115
가공업구분0.1151.000

Missing values

2023-12-13T04:40:14.131896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:40:14.307034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업장명소재지소재지전화번호가공업구분영업상태
01제일푸드(JEIL FOOD)대구광역시 동구 반야월북로 6-4 (율암동)053-951-2868식육가공업정상
12두민푸드대구광역시 동구 도평로43길 11 (도동)053-986-2387식육가공업정상
23글로벌식품대구광역시 동구 방천로1길 105 (지저동)053-984-8866식육가공업정상
34(주)대원미트대구광역시 동구 신덕로6길 26 (신평동)070-8201-3210식육가공업정상
45(주)유림씨엔에프대구광역시 서구 국채보상로 38-20 (중리동)053-573-9981식육가공업정상
56(주)대홍 농업회사법인대구광역시 서구 와룡로73길 11 (중리동)053-526-9998식육가공업정상
67종국이푸드대구광역시 서구 평리로35길 13-18 (중리동)053-565-9998식육가공업정상
78농업회사법인(주)대풍디앤에프대구광역시 서구 평리로35길 90-5 (중리동)053-525-4447식육가공업정상
89(주)세원피앤피대구광역시 서구 와룡로 458-6 (이현동)053-562-0099식육가공업정상
910늘푸른식품대구광역시 남구 앞산순환로 732-5 (봉덕동)053-471-6022식육가공업정상
번호사업장명소재지소재지전화번호가공업구분영업상태
216217주식회사 맛잽이식품대구광역시 동구 화랑로108길 6-1, 10-2동 (용계동)053-551-0631식육가공업정상
217218(주)자모대구광역시 달성군 현풍읍 현풍서로 106<NA>알가공업정상
218219(주)융화식품대구광역시 동구 입석로 97-20 (검사동)053-985-3957식육가공업정상
219220광진막창유통대구광역시 서구 가르뱅이로20길 20-1 (상리동)053-526-1194식육가공업정상
220221참마음식품대구광역시 서구 북비산로17길 15 (이현동)053-353-3665식육가공업정상
221222삼촌식품대구광역시 북구 학정로 95-20 (태전동)<NA>식육가공업정상
222223거봉종합식품대구광역시 북구 침산남로9길 179-9 (침산동)053-354-3129식육가공업정상
223224(주)한스미트대구광역시 달성군 다사읍 서재로12길 39053-593-2330식육가공업정상
224225행복하계 협동조합대구광역시 서구 팔달로2길 25, 1층 (비산동)<NA>식육가공업정상
225226옛날막창대구광역시 북구 관음중앙로22길 6-15 (관음동)053-321-6386식육가공업정상