Overview

Dataset statistics

Number of variables5
Number of observations269
Missing cells80
Missing cells (%)5.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.9 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description대구광역시_축산가공업체_20210318
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15042998&dataSetDetailId=150429981c47919d79222&provdMethod=FILE

Alerts

업종 is highly imbalanced (89.0%)Imbalance
업소전화번호 has 80 (29.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 18:30:44.960169
Analysis finished2023-12-10 18:30:46.060275
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct269
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean135
Minimum1
Maximum269
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.5 KiB
2023-12-11T03:30:46.214470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.4
Q168
median135
Q3202
95-th percentile255.6
Maximum269
Range268
Interquartile range (IQR)134

Descriptive statistics

Standard deviation77.797815
Coefficient of variation (CV)0.57628011
Kurtosis-1.2
Mean135
Median Absolute Deviation (MAD)67
Skewness0
Sum36315
Variance6052.5
MonotonicityStrictly increasing
2023-12-11T03:30:46.472631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
186 1
 
0.4%
172 1
 
0.4%
173 1
 
0.4%
174 1
 
0.4%
175 1
 
0.4%
176 1
 
0.4%
177 1
 
0.4%
178 1
 
0.4%
179 1
 
0.4%
Other values (259) 259
96.3%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
269 1
0.4%
268 1
0.4%
267 1
0.4%
266 1
0.4%
265 1
0.4%
264 1
0.4%
263 1
0.4%
262 1
0.4%
261 1
0.4%
260 1
0.4%
Distinct267
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T03:30:46.888212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length6.0297398
Min length2

Characters and Unicode

Total characters1622
Distinct characters281
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique265 ?
Unique (%)98.5%

Sample

1st row(주)가온푸드
2nd row(주)거인축산
3rd row(주)국보푸드시스템
4th row(주)달구지푸드
5th row(주)대영냉장
ValueCountFrequency (%)
주식회사 14
 
4.6%
농업회사법인 6
 
2.0%
에이스식품 2
 
0.7%
풀토래(주 2
 
0.7%
주)진우식품 2
 
0.7%
food 2
 
0.7%
주)라자트푸드 2
 
0.7%
옛날막창 1
 
0.3%
영진유통 1
 
0.3%
영화푸드 1
 
0.3%
Other values (274) 274
89.3%
2023-12-11T03:30:47.480486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
5.4%
85
 
5.2%
81
 
5.0%
78
 
4.8%
( 67
 
4.1%
) 67
 
4.1%
66
 
4.1%
38
 
2.3%
29
 
1.8%
25
 
1.5%
Other values (271) 998
61.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1390
85.7%
Open Punctuation 67
 
4.1%
Close Punctuation 67
 
4.1%
Uppercase Letter 53
 
3.3%
Space Separator 38
 
2.3%
Other Punctuation 3
 
0.2%
Decimal Number 2
 
0.1%
Lowercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
6.3%
85
 
6.1%
81
 
5.8%
78
 
5.6%
66
 
4.7%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (245) 866
62.3%
Uppercase Letter
ValueCountFrequency (%)
S 9
17.0%
D 7
13.2%
F 6
11.3%
O 4
 
7.5%
J 4
 
7.5%
C 3
 
5.7%
G 3
 
5.7%
E 2
 
3.8%
I 2
 
3.8%
B 2
 
3.8%
Other values (8) 11
20.8%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
8 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
f 1
50.0%
c 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 67
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1390
85.7%
Common 177
 
10.9%
Latin 55
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
6.3%
85
 
6.1%
81
 
5.8%
78
 
5.6%
66
 
4.7%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (245) 866
62.3%
Latin
ValueCountFrequency (%)
S 9
16.4%
D 7
12.7%
F 6
10.9%
O 4
 
7.3%
J 4
 
7.3%
C 3
 
5.5%
G 3
 
5.5%
E 2
 
3.6%
I 2
 
3.6%
B 2
 
3.6%
Other values (10) 13
23.6%
Common
ValueCountFrequency (%)
( 67
37.9%
) 67
37.9%
38
21.5%
& 3
 
1.7%
1 1
 
0.6%
8 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1390
85.7%
ASCII 232
 
14.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
88
 
6.3%
85
 
6.1%
81
 
5.8%
78
 
5.6%
66
 
4.7%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (245) 866
62.3%
ASCII
ValueCountFrequency (%)
( 67
28.9%
) 67
28.9%
38
16.4%
S 9
 
3.9%
D 7
 
3.0%
F 6
 
2.6%
O 4
 
1.7%
J 4
 
1.7%
& 3
 
1.3%
C 3
 
1.3%
Other values (16) 24
 
10.3%

업종
Categorical

IMBALANCE 

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
축산물가공업-식육가공업
263 
축산물가공업-알가공업
 
4
축산물가공업-유가공업
 
2

Length

Max length12
Median length12
Mean length11.977695
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row축산물가공업-식육가공업
2nd row축산물가공업-식육가공업
3rd row축산물가공업-식육가공업
4th row축산물가공업-식육가공업
5th row축산물가공업-식육가공업

Common Values

ValueCountFrequency (%)
축산물가공업-식육가공업 263
97.8%
축산물가공업-알가공업 4
 
1.5%
축산물가공업-유가공업 2
 
0.7%

Length

2023-12-11T03:30:47.661140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:30:47.796520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
축산물가공업-식육가공업 263
97.8%
축산물가공업-알가공업 4
 
1.5%
축산물가공업-유가공업 2
 
0.7%
Distinct266
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.2 KiB
2023-12-11T03:30:48.310694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length25.018587
Min length19

Characters and Unicode

Total characters6730
Distinct characters173
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique263 ?
Unique (%)97.8%

Sample

1st row대구광역시 북구 노원로47길 17 (침산동)
2nd row대구광역시 서구 서대구로6길 6 (내당동)
3rd row대구광역시 북구 연암로42길 5 (산격동)
4th row대구광역시 달성군 논공읍 농공공단길 10
5th row대구광역시 달성군 논공읍 비슬로264길 46
ValueCountFrequency (%)
대구광역시 269
 
19.6%
북구 84
 
6.1%
달성군 59
 
4.3%
서구 37
 
2.7%
동구 36
 
2.6%
달서구 29
 
2.1%
다사읍 16
 
1.2%
노원동3가 15
 
1.1%
옥포읍 15
 
1.1%
논공읍 13
 
0.9%
Other values (500) 802
58.3%
2023-12-11T03:30:49.150381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1106
 
16.4%
492
 
7.3%
297
 
4.4%
275
 
4.1%
273
 
4.1%
273
 
4.1%
269
 
4.0%
1 261
 
3.9%
237
 
3.5%
) 208
 
3.1%
Other values (163) 3039
45.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3981
59.2%
Space Separator 1106
 
16.4%
Decimal Number 1081
 
16.1%
Close Punctuation 208
 
3.1%
Open Punctuation 208
 
3.1%
Dash Punctuation 114
 
1.7%
Other Punctuation 29
 
0.4%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
492
 
12.4%
297
 
7.5%
275
 
6.9%
273
 
6.9%
273
 
6.9%
269
 
6.8%
237
 
6.0%
192
 
4.8%
109
 
2.7%
104
 
2.6%
Other values (146) 1460
36.7%
Decimal Number
ValueCountFrequency (%)
1 261
24.1%
2 145
13.4%
3 132
12.2%
4 111
10.3%
5 94
 
8.7%
6 87
 
8.0%
7 81
 
7.5%
9 61
 
5.6%
0 59
 
5.5%
8 50
 
4.6%
Uppercase Letter
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
1106
100.0%
Close Punctuation
ValueCountFrequency (%)
) 208
100.0%
Open Punctuation
ValueCountFrequency (%)
( 208
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%
Other Punctuation
ValueCountFrequency (%)
, 29
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3981
59.2%
Common 2746
40.8%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
492
 
12.4%
297
 
7.5%
275
 
6.9%
273
 
6.9%
273
 
6.9%
269
 
6.8%
237
 
6.0%
192
 
4.8%
109
 
2.7%
104
 
2.6%
Other values (146) 1460
36.7%
Common
ValueCountFrequency (%)
1106
40.3%
1 261
 
9.5%
) 208
 
7.6%
( 208
 
7.6%
2 145
 
5.3%
3 132
 
4.8%
- 114
 
4.2%
4 111
 
4.0%
5 94
 
3.4%
6 87
 
3.2%
Other values (5) 280
 
10.2%
Latin
ValueCountFrequency (%)
A 2
66.7%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3981
59.2%
ASCII 2749
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1106
40.2%
1 261
 
9.5%
) 208
 
7.6%
( 208
 
7.6%
2 145
 
5.3%
3 132
 
4.8%
- 114
 
4.1%
4 111
 
4.0%
5 94
 
3.4%
6 87
 
3.2%
Other values (7) 283
 
10.3%
Hangul
ValueCountFrequency (%)
492
 
12.4%
297
 
7.5%
275
 
6.9%
273
 
6.9%
273
 
6.9%
269
 
6.8%
237
 
6.0%
192
 
4.8%
109
 
2.7%
104
 
2.6%
Other values (146) 1460
36.7%

업소전화번호
Text

MISSING 

Distinct187
Distinct (%)98.9%
Missing80
Missing (%)29.7%
Memory size2.2 KiB
2023-12-11T03:30:49.570410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.058201
Min length12

Characters and Unicode

Total characters2279
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique185 ?
Unique (%)97.9%

Sample

1st row053-351-9200
2nd row053-527-4355
3rd row053-525-9326
4th row053-593-5910
5th row053-631-0805
ValueCountFrequency (%)
053-572-1543 2
 
1.1%
053-746-0092 2
 
1.1%
053-617-1097 1
 
0.5%
053-616-6191 1
 
0.5%
053-321-6386 1
 
0.5%
053-551-0631 1
 
0.5%
053-354-6286 1
 
0.5%
053-351-9200 1
 
0.5%
053-1577-6805 1
 
0.5%
053-423-4041 1
 
0.5%
Other values (177) 177
93.7%
2023-12-11T03:30:50.222178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 388
17.0%
- 378
16.6%
3 349
15.3%
0 318
14.0%
1 141
 
6.2%
6 139
 
6.1%
2 138
 
6.1%
8 133
 
5.8%
9 112
 
4.9%
7 98
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1901
83.4%
Dash Punctuation 378
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 388
20.4%
3 349
18.4%
0 318
16.7%
1 141
 
7.4%
6 139
 
7.3%
2 138
 
7.3%
8 133
 
7.0%
9 112
 
5.9%
7 98
 
5.2%
4 85
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 378
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2279
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 388
17.0%
- 378
16.6%
3 349
15.3%
0 318
14.0%
1 141
 
6.2%
6 139
 
6.1%
2 138
 
6.1%
8 133
 
5.8%
9 112
 
4.9%
7 98
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2279
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 388
17.0%
- 378
16.6%
3 349
15.3%
0 318
14.0%
1 141
 
6.2%
6 139
 
6.1%
2 138
 
6.1%
8 133
 
5.8%
9 112
 
4.9%
7 98
 
4.3%

Interactions

2023-12-11T03:30:45.493101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T03:30:50.388535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.405
업종0.4051.000
2023-12-11T03:30:50.536700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.263
업종0.2631.000

Missing values

2023-12-11T03:30:45.705435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:30:45.920923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명업종업소주소업소전화번호
01(주)가온푸드축산물가공업-식육가공업대구광역시 북구 노원로47길 17 (침산동)053-351-9200
12(주)거인축산축산물가공업-식육가공업대구광역시 서구 서대구로6길 6 (내당동)053-527-4355
23(주)국보푸드시스템축산물가공업-식육가공업대구광역시 북구 연암로42길 5 (산격동)053-525-9326
34(주)달구지푸드축산물가공업-식육가공업대구광역시 달성군 논공읍 농공공단길 10053-593-5910
45(주)대영냉장축산물가공업-식육가공업대구광역시 달성군 논공읍 비슬로264길 46053-631-0805
56(주)대원미트축산물가공업-식육가공업대구광역시 동구 신덕로6길 26 (신평동)070-8201-3210
67(주)대홍 농업회사법인축산물가공업-식육가공업대구광역시 서구 와룡로73길 11 (중리동)053-526-9998
78(주)더푸드축산물가공업-식육가공업대구광역시 북구 노원로10길 74, 1층 (노원동2가)<NA>
89(주)도야지식품축산물가공업-식육가공업대구광역시 달성군 유가읍 테크노중앙대로 20, (주)도야지식품053-558-2345
910(주)라자트푸드축산물가공업-식육가공업대구광역시 중구 국채보상로 736-10 (동인동4가)<NA>
연번업소명업종업소주소업소전화번호
259260행복하계(동구)축산물가공업-식육가공업대구광역시 동구 반야월로 271-36 (각산동)053-986-0826
260261행복한상상홍이에프앤비축산물가공업-식육가공업대구광역시 달서구 와룡로45길 72-10053-1800-8192
261262현미트축산물가공업-식육가공업대구광역시 북구 연암로42길 37 (산격동)<NA>
262263호로록 맛집축산물가공업-식육가공업대구광역시 남구 명덕로14길 54 (대명동)<NA>
263264(주)비락 대구공장축산물가공업-유가공업대구광역시 달성군 논공읍 논공로 465053-615-0720
264265(주)푸르밀축산물가공업-유가공업대구광역시 달성군 논공읍 논공중앙로 350053-614-8488
265266(주)자모축산물가공업-알가공업대구광역시 달성군 현풍읍 현풍서로 106053-617-1097
266267십리골양계축산물가공업-알가공업대구광역시 북구 복현로36길 25 (복현동)053-382-6309
267268오복유통축산물가공업-알가공업대구광역시 동구 평화로 73-1 (신암동)053-958-5820
268269파인식품축산물가공업-알가공업대구광역시 달성군 화원읍 성화로4길 3053-638-8988