Overview

Dataset statistics

Number of variables4
Number of observations2165
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory69.9 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Categorical1
Text2

Dataset

Description대전광역시 관내 식육포장처리업소, 식육즉석판매가공업, 식육판매업소의 상호, 소재지에 관한 정보공개 데이터 요청사항에 대한 시도행정시스템(축산)자료입니다.
Author대전광역시
URLhttps://www.data.go.kr/data/15098551/fileData.do

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:11:20.845757
Analysis finished2023-12-12 16:11:21.741974
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct2165
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1083
Minimum1
Maximum2165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size19.2 KiB
2023-12-13T01:11:21.864735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile109.2
Q1542
median1083
Q31624
95-th percentile2056.8
Maximum2165
Range2164
Interquartile range (IQR)1082

Descriptive statistics

Standard deviation625.12599
Coefficient of variation (CV)0.57721698
Kurtosis-1.2
Mean1083
Median Absolute Deviation (MAD)541
Skewness0
Sum2344695
Variance390782.5
MonotonicityStrictly increasing
2023-12-13T01:11:22.045764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1440 1
 
< 0.1%
1454 1
 
< 0.1%
1453 1
 
< 0.1%
1452 1
 
< 0.1%
1451 1
 
< 0.1%
1450 1
 
< 0.1%
1449 1
 
< 0.1%
1448 1
 
< 0.1%
1447 1
 
< 0.1%
Other values (2155) 2155
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2165 1
< 0.1%
2164 1
< 0.1%
2163 1
< 0.1%
2162 1
< 0.1%
2161 1
< 0.1%
2160 1
< 0.1%
2159 1
< 0.1%
2158 1
< 0.1%
2157 1
< 0.1%
2156 1
< 0.1%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size17.0 KiB
식육판매업
1032 
식육즉석판매가공업
667 
식육포장처리업
319 
축산물유통전문판매업
147 

Length

Max length10
Median length9
Mean length6.8665127
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식육포장처리업
2nd row식육포장처리업
3rd row식육포장처리업
4th row식육포장처리업
5th row식육포장처리업

Common Values

ValueCountFrequency (%)
식육판매업 1032
47.7%
식육즉석판매가공업 667
30.8%
식육포장처리업 319
 
14.7%
축산물유통전문판매업 147
 
6.8%

Length

2023-12-13T01:11:22.176228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:11:22.331614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식육판매업 1032
47.7%
식육즉석판매가공업 667
30.8%
식육포장처리업 319
 
14.7%
축산물유통전문판매업 147
 
6.8%
Distinct1833
Distinct (%)84.7%
Missing0
Missing (%)0.0%
Memory size17.0 KiB
2023-12-13T01:11:22.644887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length6.6549654
Min length2

Characters and Unicode

Total characters14408
Distinct characters538
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1573 ?
Unique (%)72.7%

Sample

1st row경우식품
2nd row평화푸드
3rd row(주)엠피대산
4th row대산포크 직영판매장
5th row주식회사 모소육가공
ValueCountFrequency (%)
주식회사 114
 
4.1%
정육점 63
 
2.3%
농업회사법인 47
 
1.7%
정육 34
 
1.2%
축산 16
 
0.6%
푸드 15
 
0.5%
미트 11
 
0.4%
육가공 9
 
0.3%
정육코너 9
 
0.3%
하나로마트 8
 
0.3%
Other values (1933) 2442
88.2%
2023-12-13T01:11:23.180789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
603
 
4.2%
588
 
4.1%
562
 
3.9%
554
 
3.8%
480
 
3.3%
476
 
3.3%
473
 
3.3%
348
 
2.4%
) 333
 
2.3%
( 333
 
2.3%
Other values (528) 9658
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12964
90.0%
Space Separator 603
 
4.2%
Close Punctuation 333
 
2.3%
Open Punctuation 333
 
2.3%
Uppercase Letter 101
 
0.7%
Decimal Number 38
 
0.3%
Lowercase Letter 25
 
0.2%
Other Punctuation 9
 
0.1%
Math Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
588
 
4.5%
562
 
4.3%
554
 
4.3%
480
 
3.7%
476
 
3.7%
473
 
3.6%
348
 
2.7%
269
 
2.1%
255
 
2.0%
250
 
1.9%
Other values (477) 8709
67.2%
Uppercase Letter
ValueCountFrequency (%)
S 13
12.9%
K 12
11.9%
O 12
11.9%
F 10
9.9%
C 8
 
7.9%
T 6
 
5.9%
G 5
 
5.0%
D 5
 
5.0%
A 4
 
4.0%
H 4
 
4.0%
Other values (10) 22
21.8%
Lowercase Letter
ValueCountFrequency (%)
e 4
16.0%
t 4
16.0%
n 3
12.0%
h 2
8.0%
i 2
8.0%
a 2
8.0%
m 1
 
4.0%
r 1
 
4.0%
s 1
 
4.0%
f 1
 
4.0%
Other values (4) 4
16.0%
Decimal Number
ValueCountFrequency (%)
2 11
28.9%
1 7
18.4%
3 5
13.2%
5 4
 
10.5%
6 4
 
10.5%
0 3
 
7.9%
8 2
 
5.3%
4 1
 
2.6%
9 1
 
2.6%
Other Punctuation
ValueCountFrequency (%)
& 7
77.8%
· 1
 
11.1%
. 1
 
11.1%
Space Separator
ValueCountFrequency (%)
603
100.0%
Close Punctuation
ValueCountFrequency (%)
) 333
100.0%
Open Punctuation
ValueCountFrequency (%)
( 333
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12961
90.0%
Common 1318
 
9.1%
Latin 126
 
0.9%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
588
 
4.5%
562
 
4.3%
554
 
4.3%
480
 
3.7%
476
 
3.7%
473
 
3.6%
348
 
2.7%
269
 
2.1%
255
 
2.0%
250
 
1.9%
Other values (474) 8706
67.2%
Latin
ValueCountFrequency (%)
S 13
 
10.3%
K 12
 
9.5%
O 12
 
9.5%
F 10
 
7.9%
C 8
 
6.3%
T 6
 
4.8%
G 5
 
4.0%
D 5
 
4.0%
e 4
 
3.2%
t 4
 
3.2%
Other values (24) 47
37.3%
Common
ValueCountFrequency (%)
603
45.8%
) 333
25.3%
( 333
25.3%
2 11
 
0.8%
& 7
 
0.5%
1 7
 
0.5%
3 5
 
0.4%
5 4
 
0.3%
6 4
 
0.3%
0 3
 
0.2%
Other values (7) 8
 
0.6%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12961
90.0%
ASCII 1443
 
10.0%
CJK 3
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
603
41.8%
) 333
23.1%
( 333
23.1%
S 13
 
0.9%
K 12
 
0.8%
O 12
 
0.8%
2 11
 
0.8%
F 10
 
0.7%
C 8
 
0.6%
& 7
 
0.5%
Other values (40) 101
 
7.0%
Hangul
ValueCountFrequency (%)
588
 
4.5%
562
 
4.3%
554
 
4.3%
480
 
3.7%
476
 
3.7%
473
 
3.6%
348
 
2.7%
269
 
2.1%
255
 
2.0%
250
 
1.9%
Other values (474) 8706
67.2%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct2011
Distinct (%)92.9%
Missing1
Missing (%)< 0.1%
Memory size17.0 KiB
2023-12-13T01:11:23.634583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length45
Mean length23.759704
Min length4

Characters and Unicode

Total characters51416
Distinct characters324
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1898 ?
Unique (%)87.7%

Sample

1st row대전광역시 동구 가오동 403번지 10호
2nd row대전광역시 동구 인동 78번지 18호
3rd row대전광역시 동구 용운동 432번지 1호
4th row대전광역시 동구 용운동 432번지 6호 101호
5th row대전광역시 동구 용운동 496번지 7호
ValueCountFrequency (%)
대전광역시 2153
 
20.0%
서구 540
 
5.0%
중구 426
 
4.0%
대덕구 406
 
3.8%
유성구 404
 
3.8%
동구 379
 
3.5%
1호 250
 
2.3%
오정동 155
 
1.4%
2호 129
 
1.2%
3호 122
 
1.1%
Other values (1316) 5791
53.8%
2023-12-13T01:11:24.237952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12370
24.1%
2709
 
5.3%
2610
 
5.1%
2236
 
4.3%
2197
 
4.3%
2178
 
4.2%
2166
 
4.2%
2159
 
4.2%
2154
 
4.2%
2142
 
4.2%
Other values (314) 18495
36.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29946
58.2%
Space Separator 12370
24.1%
Decimal Number 9009
 
17.5%
Uppercase Letter 34
 
0.1%
Other Punctuation 26
 
0.1%
Dash Punctuation 10
 
< 0.1%
Close Punctuation 9
 
< 0.1%
Open Punctuation 9
 
< 0.1%
Math Symbol 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2709
 
9.0%
2610
 
8.7%
2236
 
7.5%
2197
 
7.3%
2178
 
7.3%
2166
 
7.2%
2159
 
7.2%
2154
 
7.2%
2142
 
7.2%
1793
 
6.0%
Other values (283) 7602
25.4%
Uppercase Letter
ValueCountFrequency (%)
B 12
35.3%
C 7
20.6%
A 4
 
11.8%
K 2
 
5.9%
L 2
 
5.9%
H 2
 
5.9%
J 1
 
2.9%
M 1
 
2.9%
P 1
 
2.9%
S 1
 
2.9%
Decimal Number
ValueCountFrequency (%)
1 1890
21.0%
2 1051
11.7%
3 1038
11.5%
4 870
9.7%
5 789
8.8%
6 764
8.5%
0 748
 
8.3%
8 654
 
7.3%
7 630
 
7.0%
9 575
 
6.4%
Other Punctuation
ValueCountFrequency (%)
@ 13
50.0%
, 8
30.8%
· 4
 
15.4%
. 1
 
3.8%
Space Separator
ValueCountFrequency (%)
12370
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29947
58.2%
Common 21435
41.7%
Latin 34
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2709
 
9.0%
2610
 
8.7%
2236
 
7.5%
2197
 
7.3%
2178
 
7.3%
2166
 
7.2%
2159
 
7.2%
2154
 
7.2%
2142
 
7.2%
1793
 
6.0%
Other values (284) 7603
25.4%
Common
ValueCountFrequency (%)
12370
57.7%
1 1890
 
8.8%
2 1051
 
4.9%
3 1038
 
4.8%
4 870
 
4.1%
5 789
 
3.7%
6 764
 
3.6%
0 748
 
3.5%
8 654
 
3.1%
7 630
 
2.9%
Other values (9) 631
 
2.9%
Latin
ValueCountFrequency (%)
B 12
35.3%
C 7
20.6%
A 4
 
11.8%
K 2
 
5.9%
L 2
 
5.9%
H 2
 
5.9%
J 1
 
2.9%
M 1
 
2.9%
P 1
 
2.9%
S 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29946
58.2%
ASCII 21465
41.7%
None 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12370
57.6%
1 1890
 
8.8%
2 1051
 
4.9%
3 1038
 
4.8%
4 870
 
4.1%
5 789
 
3.7%
6 764
 
3.6%
0 748
 
3.5%
8 654
 
3.0%
7 630
 
2.9%
Other values (19) 661
 
3.1%
Hangul
ValueCountFrequency (%)
2709
 
9.0%
2610
 
8.7%
2236
 
7.5%
2197
 
7.3%
2178
 
7.3%
2166
 
7.2%
2159
 
7.2%
2154
 
7.2%
2142
 
7.2%
1793
 
6.0%
Other values (283) 7602
25.4%
None
ValueCountFrequency (%)
· 4
80.0%
1
 
20.0%

Interactions

2023-12-13T01:11:21.441040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:11:24.350677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.941
구분0.9411.000
2023-12-13T01:11:24.435710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분
연번1.0000.856
구분0.8561.000

Missing values

2023-12-13T01:11:21.577289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:11:21.691986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분사업장명소재지
01식육포장처리업경우식품대전광역시 동구 가오동 403번지 10호
12식육포장처리업평화푸드대전광역시 동구 인동 78번지 18호
23식육포장처리업(주)엠피대산대전광역시 동구 용운동 432번지 1호
34식육포장처리업대산포크 직영판매장대전광역시 동구 용운동 432번지 6호 101호
45식육포장처리업주식회사 모소육가공대전광역시 동구 용운동 496번지 7호
56식육포장처리업하나로축산대전광역시 동구 가양동 153번지 9호
67식육포장처리업늘품축산도매센터대전광역시 동구 가양동 32번지 18호
78식육포장처리업푸드텍농업회사법인(주)대전광역시 동구 가양동 403번지 1호 지하1층
89식육포장처리업라온 육가공대전광역시 동구 가양동 149번지 46호 1층
910식육포장처리업(주)에프앤씨대전광역시 동구 가양동 418번지 11호
연번구분사업장명소재지
21552156식육즉석판매가공업신선마트정육대전광역시 대덕구 덕암동 65번지 15호
21562157식육즉석판매가공업넘버원 축산대전광역시 대덕구 목상동 150번지 4호
21572158식육즉석판매가공업한마당마트 정육대전광역시 대덕구 목상동 185번지 2호
21582159식육즉석판매가공업목상축산물유통정육점대전광역시 대덕구 목상동 185번지 6호
21592160식육즉석판매가공업신신마트대전광역시 대덕구 목상동 197번지
21602161식육즉석판매가공업진영푸줏간대전광역시 대덕구 목상동 858번지 7호
21612162식육즉석판매가공업조은축산 11번가대전광역시 대덕구 읍내동 54번지 현대아파트
21622163식육즉석판매가공업플러스 축산대전광역시 대덕구 대화동 37번지 11호
21632164식육즉석판매가공업상인축산대전광역시 대덕구 송촌동 496번지 2호
21642165식육즉석판매가공업한우리축산대전광역시 대덕구 석봉동 770번지 금강엑슬루타워