Overview

Dataset statistics

Number of variables5
Number of observations1030
Missing cells0
Missing cells (%)0.0%
Duplicate rows150
Duplicate rows (%)14.6%
Total size in memory41.4 KiB
Average record size in memory41.1 B

Variable types

Text3
Categorical1
Numeric1

Dataset

Description창원시 관내의 식품제조가공업소에 대한 데이터로 회사명, 도로명주소, 식품유형 등 식품제조가공회사에 대한 현황을 공개함
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15021956

Alerts

업종 has constant value ""Constant
Dataset has 150 (14.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-11 00:46:14.997604
Analysis finished2023-12-11 00:46:15.639948
Duration0.64 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct262
Distinct (%)25.4%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2023-12-11T09:46:15.912922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length7.4058252
Min length2

Characters and Unicode

Total characters7628
Distinct characters332
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)6.0%

Sample

1st row(주)가야식품
2nd row(주)가야식품
3rd row(주)경남씨푸드빌
4th row(주)경남씨푸드빌
5th row(주)곰내마을
ValueCountFrequency (%)
주식회사 21
 
1.7%
농업회사법인(주)가고파힐링푸드 20
 
1.6%
동서유지(주 19
 
1.6%
분부골 18
 
1.5%
순금이네 18
 
1.5%
국화된장 18
 
1.5%
장원식품 17
 
1.4%
울엄마된장 17
 
1.4%
농업회사법인 16
 
1.3%
합천식품 15
 
1.2%
Other values (278) 1038
85.3%
2023-12-11T09:46:16.386989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
358
 
4.7%
349
 
4.6%
270
 
3.5%
) 267
 
3.5%
261
 
3.4%
( 261
 
3.4%
222
 
2.9%
208
 
2.7%
205
 
2.7%
187
 
2.5%
Other values (322) 5040
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6721
88.1%
Close Punctuation 267
 
3.5%
Open Punctuation 261
 
3.4%
Space Separator 187
 
2.5%
Uppercase Letter 113
 
1.5%
Lowercase Letter 31
 
0.4%
Decimal Number 30
 
0.4%
Other Punctuation 18
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
358
 
5.3%
349
 
5.2%
270
 
4.0%
261
 
3.9%
222
 
3.3%
208
 
3.1%
205
 
3.1%
178
 
2.6%
170
 
2.5%
150
 
2.2%
Other values (288) 4350
64.7%
Uppercase Letter
ValueCountFrequency (%)
F 28
24.8%
B 15
13.3%
R 8
 
7.1%
S 8
 
7.1%
O 7
 
6.2%
D 7
 
6.2%
C 7
 
6.2%
E 6
 
5.3%
N 5
 
4.4%
T 4
 
3.5%
Other values (8) 18
15.9%
Lowercase Letter
ValueCountFrequency (%)
e 8
25.8%
c 7
22.6%
n 7
22.6%
f 3
 
9.7%
a 2
 
6.5%
m 1
 
3.2%
t 1
 
3.2%
o 1
 
3.2%
v 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
2 24
80.0%
1 6
 
20.0%
Other Punctuation
ValueCountFrequency (%)
& 16
88.9%
' 2
 
11.1%
Close Punctuation
ValueCountFrequency (%)
) 267
100.0%
Open Punctuation
ValueCountFrequency (%)
( 261
100.0%
Space Separator
ValueCountFrequency (%)
187
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6721
88.1%
Common 763
 
10.0%
Latin 144
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
358
 
5.3%
349
 
5.2%
270
 
4.0%
261
 
3.9%
222
 
3.3%
208
 
3.1%
205
 
3.1%
178
 
2.6%
170
 
2.5%
150
 
2.2%
Other values (288) 4350
64.7%
Latin
ValueCountFrequency (%)
F 28
19.4%
B 15
 
10.4%
R 8
 
5.6%
e 8
 
5.6%
S 8
 
5.6%
c 7
 
4.9%
O 7
 
4.9%
D 7
 
4.9%
C 7
 
4.9%
n 7
 
4.9%
Other values (17) 42
29.2%
Common
ValueCountFrequency (%)
) 267
35.0%
( 261
34.2%
187
24.5%
2 24
 
3.1%
& 16
 
2.1%
1 6
 
0.8%
' 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6721
88.1%
ASCII 907
 
11.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
358
 
5.3%
349
 
5.2%
270
 
4.0%
261
 
3.9%
222
 
3.3%
208
 
3.1%
205
 
3.1%
178
 
2.6%
170
 
2.5%
150
 
2.2%
Other values (288) 4350
64.7%
ASCII
ValueCountFrequency (%)
) 267
29.4%
( 261
28.8%
187
20.6%
F 28
 
3.1%
2 24
 
2.6%
& 16
 
1.8%
B 15
 
1.7%
R 8
 
0.9%
e 8
 
0.9%
S 8
 
0.9%
Other values (24) 85
 
9.4%
Distinct261
Distinct (%)25.3%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2023-12-11T09:46:16.789877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length45
Mean length31.882524
Min length22

Characters and Unicode

Total characters32839
Distinct characters242
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique62 ?
Unique (%)6.0%

Sample

1st row경상남도 창원시 의창구 대산면 대산북로 362(주건축물제1동 1층)
2nd row경상남도 창원시 의창구 대산면 대산북로 362(주건축물제1동 1층)
3rd row경상남도 창원시 진해구 웅천동로49번길 100(남문동)
4th row경상남도 창원시 진해구 웅천동로49번길 100(남문동)
5th row경상남도 창원시 진해구 웅천중로65번길 4(1층 성내동)
ValueCountFrequency (%)
경상남도 1030
 
16.6%
창원시 1030
 
16.6%
마산합포구 382
 
6.2%
의창구 313
 
5.0%
마산회원구 152
 
2.5%
진해구 137
 
2.2%
진북면 90
 
1.5%
대산면 72
 
1.2%
동읍 63
 
1.0%
진동면 61
 
1.0%
Other values (588) 2870
46.3%
2023-12-11T09:46:17.356985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5172
 
15.7%
1 1429
 
4.4%
1376
 
4.2%
1252
 
3.8%
1222
 
3.7%
1216
 
3.7%
1117
 
3.4%
1091
 
3.3%
1076
 
3.3%
1042
 
3.2%
Other values (232) 16846
51.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20633
62.8%
Space Separator 5172
 
15.7%
Decimal Number 4918
 
15.0%
Open Punctuation 761
 
2.3%
Close Punctuation 761
 
2.3%
Dash Punctuation 278
 
0.8%
Other Punctuation 253
 
0.8%
Uppercase Letter 40
 
0.1%
Math Symbol 17
 
0.1%
Lowercase Letter 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1376
 
6.7%
1252
 
6.1%
1222
 
5.9%
1216
 
5.9%
1117
 
5.4%
1091
 
5.3%
1076
 
5.2%
1042
 
5.1%
880
 
4.3%
859
 
4.2%
Other values (202) 9502
46.1%
Decimal Number
ValueCountFrequency (%)
1 1429
29.1%
2 703
14.3%
3 503
 
10.2%
0 393
 
8.0%
7 351
 
7.1%
4 342
 
7.0%
6 315
 
6.4%
8 311
 
6.3%
5 299
 
6.1%
9 272
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 16
40.0%
F 7
17.5%
J 7
17.5%
A 4
 
10.0%
H 2
 
5.0%
C 2
 
5.0%
E 1
 
2.5%
M 1
 
2.5%
Other Punctuation
ValueCountFrequency (%)
, 237
93.7%
. 8
 
3.2%
& 5
 
2.0%
· 3
 
1.2%
Lowercase Letter
ValueCountFrequency (%)
h 2
33.3%
e 2
33.3%
t 2
33.3%
Space Separator
ValueCountFrequency (%)
5172
100.0%
Open Punctuation
ValueCountFrequency (%)
( 761
100.0%
Close Punctuation
ValueCountFrequency (%)
) 761
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 278
100.0%
Math Symbol
ValueCountFrequency (%)
~ 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20633
62.8%
Common 12160
37.0%
Latin 46
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1376
 
6.7%
1252
 
6.1%
1222
 
5.9%
1216
 
5.9%
1117
 
5.4%
1091
 
5.3%
1076
 
5.2%
1042
 
5.1%
880
 
4.3%
859
 
4.2%
Other values (202) 9502
46.1%
Common
ValueCountFrequency (%)
5172
42.5%
1 1429
 
11.8%
( 761
 
6.3%
) 761
 
6.3%
2 703
 
5.8%
3 503
 
4.1%
0 393
 
3.2%
7 351
 
2.9%
4 342
 
2.8%
6 315
 
2.6%
Other values (9) 1430
 
11.8%
Latin
ValueCountFrequency (%)
B 16
34.8%
F 7
15.2%
J 7
15.2%
A 4
 
8.7%
H 2
 
4.3%
C 2
 
4.3%
h 2
 
4.3%
e 2
 
4.3%
t 2
 
4.3%
E 1
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20633
62.8%
ASCII 12203
37.2%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5172
42.4%
1 1429
 
11.7%
( 761
 
6.2%
) 761
 
6.2%
2 703
 
5.8%
3 503
 
4.1%
0 393
 
3.2%
7 351
 
2.9%
4 342
 
2.8%
6 315
 
2.6%
Other values (19) 1473
 
12.1%
Hangul
ValueCountFrequency (%)
1376
 
6.7%
1252
 
6.1%
1222
 
5.9%
1216
 
5.9%
1117
 
5.4%
1091
 
5.3%
1076
 
5.2%
1042
 
5.1%
880
 
4.3%
859
 
4.2%
Other values (202) 9502
46.1%
None
ValueCountFrequency (%)
· 3
100.0%

업종
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
식품제조가공업
1030 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품제조가공업
2nd row식품제조가공업
3rd row식품제조가공업
4th row식품제조가공업
5th row식품제조가공업

Common Values

ValueCountFrequency (%)
식품제조가공업 1030
100.0%

Length

2023-12-11T09:46:17.523718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:46:17.633783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품제조가공업 1030
100.0%
Distinct152
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size8.2 KiB
2023-12-11T09:46:17.880221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length3.9970874
Min length1

Characters and Unicode

Total characters4117
Distinct characters179
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)4.8%

Sample

1st row곡류가공품
2nd row두류가공품
3rd row기타 수산물가공품
4th row조미건어포
5th row
ValueCountFrequency (%)
수산물가공품 70
 
6.5%
커피 68
 
6.3%
기타 52
 
4.8%
젓갈 41
 
3.8%
기타가공품 41
 
3.8%
절임식품 35
 
3.2%
액상차 31
 
2.9%
즉석조리식품 29
 
2.7%
액젓 26
 
2.4%
소스 26
 
2.4%
Other values (145) 666
61.4%
2023-12-11T09:46:18.297460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
304
 
7.4%
226
 
5.5%
206
 
5.0%
162
 
3.9%
150
 
3.6%
133
 
3.2%
120
 
2.9%
106
 
2.6%
98
 
2.4%
98
 
2.4%
Other values (169) 2514
61.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4013
97.5%
Space Separator 55
 
1.3%
Other Punctuation 35
 
0.9%
Close Punctuation 6
 
0.1%
Open Punctuation 6
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
304
 
7.6%
226
 
5.6%
206
 
5.1%
162
 
4.0%
150
 
3.7%
133
 
3.3%
120
 
3.0%
106
 
2.6%
98
 
2.4%
98
 
2.4%
Other values (164) 2410
60.1%
Space Separator
ValueCountFrequency (%)
55
100.0%
Other Punctuation
ValueCountFrequency (%)
. 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Decimal Number
ValueCountFrequency (%)
1 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4013
97.5%
Common 104
 
2.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
304
 
7.6%
226
 
5.6%
206
 
5.1%
162
 
4.0%
150
 
3.7%
133
 
3.3%
120
 
3.0%
106
 
2.6%
98
 
2.4%
98
 
2.4%
Other values (164) 2410
60.1%
Common
ValueCountFrequency (%)
55
52.9%
. 35
33.7%
) 6
 
5.8%
( 6
 
5.8%
1 2
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4013
97.5%
ASCII 104
 
2.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
304
 
7.6%
226
 
5.6%
206
 
5.1%
162
 
4.0%
150
 
3.7%
133
 
3.3%
120
 
3.0%
106
 
2.6%
98
 
2.4%
98
 
2.4%
Other values (164) 2410
60.1%
ASCII
ValueCountFrequency (%)
55
52.9%
. 35
33.7%
) 6
 
5.8%
( 6
 
5.8%
1 2
 
1.9%

운영품목수
Real number (ℝ)

Distinct56
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.1932039
Minimum1
Maximum219
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.2 KiB
2023-12-11T09:46:18.445459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median2
Q35
95-th percentile24.55
Maximum219
Range218
Interquartile range (IQR)4

Descriptive statistics

Standard deviation14.551692
Coefficient of variation (CV)2.3496227
Kurtosis72.402992
Mean6.1932039
Median Absolute Deviation (MAD)1
Skewness7.2078735
Sum6379
Variance211.75175
MonotonicityNot monotonic
2023-12-11T09:46:18.566037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 444
43.1%
2 166
 
16.1%
3 74
 
7.2%
4 60
 
5.8%
5 50
 
4.9%
6 36
 
3.5%
7 25
 
2.4%
9 15
 
1.5%
16 15
 
1.5%
8 14
 
1.4%
Other values (46) 131
 
12.7%
ValueCountFrequency (%)
1 444
43.1%
2 166
 
16.1%
3 74
 
7.2%
4 60
 
5.8%
5 50
 
4.9%
6 36
 
3.5%
7 25
 
2.4%
8 14
 
1.4%
9 15
 
1.5%
10 13
 
1.3%
ValueCountFrequency (%)
219 1
0.1%
148 1
0.1%
145 1
0.1%
115 1
0.1%
110 1
0.1%
105 1
0.1%
98 1
0.1%
91 1
0.1%
85 1
0.1%
69 1
0.1%

Interactions

2023-12-11T09:46:15.407186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T09:46:15.519753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:46:15.604192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명영업소재지업종품목유형운영품목수
0(주)가야식품경상남도 창원시 의창구 대산면 대산북로 362(주건축물제1동 1층)식품제조가공업곡류가공품5
1(주)가야식품경상남도 창원시 의창구 대산면 대산북로 362(주건축물제1동 1층)식품제조가공업두류가공품2
2(주)경남씨푸드빌경상남도 창원시 진해구 웅천동로49번길 100(남문동)식품제조가공업기타 수산물가공품16
3(주)경남씨푸드빌경상남도 창원시 진해구 웅천동로49번길 100(남문동)식품제조가공업조미건어포1
4(주)곰내마을경상남도 창원시 진해구 웅천중로65번길 4(1층 성내동)식품제조가공업1
5(주)그린밀푸드경상남도 창원시 의창구 북면 천주로577번길 10-15(2동 1층)식품제조가공업떡류85
6(주)나눔푸드경상남도 창원시 의창구 동읍 신방로39번길 57(1층)식품제조가공업기타 수산물가공품4
7(주)나눔푸드경상남도 창원시 의창구 동읍 신방로39번길 57(1층)식품제조가공업김치1
8(주)나눔푸드경상남도 창원시 의창구 동읍 신방로39번길 57(1층)식품제조가공업당절임2
9(주)나눔푸드경상남도 창원시 의창구 동읍 신방로39번길 57(1층)식품제조가공업소스10
업체명영업소재지업종품목유형운영품목수
1020해진식품경상남도 창원시 진해구 충장로603번길 7(1층 풍호동)식품제조가공업양념젓갈2
1021해진식품경상남도 창원시 진해구 충장로603번길 7(1층 풍호동)식품제조가공업양념젓갈6
1022해진식품경상남도 창원시 진해구 충장로603번길 7(1층 풍호동)식품제조가공업천연향신료1
1023해진식품경상남도 창원시 진해구 충장로603번길 7(1층 풍호동)식품제조가공업천연향신료1
1024햇살푸드경상남도 창원시 마산합포구 진동면 교동3길 170(1층)식품제조가공업고춧가루8
1025햇살푸드경상남도 창원시 마산합포구 진동면 교동3길 170(1층)식품제조가공업향신료조제품1
1026호기로스터스경상남도 창원시 의창구 사림로158번길 20(1층 102-1호 사림동)식품제조가공업커피1
1027호끼린커피로스터스경상남도 창원시 진해구 청안로 251(3층 안골동)식품제조가공업커피2
1028희망이룸경상남도 창원시 의창구 사림로 53(한우프라자 지하1층 사림동)식품제조가공업액상차2
1029희망이룸경상남도 창원시 의창구 사림로 53(한우프라자 지하1층 사림동)식품제조가공업커피1

Duplicate rows

Most frequently occurring

업체명영업소재지업종품목유형운영품목수# duplicates
0(주)대영수산식품경상남도 창원시 마산합포구 구산면 안녕로 255(외4필지,지상1층)식품제조가공업젓갈12
1(주)마산푸드경상남도 창원시 마산합포구 수산2길 67(1~3층 신포동1가)식품제조가공업즉석조리식품12
2(주)시민푸드경상남도 창원시 진해구 석동로59번길 24-1(1층 석동)식품제조가공업절임식품12
3(주)아이언크로스커피경상남도 창원시 마산합포구 진동면 요장1길 33(상가동 1층 101-103호)식품제조가공업커피12
4(주)제이스경상남도 창원시 의창구 동읍 자여로42번길 11식품제조가공업즉석섭취식품102
5(주)제일냉장경상남도 창원시 마산합포구 수산1길 238(오동동)식품제조가공업식용얼음22
6(주)제일냉장경상남도 창원시 마산합포구 수산1길 238(오동동)식품제조가공업어업용얼음12
7(주)한울푸드경상남도 창원시 마산합포구 진북면 진북산업로 352(1,2층)식품제조가공업조미김72
8가야식품경상남도 창원시 마산회원구 내서읍 신평안길 11-1(지상1층)식품제조가공업어육반제품12
9감천골블루베리농원경상남도 창원시 마산회원구 내서읍 광려로 421-133(지상1층)식품제조가공업발효식초12