Overview

Dataset statistics

Number of variables6
Number of observations465
Missing cells106
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.4 KiB
Average record size in memory49.3 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description성남시내 식품제조가공업(식품제조가공업, 식품첨가물제조업) 현황에 대한 자료이며, 업종명, 업소명,소재지,전화번호 항목으로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15054245/fileData.do

Alerts

데이터 기준일자 has constant value ""Constant
연번 is highly overall correlated with 업종High correlation
업종 is highly overall correlated with 연번High correlation
소재지전화번호 has 106 (22.8%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:06:55.461304
Analysis finished2023-12-12 22:06:56.181483
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct465
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean233
Minimum1
Maximum465
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2023-12-13T07:06:56.256067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.2
Q1117
median233
Q3349
95-th percentile441.8
Maximum465
Range464
Interquartile range (IQR)232

Descriptive statistics

Standard deviation134.3782
Coefficient of variation (CV)0.57673046
Kurtosis-1.2
Mean233
Median Absolute Deviation (MAD)116
Skewness0
Sum108345
Variance18057.5
MonotonicityStrictly increasing
2023-12-13T07:06:56.410910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
307 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
315 1
 
0.2%
314 1
 
0.2%
313 1
 
0.2%
312 1
 
0.2%
Other values (455) 455
97.8%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
465 1
0.2%
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%
460 1
0.2%
459 1
0.2%
458 1
0.2%
457 1
0.2%
456 1
0.2%

업종
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
식품제조가공업
405 
식품첨가물제조업
60 

Length

Max length8
Median length7
Mean length7.1290323
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품제조가공업
2nd row식품제조가공업
3rd row식품제조가공업
4th row식품제조가공업
5th row식품제조가공업

Common Values

ValueCountFrequency (%)
식품제조가공업 405
87.1%
식품첨가물제조업 60
 
12.9%

Length

2023-12-13T07:06:56.588455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:06:56.728214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품제조가공업 405
87.1%
식품첨가물제조업 60
 
12.9%
Distinct442
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T07:06:56.982741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length7.9096774
Min length2

Characters and Unicode

Total characters3678
Distinct characters419
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique419 ?
Unique (%)90.1%

Sample

1st row(주)샤니
2nd row(주)개미식품
3rd row명성제과
4th row(주)맘모스제과
5th row(주)평화식품
ValueCountFrequency (%)
주식회사 75
 
12.0%
성남공장 6
 
1.0%
2공장 5
 
0.8%
제2공장 5
 
0.8%
에스제이디 4
 
0.6%
센트럴키친 4
 
0.6%
농업회사법인 4
 
0.6%
커피 4
 
0.6%
주)비전바이오켐 4
 
0.6%
제이비케이랩 3
 
0.5%
Other values (473) 511
81.8%
2023-12-13T07:06:57.514927image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
269
 
7.3%
( 200
 
5.4%
) 200
 
5.4%
160
 
4.4%
135
 
3.7%
113
 
3.1%
109
 
3.0%
94
 
2.6%
85
 
2.3%
69
 
1.9%
Other values (409) 2244
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2951
80.2%
Open Punctuation 200
 
5.4%
Close Punctuation 200
 
5.4%
Space Separator 160
 
4.4%
Uppercase Letter 81
 
2.2%
Decimal Number 33
 
0.9%
Lowercase Letter 33
 
0.9%
Other Punctuation 18
 
0.5%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
269
 
9.1%
135
 
4.6%
113
 
3.8%
109
 
3.7%
94
 
3.2%
85
 
2.9%
69
 
2.3%
66
 
2.2%
63
 
2.1%
49
 
1.7%
Other values (357) 1899
64.4%
Uppercase Letter
ValueCountFrequency (%)
F 11
13.6%
B 10
12.3%
C 8
9.9%
E 7
 
8.6%
L 6
 
7.4%
A 5
 
6.2%
S 5
 
6.2%
K 4
 
4.9%
J 4
 
4.9%
D 3
 
3.7%
Other values (11) 18
22.2%
Lowercase Letter
ValueCountFrequency (%)
e 5
15.2%
o 4
12.1%
n 4
12.1%
r 3
9.1%
a 3
9.1%
y 2
 
6.1%
f 2
 
6.1%
p 2
 
6.1%
u 2
 
6.1%
m 1
 
3.0%
Other values (5) 5
15.2%
Decimal Number
ValueCountFrequency (%)
2 19
57.6%
0 5
 
15.2%
3 3
 
9.1%
1 3
 
9.1%
4 1
 
3.0%
9 1
 
3.0%
8 1
 
3.0%
Other Punctuation
ValueCountFrequency (%)
& 15
83.3%
' 1
 
5.6%
1
 
5.6%
, 1
 
5.6%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 200
100.0%
Close Punctuation
ValueCountFrequency (%)
) 200
100.0%
Space Separator
ValueCountFrequency (%)
160
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2951
80.2%
Common 613
 
16.7%
Latin 114
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
269
 
9.1%
135
 
4.6%
113
 
3.8%
109
 
3.7%
94
 
3.2%
85
 
2.9%
69
 
2.3%
66
 
2.2%
63
 
2.1%
49
 
1.7%
Other values (357) 1899
64.4%
Latin
ValueCountFrequency (%)
F 11
 
9.6%
B 10
 
8.8%
C 8
 
7.0%
E 7
 
6.1%
L 6
 
5.3%
A 5
 
4.4%
e 5
 
4.4%
S 5
 
4.4%
o 4
 
3.5%
n 4
 
3.5%
Other values (26) 49
43.0%
Common
ValueCountFrequency (%)
( 200
32.6%
) 200
32.6%
160
26.1%
2 19
 
3.1%
& 15
 
2.4%
0 5
 
0.8%
3 3
 
0.5%
1 3
 
0.5%
> 1
 
0.2%
< 1
 
0.2%
Other values (6) 6
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2951
80.2%
ASCII 726
 
19.7%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
269
 
9.1%
135
 
4.6%
113
 
3.8%
109
 
3.7%
94
 
3.2%
85
 
2.9%
69
 
2.3%
66
 
2.2%
63
 
2.1%
49
 
1.7%
Other values (357) 1899
64.4%
ASCII
ValueCountFrequency (%)
( 200
27.5%
) 200
27.5%
160
22.0%
2 19
 
2.6%
& 15
 
2.1%
F 11
 
1.5%
B 10
 
1.4%
C 8
 
1.1%
E 7
 
1.0%
L 6
 
0.8%
Other values (41) 90
12.4%
None
ValueCountFrequency (%)
1
100.0%

소재지전화번호
Text

MISSING 

Distinct323
Distinct (%)90.0%
Missing106
Missing (%)22.8%
Memory size3.8 KiB
2023-12-13T07:06:57.830525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.036212
Min length9

Characters and Unicode

Total characters4321
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)81.3%

Sample

1st row031-739-2113
2nd row031-741-2202
3rd row031-751-5664
4th row031-741-2000
5th row031-731-4405
ValueCountFrequency (%)
031-737-9570 4
 
1.1%
031-737-5289 3
 
0.8%
031-737-6223 3
 
0.8%
031-707-0223 3
 
0.8%
031-639-6655 2
 
0.6%
031-745-0080 2
 
0.6%
070-4171-0389 2
 
0.6%
070-7727-6764 2
 
0.6%
031-712-1217 2
 
0.6%
031-736-5255 2
 
0.6%
Other values (313) 334
93.0%
2023-12-13T07:06:58.271036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 737
17.1%
- 709
16.4%
7 529
12.2%
3 528
12.2%
1 489
11.3%
2 285
 
6.6%
5 257
 
5.9%
8 212
 
4.9%
4 212
 
4.9%
6 206
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3612
83.6%
Dash Punctuation 709
 
16.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 737
20.4%
7 529
14.6%
3 528
14.6%
1 489
13.5%
2 285
 
7.9%
5 257
 
7.1%
8 212
 
5.9%
4 212
 
5.9%
6 206
 
5.7%
9 157
 
4.3%
Dash Punctuation
ValueCountFrequency (%)
- 709
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4321
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 737
17.1%
- 709
16.4%
7 529
12.2%
3 528
12.2%
1 489
11.3%
2 285
 
6.6%
5 257
 
5.9%
8 212
 
4.9%
4 212
 
4.9%
6 206
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4321
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 737
17.1%
- 709
16.4%
7 529
12.2%
3 528
12.2%
1 489
11.3%
2 285
 
6.6%
5 257
 
5.9%
8 212
 
4.9%
4 212
 
4.9%
6 206
 
4.8%
Distinct452
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T07:06:58.540153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length62
Mean length46.670968
Min length24

Characters and Unicode

Total characters21702
Distinct characters234
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique439 ?
Unique (%)94.4%

Sample

1st row경기도 성남시 중원구 둔촌대로457번길 13 (상대원동)
2nd row경기도 성남시 중원구 둔촌대로 477, 1~4층, 5층 일부 (상대원동)
3rd row경기도 성남시 수정구 탄리로126번길 3-10, 지층,1층 (태평동)
4th row경기도 성남시 중원구 둔촌대로526번길 8 (상대원동, (주)맘모스제과 1동 3층, 2동)
5th row경기도 성남시 중원구 둔촌대로541번길 51 (상대원동, 1.2.3층)
ValueCountFrequency (%)
경기도 465
 
11.6%
성남시 465
 
11.6%
중원구 390
 
9.8%
상대원동 384
 
9.6%
사기막골로 78
 
2.0%
갈마치로 72
 
1.8%
둔촌대로 62
 
1.6%
일부 58
 
1.5%
1층 45
 
1.1%
분당구 43
 
1.1%
Other values (716) 1935
48.4%
2023-12-13T07:06:58.961879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3535
 
16.3%
1 904
 
4.2%
780
 
3.6%
, 669
 
3.1%
629
 
2.9%
598
 
2.8%
0 566
 
2.6%
546
 
2.5%
545
 
2.5%
2 513
 
2.4%
Other values (224) 12417
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12177
56.1%
Decimal Number 3793
 
17.5%
Space Separator 3535
 
16.3%
Other Punctuation 675
 
3.1%
Open Punctuation 501
 
2.3%
Close Punctuation 501
 
2.3%
Uppercase Letter 389
 
1.8%
Lowercase Letter 55
 
0.3%
Dash Punctuation 52
 
0.2%
Math Symbol 23
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
780
 
6.4%
629
 
5.2%
598
 
4.9%
546
 
4.5%
545
 
4.5%
507
 
4.2%
478
 
3.9%
467
 
3.8%
466
 
3.8%
465
 
3.8%
Other values (186) 6696
55.0%
Uppercase Letter
ValueCountFrequency (%)
B 134
34.4%
S 64
16.5%
K 59
15.2%
A 40
 
10.3%
T 19
 
4.9%
V 14
 
3.6%
I 14
 
3.6%
F 11
 
2.8%
O 10
 
2.6%
D 10
 
2.6%
Other values (4) 14
 
3.6%
Decimal Number
ValueCountFrequency (%)
1 904
23.8%
0 566
14.9%
2 513
13.5%
4 398
10.5%
3 366
9.6%
5 363
9.6%
7 192
 
5.1%
8 191
 
5.0%
6 183
 
4.8%
9 117
 
3.1%
Lowercase Letter
ValueCountFrequency (%)
n 30
54.5%
w 5
 
9.1%
o 5
 
9.1%
t 5
 
9.1%
r 5
 
9.1%
e 5
 
9.1%
Other Punctuation
ValueCountFrequency (%)
, 669
99.1%
. 6
 
0.9%
Space Separator
ValueCountFrequency (%)
3535
100.0%
Open Punctuation
ValueCountFrequency (%)
( 501
100.0%
Close Punctuation
ValueCountFrequency (%)
) 501
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%
Math Symbol
ValueCountFrequency (%)
~ 23
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12177
56.1%
Common 9081
41.8%
Latin 444
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
780
 
6.4%
629
 
5.2%
598
 
4.9%
546
 
4.5%
545
 
4.5%
507
 
4.2%
478
 
3.9%
467
 
3.8%
466
 
3.8%
465
 
3.8%
Other values (186) 6696
55.0%
Latin
ValueCountFrequency (%)
B 134
30.2%
S 64
14.4%
K 59
13.3%
A 40
 
9.0%
n 30
 
6.8%
T 19
 
4.3%
V 14
 
3.2%
I 14
 
3.2%
F 11
 
2.5%
O 10
 
2.3%
Other values (10) 49
 
11.0%
Common
ValueCountFrequency (%)
3535
38.9%
1 904
 
10.0%
, 669
 
7.4%
0 566
 
6.2%
2 513
 
5.6%
( 501
 
5.5%
) 501
 
5.5%
4 398
 
4.4%
3 366
 
4.0%
5 363
 
4.0%
Other values (8) 765
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12177
56.1%
ASCII 9524
43.9%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3535
37.1%
1 904
 
9.5%
, 669
 
7.0%
0 566
 
5.9%
2 513
 
5.4%
( 501
 
5.3%
) 501
 
5.3%
4 398
 
4.2%
3 366
 
3.8%
5 363
 
3.8%
Other values (27) 1208
 
12.7%
Hangul
ValueCountFrequency (%)
780
 
6.4%
629
 
5.2%
598
 
4.9%
546
 
4.5%
545
 
4.5%
507
 
4.2%
478
 
3.9%
467
 
3.8%
466
 
3.8%
465
 
3.8%
Other values (186) 6696
55.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

데이터 기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-06-01
465 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-01
2nd row2023-06-01
3rd row2023-06-01
4th row2023-06-01
5th row2023-06-01

Common Values

ValueCountFrequency (%)
2023-06-01 465
100.0%

Length

2023-12-13T07:06:59.090906image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:06:59.175295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-01 465
100.0%

Interactions

2023-12-13T07:06:55.877511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:06:59.224732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.988
업종0.9881.000
2023-12-13T07:06:59.319937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.897
업종0.8971.000

Missing values

2023-12-13T07:06:56.019536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:06:56.137189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종업소명소재지전화번호소재지(도로명)데이터 기준일자
01식품제조가공업(주)샤니031-739-2113경기도 성남시 중원구 둔촌대로457번길 13 (상대원동)2023-06-01
12식품제조가공업(주)개미식품031-741-2202경기도 성남시 중원구 둔촌대로 477, 1~4층, 5층 일부 (상대원동)2023-06-01
23식품제조가공업명성제과031-751-5664경기도 성남시 수정구 탄리로126번길 3-10, 지층,1층 (태평동)2023-06-01
34식품제조가공업(주)맘모스제과031-741-2000경기도 성남시 중원구 둔촌대로526번길 8 (상대원동, (주)맘모스제과 1동 3층, 2동)2023-06-01
45식품제조가공업(주)평화식품031-731-4405경기도 성남시 중원구 둔촌대로541번길 51 (상대원동, 1.2.3층)2023-06-01
56식품제조가공업씨제이 씨푸드 주식회사031-730-9180경기도 성남시 중원구 둔촌대로388번길 32 (상대원동)2023-06-01
67식품제조가공업(주)파리크라상031-740-5537경기도 성남시 중원구 사기막골로31번길 18 (상대원동)2023-06-01
78식품제조가공업(주)동원F&B031-744-9601경기도 성남시 중원구 둔촌대로388번길 14 (상대원동)2023-06-01
89식품제조가공업강동식품02-3401-1691경기도 성남시 중원구 갈마치로244번길 31, 현대아이밸리 414호 (상대원동)2023-06-01
910식품제조가공업(주)서울식연031-704-4927경기도 성남시 분당구 판교로 700, D동 309호 (야탑동, 분당테크노파크)2023-06-01
연번업종업소명소재지전화번호소재지(도로명)데이터 기준일자
455456식품첨가물제조업오트로 패밀리아02-6404-4473경기도 성남시 중원구 사기막골로45번길 14, 성남 우림 라이온스밸리2차 B동 B302호 일부호 (상대원동)2023-06-01
456457식품첨가물제조업주식회사 조이스파이스<NA>경기도 성남시 중원구 갈마치로244번길 31, 현대아이밸리 403호 (상대원동)2023-06-01
457458식품첨가물제조업주식회사 황초원<NA>경기도 성남시 중원구 갈마치로 314, 성남 센트럴비즈타워 1 922호 일부호 (상대원동)2023-06-01
458459식품첨가물제조업플랜에이031-776-0540경기도 성남시 중원구 사기막골로 124, SKn테크노파크 테크동 1201호 (상대원동)2023-06-01
459460식품첨가물제조업(주)비전바이오켐 2공장031-737-9570경기도 성남시 중원구 갈마치로244번길 31, 현대아이밸리 205호 일부 (상대원동)2023-06-01
460461식품첨가물제조업가비스코퍼레이션031-639-6655경기도 성남시 중원구 갈마치로 314, 816호 일부, 817호 일부 (상대원동)2023-06-01
461462식품첨가물제조업(주)백광시엔에스031-704-1851경기도 성남시 중원구 사기막골로 99, 908호 일부 (상대원동)2023-06-01
462463식품첨가물제조업그득그득컴퍼니<NA>경기도 성남시 중원구 둔촌대로388번길 24, 성남 우림라이온스밸리3차 1305호 (상대원동)2023-06-01
463464식품첨가물제조업주식회사 더헬스랩발효031-732-8236경기도 성남시 중원구 둔촌대로 550, 1층 일부 (상대원동)2023-06-01
464465식품첨가물제조업태호향료<NA>경기도 성남시 중원구 갈마치로288번길 24, 노벨테크노타워 7층 702호 일부 (상대원동)2023-06-01