Overview

Dataset statistics

Number of variables9
Number of observations41
Missing cells15
Missing cells (%)4.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.0 KiB
Average record size in memory75.2 B

Variable types

Categorical1
DateTime3
Text5

Dataset

Description경기도 여주시 관내 식품위생업 중 옹기류 및 용기포장지 제조업현황 정보(업종명, 허가일자, 업소명, 소재지, 영업장면적, 전화번호, 영업자시작일자, 소재지시작일)를 제공합니다
Author경기도 여주시
URLhttps://www.data.go.kr/data/15038677/fileData.do

Alerts

소재지전화 has 15 (36.6%) missing valuesMissing
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-12 01:15:08.517101
Analysis finished2023-12-12 01:15:09.401234
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

Distinct2
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size460.0 B
용기.포장지제조업
28 
옹기류제조업
13 

Length

Max length9
Median length9
Mean length8.0487805
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row용기.포장지제조업
2nd row용기.포장지제조업
3rd row용기.포장지제조업
4th row용기.포장지제조업
5th row용기.포장지제조업

Common Values

ValueCountFrequency (%)
용기.포장지제조업 28
68.3%
옹기류제조업 13
31.7%

Length

2023-12-12T10:15:09.484827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T10:15:09.607602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
용기.포장지제조업 28
68.3%
옹기류제조업 13
31.7%
Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum1982-01-19 00:00:00
Maximum2020-09-10 00:00:00
2023-12-12T10:15:09.781675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:15:10.000320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T10:15:10.273443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length9
Mean length5.2439024
Min length2

Characters and Unicode

Total characters215
Distinct characters90
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)95.1%

Sample

1st row풍원산업사
2nd row태림기업
3rd row삼일프라스틱
4th row씨엔테크(주)
5th row한솔세라믹
ValueCountFrequency (%)
주식회사 3
 
6.7%
나의손 2
 
4.4%
이도 2
 
4.4%
오부자옹기 1
 
2.2%
화림경산업 1
 
2.2%
태림기업 1
 
2.2%
지앤세라믹 1
 
2.2%
한국알카리수 1
 
2.2%
알파세라믹 1
 
2.2%
청운 1
 
2.2%
Other values (31) 31
68.9%
2023-12-12T10:15:10.704730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
5.1%
11
 
5.1%
8
 
3.7%
7
 
3.3%
) 7
 
3.3%
( 6
 
2.8%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (80) 144
67.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 198
92.1%
Close Punctuation 7
 
3.3%
Open Punctuation 6
 
2.8%
Space Separator 4
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
5.6%
11
 
5.6%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 132
66.7%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 198
92.1%
Common 17
 
7.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
5.6%
11
 
5.6%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 132
66.7%
Common
ValueCountFrequency (%)
) 7
41.2%
( 6
35.3%
4
23.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 198
92.1%
ASCII 17
 
7.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
5.6%
11
 
5.6%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
5
 
2.5%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (77) 132
66.7%
ASCII
ValueCountFrequency (%)
) 7
41.2%
( 6
35.3%
4
23.5%

소재지
Text

UNIQUE 

Distinct41
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T10:15:11.095283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length23.219512
Min length15

Characters and Unicode

Total characters952
Distinct characters88
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique41 ?
Unique (%)100.0%

Sample

1st row경기도 여주시 가남읍 양화로 97
2nd row경기도 여주시 흥천면 흥천로 23-13
3rd row경기도 여주시 점동면 처리2길 59-14
4th row경기도 여주시 강천면 곱새기로 24-38
5th row경기도 여주시 대신면 여양로 685, 1동 1층
ValueCountFrequency (%)
경기도 41
18.2%
여주시 41
18.2%
1층 12
 
5.3%
북내면 7
 
3.1%
강천면 5
 
2.2%
가남읍 5
 
2.2%
여양로 5
 
2.2%
흥천면 4
 
1.8%
천송동 3
 
1.3%
어영실로 3
 
1.3%
Other values (84) 99
44.0%
2023-12-12T10:15:11.656810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
186
19.5%
49
 
5.1%
1 48
 
5.0%
45
 
4.7%
43
 
4.5%
42
 
4.4%
42
 
4.4%
41
 
4.3%
26
 
2.7%
25
 
2.6%
Other values (78) 405
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 533
56.0%
Space Separator 186
 
19.5%
Decimal Number 167
 
17.5%
Dash Punctuation 21
 
2.2%
Other Punctuation 19
 
2.0%
Open Punctuation 10
 
1.1%
Close Punctuation 10
 
1.1%
Math Symbol 3
 
0.3%
Uppercase Letter 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
9.2%
45
 
8.4%
43
 
8.1%
42
 
7.9%
42
 
7.9%
41
 
7.7%
26
 
4.9%
25
 
4.7%
19
 
3.6%
14
 
2.6%
Other values (59) 187
35.1%
Decimal Number
ValueCountFrequency (%)
1 48
28.7%
3 22
13.2%
2 19
 
11.4%
6 15
 
9.0%
7 15
 
9.0%
5 12
 
7.2%
8 10
 
6.0%
4 10
 
6.0%
0 8
 
4.8%
9 8
 
4.8%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
D 1
33.3%
A 1
33.3%
Space Separator
ValueCountFrequency (%)
186
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Other Punctuation
ValueCountFrequency (%)
, 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 533
56.0%
Common 416
43.7%
Latin 3
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
9.2%
45
 
8.4%
43
 
8.1%
42
 
7.9%
42
 
7.9%
41
 
7.7%
26
 
4.9%
25
 
4.7%
19
 
3.6%
14
 
2.6%
Other values (59) 187
35.1%
Common
ValueCountFrequency (%)
186
44.7%
1 48
 
11.5%
3 22
 
5.3%
- 21
 
5.0%
2 19
 
4.6%
, 19
 
4.6%
6 15
 
3.6%
7 15
 
3.6%
5 12
 
2.9%
( 10
 
2.4%
Other values (6) 49
 
11.8%
Latin
ValueCountFrequency (%)
C 1
33.3%
D 1
33.3%
A 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 533
56.0%
ASCII 419
44.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
186
44.4%
1 48
 
11.5%
3 22
 
5.3%
- 21
 
5.0%
2 19
 
4.5%
, 19
 
4.5%
6 15
 
3.6%
7 15
 
3.6%
5 12
 
2.9%
( 10
 
2.4%
Other values (9) 52
 
12.4%
Hangul
ValueCountFrequency (%)
49
 
9.2%
45
 
8.4%
43
 
8.1%
42
 
7.9%
42
 
7.9%
41
 
7.7%
26
 
4.9%
25
 
4.7%
19
 
3.6%
14
 
2.6%
Other values (59) 187
35.1%
Distinct40
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T10:15:11.894329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length4.9268293
Min length1

Characters and Unicode

Total characters202
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)95.1%

Sample

1st row859.58
2nd row395.8
3rd row280.03
4th row3,525.32
5th row351.96
ValueCountFrequency (%)
0 2
 
4.9%
859.58 1
 
2.4%
198 1
 
2.4%
480 1
 
2.4%
1,073.73 1
 
2.4%
337.44 1
 
2.4%
1,979.39 1
 
2.4%
1,138.65 1
 
2.4%
364.1 1
 
2.4%
420 1
 
2.4%
Other values (30) 30
73.2%
2023-12-12T10:15:12.436291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 24
11.9%
3 23
11.4%
5 20
9.9%
1 19
9.4%
6 19
9.4%
0 18
8.9%
9 17
8.4%
2 16
7.9%
4 16
7.9%
8 11
5.4%
Other values (2) 19
9.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 169
83.7%
Other Punctuation 33
 
16.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 23
13.6%
5 20
11.8%
1 19
11.2%
6 19
11.2%
0 18
10.7%
9 17
10.1%
2 16
9.5%
4 16
9.5%
8 11
6.5%
7 10
5.9%
Other Punctuation
ValueCountFrequency (%)
. 24
72.7%
, 9
 
27.3%

Most occurring scripts

ValueCountFrequency (%)
Common 202
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 24
11.9%
3 23
11.4%
5 20
9.9%
1 19
9.4%
6 19
9.4%
0 18
8.9%
9 17
8.4%
2 16
7.9%
4 16
7.9%
8 11
5.4%
Other values (2) 19
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 202
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 24
11.9%
3 23
11.4%
5 20
9.9%
1 19
9.4%
6 19
9.4%
0 18
8.9%
9 17
8.4%
2 16
7.9%
4 16
7.9%
8 11
5.4%
Other values (2) 19
9.4%

소재지전화
Text

MISSING 

Distinct25
Distinct (%)96.2%
Missing15
Missing (%)36.6%
Memory size460.0 B
2023-12-12T10:15:12.659289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.076923
Min length11

Characters and Unicode

Total characters314
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique24 ?
Unique (%)92.3%

Sample

1st row031-882-6585
2nd row031-0881-1206
3rd row031-882-2363
4th row031-884-2646
5th row031-769-5222
ValueCountFrequency (%)
031-886-1444 2
 
7.7%
031-886-7101 1
 
3.8%
031-882-6585 1
 
3.8%
031-632-1069 1
 
3.8%
031-882-9334 1
 
3.8%
070-7770-5035 1
 
3.8%
031-797-4573 1
 
3.8%
031-408-1497 1
 
3.8%
031-885-9866 1
 
3.8%
031-884-6023 1
 
3.8%
Other values (15) 15
57.7%
2023-12-12T10:15:13.113154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 52
16.6%
8 45
14.3%
1 41
13.1%
0 39
12.4%
3 39
12.4%
4 20
 
6.4%
6 19
 
6.1%
7 18
 
5.7%
2 16
 
5.1%
5 13
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 262
83.4%
Dash Punctuation 52
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
8 45
17.2%
1 41
15.6%
0 39
14.9%
3 39
14.9%
4 20
7.6%
6 19
7.3%
7 18
 
6.9%
2 16
 
6.1%
5 13
 
5.0%
9 12
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 52
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 314
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 52
16.6%
8 45
14.3%
1 41
13.1%
0 39
12.4%
3 39
12.4%
4 20
 
6.4%
6 19
 
6.1%
7 18
 
5.7%
2 16
 
5.1%
5 13
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 314
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 52
16.6%
8 45
14.3%
1 41
13.1%
0 39
12.4%
3 39
12.4%
4 20
 
6.4%
6 19
 
6.1%
7 18
 
5.7%
2 16
 
5.1%
5 13
 
4.1%
Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum1982-01-19 00:00:00
Maximum2021-05-13 00:00:00
2023-12-12T10:15:13.300558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:15:13.478136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)
Distinct22
Distinct (%)53.7%
Missing0
Missing (%)0.0%
Memory size460.0 B
2023-12-12T10:15:13.664310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.097561
Min length5

Characters and Unicode

Total characters209
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)34.1%

Sample

1st row12661
2nd row12607
3rd row12666
4th row12616
5th row12611
ValueCountFrequency (%)
12615 8
19.5%
12636 5
12.2%
12661 3
 
7.3%
12635 3
 
7.3%
12607 2
 
4.9%
12637 2
 
4.9%
12603 2
 
4.9%
12611 2
 
4.9%
12665 1
 
2.4%
12663 1
 
2.4%
Other values (12) 12
29.3%
2023-12-12T10:15:14.010412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 57
27.3%
6 57
27.3%
2 40
19.1%
5 16
 
7.7%
3 16
 
7.7%
0 7
 
3.3%
7 5
 
2.4%
4 5
 
2.4%
9 2
 
1.0%
- 2
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 207
99.0%
Dash Punctuation 2
 
1.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 57
27.5%
6 57
27.5%
2 40
19.3%
5 16
 
7.7%
3 16
 
7.7%
0 7
 
3.4%
7 5
 
2.4%
4 5
 
2.4%
9 2
 
1.0%
8 2
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 209
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 57
27.3%
6 57
27.3%
2 40
19.1%
5 16
 
7.7%
3 16
 
7.7%
0 7
 
3.3%
7 5
 
2.4%
4 5
 
2.4%
9 2
 
1.0%
- 2
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 209
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 57
27.3%
6 57
27.3%
2 40
19.1%
5 16
 
7.7%
3 16
 
7.7%
0 7
 
3.3%
7 5
 
2.4%
4 5
 
2.4%
9 2
 
1.0%
- 2
 
1.0%
Distinct39
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Memory size460.0 B
Minimum1982-01-19 00:00:00
Maximum2020-09-10 00:00:00
2023-12-12T10:15:14.173825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T10:15:14.379279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=39)

Correlations

2023-12-12T10:15:14.505977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일우편번호소재지시작일
업종명1.0000.5670.0001.0001.0000.0000.5670.0000.567
인허가일자0.5671.0001.0001.0000.9841.0001.0000.9971.000
업소명0.0001.0001.0001.0000.9951.0001.0001.0001.000
소재지1.0001.0001.0001.0001.0001.0001.0001.0001.000
영업장면적1.0000.9840.9951.0001.0001.0000.9840.5360.984
소재지전화0.0001.0001.0001.0001.0001.0001.0001.0001.000
영업자시작일0.5671.0001.0001.0000.9841.0001.0000.9971.000
우편번호0.0000.9971.0001.0000.5361.0000.9971.0000.997
소재지시작일0.5671.0001.0001.0000.9841.0001.0000.9971.000

Missing values

2023-12-12T10:15:09.098855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:15:09.274016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일우편번호소재지시작일
0용기.포장지제조업1991-08-10풍원산업사경기도 여주시 가남읍 양화로 97859.58031-882-65852019-06-03126611991-08-10
1용기.포장지제조업2001-12-14태림기업경기도 여주시 흥천면 흥천로 23-13395.8031-0881-12062001-12-14126072003-07-23
2용기.포장지제조업2001-08-25삼일프라스틱경기도 여주시 점동면 처리2길 59-14280.03031-882-23632001-08-25126662001-08-25
3용기.포장지제조업2002-02-14씨엔테크(주)경기도 여주시 강천면 곱새기로 24-383,525.32031-884-26462008-07-18126162013-02-27
4용기.포장지제조업2002-02-21한솔세라믹경기도 여주시 대신면 여양로 685, 1동 1층351.96031-769-52222002-02-21126112018-03-08
5용기.포장지제조업2002-04-22모리아경기도 여주시 흥천면 남산로 3120<NA>2002-04-22126062002-04-22
6용기.포장지제조업2002-07-31삼진화학공업사경기도 여주시 능서면 월평로 562-36650031-884-71772005-04-04126472002-07-31
7용기.포장지제조업1988-12-23테트라팩여주(유)경기도 여주시 가남읍 경충대로 1037002-799-23292007-03-23126631988-12-23
8용기.포장지제조업2003-06-07코리아페이퍼컵경기도 여주시 금사면 도곡리 96-1900031-881-18952006-02-21469-8312003-06-07
9용기.포장지제조업2005-06-24주)영천씰테크경기도 여주시 가남읍 가남로 911,978.36031-884-44162011-04-11126612005-06-24
업종명인허가일자업소명소재지영업장면적소재지전화영업자시작일우편번호소재지시작일
31옹기류제조업2014-06-11천화경기도 여주시 여양로 588-2, 1층 (오금동)480<NA>2014-06-11126352014-06-11
32옹기류제조업2014-06-16태화도예경기도 여주시 북내면 가정긴골길 11276<NA>2014-06-16126152014-06-16
33옹기류제조업2014-06-24영도예경기도 여주시 북내면 도예촌길 17-16357.36<NA>2014-06-24126152014-06-24
34옹기류제조업2014-06-24여강도예경기도 여주시 어영실로 76-13 (천송동)425.75<NA>2014-06-24126362014-06-24
35옹기류제조업2014-07-01선욱요업경기도 여주시 현암3길 38, 지하1~지상1층 (현암동)210<NA>2014-07-01126372014-07-01
36옹기류제조업2014-07-14송천요업경기도 여주시 오금1길 52, 1층 (오금동)340.13<NA>2014-07-14126352014-07-14
37옹기류제조업2014-08-11옹기공방경기도 여주시 북내면 도예촌길 17-14, 1층356.4<NA>2014-08-11126152014-08-11
38옹기류제조업2014-09-04수곡요업경기도 여주시 대신면 여양로 677493.46<NA>2014-09-04126112014-09-04
39옹기류제조업2014-10-20태양상사경기도 여주시 흥천면 흥천로 308-52, 1층194.5<NA>2014-10-20126072014-10-20
40옹기류제조업2014-12-29나의손경기도 여주시 강천면 강문로 169-6, 에이동 1층238031-886-14442014-12-29126152014-12-29