Overview

Dataset statistics

Number of variables6
Number of observations82
Missing cells37
Missing cells (%)7.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory50.6 B

Variable types

Categorical1
Text4
Numeric1

Dataset

Description대전광역시 동구 식품소분업 현황으로 2023.7.24일 기준 업종명, 업소명, 주소, 전화번호, 우편번호를 제공하고 있습니다.
URLhttps://www.data.go.kr/data/15067245/fileData.do

Alerts

업종명 has constant value ""Constant
소재지전화 has 37 (45.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 18:55:01.946103
Analysis finished2023-12-12 18:55:03.433109
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size788.0 B
식품소분업
82 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식품소분업
2nd row식품소분업
3rd row식품소분업
4th row식품소분업
5th row식품소분업

Common Values

ValueCountFrequency (%)
식품소분업 82
100.0%

Length

2023-12-13T03:55:03.608078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:55:03.800056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식품소분업 82
100.0%
Distinct81
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size788.0 B
2023-12-13T03:55:04.167935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length12
Mean length5.9390244
Min length3

Characters and Unicode

Total characters487
Distinct characters183
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)97.6%

Sample

1st row우리땅콩사
2nd row대미농수산
3rd row중부식품
4th row중앙상회
5th row진흥제과
ValueCountFrequency (%)
주식회사 4
 
4.3%
중앙상회 2
 
2.2%
주)우리들마트 1
 
1.1%
화인푸드셰프애찬 1
 
1.1%
서해수산 1
 
1.1%
푸드매니저 1
 
1.1%
낙랑식품 1
 
1.1%
온통자연 1
 
1.1%
주)찬장에프에스 1
 
1.1%
주)아미셀 1
 
1.1%
Other values (79) 79
84.9%
2023-12-13T03:55:04.862617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16
 
3.3%
15
 
3.1%
14
 
2.9%
13
 
2.7%
12
 
2.5%
12
 
2.5%
11
 
2.3%
) 11
 
2.3%
11
 
2.3%
10
 
2.1%
Other values (173) 362
74.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
91.8%
Close Punctuation 11
 
2.3%
Space Separator 11
 
2.3%
Open Punctuation 10
 
2.1%
Uppercase Letter 6
 
1.2%
Other Punctuation 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16
 
3.6%
15
 
3.4%
14
 
3.1%
13
 
2.9%
12
 
2.7%
12
 
2.7%
11
 
2.5%
10
 
2.2%
10
 
2.2%
8
 
1.8%
Other values (163) 326
72.9%
Uppercase Letter
ValueCountFrequency (%)
K 2
33.3%
A 1
16.7%
S 1
16.7%
N 1
16.7%
H 1
16.7%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
91.8%
Common 34
 
7.0%
Latin 6
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16
 
3.6%
15
 
3.4%
14
 
3.1%
13
 
2.9%
12
 
2.7%
12
 
2.7%
11
 
2.5%
10
 
2.2%
10
 
2.2%
8
 
1.8%
Other values (163) 326
72.9%
Common
ValueCountFrequency (%)
) 11
32.4%
11
32.4%
( 10
29.4%
& 1
 
2.9%
3 1
 
2.9%
Latin
ValueCountFrequency (%)
K 2
33.3%
A 1
16.7%
S 1
16.7%
N 1
16.7%
H 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
91.8%
ASCII 40
 
8.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
16
 
3.6%
15
 
3.4%
14
 
3.1%
13
 
2.9%
12
 
2.7%
12
 
2.7%
11
 
2.5%
10
 
2.2%
10
 
2.2%
8
 
1.8%
Other values (163) 326
72.9%
ASCII
ValueCountFrequency (%)
) 11
27.5%
11
27.5%
( 10
25.0%
K 2
 
5.0%
& 1
 
2.5%
A 1
 
2.5%
S 1
 
2.5%
3 1
 
2.5%
N 1
 
2.5%
H 1
 
2.5%
Distinct79
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size788.0 B
2023-12-13T03:55:05.334783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length33
Mean length27.256098
Min length20

Characters and Unicode

Total characters2235
Distinct characters104
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)93.9%

Sample

1st row대전광역시 동구 우암로 159-23 (성남동)
2nd row대전광역시 동구 송촌남로11번길 47 (용전동)
3rd row대전광역시 동구 대전로813번길 54 (중동)
4th row대전광역시 동구 대전로813번길 54 (중동)
5th row대전광역시 동구 우암로 293 (가양동)
ValueCountFrequency (%)
대전광역시 82
 
17.5%
동구 82
 
17.5%
1층 37
 
7.9%
중동 10
 
2.1%
낭월동 8
 
1.7%
2층 7
 
1.5%
원동 7
 
1.5%
가양동 7
 
1.5%
대전로813번길 6
 
1.3%
삼성동 6
 
1.3%
Other values (153) 216
46.2%
2023-12-13T03:55:06.037465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
386
17.3%
176
 
7.9%
115
 
5.1%
1 112
 
5.0%
109
 
4.9%
85
 
3.8%
85
 
3.8%
85
 
3.8%
84
 
3.8%
( 82
 
3.7%
Other values (94) 916
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1225
54.8%
Decimal Number 388
 
17.4%
Space Separator 386
 
17.3%
Open Punctuation 82
 
3.7%
Close Punctuation 82
 
3.7%
Other Punctuation 51
 
2.3%
Dash Punctuation 21
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
14.4%
115
 
9.4%
109
 
8.9%
85
 
6.9%
85
 
6.9%
85
 
6.9%
84
 
6.9%
72
 
5.9%
47
 
3.8%
45
 
3.7%
Other values (79) 322
26.3%
Decimal Number
ValueCountFrequency (%)
1 112
28.9%
3 53
13.7%
5 40
 
10.3%
2 38
 
9.8%
4 32
 
8.2%
0 27
 
7.0%
8 23
 
5.9%
9 22
 
5.7%
6 21
 
5.4%
7 20
 
5.2%
Space Separator
ValueCountFrequency (%)
386
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Other Punctuation
ValueCountFrequency (%)
, 51
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1225
54.8%
Common 1010
45.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
14.4%
115
 
9.4%
109
 
8.9%
85
 
6.9%
85
 
6.9%
85
 
6.9%
84
 
6.9%
72
 
5.9%
47
 
3.8%
45
 
3.7%
Other values (79) 322
26.3%
Common
ValueCountFrequency (%)
386
38.2%
1 112
 
11.1%
( 82
 
8.1%
) 82
 
8.1%
3 53
 
5.2%
, 51
 
5.0%
5 40
 
4.0%
2 38
 
3.8%
4 32
 
3.2%
0 27
 
2.7%
Other values (5) 107
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1225
54.8%
ASCII 1010
45.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
386
38.2%
1 112
 
11.1%
( 82
 
8.1%
) 82
 
8.1%
3 53
 
5.2%
, 51
 
5.0%
5 40
 
4.0%
2 38
 
3.8%
4 32
 
3.2%
0 27
 
2.7%
Other values (5) 107
 
10.6%
Hangul
ValueCountFrequency (%)
176
14.4%
115
 
9.4%
109
 
8.9%
85
 
6.9%
85
 
6.9%
85
 
6.9%
84
 
6.9%
72
 
5.9%
47
 
3.8%
45
 
3.7%
Other values (79) 322
26.3%
Distinct79
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size788.0 B
2023-12-13T03:55:06.531559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length19.597561
Min length16

Characters and Unicode

Total characters1607
Distinct characters75
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)92.7%

Sample

1st row대전광역시 동구 성남동 158-3
2nd row대전광역시 동구 용전동 21-2
3rd row대전광역시 동구 중동 30-3
4th row대전광역시 동구 중동 30-3, 4, 5
5th row대전광역시 동구 가양동 392-4
ValueCountFrequency (%)
대전광역시 82
23.1%
동구 82
23.1%
중동 10
 
2.8%
낭월동 8
 
2.3%
원동 7
 
2.0%
가양동 7
 
2.0%
삼성동 6
 
1.7%
용운동 5
 
1.4%
30-3 5
 
1.4%
용전동 5
 
1.4%
Other values (105) 138
38.9%
2023-12-13T03:55:07.240112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
355
22.1%
164
 
10.2%
91
 
5.7%
88
 
5.5%
83
 
5.2%
82
 
5.1%
82
 
5.1%
82
 
5.1%
- 64
 
4.0%
1 63
 
3.9%
Other values (65) 453
28.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 851
53.0%
Space Separator 355
22.1%
Decimal Number 323
 
20.1%
Dash Punctuation 64
 
4.0%
Other Punctuation 10
 
0.6%
Close Punctuation 2
 
0.1%
Open Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
164
19.3%
91
10.7%
88
10.3%
83
9.8%
82
9.6%
82
9.6%
82
9.6%
15
 
1.8%
10
 
1.2%
10
 
1.2%
Other values (50) 144
16.9%
Decimal Number
ValueCountFrequency (%)
1 63
19.5%
3 53
16.4%
2 48
14.9%
5 28
8.7%
4 28
8.7%
0 28
8.7%
8 27
8.4%
6 18
 
5.6%
9 17
 
5.3%
7 13
 
4.0%
Space Separator
ValueCountFrequency (%)
355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 851
53.0%
Common 756
47.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
164
19.3%
91
10.7%
88
10.3%
83
9.8%
82
9.6%
82
9.6%
82
9.6%
15
 
1.8%
10
 
1.2%
10
 
1.2%
Other values (50) 144
16.9%
Common
ValueCountFrequency (%)
355
47.0%
- 64
 
8.5%
1 63
 
8.3%
3 53
 
7.0%
2 48
 
6.3%
5 28
 
3.7%
4 28
 
3.7%
0 28
 
3.7%
8 27
 
3.6%
6 18
 
2.4%
Other values (5) 44
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 851
53.0%
ASCII 756
47.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
355
47.0%
- 64
 
8.5%
1 63
 
8.3%
3 53
 
7.0%
2 48
 
6.3%
5 28
 
3.7%
4 28
 
3.7%
0 28
 
3.7%
8 27
 
3.6%
6 18
 
2.4%
Other values (5) 44
 
5.8%
Hangul
ValueCountFrequency (%)
164
19.3%
91
10.7%
88
10.3%
83
9.8%
82
9.6%
82
9.6%
82
9.6%
15
 
1.8%
10
 
1.2%
10
 
1.2%
Other values (50) 144
16.9%

소재지전화
Text

MISSING 

Distinct45
Distinct (%)100.0%
Missing37
Missing (%)45.1%
Memory size788.0 B
2023-12-13T03:55:07.619129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.022222
Min length12

Characters and Unicode

Total characters541
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row042-672-7097
2nd row042-626-0431
3rd row042-257-8250
4th row042-222-5385
5th row042-621-4913
ValueCountFrequency (%)
042-271-7044 1
 
2.2%
042-633-1501 1
 
2.2%
042-257-7290 1
 
2.2%
042-256-5385 1
 
2.2%
042-284-9113 1
 
2.2%
042-285-0084 1
 
2.2%
042-484-2170 1
 
2.2%
042-853-1144 1
 
2.2%
042-252-4944 1
 
2.2%
042-273-7766 1
 
2.2%
Other values (35) 35
77.8%
2023-12-13T03:55:08.235766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 107
19.8%
- 90
16.6%
4 71
13.1%
0 67
12.4%
5 35
 
6.5%
3 34
 
6.3%
1 32
 
5.9%
8 32
 
5.9%
6 32
 
5.9%
7 25
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 451
83.4%
Dash Punctuation 90
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 107
23.7%
4 71
15.7%
0 67
14.9%
5 35
 
7.8%
3 34
 
7.5%
1 32
 
7.1%
8 32
 
7.1%
6 32
 
7.1%
7 25
 
5.5%
9 16
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 90
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 541
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 107
19.8%
- 90
16.6%
4 71
13.1%
0 67
12.4%
5 35
 
6.5%
3 34
 
6.3%
1 32
 
5.9%
8 32
 
5.9%
6 32
 
5.9%
7 25
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 541
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 107
19.8%
- 90
16.6%
4 71
13.1%
0 67
12.4%
5 35
 
6.5%
3 34
 
6.3%
1 32
 
5.9%
8 32
 
5.9%
6 32
 
5.9%
7 25
 
4.6%

우편번호
Real number (ℝ)

Distinct51
Distinct (%)62.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34619.232
Minimum34513
Maximum34712
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size870.0 B
2023-12-13T03:55:08.487902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34513
5-th percentile34521.5
Q134568
median34627
Q334666
95-th percentile34706
Maximum34712
Range199
Interquartile range (IQR)98

Descriptive statistics

Standard deviation59.064998
Coefficient of variation (CV)0.0017061326
Kurtosis-1.0363111
Mean34619.232
Median Absolute Deviation (MAD)51
Skewness-0.021912863
Sum2838777
Variance3488.674
MonotonicityNot monotonic
2023-12-13T03:55:08.731815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34627 9
 
11.0%
34628 6
 
7.3%
34629 4
 
4.9%
34568 4
 
4.9%
34706 3
 
3.7%
34591 2
 
2.4%
34610 2
 
2.4%
34521 2
 
2.4%
34558 2
 
2.4%
34540 2
 
2.4%
Other values (41) 46
56.1%
ValueCountFrequency (%)
34513 1
1.2%
34516 1
1.2%
34517 1
1.2%
34521 2
2.4%
34531 1
1.2%
34532 1
1.2%
34535 1
1.2%
34540 2
2.4%
34543 1
1.2%
34544 1
1.2%
ValueCountFrequency (%)
34712 1
 
1.2%
34711 2
2.4%
34706 3
3.7%
34705 2
2.4%
34704 1
 
1.2%
34703 2
2.4%
34701 1
 
1.2%
34700 1
 
1.2%
34699 2
2.4%
34692 1
 
1.2%

Interactions

2023-12-13T03:55:02.911663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:55:08.913565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지(도로명)소재지(지번)소재지전화우편번호
업소명1.0000.9980.9961.0001.000
소재지(도로명)0.9981.0001.0001.0001.000
소재지(지번)0.9961.0001.0001.0001.000
소재지전화1.0001.0001.0001.0001.000
우편번호1.0001.0001.0001.0001.000

Missing values

2023-12-13T03:55:03.139825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:55:03.342816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지(지번)소재지전화우편번호
0식품소분업우리땅콩사대전광역시 동구 우암로 159-23 (성남동)대전광역시 동구 성남동 158-3042-672-709734591
1식품소분업대미농수산대전광역시 동구 송촌남로11번길 47 (용전동)대전광역시 동구 용전동 21-2042-626-043134540
2식품소분업중부식품대전광역시 동구 대전로813번길 54 (중동)대전광역시 동구 중동 30-3042-257-825034627
3식품소분업중앙상회대전광역시 동구 대전로813번길 54 (중동)대전광역시 동구 중동 30-3, 4, 5042-222-538534627
4식품소분업진흥제과대전광역시 동구 우암로 293 (가양동)대전광역시 동구 가양동 392-4042-621-491334535
5식품소분업후생사3매장대전광역시 동구 비래서로42번길 105 (가양동)대전광역시 동구 가양동 38-6 외3필지042-623-252534532
6식품소분업청원유통대전광역시 동구 대전로825번길 53 (중동)대전광역시 동구 중동 30-1 외2필지042-257-135234627
7식품소분업롯데프레시용운점대전광역시 동구 동부로85번길 19-8 (용운동)대전광역시 동구 용운동 322, 323-1042-282-210134663
8식품소분업광천젓갈대전광역시 동구 계족로382번길 27 (성남동)대전광역시 동구 성남동 501-34042-672-466434579
9식품소분업한성상회대전광역시 동구 대전로813번길 54 (중동)대전광역시 동구 중동 30-3042-254-428934627
업종명업소명소재지(도로명)소재지(지번)소재지전화우편번호
72식품소분업(주)오웬푸드셰프애찬대전광역시 동구 계족로 205 (신안동)대전광역시 동구 신안동 215-1042-621-950034604
73식품소분업케이앤케이(K&K)푸드대전광역시 동구 계족로475번길 34, 지하1층 (용전동)대전광역시 동구 용전동 158-15<NA>34544
74식품소분업주식회사탑마트홀딩스대전광역시 동구 동부로 167, 지하1층 (용운동)대전광역시 동구 용운동 274-2<NA>34521
75식품소분업맵지만대전광역시 동구 대전로906번길 47-23, 1층 (삼성동)대전광역시 동구 삼성동 297-23<NA>34568
76식품소분업카페브라운대전광역시 동구 대전로 436-12, 에스코아 1층 103호 (가오동)대전광역시 동구 가오동 622 에스코아<NA>34692
77식품소분업유니콘 바이오대전광역시 동구 동대전로283번길 23, 1층 (가양동)대전광역시 동구 가양동 413-6<NA>34586
78식품소분업팝스이엔티대전광역시 동구 성동로37번길 35, 2층 (자양동)대전광역시 동구 자양동 91-23<NA>34517
79식품소분업유정식품대전광역시 동구 가양로68번길 7, 1층 (가양동)대전광역시 동구 가양동 293-6<NA>34594
80식품소분업탑마트대전광역시 동구 동대전로46번길 11, 1층 (대동)대전광역시 동구 대동 543<NA>34637
81식품소분업송가네건어물대전광역시 동구 대전로779번길 45-1, 1층 (원동)대전광역시 동구 원동 62-5<NA>34628