Overview

Dataset statistics

Number of variables4
Number of observations84
Missing cells25
Missing cells (%)7.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory33.6 B

Variable types

Categorical1
Text3

Dataset

Description전북특별자치도 장수군 즉석판매제조가공업소 현황(업종명, 업소명, 소재지(도로명), 전화번호)에 대하여 설명하고 있다.
Author전북특별자치도 장수군
URLhttps://www.data.go.kr/data/3077637/fileData.do

Alerts

업종명 has constant value ""Constant
소재지전화 has 25 (29.8%) missing valuesMissing
업소명 has unique valuesUnique

Reproduction

Analysis started2024-03-30 08:45:12.315141
Analysis finished2024-03-30 08:45:14.107271
Duration1.79 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size804.0 B
즉석판매제조가공업
84 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row즉석판매제조가공업
2nd row즉석판매제조가공업
3rd row즉석판매제조가공업
4th row즉석판매제조가공업
5th row즉석판매제조가공업

Common Values

ValueCountFrequency (%)
즉석판매제조가공업 84
100.0%

Length

2024-03-30T08:45:14.316352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-30T08:45:14.670635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
즉석판매제조가공업 84
100.0%

업소명
Text

UNIQUE 

Distinct84
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size804.0 B
2024-03-30T08:45:15.388501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length15
Mean length5.9166667
Min length2

Characters and Unicode

Total characters497
Distinct characters179
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique84 ?
Unique (%)100.0%

Sample

1st row양선떡방아간
2nd row노곡떡방아간
3rd row풍년떡방아간
4th row호남떡방아간
5th row장계떡집
ValueCountFrequency (%)
장수 2
 
1.8%
덕순네 2
 
1.8%
영농조합법인 2
 
1.8%
건강원 2
 
1.8%
보리밥집 1
 
0.9%
굿팜 1
 
0.9%
베리 1
 
0.9%
요리체험장 1
 
0.9%
이츠레드 1
 
0.9%
생즙 1
 
0.9%
Other values (99) 99
87.6%
2024-03-30T08:45:16.734049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
 
5.8%
23
 
4.6%
20
 
4.0%
20
 
4.0%
19
 
3.8%
19
 
3.8%
19
 
3.8%
19
 
3.8%
15
 
3.0%
13
 
2.6%
Other values (169) 301
60.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 465
93.6%
Space Separator 29
 
5.8%
Open Punctuation 1
 
0.2%
Other Punctuation 1
 
0.2%
Close Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.9%
20
 
4.3%
20
 
4.3%
19
 
4.1%
19
 
4.1%
19
 
4.1%
19
 
4.1%
15
 
3.2%
13
 
2.8%
8
 
1.7%
Other values (165) 290
62.4%
Space Separator
ValueCountFrequency (%)
29
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 465
93.6%
Common 32
 
6.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.9%
20
 
4.3%
20
 
4.3%
19
 
4.1%
19
 
4.1%
19
 
4.1%
19
 
4.1%
15
 
3.2%
13
 
2.8%
8
 
1.7%
Other values (165) 290
62.4%
Common
ValueCountFrequency (%)
29
90.6%
( 1
 
3.1%
& 1
 
3.1%
) 1
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 465
93.6%
ASCII 32
 
6.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29
90.6%
( 1
 
3.1%
& 1
 
3.1%
) 1
 
3.1%
Hangul
ValueCountFrequency (%)
23
 
4.9%
20
 
4.3%
20
 
4.3%
19
 
4.1%
19
 
4.1%
19
 
4.1%
19
 
4.1%
15
 
3.2%
13
 
2.8%
8
 
1.7%
Other values (165) 290
62.4%
Distinct80
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size804.0 B
2024-03-30T08:45:17.644917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length41
Mean length24.928571
Min length21

Characters and Unicode

Total characters2094
Distinct characters120
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)90.5%

Sample

1st row전북특별자치도 장수군 장수읍 양선길 7
2nd row전북특별자치도 장수군 장수읍 개실길 121
3rd row전북특별자치도 장수군 산서면 비행로 29-4
4th row전북특별자치도 장수군 번암면 원노단길 12-2
5th row전북특별자치도 장수군 장계면 장무로 206
ValueCountFrequency (%)
전북특별자치도 84
19.1%
장수군 84
19.1%
장수읍 32
 
7.3%
장계면 26
 
5.9%
한들로 10
 
2.3%
산서면 9
 
2.0%
장무로 8
 
1.8%
번암면 5
 
1.1%
보산로 4
 
0.9%
시장통길 4
 
0.9%
Other values (132) 174
39.5%
2024-03-30T08:45:19.090881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
369
17.6%
176
 
8.4%
119
 
5.7%
91
 
4.3%
85
 
4.1%
85
 
4.1%
85
 
4.1%
84
 
4.0%
84
 
4.0%
84
 
4.0%
Other values (110) 832
39.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1448
69.1%
Space Separator 369
 
17.6%
Decimal Number 249
 
11.9%
Dash Punctuation 25
 
1.2%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
176
 
12.2%
119
 
8.2%
91
 
6.3%
85
 
5.9%
85
 
5.9%
85
 
5.9%
84
 
5.8%
84
 
5.8%
84
 
5.8%
84
 
5.8%
Other values (97) 471
32.5%
Decimal Number
ValueCountFrequency (%)
1 71
28.5%
2 33
13.3%
4 21
 
8.4%
9 20
 
8.0%
3 20
 
8.0%
5 19
 
7.6%
8 18
 
7.2%
0 18
 
7.2%
6 17
 
6.8%
7 12
 
4.8%
Space Separator
ValueCountFrequency (%)
369
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 25
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1448
69.1%
Common 643
30.7%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
176
 
12.2%
119
 
8.2%
91
 
6.3%
85
 
5.9%
85
 
5.9%
85
 
5.9%
84
 
5.8%
84
 
5.8%
84
 
5.8%
84
 
5.8%
Other values (97) 471
32.5%
Common
ValueCountFrequency (%)
369
57.4%
1 71
 
11.0%
2 33
 
5.1%
- 25
 
3.9%
4 21
 
3.3%
9 20
 
3.1%
3 20
 
3.1%
5 19
 
3.0%
8 18
 
2.8%
0 18
 
2.8%
Other values (2) 29
 
4.5%
Latin
ValueCountFrequency (%)
A 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1448
69.1%
ASCII 646
30.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
369
57.1%
1 71
 
11.0%
2 33
 
5.1%
- 25
 
3.9%
4 21
 
3.3%
9 20
 
3.1%
3 20
 
3.1%
5 19
 
2.9%
8 18
 
2.8%
0 18
 
2.8%
Other values (3) 32
 
5.0%
Hangul
ValueCountFrequency (%)
176
 
12.2%
119
 
8.2%
91
 
6.3%
85
 
5.9%
85
 
5.9%
85
 
5.9%
84
 
5.8%
84
 
5.8%
84
 
5.8%
84
 
5.8%
Other values (97) 471
32.5%

소재지전화
Text

MISSING 

Distinct57
Distinct (%)96.6%
Missing25
Missing (%)29.8%
Memory size804.0 B
2024-03-30T08:45:19.956263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.016949
Min length12

Characters and Unicode

Total characters709
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)93.2%

Sample

1st row063-351-2359
2nd row063-351-1914
3rd row063-351-3990
4th row063-353-4444
5th row063-351-0173
ValueCountFrequency (%)
063-352-3335 2
 
3.4%
063-351-0292 2
 
3.4%
070-4833-3309 1
 
1.7%
063-351-0319 1
 
1.7%
063-351-9530 1
 
1.7%
063-353-9191 1
 
1.7%
063-351-2359 1
 
1.7%
063-353-0757 1
 
1.7%
063-351-2006 1
 
1.7%
063-351-6776 1
 
1.7%
Other values (47) 47
79.7%
2024-03-30T08:45:21.223399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 163
23.0%
- 118
16.6%
0 101
14.2%
5 75
10.6%
6 71
10.0%
1 56
 
7.9%
2 52
 
7.3%
9 21
 
3.0%
7 20
 
2.8%
4 18
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 591
83.4%
Dash Punctuation 118
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 163
27.6%
0 101
17.1%
5 75
12.7%
6 71
12.0%
1 56
 
9.5%
2 52
 
8.8%
9 21
 
3.6%
7 20
 
3.4%
4 18
 
3.0%
8 14
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 118
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 709
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 163
23.0%
- 118
16.6%
0 101
14.2%
5 75
10.6%
6 71
10.0%
1 56
 
7.9%
2 52
 
7.3%
9 21
 
3.0%
7 20
 
2.8%
4 18
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 709
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 163
23.0%
- 118
16.6%
0 101
14.2%
5 75
10.6%
6 71
10.0%
1 56
 
7.9%
2 52
 
7.3%
9 21
 
3.0%
7 20
 
2.8%
4 18
 
2.5%

Correlations

2024-03-30T08:45:21.558999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업소명소재지(도로명)소재지전화
업소명1.0001.0001.000
소재지(도로명)1.0001.0000.996
소재지전화1.0000.9961.000

Missing values

2024-03-30T08:45:13.583943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-30T08:45:13.975346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종명업소명소재지(도로명)소재지전화
0즉석판매제조가공업양선떡방아간전북특별자치도 장수군 장수읍 양선길 7063-351-2359
1즉석판매제조가공업노곡떡방아간전북특별자치도 장수군 장수읍 개실길 121063-351-1914
2즉석판매제조가공업풍년떡방아간전북특별자치도 장수군 산서면 비행로 29-4063-351-3990
3즉석판매제조가공업호남떡방아간전북특별자치도 장수군 번암면 원노단길 12-2063-353-4444
4즉석판매제조가공업장계떡집전북특별자치도 장수군 장계면 장무로 206063-351-0173
5즉석판매제조가공업북동떡방앗간전북특별자치도 장수군 장계면 장무로 258063-352-2225
6즉석판매제조가공업춘송떡방아간전북특별자치도 장수군 천천면 송탄로 41063-352-2703
7즉석판매제조가공업원촌떡방아간전북특별자치도 장수군 계북면 장무로 1345063-352-2889
8즉석판매제조가공업금잔디건강원전북특별자치도 장수군 장계면 장계5길 3063-352-0227
9즉석판매제조가공업천천건강원전북특별자치도 장수군 천천면 송탄4길 8-16063-353-1361
업종명업소명소재지(도로명)소재지전화
74즉석판매제조가공업용방앗간전북특별자치도 장수군 계남면 한거2길 15<NA>
75즉석판매제조가공업꽃다온전북특별자치도 장수군 산서면 보산로 1864-14<NA>
76즉석판매제조가공업수민식품전북특별자치도 장수군 천천면 용신길 1-42063-352-7035
77즉석판매제조가공업덕순네 쿠킹 앤(&) 베이킹 스튜디오전북특별자치도 장수군 장계면 장무로 237-2<NA>
78즉석판매제조가공업꼬모 베어전북특별자치도 장수군 장수읍 시장로 8-17070-4833-3309
79즉석판매제조가공업커피홀릭전북특별자치도 장수군 장계면 한들로 88 2층063-351-0652
80즉석판매제조가공업엄마의 부엌전북특별자치도 장수군 장수읍 관두길 18 장수북동주공아파트 102호<NA>
81즉석판매제조가공업지현숙 약선발효공방전북특별자치도 장수군 장계면 원금곡길 31-7<NA>
82즉석판매제조가공업신명전북특별자치도 장수군 산서면 창터길 40-44063-351-9414
83즉석판매제조가공업덕순네 반찬전북특별자치도 장수군 장계면 한들로 114 장계농업협동조합<NA>