Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory46.3 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description소상공인시장진흥공단에서 진행하는 2020년 우수프랜차이즈 정보(등급, 브랜드명, 업체명, 기간)를 제공합니다.
Author소상공인시장진흥공단
URLhttps://www.data.go.kr/data/15077940/fileData.do

Alerts

연번 is highly overall correlated with 등급High correlation
등급 is highly overall correlated with 연번High correlation
기간 is highly imbalanced (75.8%)Imbalance
연번 has unique valuesUnique
브랜드명 has unique valuesUnique
업체명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 11:50:57.586260
Analysis finished2023-12-12 11:50:58.432457
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size357.0 B
2023-12-12T20:50:58.533932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.2
Q17
median13
Q319
95-th percentile23.8
Maximum25
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.3598007
Coefficient of variation (CV)0.56613852
Kurtosis-1.2
Mean13
Median Absolute Deviation (MAD)6
Skewness0
Sum325
Variance54.166667
MonotonicityStrictly increasing
2023-12-12T20:50:58.698680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
1 1
 
4.0%
2 1
 
4.0%
25 1
 
4.0%
24 1
 
4.0%
23 1
 
4.0%
22 1
 
4.0%
21 1
 
4.0%
20 1
 
4.0%
19 1
 
4.0%
18 1
 
4.0%
Other values (15) 15
60.0%
ValueCountFrequency (%)
1 1
4.0%
2 1
4.0%
3 1
4.0%
4 1
4.0%
5 1
4.0%
6 1
4.0%
7 1
4.0%
8 1
4.0%
9 1
4.0%
10 1
4.0%
ValueCountFrequency (%)
25 1
4.0%
24 1
4.0%
23 1
4.0%
22 1
4.0%
21 1
4.0%
20 1
4.0%
19 1
4.0%
18 1
4.0%
17 1
4.0%
16 1
4.0%

등급
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
14 
11 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
14
56.0%
11
44.0%

Length

2023-12-12T20:50:58.886296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:50:59.027907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
14
56.0%
11
44.0%

브랜드명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T20:50:59.295218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length5.48
Min length2

Characters and Unicode

Total characters137
Distinct characters103
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row메가MGC커피
2nd row푸라닭치킨
3rd row깐깐한족발
4th row커피베이
5th row유가네닭갈비
ValueCountFrequency (%)
메가mgc커피 1
 
3.6%
푸라닭치킨 1
 
3.6%
얌샘김밥 1
 
3.6%
피자알볼로 1
 
3.6%
군산오징어 1
 
3.6%
79대포 1
 
3.6%
두찜 1
 
3.6%
한끼맛있다 1
 
3.6%
티바두마리치킨 1
 
3.6%
수유리우동집 1
 
3.6%
Other values (18) 18
64.3%
2023-12-12T20:50:59.746759image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
2.9%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3
 
2.2%
3 2
 
1.5%
2
 
1.5%
Other values (93) 106
77.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 123
89.8%
Decimal Number 8
 
5.8%
Space Separator 3
 
2.2%
Uppercase Letter 3
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (83) 93
75.6%
Decimal Number
ValueCountFrequency (%)
3 2
25.0%
9 2
25.0%
7 1
12.5%
2 1
12.5%
8 1
12.5%
1 1
12.5%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
G 1
33.3%
M 1
33.3%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 123
89.8%
Common 11
 
8.0%
Latin 3
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (83) 93
75.6%
Common
ValueCountFrequency (%)
3
27.3%
3 2
18.2%
9 2
18.2%
7 1
 
9.1%
2 1
 
9.1%
8 1
 
9.1%
1 1
 
9.1%
Latin
ValueCountFrequency (%)
C 1
33.3%
G 1
33.3%
M 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 123
89.8%
ASCII 14
 
10.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
3.3%
4
 
3.3%
4
 
3.3%
3
 
2.4%
3
 
2.4%
3
 
2.4%
3
 
2.4%
2
 
1.6%
2
 
1.6%
2
 
1.6%
Other values (83) 93
75.6%
ASCII
ValueCountFrequency (%)
3
21.4%
3 2
14.3%
9 2
14.3%
7 1
 
7.1%
C 1
 
7.1%
G 1
 
7.1%
M 1
 
7.1%
2 1
 
7.1%
8 1
 
7.1%
1 1
 
7.1%

업체명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-12T20:51:00.028077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length10
Mean length7
Min length3

Characters and Unicode

Total characters175
Distinct characters82
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row주식회사앤하우스
2nd row㈜아이더스코리아
3rd row㈜깐깐한패밀리
4th row㈜커피베이
5th row㈜바이올푸드글로벌
ValueCountFrequency (%)
주식회사앤하우스 1
 
3.8%
㈜아이더스코리아 1
 
3.8%
㈜얌샘 1
 
3.8%
㈜알볼로에프앤씨 1
 
3.8%
㈜산무리 1
 
3.8%
더벗 1
 
3.8%
주식회사 1
 
3.8%
㈜기영에프앤비 1
 
3.8%
㈜쉐프마인드 1
 
3.8%
㈜신라외식개발 1
 
3.8%
Other values (16) 16
61.5%
2023-12-12T20:51:00.486820image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
19
 
10.9%
9
 
5.1%
8
 
4.6%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.9%
5
 
2.9%
4
 
2.3%
4
 
2.3%
Other values (72) 101
57.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 150
85.7%
Other Symbol 19
 
10.9%
Uppercase Letter 2
 
1.1%
Other Punctuation 1
 
0.6%
Space Separator 1
 
0.6%
Open Punctuation 1
 
0.6%
Close Punctuation 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
 
6.0%
8
 
5.3%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.3%
5
 
3.3%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (65) 91
60.7%
Uppercase Letter
ValueCountFrequency (%)
F 1
50.0%
S 1
50.0%
Other Symbol
ValueCountFrequency (%)
19
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 169
96.6%
Common 4
 
2.3%
Latin 2
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
11.2%
9
 
5.3%
8
 
4.7%
7
 
4.1%
7
 
4.1%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (66) 95
56.2%
Common
ValueCountFrequency (%)
& 1
25.0%
1
25.0%
( 1
25.0%
) 1
25.0%
Latin
ValueCountFrequency (%)
F 1
50.0%
S 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 150
85.7%
None 19
 
10.9%
ASCII 6
 
3.4%

Most frequent character per block

None
ValueCountFrequency (%)
19
100.0%
Hangul
ValueCountFrequency (%)
9
 
6.0%
8
 
5.3%
7
 
4.7%
7
 
4.7%
6
 
4.0%
5
 
3.3%
5
 
3.3%
4
 
2.7%
4
 
2.7%
4
 
2.7%
Other values (65) 91
60.7%
ASCII
ValueCountFrequency (%)
F 1
16.7%
& 1
16.7%
S 1
16.7%
1
16.7%
( 1
16.7%
) 1
16.7%

기간
Categorical

IMBALANCE 

Distinct2
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
21. 1. 5 ~ 22. 1. 4
24 
21. 2. 1 ~ 22. 1. 31
 
1

Length

Max length20
Median length19
Mean length19.04
Min length19

Unique

Unique1 ?
Unique (%)4.0%

Sample

1st row21. 1. 5 ~ 22. 1. 4
2nd row21. 1. 5 ~ 22. 1. 4
3rd row21. 1. 5 ~ 22. 1. 4
4th row21. 1. 5 ~ 22. 1. 4
5th row21. 1. 5 ~ 22. 1. 4

Common Values

ValueCountFrequency (%)
21. 1. 5 ~ 22. 1. 4 24
96.0%
21. 2. 1 ~ 22. 1. 31 1
 
4.0%

Length

2023-12-12T20:51:00.696991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:51:00.827600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 50
28.6%
21 25
14.3%
25
14.3%
22 25
14.3%
5 24
13.7%
4 24
13.7%
2 1
 
0.6%
31 1
 
0.6%

Interactions

2023-12-12T20:50:57.839808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:51:00.913299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등급브랜드명업체명기간
연번1.0000.9951.0001.0000.000
등급0.9951.0001.0001.0000.000
브랜드명1.0001.0001.0001.0001.000
업체명1.0001.0001.0001.0001.000
기간0.0000.0001.0001.0001.000
2023-12-12T20:51:01.063242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기간등급
기간1.0000.000
등급0.0001.000
2023-12-12T20:51:01.166528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번등급기간
연번1.0000.7340.000
등급0.7341.0000.000
기간0.0000.0001.000

Missing values

2023-12-12T20:50:58.272915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:50:58.380677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번등급브랜드명업체명기간
01메가MGC커피주식회사앤하우스21. 1. 5 ~ 22. 1. 4
12푸라닭치킨㈜아이더스코리아21. 1. 5 ~ 22. 1. 4
23깐깐한족발㈜깐깐한패밀리21. 1. 5 ~ 22. 1. 4
34커피베이㈜커피베이21. 1. 5 ~ 22. 1. 4
45유가네닭갈비㈜바이올푸드글로벌21. 1. 5 ~ 22. 1. 4
56크린토피아㈜크린토피아21. 1. 5 ~ 22. 1. 4
67에듀플렉스넥스큐브코퍼레이션주식회사21. 1. 5 ~ 22. 1. 4
78아소비㈜아소비교육21. 1. 5 ~ 22. 1. 4
89반딧불이㈜이지코퍼레이션21. 1. 5 ~ 22. 1. 4
910역전할머니맥주1982주식회사역전에프앤씨21. 1. 5 ~ 22. 1. 4
연번등급브랜드명업체명기간
151633떡볶이성백F&S21. 1. 5 ~ 22. 1. 4
1617수유리우동집㈜물과소금21. 1. 5 ~ 22. 1. 4
1718티바두마리치킨㈜신라외식개발21. 1. 5 ~ 22. 1. 4
1819한끼맛있다㈜쉐프마인드21. 1. 5 ~ 22. 1. 4
1920두찜㈜기영에프앤비21. 1. 5 ~ 22. 1. 4
202179대포주식회사 더벗21. 1. 5 ~ 22. 1. 4
2122군산오징어㈜산무리21. 1. 5 ~ 22. 1. 4
2223피자알볼로㈜알볼로에프앤씨21. 1. 5 ~ 22. 1. 4
2324얌샘김밥㈜얌샘21. 1. 5 ~ 22. 1. 4
2425라라코스트㈜라라에프앤비21. 2. 1 ~ 22. 1. 31