Overview

Dataset statistics

Number of variables6
Number of observations1995
Missing cells320
Missing cells (%)2.7%
Duplicate rows4
Duplicate rows (%)0.2%
Total size in memory95.6 KiB
Average record size in memory49.1 B

Variable types

Numeric1
Text3
Categorical1
DateTime1

Dataset

Description제주특별자치도 제주시 관내 여성농업인 행복바우처 가맹점 관련 현황 데이터를 제공합니다.
Author제주특별자치도 제주시
URLhttps://www.data.go.kr/data/15064846/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 4 (0.2%) duplicate rowsDuplicates
업종코드 is highly overall correlated with 업종명High correlation
업종명 is highly overall correlated with 업종코드High correlation
연락처 has 320 (16.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 12:06:42.963953
Analysis finished2023-12-12 12:06:43.884136
Duration0.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종코드
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1968.8541
Minimum1204
Maximum2099
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.7 KiB
2023-12-12T21:06:43.949682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1204
5-th percentile2001
Q12001
median2002
Q32003
95-th percentile2004
Maximum2099
Range895
Interquartile range (IQR)2

Descriptive statistics

Standard deviation170.31695
Coefficient of variation (CV)0.08650562
Kurtosis16.03753
Mean1968.8541
Median Absolute Deviation (MAD)1
Skewness-4.2022882
Sum3927864
Variance29007.863
MonotonicityIncreasing
2023-12-12T21:06:44.060843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2001 851
42.7%
2003 509
25.5%
2002 412
20.7%
1204 93
 
4.7%
2099 86
 
4.3%
2004 44
 
2.2%
ValueCountFrequency (%)
1204 93
 
4.7%
2001 851
42.7%
2002 412
20.7%
2003 509
25.5%
2004 44
 
2.2%
2099 86
 
4.3%
ValueCountFrequency (%)
2099 86
 
4.3%
2004 44
 
2.2%
2003 509
25.5%
2002 412
20.7%
2001 851
42.7%
1204 93
 
4.7%
Distinct1978
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size15.7 KiB
2023-12-12T21:06:44.302234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length23
Mean length6.477193
Min length1

Characters and Unicode

Total characters12922
Distinct characters761
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1961 ?
Unique (%)98.3%

Sample

1st row펠롱광학
2nd row아이데코안경티타늄안경테49000원렌즈타운노형점
3rd row탐라안경원
4th row글라스스토리&렌즈스토리 중앙로점
5th row렌즈미 제주시청점
ValueCountFrequency (%)
주식회사 42
 
1.8%
제주점 10
 
0.4%
신제주점 6
 
0.3%
투썸플레이스 6
 
0.3%
파리바게뜨 6
 
0.3%
안경 6
 
0.3%
제주노형점 5
 
0.2%
뚜레쥬르 5
 
0.2%
한국맥도날드 5
 
0.2%
연동점 4
 
0.2%
Other values (2113) 2184
95.8%
2023-12-12T21:06:44.752814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
481
 
3.7%
443
 
3.4%
385
 
3.0%
318
 
2.5%
304
 
2.4%
285
 
2.2%
260
 
2.0%
209
 
1.6%
193
 
1.5%
179
 
1.4%
Other values (751) 9865
76.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12049
93.2%
Space Separator 285
 
2.2%
Decimal Number 143
 
1.1%
Lowercase Letter 123
 
1.0%
Uppercase Letter 118
 
0.9%
Open Punctuation 88
 
0.7%
Close Punctuation 87
 
0.7%
Other Punctuation 25
 
0.2%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
481
 
4.0%
443
 
3.7%
385
 
3.2%
318
 
2.6%
304
 
2.5%
260
 
2.2%
209
 
1.7%
193
 
1.6%
179
 
1.5%
178
 
1.5%
Other values (689) 9099
75.5%
Uppercase Letter
ValueCountFrequency (%)
E 12
 
10.2%
T 10
 
8.5%
A 9
 
7.6%
L 9
 
7.6%
G 7
 
5.9%
I 7
 
5.9%
O 7
 
5.9%
N 6
 
5.1%
R 5
 
4.2%
D 5
 
4.2%
Other values (14) 41
34.7%
Lowercase Letter
ValueCountFrequency (%)
e 15
12.2%
o 13
10.6%
a 12
9.8%
i 10
 
8.1%
f 8
 
6.5%
t 8
 
6.5%
r 8
 
6.5%
c 7
 
5.7%
s 7
 
5.7%
l 6
 
4.9%
Other values (11) 29
23.6%
Decimal Number
ValueCountFrequency (%)
1 28
19.6%
2 23
16.1%
0 17
11.9%
3 17
11.9%
7 16
11.2%
9 10
 
7.0%
8 10
 
7.0%
4 9
 
6.3%
5 8
 
5.6%
6 5
 
3.5%
Other Punctuation
ValueCountFrequency (%)
. 11
44.0%
/ 10
40.0%
& 4
 
16.0%
Space Separator
ValueCountFrequency (%)
285
100.0%
Open Punctuation
ValueCountFrequency (%)
( 88
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12049
93.2%
Common 632
 
4.9%
Latin 241
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
481
 
4.0%
443
 
3.7%
385
 
3.2%
318
 
2.6%
304
 
2.5%
260
 
2.2%
209
 
1.7%
193
 
1.6%
179
 
1.5%
178
 
1.5%
Other values (689) 9099
75.5%
Latin
ValueCountFrequency (%)
e 15
 
6.2%
o 13
 
5.4%
a 12
 
5.0%
E 12
 
5.0%
T 10
 
4.1%
i 10
 
4.1%
A 9
 
3.7%
L 9
 
3.7%
f 8
 
3.3%
t 8
 
3.3%
Other values (35) 135
56.0%
Common
ValueCountFrequency (%)
285
45.1%
( 88
 
13.9%
) 87
 
13.8%
1 28
 
4.4%
2 23
 
3.6%
0 17
 
2.7%
3 17
 
2.7%
7 16
 
2.5%
. 11
 
1.7%
9 10
 
1.6%
Other values (7) 50
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12049
93.2%
ASCII 873
 
6.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
481
 
4.0%
443
 
3.7%
385
 
3.2%
318
 
2.6%
304
 
2.5%
260
 
2.2%
209
 
1.7%
193
 
1.6%
179
 
1.5%
178
 
1.5%
Other values (689) 9099
75.5%
ASCII
ValueCountFrequency (%)
285
32.6%
( 88
 
10.1%
) 87
 
10.0%
1 28
 
3.2%
2 23
 
2.6%
0 17
 
1.9%
3 17
 
1.9%
7 16
 
1.8%
e 15
 
1.7%
o 13
 
1.5%
Other values (52) 284
32.5%

연락처
Text

MISSING 

Distinct1613
Distinct (%)96.3%
Missing320
Missing (%)16.0%
Memory size15.7 KiB
2023-12-12T21:06:45.058211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.155224
Min length12

Characters and Unicode

Total characters20360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1571 ?
Unique (%)93.8%

Sample

1st row064-702-3352
2nd row064-576-7652
3rd row064-702-0749
4th row064-702-1001
5th row064-702-2221
ValueCountFrequency (%)
064-901-6366 8
 
0.5%
064-764-4207 7
 
0.4%
064-721-7978 4
 
0.2%
064-756-7674 4
 
0.2%
064-600-5252 4
 
0.2%
064-758-6280 3
 
0.2%
064-711-1907 3
 
0.2%
064-747-8360 3
 
0.2%
064-712-1970 2
 
0.1%
064-805-0006 2
 
0.1%
Other values (1603) 1635
97.6%
2023-12-12T21:06:45.555701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3350
16.5%
0 3133
15.4%
4 2573
12.6%
7 2564
12.6%
6 2218
10.9%
2 1378
6.8%
5 1268
 
6.2%
8 1048
 
5.1%
1 1017
 
5.0%
9 942
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 17010
83.5%
Dash Punctuation 3350
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3133
18.4%
4 2573
15.1%
7 2564
15.1%
6 2218
13.0%
2 1378
8.1%
5 1268
7.5%
8 1048
 
6.2%
1 1017
 
6.0%
9 942
 
5.5%
3 869
 
5.1%
Dash Punctuation
ValueCountFrequency (%)
- 3350
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3350
16.5%
0 3133
15.4%
4 2573
12.6%
7 2564
12.6%
6 2218
10.9%
2 1378
6.8%
5 1268
 
6.2%
8 1048
 
5.1%
1 1017
 
5.0%
9 942
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3350
16.5%
0 3133
15.4%
4 2573
12.6%
7 2564
12.6%
6 2218
10.9%
2 1378
6.8%
5 1268
 
6.2%
8 1048
 
5.1%
1 1017
 
5.0%
9 942
 
4.6%

주소
Text

Distinct1865
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size15.7 KiB
2023-12-12T21:06:46.094529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length21.269674
Min length17

Characters and Unicode

Total characters42433
Distinct characters198
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1757 ?
Unique (%)88.1%

Sample

1st row제주특별자치도 제주시 구산서길 29
2nd row제주특별자치도 제주시 노형14길 14
3rd row제주특별자치도 제주시 삼도일동 534-16
4th row제주특별자치도 제주시 중앙로 63
5th row제주특별자치도 제주시 신성로13길 3
ValueCountFrequency (%)
제주특별자치도 1995
23.3%
제주시 1995
23.3%
애월읍 148
 
1.7%
구좌읍 134
 
1.6%
조천읍 119
 
1.4%
한림읍 86
 
1.0%
중앙로 69
 
0.8%
한경면 45
 
0.5%
1 40
 
0.5%
2 34
 
0.4%
Other values (1534) 3902
45.5%
2023-12-12T21:06:46.888495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6972
16.4%
4056
 
9.6%
4005
 
9.4%
2176
 
5.1%
2011
 
4.7%
1995
 
4.7%
1995
 
4.7%
1995
 
4.7%
1995
 
4.7%
1 1415
 
3.3%
Other values (188) 13818
32.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28662
67.5%
Space Separator 6972
 
16.4%
Decimal Number 6313
 
14.9%
Dash Punctuation 486
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4056
14.2%
4005
14.0%
2176
 
7.6%
2011
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1253
 
4.4%
798
 
2.8%
Other values (176) 6383
22.3%
Decimal Number
ValueCountFrequency (%)
1 1415
22.4%
2 895
14.2%
3 706
11.2%
4 617
9.8%
5 562
 
8.9%
6 509
 
8.1%
7 414
 
6.6%
9 404
 
6.4%
0 396
 
6.3%
8 395
 
6.3%
Space Separator
ValueCountFrequency (%)
6972
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 486
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28662
67.5%
Common 13771
32.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4056
14.2%
4005
14.0%
2176
 
7.6%
2011
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1253
 
4.4%
798
 
2.8%
Other values (176) 6383
22.3%
Common
ValueCountFrequency (%)
6972
50.6%
1 1415
 
10.3%
2 895
 
6.5%
3 706
 
5.1%
4 617
 
4.5%
5 562
 
4.1%
6 509
 
3.7%
- 486
 
3.5%
7 414
 
3.0%
9 404
 
2.9%
Other values (2) 791
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28662
67.5%
ASCII 13771
32.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6972
50.6%
1 1415
 
10.3%
2 895
 
6.5%
3 706
 
5.1%
4 617
 
4.5%
5 562
 
4.1%
6 509
 
3.7%
- 486
 
3.5%
7 414
 
3.0%
9 404
 
2.9%
Other values (2) 791
 
5.7%
Hangul
ValueCountFrequency (%)
4056
14.2%
4005
14.0%
2176
 
7.6%
2011
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1995
 
7.0%
1253
 
4.4%
798
 
2.8%
Other values (176) 6383
22.3%

업종명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size15.7 KiB
휴게음식점
851 
커피전문점
509 
제과점/아이스크림점
412 
안경점
93 
기타휴게음식점
86 

Length

Max length11
Median length6
Mean length6.9824561
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안경점
2nd row안경점
3rd row안경점
4th row안경점
5th row안경점

Common Values

ValueCountFrequency (%)
휴게음식점 851
42.7%
커피전문점 509
25.5%
제과점/아이스크림점 412
20.7%
안경점 93
 
4.7%
기타휴게음식점 86
 
4.3%
패스트푸드점 44
 
2.2%

Length

2023-12-12T21:06:47.123676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:06:47.301179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴게음식점 851
42.7%
커피전문점 509
25.5%
제과점/아이스크림점 412
20.7%
안경점 93
 
4.7%
기타휴게음식점 86
 
4.3%
패스트푸드점 44
 
2.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size15.7 KiB
Minimum2020-08-18 00:00:00
Maximum2020-08-18 00:00:00
2023-12-12T21:06:47.418474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:06:47.532953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T21:06:43.573228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:06:47.625882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종코드업종명
업종코드1.0001.000
업종명1.0001.000
2023-12-12T21:06:47.730284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종코드업종명
업종코드1.0000.999
업종명0.9991.000

Missing values

2023-12-12T21:06:43.706015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:06:43.834887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종코드업체명연락처주소업종명데이터기준일자
01204펠롱광학064-702-3352제주특별자치도 제주시 구산서길 29안경점2020-08-18
11204아이데코안경티타늄안경테49000원렌즈타운노형점064-576-7652제주특별자치도 제주시 노형14길 14안경점2020-08-18
21204탐라안경원064-702-0749제주특별자치도 제주시 삼도일동 534-16안경점2020-08-18
31204글라스스토리&렌즈스토리 중앙로점064-702-1001제주특별자치도 제주시 중앙로 63안경점2020-08-18
41204렌즈미 제주시청점064-702-2221제주특별자치도 제주시 신성로13길 3안경점2020-08-18
51204올리브 안경갤러리 By 아이피아064-702-3008제주특별자치도 제주시 서광로 215안경점2020-08-18
61204글라스 스토리 제주점064-711-2476제주특별자치도 제주시 연동 노연로 118-1안경점2020-08-18
71204E-월드안경원064-711-4510제주특별자치도 제주시 외도일동 548-3안경점2020-08-18
81204렌즈유064-711-5709제주특별자치도 제주시 신광로 30안경점2020-08-18
91204늘푸른 안경064-711-9195제주특별자치도 제주시 도령로 1안경점2020-08-18
업종코드업체명연락처주소업종명데이터기준일자
19852099예향064-799-4675제주특별자치도 제주시 애월읍 고내리 502-1기타휴게음식점2020-08-18
19862099달리070-3859-5627제주특별자치도 제주시 구좌읍 월정1길 96기타휴게음식점2020-08-18
19872099무늬070-4789-2381제주특별자치도 제주시 구좌읍 월정5길 56기타휴게음식점2020-08-18
19882099스테이솔티070-5121-6771제주특별자치도 제주시 구좌읍 해맞이해안로 480-1기타휴게음식점2020-08-18
19892099젤리빈슬라임하우스070-7626-4242제주특별자치도 제주시 과원로 18-1기타휴게음식점2020-08-18
19902099하우스레서피당근케이크070-7760-9440제주특별자치도 제주시 한림읍 귀덕리 1236-9기타휴게음식점2020-08-18
19912099토끼썸070-8098-1841제주특별자치도 제주시 구좌읍 해맞이해안로 1860기타휴게음식점2020-08-18
19922099꽃길070-8261-7815제주특별자치도 제주시 서광로25길 60기타휴게음식점2020-08-18
19932099모리노코에070-8830-4537제주특별자치도 제주시 이도이동 2042-1기타휴게음식점2020-08-18
19942099모니카디저트070-8900-8020제주특별자치도 제주시 노형1길 33기타휴게음식점2020-08-18

Duplicate rows

Most frequently occurring

업종코드업체명연락처주소업종명데이터기준일자# duplicates
01204착한안경064-757-7959제주특별자치도 제주시 중앙로 283안경점2020-08-182
12001롯데리아제주삼화지구점064-753-8873제주특별자치도 제주시 건주로4길 6-4휴게음식점2020-08-182
22002대원오메기떡064-757-4244제주특별자치도 제주시 동광로5길 3제과점/아이스크림점2020-08-182
32003덕인당소락<NA>제주특별자치도 제주시 중앙로 451커피전문점2020-08-182