Overview

Dataset statistics

Number of variables21
Number of observations1094
Missing cells882
Missing cells (%)3.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory182.8 KiB
Average record size in memory171.1 B

Variable types

Categorical5
Text6
Numeric3
Boolean7

Dataset

Description대구광역시 북구 시장별상점가정보에 대한 데이터로 시장명, 소속상인회, 상가명, 주요취급품목, 주소, 영업시간 등의 항목을 제공합니다.
Author대구광역시 북구
URLhttps://www.data.go.kr/data/15095900/fileData.do

Alerts

시장명 has constant value ""Constant
스마트결제여부 is highly imbalanced (55.3%)Imbalance
제품교환여부 is highly imbalanced (50.4%)Imbalance
점주명 has 72 (6.6%) missing valuesMissing
주요취급품목 has 24 (2.2%) missing valuesMissing
창업년도 has 63 (5.8%) missing valuesMissing
주소 has 176 (16.1%) missing valuesMissing
면적 has 129 (11.8%) missing valuesMissing
영업시간 has 137 (12.5%) missing valuesMissing
월평균영업일수 has 132 (12.1%) missing valuesMissing
고용자수 has 149 (13.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 11:53:32.089651
Analysis finished2023-12-12 11:53:33.116945
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시장명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
칠성종합시장
1094 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row칠성종합시장
2nd row칠성종합시장
3rd row칠성종합시장
4th row칠성종합시장
5th row칠성종합시장

Common Values

ValueCountFrequency (%)
칠성종합시장 1094
100.0%

Length

2023-12-12T20:53:33.202653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:33.295614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
칠성종합시장 1094
100.0%

소속상인회
Categorical

Distinct14
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
칠성시장
185 
대구능금시장
157 
칠성시장풍물거리(강변시장)
95 
칠성진·경명시장
85 
주방용품골목
70 
Other values (9)
502 

Length

Max length14
Median length8
Mean length6.2714808
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row완구골목
2nd row대구청과시장
3rd row대구청과시장
4th row대구청과시장
5th row주방용품골목

Common Values

ValueCountFrequency (%)
칠성시장 185
16.9%
대구능금시장 157
14.4%
칠성시장풍물거리(강변시장) 95
8.7%
칠성진·경명시장 85
7.8%
주방용품골목 70
 
6.4%
칠성전자주방시장 68
 
6.2%
칠성본시장 64
 
5.9%
<NA> 62
 
5.7%
칠성원시장 61
 
5.6%
별별상상 야시장 59
 
5.4%
Other values (4) 188
17.2%

Length

2023-12-12T20:53:33.408110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
칠성시장 185
16.0%
대구능금시장 157
13.6%
칠성시장풍물거리(강변시장 95
 
8.2%
칠성진·경명시장 85
 
7.4%
주방용품골목 70
 
6.1%
칠성전자주방시장 68
 
5.9%
칠성본시장 64
 
5.6%
na 62
 
5.4%
칠성원시장 61
 
5.3%
별별상상 59
 
5.1%
Other values (5) 247
21.4%
Distinct1044
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2023-12-12T20:53:33.733467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length4
Mean length5.0758684
Min length2

Characters and Unicode

Total characters5553
Distinct characters461
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1009 ?
Unique (%)92.2%

Sample

1st row1000wells
2nd row1번상회
3rd row22번상회
4th row2번상회
5th row363국시마을
ValueCountFrequency (%)
대구상회 6
 
0.5%
노점1 5
 
0.4%
노점3 4
 
0.4%
노점2 4
 
0.4%
대성상회 3
 
0.3%
삼성상회 3
 
0.3%
노점4 3
 
0.3%
대광상회 3
 
0.3%
대원상회 2
 
0.2%
부산해물 2
 
0.2%
Other values (1073) 1100
96.9%
2023-12-12T20:53:34.327216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
382
 
6.9%
358
 
6.4%
151
 
2.7%
130
 
2.3%
124
 
2.2%
108
 
1.9%
108
 
1.9%
) 101
 
1.8%
( 101
 
1.8%
80
 
1.4%
Other values (451) 3910
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4982
89.7%
Decimal Number 266
 
4.8%
Close Punctuation 101
 
1.8%
Open Punctuation 101
 
1.8%
Space Separator 41
 
0.7%
Dash Punctuation 24
 
0.4%
Lowercase Letter 18
 
0.3%
Uppercase Letter 13
 
0.2%
Other Punctuation 7
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
 
7.7%
358
 
7.2%
151
 
3.0%
130
 
2.6%
124
 
2.5%
108
 
2.2%
108
 
2.2%
80
 
1.6%
73
 
1.5%
65
 
1.3%
Other values (411) 3403
68.3%
Lowercase Letter
ValueCountFrequency (%)
l 4
22.2%
c 2
11.1%
o 2
11.1%
e 2
11.1%
s 1
 
5.6%
r 1
 
5.6%
a 1
 
5.6%
b 1
 
5.6%
w 1
 
5.6%
n 1
 
5.6%
Other values (2) 2
11.1%
Uppercase Letter
ValueCountFrequency (%)
C 2
15.4%
V 1
7.7%
T 1
7.7%
O 1
7.7%
E 1
7.7%
M 1
7.7%
J 1
7.7%
A 1
7.7%
L 1
7.7%
B 1
7.7%
Other values (2) 2
15.4%
Decimal Number
ValueCountFrequency (%)
1 72
27.1%
2 71
26.7%
3 36
13.5%
4 17
 
6.4%
5 15
 
5.6%
6 14
 
5.3%
0 13
 
4.9%
7 10
 
3.8%
8 10
 
3.8%
9 8
 
3.0%
Other Punctuation
ValueCountFrequency (%)
, 5
71.4%
& 2
 
28.6%
Close Punctuation
ValueCountFrequency (%)
) 101
100.0%
Open Punctuation
ValueCountFrequency (%)
( 101
100.0%
Space Separator
ValueCountFrequency (%)
41
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4982
89.7%
Common 540
 
9.7%
Latin 31
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
 
7.7%
358
 
7.2%
151
 
3.0%
130
 
2.6%
124
 
2.5%
108
 
2.2%
108
 
2.2%
80
 
1.6%
73
 
1.5%
65
 
1.3%
Other values (411) 3403
68.3%
Latin
ValueCountFrequency (%)
l 4
 
12.9%
c 2
 
6.5%
o 2
 
6.5%
C 2
 
6.5%
e 2
 
6.5%
V 1
 
3.2%
T 1
 
3.2%
s 1
 
3.2%
r 1
 
3.2%
a 1
 
3.2%
Other values (14) 14
45.2%
Common
ValueCountFrequency (%)
) 101
18.7%
( 101
18.7%
1 72
13.3%
2 71
13.1%
41
7.6%
3 36
 
6.7%
- 24
 
4.4%
4 17
 
3.1%
5 15
 
2.8%
6 14
 
2.6%
Other values (6) 48
8.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4982
89.7%
ASCII 571
 
10.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
382
 
7.7%
358
 
7.2%
151
 
3.0%
130
 
2.6%
124
 
2.5%
108
 
2.2%
108
 
2.2%
80
 
1.6%
73
 
1.5%
65
 
1.3%
Other values (411) 3403
68.3%
ASCII
ValueCountFrequency (%)
) 101
17.7%
( 101
17.7%
1 72
12.6%
2 71
12.4%
41
7.2%
3 36
 
6.3%
- 24
 
4.2%
4 17
 
3.0%
5 15
 
2.6%
6 14
 
2.5%
Other values (30) 79
13.8%

점주명
Text

MISSING 

Distinct70
Distinct (%)6.8%
Missing72
Missing (%)6.6%
Memory size8.7 KiB
2023-12-12T20:53:34.592293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.0009785
Min length3

Characters and Unicode

Total characters3067
Distinct characters72
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)1.7%

Sample

1st row박**
2nd row황**
3rd row김**
4th row정**
5th row이**
ValueCountFrequency (%)
206
20.2%
159
15.6%
94
 
9.2%
49
 
4.8%
42
 
4.1%
27
 
2.6%
26
 
2.5%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (60) 357
34.9%
2023-12-12T20:53:35.009662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 2044
66.6%
206
 
6.7%
159
 
5.2%
94
 
3.1%
49
 
1.6%
42
 
1.4%
27
 
0.9%
26
 
0.8%
21
 
0.7%
21
 
0.7%
Other values (62) 378
 
12.3%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 2044
66.6%
Other Letter 1021
33.3%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
20.2%
159
15.6%
94
 
9.2%
49
 
4.8%
42
 
4.1%
27
 
2.6%
26
 
2.5%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (59) 356
34.9%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
L 1
50.0%
Other Punctuation
ValueCountFrequency (%)
* 2044
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2044
66.6%
Hangul 1021
33.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
20.2%
159
15.6%
94
 
9.2%
49
 
4.8%
42
 
4.1%
27
 
2.6%
26
 
2.5%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (59) 356
34.9%
Latin
ValueCountFrequency (%)
I 1
50.0%
L 1
50.0%
Common
ValueCountFrequency (%)
* 2044
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2046
66.7%
Hangul 1021
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 2044
99.9%
I 1
 
< 0.1%
L 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
206
20.2%
159
15.6%
94
 
9.2%
49
 
4.8%
42
 
4.1%
27
 
2.6%
26
 
2.5%
21
 
2.1%
21
 
2.1%
20
 
2.0%
Other values (59) 356
34.9%

업종
Categorical

Distinct13
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
농산물판매
387 
음식점업
193 
기타소매업
125 
수산물판매
113 
가공식품판매
87 
Other values (8)
189 

Length

Max length8
Median length5
Mean length4.9744059
Min length2

Unique

Unique4 ?
Unique (%)0.4%

Sample

1st row의류/신발판매
2nd row농산물판매
3rd row농산물판매
4th row농산물판매
5th row음식점업

Common Values

ValueCountFrequency (%)
농산물판매 387
35.4%
음식점업 193
17.6%
기타소매업 125
 
11.4%
수산물판매 113
 
10.3%
가공식품판매 87
 
8.0%
전자용품판매 71
 
6.5%
축산물판매 47
 
4.3%
의류/신발판매 41
 
3.7%
기타 26
 
2.4%
온누리상품권 1
 
0.1%
Other values (3) 3
 
0.3%

Length

2023-12-12T20:53:35.201195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
농산물판매 387
35.3%
음식점업 193
17.6%
기타소매업 125
 
11.4%
수산물판매 113
 
10.3%
가공식품판매 87
 
7.9%
전자용품판매 71
 
6.5%
축산물판매 47
 
4.3%
의류/신발판매 41
 
3.7%
기타 26
 
2.4%
온누리상품권 1
 
0.1%
Other values (4) 4
 
0.4%

주요취급품목
Text

MISSING 

Distinct696
Distinct (%)65.0%
Missing24
Missing (%)2.2%
Memory size8.7 KiB
2023-12-12T20:53:35.604814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length23
Mean length5.588785
Min length1

Characters and Unicode

Total characters5980
Distinct characters380
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique565 ?
Unique (%)52.8%

Sample

1st row스텐프, 도장
2nd row과일류
3rd row과일
4th row과일류
5th row국수
ValueCountFrequency (%)
판매 59
 
4.0%
과일 53
 
3.6%
사과 27
 
1.9%
수리 21
 
1.4%
17
 
1.2%
포도 16
 
1.1%
배추 14
 
1.0%
전자용품 14
 
1.0%
야채 14
 
1.0%
수박 12
 
0.8%
Other values (699) 1211
83.1%
2023-12-12T20:53:36.296249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 498
 
8.3%
388
 
6.5%
152
 
2.5%
148
 
2.5%
146
 
2.4%
144
 
2.4%
127
 
2.1%
116
 
1.9%
112
 
1.9%
103
 
1.7%
Other values (370) 4046
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4917
82.2%
Other Punctuation 516
 
8.6%
Space Separator 388
 
6.5%
Open Punctuation 68
 
1.1%
Close Punctuation 67
 
1.1%
Dash Punctuation 13
 
0.2%
Uppercase Letter 10
 
0.2%
Decimal Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
152
 
3.1%
148
 
3.0%
146
 
3.0%
144
 
2.9%
127
 
2.6%
116
 
2.4%
112
 
2.3%
103
 
2.1%
102
 
2.1%
87
 
1.8%
Other values (360) 3680
74.8%
Other Punctuation
ValueCountFrequency (%)
, 498
96.5%
. 17
 
3.3%
/ 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
T 5
50.0%
V 5
50.0%
Space Separator
ValueCountFrequency (%)
388
100.0%
Open Punctuation
ValueCountFrequency (%)
( 68
100.0%
Close Punctuation
ValueCountFrequency (%)
) 67
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4917
82.2%
Common 1053
 
17.6%
Latin 10
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
152
 
3.1%
148
 
3.0%
146
 
3.0%
144
 
2.9%
127
 
2.6%
116
 
2.4%
112
 
2.3%
103
 
2.1%
102
 
2.1%
87
 
1.8%
Other values (360) 3680
74.8%
Common
ValueCountFrequency (%)
, 498
47.3%
388
36.8%
( 68
 
6.5%
) 67
 
6.4%
. 17
 
1.6%
- 13
 
1.2%
/ 1
 
0.1%
1 1
 
0.1%
Latin
ValueCountFrequency (%)
T 5
50.0%
V 5
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4917
82.2%
ASCII 1063
 
17.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 498
46.8%
388
36.5%
( 68
 
6.4%
) 67
 
6.3%
. 17
 
1.6%
- 13
 
1.2%
T 5
 
0.5%
V 5
 
0.5%
/ 1
 
0.1%
1 1
 
0.1%
Hangul
ValueCountFrequency (%)
152
 
3.1%
148
 
3.0%
146
 
3.0%
144
 
2.9%
127
 
2.6%
116
 
2.4%
112
 
2.3%
103
 
2.1%
102
 
2.1%
87
 
1.8%
Other values (360) 3680
74.8%

창업년도
Text

MISSING 

Distinct61
Distinct (%)5.9%
Missing63
Missing (%)5.8%
Memory size8.7 KiB
2023-12-12T20:53:36.557032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.9970902
Min length1

Characters and Unicode

Total characters4121
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)0.8%

Sample

1st row2007
2nd row2015
3rd row1979
4th row2014
5th row2016
ValueCountFrequency (%)
1989 141
 
13.7%
1999 125
 
12.1%
2019 96
 
9.3%
2009 81
 
7.9%
1979 58
 
5.6%
2014 47
 
4.6%
2004 40
 
3.9%
1969 25
 
2.4%
2017 23
 
2.2%
2018 22
 
2.1%
Other values (50) 372
36.1%
2023-12-12T20:53:36.986910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 1254
30.4%
1 847
20.6%
0 777
18.9%
2 531
12.9%
8 254
 
6.2%
7 140
 
3.4%
4 118
 
2.9%
6 89
 
2.2%
5 59
 
1.4%
3 51
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4120
> 99.9%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 1254
30.4%
1 847
20.6%
0 777
18.9%
2 531
12.9%
8 254
 
6.2%
7 140
 
3.4%
4 118
 
2.9%
6 89
 
2.2%
5 59
 
1.4%
3 51
 
1.2%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4121
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
9 1254
30.4%
1 847
20.6%
0 777
18.9%
2 531
12.9%
8 254
 
6.2%
7 140
 
3.4%
4 118
 
2.9%
6 89
 
2.2%
5 59
 
1.4%
3 51
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4121
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 1254
30.4%
1 847
20.6%
0 777
18.9%
2 531
12.9%
8 254
 
6.2%
7 140
 
3.4%
4 118
 
2.9%
6 89
 
2.2%
5 59
 
1.4%
3 51
 
1.2%

주소
Text

MISSING 

Distinct411
Distinct (%)44.8%
Missing176
Missing (%)16.1%
Memory size8.7 KiB
2023-12-12T20:53:37.344628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length15.563181
Min length11

Characters and Unicode

Total characters14287
Distinct characters53
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique292 ?
Unique (%)31.8%

Sample

1st row대구 북구 칠성시장로7길 39-2
2nd row대구 북구 칠성시장로 34
3rd row대구 북구 칠성남로 지하 222
4th row대구 북구 칠성남로 지하 222
5th row대구 북구 칠성시장로 16-5
ValueCountFrequency (%)
대구 918
23.7%
북구 918
23.7%
칠성남로 299
 
7.7%
지하 237
 
6.1%
222 237
 
6.1%
칠성시장로 135
 
3.5%
칠성동1가 82
 
2.1%
칠성시장로7길 54
 
1.4%
85 41
 
1.1%
칠성시장로3길 41
 
1.1%
Other values (322) 911
23.5%
2023-12-12T20:53:37.790466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2955
20.7%
1836
12.9%
2 1006
 
7.0%
919
 
6.4%
918
 
6.4%
846
 
5.9%
845
 
5.9%
770
 
5.4%
1 594
 
4.2%
365
 
2.6%
Other values (43) 3233
22.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8262
57.8%
Space Separator 2955
 
20.7%
Decimal Number 2854
 
20.0%
Dash Punctuation 212
 
1.5%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1836
22.2%
919
11.1%
918
11.1%
846
10.2%
845
10.2%
770
9.3%
365
 
4.4%
335
 
4.1%
335
 
4.1%
263
 
3.2%
Other values (29) 830
10.0%
Decimal Number
ValueCountFrequency (%)
2 1006
35.2%
1 594
20.8%
3 252
 
8.8%
4 198
 
6.9%
7 173
 
6.1%
5 161
 
5.6%
8 132
 
4.6%
0 124
 
4.3%
9 118
 
4.1%
6 96
 
3.4%
Space Separator
ValueCountFrequency (%)
2955
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 212
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8262
57.8%
Common 6025
42.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1836
22.2%
919
11.1%
918
11.1%
846
10.2%
845
10.2%
770
9.3%
365
 
4.4%
335
 
4.1%
335
 
4.1%
263
 
3.2%
Other values (29) 830
10.0%
Common
ValueCountFrequency (%)
2955
49.0%
2 1006
 
16.7%
1 594
 
9.9%
3 252
 
4.2%
- 212
 
3.5%
4 198
 
3.3%
7 173
 
2.9%
5 161
 
2.7%
8 132
 
2.2%
0 124
 
2.1%
Other values (4) 218
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8262
57.8%
ASCII 6025
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2955
49.0%
2 1006
 
16.7%
1 594
 
9.9%
3 252
 
4.2%
- 212
 
3.5%
4 198
 
3.3%
7 173
 
2.9%
5 161
 
2.7%
8 132
 
2.2%
0 124
 
2.1%
Other values (4) 218
 
3.6%
Hangul
ValueCountFrequency (%)
1836
22.2%
919
11.1%
918
11.1%
846
10.2%
845
10.2%
770
9.3%
365
 
4.4%
335
 
4.1%
335
 
4.1%
263
 
3.2%
Other values (29) 830
10.0%

점포형태
Categorical

Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
상가건물형
587 
상가형
264 
노점형
224 
상가주택복합형
 
17
<NA>
 
2

Length

Max length7
Median length5
Mean length4.1371115
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상가건물형
2nd row상가건물형
3rd row노점형
4th row상가건물형
5th row상가건물형

Common Values

ValueCountFrequency (%)
상가건물형 587
53.7%
상가형 264
24.1%
노점형 224
 
20.5%
상가주택복합형 17
 
1.6%
<NA> 2
 
0.2%

Length

2023-12-12T20:53:37.982028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:38.164244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상가건물형 587
53.7%
상가형 264
24.1%
노점형 224
 
20.5%
상가주택복합형 17
 
1.6%
na 2
 
0.2%

면적
Real number (ℝ)

MISSING 

Distinct66
Distinct (%)6.8%
Missing129
Missing (%)11.8%
Infinite0
Infinite (%)0.0%
Mean34.872394
Minimum0.09
Maximum511.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.7 KiB
2023-12-12T20:53:38.364364image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.09
5-th percentile3.06
Q16.6
median19.8
Q339.6
95-th percentile121.44
Maximum511.5
Range511.41
Interquartile range (IQR)33

Descriptive statistics

Standard deviation45.807072
Coefficient of variation (CV)1.3135626
Kurtosis19.908534
Mean34.872394
Median Absolute Deviation (MAD)13.2
Skewness3.5267204
Sum33651.86
Variance2098.2879
MonotonicityNot monotonic
2023-12-12T20:53:38.611100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3.3 111
 
10.1%
33.0 102
 
9.3%
9.9 87
 
8.0%
6.6 66
 
6.0%
16.5 62
 
5.7%
66.0 56
 
5.1%
19.8 49
 
4.5%
23.1 43
 
3.9%
26.4 41
 
3.7%
1.65 36
 
3.3%
Other values (56) 312
28.5%
(Missing) 129
11.8%
ValueCountFrequency (%)
0.09 1
 
0.1%
0.12 1
 
0.1%
0.15 1
 
0.1%
0.66 1
 
0.1%
0.82 1
 
0.1%
0.99 5
 
0.5%
1.65 36
 
3.3%
2.97 1
 
0.1%
3.0 2
 
0.2%
3.3 111
10.1%
ValueCountFrequency (%)
511.5 1
 
0.1%
330.0 1
 
0.1%
297.0 2
0.2%
264.0 4
0.4%
250.0 1
 
0.1%
231.0 3
0.3%
198.0 3
0.3%
191.4 2
0.2%
174.9 1
 
0.1%
171.6 1
 
0.1%
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
False
957 
True
137 
ValueCountFrequency (%)
False 957
87.5%
True 137
 
12.5%
2023-12-12T20:53:38.748153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
True
913 
False
181 
ValueCountFrequency (%)
True 913
83.5%
False 181
 
16.5%
2023-12-12T20:53:38.857975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

영업시간
Text

MISSING 

Distinct590
Distinct (%)61.7%
Missing137
Missing (%)12.5%
Memory size8.7 KiB
2023-12-12T20:53:39.147511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length25
Mean length19.07628
Min length11

Characters and Unicode

Total characters18256
Distinct characters41
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique440 ?
Unique (%)46.0%

Sample

1st row09:00-18:00(휴일:8)
2nd row06:00-18:00(휴일:1,3)
3rd row06:00-18:00(휴일:1,3)
4th row17:00-17:00(휴일:5)
5th row06:00-19:00(휴일:X)
ValueCountFrequency (%)
502
18.2%
18:00 202
 
7.3%
일요일 153
 
5.6%
05:00 140
 
5.1%
휴무 133
 
4.8%
없음 117
 
4.2%
19:00 110
 
4.0%
06:00 102
 
3.7%
17:00 83
 
3.0%
4 71
 
2.6%
Other values (381) 1142
41.5%
2023-12-12T20:53:39.645538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4566
25.0%
: 2355
12.9%
1807
 
9.9%
1 1221
 
6.7%
( 930
 
5.1%
) 930
 
5.1%
847
 
4.6%
- 711
 
3.9%
614
 
3.4%
8 519
 
2.8%
Other values (31) 3756
20.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8369
45.8%
Other Punctuation 2564
 
14.0%
Other Letter 2538
 
13.9%
Space Separator 1807
 
9.9%
Open Punctuation 930
 
5.1%
Close Punctuation 930
 
5.1%
Dash Punctuation 711
 
3.9%
Math Symbol 249
 
1.4%
Uppercase Letter 158
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
847
33.4%
614
24.2%
209
 
8.2%
193
 
7.6%
171
 
6.7%
123
 
4.8%
119
 
4.7%
118
 
4.6%
100
 
3.9%
13
 
0.5%
Other values (13) 31
 
1.2%
Decimal Number
ValueCountFrequency (%)
0 4566
54.6%
1 1221
 
14.6%
8 519
 
6.2%
3 447
 
5.3%
5 342
 
4.1%
9 310
 
3.7%
7 271
 
3.2%
6 255
 
3.0%
4 246
 
2.9%
2 192
 
2.3%
Other Punctuation
ValueCountFrequency (%)
: 2355
91.8%
, 209
 
8.2%
Space Separator
ValueCountFrequency (%)
1807
100.0%
Open Punctuation
ValueCountFrequency (%)
( 930
100.0%
Close Punctuation
ValueCountFrequency (%)
) 930
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 711
100.0%
Math Symbol
ValueCountFrequency (%)
~ 249
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 158
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15560
85.2%
Hangul 2538
 
13.9%
Latin 158
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
847
33.4%
614
24.2%
209
 
8.2%
193
 
7.6%
171
 
6.7%
123
 
4.8%
119
 
4.7%
118
 
4.6%
100
 
3.9%
13
 
0.5%
Other values (13) 31
 
1.2%
Common
ValueCountFrequency (%)
0 4566
29.3%
: 2355
15.1%
1807
 
11.6%
1 1221
 
7.8%
( 930
 
6.0%
) 930
 
6.0%
- 711
 
4.6%
8 519
 
3.3%
3 447
 
2.9%
5 342
 
2.2%
Other values (7) 1732
 
11.1%
Latin
ValueCountFrequency (%)
X 158
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15718
86.1%
Hangul 2538
 
13.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4566
29.0%
: 2355
15.0%
1807
 
11.5%
1 1221
 
7.8%
( 930
 
5.9%
) 930
 
5.9%
- 711
 
4.5%
8 519
 
3.3%
3 447
 
2.8%
5 342
 
2.2%
Other values (8) 1890
12.0%
Hangul
ValueCountFrequency (%)
847
33.4%
614
24.2%
209
 
8.2%
193
 
7.6%
171
 
6.7%
123
 
4.8%
119
 
4.7%
118
 
4.6%
100
 
3.9%
13
 
0.5%
Other values (13) 31
 
1.2%

월평균영업일수
Real number (ℝ)

MISSING 

Distinct10
Distinct (%)1.0%
Missing132
Missing (%)12.1%
Infinite0
Infinite (%)0.0%
Mean27.951143
Minimum6
Maximum30
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.7 KiB
2023-12-12T20:53:39.813719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile25
Q126
median28
Q330
95-th percentile30
Maximum30
Range24
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.8965722
Coefficient of variation (CV)0.067853118
Kurtosis18.542342
Mean27.951143
Median Absolute Deviation (MAD)2
Skewness-2.1252966
Sum26889
Variance3.5969862
MonotonicityNot monotonic
2023-12-12T20:53:39.993484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
28 310
28.3%
30 285
26.1%
26 233
21.3%
29 73
 
6.7%
25 45
 
4.1%
20 5
 
0.5%
27 5
 
0.5%
24 4
 
0.4%
22 1
 
0.1%
6 1
 
0.1%
(Missing) 132
12.1%
ValueCountFrequency (%)
6 1
 
0.1%
20 5
 
0.5%
22 1
 
0.1%
24 4
 
0.4%
25 45
 
4.1%
26 233
21.3%
27 5
 
0.5%
28 310
28.3%
29 73
 
6.7%
30 285
26.1%
ValueCountFrequency (%)
30 285
26.1%
29 73
 
6.7%
28 310
28.3%
27 5
 
0.5%
26 233
21.3%
25 45
 
4.1%
24 4
 
0.4%
22 1
 
0.1%
20 5
 
0.5%
6 1
 
0.1%

고용자수
Real number (ℝ)

MISSING 

Distinct8
Distinct (%)0.8%
Missing149
Missing (%)13.6%
Infinite0
Infinite (%)0.0%
Mean1.4899471
Minimum0
Maximum8
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size9.7 KiB
2023-12-12T20:53:40.145933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q32
95-th percentile3
Maximum8
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.76340735
Coefficient of variation (CV)0.51237212
Kurtosis13.721046
Mean1.4899471
Median Absolute Deviation (MAD)0
Skewness2.7611771
Sum1408
Variance0.58279078
MonotonicityNot monotonic
2023-12-12T20:53:40.306731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 570
52.1%
2 321
29.3%
3 33
 
3.0%
4 10
 
0.9%
5 7
 
0.6%
7 2
 
0.2%
8 1
 
0.1%
0 1
 
0.1%
(Missing) 149
 
13.6%
ValueCountFrequency (%)
0 1
 
0.1%
1 570
52.1%
2 321
29.3%
3 33
 
3.0%
4 10
 
0.9%
5 7
 
0.6%
7 2
 
0.2%
8 1
 
0.1%
ValueCountFrequency (%)
8 1
 
0.1%
7 2
 
0.2%
5 7
 
0.6%
4 10
 
0.9%
3 33
 
3.0%
2 321
29.3%
1 570
52.1%
0 1
 
0.1%
Distinct5
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
중년(40-50대)
478 
장/노년(60대 이상)
407 
<NA>
104 
청년(20-30대)
101 
청소년(10대)
 
4

Length

Max length12
Median length10
Mean length10.166362
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row청년(20-30대)
2nd row중년(40-50대)
3rd row중년(40-50대)
4th row중년(40-50대)
5th row중년(40-50대)

Common Values

ValueCountFrequency (%)
중년(40-50대) 478
43.7%
장/노년(60대 이상) 407
37.2%
<NA> 104
 
9.5%
청년(20-30대) 101
 
9.2%
청소년(10대) 4
 
0.4%

Length

2023-12-12T20:53:40.518996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:53:40.712441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중년(40-50대 478
31.8%
장/노년(60대 407
27.1%
이상 407
27.1%
na 104
 
6.9%
청년(20-30대 101
 
6.7%
청소년(10대 4
 
0.3%
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
True
584 
False
510 
ValueCountFrequency (%)
True 584
53.4%
False 510
46.6%
2023-12-12T20:53:40.842017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

스마트결제여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
False
992 
True
102 
ValueCountFrequency (%)
False 992
90.7%
True 102
 
9.3%
2023-12-12T20:53:40.974712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
True
659 
False
435 
ValueCountFrequency (%)
True 659
60.2%
False 435
39.8%
2023-12-12T20:53:41.103791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제품교환여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
True
975 
False
119 
ValueCountFrequency (%)
True 975
89.1%
False 119
 
10.9%
2023-12-12T20:53:41.237510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
True
560 
False
534 
ValueCountFrequency (%)
True 560
51.2%
False 534
48.8%
2023-12-12T20:53:41.351951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Sample

시장명소속상인회상가명점주명업종주요취급품목창업년도주소점포형태면적소유여부입간판여부영업시간월평균영업일수고용자수주고객연령층신용카드여부스마트결제여부온누리상품권여부제품교환여부택배서비스여부
0칠성종합시장완구골목1000wells박**의류/신발판매스텐프, 도장2007대구 북구 칠성시장로7길 39-2상가건물형165.0YY09:00-18:00(휴일:8)223청년(20-30대)YYYYY
1칠성종합시장대구청과시장1번상회황**농산물판매과일류2015대구 북구 칠성시장로 34상가건물형39.6NY06:00-18:00(휴일:1,3)282중년(40-50대)YNYYY
2칠성종합시장대구청과시장22번상회김**농산물판매과일1979대구 북구 칠성남로 지하 222노점형16.5NY06:00-18:00(휴일:1,3)282중년(40-50대)YNYYY
3칠성종합시장대구청과시장2번상회정**농산물판매과일류2014대구 북구 칠성남로 지하 222상가건물형6.6NY17:00-17:00(휴일:5)251중년(40-50대)NNNYY
4칠성종합시장주방용품골목363국시마을이**음식점업국수2016대구 북구 칠성시장로 16-5상가건물형33.0NY06:00-19:00(휴일:X)301중년(40-50대)YNNYN
5칠성종합시장대구청과시장3번상회김**농산물판매과일류1970대구 북구 칠성(34) 276번지상가건물형9.9NY05:30-19:00(휴일:2)282장/노년(60대 이상)YNYYY
6칠성종합시장별별상상 야시장88막창이**음식점업막창2019<NA>노점형<NA>NN<NA><NA>1청년(20-30대)YYYYN
7칠성종합시장칠성원시장BYC윤**의류/신발판매속옷,모자,잡화1982대구 북구 칠성로 81상가건물형19.8YY09:00-19:00(휴일:1,3)261중년(40-50대)YNYYN
8칠성종합시장별별상상 야시장JM 스테이크황**음식점업스테이크2019<NA>노점형<NA>NN18:00~24:00<NA><NA><NA>YYYYN
9칠성종합시장칠성진·경명시장LA타운식육도매센타노**축산물판매쇠고기,돼지고기 판매2001대구 북구 칠성시장로 21상가건물형99.0NY05:00-19:00(휴일:X)304장/노년(60대 이상)YNYYY
시장명소속상인회상가명점주명업종주요취급품목창업년도주소점포형태면적소유여부입간판여부영업시간월평균영업일수고용자수주고객연령층신용카드여부스마트결제여부온누리상품권여부제품교환여부택배서비스여부
1084칠성종합시장완구골목환일상회박**기타소매업잡화1959대구 북구 칠성시장로5길 30-3상가형39.6YY05:00 - 18:00 (휴일 1,3)281중년(40-50대)YNYYY
1085칠성종합시장칠성원시장황금지업사김**기타소매업지물,장판류1980대구 북구 칠성시장로5길 14상가건물형23.1YY05:30 - 18:00 (1,3)282장/노년(60대 이상)YNYYY
1086칠성종합시장칠성원시장황도식당김**축산물판매삶은 돼지고기2012대구 북구 칠성동1가 85상가건물형9.9NY06:00 - 19:00 (둘째 수요일)292장/노년(60대 이상)YNYYY
1087칠성종합시장완구골목황보문구황**기타소매업문구2012대구 북구 칠성시장로7길 31상가건물형132.0NY09:00 - 19:00 (휴무 없음)304중년(40-50대)YYYYY
1088칠성종합시장대구능금시장효성농산이**농산물판매감귤,메론,방울토마토1999대구 북구 칠성남로41길 20상가건물형49.5NY05:00 - 16:00 (1)302중년(40-50대)YNYYY
1089칠성종합시장칠성시장풍물거리(강변시장)흥복상회이**농산물판매채소1989대구 북구 풍물 3-17상가형3.3NY06:00 - 19:00 (1,3)28<NA>중년(40-50대)NNNYN
1090칠성종합시장칠성원시장흥일상회 (아래와 동일)정**기타소매업식품잡화1990대구 북구 칠성로 81상가건물형11.55NY06:30 - 19:00 (휴무 없음)301중년(40-50대)YNYYY
1091칠성종합시장칠성원시장흥일상회정**가공식품판매식품, 잡화, 잡채,식용유, 진간장1989대구 북구 칠성시장로5길 22상가형9.9NY06:00 - 18:30 (2)281중년(40-50대)YNYYY
1092칠성종합시장칠성원시장희우상회배**기타소매업고무장갑 외 잡화1999대구 북구 칠성시장로5길 22상가형26.4NY05:00 - 17:30 (일요일 휴일)261중년(40-50대)YNYYY
1093칠성종합시장별별상상 야시장히츠지카트민**음식점업양고기2019<NA>노점형<NA>NN<NA><NA>2청년(20-30대)YYYYN