Overview

Dataset statistics

Number of variables12
Number of observations237
Missing cells237
Missing cells (%)8.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.0 KiB
Average record size in memory99.6 B

Variable types

Numeric2
DateTime1
Categorical3
Text5
Unsupported1

Dataset

Description대구 지역 착한가격업소 현황(2014년 9월말 기준)
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15055835&dataSetDetailId=150558351d999caa612ec_201705310942&provdMethod=FILE

Alerts

연번 is highly overall correlated with 구군 and 1 other fieldsHigh correlation
업종 is highly overall correlated with 비고High correlation
구군 is highly overall correlated with 연번High correlation
비고 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
비고 is highly imbalanced (85.4%)Imbalance
Unnamed: 11 has 237 (100.0%) missing valuesMissing
연번 has unique valuesUnique
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 18:28:48.579361
Analysis finished2023-12-10 18:28:51.335202
Duration2.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct237
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119
Minimum1
Maximum237
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T03:28:51.446655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12.8
Q160
median119
Q3178
95-th percentile225.2
Maximum237
Range236
Interquartile range (IQR)118

Descriptive statistics

Standard deviation68.560193
Coefficient of variation (CV)0.57613607
Kurtosis-1.2
Mean119
Median Absolute Deviation (MAD)59
Skewness0
Sum28203
Variance4700.5
MonotonicityStrictly increasing
2023-12-11T03:28:51.649390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
164 1
 
0.4%
152 1
 
0.4%
153 1
 
0.4%
154 1
 
0.4%
155 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
Other values (227) 227
95.8%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%
233 1
0.4%
232 1
0.4%
231 1
0.4%
230 1
0.4%
229 1
0.4%
228 1
0.4%
Distinct6
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Minimum2011-11-01 00:00:00
Maximum2014-09-01 00:00:00
2023-12-11T03:28:51.773039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T03:28:51.935752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)

업종
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
한식_육류
56 
한식_일반
51 
중식
46 
한식_면류
34 
한식_찌개류
22 
Other values (5)
28 

Length

Max length6
Median length5
Mean length4.4135021
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타양식
2nd row기타양식
3rd row중식
4th row중식
5th row중식

Common Values

ValueCountFrequency (%)
한식_육류 56
23.6%
한식_일반 51
21.5%
중식 46
19.4%
한식_면류 34
14.3%
한식_찌개류 22
 
9.3%
한식_해산물 8
 
3.4%
한식_분식 7
 
3.0%
일식 5
 
2.1%
기타양식 4
 
1.7%
양식 4
 
1.7%

Length

2023-12-11T03:28:52.125719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:28:52.316243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식_육류 56
23.6%
한식_일반 51
21.5%
중식 46
19.4%
한식_면류 34
14.3%
한식_찌개류 22
 
9.3%
한식_해산물 8
 
3.4%
한식_분식 7
 
3.0%
일식 5
 
2.1%
기타양식 4
 
1.7%
양식 4
 
1.7%
Distinct236
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T03:28:52.723389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.1476793
Min length2

Characters and Unicode

Total characters1220
Distinct characters273
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique235 ?
Unique (%)99.2%

Sample

1st row부림해물손수제비
2nd row명가
3rd row덕화반점
4th row만리향반점
5th row영빈반점
ValueCountFrequency (%)
시골밥상 2
 
0.8%
숯불촌 2
 
0.8%
미도식당 1
 
0.4%
자금성 1
 
0.4%
부림해물손수제비 1
 
0.4%
차돌이김밥 1
 
0.4%
옥경짬뽕 1
 
0.4%
동피랑충무김밥 1
 
0.4%
낙원식당 1
 
0.4%
부자돼지촌 1
 
0.4%
Other values (237) 237
95.2%
2023-12-11T03:28:53.387946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
47
 
3.9%
41
 
3.4%
36
 
3.0%
31
 
2.5%
27
 
2.2%
27
 
2.2%
25
 
2.0%
25
 
2.0%
24
 
2.0%
23
 
1.9%
Other values (263) 914
74.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1194
97.9%
Space Separator 12
 
1.0%
Decimal Number 8
 
0.7%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
47
 
3.9%
41
 
3.4%
36
 
3.0%
31
 
2.6%
27
 
2.3%
27
 
2.3%
25
 
2.1%
25
 
2.1%
24
 
2.0%
23
 
1.9%
Other values (255) 888
74.4%
Decimal Number
ValueCountFrequency (%)
3 3
37.5%
6 2
25.0%
5 1
 
12.5%
2 1
 
12.5%
4 1
 
12.5%
Space Separator
ValueCountFrequency (%)
12
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1194
97.9%
Common 26
 
2.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
47
 
3.9%
41
 
3.4%
36
 
3.0%
31
 
2.6%
27
 
2.3%
27
 
2.3%
25
 
2.1%
25
 
2.1%
24
 
2.0%
23
 
1.9%
Other values (255) 888
74.4%
Common
ValueCountFrequency (%)
12
46.2%
3 3
 
11.5%
) 3
 
11.5%
( 3
 
11.5%
6 2
 
7.7%
5 1
 
3.8%
2 1
 
3.8%
4 1
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1194
97.9%
ASCII 26
 
2.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
47
 
3.9%
41
 
3.4%
36
 
3.0%
31
 
2.6%
27
 
2.3%
27
 
2.3%
25
 
2.1%
25
 
2.1%
24
 
2.0%
23
 
1.9%
Other values (255) 888
74.4%
ASCII
ValueCountFrequency (%)
12
46.2%
3 3
 
11.5%
) 3
 
11.5%
( 3
 
11.5%
6 2
 
7.7%
5 1
 
3.8%
2 1
 
3.8%
4 1
 
3.8%
Distinct233
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T03:28:53.889129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length3
Mean length3.0168776
Min length3

Characters and Unicode

Total characters715
Distinct characters136
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique229 ?
Unique (%)96.6%

Sample

1st row석윤기
2nd row허종규
3rd row정명숙
4th row전은숙
5th row김두식
ValueCountFrequency (%)
김영숙 2
 
0.8%
이옥순 2
 
0.8%
김경숙 2
 
0.8%
이정순 2
 
0.8%
신현주 1
 
0.4%
김재순 1
 
0.4%
박순자 1
 
0.4%
석윤기 1
 
0.4%
김기태 1
 
0.4%
허영옥 1
 
0.4%
Other values (224) 224
94.1%
2023-12-11T03:28:54.621996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
 
7.8%
41
 
5.7%
34
 
4.8%
31
 
4.3%
30
 
4.2%
24
 
3.4%
24
 
3.4%
23
 
3.2%
20
 
2.8%
17
 
2.4%
Other values (126) 415
58.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 713
99.7%
Other Punctuation 1
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
7.9%
41
 
5.8%
34
 
4.8%
31
 
4.3%
30
 
4.2%
24
 
3.4%
24
 
3.4%
23
 
3.2%
20
 
2.8%
17
 
2.4%
Other values (124) 413
57.9%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 713
99.7%
Common 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
7.9%
41
 
5.8%
34
 
4.8%
31
 
4.3%
30
 
4.2%
24
 
3.4%
24
 
3.4%
23
 
3.2%
20
 
2.8%
17
 
2.4%
Other values (124) 413
57.9%
Common
ValueCountFrequency (%)
, 1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 713
99.7%
ASCII 2
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
56
 
7.9%
41
 
5.8%
34
 
4.8%
31
 
4.3%
30
 
4.2%
24
 
3.4%
24
 
3.4%
23
 
3.2%
20
 
2.8%
17
 
2.4%
Other values (124) 413
57.9%
ASCII
ValueCountFrequency (%)
, 1
50.0%
1
50.0%

구군
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
달서구
57 
동구
39 
중구
37 
남구
31 
수성구
28 
Other values (3)
45 

Length

Max length3
Median length2
Mean length2.4303797
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row중구
2nd row중구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
달서구 57
24.1%
동구 39
16.5%
중구 37
15.6%
남구 31
13.1%
수성구 28
11.8%
달성군 17
 
7.2%
서구 15
 
6.3%
북구 13
 
5.5%

Length

2023-12-11T03:28:54.832914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:28:55.002263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달서구 57
24.1%
동구 39
16.5%
중구 37
15.6%
남구 31
13.1%
수성구 28
11.8%
달성군 17
 
7.2%
서구 15
 
6.3%
북구 13
 
5.5%

주소
Text

Distinct235
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T03:28:55.385758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length23
Mean length13.966245
Min length4

Characters and Unicode

Total characters3310
Distinct characters148
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique233 ?
Unique (%)98.3%

Sample

1st row?중구?국채보상로 139길8
2nd row?중구?달구벌대로 지하 2100 S106호
3rd row?중구?중앙대로 67길 31
4th row?중구?중앙대로 65길14
5th row?중구?경상감영길 117-7
ValueCountFrequency (%)
중구?국채보상로 8
 
1.6%
중구?달구벌대로 6
 
1.2%
동구?동촌로 5
 
1.0%
달서구?와룡로 5
 
1.0%
동구?아양로 5
 
1.0%
달성군?화원읍 5
 
1.0%
달서구?야외음악당로 5
 
1.0%
달서구?선원로 5
 
1.0%
중구?중앙대로 5
 
1.0%
중구?공평로 4
 
0.8%
Other values (351) 448
89.4%
2023-12-11T03:28:55.968183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
? 458
 
13.8%
265
 
8.0%
239
 
7.2%
223
 
6.7%
1 177
 
5.3%
2 127
 
3.8%
118
 
3.6%
93
 
2.8%
3 92
 
2.8%
4 88
 
2.7%
Other values (138) 1430
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1648
49.8%
Decimal Number 854
25.8%
Other Punctuation 462
 
14.0%
Space Separator 265
 
8.0%
Dash Punctuation 62
 
1.9%
Close Punctuation 8
 
0.2%
Open Punctuation 8
 
0.2%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
239
 
14.5%
223
 
13.5%
118
 
7.2%
93
 
5.6%
81
 
4.9%
74
 
4.5%
60
 
3.6%
48
 
2.9%
38
 
2.3%
37
 
2.2%
Other values (119) 637
38.7%
Decimal Number
ValueCountFrequency (%)
1 177
20.7%
2 127
14.9%
3 92
10.8%
4 88
10.3%
5 75
8.8%
7 69
 
8.1%
0 66
 
7.7%
6 60
 
7.0%
9 53
 
6.2%
8 47
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
B 1
33.3%
C 1
33.3%
S 1
33.3%
Other Punctuation
ValueCountFrequency (%)
? 458
99.1%
, 4
 
0.9%
Space Separator
ValueCountFrequency (%)
265
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 62
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1659
50.1%
Hangul 1648
49.8%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
239
 
14.5%
223
 
13.5%
118
 
7.2%
93
 
5.6%
81
 
4.9%
74
 
4.5%
60
 
3.6%
48
 
2.9%
38
 
2.3%
37
 
2.2%
Other values (119) 637
38.7%
Common
ValueCountFrequency (%)
? 458
27.6%
265
16.0%
1 177
 
10.7%
2 127
 
7.7%
3 92
 
5.5%
4 88
 
5.3%
5 75
 
4.5%
7 69
 
4.2%
0 66
 
4.0%
- 62
 
3.7%
Other values (6) 180
 
10.8%
Latin
ValueCountFrequency (%)
B 1
33.3%
C 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1662
50.2%
Hangul 1648
49.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
? 458
27.6%
265
15.9%
1 177
 
10.6%
2 127
 
7.6%
3 92
 
5.5%
4 88
 
5.3%
5 75
 
4.5%
7 69
 
4.2%
0 66
 
4.0%
- 62
 
3.7%
Other values (9) 183
 
11.0%
Hangul
ValueCountFrequency (%)
239
 
14.5%
223
 
13.5%
118
 
7.2%
93
 
5.6%
81
 
4.9%
74
 
4.5%
60
 
3.6%
48
 
2.9%
38
 
2.3%
37
 
2.2%
Other values (119) 637
38.7%
Distinct234
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T03:28:56.304824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.025316
Min length12

Characters and Unicode

Total characters2850
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique232 ?
Unique (%)97.9%

Sample

1st row053-292-3482
2nd row053-252-9743
3rd row053-254-0958
4th row053-255-9380
5th row053-255-7200
ValueCountFrequency (%)
010-xxxx-xxxx 3
 
1.3%
053-423-7008 2
 
0.8%
053-351-1140 1
 
0.4%
053-641-7388 1
 
0.4%
053-292-3482 1
 
0.4%
053-654-6474 1
 
0.4%
053-742-9285 1
 
0.4%
053-744-1479 1
 
0.4%
053-741-3447 1
 
0.4%
053-755-6508 1
 
0.4%
Other values (224) 224
94.5%
2023-12-11T03:28:56.790548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 474
16.6%
5 468
16.4%
3 387
13.6%
0 357
12.5%
6 205
7.2%
2 195
6.8%
4 171
 
6.0%
8 151
 
5.3%
7 142
 
5.0%
9 135
 
4.7%
Other values (2) 165
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2345
82.3%
Dash Punctuation 474
 
16.6%
Lowercase Letter 31
 
1.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 468
20.0%
3 387
16.5%
0 357
15.2%
6 205
8.7%
2 195
8.3%
4 171
 
7.3%
8 151
 
6.4%
7 142
 
6.1%
9 135
 
5.8%
1 134
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 474
100.0%
Lowercase Letter
ValueCountFrequency (%)
x 31
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2819
98.9%
Latin 31
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 474
16.8%
5 468
16.6%
3 387
13.7%
0 357
12.7%
6 205
7.3%
2 195
6.9%
4 171
 
6.1%
8 151
 
5.4%
7 142
 
5.0%
9 135
 
4.8%
Latin
ValueCountFrequency (%)
x 31
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2850
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 474
16.6%
5 468
16.4%
3 387
13.6%
0 357
12.5%
6 205
7.2%
2 195
6.8%
4 171
 
6.0%
8 151
 
5.3%
7 142
 
5.0%
9 135
 
4.7%
Other values (2) 165
 
5.8%
Distinct101
Distinct (%)42.6%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-11T03:28:57.148986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length5.1350211
Min length2

Characters and Unicode

Total characters1217
Distinct characters148
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)31.6%

Sample

1st row해물칼국수
2nd row칼국수
3rd row짜장면
4th row짜장면
5th row짜장면
ValueCountFrequency (%)
칼국수 37
 
15.0%
자장면 29
 
11.8%
짜장면 16
 
6.5%
된장찌개 9
 
3.7%
비빔밥 7
 
2.8%
삼겹살(150g 6
 
2.4%
잔치국수 5
 
2.0%
김치찌개 5
 
2.0%
생선초밥 5
 
2.0%
삼겹살 4
 
1.6%
Other values (92) 123
50.0%
2023-12-11T03:28:57.675625image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
75
 
6.2%
0 71
 
5.8%
60
 
4.9%
51
 
4.2%
50
 
4.1%
g 49
 
4.0%
43
 
3.5%
1 39
 
3.2%
38
 
3.1%
( 36
 
3.0%
Other values (138) 705
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 919
75.5%
Decimal Number 157
 
12.9%
Lowercase Letter 49
 
4.0%
Open Punctuation 36
 
3.0%
Close Punctuation 36
 
3.0%
Space Separator 9
 
0.7%
Other Punctuation 7
 
0.6%
Uppercase Letter 2
 
0.2%
Math Symbol 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
75
 
8.2%
60
 
6.5%
51
 
5.5%
50
 
5.4%
43
 
4.7%
38
 
4.1%
34
 
3.7%
33
 
3.6%
32
 
3.5%
29
 
3.2%
Other values (124) 474
51.6%
Decimal Number
ValueCountFrequency (%)
0 71
45.2%
1 39
24.8%
2 23
 
14.6%
5 17
 
10.8%
8 5
 
3.2%
4 1
 
0.6%
3 1
 
0.6%
Lowercase Letter
ValueCountFrequency (%)
g 49
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 919
75.5%
Common 247
 
20.3%
Latin 51
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
75
 
8.2%
60
 
6.5%
51
 
5.5%
50
 
5.4%
43
 
4.7%
38
 
4.1%
34
 
3.7%
33
 
3.6%
32
 
3.5%
29
 
3.2%
Other values (124) 474
51.6%
Common
ValueCountFrequency (%)
0 71
28.7%
1 39
15.8%
( 36
14.6%
) 36
14.6%
2 23
 
9.3%
5 17
 
6.9%
9
 
3.6%
, 7
 
2.8%
8 5
 
2.0%
+ 2
 
0.8%
Other values (2) 2
 
0.8%
Latin
ValueCountFrequency (%)
g 49
96.1%
G 2
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 919
75.5%
ASCII 298
 
24.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
75
 
8.2%
60
 
6.5%
51
 
5.5%
50
 
5.4%
43
 
4.7%
38
 
4.1%
34
 
3.7%
33
 
3.6%
32
 
3.5%
29
 
3.2%
Other values (124) 474
51.6%
ASCII
ValueCountFrequency (%)
0 71
23.8%
g 49
16.4%
1 39
13.1%
( 36
12.1%
) 36
12.1%
2 23
 
7.7%
5 17
 
5.7%
9
 
3.0%
, 7
 
2.3%
8 5
 
1.7%
Other values (4) 6
 
2.0%

가격1
Real number (ℝ)

Distinct35
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4845.1899
Minimum700
Maximum19000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.2 KiB
2023-12-11T03:28:57.837466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile2500
Q13500
median4500
Q36000
95-th percentile9000
Maximum19000
Range18300
Interquartile range (IQR)2500

Descriptive statistics

Standard deviation2392.0819
Coefficient of variation (CV)0.4937024
Kurtosis9.2314905
Mean4845.1899
Median Absolute Deviation (MAD)1500
Skewness2.2765937
Sum1148310
Variance5722055.6
MonotonicityNot monotonic
2023-12-11T03:28:58.027524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
4000 42
17.7%
5000 28
11.8%
3000 24
10.1%
4500 23
9.7%
6000 22
9.3%
7000 21
8.9%
3500 18
7.6%
2500 16
 
6.8%
8000 6
 
2.5%
1500 5
 
2.1%
Other values (25) 32
13.5%
ValueCountFrequency (%)
700 1
 
0.4%
1000 1
 
0.4%
1200 1
 
0.4%
1500 5
 
2.1%
1800 1
 
0.4%
2000 2
 
0.8%
2500 16
6.8%
2700 1
 
0.4%
2800 1
 
0.4%
2900 2
 
0.8%
ValueCountFrequency (%)
19000 1
 
0.4%
18000 1
 
0.4%
14000 1
 
0.4%
13000 1
 
0.4%
12000 1
 
0.4%
11000 1
 
0.4%
10000 1
 
0.4%
9900 1
 
0.4%
9330 3
1.3%
9000 2
0.8%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
<NA>
227 
신규
 
8
상호변경(전:풍기인삼가)
 
1
상호변경(전:고향맛식당)
 
1

Length

Max length13
Median length4
Mean length4.0084388
Min length2

Unique

Unique2 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 227
95.8%
신규 8
 
3.4%
상호변경(전:풍기인삼가) 1
 
0.4%
상호변경(전:고향맛식당) 1
 
0.4%

Length

2023-12-11T03:28:58.198148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T03:28:58.339614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 227
95.8%
신규 8
 
3.4%
상호변경(전:풍기인삼가 1
 
0.4%
상호변경(전:고향맛식당 1
 
0.4%

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing237
Missing (%)100.0%
Memory size2.2 KiB

Interactions

2023-12-11T03:28:50.514812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T03:28:49.812524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T03:28:50.680568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T03:28:50.354475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T03:28:58.433439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지정연월업종구군가격1비고
연번1.0000.4780.7110.9410.2671.000
지정연월0.4781.0000.4310.4270.2471.000
업종0.7110.4311.0000.3340.5840.898
구군0.9410.4270.3341.0000.2400.000
가격10.2670.2470.5840.2401.0000.000
비고1.0001.0000.8980.0000.0001.000
2023-12-11T03:28:58.556481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구군비고업종
구군1.0000.0000.165
비고0.0001.0000.598
업종0.1650.5981.000
2023-12-11T03:28:58.671746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번가격1업종구군비고
연번1.0000.0160.2880.8250.845
가격10.0161.0000.3150.1180.000
업종0.2880.3151.0000.1650.598
구군0.8250.1180.1651.0000.000
비고0.8450.0000.5980.0001.000

Missing values

2023-12-11T03:28:50.925190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T03:28:51.225483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지정연월업종업소명업주명구군주소연락처메뉴1가격1비고Unnamed: 11
012012-05기타양식부림해물손수제비석윤기중구?중구?국채보상로 139길8053-292-3482해물칼국수4500<NA><NA>
122012-06기타양식명가허종규중구?중구?달구벌대로 지하 2100 S106호053-252-9743칼국수4500<NA><NA>
232011-11중식덕화반점정명숙중구?중구?중앙대로 67길 31053-254-0958짜장면2500<NA><NA>
342011-11중식만리향반점전은숙중구?중구?중앙대로 65길14053-255-9380짜장면4000<NA><NA>
452011-11중식영빈반점김두식중구?중구?경상감영길 117-7053-255-7200짜장면3500<NA><NA>
562011-11중식영화반점김원옥중구?중구?중앙대로 406-13053-423-7008자장면2500<NA><NA>
672012-05중식장성루장경숙중구?중구?국채보상로 139길10053-425-4303짜장면3000<NA><NA>
782012-06중식중해반점우려원중구?중구?명륜로 63053-255-8555짜장면3000<NA><NA>
892012-06중식행복반점박금숙중구?중구?달구벌대로 2125-5053-423-0908짜장면2500<NA><NA>
9102013-05중식만리궁성김일규중구?중구?남산로 21053-257-6166짜장면4000<NA><NA>
연번지정연월업종업소명업주명구군주소연락처메뉴1가격1비고Unnamed: 11
2272282013-12한식_육류이놈애양곱창최설봄달성군?달성군?다사읍 죽곡리 5-21, B동053-588-7419곱창(200g,국내산)11000<NA><NA>
2282292013-05한식_육류고구려나윤권달성군?달성군?화원읍 명천로 244053-639-6766삼겹살(200g,국내산)9330<NA><NA>
2292302013-05한식_육류대명숯불갈비한경순달성군?달성군?논공읍 금강로2길 19-2053-615-8668비빔밥5000<NA><NA>
2302312013-05한식_육류동문돌식당김정자달성군?달성군?옥포면 옥포로 282053-616-6632칼국수4000<NA><NA>
2312322013-05한식_육류별난집삼겹살김영보달성군?달성군?화원읍 명천로 250053-636-1960삼겹살(200g,국내산)7220<NA><NA>
2322332013-05한식_육류월촌숯불갈비임정숙달성군?달성군?현풍면 현풍동로 51053-614-3464삼겹살(200g,국내산)9330<NA><NA>
2332342013-05한식_육류한가네식당정문주달성군?달성군?현풍면 현풍동로 53053-615-9396삼겹살(200g,국내산)9330<NA><NA>
2342352011-11한식_일반한상한정식백선옥달성군?달성군?옥포면 비슬로 2205053-611-1033김치찌개4000<NA><NA>
2352362013-05한식_일반화선장박화자달성군?달성군?화원읍 비슬로 523길 3053-636-1508칼국수5000<NA><NA>
2362372012-06한식_찌개류정아국밥정곡자달성군?달성군?비슬로 512길 58-5010-xxxx-xxxx칼국수3500<NA><NA>