Overview

Dataset statistics

Number of variables12
Number of observations136
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.3 KiB
Average record size in memory100.0 B

Variable types

Numeric3
Text6
Categorical3

Dataset

Description대전광역시 서구 모범음식점 현황 데이터입니다.(순번, 업소명, 행정동, 행정동코드, 법정동, 법정동코드, 지번주소, 도로명주소, 상세주소, 음식의 유형, 주된음식, 전화번호)
URLhttps://www.data.go.kr/data/15104114/fileData.do

Alerts

법정동 is highly overall correlated with 행정동코드 and 2 other fieldsHigh correlation
행정동 is highly overall correlated with 행정동코드 and 2 other fieldsHigh correlation
행정동코드 is highly overall correlated with 법정동코드 and 2 other fieldsHigh correlation
법정동코드 is highly overall correlated with 행정동코드 and 2 other fieldsHigh correlation
음식의유형 is highly imbalanced (58.4%)Imbalance
순번 has unique valuesUnique
업소명 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:53:30.244524
Analysis finished2023-12-12 23:53:31.805743
Duration1.56 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.5
Minimum1
Maximum136
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T08:53:31.873775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.75
Q134.75
median68.5
Q3102.25
95-th percentile129.25
Maximum136
Range135
Interquartile range (IQR)67.5

Descriptive statistics

Standard deviation39.403892
Coefficient of variation (CV)0.57523929
Kurtosis-1.2
Mean68.5
Median Absolute Deviation (MAD)34
Skewness0
Sum9316
Variance1552.6667
MonotonicityStrictly increasing
2023-12-13T08:53:31.995779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
95 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
96 1
 
0.7%
70 1
 
0.7%
Other values (126) 126
92.6%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%
127 1
0.7%

업소명
Text

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:32.237085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length5.875
Min length2

Characters and Unicode

Total characters799
Distinct characters263
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)100.0%

Sample

1st row충무할매낙지볶음
2nd row정림가든
3rd row대성콩국수
4th row돈수원갈비
5th row한마음면옥식당
ValueCountFrequency (%)
토종칼국수 2
 
1.4%
충무할매낙지볶음 1
 
0.7%
크래버 1
 
0.7%
제주유기농쌈밥촌 1
 
0.7%
토담낙지한마당 1
 
0.7%
총체보리한우타운 1
 
0.7%
만년애한우바보곰탕 1
 
0.7%
정림가든 1
 
0.7%
대게나라 1
 
0.7%
백마강숯불민물장어 1
 
0.7%
Other values (130) 130
92.2%
2023-12-13T08:53:32.597085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
2.3%
14
 
1.8%
14
 
1.8%
14
 
1.8%
13
 
1.6%
13
 
1.6%
12
 
1.5%
12
 
1.5%
12
 
1.5%
11
 
1.4%
Other values (253) 666
83.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 768
96.1%
Decimal Number 11
 
1.4%
Space Separator 5
 
0.6%
Close Punctuation 5
 
0.6%
Open Punctuation 5
 
0.6%
Other Punctuation 3
 
0.4%
Uppercase Letter 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
2.3%
14
 
1.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
Other values (244) 635
82.7%
Decimal Number
ValueCountFrequency (%)
5 5
45.5%
0 4
36.4%
2 2
 
18.2%
Uppercase Letter
ValueCountFrequency (%)
D 1
50.0%
K 1
50.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 768
96.1%
Common 29
 
3.6%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
2.3%
14
 
1.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
Other values (244) 635
82.7%
Common
ValueCountFrequency (%)
5
17.2%
5 5
17.2%
) 5
17.2%
( 5
17.2%
0 4
13.8%
. 3
10.3%
2 2
 
6.9%
Latin
ValueCountFrequency (%)
D 1
50.0%
K 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 768
96.1%
ASCII 31
 
3.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
2.3%
14
 
1.8%
14
 
1.8%
14
 
1.8%
13
 
1.7%
13
 
1.7%
12
 
1.6%
12
 
1.6%
12
 
1.6%
11
 
1.4%
Other values (244) 635
82.7%
ASCII
ValueCountFrequency (%)
5
16.1%
5 5
16.1%
) 5
16.1%
( 5
16.1%
0 4
12.9%
. 3
9.7%
2 2
 
6.5%
D 1
 
3.2%
K 1
 
3.2%

행정동
Categorical

HIGH CORRELATION 

Distinct19
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
둔산2동
24 
만년동
18 
둔산1동
14 
도마2동
탄방동
Other values (14)
63 

Length

Max length4
Median length4
Mean length3.5220588
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row용문동
2nd row정림동
3rd row도마1동
4th row도마2동
5th row도마2동

Common Values

ValueCountFrequency (%)
둔산2동 24
17.6%
만년동 18
13.2%
둔산1동 14
10.3%
도마2동 9
 
6.6%
탄방동 8
 
5.9%
갈마1동 8
 
5.9%
도마1동 7
 
5.1%
월평2동 6
 
4.4%
복수동 6
 
4.4%
괴정동 6
 
4.4%
Other values (9) 30
22.1%

Length

2023-12-13T08:53:32.709921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
둔산2동 24
17.6%
만년동 18
13.2%
둔산1동 14
10.3%
도마2동 9
 
6.6%
탄방동 8
 
5.9%
갈마1동 8
 
5.9%
도마1동 7
 
5.1%
복수동 6
 
4.4%
괴정동 6
 
4.4%
월평2동 6
 
4.4%
Other values (9) 30
22.1%

행정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct19
Distinct (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0170591 × 109
Minimum3.017051 × 109
Maximum3.017065 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T08:53:32.804371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.017051 × 109
5-th percentile3.017052 × 109
Q13.0170555 × 109
median3.0170587 × 109
Q33.017064 × 109
95-th percentile3.017065 × 109
Maximum3.017065 × 109
Range14000
Interquartile range (IQR)8500

Descriptive statistics

Standard deviation4683.1171
Coefficient of variation (CV)1.5522126 × 10-6
Kurtosis-1.3897033
Mean3.0170591 × 109
Median Absolute Deviation (MAD)4950
Skewness-0.17755998
Sum4.1032004 × 1011
Variance21931586
MonotonicityNot monotonic
2023-12-13T08:53:32.904580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
3017064000 24
17.6%
3017065000 18
13.2%
3017063000 14
10.3%
3017053000 9
 
6.6%
3017055500 8
 
5.9%
3017058100 8
 
5.9%
3017052000 7
 
5.1%
3017056000 6
 
4.4%
3017058700 6
 
4.4%
3017051000 6
 
4.4%
Other values (9) 30
22.1%
ValueCountFrequency (%)
3017051000 6
4.4%
3017052000 7
5.1%
3017053000 9
6.6%
3017053500 4
2.9%
3017054000 4
2.9%
3017055000 2
 
1.5%
3017055500 8
5.9%
3017056000 6
4.4%
3017057000 3
 
2.2%
3017057500 5
3.7%
ValueCountFrequency (%)
3017065000 18
13.2%
3017064000 24
17.6%
3017063000 14
10.3%
3017059700 4
 
2.9%
3017059000 4
 
2.9%
3017058700 6
 
4.4%
3017058600 1
 
0.7%
3017058200 3
 
2.2%
3017058100 8
 
5.9%
3017057500 5
 
3.7%

법정동
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
둔산동
38 
만년동
18 
도마동
16 
갈마동
11 
탄방동
Other values (11)
45 

Length

Max length4
Median length3
Mean length2.9411765
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row용문동
2nd row정림동
3rd row도마동
4th row도마동
5th row도마동

Common Values

ValueCountFrequency (%)
둔산동 38
27.9%
만년동 18
13.2%
도마동 16
11.8%
갈마동 11
 
8.1%
탄방동 8
 
5.9%
월평동 7
 
5.1%
복수동 6
 
4.4%
괴정동 6
 
4.4%
내동 5
 
3.7%
정림동 4
 
2.9%
Other values (6) 17
12.5%

Length

2023-12-13T08:53:33.020619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
둔산동 38
27.9%
만년동 18
13.2%
도마동 16
11.8%
갈마동 11
 
8.1%
탄방동 8
 
5.9%
월평동 7
 
5.1%
복수동 6
 
4.4%
괴정동 6
 
4.4%
내동 5
 
3.7%
정림동 4
 
2.9%
Other values (6) 17
12.5%

법정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct16
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.0170111 × 109
Minimum3.0170101 × 109
Maximum3.0170128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-13T08:53:33.129443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum3.0170101 × 109
5-th percentile3.0170102 × 109
Q13.0170106 × 109
median3.0170112 × 109
Q33.0170112 × 109
95-th percentile3.0170128 × 109
Maximum3.0170128 × 109
Range2700
Interquartile range (IQR)600

Descriptive statistics

Standard deviation764.27305
Coefficient of variation (CV)2.5332125 × 10-7
Kurtosis0.48111399
Mean3.0170111 × 109
Median Absolute Deviation (MAD)350
Skewness0.98846249
Sum4.1031352 × 1011
Variance584113.29
MonotonicityNot monotonic
2023-12-13T08:53:33.230978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%)
3017011200 38
27.9%
3017012800 18
13.2%
3017010300 16
11.8%
3017011100 11
 
8.1%
3017010600 8
 
5.9%
3017011300 7
 
5.1%
3017010100 6
 
4.4%
3017010800 6
 
4.4%
3017011000 5
 
3.7%
3017011600 4
 
2.9%
Other values (6) 17
12.5%
ValueCountFrequency (%)
3017010100 6
 
4.4%
3017010200 4
 
2.9%
3017010300 16
11.8%
3017010400 4
 
2.9%
3017010500 2
 
1.5%
3017010600 8
5.9%
3017010800 6
 
4.4%
3017010900 3
 
2.2%
3017011000 5
 
3.7%
3017011100 11
8.1%
ValueCountFrequency (%)
3017012800 18
13.2%
3017011600 4
 
2.9%
3017011500 3
 
2.2%
3017011400 1
 
0.7%
3017011300 7
 
5.1%
3017011200 38
27.9%
3017011100 11
 
8.1%
3017011000 5
 
3.7%
3017010900 3
 
2.2%
3017010800 6
 
4.4%
Distinct133
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:33.522058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length13.345588
Min length7

Characters and Unicode

Total characters1815
Distinct characters46
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)96.3%

Sample

1st row대전광역시 서구 용문동 277-4
2nd row대전광역시 서구 정림동 8-4
3rd row대전광역시 서구 도마동 67-36
4th row대전광역시 서구 도마동 100-15
5th row대전광역시 서구 도마동 101-6
ValueCountFrequency (%)
대전광역시 81
18.7%
서구 81
18.7%
둔산동 38
 
8.8%
만년동 18
 
4.1%
도마동 15
 
3.5%
갈마동 11
 
2.5%
탄방동 8
 
1.8%
월평동 7
 
1.6%
괴정동 6
 
1.4%
복수동 6
 
1.4%
Other values (138) 163
37.6%
2023-12-13T08:53:33.947703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
298
16.4%
136
 
7.5%
1 86
 
4.7%
81
 
4.5%
81
 
4.5%
81
 
4.5%
81
 
4.5%
81
 
4.5%
81
 
4.5%
81
 
4.5%
Other values (36) 728
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 967
53.3%
Decimal Number 501
27.6%
Space Separator 298
 
16.4%
Dash Punctuation 49
 
2.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
136
14.1%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
38
 
3.9%
38
 
3.9%
Other values (24) 188
19.4%
Decimal Number
ValueCountFrequency (%)
1 86
17.2%
3 79
15.8%
2 68
13.6%
6 49
9.8%
4 48
9.6%
7 38
7.6%
5 37
7.4%
9 35
7.0%
8 34
 
6.8%
0 27
 
5.4%
Space Separator
ValueCountFrequency (%)
298
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 49
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 967
53.3%
Common 848
46.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
136
14.1%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
38
 
3.9%
38
 
3.9%
Other values (24) 188
19.4%
Common
ValueCountFrequency (%)
298
35.1%
1 86
 
10.1%
3 79
 
9.3%
2 68
 
8.0%
- 49
 
5.8%
6 49
 
5.8%
4 48
 
5.7%
7 38
 
4.5%
5 37
 
4.4%
9 35
 
4.1%
Other values (2) 61
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 967
53.3%
ASCII 848
46.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
298
35.1%
1 86
 
10.1%
3 79
 
9.3%
2 68
 
8.0%
- 49
 
5.8%
6 49
 
5.8%
4 48
 
5.7%
7 38
 
4.5%
5 37
 
4.4%
9 35
 
4.1%
Other values (2) 61
 
7.2%
Hangul
ValueCountFrequency (%)
136
14.1%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
81
8.4%
38
 
3.9%
38
 
3.9%
Other values (24) 188
19.4%
Distinct131
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:34.245433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length23
Mean length18.794118
Min length15

Characters and Unicode

Total characters2556
Distinct characters73
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)93.4%

Sample

1st row대전광역시 서구 도산로 340
2nd row대전광역시 서구 혜천로 20
3rd row대전광역시 서구 도산로 141
4th row대전광역시 서구 도산로 103
5th row대전광역시 서구 사마1길 22
ValueCountFrequency (%)
서구 136
24.9%
대전광역시 134
24.5%
둔산대로117번길 9
 
1.6%
도산로 7
 
1.3%
15 7
 
1.3%
신갈마로 6
 
1.1%
17 5
 
0.9%
대덕대로 5
 
0.9%
29 5
 
0.9%
둔산중로134번길 5
 
0.9%
Other values (158) 227
41.6%
2023-12-13T08:53:34.636617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
463
18.1%
171
 
6.7%
143
 
5.6%
138
 
5.4%
136
 
5.3%
134
 
5.2%
134
 
5.2%
134
 
5.2%
132
 
5.2%
1 129
 
5.0%
Other values (63) 842
32.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1561
61.1%
Decimal Number 515
 
20.1%
Space Separator 463
 
18.1%
Dash Punctuation 15
 
0.6%
Close Punctuation 1
 
< 0.1%
Open Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
11.0%
143
9.2%
138
8.8%
136
8.7%
134
8.6%
134
8.6%
134
8.6%
132
8.5%
72
 
4.6%
70
 
4.5%
Other values (49) 297
19.0%
Decimal Number
ValueCountFrequency (%)
1 129
25.0%
3 63
12.2%
2 59
11.5%
5 51
 
9.9%
7 45
 
8.7%
6 45
 
8.7%
4 38
 
7.4%
8 33
 
6.4%
0 27
 
5.2%
9 25
 
4.9%
Space Separator
ValueCountFrequency (%)
463
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1561
61.1%
Common 995
38.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
11.0%
143
9.2%
138
8.8%
136
8.7%
134
8.6%
134
8.6%
134
8.6%
132
8.5%
72
 
4.6%
70
 
4.5%
Other values (49) 297
19.0%
Common
ValueCountFrequency (%)
463
46.5%
1 129
 
13.0%
3 63
 
6.3%
2 59
 
5.9%
5 51
 
5.1%
7 45
 
4.5%
6 45
 
4.5%
4 38
 
3.8%
8 33
 
3.3%
0 27
 
2.7%
Other values (4) 42
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1561
61.1%
ASCII 995
38.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
463
46.5%
1 129
 
13.0%
3 63
 
6.3%
2 59
 
5.9%
5 51
 
5.1%
7 45
 
4.5%
6 45
 
4.5%
4 38
 
3.8%
8 33
 
3.3%
0 27
 
2.7%
Other values (4) 42
 
4.2%
Hangul
ValueCountFrequency (%)
171
11.0%
143
9.2%
138
8.8%
136
8.7%
134
8.6%
134
8.6%
134
8.6%
132
8.5%
72
 
4.6%
70
 
4.5%
Other values (49) 297
19.0%
Distinct135
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:34.891249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length46
Median length39
Mean length28.639706
Min length21

Characters and Unicode

Total characters3895
Distinct characters104
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique134 ?
Unique (%)98.5%

Sample

1st row대전광역시 서구 도산로 340, 1층 (용문동)
2nd row대전광역시 서구 혜천로 20 (정림동)
3rd row대전광역시 서구 도산로 141, 1,2층 (도마동)
4th row대전광역시 서구 도산로 103 (도마동)
5th row대전광역시 서구 사마1길 22 (도마동)
ValueCountFrequency (%)
대전광역시 136
 
17.9%
서구 136
 
17.9%
1층 38
 
5.0%
둔산동 20
 
2.6%
도마동 13
 
1.7%
2층 11
 
1.5%
만년동 11
 
1.5%
갈마동 10
 
1.3%
둔산대로117번길 9
 
1.2%
도산로 7
 
0.9%
Other values (234) 367
48.4%
2023-12-13T08:53:35.239973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
622
 
16.0%
1 231
 
5.9%
171
 
4.4%
) 158
 
4.1%
( 158
 
4.1%
149
 
3.8%
143
 
3.7%
138
 
3.5%
136
 
3.5%
136
 
3.5%
Other values (94) 1853
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2128
54.6%
Decimal Number 687
 
17.6%
Space Separator 622
 
16.0%
Close Punctuation 158
 
4.1%
Open Punctuation 158
 
4.1%
Other Punctuation 123
 
3.2%
Dash Punctuation 14
 
0.4%
Uppercase Letter 4
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
171
 
8.0%
149
 
7.0%
143
 
6.7%
138
 
6.5%
136
 
6.4%
136
 
6.4%
136
 
6.4%
136
 
6.4%
133
 
6.2%
102
 
4.8%
Other values (75) 748
35.2%
Decimal Number
ValueCountFrequency (%)
1 231
33.6%
2 91
 
13.2%
3 72
 
10.5%
5 53
 
7.7%
0 48
 
7.0%
7 45
 
6.6%
6 44
 
6.4%
4 41
 
6.0%
8 35
 
5.1%
9 27
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
B 2
50.0%
A 1
25.0%
C 1
25.0%
Space Separator
ValueCountFrequency (%)
622
100.0%
Close Punctuation
ValueCountFrequency (%)
) 158
100.0%
Open Punctuation
ValueCountFrequency (%)
( 158
100.0%
Other Punctuation
ValueCountFrequency (%)
, 123
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2128
54.6%
Common 1763
45.3%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
171
 
8.0%
149
 
7.0%
143
 
6.7%
138
 
6.5%
136
 
6.4%
136
 
6.4%
136
 
6.4%
136
 
6.4%
133
 
6.2%
102
 
4.8%
Other values (75) 748
35.2%
Common
ValueCountFrequency (%)
622
35.3%
1 231
 
13.1%
) 158
 
9.0%
( 158
 
9.0%
, 123
 
7.0%
2 91
 
5.2%
3 72
 
4.1%
5 53
 
3.0%
0 48
 
2.7%
7 45
 
2.6%
Other values (6) 162
 
9.2%
Latin
ValueCountFrequency (%)
B 2
50.0%
A 1
25.0%
C 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2128
54.6%
ASCII 1767
45.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
622
35.2%
1 231
 
13.1%
) 158
 
8.9%
( 158
 
8.9%
, 123
 
7.0%
2 91
 
5.1%
3 72
 
4.1%
5 53
 
3.0%
0 48
 
2.7%
7 45
 
2.5%
Other values (9) 166
 
9.4%
Hangul
ValueCountFrequency (%)
171
 
8.0%
149
 
7.0%
143
 
6.7%
138
 
6.5%
136
 
6.4%
136
 
6.4%
136
 
6.4%
136
 
6.4%
133
 
6.2%
102
 
4.8%
Other values (75) 748
35.2%

음식의유형
Categorical

IMBALANCE 

Distinct7
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
한식
108 
중식
11 
일식
 
8
분식
 
5
양식
 
2
Other values (2)
 
2

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)1.5%

Sample

1st row한식
2nd row한식
3rd row한식
4th row한식
5th row분식

Common Values

ValueCountFrequency (%)
한식 108
79.4%
중식 11
 
8.1%
일식 8
 
5.9%
분식 5
 
3.7%
양식 2
 
1.5%
기타 1
 
0.7%
뷔페 1
 
0.7%

Length

2023-12-13T08:53:35.361399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:53:35.743776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한식 108
79.4%
중식 11
 
8.1%
일식 8
 
5.9%
분식 5
 
3.7%
양식 2
 
1.5%
기타 1
 
0.7%
뷔페 1
 
0.7%
Distinct85
Distinct (%)62.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:35.973376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length3.8235294
Min length2

Characters and Unicode

Total characters520
Distinct characters130
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)47.8%

Sample

1st row낙지볶음
2nd row오리전골, 장어
3rd row콩국수
4th row돼지갈비
5th row냉면
ValueCountFrequency (%)
삼겹살 11
 
7.5%
돼지갈비 6
 
4.1%
감자탕 6
 
4.1%
활어회 5
 
3.4%
한정식 5
 
3.4%
짜장면 4
 
2.7%
등심구이 4
 
2.7%
칼국수 4
 
2.7%
낙지볶음 4
 
2.7%
짬뽕 3
 
2.0%
Other values (78) 95
64.6%
2023-12-13T08:53:36.366395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
4.4%
18
 
3.5%
17
 
3.3%
17
 
3.3%
17
 
3.3%
17
 
3.3%
16
 
3.1%
16
 
3.1%
14
 
2.7%
, 13
 
2.5%
Other values (120) 352
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 496
95.4%
Other Punctuation 13
 
2.5%
Space Separator 11
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.6%
18
 
3.6%
17
 
3.4%
17
 
3.4%
17
 
3.4%
17
 
3.4%
16
 
3.2%
16
 
3.2%
14
 
2.8%
11
 
2.2%
Other values (118) 330
66.5%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 496
95.4%
Common 24
 
4.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
4.6%
18
 
3.6%
17
 
3.4%
17
 
3.4%
17
 
3.4%
17
 
3.4%
16
 
3.2%
16
 
3.2%
14
 
2.8%
11
 
2.2%
Other values (118) 330
66.5%
Common
ValueCountFrequency (%)
, 13
54.2%
11
45.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 496
95.4%
ASCII 24
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
4.6%
18
 
3.6%
17
 
3.4%
17
 
3.4%
17
 
3.4%
17
 
3.4%
16
 
3.2%
16
 
3.2%
14
 
2.8%
11
 
2.2%
Other values (118) 330
66.5%
ASCII
ValueCountFrequency (%)
, 13
54.2%
11
45.8%

전화번호
Text

UNIQUE 

Distinct136
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-13T08:53:36.601598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length12.220588
Min length9

Characters and Unicode

Total characters1662
Distinct characters19
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)100.0%

Sample

1st row042-531-9348
2nd row042-585-2992
3rd row042-533-4586
4th row042-525-9209
5th row042-536-0408
ValueCountFrequency (%)
042-531-9348 1
 
0.7%
042-477-1002 1
 
0.7%
042-537-3889 1
 
0.7%
042-583-7707 1
 
0.7%
042-489-5330 1
 
0.7%
042-482-0985 1
 
0.7%
042-488-4292 1
 
0.7%
042-585-2992 1
 
0.7%
042-527-8788 1
 
0.7%
0507-1438-2210 1
 
0.7%
Other values (127) 127
92.7%
2023-12-13T08:53:36.955505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 270
16.2%
4 232
14.0%
0 224
13.5%
2 224
13.5%
5 163
9.8%
8 129
7.8%
7 103
 
6.2%
3 98
 
5.9%
9 75
 
4.5%
1 75
 
4.5%
Other values (9) 69
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1383
83.2%
Dash Punctuation 270
 
16.2%
Other Letter 8
 
0.5%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 232
16.8%
0 224
16.2%
2 224
16.2%
5 163
11.8%
8 129
9.3%
7 103
7.4%
3 98
7.1%
9 75
 
5.4%
1 75
 
5.4%
6 60
 
4.3%
Other Letter
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Dash Punctuation
ValueCountFrequency (%)
- 270
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1654
99.5%
Hangul 8
 
0.5%

Most frequent character per script

Common
ValueCountFrequency (%)
- 270
16.3%
4 232
14.0%
0 224
13.5%
2 224
13.5%
5 163
9.9%
8 129
7.8%
7 103
 
6.2%
3 98
 
5.9%
9 75
 
4.5%
1 75
 
4.5%
Other values (2) 61
 
3.7%
Hangul
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1654
99.5%
Hangul 8
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 270
16.3%
4 232
14.0%
0 224
13.5%
2 224
13.5%
5 163
9.9%
8 129
7.8%
7 103
 
6.2%
3 98
 
5.9%
9 75
 
4.5%
1 75
 
4.5%
Other values (2) 61
 
3.7%
Hangul
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%

Interactions

2023-12-13T08:53:31.311116image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:30.804528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:31.069535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:31.402830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:30.883415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:31.154159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:31.492328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:30.980269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:53:31.227731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:53:37.058102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번행정동행정동코드법정동법정동코드음식의유형주된음식
순번1.0000.4240.3120.3890.2720.1440.000
행정동0.4241.0001.0000.9950.9960.0000.000
행정동코드0.3121.0001.0000.9730.8500.0000.000
법정동0.3890.9950.9731.0001.0000.0000.364
법정동코드0.2720.9960.8501.0001.0000.0000.000
음식의유형0.1440.0000.0000.0000.0001.0000.998
주된음식0.0000.0000.0000.3640.0000.9981.000
2023-12-13T08:53:37.156675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
음식의유형법정동행정동
음식의유형1.0000.0000.000
법정동0.0001.0000.949
행정동0.0000.9491.000
2023-12-13T08:53:37.236864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번행정동코드법정동코드행정동법정동음식의유형
순번1.0000.1200.1130.1740.1680.000
행정동코드0.1201.0000.9130.9600.8670.000
법정동코드0.1130.9131.0000.9410.9640.000
행정동0.1740.9600.9411.0000.9490.000
법정동0.1680.8670.9640.9491.0000.000
음식의유형0.0000.0000.0000.0000.0001.000

Missing values

2023-12-13T08:53:31.609627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:53:31.755110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업소명행정동행정동코드법정동법정동코드지번주소도로명주소상세주소음식의유형주된음식전화번호
01충무할매낙지볶음용문동3017055000용문동3017010500대전광역시 서구 용문동 277-4대전광역시 서구 도산로 340대전광역시 서구 도산로 340, 1층 (용문동)한식낙지볶음042-531-9348
12정림가든정림동3017053500정림동3017010400대전광역시 서구 정림동 8-4대전광역시 서구 혜천로 20대전광역시 서구 혜천로 20 (정림동)한식오리전골, 장어042-585-2992
23대성콩국수도마1동3017052000도마동3017010300대전광역시 서구 도마동 67-36대전광역시 서구 도산로 141대전광역시 서구 도산로 141, 1,2층 (도마동)한식콩국수042-533-4586
34돈수원갈비도마2동3017053000도마동3017010300대전광역시 서구 도마동 100-15대전광역시 서구 도산로 103대전광역시 서구 도산로 103 (도마동)한식돼지갈비042-525-9209
45한마음면옥식당도마2동3017053000도마동3017010300대전광역시 서구 도마동 101-6대전광역시 서구 사마1길 22대전광역시 서구 사마1길 22 (도마동)분식냉면042-536-0408
56이모네감자탕도마1동3017052000도마동3017010300대전광역시 서구 도마동 34-44대전광역시 서구 도솔로 135대전광역시 서구 도솔로 135 (도마동)한식감자탕042-522-9781
67나인관저2동3017059700관저동3017011600대전광역시 서구 관저동 833대전광역시 서구 계백로776번길 134대전광역시 서구 계백로776번길 134 (관저동)한식멧돼지숯불구이042-545-3000
78그림같은집도마2동3017053000도마동3017010300대전광역시 서구 도마동 328-20대전광역시 서구 배재로91번길 51-2대전광역시 서구 배재로91번길 51-2 (도마동)한식백반042-524-7239
89도시애바다갈마1동3017058100갈마동3017011100대전광역시 서구 갈마동 337-14대전광역시 서구 신갈마로 171대전광역시 서구 신갈마로 171 (갈마동)한식옻닭042-534-3345
910정림일식정림동3017053500정림동3017010400대전광역시 서구 정림동 514대전광역시 서구 정림로 78-14대전광역시 서구 정림로 78-14, 1층 (정림동)일식활어회042-583-4545
순번업소명행정동행정동코드법정동법정동코드지번주소도로명주소상세주소음식의유형주된음식전화번호
126127진성아구찜가수원동3017059000도안동3017011500도안동 1664대전광역시 서구 도안동로31번길 20-16대전광역시 서구 도안동로31번길 20-16 (가수원동)한식아구찜042-541-3940
127128대대로감자탕둔산2동3017064000둔산동3017011200둔산동 923대전광역시 서구 둔산중로134번길 21대전광역시 서구 둔산중로134번길 21, 1층 110호 (둔산동)한식감자탕042-482-6767
128129참맛있는칼국수월평2동3017058700월평동3017011300월평동 229대전광역시 서구 청사서로54번길 17대전광역시 서구 청사서로54번길 17, 2층 (월평동, 2층일부)한식칼국수042-483-9259
129130추성순대족발둔산2동3017064000둔산동3017011200둔산동 925대전광역시 서구 둔산중로134번길 39대전광역시 서구 둔산중로134번길 39, 1층 108호 (둔산동)한식순대국밥042-482-4890
130131시온성관저2동3017059700관저동3017011600관저동 1613-1대전광역시 서구 구봉산북로280번길 12대전광역시 서구 구봉산북로280번길 12, 1층 (관저동)중식짜장면042-541-9595
131132토종칼국수 복수점복수동3017051000복수동3017010100복수동 729대전광역시 서구 복수남로 6대전광역시 서구 복수남로 6, 1층 (복수동, 1층일부)한식칼국수042-581-5662
132133까치돌구이탄방동3017055500탄방동3017010600탄방동 733대전광역시 서구 문정로89번길 32대전광역시 서구 문정로89번길 32, 1층 (탄방동, 1층일부)한식삼겹살042-472-0492
133134오가네대구왕뽈떼기갈마1동3017058100갈마동3017011100갈마동 338-3대전광역시 서구 갈마로 36대전광역시 서구 갈마로 36, 1층 (갈마동)한식대구뽈찜042-526-3007
134135한우전문점 눈꽃탄방동3017055500탄방동3017010600탄방동 724대전광역시 서구 둔산남로 100대전광역시 서구 둔산남로 100, 8층 (탄방동)한식등심구이0507-1478-7830
135136대관령양푼이동태찌개변동3017054000변동3017010200대전광역시 서구 변동 43-9대전광역시 서구 변동로 110대전광역시 서구 변동로 110, 1층 일부호 (변동)한식동태찌개042-536-7751