Overview

Dataset statistics

Number of variables15
Number of observations141
Missing cells18
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory17.5 KiB
Average record size in memory126.9 B

Variable types

Categorical6
Numeric3
Text6

Dataset

Description시군구코드,지정년도,지정번호,신청일자,지정일자,업소명,소재지도로명,소재지지번,허가(신고)번호,업태명,주된음식,영업장면적(㎡),행정동명,급수시설구분,소재지전화번호
Author성동구
URLhttps://data.seoul.go.kr/dataList/OA-10756/S/1/datasetView.do

Alerts

시군구코드 has constant value ""Constant
지정년도 has constant value ""Constant
지정일자 has constant value ""Constant
급수시설구분 is highly overall correlated with 지정번호 and 4 other fieldsHigh correlation
업태명 is highly overall correlated with 급수시설구분High correlation
행정동명 is highly overall correlated with 급수시설구분High correlation
지정번호 is highly overall correlated with 신청일자 and 1 other fieldsHigh correlation
신청일자 is highly overall correlated with 지정번호 and 1 other fieldsHigh correlation
영업장면적(㎡) is highly overall correlated with 급수시설구분High correlation
업태명 is highly imbalanced (55.8%)Imbalance
소재지전화번호 has 18 (12.8%) missing valuesMissing
업소명 has unique valuesUnique
소재지도로명 has unique valuesUnique
허가(신고)번호 has unique valuesUnique

Reproduction

Analysis started2024-05-04 00:15:54.681203
Analysis finished2024-05-04 00:16:00.132704
Duration5.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
3030000
141 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3030000
2nd row3030000
3rd row3030000
4th row3030000
5th row3030000

Common Values

ValueCountFrequency (%)
3030000 141
100.0%

Length

2024-05-04T00:16:00.458840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:16:00.832735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3030000 141
100.0%

지정년도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024
141 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024
2nd row2024
3rd row2024
4th row2024
5th row2024

Common Values

ValueCountFrequency (%)
2024 141
100.0%

Length

2024-05-04T00:16:01.159841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:16:01.455466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024 141
100.0%

지정번호
Real number (ℝ)

HIGH CORRELATION 

Distinct76
Distinct (%)53.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107.35461
Minimum1
Maximum423
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-05-04T00:16:01.794240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q111
median24
Q3139
95-th percentile418
Maximum423
Range422
Interquartile range (IQR)128

Descriptive statistics

Standard deviation147.80557
Coefficient of variation (CV)1.3767976
Kurtosis0.44643224
Mean107.35461
Median Absolute Deviation (MAD)20
Skewness1.4290224
Sum15137
Variance21846.488
MonotonicityNot monotonic
2024-05-04T00:16:02.309601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
13 5
 
3.5%
10 5
 
3.5%
9 5
 
3.5%
12 5
 
3.5%
20 4
 
2.8%
2 4
 
2.8%
22 4
 
2.8%
4 4
 
2.8%
1 4
 
2.8%
8 4
 
2.8%
Other values (66) 97
68.8%
ValueCountFrequency (%)
1 4
2.8%
2 4
2.8%
3 2
 
1.4%
4 4
2.8%
5 1
 
0.7%
6 2
 
1.4%
7 3
2.1%
8 4
2.8%
9 5
3.5%
10 5
3.5%
ValueCountFrequency (%)
423 1
0.7%
421 2
1.4%
420 2
1.4%
419 2
1.4%
418 2
1.4%
417 2
1.4%
416 2
1.4%
415 2
1.4%
414 2
1.4%
413 2
1.4%

신청일자
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)26.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20169186
Minimum20040501
Maximum20231115
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-05-04T00:16:02.700611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20040501
5-th percentile20060930
Q120151015
median20181025
Q320201101
95-th percentile20231115
Maximum20231115
Range190614
Interquartile range (IQR)50086

Descriptive statistics

Standard deviation48464.518
Coefficient of variation (CV)0.0024028991
Kurtosis0.63177131
Mean20169186
Median Absolute Deviation (MAD)30009
Skewness-1.0338046
Sum2.8438552 × 109
Variance2.3488095 × 109
MonotonicityNot monotonic
2024-05-04T00:16:03.316046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
20191016 18
 
12.8%
20171020 14
 
9.9%
20201101 12
 
8.5%
20231115 11
 
7.8%
20211101 11
 
7.8%
20221116 8
 
5.7%
20040501 6
 
4.3%
20111116 6
 
4.3%
20141110 5
 
3.5%
20131028 5
 
3.5%
Other values (27) 45
31.9%
ValueCountFrequency (%)
20040501 6
4.3%
20060519 1
 
0.7%
20060930 1
 
0.7%
20061120 1
 
0.7%
20070522 1
 
0.7%
20071123 1
 
0.7%
20081127 1
 
0.7%
20100623 1
 
0.7%
20111116 6
4.3%
20121016 2
 
1.4%
ValueCountFrequency (%)
20231115 11
7.8%
20221116 8
5.7%
20221115 3
 
2.1%
20211101 11
7.8%
20201101 12
8.5%
20191016 18
12.8%
20181224 3
 
2.1%
20181029 1
 
0.7%
20181026 1
 
0.7%
20181025 3
 
2.1%

지정일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
20240118
141 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20240118
2nd row20240118
3rd row20240118
4th row20240118
5th row20240118

Common Values

ValueCountFrequency (%)
20240118 141
100.0%

Length

2024-05-04T00:16:04.059766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:16:04.506137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20240118 141
100.0%

업소명
Text

UNIQUE 

Distinct141
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-05-04T00:16:05.211359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length5.7588652
Min length2

Characters and Unicode

Total characters812
Distinct characters249
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)100.0%

Sample

1st row귀한족발 왕십리점
2nd row쭈식이 상회
3rd row대촌정
4th row동보성
5th row이가갈비
ValueCountFrequency (%)
주식회사 2
 
1.1%
본점 2
 
1.1%
뚝섬점 1
 
0.6%
동해루 1
 
0.6%
전주본가영양돌솥밥 1
 
0.6%
푸른바다횟집 1
 
0.6%
육백더육백 1
 
0.6%
푸줏간생고기점 1
 
0.6%
이북집찹쌀순대 1
 
0.6%
서래 1
 
0.6%
Other values (163) 163
93.1%
2024-05-04T00:16:06.708782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
34
 
4.2%
20
 
2.5%
19
 
2.3%
16
 
2.0%
14
 
1.7%
14
 
1.7%
12
 
1.5%
12
 
1.5%
11
 
1.4%
11
 
1.4%
Other values (239) 649
79.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 759
93.5%
Space Separator 34
 
4.2%
Uppercase Letter 6
 
0.7%
Close Punctuation 5
 
0.6%
Open Punctuation 4
 
0.5%
Other Punctuation 2
 
0.2%
Dash Punctuation 1
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
16
 
2.1%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
11
 
1.4%
11
 
1.4%
11
 
1.4%
Other values (228) 619
81.6%
Uppercase Letter
ValueCountFrequency (%)
E 2
33.3%
O 1
16.7%
N 1
16.7%
A 1
16.7%
S 1
16.7%
Space Separator
ValueCountFrequency (%)
34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 759
93.5%
Common 47
 
5.8%
Latin 6
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
16
 
2.1%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
11
 
1.4%
11
 
1.4%
11
 
1.4%
Other values (228) 619
81.6%
Common
ValueCountFrequency (%)
34
72.3%
) 5
 
10.6%
( 4
 
8.5%
. 2
 
4.3%
- 1
 
2.1%
2 1
 
2.1%
Latin
ValueCountFrequency (%)
E 2
33.3%
O 1
16.7%
N 1
16.7%
A 1
16.7%
S 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 759
93.5%
ASCII 53
 
6.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
34
64.2%
) 5
 
9.4%
( 4
 
7.5%
. 2
 
3.8%
E 2
 
3.8%
O 1
 
1.9%
N 1
 
1.9%
A 1
 
1.9%
- 1
 
1.9%
S 1
 
1.9%
Hangul
ValueCountFrequency (%)
20
 
2.6%
19
 
2.5%
16
 
2.1%
14
 
1.8%
14
 
1.8%
12
 
1.6%
12
 
1.6%
11
 
1.4%
11
 
1.4%
11
 
1.4%
Other values (228) 619
81.6%

소재지도로명
Text

UNIQUE 

Distinct141
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-05-04T00:16:07.618720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length48
Mean length32.751773
Min length23

Characters and Unicode

Total characters4618
Distinct characters124
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)100.0%

Sample

1st row서울특별시 성동구 무학로2길 43, 1층 (도선동)
2nd row서울특별시 성동구 마조로7길 9-1, 1층 (행당동)
3rd row서울특별시 성동구 마장로23길 4, (홍익동)
4th row서울특별시 성동구 한림말1길 15-2, (옥수동)
5th row서울특별시 성동구 마조로 25, (행당동)
ValueCountFrequency (%)
서울특별시 141
 
16.7%
성동구 141
 
16.7%
1층 38
 
4.5%
성수동2가 25
 
3.0%
성수동1가 21
 
2.5%
지상1층 14
 
1.7%
마장동 13
 
1.5%
행당동 13
 
1.5%
2층 12
 
1.4%
도선동 8
 
0.9%
Other values (273) 419
49.6%
2024-05-04T00:16:09.106428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
706
 
15.3%
300
 
6.5%
1 268
 
5.8%
225
 
4.9%
, 217
 
4.7%
) 155
 
3.4%
( 155
 
3.4%
154
 
3.3%
2 150
 
3.2%
145
 
3.1%
Other values (114) 2143
46.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2546
55.1%
Decimal Number 792
 
17.2%
Space Separator 706
 
15.3%
Other Punctuation 217
 
4.7%
Close Punctuation 155
 
3.4%
Open Punctuation 155
 
3.4%
Dash Punctuation 38
 
0.8%
Uppercase Letter 7
 
0.2%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
300
 
11.8%
225
 
8.8%
154
 
6.0%
145
 
5.7%
143
 
5.6%
141
 
5.5%
141
 
5.5%
141
 
5.5%
107
 
4.2%
96
 
3.8%
Other values (95) 953
37.4%
Decimal Number
ValueCountFrequency (%)
1 268
33.8%
2 150
18.9%
3 76
 
9.6%
0 60
 
7.6%
4 59
 
7.4%
7 42
 
5.3%
9 39
 
4.9%
6 35
 
4.4%
5 32
 
4.0%
8 31
 
3.9%
Uppercase Letter
ValueCountFrequency (%)
T 3
42.9%
I 3
42.9%
L 1
 
14.3%
Space Separator
ValueCountFrequency (%)
706
100.0%
Other Punctuation
ValueCountFrequency (%)
, 217
100.0%
Close Punctuation
ValueCountFrequency (%)
) 155
100.0%
Open Punctuation
ValueCountFrequency (%)
( 155
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 38
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2546
55.1%
Common 2065
44.7%
Latin 7
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
300
 
11.8%
225
 
8.8%
154
 
6.0%
145
 
5.7%
143
 
5.6%
141
 
5.5%
141
 
5.5%
141
 
5.5%
107
 
4.2%
96
 
3.8%
Other values (95) 953
37.4%
Common
ValueCountFrequency (%)
706
34.2%
1 268
 
13.0%
, 217
 
10.5%
) 155
 
7.5%
( 155
 
7.5%
2 150
 
7.3%
3 76
 
3.7%
0 60
 
2.9%
4 59
 
2.9%
7 42
 
2.0%
Other values (6) 177
 
8.6%
Latin
ValueCountFrequency (%)
T 3
42.9%
I 3
42.9%
L 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2546
55.1%
ASCII 2072
44.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
706
34.1%
1 268
 
12.9%
, 217
 
10.5%
) 155
 
7.5%
( 155
 
7.5%
2 150
 
7.2%
3 76
 
3.7%
0 60
 
2.9%
4 59
 
2.8%
7 42
 
2.0%
Other values (9) 184
 
8.9%
Hangul
ValueCountFrequency (%)
300
 
11.8%
225
 
8.8%
154
 
6.0%
145
 
5.7%
143
 
5.6%
141
 
5.5%
141
 
5.5%
141
 
5.5%
107
 
4.2%
96
 
3.8%
Other values (95) 953
37.4%
Distinct140
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-05-04T00:16:10.257549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length40
Mean length29.148936
Min length22

Characters and Unicode

Total characters4110
Distinct characters109
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique139 ?
Unique (%)98.6%

Sample

1st row서울특별시 성동구 도선동 144번지
2nd row서울특별시 성동구 행당동 3번지 10호
3rd row서울특별시 성동구 홍익동 112번지
4th row서울특별시 성동구 옥수동 298번지
5th row서울특별시 성동구 행당동 6번지 2호
ValueCountFrequency (%)
서울특별시 141
18.4%
성동구 141
18.4%
성수동2가 32
 
4.2%
성수동1가 24
 
3.1%
지상1층 24
 
3.1%
행당동 18
 
2.3%
1호 15
 
2.0%
마장동 14
 
1.8%
656번지 12
 
1.6%
도선동 11
 
1.4%
Other values (214) 334
43.6%
2024-05-04T00:16:11.895152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
985
24.0%
287
 
7.0%
204
 
5.0%
1 203
 
4.9%
177
 
4.3%
144
 
3.5%
144
 
3.5%
143
 
3.5%
141
 
3.4%
141
 
3.4%
Other values (99) 1541
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2271
55.3%
Space Separator 985
24.0%
Decimal Number 785
 
19.1%
Open Punctuation 21
 
0.5%
Close Punctuation 21
 
0.5%
Other Punctuation 13
 
0.3%
Dash Punctuation 7
 
0.2%
Uppercase Letter 6
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
287
12.6%
204
 
9.0%
177
 
7.8%
144
 
6.3%
144
 
6.3%
143
 
6.3%
141
 
6.2%
141
 
6.2%
141
 
6.2%
141
 
6.2%
Other values (80) 608
26.8%
Decimal Number
ValueCountFrequency (%)
1 203
25.9%
2 121
15.4%
3 74
 
9.4%
6 71
 
9.0%
7 62
 
7.9%
4 61
 
7.8%
5 51
 
6.5%
9 51
 
6.5%
8 46
 
5.9%
0 45
 
5.7%
Other Punctuation
ValueCountFrequency (%)
, 12
92.3%
@ 1
 
7.7%
Uppercase Letter
ValueCountFrequency (%)
I 3
50.0%
T 3
50.0%
Space Separator
ValueCountFrequency (%)
985
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2271
55.3%
Common 1833
44.6%
Latin 6
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
287
12.6%
204
 
9.0%
177
 
7.8%
144
 
6.3%
144
 
6.3%
143
 
6.3%
141
 
6.2%
141
 
6.2%
141
 
6.2%
141
 
6.2%
Other values (80) 608
26.8%
Common
ValueCountFrequency (%)
985
53.7%
1 203
 
11.1%
2 121
 
6.6%
3 74
 
4.0%
6 71
 
3.9%
7 62
 
3.4%
4 61
 
3.3%
5 51
 
2.8%
9 51
 
2.8%
8 46
 
2.5%
Other values (7) 108
 
5.9%
Latin
ValueCountFrequency (%)
I 3
50.0%
T 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2271
55.3%
ASCII 1839
44.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
985
53.6%
1 203
 
11.0%
2 121
 
6.6%
3 74
 
4.0%
6 71
 
3.9%
7 62
 
3.4%
4 61
 
3.3%
5 51
 
2.8%
9 51
 
2.8%
8 46
 
2.5%
Other values (9) 114
 
6.2%
Hangul
ValueCountFrequency (%)
287
12.6%
204
 
9.0%
177
 
7.8%
144
 
6.3%
144
 
6.3%
143
 
6.3%
141
 
6.2%
141
 
6.2%
141
 
6.2%
141
 
6.2%
Other values (80) 608
26.8%
Distinct141
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-05-04T00:16:12.694608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length22
Mean length22
Min length22

Characters and Unicode

Total characters3102
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)100.0%

Sample

1st row3030000-101-2014-00172
2nd row3030000-101-2014-00221
3rd row3030000-101-1985-04894
4th row3030000-101-1993-03737
5th row3030000-101-1988-06036
ValueCountFrequency (%)
3030000-101-2014-00172 1
 
0.7%
3030000-101-2002-00481 1
 
0.7%
3030000-101-2016-00137 1
 
0.7%
3030000-101-2014-00033 1
 
0.7%
3030000-101-2007-00212 1
 
0.7%
3030000-101-1996-04839 1
 
0.7%
3030000-101-2009-00089 1
 
0.7%
3030000-101-2008-00090 1
 
0.7%
3030000-101-1993-05381 1
 
0.7%
3030000-101-1988-03762 1
 
0.7%
Other values (131) 131
92.9%
2024-05-04T00:16:14.010461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1284
41.4%
1 453
 
14.6%
- 423
 
13.6%
3 349
 
11.3%
2 171
 
5.5%
9 126
 
4.1%
5 64
 
2.1%
8 64
 
2.1%
4 63
 
2.0%
6 53
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2679
86.4%
Dash Punctuation 423
 
13.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1284
47.9%
1 453
 
16.9%
3 349
 
13.0%
2 171
 
6.4%
9 126
 
4.7%
5 64
 
2.4%
8 64
 
2.4%
4 63
 
2.4%
6 53
 
2.0%
7 52
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 423
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3102
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1284
41.4%
1 453
 
14.6%
- 423
 
13.6%
3 349
 
11.3%
2 171
 
5.5%
9 126
 
4.1%
5 64
 
2.1%
8 64
 
2.1%
4 63
 
2.0%
6 53
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3102
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1284
41.4%
1 453
 
14.6%
- 423
 
13.6%
3 349
 
11.3%
2 171
 
5.5%
9 126
 
4.1%
5 64
 
2.1%
8 64
 
2.1%
4 63
 
2.0%
6 53
 
1.7%

업태명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct12
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
한식
104 
중국식
11 
일식
 
9
경양식
 
4
기타
 
4
Other values (7)
 
9

Length

Max length10
Median length2
Mean length2.2765957
Min length2

Unique

Unique5 ?
Unique (%)3.5%

Sample

1st row한식
2nd row한식
3rd row한식
4th row한식
5th row한식

Common Values

ValueCountFrequency (%)
한식 104
73.8%
중국식 11
 
7.8%
일식 9
 
6.4%
경양식 4
 
2.8%
기타 4
 
2.8%
호프/통닭 2
 
1.4%
분식 2
 
1.4%
회집 1
 
0.7%
뷔페식 1
 
0.7%
정종/대포집/소주방 1
 
0.7%
Other values (2) 2
 
1.4%

Length

2024-05-04T00:16:14.720829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한식 104
73.8%
중국식 11
 
7.8%
일식 9
 
6.4%
경양식 4
 
2.8%
기타 4
 
2.8%
호프/통닭 2
 
1.4%
분식 2
 
1.4%
회집 1
 
0.7%
뷔페식 1
 
0.7%
정종/대포집/소주방 1
 
0.7%
Other values (2) 2
 
1.4%
Distinct103
Distinct (%)73.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-05-04T00:16:15.291063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length4.3262411
Min length2

Characters and Unicode

Total characters610
Distinct characters141
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)56.7%

Sample

1st row족발
2nd row쭈꾸미볶음
3rd row닭백숙
4th row짜장면
5th row돼지갈비
ValueCountFrequency (%)
삼겹살 9
 
5.4%
돼지고기 5
 
3.0%
등심 5
 
3.0%
백반 4
 
2.4%
탕수육 4
 
2.4%
설렁탕 4
 
2.4%
짜장면 4
 
2.4%
생선회 4
 
2.4%
감자탕 4
 
2.4%
소고기 4
 
2.4%
Other values (97) 121
72.0%
2024-05-04T00:16:16.453550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 30
 
4.9%
27
 
4.4%
24
 
3.9%
17
 
2.8%
17
 
2.8%
15
 
2.5%
15
 
2.5%
14
 
2.3%
14
 
2.3%
13
 
2.1%
Other values (131) 424
69.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 552
90.5%
Other Punctuation 31
 
5.1%
Space Separator 27
 
4.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
4.3%
17
 
3.1%
17
 
3.1%
15
 
2.7%
15
 
2.7%
14
 
2.5%
14
 
2.5%
13
 
2.4%
13
 
2.4%
13
 
2.4%
Other values (128) 397
71.9%
Other Punctuation
ValueCountFrequency (%)
, 30
96.8%
1
 
3.2%
Space Separator
ValueCountFrequency (%)
27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 552
90.5%
Common 58
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
4.3%
17
 
3.1%
17
 
3.1%
15
 
2.7%
15
 
2.7%
14
 
2.5%
14
 
2.5%
13
 
2.4%
13
 
2.4%
13
 
2.4%
Other values (128) 397
71.9%
Common
ValueCountFrequency (%)
, 30
51.7%
27
46.6%
1
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 552
90.5%
ASCII 57
 
9.3%
None 1
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 30
52.6%
27
47.4%
Hangul
ValueCountFrequency (%)
24
 
4.3%
17
 
3.1%
17
 
3.1%
15
 
2.7%
15
 
2.7%
14
 
2.5%
14
 
2.5%
13
 
2.4%
13
 
2.4%
13
 
2.4%
Other values (128) 397
71.9%
None
ValueCountFrequency (%)
1
100.0%

영업장면적(㎡)
Real number (ℝ)

HIGH CORRELATION 

Distinct139
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean114.00915
Minimum20
Maximum1063.55
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-05-04T00:16:17.099857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20
5-th percentile31.66
Q159.46
median89.79
Q3137.7
95-th percentile243.74
Maximum1063.55
Range1043.55
Interquartile range (IQR)78.24

Descriptive statistics

Standard deviation105.08675
Coefficient of variation (CV)0.92173959
Kurtosis47.564621
Mean114.00915
Median Absolute Deviation (MAD)37.37
Skewness5.6634647
Sum16075.29
Variance11043.224
MonotonicityNot monotonic
2024-05-04T00:16:17.570781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
135.6 2
 
1.4%
89.79 2
 
1.4%
58.2 1
 
0.7%
94.95 1
 
0.7%
43.93 1
 
0.7%
224.4 1
 
0.7%
154.5 1
 
0.7%
83.6 1
 
0.7%
114.92 1
 
0.7%
93.75 1
 
0.7%
Other values (129) 129
91.5%
ValueCountFrequency (%)
20.0 1
0.7%
20.5 1
0.7%
22.44 1
0.7%
24.0 1
0.7%
27.0 1
0.7%
28.7 1
0.7%
28.8 1
0.7%
31.66 1
0.7%
33.0 1
0.7%
33.2 1
0.7%
ValueCountFrequency (%)
1063.55 1
0.7%
395.0 1
0.7%
361.21 1
0.7%
297.0 1
0.7%
270.45 1
0.7%
248.04 1
0.7%
247.0 1
0.7%
243.74 1
0.7%
242.74 1
0.7%
236.2 1
0.7%

행정동명
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)11.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
성수2가제1동
22 
왕십리도선동
18 
마장동
14 
성수1가제1동
13 
사근동
11 
Other values (11)
63 

Length

Max length7
Median length6
Mean length5.3971631
Min length3

Unique

Unique2 ?
Unique (%)1.4%

Sample

1st row왕십리도선동
2nd row사근동
3rd row왕십리도선동
4th row옥수동
5th row사근동

Common Values

ValueCountFrequency (%)
성수2가제1동 22
15.6%
왕십리도선동 18
12.8%
마장동 14
9.9%
성수1가제1동 13
9.2%
사근동 11
7.8%
성수1가제2동 11
7.8%
성수2가제3동 10
7.1%
행당제1동 9
6.4%
옥수동 7
 
5.0%
용답동 7
 
5.0%
Other values (6) 19
13.5%

Length

2024-05-04T00:16:17.970866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
성수2가제1동 22
15.6%
왕십리도선동 18
12.8%
마장동 14
9.9%
성수1가제1동 13
9.2%
사근동 11
7.8%
성수1가제2동 11
7.8%
성수2가제3동 10
7.1%
행당제1동 9
6.4%
옥수동 7
 
5.0%
용답동 7
 
5.0%
Other values (6) 19
13.5%

급수시설구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
상수도전용
73 
<NA>
68 

Length

Max length5
Median length5
Mean length4.5177305
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row상수도전용
4th row상수도전용
5th row상수도전용

Common Values

ValueCountFrequency (%)
상수도전용 73
51.8%
<NA> 68
48.2%

Length

2024-05-04T00:16:18.412297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T00:16:18.780249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
상수도전용 73
51.8%
na 68
48.2%

소재지전화번호
Text

MISSING 

Distinct123
Distinct (%)100.0%
Missing18
Missing (%)12.8%
Memory size1.2 KiB
2024-05-04T00:16:19.300729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length10.300813
Min length7

Characters and Unicode

Total characters1267
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique123 ?
Unique (%)100.0%

Sample

1st row0222942329
2nd row0222813228
3rd row0222950421
4th row0222994520
5th row0222988878
ValueCountFrequency (%)
02 47
27.2%
0222998592 1
 
0.6%
4662756 1
 
0.6%
0222820220 1
 
0.6%
0222990224 1
 
0.6%
4662090 1
 
0.6%
4618860 1
 
0.6%
4686856 1
 
0.6%
000222977361 1
 
0.6%
0222330326 1
 
0.6%
Other values (117) 117
67.6%
2024-05-04T00:16:20.321316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 343
27.1%
0 206
16.3%
9 126
 
9.9%
4 106
 
8.4%
6 95
 
7.5%
8 91
 
7.2%
5 73
 
5.8%
61
 
4.8%
3 60
 
4.7%
1 59
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1206
95.2%
Space Separator 61
 
4.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 343
28.4%
0 206
17.1%
9 126
 
10.4%
4 106
 
8.8%
6 95
 
7.9%
8 91
 
7.5%
5 73
 
6.1%
3 60
 
5.0%
1 59
 
4.9%
7 47
 
3.9%
Space Separator
ValueCountFrequency (%)
61
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1267
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 343
27.1%
0 206
16.3%
9 126
 
9.9%
4 106
 
8.4%
6 95
 
7.5%
8 91
 
7.2%
5 73
 
5.8%
61
 
4.8%
3 60
 
4.7%
1 59
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1267
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 343
27.1%
0 206
16.3%
9 126
 
9.9%
4 106
 
8.4%
6 95
 
7.5%
8 91
 
7.2%
5 73
 
5.8%
61
 
4.8%
3 60
 
4.7%
1 59
 
4.7%

Interactions

2024-05-04T00:15:58.105782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:56.231061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:57.178776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:58.383278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:56.504870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:57.451374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:58.664240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:56.864634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-05-04T00:15:57.761365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-04T00:16:20.650815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정번호신청일자업태명영업장면적(㎡)행정동명
지정번호1.0000.8540.1430.0670.000
신청일자0.8541.0000.0000.4580.000
업태명0.1430.0001.0000.0000.000
영업장면적(㎡)0.0670.4580.0001.0000.000
행정동명0.0000.0000.0000.0001.000
2024-05-04T00:16:20.993271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
급수시설구분업태명행정동명
급수시설구분1.0001.0001.000
업태명1.0001.0000.000
행정동명1.0000.0001.000
2024-05-04T00:16:21.285843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정번호신청일자영업장면적(㎡)업태명행정동명급수시설구분
지정번호1.0000.582-0.2440.0730.0001.000
신청일자0.5821.000-0.4690.0000.0001.000
영업장면적(㎡)-0.244-0.4691.0000.0000.0001.000
업태명0.0730.0000.0001.0000.0001.000
행정동명0.0000.0000.0000.0001.0001.000
급수시설구분1.0001.0001.0001.0001.0001.000

Missing values

2024-05-04T00:15:59.057028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-04T00:15:59.823568image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구코드지정년도지정번호신청일자지정일자업소명소재지도로명소재지지번허가(신고)번호업태명주된음식영업장면적(㎡)행정동명급수시설구분소재지전화번호
030300002024282017102020240118귀한족발 왕십리점서울특별시 성동구 무학로2길 43, 1층 (도선동)서울특별시 성동구 도선동 144번지3030000-101-2014-00172한식족발58.2왕십리도선동<NA>0222942329
130300002024242017102020240118쭈식이 상회서울특별시 성동구 마조로7길 9-1, 1층 (행당동)서울특별시 성동구 행당동 3번지 10호3030000-101-2014-00221한식쭈꾸미볶음80.0사근동<NA>0222813228
2303000020241442020110120240118대촌정서울특별시 성동구 마장로23길 4, (홍익동)서울특별시 성동구 홍익동 112번지3030000-101-1985-04894한식닭백숙35.42왕십리도선동상수도전용0222950421
33030000202482018101720240118동보성서울특별시 성동구 한림말1길 15-2, (옥수동)서울특별시 성동구 옥수동 298번지3030000-101-1993-03737한식짜장면133.19옥수동상수도전용0222994520
43030000202412012101620240118이가갈비서울특별시 성동구 마조로 25, (행당동)서울특별시 성동구 행당동 6번지 2호3030000-101-1988-06036한식돼지갈비161.0사근동상수도전용0222988878
5303000020241422020110120240118신신원서울특별시 성동구 광나루로 290, (성수동2가)서울특별시 성동구 성수동2가 280번지 41호3030000-101-1981-03691중국식굴짬뽕89.95성수2가제1동상수도전용02 4998940
6303000020241292021110120240118청년감자탕순대국(왕십리역점)서울특별시 성동구 마조로5길 5, (행당동)서울특별시 성동구 행당동 19번지 48호3030000-101-1989-05282한식감자탕79.56사근동상수도전용0222942020
730300002024382015101620240118본토고기구이서울특별시 성동구 동일로 185, (송정동)서울특별시 성동구 송정동 79번지 5호 ,68-473030000-101-1985-05444한식삼겹살, 돼지양념구이49.94송정동상수도전용02 4669253
830300002024202019101620240118전자방서울특별시 성동구 성수일로12길 23, (성수동2가)서울특별시 성동구 성수동2가 299번지 157호3030000-101-1989-05608한식탕수육46.0성수2가제1동상수도전용02 4613898
9303000020241502020110120240118손박사삼겹살서울특별시 성동구 상원길 40, 상가동 106호,107호 (성수동1가, 뚝섬현대아파트)서울특별시 성동구 성수동1가 13번지 157호3030000-101-1992-02574호프/통닭삼겹살50.24성수1가제1동상수도전용02 4661929
시군구코드지정년도지정번호신청일자지정일자업소명소재지도로명소재지지번허가(신고)번호업태명주된음식영업장면적(㎡)행정동명급수시설구분소재지전화번호
13130300002024922015102120240118왕십리 오리사냥서울특별시 성동구 행당로17길 7, (행당동,(지상1층))서울특별시 성동구 행당동 317번지 31호 (지상1층)3030000-101-2003-00130한식오리구이150.94행당제1동상수도전용0222822838
13230300002024202007112320240118정은회관서울특별시 성동구 금호로 45, (금호동4가,정은타워 1층)서울특별시 성동구 금호동4가 188번지 정은타워 1층3030000-101-2004-00085한식꽃등심225.68금호4가동<NA>0222991992
13330300002024162017102020240118산수골추어탕감자탕서울특별시 성동구 왕십리로31가길 31, (하왕십리동)서울특별시 성동구 하왕십리동 946번지 39호3030000-101-1989-05050한식추어탕, 감자탕118.0왕십리제2동상수도전용02 22985111
134303000020244132023111520240118부산밀면서울특별시 성동구 연무장5가길 7, 성수역 현대테라스타워 1층 103호 (성수동2가)서울특별시 성동구 성수동2가 314번지 13호 성수역 현대테라스타워3030000-101-2021-00256한식물,비빔밀면46.38성수2가제1동<NA><NA>
135303000020241412020110120240118소설옥서울특별시 성동구 독서당로 296-11, (금호동4가)서울특별시 성동구 금호동4가 565번지 1호3030000-101-2000-06491한식돼지고기38.94금호4가동상수도전용0222994196
136303000020244182023111520240118어랑회집서울특별시 성동구 성수이로3길 19, (성수동2가, 492-2 지상1층)서울특별시 성동구 성수동2가 492번지 2호 (1층)3030000-101-2005-00046일식생선회44.73성수2가제1동상수도전용<NA>
13730300002024442016102720240118수현회집서울특별시 성동구 무학봉28길 12, 지상1층 (행당동, 286-17)서울특별시 성동구 행당동 286번지 17호 지상1층3030000-101-2012-00210일식생선회, 매운탕122.0행당제1동<NA>02 22959524
1383030000202422017102020240118골목냉면서울특별시 성동구 독서당로 295-7, (금호동3가)서울특별시 성동구 금호동3가 405번지 1호3030000-101-1980-06037기타냉면, 떡만두87.04금호2.3가동상수도전용02 22352540
139303000020244122022111520240118먹깨비식당서울특별시 성동구 금호산길 49-1, (금호동2가,외1필지 지상1층)서울특별시 성동구 금호동2가 500번지 17호 외1필지 지상1층3030000-101-2006-00173한식갈비, 삼겹살120.63금호2.3가동<NA>0222349888
140303000020241482020110120240118목금서울특별시 성동구 왕십리로2길 34, 1층 (성수동1가)서울특별시 성동구 성수동1가 656번지 1661호3030000-101-2020-00090경양식함박스테이크36.23성수1가제1동<NA><NA>