Overview

Dataset statistics

Number of variables5
Number of observations119
Missing cells11
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 KiB
Average record size in memory42.1 B

Variable types

Numeric1
Text4

Dataset

Description2022년도 충청남도 서산시 모범음식점 및 위생등급제 지정현황
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=450&beforeMenuCd=DOM_000000201001001000&publicdatapk=3068030

Alerts

전화번호 has 11 (9.2%) missing valuesMissing
연번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2024-01-09 22:35:11.651261
Analysis finished2024-01-09 22:35:12.376880
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct119
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60
Minimum1
Maximum119
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2024-01-10T07:35:12.444984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.9
Q130.5
median60
Q389.5
95-th percentile113.1
Maximum119
Range118
Interquartile range (IQR)59

Descriptive statistics

Standard deviation34.496377
Coefficient of variation (CV)0.57493961
Kurtosis-1.2
Mean60
Median Absolute Deviation (MAD)30
Skewness0
Sum7140
Variance1190
MonotonicityStrictly increasing
2024-01-10T07:35:12.572514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
2 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
Other values (109) 109
91.6%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%
110 1
0.8%

업소명
Text

UNIQUE 

Distinct119
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-10T07:35:12.757036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length17
Mean length9.6218487
Min length1

Characters and Unicode

Total characters1145
Distinct characters248
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)100.0%

Sample

1st row(주)태경비케이 서산(상)휴게소
2nd row(주)태경비케이 서산(하)휴게소
3rd row가야면옥
4th row갈비본가 두툼한숯불갈비
5th row강미루
ValueCountFrequency (%)
주)태경비케이 11
 
5.8%
서산(하)휴게소 7
 
3.7%
서산테크노밸리점 6
 
3.2%
서산예천점 5
 
2.6%
파리바게뜨 5
 
2.6%
서산(상)휴게소 4
 
2.1%
배스킨라빈스 3
 
1.6%
서산호수공원점 3
 
1.6%
투썸플레이스 3
 
1.6%
본도시락 2
 
1.1%
Other values (127) 141
74.2%
2024-01-10T07:35:13.063873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71
 
6.2%
56
 
4.9%
53
 
4.6%
50
 
4.4%
( 36
 
3.1%
) 36
 
3.1%
28
 
2.4%
24
 
2.1%
24
 
2.1%
21
 
1.8%
Other values (238) 746
65.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 972
84.9%
Space Separator 71
 
6.2%
Open Punctuation 36
 
3.1%
Close Punctuation 36
 
3.1%
Uppercase Letter 22
 
1.9%
Other Punctuation 4
 
0.3%
Decimal Number 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
5.8%
53
 
5.5%
50
 
5.1%
28
 
2.9%
24
 
2.5%
24
 
2.5%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (220) 659
67.8%
Uppercase Letter
ValueCountFrequency (%)
C 6
27.3%
H 4
18.2%
B 4
18.2%
N 2
 
9.1%
D 1
 
4.5%
U 1
 
4.5%
S 1
 
4.5%
G 1
 
4.5%
I 1
 
4.5%
K 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
· 1
25.0%
& 1
25.0%
Decimal Number
ValueCountFrequency (%)
0 2
50.0%
6 2
50.0%
Space Separator
ValueCountFrequency (%)
71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 972
84.9%
Common 151
 
13.2%
Latin 22
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
5.8%
53
 
5.5%
50
 
5.1%
28
 
2.9%
24
 
2.5%
24
 
2.5%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (220) 659
67.8%
Latin
ValueCountFrequency (%)
C 6
27.3%
H 4
18.2%
B 4
18.2%
N 2
 
9.1%
D 1
 
4.5%
U 1
 
4.5%
S 1
 
4.5%
G 1
 
4.5%
I 1
 
4.5%
K 1
 
4.5%
Common
ValueCountFrequency (%)
71
47.0%
( 36
23.8%
) 36
23.8%
. 2
 
1.3%
0 2
 
1.3%
6 2
 
1.3%
· 1
 
0.7%
& 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 972
84.9%
ASCII 172
 
15.0%
None 1
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71
41.3%
( 36
20.9%
) 36
20.9%
C 6
 
3.5%
H 4
 
2.3%
B 4
 
2.3%
N 2
 
1.2%
. 2
 
1.2%
0 2
 
1.2%
6 2
 
1.2%
Other values (7) 7
 
4.1%
Hangul
ValueCountFrequency (%)
56
 
5.8%
53
 
5.5%
50
 
5.1%
28
 
2.9%
24
 
2.5%
24
 
2.5%
21
 
2.2%
20
 
2.1%
19
 
2.0%
18
 
1.9%
Other values (220) 659
67.8%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct106
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-10T07:35:13.349777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length3
Mean length3.7394958
Min length3

Characters and Unicode

Total characters445
Distinct characters111
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)85.7%

Sample

1st row김민정 외 1명
2nd row김민정 외 1명
3rd row이영철
4th row안진구 외 1명
5th row이강민 외 1명
ValueCountFrequency (%)
1명 16
 
10.6%
16
 
10.6%
김민정 11
 
7.3%
이화순 2
 
1.3%
유태준 2
 
1.3%
송데이비드호섭 2
 
1.3%
최영준 1
 
0.7%
박미숙 1
 
0.7%
김강필 1
 
0.7%
김진영 1
 
0.7%
Other values (98) 98
64.9%
2024-01-10T07:35:13.732509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
7.2%
29
 
6.5%
22
 
4.9%
20
 
4.5%
20
 
4.5%
16
 
3.6%
1 16
 
3.6%
15
 
3.4%
12
 
2.7%
10
 
2.2%
Other values (101) 253
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 397
89.2%
Space Separator 32
 
7.2%
Decimal Number 16
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
7.3%
22
 
5.5%
20
 
5.0%
20
 
5.0%
16
 
4.0%
15
 
3.8%
12
 
3.0%
10
 
2.5%
9
 
2.3%
9
 
2.3%
Other values (99) 235
59.2%
Space Separator
ValueCountFrequency (%)
32
100.0%
Decimal Number
ValueCountFrequency (%)
1 16
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 397
89.2%
Common 48
 
10.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
7.3%
22
 
5.5%
20
 
5.0%
20
 
5.0%
16
 
4.0%
15
 
3.8%
12
 
3.0%
10
 
2.5%
9
 
2.3%
9
 
2.3%
Other values (99) 235
59.2%
Common
ValueCountFrequency (%)
32
66.7%
1 16
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 397
89.2%
ASCII 48
 
10.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32
66.7%
1 16
33.3%
Hangul
ValueCountFrequency (%)
29
 
7.3%
22
 
5.5%
20
 
5.0%
20
 
5.0%
16
 
4.0%
15
 
3.8%
12
 
3.0%
10
 
2.5%
9
 
2.3%
9
 
2.3%
Other values (99) 235
59.2%
Distinct113
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-01-10T07:35:13.945734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length28
Mean length18.571429
Min length11

Characters and Unicode

Total characters2210
Distinct characters133
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique110 ?
Unique (%)92.4%

Sample

1st row 해미면 서해안고속도로 242
2nd row 해미면 서해안고속도로 241
3rd row 명륜1로 80 (읍내동,(2,3층))
4th row 율지19길 74 (동문동)
5th row 양유정1로 3 (읍내동,,124-38,124-39(1,2층))
ValueCountFrequency (%)
1층 55
 
12.3%
동문동 26
 
5.8%
해미면 17
 
3.8%
예천동 14
 
3.1%
서해안고속도로 11
 
2.5%
성연면 11
 
2.5%
대산읍 11
 
2.5%
읍내동 10
 
2.2%
1~2층 8
 
1.8%
241 7
 
1.6%
Other values (185) 277
62.0%
2024-01-10T07:35:14.257058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
447
20.2%
1 173
 
7.8%
112
 
5.1%
110
 
5.0%
) 90
 
4.1%
( 90
 
4.1%
, 85
 
3.8%
79
 
3.6%
2 76
 
3.4%
3 51
 
2.3%
Other values (123) 897
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 965
43.7%
Decimal Number 504
22.8%
Space Separator 447
20.2%
Close Punctuation 90
 
4.1%
Open Punctuation 90
 
4.1%
Other Punctuation 85
 
3.8%
Dash Punctuation 18
 
0.8%
Math Symbol 9
 
0.4%
Uppercase Letter 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
112
 
11.6%
110
 
11.4%
79
 
8.2%
37
 
3.8%
36
 
3.7%
29
 
3.0%
28
 
2.9%
24
 
2.5%
24
 
2.5%
23
 
2.4%
Other values (105) 463
48.0%
Decimal Number
ValueCountFrequency (%)
1 173
34.3%
2 76
15.1%
3 51
 
10.1%
4 49
 
9.7%
7 33
 
6.5%
5 29
 
5.8%
6 29
 
5.8%
0 26
 
5.2%
9 22
 
4.4%
8 16
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%
Space Separator
ValueCountFrequency (%)
447
100.0%
Close Punctuation
ValueCountFrequency (%)
) 90
100.0%
Open Punctuation
ValueCountFrequency (%)
( 90
100.0%
Other Punctuation
ValueCountFrequency (%)
, 85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%
Math Symbol
ValueCountFrequency (%)
~ 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1243
56.2%
Hangul 965
43.7%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
112
 
11.6%
110
 
11.4%
79
 
8.2%
37
 
3.8%
36
 
3.7%
29
 
3.0%
28
 
2.9%
24
 
2.5%
24
 
2.5%
23
 
2.4%
Other values (105) 463
48.0%
Common
ValueCountFrequency (%)
447
36.0%
1 173
 
13.9%
) 90
 
7.2%
( 90
 
7.2%
, 85
 
6.8%
2 76
 
6.1%
3 51
 
4.1%
4 49
 
3.9%
7 33
 
2.7%
5 29
 
2.3%
Other values (6) 120
 
9.7%
Latin
ValueCountFrequency (%)
A 1
50.0%
B 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1245
56.3%
Hangul 965
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
447
35.9%
1 173
 
13.9%
) 90
 
7.2%
( 90
 
7.2%
, 85
 
6.8%
2 76
 
6.1%
3 51
 
4.1%
4 49
 
3.9%
7 33
 
2.7%
5 29
 
2.3%
Other values (8) 122
 
9.8%
Hangul
ValueCountFrequency (%)
112
 
11.6%
110
 
11.4%
79
 
8.2%
37
 
3.8%
36
 
3.7%
29
 
3.0%
28
 
2.9%
24
 
2.5%
24
 
2.5%
23
 
2.4%
Other values (105) 463
48.0%

전화번호
Text

MISSING 

Distinct103
Distinct (%)95.4%
Missing11
Missing (%)9.2%
Memory size1.1 KiB
2024-01-10T07:35:14.489459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length13.694444
Min length13

Characters and Unicode

Total characters1479
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique101 ?
Unique (%)93.5%

Sample

1st row041- 688-7714
2nd row041- 688-8814
3rd row041 -668 -8123
4th row041 -668 -4420
5th row041 -669 -4938
ValueCountFrequency (%)
041 99
34.5%
668 10
 
3.5%
688 10
 
3.5%
665 9
 
3.1%
664 8
 
2.8%
666 7
 
2.4%
669 7
 
2.4%
681 6
 
2.1%
667 6
 
2.1%
8814 5
 
1.7%
Other values (107) 120
41.8%
2024-01-10T07:35:14.844064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 227
15.3%
- 216
14.6%
181
12.2%
0 169
11.4%
1 153
10.3%
4 151
10.2%
8 125
8.5%
2 57
 
3.9%
5 53
 
3.6%
3 50
 
3.4%
Other values (2) 97
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1082
73.2%
Dash Punctuation 216
 
14.6%
Space Separator 181
 
12.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 227
21.0%
0 169
15.6%
1 153
14.1%
4 151
14.0%
8 125
11.6%
2 57
 
5.3%
5 53
 
4.9%
3 50
 
4.6%
7 49
 
4.5%
9 48
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%
Space Separator
ValueCountFrequency (%)
181
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1479
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 227
15.3%
- 216
14.6%
181
12.2%
0 169
11.4%
1 153
10.3%
4 151
10.2%
8 125
8.5%
2 57
 
3.9%
5 53
 
3.6%
3 50
 
3.4%
Other values (2) 97
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1479
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 227
15.3%
- 216
14.6%
181
12.2%
0 169
11.4%
1 153
10.3%
4 151
10.2%
8 125
8.5%
2 57
 
3.9%
5 53
 
3.6%
3 50
 
3.4%
Other values (2) 97
6.6%

Interactions

2024-01-10T07:35:12.149124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T07:35:12.250844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:35:12.341577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명영업자성명소재지(도로명)전화번호
01(주)태경비케이 서산(상)휴게소김민정 외 1명해미면 서해안고속도로 242041- 688-7714
12(주)태경비케이 서산(하)휴게소김민정 외 1명해미면 서해안고속도로 241041- 688-8814
23가야면옥이영철명륜1로 80 (읍내동,(2,3층))041 -668 -8123
34갈비본가 두툼한숯불갈비안진구 외 1명율지19길 74 (동문동)041 -668 -4420
45강미루이강민 외 1명양유정1로 3 (읍내동,,124-38,124-39(1,2층))041 -669 -4938
56거북이 횟집유병일율지6로 45 (동문동,1층)041 -668 -6116
67고삐김영철쌍연남2로 14 (동문동)041 -664 -9253
78구도횟집서경자팔봉면 팔봉1로 748041- 662-6117
89김연수대지제5길 7 (석남동)041 -668 -0205
910늘푸른쌈밥이인숙문화로 61 (읍내동,(1층))041 -667 -9289
연번업소명영업자성명소재지(도로명)전화번호
109110던킨도너츠 서산동문점구향미고운로 251, 1층 (동문동)<NA>
110111이디야 서산테크노밸리점전유희성연면 성연1로 21, 1층 103호041- 663-3827
111112(주)지에스리테일 GS더프레시 서산석남점허연수서해로 3359 (석남동)<NA>
112113빠리바게트이미정해미면 남문4로 4041- 688-2369
113114파리바게뜨 서평점이형규석림3로 2, 1층 (석림동)041 -663 -8233
114115파리바게뜨 부춘점이영미부춘1로 44 (읍내동,(1층))041 -681 -8240
115116파리바게뜨 예천점이복남 외 1명호수공원10로 3, 1층 (예천동)041- 666-8240
116117파리바게뜨 동문점배길남서령로 100, 1층 (동문동)041- 664-0440
117118뚜레쥬르(코아루점)고영례고운로 275-6, 1층 (동문동)041 -669 -3535
118119뚜레쥬르테크노밸리점박흥조성연면 성연3로 35, 1층041 -664 -0046