Overview

Dataset statistics

Number of variables4
Number of observations468
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory15.2 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text3

Dataset

Description충청남도 청양군에서 방문 가능한 일반음식점 정보(업소명, 소재지, 소재지 전화)를 가나다 순으로 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15118805/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:03:12.452611
Analysis finished2023-12-12 22:03:12.912348
Duration0.46 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct468
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean234.5
Minimum1
Maximum468
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.2 KiB
2023-12-13T07:03:12.981348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.35
Q1117.75
median234.5
Q3351.25
95-th percentile444.65
Maximum468
Range467
Interquartile range (IQR)233.5

Descriptive statistics

Standard deviation135.24422
Coefficient of variation (CV)0.57673443
Kurtosis-1.2
Mean234.5
Median Absolute Deviation (MAD)117
Skewness0
Sum109746
Variance18291
MonotonicityStrictly increasing
2023-12-13T07:03:13.114218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
310 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
315 1
 
0.2%
Other values (458) 458
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%
460 1
0.2%
459 1
0.2%
Distinct467
Distinct (%)99.8%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T07:03:13.374992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length5.5576923
Min length2

Characters and Unicode

Total characters2601
Distinct characters464
Distinct categories8 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique466 ?
Unique (%)99.6%

Sample

1st row(주)칠갑산관광농원
2nd rowBHC
3rd rowbhc치킨청양정산점
4th rowJJ식당
5th rowLA김밥&카페
ValueCountFrequency (%)
청양점 6
 
1.1%
식당 5
 
0.9%
칠갑산 5
 
0.9%
칼국수 3
 
0.5%
구기자 3
 
0.5%
밥상 3
 
0.5%
까치내 2
 
0.4%
구내식당 2
 
0.4%
치킨 2
 
0.4%
호프 2
 
0.4%
Other values (523) 524
94.1%
2023-12-13T07:03:13.778462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
101
 
3.9%
91
 
3.5%
89
 
3.4%
51
 
2.0%
44
 
1.7%
43
 
1.7%
39
 
1.5%
36
 
1.4%
35
 
1.3%
32
 
1.2%
Other values (454) 2040
78.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2358
90.7%
Space Separator 89
 
3.4%
Lowercase Letter 65
 
2.5%
Uppercase Letter 36
 
1.4%
Close Punctuation 19
 
0.7%
Open Punctuation 19
 
0.7%
Other Punctuation 10
 
0.4%
Decimal Number 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
101
 
4.3%
91
 
3.9%
51
 
2.2%
44
 
1.9%
43
 
1.8%
39
 
1.7%
36
 
1.5%
35
 
1.5%
32
 
1.4%
30
 
1.3%
Other values (406) 1856
78.7%
Lowercase Letter
ValueCountFrequency (%)
a 12
18.5%
n 10
15.4%
o 6
9.2%
e 5
7.7%
i 4
 
6.2%
f 4
 
6.2%
h 4
 
6.2%
l 3
 
4.6%
z 3
 
4.6%
w 2
 
3.1%
Other values (10) 12
18.5%
Uppercase Letter
ValueCountFrequency (%)
J 5
13.9%
C 5
13.9%
B 3
 
8.3%
H 3
 
8.3%
A 3
 
8.3%
E 2
 
5.6%
L 2
 
5.6%
P 2
 
5.6%
T 1
 
2.8%
S 1
 
2.8%
Other values (9) 9
25.0%
Decimal Number
ValueCountFrequency (%)
5 2
40.0%
1 1
20.0%
4 1
20.0%
2 1
20.0%
Other Punctuation
ValueCountFrequency (%)
& 8
80.0%
· 2
 
20.0%
Space Separator
ValueCountFrequency (%)
89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 19
100.0%
Open Punctuation
ValueCountFrequency (%)
( 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2354
90.5%
Common 142
 
5.5%
Latin 101
 
3.9%
Han 4
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
101
 
4.3%
91
 
3.9%
51
 
2.2%
44
 
1.9%
43
 
1.8%
39
 
1.7%
36
 
1.5%
35
 
1.5%
32
 
1.4%
30
 
1.3%
Other values (402) 1852
78.7%
Latin
ValueCountFrequency (%)
a 12
 
11.9%
n 10
 
9.9%
o 6
 
5.9%
J 5
 
5.0%
C 5
 
5.0%
e 5
 
5.0%
i 4
 
4.0%
f 4
 
4.0%
h 4
 
4.0%
B 3
 
3.0%
Other values (29) 43
42.6%
Common
ValueCountFrequency (%)
89
62.7%
) 19
 
13.4%
( 19
 
13.4%
& 8
 
5.6%
5 2
 
1.4%
· 2
 
1.4%
1 1
 
0.7%
4 1
 
0.7%
2 1
 
0.7%
Han
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2354
90.5%
ASCII 241
 
9.3%
CJK 4
 
0.2%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
101
 
4.3%
91
 
3.9%
51
 
2.2%
44
 
1.9%
43
 
1.8%
39
 
1.7%
36
 
1.5%
35
 
1.5%
32
 
1.4%
30
 
1.3%
Other values (402) 1852
78.7%
ASCII
ValueCountFrequency (%)
89
36.9%
) 19
 
7.9%
( 19
 
7.9%
a 12
 
5.0%
n 10
 
4.1%
& 8
 
3.3%
o 6
 
2.5%
J 5
 
2.1%
C 5
 
2.1%
e 5
 
2.1%
Other values (37) 63
26.1%
None
ValueCountFrequency (%)
· 2
100.0%
CJK
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct404
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T07:03:14.024469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length42
Mean length22.942308
Min length18

Characters and Unicode

Total characters10737
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique361 ?
Unique (%)77.1%

Sample

1st row충청남도 청양군 화성면 구숫골길 105-19
2nd row충청남도 청양군 청양읍 중앙로 84
3rd row충청남도 청양군 정산면 효자길 33
4th row충청남도 청양군 정산면 정현길 54, B동 16호
5th row충청남도 청양군 정산면 서정2길 6
ValueCountFrequency (%)
충청남도 468
18.7%
청양군 468
18.7%
청양읍 250
 
10.0%
정산면 83
 
3.3%
1층 52
 
2.1%
중앙로 45
 
1.8%
칠갑산로 44
 
1.8%
정현길 32
 
1.3%
대치면 29
 
1.2%
칠갑산로4길 27
 
1.1%
Other values (426) 1002
40.1%
2023-12-13T07:03:14.444934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2032
18.9%
1224
 
11.4%
750
 
7.0%
496
 
4.6%
495
 
4.6%
471
 
4.4%
469
 
4.4%
1 458
 
4.3%
330
 
3.1%
286
 
2.7%
Other values (172) 3726
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6806
63.4%
Space Separator 2032
 
18.9%
Decimal Number 1617
 
15.1%
Dash Punctuation 153
 
1.4%
Other Punctuation 99
 
0.9%
Uppercase Letter 13
 
0.1%
Open Punctuation 8
 
0.1%
Close Punctuation 8
 
0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1224
18.0%
750
11.0%
496
 
7.3%
495
 
7.3%
471
 
6.9%
469
 
6.9%
330
 
4.8%
286
 
4.2%
250
 
3.7%
218
 
3.2%
Other values (149) 1817
26.7%
Decimal Number
ValueCountFrequency (%)
1 458
28.3%
2 197
12.2%
3 172
 
10.6%
4 139
 
8.6%
5 129
 
8.0%
7 123
 
7.6%
6 122
 
7.5%
0 105
 
6.5%
9 88
 
5.4%
8 84
 
5.2%
Uppercase Letter
ValueCountFrequency (%)
B 5
38.5%
A 2
 
15.4%
D 2
 
15.4%
C 2
 
15.4%
E 2
 
15.4%
Other Punctuation
ValueCountFrequency (%)
, 95
96.0%
& 3
 
3.0%
. 1
 
1.0%
Space Separator
ValueCountFrequency (%)
2032
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 153
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6806
63.4%
Common 3917
36.5%
Latin 14
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1224
18.0%
750
11.0%
496
 
7.3%
495
 
7.3%
471
 
6.9%
469
 
6.9%
330
 
4.8%
286
 
4.2%
250
 
3.7%
218
 
3.2%
Other values (149) 1817
26.7%
Common
ValueCountFrequency (%)
2032
51.9%
1 458
 
11.7%
2 197
 
5.0%
3 172
 
4.4%
- 153
 
3.9%
4 139
 
3.5%
5 129
 
3.3%
7 123
 
3.1%
6 122
 
3.1%
0 105
 
2.7%
Other values (7) 287
 
7.3%
Latin
ValueCountFrequency (%)
B 5
35.7%
A 2
 
14.3%
D 2
 
14.3%
C 2
 
14.3%
E 2
 
14.3%
c 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6806
63.4%
ASCII 3931
36.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2032
51.7%
1 458
 
11.7%
2 197
 
5.0%
3 172
 
4.4%
- 153
 
3.9%
4 139
 
3.5%
5 129
 
3.3%
7 123
 
3.1%
6 122
 
3.1%
0 105
 
2.7%
Other values (13) 301
 
7.7%
Hangul
ValueCountFrequency (%)
1224
18.0%
750
11.0%
496
 
7.3%
495
 
7.3%
471
 
6.9%
469
 
6.9%
330
 
4.8%
286
 
4.2%
250
 
3.7%
218
 
3.2%
Other values (149) 1817
26.7%
Distinct423
Distinct (%)90.4%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
2023-12-13T07:03:14.696822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.991453
Min length1

Characters and Unicode

Total characters5144
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique419 ?
Unique (%)89.5%

Sample

1st row041-943-8110
2nd row041-943-7000
3rd row041-943-7058
4th row041-943-6648
5th row041-943-8316
ValueCountFrequency (%)
041-942-0987 2
 
0.5%
041-942-7874 2
 
0.5%
041-943-2926 2
 
0.5%
041-943-9300 1
 
0.2%
041-943-1238 1
 
0.2%
041-944-0972 1
 
0.2%
041-942-6214 1
 
0.2%
041-942-1197 1
 
0.2%
041-942-4586 1
 
0.2%
041-942-5500 1
 
0.2%
Other values (412) 412
96.9%
2023-12-13T07:03:15.077031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 1017
19.8%
- 850
16.5%
0 689
13.4%
9 619
12.0%
1 577
11.2%
3 364
 
7.1%
2 355
 
6.9%
8 168
 
3.3%
7 158
 
3.1%
5 154
 
3.0%
Other values (2) 193
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4251
82.6%
Dash Punctuation 850
 
16.5%
Space Separator 43
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 1017
23.9%
0 689
16.2%
9 619
14.6%
1 577
13.6%
3 364
 
8.6%
2 355
 
8.4%
8 168
 
4.0%
7 158
 
3.7%
5 154
 
3.6%
6 150
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 850
100.0%
Space Separator
ValueCountFrequency (%)
43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5144
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 1017
19.8%
- 850
16.5%
0 689
13.4%
9 619
12.0%
1 577
11.2%
3 364
 
7.1%
2 355
 
6.9%
8 168
 
3.3%
7 158
 
3.1%
5 154
 
3.0%
Other values (2) 193
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5144
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 1017
19.8%
- 850
16.5%
0 689
13.4%
9 619
12.0%
1 577
11.2%
3 364
 
7.1%
2 355
 
6.9%
8 168
 
3.3%
7 158
 
3.1%
5 154
 
3.0%
Other values (2) 193
 
3.8%

Interactions

2023-12-13T07:03:12.733347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T07:03:12.817022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:03:12.882233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업소명소재지(도로명)소재지전화
01(주)칠갑산관광농원충청남도 청양군 화성면 구숫골길 105-19041-943-8110
12BHC충청남도 청양군 청양읍 중앙로 84041-943-7000
23bhc치킨청양정산점충청남도 청양군 정산면 효자길 33041-943-7058
34JJ식당충청남도 청양군 정산면 정현길 54, B동 16호041-943-6648
45LA김밥&카페충청남도 청양군 정산면 서정2길 6041-943-8316
56가마솥국밥충청남도 청양군 청양읍 칠갑산로6길 15041-943-5585
67가마솥회관충청남도 청양군 청양읍 중앙로11길 19041-942-9829
78가마치통닭 청양읍내점충청남도 청양군 청양읍 중앙로 130-1, 삼성의원 1층041-943-9734
89가장맛있는족발 청양점충청남도 청양군 청양읍 칠갑산로8길 9041-944-0057
910갈비삼겹살전문점충청남도 청양군 청양읍 칠갑산로4길 36041-943-6022
연번업소명소재지(도로명)소재지전화
458459황소집충청남도 청양군 대치면 장곡길 83041-943-7728
459460황토오리철판구이충청남도 청양군 청양읍 칠갑산로12길 17, 나동 1층041-942-3390
460461황해원충청남도 청양군 청양읍 칠갑산로 12-4041-943-2555
461462회랑참치랑충청남도 청양군 청양읍 중앙로11길 13041-943-7942
462463후덕한 밥상충청남도 청양군 운곡면 청신로 567-26041-943-8586
463464훠라라양꼬치충청남도 청양군 청양읍 중앙로열길 4
464465휘영청이충청남도 청양군 대치면 장곡길 143-13
465466휴식충청남도 청양군 청양읍 월촌길 56-2, 1층041-942-5773
466467흙사랑충청남도 청양군 화성면 무한로 209-7041-943-4496
467468힐링낭만콘서트충청남도 청양군 청양읍 중앙로9길 7041-942-8773