Overview

Dataset statistics

Number of variables6
Number of observations3560
Missing cells1583
Missing cells (%)7.4%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory170.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description충청남도 보령시 보령사랑상품권 가맹점 현황자료의 연번, 상호, 소재지, 사업장 전화번호, 행정동, 업종 현황자료 공개데이터
Author충청남도 보령시
URLhttps://www.data.go.kr/data/15089537/fileData.do

Alerts

Dataset has 1 (< 0.1%) duplicate rowsDuplicates
연번 has 549 (15.4%) missing valuesMissing
사업장전화번호 has 1034 (29.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 22:50:17.982123
Analysis finished2023-12-12 22:50:19.007808
Duration1.03 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

MISSING 

Distinct3011
Distinct (%)100.0%
Missing549
Missing (%)15.4%
Infinite0
Infinite (%)0.0%
Mean1506.1069
Minimum1
Maximum3013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size31.4 KiB
2023-12-13T07:50:19.069415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile151.5
Q1753.5
median1506
Q32258.5
95-th percentile2862.5
Maximum3013
Range3012
Interquartile range (IQR)1505

Descriptive statistics

Standard deviation869.5206
Coefficient of variation (CV)0.57732992
Kurtosis-1.1992733
Mean1506.1069
Median Absolute Deviation (MAD)753
Skewness0.00062516103
Sum4534888
Variance756066.08
MonotonicityStrictly increasing
2023-12-13T07:50:19.187160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2013 1
 
< 0.1%
2004 1
 
< 0.1%
2005 1
 
< 0.1%
2006 1
 
< 0.1%
2007 1
 
< 0.1%
2008 1
 
< 0.1%
2009 1
 
< 0.1%
2010 1
 
< 0.1%
2011 1
 
< 0.1%
2012 1
 
< 0.1%
Other values (3001) 3001
84.3%
(Missing) 549
 
15.4%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3013 1
< 0.1%
3012 1
< 0.1%
3011 1
< 0.1%
3010 1
< 0.1%
3009 1
< 0.1%
3008 1
< 0.1%
3007 1
< 0.1%
3006 1
< 0.1%
3005 1
< 0.1%
3004 1
< 0.1%

상호
Text

Distinct3486
Distinct (%)97.9%
Missing0
Missing (%)0.0%
Memory size27.9 KiB
2023-12-13T07:50:19.507450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length6.0233146
Min length1

Characters and Unicode

Total characters21443
Distinct characters812
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3417 ?
Unique (%)96.0%

Sample

1st row바이태닝 스튜디오
2nd row쁘띠헤어
3rd row동그루밍
4th row엘리트교복(보령점)
5th row한내해물칼국수
ValueCountFrequency (%)
보령점 41
 
1.0%
cu 29
 
0.7%
gs25 28
 
0.7%
이마트24 18
 
0.4%
세븐일레븐 17
 
0.4%
주식회사 14
 
0.3%
대천점 14
 
0.3%
동대점 7
 
0.2%
아모레 6
 
0.1%
보령동대점 5
 
0.1%
Other values (3646) 3859
95.6%
2023-12-13T07:50:20.309713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
493
 
2.3%
478
 
2.2%
460
 
2.1%
450
 
2.1%
386
 
1.8%
373
 
1.7%
348
 
1.6%
334
 
1.6%
301
 
1.4%
286
 
1.3%
Other values (802) 17534
81.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20107
93.8%
Space Separator 478
 
2.2%
Decimal Number 317
 
1.5%
Uppercase Letter 278
 
1.3%
Close Punctuation 76
 
0.4%
Open Punctuation 76
 
0.4%
Lowercase Letter 61
 
0.3%
Other Punctuation 37
 
0.2%
Other Symbol 13
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
493
 
2.5%
460
 
2.3%
450
 
2.2%
386
 
1.9%
373
 
1.9%
348
 
1.7%
334
 
1.7%
301
 
1.5%
286
 
1.4%
266
 
1.3%
Other values (734) 16410
81.6%
Uppercase Letter
ValueCountFrequency (%)
C 52
18.7%
S 45
16.2%
G 39
14.0%
U 36
12.9%
L 14
 
5.0%
P 12
 
4.3%
B 10
 
3.6%
H 8
 
2.9%
I 6
 
2.2%
A 6
 
2.2%
Other values (14) 50
18.0%
Lowercase Letter
ValueCountFrequency (%)
e 9
14.8%
a 7
11.5%
r 6
9.8%
n 6
9.8%
o 5
 
8.2%
g 4
 
6.6%
s 3
 
4.9%
m 3
 
4.9%
t 3
 
4.9%
i 2
 
3.3%
Other values (11) 13
21.3%
Decimal Number
ValueCountFrequency (%)
2 117
36.9%
5 58
18.3%
4 50
15.8%
1 26
 
8.2%
0 19
 
6.0%
8 16
 
5.0%
9 12
 
3.8%
3 10
 
3.2%
7 5
 
1.6%
6 4
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 14
37.8%
& 12
32.4%
. 5
 
13.5%
; 2
 
5.4%
# 2
 
5.4%
/ 1
 
2.7%
' 1
 
2.7%
Close Punctuation
ValueCountFrequency (%)
) 69
90.8%
] 7
 
9.2%
Open Punctuation
ValueCountFrequency (%)
( 69
90.8%
[ 7
 
9.2%
Space Separator
ValueCountFrequency (%)
478
100.0%
Other Symbol
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20120
93.8%
Common 984
 
4.6%
Latin 339
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
493
 
2.5%
460
 
2.3%
450
 
2.2%
386
 
1.9%
373
 
1.9%
348
 
1.7%
334
 
1.7%
301
 
1.5%
286
 
1.4%
266
 
1.3%
Other values (735) 16423
81.6%
Latin
ValueCountFrequency (%)
C 52
15.3%
S 45
13.3%
G 39
 
11.5%
U 36
 
10.6%
L 14
 
4.1%
P 12
 
3.5%
B 10
 
2.9%
e 9
 
2.7%
H 8
 
2.4%
a 7
 
2.1%
Other values (35) 107
31.6%
Common
ValueCountFrequency (%)
478
48.6%
2 117
 
11.9%
) 69
 
7.0%
( 69
 
7.0%
5 58
 
5.9%
4 50
 
5.1%
1 26
 
2.6%
0 19
 
1.9%
8 16
 
1.6%
, 14
 
1.4%
Other values (12) 68
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20107
93.8%
ASCII 1323
 
6.2%
None 13
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
493
 
2.5%
460
 
2.3%
450
 
2.2%
386
 
1.9%
373
 
1.9%
348
 
1.7%
334
 
1.7%
301
 
1.5%
286
 
1.4%
266
 
1.3%
Other values (734) 16410
81.6%
ASCII
ValueCountFrequency (%)
478
36.1%
2 117
 
8.8%
) 69
 
5.2%
( 69
 
5.2%
5 58
 
4.4%
C 52
 
3.9%
4 50
 
3.8%
S 45
 
3.4%
G 39
 
2.9%
U 36
 
2.7%
Other values (57) 310
23.4%
None
ValueCountFrequency (%)
13
100.0%
Distinct2672
Distinct (%)75.1%
Missing0
Missing (%)0.0%
Memory size27.9 KiB
2023-12-13T07:50:20.597855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length38
Mean length20.798876
Min length9

Characters and Unicode

Total characters74044
Distinct characters258
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2160 ?
Unique (%)60.7%

Sample

1st row충청남도 보령시 작은오랏7길 35(동대동)
2nd row충청남도 보령시 한내로터리길 79(동대동)
3rd row충청남도 보령시 작은오랏3길 82-1(동대동)
4th row충청남도 보령시 원동1길 69(대천동, 시티타워)
5th row충청남도 보령시 정화로길 16(대천동)
ValueCountFrequency (%)
보령시 3553
23.0%
충청남도 3032
19.6%
대천동 352
 
2.3%
웅천읍 245
 
1.6%
중앙로 191
 
1.2%
대흥로 149
 
1.0%
동대동 135
 
0.9%
신흑동 127
 
0.8%
대천항로 113
 
0.7%
주공로 113
 
0.7%
Other values (1698) 7459
48.2%
2023-12-13T07:50:21.044958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11910
 
16.1%
3782
 
5.1%
3765
 
5.1%
3712
 
5.0%
3436
 
4.6%
3245
 
4.4%
3237
 
4.4%
3209
 
4.3%
3053
 
4.1%
( 2417
 
3.3%
Other values (248) 32278
43.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46864
63.3%
Space Separator 11910
 
16.1%
Decimal Number 9712
 
13.1%
Open Punctuation 2417
 
3.3%
Close Punctuation 2417
 
3.3%
Dash Punctuation 569
 
0.8%
Other Punctuation 149
 
0.2%
Lowercase Letter 4
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3782
 
8.1%
3765
 
8.0%
3712
 
7.9%
3436
 
7.3%
3245
 
6.9%
3237
 
6.9%
3209
 
6.8%
3053
 
6.5%
2339
 
5.0%
2026
 
4.3%
Other values (230) 15060
32.1%
Decimal Number
ValueCountFrequency (%)
1 2056
21.2%
2 1290
13.3%
3 1231
12.7%
4 938
9.7%
6 804
 
8.3%
5 786
 
8.1%
8 717
 
7.4%
7 664
 
6.8%
9 615
 
6.3%
0 611
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
H 1
50.0%
L 1
50.0%
Space Separator
ValueCountFrequency (%)
11910
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2417
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2417
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 569
100.0%
Other Punctuation
ValueCountFrequency (%)
, 149
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46864
63.3%
Common 27174
36.7%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3782
 
8.1%
3765
 
8.0%
3712
 
7.9%
3436
 
7.3%
3245
 
6.9%
3237
 
6.9%
3209
 
6.8%
3053
 
6.5%
2339
 
5.0%
2026
 
4.3%
Other values (230) 15060
32.1%
Common
ValueCountFrequency (%)
11910
43.8%
( 2417
 
8.9%
) 2417
 
8.9%
1 2056
 
7.6%
2 1290
 
4.7%
3 1231
 
4.5%
4 938
 
3.5%
6 804
 
3.0%
5 786
 
2.9%
8 717
 
2.6%
Other values (5) 2608
 
9.6%
Latin
ValueCountFrequency (%)
e 4
66.7%
H 1
 
16.7%
L 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46864
63.3%
ASCII 27180
36.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11910
43.8%
( 2417
 
8.9%
) 2417
 
8.9%
1 2056
 
7.6%
2 1290
 
4.7%
3 1231
 
4.5%
4 938
 
3.5%
6 804
 
3.0%
5 786
 
2.9%
8 717
 
2.6%
Other values (8) 2614
 
9.6%
Hangul
ValueCountFrequency (%)
3782
 
8.1%
3765
 
8.0%
3712
 
7.9%
3436
 
7.3%
3245
 
6.9%
3237
 
6.9%
3209
 
6.8%
3053
 
6.5%
2339
 
5.0%
2026
 
4.3%
Other values (230) 15060
32.1%

사업장전화번호
Text

MISSING 

Distinct2438
Distinct (%)96.5%
Missing1034
Missing (%)29.0%
Memory size27.9 KiB
2023-12-13T07:50:21.269464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010293
Min length12

Characters and Unicode

Total characters30338
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2353 ?
Unique (%)93.2%

Sample

1st row041-935-7070
2nd row041-932-9955
3rd row041-935-3234
4th row041-931-1880
5th row041-934-8709
ValueCountFrequency (%)
041-933-0009 4
 
0.2%
041-936-1283 3
 
0.1%
041-935-5448 2
 
0.1%
041-936-4554 2
 
0.1%
041-932-0036 2
 
0.1%
041-936-4488 2
 
0.1%
041-933-6600 2
 
0.1%
041-933-5015 2
 
0.1%
041-641-5411 2
 
0.1%
041-935-5447 2
 
0.1%
Other values (2428) 2503
99.1%
2023-12-13T07:50:21.643323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 5052
16.7%
1 3956
13.0%
3 3829
12.6%
0 3804
12.5%
4 3792
12.5%
9 3388
11.2%
5 1613
 
5.3%
2 1516
 
5.0%
6 1267
 
4.2%
8 1100
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 25286
83.3%
Dash Punctuation 5052
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3956
15.6%
3 3829
15.1%
0 3804
15.0%
4 3792
15.0%
9 3388
13.4%
5 1613
6.4%
2 1516
 
6.0%
6 1267
 
5.0%
8 1100
 
4.4%
7 1021
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 5052
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30338
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 5052
16.7%
1 3956
13.0%
3 3829
12.6%
0 3804
12.5%
4 3792
12.5%
9 3388
11.2%
5 1613
 
5.3%
2 1516
 
5.0%
6 1267
 
4.2%
8 1100
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30338
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 5052
16.7%
1 3956
13.0%
3 3829
12.6%
0 3804
12.5%
4 3792
12.5%
9 3388
11.2%
5 1613
 
5.3%
2 1516
 
5.0%
6 1267
 
4.2%
8 1100
 
3.6%

행정동
Categorical

Distinct21
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size27.9 KiB
대천동
1097 
동대동
772 
신흑동
368 
명천동
255 
웅천읍
245 
Other values (16)
823 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동대동
2nd row동대동
3rd row동대동
4th row대천동
5th row대천동

Common Values

ValueCountFrequency (%)
대천동 1097
30.8%
동대동 772
21.7%
신흑동 368
 
10.3%
명천동 255
 
7.2%
웅천읍 245
 
6.9%
죽정동 142
 
4.0%
남포면 88
 
2.5%
궁촌동 83
 
2.3%
천북면 74
 
2.1%
성주면 63
 
1.8%
Other values (11) 373
 
10.5%

Length

2023-12-13T07:50:21.842254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대천동 1097
30.8%
동대동 772
21.7%
신흑동 368
 
10.3%
명천동 255
 
7.2%
웅천읍 245
 
6.9%
죽정동 142
 
4.0%
남포면 88
 
2.5%
궁촌동 83
 
2.3%
천북면 74
 
2.1%
성주면 63
 
1.8%
Other values (11) 373
 
10.5%

업종
Categorical

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size27.9 KiB
소매업
1565 
음식점업
1205 
개인서비스업
323 
보건업
 
115
기타
 
111
Other values (4)
241 

Length

Max length11
Median length3
Mean length3.7230337
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row소매업
2nd row개인서비스업
3rd row개인서비스업
4th row소매업
5th row음식점업

Common Values

ValueCountFrequency (%)
소매업 1565
44.0%
음식점업 1205
33.8%
개인서비스업 323
 
9.1%
보건업 115
 
3.2%
기타 111
 
3.1%
교육서비스업 101
 
2.8%
제조업 83
 
2.3%
숙박업 31
 
0.9%
스포츠여가관련서비스업 26
 
0.7%

Length

2023-12-13T07:50:21.972159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:50:22.107062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소매업 1565
44.0%
음식점업 1205
33.8%
개인서비스업 323
 
9.1%
보건업 115
 
3.2%
기타 111
 
3.1%
교육서비스업 101
 
2.8%
제조업 83
 
2.3%
숙박업 31
 
0.9%
스포츠여가관련서비스업 26
 
0.7%

Interactions

2023-12-13T07:50:18.655185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:50:22.223589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동업종
연번1.0000.3400.222
행정동0.3401.0000.371
업종0.2220.3711.000
2023-12-13T07:50:22.345755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종행정동
업종1.0000.152
행정동0.1521.000
2023-12-13T07:50:22.430434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동업종
연번1.0000.1320.102
행정동0.1321.0000.152
업종0.1020.1521.000

Missing values

2023-12-13T07:50:18.763695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:50:18.864296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:50:18.961371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번상호소재지사업장전화번호행정동업종
01바이태닝 스튜디오충청남도 보령시 작은오랏7길 35(동대동)<NA>동대동소매업
12쁘띠헤어충청남도 보령시 한내로터리길 79(동대동)<NA>동대동개인서비스업
23동그루밍충청남도 보령시 작은오랏3길 82-1(동대동)<NA>동대동개인서비스업
34엘리트교복(보령점)충청남도 보령시 원동1길 69(대천동, 시티타워)041-935-7070대천동소매업
45한내해물칼국수충청남도 보령시 정화로길 16(대천동)<NA>대천동음식점업
56오복정육점충청남도 보령시 중앙시장1길 4(대천동)<NA>대천동소매업
67웰컴드론충청남도 보령시 보령북로 81(대천동)<NA>대천동소매업
78티파니귀금속충청남도 보령시 대흥로 38(대천동)041-932-9955대천동소매업
89점프윙스줄넘기클럽보령점충청남도 보령시 신평1길 3(대천동)<NA>대천동교육서비스업
910다미네일충청남도 보령시 남대천로 67(대천동)041-935-3234대천동개인서비스업
연번상호소재지사업장전화번호행정동업종
3550<NA>비타민약국충청남도 보령시 대천로 17 (대천동)041-933-7087대천동보건업
3551<NA>보령종로약국충청남도 보령시 대천로 20 (대천동)041-933-0114대천동보건업
3552<NA>써스데이 아일랜드충청남도 보령시 원동1길 30-13 (대천동)041-936-9695대천동소매업
3553<NA>조이너스충청남도 보령시 중앙로 76 (대천동)041-936-1614대천동소매업
3554<NA>아리따움충남보령점충청남도 보령시 중앙로 63 (대천동)041-931-5884대천동소매업
3555<NA>크레타충청남도 보령시 중앙로 67 (대천동)041-932-1730대천동소매업
3556<NA>보령슈퍼충청남도 보령시 중앙로 61 (대천동)041-935-3828대천동소매업
3557<NA>만능스포츠충청남도 보령시 중앙로 45 (대천동)041-935-0729대천동소매업
3558<NA>대천자전차충청남도 보령시 대흥로 2 (대천동)041-933-1330대천동소매업
3559<NA>금성세탁소충청남도 보령시 중앙로 52 (대천동)041-935-6675대천동기타

Duplicate rows

Most frequently occurring

연번상호소재지사업장전화번호행정동업종# duplicates
0<NA>거북수산충청남도 보령시 대천항로 334 (신흑동)041-931-2225신흑동소매업2