Overview

Dataset statistics

Number of variables3
Number of observations4709
Missing cells0
Missing cells (%)0.0%
Duplicate rows7
Duplicate rows (%)0.1%
Total size in memory115.1 KiB
Average record size in memory25.0 B

Variable types

Text2
Numeric1

Dataset

Description충청남도 논산시 지역화폐 가맹점 목록에 관한 데이터로 가맹점명, 주소, 우편번호 정보를 공공데이터 포털 파일데이터로 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=102&beforeMenuCd=DOM_000000201001001000&publicdatapk=15102182

Alerts

Dataset has 7 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 22:23:34.550434
Analysis finished2024-01-09 22:23:35.271143
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct4643
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size36.9 KiB
2024-01-10T07:23:35.442726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length24
Mean length6.2047144
Min length1

Characters and Unicode

Total characters29218
Distinct characters901
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4582 ?
Unique (%)97.3%

Sample

1st row논산파구스
2nd row오거리뒷방고기
3rd row샘골가든
4th row꾸밈
5th row세븐일레븐 논산연무삼거리점
ValueCountFrequency (%)
논산점 109
 
2.0%
씨유 38
 
0.7%
주식회사 30
 
0.6%
지에스25 25
 
0.5%
논산내동점 25
 
0.5%
세븐일레븐 20
 
0.4%
연무점 18
 
0.3%
농업회사법인 11
 
0.2%
강경점 11
 
0.2%
내동점 10
 
0.2%
Other values (4793) 5106
94.5%
2024-01-10T07:23:35.776881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
789
 
2.7%
698
 
2.4%
644
 
2.2%
554
 
1.9%
446
 
1.5%
445
 
1.5%
380
 
1.3%
376
 
1.3%
311
 
1.1%
308
 
1.1%
Other values (891) 24267
83.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27290
93.4%
Space Separator 698
 
2.4%
Decimal Number 416
 
1.4%
Uppercase Letter 279
 
1.0%
Open Punctuation 173
 
0.6%
Close Punctuation 173
 
0.6%
Lowercase Letter 115
 
0.4%
Other Punctuation 54
 
0.2%
Other Symbol 14
 
< 0.1%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
789
 
2.9%
644
 
2.4%
554
 
2.0%
446
 
1.6%
445
 
1.6%
380
 
1.4%
376
 
1.4%
311
 
1.1%
308
 
1.1%
283
 
1.0%
Other values (823) 22754
83.4%
Uppercase Letter
ValueCountFrequency (%)
C 26
 
9.3%
P 23
 
8.2%
B 20
 
7.2%
S 18
 
6.5%
O 18
 
6.5%
A 17
 
6.1%
T 15
 
5.4%
I 14
 
5.0%
N 14
 
5.0%
G 13
 
4.7%
Other values (15) 101
36.2%
Lowercase Letter
ValueCountFrequency (%)
e 16
13.9%
o 14
12.2%
a 12
10.4%
m 10
 
8.7%
p 8
 
7.0%
t 7
 
6.1%
l 7
 
6.1%
n 6
 
5.2%
c 5
 
4.3%
u 5
 
4.3%
Other values (9) 25
21.7%
Decimal Number
ValueCountFrequency (%)
5 78
18.8%
2 70
16.8%
1 63
15.1%
8 43
10.3%
0 42
10.1%
6 37
8.9%
4 25
 
6.0%
3 23
 
5.5%
9 20
 
4.8%
7 15
 
3.6%
Other Punctuation
ValueCountFrequency (%)
& 15
27.8%
, 14
25.9%
. 13
24.1%
; 7
13.0%
/ 2
 
3.7%
# 2
 
3.7%
! 1
 
1.9%
Other Symbol
ValueCountFrequency (%)
13
92.9%
1
 
7.1%
Space Separator
ValueCountFrequency (%)
698
100.0%
Open Punctuation
ValueCountFrequency (%)
( 173
100.0%
Close Punctuation
ValueCountFrequency (%)
) 173
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27298
93.4%
Common 1521
 
5.2%
Latin 394
 
1.3%
Han 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
789
 
2.9%
644
 
2.4%
554
 
2.0%
446
 
1.6%
445
 
1.6%
380
 
1.4%
376
 
1.4%
311
 
1.1%
308
 
1.1%
283
 
1.0%
Other values (819) 22762
83.4%
Latin
ValueCountFrequency (%)
C 26
 
6.6%
P 23
 
5.8%
B 20
 
5.1%
S 18
 
4.6%
O 18
 
4.6%
A 17
 
4.3%
e 16
 
4.1%
T 15
 
3.8%
o 14
 
3.6%
I 14
 
3.6%
Other values (34) 213
54.1%
Common
ValueCountFrequency (%)
698
45.9%
( 173
 
11.4%
) 173
 
11.4%
5 78
 
5.1%
2 70
 
4.6%
1 63
 
4.1%
8 43
 
2.8%
0 42
 
2.8%
6 37
 
2.4%
4 25
 
1.6%
Other values (13) 119
 
7.8%
Han
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27285
93.4%
ASCII 1914
 
6.6%
None 13
 
< 0.1%
CJK 5
 
< 0.1%
Letterlike Symbols 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
789
 
2.9%
644
 
2.4%
554
 
2.0%
446
 
1.6%
445
 
1.6%
380
 
1.4%
376
 
1.4%
311
 
1.1%
308
 
1.1%
283
 
1.0%
Other values (818) 22749
83.4%
ASCII
ValueCountFrequency (%)
698
36.5%
( 173
 
9.0%
) 173
 
9.0%
5 78
 
4.1%
2 70
 
3.7%
1 63
 
3.3%
8 43
 
2.2%
0 42
 
2.2%
6 37
 
1.9%
C 26
 
1.4%
Other values (56) 511
26.7%
None
ValueCountFrequency (%)
13
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Letterlike Symbols
ValueCountFrequency (%)
1
100.0%

주소
Text

Distinct4019
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Memory size36.9 KiB
2024-01-10T07:23:36.017686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length68
Median length53
Mean length24.488639
Min length12

Characters and Unicode

Total characters115317
Distinct characters465
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3514 ?
Unique (%)74.6%

Sample

1st row충청남도 논산시 해월로 175(반월동)
2nd row충청남도 논산시 중앙로410번길 7-7(취암동)
3rd row충청남도 논산시 광석면 논산평야로 856-6
4th row충청남도 논산시 시민로307번길 6(취암동) 정면 좌측 상가1층
5th row충청남도 논산시 연무읍 안심로 9 1층
ValueCountFrequency (%)
논산시 4708
21.1%
충청남도 3979
 
17.8%
충남 730
 
3.3%
연무읍 469
 
2.1%
강경읍 440
 
2.0%
계백로 422
 
1.9%
중앙로 382
 
1.7%
1층 260
 
1.2%
연산면 258
 
1.2%
시민로 228
 
1.0%
Other values (3301) 10457
46.8%
2024-01-10T07:23:36.575248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21332
18.5%
5542
 
4.8%
5452
 
4.7%
4892
 
4.2%
1 4766
 
4.1%
4729
 
4.1%
4723
 
4.1%
4438
 
3.8%
4024
 
3.5%
3989
 
3.5%
Other values (455) 51430
44.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 66627
57.8%
Space Separator 21332
 
18.5%
Decimal Number 20603
 
17.9%
Close Punctuation 2459
 
2.1%
Open Punctuation 2459
 
2.1%
Dash Punctuation 1582
 
1.4%
Other Punctuation 184
 
0.2%
Uppercase Letter 57
 
< 0.1%
Lowercase Letter 10
 
< 0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5542
 
8.3%
5452
 
8.2%
4892
 
7.3%
4729
 
7.1%
4723
 
7.1%
4438
 
6.7%
4024
 
6.0%
3989
 
6.0%
2738
 
4.1%
2262
 
3.4%
Other values (412) 23838
35.8%
Uppercase Letter
ValueCountFrequency (%)
L 11
19.3%
H 8
14.0%
T 8
14.0%
A 5
8.8%
P 3
 
5.3%
E 3
 
5.3%
K 3
 
5.3%
B 3
 
5.3%
N 2
 
3.5%
M 2
 
3.5%
Other values (6) 9
15.8%
Decimal Number
ValueCountFrequency (%)
1 4766
23.1%
2 2875
14.0%
3 2069
10.0%
0 2022
9.8%
4 1937
9.4%
5 1523
 
7.4%
8 1477
 
7.2%
9 1364
 
6.6%
7 1351
 
6.6%
6 1219
 
5.9%
Lowercase Letter
ValueCountFrequency (%)
e 2
20.0%
k 2
20.0%
t 2
20.0%
c 1
10.0%
a 1
10.0%
o 1
10.0%
u 1
10.0%
Other Punctuation
ValueCountFrequency (%)
, 179
97.3%
. 3
 
1.6%
@ 1
 
0.5%
# 1
 
0.5%
Space Separator
ValueCountFrequency (%)
21332
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2459
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2459
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1582
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 66627
57.8%
Common 48623
42.2%
Latin 67
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5542
 
8.3%
5452
 
8.2%
4892
 
7.3%
4729
 
7.1%
4723
 
7.1%
4438
 
6.7%
4024
 
6.0%
3989
 
6.0%
2738
 
4.1%
2262
 
3.4%
Other values (412) 23838
35.8%
Latin
ValueCountFrequency (%)
L 11
16.4%
H 8
11.9%
T 8
11.9%
A 5
 
7.5%
P 3
 
4.5%
E 3
 
4.5%
K 3
 
4.5%
B 3
 
4.5%
N 2
 
3.0%
M 2
 
3.0%
Other values (13) 19
28.4%
Common
ValueCountFrequency (%)
21332
43.9%
1 4766
 
9.8%
2 2875
 
5.9%
) 2459
 
5.1%
( 2459
 
5.1%
3 2069
 
4.3%
0 2022
 
4.2%
4 1937
 
4.0%
- 1582
 
3.3%
5 1523
 
3.1%
Other values (10) 5599
 
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 66627
57.8%
ASCII 48690
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
21332
43.8%
1 4766
 
9.8%
2 2875
 
5.9%
) 2459
 
5.1%
( 2459
 
5.1%
3 2069
 
4.2%
0 2022
 
4.2%
4 1937
 
4.0%
- 1582
 
3.2%
5 1523
 
3.1%
Other values (33) 5666
 
11.6%
Hangul
ValueCountFrequency (%)
5542
 
8.3%
5452
 
8.2%
4892
 
7.3%
4729
 
7.1%
4723
 
7.1%
4438
 
6.7%
4024
 
6.0%
3989
 
6.0%
2738
 
4.1%
2262
 
3.4%
Other values (412) 23838
35.8%

우편번호
Real number (ℝ)

Distinct126
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32968.304
Minimum32900
Maximum33028
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size41.5 KiB
2024-01-10T07:23:36.696746image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32900
5-th percentile32911
Q132950
median32974
Q332989
95-th percentile33015
Maximum33028
Range128
Interquartile range (IQR)39

Descriptive statistics

Standard deviation30.567774
Coefficient of variation (CV)0.00092718673
Kurtosis-0.50975737
Mean32968.304
Median Absolute Deviation (MAD)18
Skewness-0.37282531
Sum1.5524774 × 108
Variance934.38881
MonotonicityNot monotonic
2024-01-10T07:23:36.805439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32974 387
 
8.2%
32967 284
 
6.0%
33008 178
 
3.8%
33007 169
 
3.6%
32976 149
 
3.2%
32989 145
 
3.1%
32991 145
 
3.1%
32983 135
 
2.9%
32968 131
 
2.8%
32954 125
 
2.7%
Other values (116) 2861
60.8%
ValueCountFrequency (%)
32900 1
 
< 0.1%
32901 11
 
0.2%
32902 3
 
0.1%
32903 47
1.0%
32904 31
0.7%
32905 11
 
0.2%
32906 10
 
0.2%
32907 10
 
0.2%
32908 6
 
0.1%
32909 8
 
0.2%
ValueCountFrequency (%)
33028 12
 
0.3%
33027 6
 
0.1%
33026 24
0.5%
33025 4
 
0.1%
33024 1
 
< 0.1%
33023 19
0.4%
33022 47
1.0%
33021 37
0.8%
33020 32
0.7%
33019 3
 
0.1%

Interactions

2024-01-10T07:23:35.098736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T07:23:35.189017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:23:35.244776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

가맹점명주소우편번호
0논산파구스충청남도 논산시 해월로 175(반월동)32968
1오거리뒷방고기충청남도 논산시 중앙로410번길 7-7(취암동)32974
2샘골가든충청남도 논산시 광석면 논산평야로 856-632921
3꾸밈충청남도 논산시 시민로307번길 6(취암동) 정면 좌측 상가1층32977
4세븐일레븐 논산연무삼거리점충청남도 논산시 연무읍 안심로 9 1층33007
5호김밥충청남도 논산시 시민로308번길 28(취암동)32980
6미쿡미트마켓 논산취암점충청남도 논산시 시민로307번길 12-1(취암동) 1층32977
7황제농장충청남도 논산시 광석면 장마루로 141-4632924
8대패집충청남도 논산시 강경읍 여강로 132532940
9단테충청남도 논산시 시민로210번길 26(내동) 1층32989
가맹점명주소우편번호
4699강남동태찜충남 논산시 중앙로 34632979
4700이브자리충청남도 논산시 중앙로 394(취암동)32974
4701시골삼겹살충남 논산시 시민로194번길 1832989
4702문경휘트니스충남 논산시 중앙로410번길 632974
4703임실치즈피자충청남도 논산시 시민로 287(내동)32981
47045앤5충청남도 논산시 체육로 31-5(내동)32983
4705올리비아로렌충남 논산시 해월로167번길 9-132968
4706나래꼬마김밥충청남도 논산시 중앙로 235(내동) 105호32986
4707사탕수수족욕카페충청남도 논산시 은진면 방축길 16-1033001
4708용천자연유리총판대리점충청남도 논산시 은진면 방축3길 2233002

Duplicate rows

Most frequently occurring

가맹점명주소우편번호# duplicates
0논산곤드레밥 송림한우충청남도 논산시 부창로72번길 7-17(부창동)329762
1로트라인충청남도 논산시 중앙로384번길 46-1(취암동) 1층329742
2백씨네가든충청남도 논산시 성동면 금백로 667-5329282
3새우김밥충청남도 논산시 강경읍 대흥로 32-1329362
4오거리뒷방고기충청남도 논산시 중앙로410번길 7-7(취암동)329742
5장수고을충청남도 논산시 강산길 86(강산동)329592
6한성건설기계충청남도 논산시 시민로210번길 4(내동)329892