Overview

Dataset statistics

Number of variables7
Number of observations2109
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory117.5 KiB
Average record size in memory57.1 B

Variable types

Numeric1
Categorical2
Text3
DateTime1

Dataset

Description이 데이터는 금산군에서 발행하는 금산사랑상품권의 지류 가맹점에 대하여 연번, 행정동, 가맹점명, 업태분류, 우편번호, 기본주소, 데이터기준일자 정보를 제공합니다.
Author충청남도 금산군
URLhttps://www.data.go.kr/data/15088818/fileData.do

Alerts

데이터기준일 has constant value ""Constant
행정동 is highly imbalanced (56.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:21:39.616658
Analysis finished2023-12-12 06:21:40.706678
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct2109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1055
Minimum1
Maximum2109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size18.7 KiB
2023-12-12T15:21:40.792575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile106.4
Q1528
median1055
Q31582
95-th percentile2003.6
Maximum2109
Range2108
Interquartile range (IQR)1054

Descriptive statistics

Standard deviation608.96018
Coefficient of variation (CV)0.57721344
Kurtosis-1.2
Mean1055
Median Absolute Deviation (MAD)527
Skewness0
Sum2224995
Variance370832.5
MonotonicityStrictly increasing
2023-12-12T15:21:40.945345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
1418 1
 
< 0.1%
1416 1
 
< 0.1%
1415 1
 
< 0.1%
1414 1
 
< 0.1%
1413 1
 
< 0.1%
1412 1
 
< 0.1%
1411 1
 
< 0.1%
1410 1
 
< 0.1%
1409 1
 
< 0.1%
Other values (2099) 2099
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
2109 1
< 0.1%
2108 1
< 0.1%
2107 1
< 0.1%
2106 1
< 0.1%
2105 1
< 0.1%
2104 1
< 0.1%
2103 1
< 0.1%
2102 1
< 0.1%
2101 1
< 0.1%
2100 1
< 0.1%

행정동
Categorical

IMBALANCE 

Distinct10
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
금산읍
1595 
추부면
204 
복수면
 
71
진산면
 
67
제원면
 
60
Other values (5)
 
112

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row추부면
2nd row금산읍
3rd row추부면
4th row추부면
5th row금산읍

Common Values

ValueCountFrequency (%)
금산읍 1595
75.6%
추부면 204
 
9.7%
복수면 71
 
3.4%
진산면 67
 
3.2%
제원면 60
 
2.8%
부리면 26
 
1.2%
군북면 24
 
1.1%
금성면 23
 
1.1%
남일면 21
 
1.0%
남이면 18
 
0.9%

Length

2023-12-12T15:21:41.091809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:21:41.248434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
금산읍 1595
75.6%
추부면 204
 
9.7%
복수면 71
 
3.4%
진산면 67
 
3.2%
제원면 60
 
2.8%
부리면 26
 
1.2%
군북면 24
 
1.1%
금성면 23
 
1.1%
남일면 21
 
1.0%
남이면 18
 
0.9%
Distinct1963
Distinct (%)93.1%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
2023-12-12T15:21:41.511793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length21
Mean length6.257468
Min length2

Characters and Unicode

Total characters13197
Distinct characters691
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1903 ?
Unique (%)90.2%

Sample

1st row코리엔탈깻잎두마리치킨
2nd row봄날은온다
3rd row착한감자탕해장국
4th row내가그린정원
5th row드루와
ValueCountFrequency (%)
개인택시 66
 
2.6%
금산수삼센타 46
 
1.8%
금산점 40
 
1.5%
금산수삼센터 35
 
1.4%
금산수삼시장 21
 
0.8%
금산 17
 
0.7%
수삼센터 13
 
0.5%
소매 12
 
0.5%
씨유 11
 
0.4%
주식회사 11
 
0.4%
Other values (2131) 2309
89.5%
2023-12-12T15:21:42.013565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
500
 
3.8%
473
 
3.6%
440
 
3.3%
350
 
2.7%
252
 
1.9%
237
 
1.8%
217
 
1.6%
199
 
1.5%
175
 
1.3%
173
 
1.3%
Other values (681) 10181
77.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12027
91.1%
Space Separator 473
 
3.6%
Decimal Number 384
 
2.9%
Uppercase Letter 106
 
0.8%
Close Punctuation 69
 
0.5%
Open Punctuation 69
 
0.5%
Lowercase Letter 34
 
0.3%
Other Punctuation 17
 
0.1%
Other Symbol 15
 
0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
500
 
4.2%
440
 
3.7%
350
 
2.9%
252
 
2.1%
237
 
2.0%
217
 
1.8%
199
 
1.7%
175
 
1.5%
173
 
1.4%
171
 
1.4%
Other values (626) 9313
77.4%
Uppercase Letter
ValueCountFrequency (%)
G 12
 
11.3%
S 10
 
9.4%
A 9
 
8.5%
C 8
 
7.5%
B 7
 
6.6%
H 6
 
5.7%
P 5
 
4.7%
N 5
 
4.7%
O 5
 
4.7%
I 5
 
4.7%
Other values (13) 34
32.1%
Lowercase Letter
ValueCountFrequency (%)
e 8
23.5%
c 6
17.6%
o 4
11.8%
h 3
 
8.8%
p 3
 
8.8%
k 2
 
5.9%
a 2
 
5.9%
y 1
 
2.9%
r 1
 
2.9%
t 1
 
2.9%
Other values (3) 3
 
8.8%
Decimal Number
ValueCountFrequency (%)
1 68
17.7%
2 55
14.3%
0 39
10.2%
5 39
10.2%
7 39
10.2%
3 35
9.1%
8 33
8.6%
6 27
 
7.0%
9 25
 
6.5%
4 24
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 9
52.9%
& 7
41.2%
; 1
 
5.9%
Space Separator
ValueCountFrequency (%)
473
100.0%
Close Punctuation
ValueCountFrequency (%)
) 69
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12040
91.2%
Common 1015
 
7.7%
Latin 140
 
1.1%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
500
 
4.2%
440
 
3.7%
350
 
2.9%
252
 
2.1%
237
 
2.0%
217
 
1.8%
199
 
1.7%
175
 
1.5%
173
 
1.4%
171
 
1.4%
Other values (625) 9326
77.5%
Latin
ValueCountFrequency (%)
G 12
 
8.6%
S 10
 
7.1%
A 9
 
6.4%
e 8
 
5.7%
C 8
 
5.7%
B 7
 
5.0%
c 6
 
4.3%
H 6
 
4.3%
P 5
 
3.6%
N 5
 
3.6%
Other values (26) 64
45.7%
Common
ValueCountFrequency (%)
473
46.6%
) 69
 
6.8%
( 69
 
6.8%
1 68
 
6.7%
2 55
 
5.4%
0 39
 
3.8%
5 39
 
3.8%
7 39
 
3.8%
3 35
 
3.4%
8 33
 
3.3%
Other values (8) 96
 
9.5%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12025
91.1%
ASCII 1154
 
8.7%
None 15
 
0.1%
CJK 2
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
500
 
4.2%
440
 
3.7%
350
 
2.9%
252
 
2.1%
237
 
2.0%
217
 
1.8%
199
 
1.7%
175
 
1.5%
173
 
1.4%
171
 
1.4%
Other values (624) 9311
77.4%
ASCII
ValueCountFrequency (%)
473
41.0%
) 69
 
6.0%
( 69
 
6.0%
1 68
 
5.9%
2 55
 
4.8%
0 39
 
3.4%
5 39
 
3.4%
7 39
 
3.4%
3 35
 
3.0%
8 33
 
2.9%
Other values (43) 235
20.4%
None
ValueCountFrequency (%)
15
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%
Punctuation
ValueCountFrequency (%)
1
100.0%

업태분류
Categorical

Distinct9
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
소매업
955 
음식점업
678 
개인서비스업
274 
제조업
 
60
보건업
 
57
Other values (4)
 
85

Length

Max length11
Median length3
Mean length3.8003793
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음식점업
2nd row음식점업
3rd row음식점업
4th row음식점업
5th row음식점업

Common Values

ValueCountFrequency (%)
소매업 955
45.3%
음식점업 678
32.1%
개인서비스업 274
 
13.0%
제조업 60
 
2.8%
보건업 57
 
2.7%
교육서비스업 39
 
1.8%
기타 25
 
1.2%
스포츠여가관련서비스업 12
 
0.6%
숙박업 9
 
0.4%

Length

2023-12-12T15:21:42.217163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:21:42.401844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
소매업 955
45.3%
음식점업 678
32.1%
개인서비스업 274
 
13.0%
제조업 60
 
2.8%
보건업 57
 
2.7%
교육서비스업 39
 
1.8%
기타 25
 
1.2%
스포츠여가관련서비스업 12
 
0.6%
숙박업 9
 
0.4%
Distinct63
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
2023-12-12T15:21:42.675233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.9905168
Min length1

Characters and Unicode

Total characters10525
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row32713
2nd row32735
3rd row32714
4th row32715
5th row32734
ValueCountFrequency (%)
32739 388
18.4%
32738 194
 
9.2%
32740 132
 
6.3%
32714 119
 
5.6%
32747 118
 
5.6%
32735 97
 
4.6%
32737 84
 
4.0%
32732 76
 
3.6%
32733 76
 
3.6%
32736 54
 
2.6%
Other values (53) 771
36.6%
2023-12-12T15:21:43.173434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 3326
31.6%
2 2497
23.7%
7 2334
22.2%
4 595
 
5.7%
9 461
 
4.4%
0 362
 
3.4%
1 357
 
3.4%
8 241
 
2.3%
5 221
 
2.1%
6 126
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10520
> 99.9%
Dash Punctuation 5
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 3326
31.6%
2 2497
23.7%
7 2334
22.2%
4 595
 
5.7%
9 461
 
4.4%
0 362
 
3.4%
1 357
 
3.4%
8 241
 
2.3%
5 221
 
2.1%
6 126
 
1.2%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10525
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 3326
31.6%
2 2497
23.7%
7 2334
22.2%
4 595
 
5.7%
9 461
 
4.4%
0 362
 
3.4%
1 357
 
3.4%
8 241
 
2.3%
5 221
 
2.1%
6 126
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 10525
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 3326
31.6%
2 2497
23.7%
7 2334
22.2%
4 595
 
5.7%
9 461
 
4.4%
0 362
 
3.4%
1 357
 
3.4%
8 241
 
2.3%
5 221
 
2.1%
6 126
 
1.2%
Distinct1197
Distinct (%)56.8%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
2023-12-12T15:21:43.566136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length34
Mean length20.71266
Min length18

Characters and Unicode

Total characters43683
Distinct characters199
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique887 ?
Unique (%)42.1%

Sample

1st row충청남도 금산군 추부면 대학로 191
2nd row충청남도 금산군 금산읍 비호로 92
3rd row충청남도 금산군 추부면 하마전로 53
4th row충청남도 금산군 추부면 서대산로 18
5th row충청남도 금산군 금산읍 비호로 97
ValueCountFrequency (%)
충청남도 2109
19.8%
금산군 2109
19.8%
금산읍 1595
15.0%
인삼약초로 297
 
2.8%
비단로 224
 
2.1%
추부면 204
 
1.9%
금산로 158
 
1.5%
인삼로 142
 
1.3%
24 140
 
1.3%
비호로 102
 
1.0%
Other values (833) 3547
33.4%
2023-12-12T15:21:44.114503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8518
19.5%
4157
 
9.5%
4028
 
9.2%
2184
 
5.0%
2157
 
4.9%
2145
 
4.9%
2110
 
4.8%
2109
 
4.8%
1610
 
3.7%
1570
 
3.6%
Other values (189) 13095
30.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29222
66.9%
Space Separator 8518
 
19.5%
Decimal Number 5520
 
12.6%
Dash Punctuation 257
 
0.6%
Open Punctuation 79
 
0.2%
Close Punctuation 79
 
0.2%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4157
14.2%
4028
13.8%
2184
 
7.5%
2157
 
7.4%
2145
 
7.3%
2110
 
7.2%
2109
 
7.2%
1610
 
5.5%
1570
 
5.4%
580
 
2.0%
Other values (174) 6572
22.5%
Decimal Number
ValueCountFrequency (%)
1 1067
19.3%
4 798
14.5%
2 764
13.8%
3 628
11.4%
5 499
9.0%
8 426
 
7.7%
7 394
 
7.1%
6 366
 
6.6%
9 301
 
5.5%
0 277
 
5.0%
Space Separator
ValueCountFrequency (%)
8518
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 257
100.0%
Open Punctuation
ValueCountFrequency (%)
( 79
100.0%
Close Punctuation
ValueCountFrequency (%)
) 79
100.0%
Uppercase Letter
ValueCountFrequency (%)
I 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29222
66.9%
Common 14453
33.1%
Latin 8
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4157
14.2%
4028
13.8%
2184
 
7.5%
2157
 
7.4%
2145
 
7.3%
2110
 
7.2%
2109
 
7.2%
1610
 
5.5%
1570
 
5.4%
580
 
2.0%
Other values (174) 6572
22.5%
Common
ValueCountFrequency (%)
8518
58.9%
1 1067
 
7.4%
4 798
 
5.5%
2 764
 
5.3%
3 628
 
4.3%
5 499
 
3.5%
8 426
 
2.9%
7 394
 
2.7%
6 366
 
2.5%
9 301
 
2.1%
Other values (4) 692
 
4.8%
Latin
ValueCountFrequency (%)
I 8
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29222
66.9%
ASCII 14461
33.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8518
58.9%
1 1067
 
7.4%
4 798
 
5.5%
2 764
 
5.3%
3 628
 
4.3%
5 499
 
3.5%
8 426
 
2.9%
7 394
 
2.7%
6 366
 
2.5%
9 301
 
2.1%
Other values (5) 700
 
4.8%
Hangul
ValueCountFrequency (%)
4157
14.2%
4028
13.8%
2184
 
7.5%
2157
 
7.4%
2145
 
7.3%
2110
 
7.2%
2109
 
7.2%
1610
 
5.5%
1570
 
5.4%
580
 
2.0%
Other values (174) 6572
22.5%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.6 KiB
Minimum2021-09-15 00:00:00
Maximum2021-09-15 00:00:00
2023-12-12T15:21:44.320475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:21:44.455633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T15:21:40.372303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:21:44.558338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동업태분류우편번호
연번1.0000.2500.3120.483
행정동0.2501.0000.3040.999
업태분류0.3120.3041.0000.645
우편번호0.4830.9990.6451.000
2023-12-12T15:21:44.693905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
행정동업태분류
행정동1.0000.143
업태분류0.1431.000
2023-12-12T15:21:44.798431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번행정동업태분류
연번1.0000.0790.147
행정동0.0791.0000.143
업태분류0.1470.1431.000

Missing values

2023-12-12T15:21:40.521438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:21:40.656837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번행정동가맹점명업태분류우편번호기본주소데이터기준일
01추부면코리엔탈깻잎두마리치킨음식점업32713충청남도 금산군 추부면 대학로 1912021-09-15
12금산읍봄날은온다음식점업32735충청남도 금산군 금산읍 비호로 922021-09-15
23추부면착한감자탕해장국음식점업32714충청남도 금산군 추부면 하마전로 532021-09-15
34추부면내가그린정원음식점업32715충청남도 금산군 추부면 서대산로 182021-09-15
45금산읍드루와음식점업32734충청남도 금산군 금산읍 비호로 972021-09-15
56추부면미미루루커피음식점업32714충청남도 금산군 추부면 서대산로 37-62021-09-15
67금산읍엄마네식탁음식점업32747충청남도 금산군 금산읍 금산천길 982021-09-15
78금산읍현호프음식점업32734충청남도 금산군 금산읍 비호로 832021-09-15
89금산읍덕광상회소매업32747충청남도 금산군 금산읍 금산천길 982021-09-15
910금산읍마이산흑염소음식점업32722충청남도 금산군 금산읍 인삼로 2522021-09-15
연번행정동가맹점명업태분류우편번호기본주소데이터기준일
20992100금산읍금산스크린골프개인서비스업32726충청남도 금산군 금산읍 비호산로 42021-09-15
21002101금산읍과일카페소매업32731충청남도 금산군 금산읍 비단로 326-142021-09-15
21012102금산읍성화당한약방소매업32738충청남도 금산군 금산읍 뒷담말길 132021-09-15
21022103금산읍전원농약사, 동물약품소매업32737충청남도 금산군 금산읍 오리정1길 502021-09-15
21032104금산읍대성유통소매업32747충청남도 금산군 금산읍 인삼약초로 132021-09-15
21042105금산읍정래약국소매업32738충청남도 금산군 금산읍 인삼로 792021-09-15
21052106추부면금동정육점소매업32714충청남도 금산군 추부면 하마전로 182021-09-15
21062107추부면청춘회관음식점업32713충청남도 금산군 추부면 대학로 1582021-09-15
21072108금산읍개인택시개인서비스업32730충청남도 금산군 금산읍 비단로 338 (금산상리주공2단지아파트)2021-09-15
21082109금산읍개인택시개인서비스업32729충청남도 금산군 금산읍 사직로 150 (대원칸타빌)2021-09-15