Overview

Dataset statistics

Number of variables5
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory478.5 KiB
Average record size in memory49.0 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description경상북도 150,846개의 상품권(지역화폐)를 사용하는 소상공인 사업체 정보(순번, 상품권(지역화폐) 종류, 상호, 시군명, 주소) 데이터 셋 (CSV 파일)
Author경상북도
URLhttps://www.data.go.kr/data/15096095/fileData.do

Alerts

순번 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 순번High correlation
순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 21:35:35.975306
Analysis finished2023-12-12 21:35:37.251736
Duration1.28 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48664.259
Minimum2
Maximum98070
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T06:35:37.333329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4483.9
Q124353.5
median48612.5
Q372833
95-th percentile93166.65
Maximum98070
Range98068
Interquartile range (IQR)48479.5

Descriptive statistics

Standard deviation28313.475
Coefficient of variation (CV)0.58181251
Kurtosis-1.1893367
Mean48664.259
Median Absolute Deviation (MAD)24231
Skewness0.012601089
Sum4.8664259 × 108
Variance8.0165285 × 108
MonotonicityNot monotonic
2023-12-13T06:35:37.503750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
43699 1
 
< 0.1%
57460 1
 
< 0.1%
81798 1
 
< 0.1%
51111 1
 
< 0.1%
76934 1
 
< 0.1%
55818 1
 
< 0.1%
61702 1
 
< 0.1%
4855 1
 
< 0.1%
50752 1
 
< 0.1%
45681 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
4 1
< 0.1%
9 1
< 0.1%
11 1
< 0.1%
12 1
< 0.1%
13 1
< 0.1%
48 1
< 0.1%
56 1
< 0.1%
79 1
< 0.1%
85 1
< 0.1%
ValueCountFrequency (%)
98070 1
< 0.1%
98053 1
< 0.1%
98030 1
< 0.1%
98011 1
< 0.1%
98009 1
< 0.1%
98008 1
< 0.1%
98003 1
< 0.1%
97998 1
< 0.1%
97995 1
< 0.1%
97980 1
< 0.1%
Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
카드형
4888 
지류형
4292 
모바일형
820 

Length

Max length4
Median length3
Mean length3.082
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지류형
2nd row모바일형
3rd row지류형
4th row카드형
5th row모바일형

Common Values

ValueCountFrequency (%)
카드형 4888
48.9%
지류형 4292
42.9%
모바일형 820
 
8.2%

Length

2023-12-13T06:35:37.642828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:35:37.734519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
카드형 4888
48.9%
지류형 4292
42.9%
모바일형 820
 
8.2%
Distinct9282
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:35:37.940590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length27
Mean length6.6144
Min length1

Characters and Unicode

Total characters66144
Distinct characters1058
Distinct categories13 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8645 ?
Unique (%)86.5%

Sample

1st row와촌식육식당
2nd row동아문구사
3rd row마루늘보
4th row1001안경원
5th row굿디자인
ValueCountFrequency (%)
경북15바 124
 
1.0%
세븐일레븐 49
 
0.4%
씨유 43
 
0.3%
43
 
0.3%
개인택시 42
 
0.3%
주식회사 38
 
0.3%
gs25 31
 
0.2%
영주점 29
 
0.2%
안동점 27
 
0.2%
문경점 21
 
0.2%
Other values (10094) 12028
96.4%
2023-12-13T06:35:38.401959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2475
 
3.7%
1815
 
2.7%
1087
 
1.6%
965
 
1.5%
839
 
1.3%
823
 
1.2%
801
 
1.2%
761
 
1.2%
759
 
1.1%
744
 
1.1%
Other values (1048) 55075
83.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 59166
89.5%
Space Separator 2475
 
3.7%
Decimal Number 1739
 
2.6%
Uppercase Letter 1076
 
1.6%
Lowercase Letter 551
 
0.8%
Open Punctuation 462
 
0.7%
Close Punctuation 462
 
0.7%
Other Punctuation 160
 
0.2%
Other Symbol 34
 
0.1%
Dash Punctuation 14
 
< 0.1%
Other values (3) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1815
 
3.1%
1087
 
1.8%
965
 
1.6%
839
 
1.4%
823
 
1.4%
801
 
1.4%
761
 
1.3%
759
 
1.3%
744
 
1.3%
677
 
1.1%
Other values (967) 49895
84.3%
Uppercase Letter
ValueCountFrequency (%)
S 138
 
12.8%
C 114
 
10.6%
G 98
 
9.1%
E 61
 
5.7%
A 55
 
5.1%
O 53
 
4.9%
P 51
 
4.7%
M 49
 
4.6%
B 45
 
4.2%
T 45
 
4.2%
Other values (16) 367
34.1%
Lowercase Letter
ValueCountFrequency (%)
e 85
15.4%
n 51
 
9.3%
a 49
 
8.9%
i 47
 
8.5%
o 41
 
7.4%
s 31
 
5.6%
r 28
 
5.1%
t 25
 
4.5%
d 25
 
4.5%
y 25
 
4.5%
Other values (15) 144
26.1%
Decimal Number
ValueCountFrequency (%)
1 445
25.6%
2 333
19.1%
5 296
17.0%
0 130
 
7.5%
6 123
 
7.1%
4 94
 
5.4%
3 85
 
4.9%
9 81
 
4.7%
7 77
 
4.4%
8 75
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 54
33.8%
. 41
25.6%
& 39
24.4%
# 7
 
4.4%
' 6
 
3.8%
/ 5
 
3.1%
: 4
 
2.5%
· 2
 
1.2%
; 2
 
1.2%
Open Punctuation
ValueCountFrequency (%)
( 456
98.7%
[ 6
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 456
98.7%
] 6
 
1.3%
Math Symbol
ValueCountFrequency (%)
< 1
50.0%
> 1
50.0%
Space Separator
ValueCountFrequency (%)
2475
100.0%
Other Symbol
ValueCountFrequency (%)
34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 59193
89.5%
Common 5317
 
8.0%
Latin 1627
 
2.5%
Han 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1815
 
3.1%
1087
 
1.8%
965
 
1.6%
839
 
1.4%
823
 
1.4%
801
 
1.4%
761
 
1.3%
759
 
1.3%
744
 
1.3%
677
 
1.1%
Other values (962) 49922
84.3%
Latin
ValueCountFrequency (%)
S 138
 
8.5%
C 114
 
7.0%
G 98
 
6.0%
e 85
 
5.2%
E 61
 
3.7%
A 55
 
3.4%
O 53
 
3.3%
P 51
 
3.1%
n 51
 
3.1%
M 49
 
3.0%
Other values (41) 872
53.6%
Common
ValueCountFrequency (%)
2475
46.5%
( 456
 
8.6%
) 456
 
8.6%
1 445
 
8.4%
2 333
 
6.3%
5 296
 
5.6%
0 130
 
2.4%
6 123
 
2.3%
4 94
 
1.8%
3 85
 
1.6%
Other values (19) 424
 
8.0%
Han
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 59159
89.4%
ASCII 6942
 
10.5%
None 36
 
0.1%
CJK 5
 
< 0.1%
CJK Compat Ideographs 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2475
35.7%
( 456
 
6.6%
) 456
 
6.6%
1 445
 
6.4%
2 333
 
4.8%
5 296
 
4.3%
S 138
 
2.0%
0 130
 
1.9%
6 123
 
1.8%
C 114
 
1.6%
Other values (69) 1976
28.5%
Hangul
ValueCountFrequency (%)
1815
 
3.1%
1087
 
1.8%
965
 
1.6%
839
 
1.4%
823
 
1.4%
801
 
1.4%
761
 
1.3%
759
 
1.3%
744
 
1.3%
677
 
1.1%
Other values (961) 49888
84.3%
None
ValueCountFrequency (%)
34
94.4%
· 2
 
5.6%
CJK Compat Ideographs
ValueCountFrequency (%)
2
100.0%
CJK
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

시군
Categorical

HIGH CORRELATION 

Distinct16
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구미시
2810 
안동시
1301 
영주시
1162 
상주시
962 
경산시
723 
Other values (11)
3042 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김천시
2nd row문경시
3rd row구미시
4th row영주시
5th row영주시

Common Values

ValueCountFrequency (%)
구미시 2810
28.1%
안동시 1301
13.0%
영주시 1162
11.6%
상주시 962
 
9.6%
경산시 723
 
7.2%
김천시 658
 
6.6%
문경시 582
 
5.8%
경주시 473
 
4.7%
영덕군 284
 
2.8%
예천군 276
 
2.8%
Other values (6) 769
 
7.7%

Length

2023-12-13T06:35:38.542816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구미시 2810
28.1%
안동시 1301
13.0%
영주시 1162
11.6%
상주시 962
 
9.6%
경산시 723
 
7.2%
김천시 658
 
6.6%
문경시 582
 
5.8%
경주시 473
 
4.7%
영덕군 284
 
2.8%
예천군 276
 
2.8%
Other values (6) 769
 
7.7%
Distinct8118
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-13T06:35:38.890512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length25
Mean length17.7446
Min length10

Characters and Unicode

Total characters177446
Distinct characters334
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6916 ?
Unique (%)69.2%

Sample

1st row경상북도 김천시 신기길 114
2nd row경상북도 문경시 호서로 26
3rd row경상북도 구미시 선산읍 단계동길 24
4th row경상북도 영주시 중앙로 71
5th row경상북도 영주시 영주로 273
ValueCountFrequency (%)
경상북도 8640
 
20.5%
구미시 2809
 
6.7%
안동시 1301
 
3.1%
영주시 1162
 
2.8%
상주시 962
 
2.3%
경산시 723
 
1.7%
김천시 658
 
1.6%
문경시 582
 
1.4%
경북 474
 
1.1%
경주시 473
 
1.1%
Other values (4511) 24423
57.9%
2023-12-13T06:35:39.386416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32207
18.2%
11497
 
6.5%
10156
 
5.7%
9411
 
5.3%
9191
 
5.2%
8729
 
4.9%
1 7321
 
4.1%
7210
 
4.1%
4703
 
2.7%
2 4701
 
2.6%
Other values (324) 72320
40.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 111290
62.7%
Space Separator 32207
 
18.2%
Decimal Number 31548
 
17.8%
Dash Punctuation 2396
 
1.4%
Other Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11497
 
10.3%
10156
 
9.1%
9411
 
8.5%
9191
 
8.3%
8729
 
7.8%
7210
 
6.5%
4703
 
4.2%
3308
 
3.0%
3301
 
3.0%
3008
 
2.7%
Other values (311) 40776
36.6%
Decimal Number
ValueCountFrequency (%)
1 7321
23.2%
2 4701
14.9%
3 3706
11.7%
4 2831
 
9.0%
5 2520
 
8.0%
6 2266
 
7.2%
7 2169
 
6.9%
0 2088
 
6.6%
8 2065
 
6.5%
9 1881
 
6.0%
Space Separator
ValueCountFrequency (%)
32207
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2396
100.0%
Other Punctuation
ValueCountFrequency (%)
. 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 111290
62.7%
Common 66156
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11497
 
10.3%
10156
 
9.1%
9411
 
8.5%
9191
 
8.3%
8729
 
7.8%
7210
 
6.5%
4703
 
4.2%
3308
 
3.0%
3301
 
3.0%
3008
 
2.7%
Other values (311) 40776
36.6%
Common
ValueCountFrequency (%)
32207
48.7%
1 7321
 
11.1%
2 4701
 
7.1%
3 3706
 
5.6%
4 2831
 
4.3%
5 2520
 
3.8%
- 2396
 
3.6%
6 2266
 
3.4%
7 2169
 
3.3%
0 2088
 
3.2%
Other values (3) 3951
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 111290
62.7%
ASCII 66156
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
32207
48.7%
1 7321
 
11.1%
2 4701
 
7.1%
3 3706
 
5.6%
4 2831
 
4.3%
5 2520
 
3.8%
- 2396
 
3.6%
6 2266
 
3.4%
7 2169
 
3.3%
0 2088
 
3.2%
Other values (3) 3951
 
6.0%
Hangul
ValueCountFrequency (%)
11497
 
10.3%
10156
 
9.1%
9411
 
8.5%
9191
 
8.3%
8729
 
7.8%
7210
 
6.5%
4703
 
4.2%
3308
 
3.0%
3301
 
3.0%
3008
 
2.7%
Other values (311) 40776
36.6%

Interactions

2023-12-13T06:35:36.977475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T06:35:39.754689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지역화폐 분류시군
순번1.0000.6120.942
지역화폐 분류0.6121.0000.593
시군0.9420.5931.000
2023-12-13T06:35:39.844583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역화폐 분류시군
지역화폐 분류1.0000.399
시군0.3991.000
2023-12-13T06:35:39.930326image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번지역화폐 분류시군
순번1.0000.4570.760
지역화폐 분류0.4571.0000.399
시군0.7600.3991.000

Missing values

2023-12-13T06:35:37.103786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:35:37.207696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번지역화폐 분류상호명시군도로명 주소
4369843699지류형와촌식육식당김천시경상북도 김천시 신기길 114
4928449285모바일형동아문구사문경시경상북도 문경시 호서로 26
1585115852지류형마루늘보구미시경상북도 구미시 선산읍 단계동길 24
9441994420카드형1001안경원영주시경상북도 영주시 중앙로 71
8557585576모바일형굿디자인영주시경상북도 영주시 영주로 273
3466134662카드형경북15바 1939구미시경상북도 구미시 송선로15길 27
3829138292카드형티아라헤어구미시경상북도 구미시 인동26길 22
9410994110카드형바이크 패밀리영주시경상북도 영주시 원당로 200
4180341804카드형육일식당군위군경상북도 군위군 효령면 중구2길 6
2578325784지류형돗소리인동점구미시경상북도 구미시 인동중앙로11길 19
순번지역화폐 분류상호명시군도로명 주소
8142081421지류형미인만들기 속눈썹영덕군경상북도 영덕군 영덕읍 중앙길 132-14
1911319114지류형치사랑벌초구미시구미시 산호대로39길 25
8560285603모바일형경화상회영주시경상북도 영주시 영주로192번길 17
2676526766지류형스마트통신구미시구미시 형곡로 196-1
4668346684카드형예스크린(부곡점드라이119세탁전문점)김천시경상북도 김천시 송설로 120
8559685597모바일형킹모텔영주시경상북도 영주시 영주로191번길 12
21912192카드형대성카워시경산시경상북도 경산시 대학로 206
6145961460카드형짬뽕시대상주시경상북도 상주시 복룡3길 45
9543495435모바일형동궁찜닭예천군경상북도 예천군 호명면 새움3로 52
5830658307지류형서울순대상주시경상북도 상주시 서성로 42