Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory53.6 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description샘플 데이터
Author경기도경제과학진흥원
URLhttps://www.bigdata-region.kr/#/dataset/0790eb9a-08b4-4bca-b28a-2c9259f550b6

Alerts

시도명 has constant value ""Constant
년월 is highly overall correlated with 시군구명 and 1 other fieldsHigh correlation
시군구명 is highly overall correlated with 년월 and 1 other fieldsHigh correlation
결제상품명 is highly overall correlated with 년월 and 1 other fieldsHigh correlation
년월 is highly imbalanced (78.4%)Imbalance

Reproduction

Analysis started2023-12-10 13:48:04.778456
Analysis finished2023-12-10 13:48:05.665946
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
2019-04
28 
2019-03
 
1

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row2019-03
2nd row2019-04
3rd row2019-04
4th row2019-04
5th row2019-04

Common Values

ValueCountFrequency (%)
2019-04 28
96.6%
2019-03 1
 
3.4%

Length

2023-12-10T22:48:05.808367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:48:05.984064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-04 28
96.6%
2019-03 1
 
3.4%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
경기도
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 29
100.0%

Length

2023-12-10T22:48:06.150574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:48:06.300864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 29
100.0%

시군구명
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)13.8%
Missing0
Missing (%)0.0%
Memory size364.0 B
고양시 덕양구
19 
가평군
양주시
 
1
고양시 일산동구
 
1

Length

Max length8
Median length7
Mean length5.7931034
Min length3

Unique

Unique2 ?
Unique (%)6.9%

Sample

1st row양주시
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
고양시 덕양구 19
65.5%
가평군 8
27.6%
양주시 1
 
3.4%
고양시 일산동구 1
 
3.4%

Length

2023-12-10T22:48:06.475334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:48:06.668036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양시 20
40.8%
덕양구 19
38.8%
가평군 8
 
16.3%
양주시 1
 
2.0%
일산동구 1
 
2.0%

동명
Text

Distinct22
Distinct (%)75.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-10T22:48:06.953324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.862069
Min length2

Characters and Unicode

Total characters83
Distinct characters37
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)69.0%

Sample

1st row장흥면
2nd row가평읍
3rd row북면
4th row북면
5th row북면
ValueCountFrequency (%)
성사동 6
20.7%
북면 3
 
10.3%
신원동 1
 
3.4%
화정동 1
 
3.4%
행신동 1
 
3.4%
토당동 1
 
3.4%
주교동 1
 
3.4%
원흥동 1
 
3.4%
원당동 1
 
3.4%
용두동 1
 
3.4%
Other values (12) 12
41.4%
2023-12-10T22:48:07.429591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
21
25.3%
8
 
9.6%
6
 
7.2%
6
 
7.2%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (27) 28
33.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
21
25.3%
8
 
9.6%
6
 
7.2%
6
 
7.2%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (27) 28
33.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
21
25.3%
8
 
9.6%
6
 
7.2%
6
 
7.2%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (27) 28
33.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
21
25.3%
8
 
9.6%
6
 
7.2%
6
 
7.2%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
2
 
2.4%
2
 
2.4%
Other values (27) 28
33.7%

결제상품명
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size364.0 B
고양페이카드
16 
가평사랑상품권
수원페이
양주사랑카드
 
1
오산화폐 오색전
 
1
Other values (4)

Length

Max length10
Median length6
Mean length6.3793103
Min length4

Unique

Unique6 ?
Unique (%)20.7%

Sample

1st row양주사랑카드
2nd row가평사랑상품권
3rd row가평사랑상품권
4th row수원페이
5th row오산화폐 오색전

Common Values

ValueCountFrequency (%)
고양페이카드 16
55.2%
가평사랑상품권 5
 
17.2%
수원페이 2
 
6.9%
양주사랑카드 1
 
3.4%
오산화폐 오색전 1
 
3.4%
용인와이페이 1
 
3.4%
안산사랑상품권 다온 1
 
3.4%
과천화폐 과천토리 1
 
3.4%
의정부사랑카드 1
 
3.4%

Length

2023-12-10T22:48:07.655679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:48:07.917183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양페이카드 16
50.0%
가평사랑상품권 5
 
15.6%
수원페이 2
 
6.2%
양주사랑카드 1
 
3.1%
오산화폐 1
 
3.1%
오색전 1
 
3.1%
용인와이페이 1
 
3.1%
안산사랑상품권 1
 
3.1%
다온 1
 
3.1%
과천화폐 1
 
3.1%
Other values (2) 2
 
6.2%

사용빈도
Real number (ℝ)

Distinct13
Distinct (%)44.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.689655
Minimum1
Maximum123
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-10T22:48:08.136815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median3
Q39
95-th percentile51.8
Maximum123
Range122
Interquartile range (IQR)8

Descriptive statistics

Standard deviation25.257394
Coefficient of variation (CV)1.9903925
Kurtosis13.378896
Mean12.689655
Median Absolute Deviation (MAD)2
Skewness3.4566429
Sum368
Variance637.93596
MonotonicityNot monotonic
2023-12-10T22:48:08.333651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
1 10
34.5%
3 4
 
13.8%
6 3
 
10.3%
17 2
 
6.9%
2 2
 
6.9%
123 1
 
3.4%
5 1
 
3.4%
9 1
 
3.4%
7 1
 
3.4%
35 1
 
3.4%
Other values (3) 3
 
10.3%
ValueCountFrequency (%)
1 10
34.5%
2 2
 
6.9%
3 4
 
13.8%
5 1
 
3.4%
6 3
 
10.3%
7 1
 
3.4%
9 1
 
3.4%
11 1
 
3.4%
17 2
 
6.9%
35 1
 
3.4%
ValueCountFrequency (%)
123 1
 
3.4%
59 1
 
3.4%
41 1
 
3.4%
35 1
 
3.4%
17 2
6.9%
11 1
 
3.4%
9 1
 
3.4%
7 1
 
3.4%
6 3
10.3%
5 1
 
3.4%

Interactions

2023-12-10T22:48:05.167376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:48:08.467871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월시군구명동명결제상품명사용빈도
년월1.0001.0001.0001.0000.000
시군구명1.0001.0001.0000.7970.000
동명1.0001.0001.0000.0000.952
결제상품명1.0000.7970.0001.0000.000
사용빈도0.0000.0000.9520.0001.000
2023-12-10T22:48:09.024930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월결제상품명시군구명
년월1.0000.8610.962
결제상품명0.8611.0000.582
시군구명0.9620.5821.000
2023-12-10T22:48:09.217236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사용빈도년월시군구명결제상품명
사용빈도1.0000.0000.0000.000
년월0.0001.0000.9620.861
시군구명0.0000.9621.0000.582
결제상품명0.0000.8610.5821.000

Missing values

2023-12-10T22:48:05.389655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:48:05.580213image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월시도명시군구명동명결제상품명사용빈도
02019-03경기도양주시장흥면양주사랑카드3
12019-04경기도가평군가평읍가평사랑상품권123
22019-04경기도가평군북면가평사랑상품권6
32019-04경기도가평군북면수원페이1
42019-04경기도가평군북면오산화폐 오색전1
52019-04경기도가평군상면가평사랑상품권1
62019-04경기도가평군설악면고양페이카드1
72019-04경기도가평군조종면가평사랑상품권3
82019-04경기도가평군청평면가평사랑상품권17
92019-04경기도고양시 덕양구고양동고양페이카드5
년월시도명시군구명동명결제상품명사용빈도
192019-04경기도고양시 덕양구신원동고양페이카드7
202019-04경기도고양시 덕양구오금동고양페이카드1
212019-04경기도고양시 덕양구용두동고양페이카드1
222019-04경기도고양시 덕양구원당동고양페이카드3
232019-04경기도고양시 덕양구원흥동고양페이카드1
242019-04경기도고양시 덕양구주교동고양페이카드35
252019-04경기도고양시 덕양구토당동고양페이카드3
262019-04경기도고양시 덕양구행신동고양페이카드41
272019-04경기도고양시 덕양구화정동고양페이카드59
282019-04경기도고양시 일산동구마두동고양페이카드11