Overview

Dataset statistics

Number of variables6
Number of observations29
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory53.6 B

Variable types

Categorical4
Text1
Numeric1

Dataset

Description샘플 데이터
Author경기도경제과학진흥원
URLhttps://www.bigdata-region.kr/#/dataset/b126fea2-7ff1-40c5-a2bc-da4481f885b0

Alerts

시도명 has constant value ""Constant
시군구명 is highly overall correlated with 년월High correlation
년월 is highly overall correlated with 시군구명High correlation
년월 is highly imbalanced (63.8%)Imbalance
시군구명 is highly imbalanced (63.7%)Imbalance

Reproduction

Analysis started2023-12-10 14:20:18.048120
Analysis finished2023-12-10 14:20:18.489919
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
2019-04
27 
2019-03
 
2

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-03
2nd row2019-03
3rd row2019-04
4th row2019-04
5th row2019-04

Common Values

ValueCountFrequency (%)
2019-04 27
93.1%
2019-03 2
 
6.9%

Length

2023-12-10T23:20:18.545854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:20:18.630743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-04 27
93.1%
2019-03 2
 
6.9%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size364.0 B
경기도
29 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 29
100.0%

Length

2023-12-10T23:20:18.781571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:20:18.867130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 29
100.0%

시군구명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size364.0 B
가평군
26 
양주시
 
2
고양시 덕양구
 
1

Length

Max length7
Median length3
Mean length3.137931
Min length3

Unique

Unique1 ?
Unique (%)3.4%

Sample

1st row양주시
2nd row양주시
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
가평군 26
89.7%
양주시 2
 
6.9%
고양시 덕양구 1
 
3.4%

Length

2023-12-10T23:20:18.989947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:20:19.087730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가평군 26
86.7%
양주시 2
 
6.7%
고양시 1
 
3.3%
덕양구 1
 
3.3%

성별코드
Categorical

Distinct2
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Memory size364.0 B
F
15 
M
14 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
F 15
51.7%
M 14
48.3%

Length

2023-12-10T23:20:19.190377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T23:20:19.284252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
f 15
51.7%
m 14
48.3%
Distinct15
Distinct (%)51.7%
Missing0
Missing (%)0.0%
Memory size364.0 B
2023-12-10T23:20:19.432158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.6896552
Min length2

Characters and Unicode

Total characters136
Distinct characters52
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)13.8%

Sample

1st row유통업 영리
2nd row일반휴게음식
3rd row회원제형태
4th row기타
5th row신변잡화
ValueCountFrequency (%)
유통업 5
14.3%
영리 3
 
8.6%
일반휴게음식 3
 
8.6%
신변잡화 3
 
8.6%
비영리 2
 
5.7%
연료판매점 2
 
5.7%
보건위생 2
 
5.7%
문화.취미 2
 
5.7%
레저업소 2
 
5.7%
의원 2
 
5.7%
Other values (7) 9
25.7%
2023-12-10T23:20:19.791028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
 
5.1%
6
 
4.4%
6
 
4.4%
5
 
3.7%
5
 
3.7%
5
 
3.7%
5
 
3.7%
5
 
3.7%
5
 
3.7%
4
 
2.9%
Other values (42) 83
61.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 128
94.1%
Space Separator 6
 
4.4%
Other Punctuation 2
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
 
5.5%
6
 
4.7%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
4
 
3.1%
4
 
3.1%
Other values (40) 77
60.2%
Space Separator
ValueCountFrequency (%)
6
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 128
94.1%
Common 8
 
5.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
 
5.5%
6
 
4.7%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
4
 
3.1%
4
 
3.1%
Other values (40) 77
60.2%
Common
ValueCountFrequency (%)
6
75.0%
. 2
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 128
94.1%
ASCII 8
 
5.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
 
5.5%
6
 
4.7%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
5
 
3.9%
4
 
3.1%
4
 
3.1%
Other values (40) 77
60.2%
ASCII
ValueCountFrequency (%)
6
75.0%
. 2
 
25.0%

총결제금액
Real number (ℝ)

Distinct28
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean155736.83
Minimum1004
Maximum1365900
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size393.0 B
2023-12-10T23:20:19.968161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1004
5-th percentile2270
Q110450
median50000
Q3141600
95-th percentile698205.6
Maximum1365900
Range1364896
Interquartile range (IQR)131150

Descriptive statistics

Standard deviation296145.38
Coefficient of variation (CV)1.9015758
Kurtosis11.044956
Mean155736.83
Median Absolute Deviation (MAD)41700
Skewness3.2450869
Sum4516368
Variance8.7702084 × 1010
MonotonicityNot monotonic
2023-12-10T23:20:20.126937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
10000 2
 
6.9%
7600 1
 
3.4%
3500 1
 
3.4%
1450 1
 
3.4%
1365900 1
 
3.4%
359514 1
 
3.4%
292590 1
 
3.4%
111100 1
 
3.4%
100000 1
 
3.4%
83390 1
 
3.4%
Other values (18) 18
62.1%
ValueCountFrequency (%)
1004 1
3.4%
1450 1
3.4%
3500 1
3.4%
7600 1
3.4%
8300 1
3.4%
10000 2
6.9%
10450 1
3.4%
12200 1
3.4%
18600 1
3.4%
23700 1
3.4%
ValueCountFrequency (%)
1365900 1
3.4%
924000 1
3.4%
359514 1
3.4%
303530 1
3.4%
292590 1
3.4%
174600 1
3.4%
167540 1
3.4%
141600 1
3.4%
111100 1
3.4%
100000 1
3.4%

Interactions

2023-12-10T23:20:18.241093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T23:20:20.520015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월시군구명성별코드가맹점업종명총결제금액
년월1.0001.0000.0000.0000.000
시군구명1.0001.0000.1230.0000.000
성별코드0.0000.1231.0000.0000.184
가맹점업종명0.0000.0000.0001.0000.000
총결제금액0.0000.0000.1840.0001.000
2023-12-10T23:20:20.628448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
성별코드시군구명년월
성별코드1.0000.1940.000
시군구명0.1941.0000.981
년월0.0000.9811.000
2023-12-10T23:20:20.703081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총결제금액년월시군구명성별코드
총결제금액1.0000.0000.0000.208
년월0.0001.0000.9810.000
시군구명0.0000.9811.0000.194
성별코드0.2080.0000.1941.000

Missing values

2023-12-10T23:20:18.356142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T23:20:18.449347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월시도명시군구명성별코드가맹점업종명총결제금액
02019-03경기도양주시F유통업 영리7600
12019-03경기도양주시F일반휴게음식26200
22019-04경기도가평군F회원제형태10000
32019-04경기도가평군F기타10450
42019-04경기도가평군F신변잡화45500
52019-04경기도가평군F음료식품49100
62019-04경기도가평군F의원50000
72019-04경기도가평군F레저업소66000
82019-04경기도가평군F문화.취미79000
92019-04경기도가평군F보건위생141600
년월시도명시군구명성별코드가맹점업종명총결제금액
192019-04경기도가평군M약국18600
202019-04경기도가평군M레저업소23700
212019-04경기도가평군M보건위생70000
222019-04경기도가평군M유통업 비영리83390
232019-04경기도가평군M자동차정비 유지100000
242019-04경기도가평군M음료식품111100
252019-04경기도가평군M유통업 영리292590
262019-04경기도가평군M연료판매점359514
272019-04경기도가평군M일반휴게음식1365900
282019-04경기도고양시 덕양구F신변잡화1450