Overview

Dataset statistics

Number of variables5
Number of observations754
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.8 KiB
Average record size in memory43.2 B

Variable types

Numeric3
Categorical2

Alerts

기준년도 is highly overall correlated with 소비자물가지수High correlation
지출목적코드 is highly overall correlated with 지출목적명High correlation
소비자물가지수 is highly overall correlated with 기준년도High correlation
지출목적명 is highly overall correlated with 지출목적코드High correlation
지출목적코드 has 58 (7.7%) zerosZeros

Reproduction

Analysis started2024-03-12 23:08:59.932555
Analysis finished2024-03-12 23:09:00.894535
Duration0.96 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2020.569
Minimum2018
Maximum2023
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2024-03-13T08:09:00.933647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2018
5-th percentile2018
Q12019
median2021
Q32022
95-th percentile2023
Maximum2023
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6940305
Coefficient of variation (CV)0.00083839283
Kurtosis-1.2388053
Mean2020.569
Median Absolute Deviation (MAD)1
Skewness-0.054871545
Sum1523509
Variance2.8697394
MonotonicityDecreasing
2024-03-13T08:09:01.032272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2023 130
17.2%
2022 130
17.2%
2021 130
17.2%
2020 130
17.2%
2019 117
15.5%
2018 117
15.5%
ValueCountFrequency (%)
2018 117
15.5%
2019 117
15.5%
2020 130
17.2%
2021 130
17.2%
2022 130
17.2%
2023 130
17.2%
ValueCountFrequency (%)
2023 130
17.2%
2022 130
17.2%
2021 130
17.2%
2020 130
17.2%
2019 117
15.5%
2018 117
15.5%

도시명
Categorical

Distinct10
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
안양시
78 
부천시
78 
고양시
78 
안산시
78 
용인시
78 
Other values (5)
364 

Length

Max length4
Median length3
Mean length3.1034483
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row화성시
2nd row화성시
3rd row안양시
4th row부천시
5th row부천시

Common Values

ValueCountFrequency (%)
안양시 78
10.3%
부천시 78
10.3%
고양시 78
10.3%
안산시 78
10.3%
용인시 78
10.3%
경기도 78
10.3%
수원시 78
10.3%
성남시 78
10.3%
의정부시 78
10.3%
화성시 52
6.9%

Length

2024-03-13T08:09:01.143257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T08:09:01.237507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안양시 78
10.3%
부천시 78
10.3%
고양시 78
10.3%
안산시 78
10.3%
용인시 78
10.3%
경기도 78
10.3%
수원시 78
10.3%
성남시 78
10.3%
의정부시 78
10.3%
화성시 52
6.9%

지출목적코드
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct13
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6
Minimum0
Maximum12
Zeros58
Zeros (%)7.7%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2024-03-13T08:09:01.339002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q13
median6
Q39
95-th percentile12
Maximum12
Range12
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.7441411
Coefficient of variation (CV)0.62402351
Kurtosis-1.2143767
Mean6
Median Absolute Deviation (MAD)3
Skewness0
Sum4524
Variance14.018592
MonotonicityNot monotonic
2024-03-13T08:09:01.425683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
11 58
 
7.7%
12 58
 
7.7%
0 58
 
7.7%
1 58
 
7.7%
2 58
 
7.7%
3 58
 
7.7%
4 58
 
7.7%
5 58
 
7.7%
6 58
 
7.7%
7 58
 
7.7%
Other values (3) 174
23.1%
ValueCountFrequency (%)
0 58
7.7%
1 58
7.7%
2 58
7.7%
3 58
7.7%
4 58
7.7%
5 58
7.7%
6 58
7.7%
7 58
7.7%
8 58
7.7%
9 58
7.7%
ValueCountFrequency (%)
12 58
7.7%
11 58
7.7%
10 58
7.7%
9 58
7.7%
8 58
7.7%
7 58
7.7%
6 58
7.7%
5 58
7.7%
4 58
7.7%
3 58
7.7%

지출목적명
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
음식 및 숙박
58 
기타 상품 및 서비스
58 
총지수
58 
식료품 및 비주류음료
58 
주류 및 담배
58 
Other values (8)
464 

Length

Max length15
Median length13
Mean length6.8461538
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row음식 및 숙박
2nd row기타 상품 및 서비스
3rd row기타 상품 및 서비스
4th row총지수
5th row식료품 및 비주류음료

Common Values

ValueCountFrequency (%)
음식 및 숙박 58
 
7.7%
기타 상품 및 서비스 58
 
7.7%
총지수 58
 
7.7%
식료품 및 비주류음료 58
 
7.7%
주류 및 담배 58
 
7.7%
의류 및 신발 58
 
7.7%
주택, 수도, 전기 및 연료 58
 
7.7%
가정용품 및 가사 서비스 58
 
7.7%
보건 58
 
7.7%
교통 58
 
7.7%
Other values (3) 174
23.1%

Length

2024-03-13T08:09:01.527553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
464
24.2%
서비스 116
 
6.1%
음식 58
 
3.0%
수도 58
 
3.0%
문화 58
 
3.0%
오락 58
 
3.0%
통신 58
 
3.0%
교통 58
 
3.0%
보건 58
 
3.0%
가사 58
 
3.0%
Other values (15) 870
45.5%

소비자물가지수
Real number (ℝ)

HIGH CORRELATION 

Distinct526
Distinct (%)69.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean102.97966
Minimum94.215
Maximum120.36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.8 KiB
2024-03-13T08:09:01.625264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum94.215
5-th percentile97.6821
Q1100
median100.945
Q3104.47675
95-th percentile114.6795
Maximum120.36
Range26.145
Interquartile range (IQR)4.47675

Descriptive statistics

Standard deviation5.2379713
Coefficient of variation (CV)0.050864133
Kurtosis1.070426
Mean102.97966
Median Absolute Deviation (MAD)1.72
Skewness1.3263489
Sum77646.665
Variance27.436343
MonotonicityNot monotonic
2024-03-13T08:09:01.748636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 131
 
17.4%
100.06 10
 
1.3%
101.14 10
 
1.3%
99.16 10
 
1.3%
102.093 6
 
0.8%
104.461 6
 
0.8%
100.52 4
 
0.5%
102.59 3
 
0.4%
103.5 3
 
0.4%
104.467 3
 
0.4%
Other values (516) 568
75.3%
ValueCountFrequency (%)
94.215 1
0.1%
94.254 1
0.1%
94.93 1
0.1%
95.044 1
0.1%
95.07 1
0.1%
95.094 1
0.1%
95.184 1
0.1%
95.418 1
0.1%
95.479 1
0.1%
95.521 1
0.1%
ValueCountFrequency (%)
120.36 1
0.1%
119.6 1
0.1%
119.44 1
0.1%
119.33 1
0.1%
119.04 1
0.1%
118.86 1
0.1%
118.72 1
0.1%
118.53 1
0.1%
118.49 1
0.1%
118.46 1
0.1%

Interactions

2024-03-13T08:09:00.528601image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.101143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.316336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.597587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.168660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.386234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.682607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.240784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T08:09:00.456221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T08:09:01.839485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도도시명지출목적코드지출목적명소비자물가지수
기준년도1.0000.0000.0000.0000.862
도시명0.0001.0000.0000.0000.000
지출목적코드0.0000.0001.0001.0000.562
지출목적명0.0000.0001.0001.0000.585
소비자물가지수0.8620.0000.5620.5851.000
2024-03-13T08:09:01.918988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
도시명지출목적명
도시명1.0000.000
지출목적명0.0001.000
2024-03-13T08:09:02.213848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도지출목적코드소비자물가지수도시명지출목적명
기준년도1.0000.0000.7670.0000.000
지출목적코드0.0001.0000.0470.0000.998
소비자물가지수0.7670.0471.0000.0000.288
도시명0.0000.0000.0001.0000.000
지출목적명0.0000.9980.2880.0001.000

Missing values

2024-03-13T08:09:00.770282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T08:09:00.860990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기준년도도시명지출목적코드지출목적명소비자물가지수
02023화성시11음식 및 숙박119.04
12023화성시12기타 상품 및 서비스113.94
22023안양시12기타 상품 및 서비스114.62
32023부천시0총지수111.32
42023부천시1식료품 및 비주류음료118.16
52023부천시2주류 및 담배104.32
62023부천시3의류 및 신발110.63
72023부천시4주택, 수도, 전기 및 연료110.78
82023부천시5가정용품 및 가사 서비스112.41
92023부천시6보건102.01
기준년도도시명지출목적코드지출목적명소비자물가지수
7442018안양시2주류 및 담배98.258
7452018안양시3의류 및 신발98.137
7462018안양시4주택, 수도, 전기 및 연료97.379
7472018안양시5가정용품 및 가사 서비스97.467
7482018안양시6보건98.187
7492018안양시7교통103.037
7502018안양시8통신104.467
7512018안양시9오락 및 문화101.454
7522018안양시10교육102.492
7532018안양시11음식 및 숙박96.899