Overview

Dataset statistics

Number of variables6
Number of observations524
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory27.2 KiB
Average record size in memory53.3 B

Variable types

Numeric4
Categorical2

Dataset

Description한국부동산원(구.한국감정원)에서 제공하는 전국 지가변동률 조사 통계를 조회 할 수 있는 서비스로 충남에 대한 해당기간, 해당지역의 연도별 지역별 지가지수 정보를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2535

Alerts

지역명 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
지역구분 레벨 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 지역코드 and 2 other fieldsHigh correlation
지역코드 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
조사년도 is highly overall correlated with 지수_평균High correlation
지수_평균 is highly overall correlated with 조사년도High correlation
지역구분 레벨 is highly imbalanced (59.0%)Imbalance
번호 has unique valuesUnique
지수_평균 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:18:28.655928
Analysis finished2024-01-09 20:18:30.853662
Duration2.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct524
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean262.5
Minimum1
Maximum524
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2024-01-10T05:18:30.922684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile27.15
Q1131.75
median262.5
Q3393.25
95-th percentile497.85
Maximum524
Range523
Interquartile range (IQR)261.5

Descriptive statistics

Standard deviation151.41004
Coefficient of variation (CV)0.57680015
Kurtosis-1.2
Mean262.5
Median Absolute Deviation (MAD)131
Skewness0
Sum137550
Variance22925
MonotonicityStrictly increasing
2024-01-10T05:18:31.045064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
362 1
 
0.2%
360 1
 
0.2%
359 1
 
0.2%
358 1
 
0.2%
357 1
 
0.2%
356 1
 
0.2%
355 1
 
0.2%
354 1
 
0.2%
353 1
 
0.2%
Other values (514) 514
98.1%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
524 1
0.2%
523 1
0.2%
522 1
0.2%
521 1
0.2%
520 1
0.2%
519 1
0.2%
518 1
0.2%
517 1
0.2%
516 1
0.2%
515 1
0.2%

지역코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44449.725
Minimum44000
Maximum44825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2024-01-10T05:18:31.143389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44000
5-th percentile44000
Q144150
median44250
Q344790
95-th percentile44825
Maximum44825
Range825
Interquartile range (IQR)640

Descriptive statistics

Standard deviation314.68218
Coefficient of variation (CV)0.007079508
Kurtosis-1.8431891
Mean44449.725
Median Absolute Deviation (MAD)250
Skewness0.061941873
Sum23291656
Variance99024.877
MonotonicityIncreasing
2024-01-10T05:18:31.233794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
44000 35
 
6.7%
44790 35
 
6.7%
44810 35
 
6.7%
44150 35
 
6.7%
44130 35
 
6.7%
44800 35
 
6.7%
44710 35
 
6.7%
44760 35
 
6.7%
44770 35
 
6.7%
44825 34
 
6.5%
Other values (8) 175
33.4%
ValueCountFrequency (%)
44000 35
6.7%
44130 35
6.7%
44131 14
 
2.7%
44133 14
 
2.7%
44150 35
6.7%
44180 28
5.3%
44200 28
5.3%
44210 34
6.5%
44230 27
5.2%
44250 19
3.6%
ValueCountFrequency (%)
44825 34
6.5%
44810 35
6.7%
44800 35
6.7%
44790 35
6.7%
44770 35
6.7%
44760 35
6.7%
44710 35
6.7%
44270 11
 
2.1%
44250 19
3.6%
44230 27
5.2%

지역명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
충남
35 
예산군
35 
홍성군
35 
공주시
35 
청양군
35 
Other values (13)
349 

Length

Max length3
Median length3
Mean length2.9332061
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남
2nd row충남
3rd row충남
4th row충남
5th row충남

Common Values

ValueCountFrequency (%)
충남 35
 
6.7%
예산군 35
 
6.7%
홍성군 35
 
6.7%
공주시 35
 
6.7%
청양군 35
 
6.7%
서천군 35
 
6.7%
부여군 35
 
6.7%
금산군 35
 
6.7%
천안시 35
 
6.7%
서산시 34
 
6.5%
Other values (8) 175
33.4%

Length

2024-01-10T05:18:31.349709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충남 35
 
6.7%
홍성군 35
 
6.7%
공주시 35
 
6.7%
청양군 35
 
6.7%
서천군 35
 
6.7%
부여군 35
 
6.7%
금산군 35
 
6.7%
천안시 35
 
6.7%
예산군 35
 
6.7%
태안군 34
 
6.5%
Other values (8) 175
33.4%

조사년도
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2005.7481
Minimum1987
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2024-01-10T05:18:31.456201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1987
5-th percentile1989
Q11998
median2007
Q32014
95-th percentile2020
Maximum2021
Range34
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.8286221
Coefficient of variation (CV)0.0049002276
Kurtosis-1.1153338
Mean2005.7481
Median Absolute Deviation (MAD)8
Skewness-0.20751924
Sum1051012
Variance96.601813
MonotonicityNot monotonic
2024-01-10T05:18:31.563399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
2013 18
 
3.4%
2017 18
 
3.4%
2011 18
 
3.4%
2021 18
 
3.4%
2014 18
 
3.4%
2019 18
 
3.4%
2012 18
 
3.4%
2020 18
 
3.4%
2015 18
 
3.4%
2016 18
 
3.4%
Other values (25) 344
65.6%
ValueCountFrequency (%)
1987 9
1.7%
1988 11
2.1%
1989 11
2.1%
1990 11
2.1%
1991 11
2.1%
1992 11
2.1%
1993 11
2.1%
1994 13
2.5%
1995 14
2.7%
1996 14
2.7%
ValueCountFrequency (%)
2021 18
3.4%
2020 18
3.4%
2019 18
3.4%
2018 18
3.4%
2017 18
3.4%
2016 18
3.4%
2015 18
3.4%
2014 18
3.4%
2013 18
3.4%
2012 18
3.4%

지수_평균
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct524
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.843635
Minimum28.721607
Maximum105.133
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.7 KiB
2024-01-10T05:18:31.676850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28.721607
5-th percentile53.457998
Q168.392538
median84.626832
Q390.820678
95-th percentile100.85475
Maximum105.133
Range76.411393
Interquartile range (IQR)22.42814

Descriptive statistics

Standard deviation15.625515
Coefficient of variation (CV)0.19570144
Kurtosis-0.16176002
Mean79.843635
Median Absolute Deviation (MAD)10.357288
Skewness-0.67141059
Sum41838.065
Variance244.15671
MonotonicityNot monotonic
2024-01-10T05:18:32.009572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
88.0197052766814 1
 
0.2%
92.5770844344467 1
 
0.2%
90.6443540752805 1
 
0.2%
99.2688699585086 1
 
0.2%
91.3896120615897 1
 
0.2%
100.381 1
 
0.2%
90.0949260321667 1
 
0.2%
91.0084684636974 1
 
0.2%
82.9234569538015 1
 
0.2%
82.4584137314394 1
 
0.2%
Other values (514) 514
98.1%
ValueCountFrequency (%)
28.7216071873067 1
0.2%
31.6427935274335 1
0.2%
33.4387950752708 1
0.2%
33.6980916936068 1
0.2%
34.9484865666039 1
0.2%
35.6664978756995 1
0.2%
35.7908979090712 1
0.2%
40.526875944447 1
0.2%
40.9384605738658 1
0.2%
41.6377779096038 1
0.2%
ValueCountFrequency (%)
105.133 1
0.2%
104.84 1
0.2%
104.797 1
0.2%
104.746 1
0.2%
104.621 1
0.2%
103.716 1
0.2%
103.7 1
0.2%
103.524 1
0.2%
103.487 1
0.2%
103.285 1
0.2%

지역구분 레벨
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size4.2 KiB
1
461 
0
 
35
2
 
28

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1 461
88.0%
0 35
 
6.7%
2 28
 
5.3%

Length

2024-01-10T05:18:32.142415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:18:32.226558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 461
88.0%
0 35
 
6.7%
2 28
 
5.3%

Interactions

2024-01-10T05:18:30.321683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.020296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.448131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.911713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.413339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.141525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.564744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.005677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.508752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.256253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.685464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.124345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.625062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.356080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:29.804066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:18:30.227958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:18:32.284424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드지역명조사년도지수_평균지역구분 레벨
번호1.0000.9810.9770.0000.4750.840
지역코드0.9811.0001.0000.0960.4600.399
지역명0.9771.0001.0000.0000.5151.000
조사년도0.0000.0960.0001.0000.8980.196
지수_평균0.4750.4600.5150.8981.0000.292
지역구분 레벨0.8400.3991.0000.1960.2921.000
2024-01-10T05:18:32.392029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역명지역구분 레벨
지역명1.0000.985
지역구분 레벨0.9851.000
2024-01-10T05:18:32.490459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드조사년도지수_평균지역명지역구분 레벨
번호1.0000.998-0.0970.0340.8780.747
지역코드0.9981.000-0.0890.0390.9880.784
조사년도-0.097-0.0891.0000.9120.0000.122
지수_평균0.0340.0390.9121.0000.2230.181
지역명0.8780.9880.0000.2231.0000.985
지역구분 레벨0.7470.7840.1220.1810.9851.000

Missing values

2024-01-10T05:18:30.733565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:18:30.820251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호지역코드지역명조사년도지수_평균지역구분 레벨
0144000충남201388.0197050
1244000충남201287.341350
2344000충남200683.1355130
3444000충남199164.8289670
4544000충남198953.1534330
5644000충남200160.6350410
6744000충남201897.2245520
7844000충남199361.3927540
8944000충남198841.6377780
91044000충남201795.0611590
번호지역코드지역명조사년도지수_평균지역구분 레벨
51451544825태안군2021102.8291
51551644825태안군200886.010221
51651744825태안군201692.332381
51751844825태안군201794.9783271
51851944825태안군199175.4449891
51952044825태안군198968.83481
52052144825태안군201998.911741
52152244825태안군199077.748561
52252344825태안군199863.3595831
52352444825태안군199667.6998651