Overview

Dataset statistics

Number of variables6
Number of observations4519
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory234.0 KiB
Average record size in memory53.0 B

Variable types

Numeric4
Categorical2

Dataset

Description한국부동산원(구.한국감정원)에서 제공하는 전국 지가변동률 조사 통계를 조회 할 수 있는 서비스로 충남에 대한 월별 지역별 지가변동률, 지가지수, 보조지수 데이터를 제공합니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/bigdata/collect/view.chungnam?menuCd=DOM_000000201001001000&apiIdx=2534

Alerts

지역명 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
지역구분 레벨 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 지역코드 and 2 other fieldsHigh correlation
지역코드 is highly overall correlated with 번호 and 2 other fieldsHigh correlation
조사일자 is highly overall correlated with 지수_평균High correlation
지수_평균 is highly overall correlated with 조사일자High correlation
지역구분 레벨 is highly imbalanced (55.0%)Imbalance
번호 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:24:23.605064
Analysis finished2024-01-09 20:24:25.280733
Duration1.68 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct4519
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2260
Minimum1
Maximum4519
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.8 KiB
2024-01-10T05:24:25.341176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile226.9
Q11130.5
median2260
Q33389.5
95-th percentile4293.1
Maximum4519
Range4518
Interquartile range (IQR)2259

Descriptive statistics

Standard deviation1304.6673
Coefficient of variation (CV)0.5772864
Kurtosis-1.2
Mean2260
Median Absolute Deviation (MAD)1130
Skewness0
Sum10212940
Variance1702156.7
MonotonicityStrictly increasing
2024-01-10T05:24:25.458028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
3011 1
 
< 0.1%
3017 1
 
< 0.1%
3016 1
 
< 0.1%
3015 1
 
< 0.1%
3014 1
 
< 0.1%
3013 1
 
< 0.1%
3012 1
 
< 0.1%
3010 1
 
< 0.1%
3019 1
 
< 0.1%
Other values (4509) 4509
99.8%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4519 1
< 0.1%
4518 1
< 0.1%
4517 1
< 0.1%
4516 1
< 0.1%
4515 1
< 0.1%
4514 1
< 0.1%
4513 1
< 0.1%
4512 1
< 0.1%
4511 1
< 0.1%
4510 1
< 0.1%

지역코드
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44433.156
Minimum44000
Maximum44825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.8 KiB
2024-01-10T05:24:25.557256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44000
5-th percentile44000
Q144150
median44250
Q344770
95-th percentile44825
Maximum44825
Range825
Interquartile range (IQR)620

Descriptive statistics

Standard deviation310.73132
Coefficient of variation (CV)0.0069932309
Kurtosis-1.8022321
Mean44433.156
Median Absolute Deviation (MAD)120
Skewness0.17940113
Sum2.0079343 × 108
Variance96553.953
MonotonicityIncreasing
2024-01-10T05:24:25.652425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
44000 282
 
6.2%
44790 282
 
6.2%
44810 282
 
6.2%
44150 282
 
6.2%
44130 282
 
6.2%
44800 282
 
6.2%
44710 282
 
6.2%
44760 282
 
6.2%
44770 282
 
6.2%
44825 275
 
6.1%
Other values (8) 1706
37.8%
ValueCountFrequency (%)
44000 282
6.2%
44130 282
6.2%
44131 169
3.7%
44133 169
3.7%
44150 282
6.2%
44180 251
5.6%
44200 251
5.6%
44210 275
6.1%
44230 247
5.5%
44250 216
4.8%
ValueCountFrequency (%)
44825 275
6.1%
44810 282
6.2%
44800 282
6.2%
44790 282
6.2%
44770 282
6.2%
44760 282
6.2%
44710 282
6.2%
44270 128
2.8%
44250 216
4.8%
44230 247
5.5%

지역명
Categorical

HIGH CORRELATION 

Distinct18
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size35.4 KiB
충남
 
282
예산군
 
282
홍성군
 
282
공주시
 
282
청양군
 
282
Other values (13)
3109 

Length

Max length3
Median length3
Mean length2.9375968
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충남
2nd row충남
3rd row충남
4th row충남
5th row충남

Common Values

ValueCountFrequency (%)
충남 282
 
6.2%
예산군 282
 
6.2%
홍성군 282
 
6.2%
공주시 282
 
6.2%
청양군 282
 
6.2%
서천군 282
 
6.2%
부여군 282
 
6.2%
금산군 282
 
6.2%
천안시 282
 
6.2%
서산시 275
 
6.1%
Other values (8) 1706
37.8%

Length

2024-01-10T05:24:25.744301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충남 282
 
6.2%
홍성군 282
 
6.2%
공주시 282
 
6.2%
청양군 282
 
6.2%
서천군 282
 
6.2%
부여군 282
 
6.2%
금산군 282
 
6.2%
천안시 282
 
6.2%
예산군 282
 
6.2%
태안군 275
 
6.1%
Other values (8) 1706
37.8%

조사일자
Real number (ℝ)

HIGH CORRELATION 

Distinct282
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201013.1
Minimum198701
Maximum202206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.8 KiB
2024-01-10T05:24:25.842961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum198701
5-th percentile199202
Q1200604
median201201
Q3201704
95-th percentile202106
Maximum202206
Range3505
Interquartile range (IQR)1100

Descriptive statistics

Standard deviation855.5698
Coefficient of variation (CV)0.0042562888
Kurtosis0.041560833
Mean201013.1
Median Absolute Deviation (MAD)510
Skewness-0.83219355
Sum9.0837819 × 108
Variance731999.68
MonotonicityNot monotonic
2024-01-10T05:24:25.958619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
201609 18
 
0.4%
201805 18
 
0.4%
201710 18
 
0.4%
202012 18
 
0.4%
201603 18
 
0.4%
201601 18
 
0.4%
201605 18
 
0.4%
201702 18
 
0.4%
201804 18
 
0.4%
202104 18
 
0.4%
Other values (272) 4339
96.0%
ValueCountFrequency (%)
198701 9
0.2%
198702 9
0.2%
198703 9
0.2%
198704 9
0.2%
198801 9
0.2%
198802 9
0.2%
198803 9
0.2%
198804 11
0.2%
198901 11
0.2%
198902 11
0.2%
ValueCountFrequency (%)
202206 18
0.4%
202205 18
0.4%
202204 18
0.4%
202203 18
0.4%
202202 18
0.4%
202201 18
0.4%
202112 18
0.4%
202111 18
0.4%
202110 18
0.4%
202109 18
0.4%

지수_평균
Real number (ℝ)

HIGH CORRELATION 

Distinct4461
Distinct (%)98.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean85.686128
Minimum26.191447
Maximum107.012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size39.8 KiB
2024-01-10T05:24:26.091013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum26.191447
5-th percentile57.87588
Q182.608839
median88.373257
Q393.803439
95-th percentile101.939
Maximum107.012
Range80.820553
Interquartile range (IQR)11.1946

Descriptive statistics

Standard deviation13.18672
Coefficient of variation (CV)0.15389562
Kurtosis2.1069678
Mean85.686128
Median Absolute Deviation (MAD)5.598333
Skewness-1.3603215
Sum387215.61
Variance173.88958
MonotonicityNot monotonic
2024-01-10T05:24:26.253277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 18
 
0.4%
81.6433542520612 4
 
0.1%
91.3100601474253 4
 
0.1%
76.3386870074569 3
 
0.1%
60.9335992680419 3
 
0.1%
66.8324230096737 3
 
0.1%
101.733 2
 
< 0.1%
86.2044339293005 2
 
< 0.1%
77.5697244327301 2
 
< 0.1%
62.1605096307226 2
 
< 0.1%
Other values (4451) 4476
99.0%
ValueCountFrequency (%)
26.1914474939269 1
< 0.1%
26.4690768373626 1
< 0.1%
27.4087290650889 1
< 0.1%
28.7216071873067 1
< 0.1%
29.7277277234335 1
< 0.1%
30.1082426382935 1
< 0.1%
30.2639574932651 1
< 0.1%
30.6200827631445 1
< 0.1%
31.4412254397531 1
< 0.1%
31.6427935274335 1
< 0.1%
ValueCountFrequency (%)
107.012 1
< 0.1%
106.772 1
< 0.1%
106.63 1
< 0.1%
106.432 1
< 0.1%
106.397 1
< 0.1%
106.39 1
< 0.1%
106.351 1
< 0.1%
106.139 1
< 0.1%
106.101 1
< 0.1%
106.099 1
< 0.1%

지역구분 레벨
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size35.4 KiB
1
3899 
2
 
338
0
 
282

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1 3899
86.3%
2 338
 
7.5%
0 282
 
6.2%

Length

2024-01-10T05:24:26.390340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:24:26.467067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 3899
86.3%
2 338
 
7.5%
0 282
 
6.2%

Interactions

2024-01-10T05:24:24.806396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:23.940001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.216678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.500259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.881031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.003266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.279931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.574321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.965860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.070001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.345010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.645372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:25.054901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.139320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.419996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:24:24.721246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:24:26.520749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드지역명조사일자지수_평균지역구분 레벨
번호1.0000.9800.9760.2280.4050.877
지역코드0.9801.0001.0000.2490.3660.465
지역명0.9761.0001.0000.2280.4401.000
조사일자0.2280.2490.2281.0000.9130.224
지수_평균0.4050.3660.4400.9131.0000.258
지역구분 레벨0.8770.4651.0000.2240.2581.000
2024-01-10T05:24:26.609773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역명지역구분 레벨
지역명1.0000.998
지역구분 레벨0.9981.000
2024-01-10T05:24:26.683186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호지역코드조사일자지수_평균지역명지역구분 레벨
번호1.0000.998-0.0650.0500.8800.812
지역코드0.9981.000-0.0670.0480.9990.812
조사일자-0.065-0.0671.0000.9400.0900.137
지수_평균0.0500.0480.9401.0000.1840.159
지역명0.8800.9990.0900.1841.0000.998
지역구분 레벨0.8120.8120.1370.1590.9981.000

Missing values

2024-01-10T05:24:25.162347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:24:25.245610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호지역코드지역명조사일자지수_평균지역구분 레벨
0144000충남20080986.9904110
1244000충남20060380.9521570
2344000충남20060982.5818090
3444000충남19940161.2331330
4544000충남19950160.9884890
5644000충남19960461.5942770
6744000충남19940261.1229130
7844000충남19970362.2246450
8944000충남19970462.2930920
91044000충남20141088.9857950
번호지역코드지역명조사일자지수_평균지역구분 레벨
4509451044825태안군20111087.7283431
4510451144825태안군20100186.0162721
4511451244825태안군20091285.816321
4512451344825태안군20160491.2070851
4513451444825태안군20151290.8055221
4514451544825태안군20160891.7968251
4515451644825태안군20190998.4847371
4516451744825태안군20161292.332381
4517451844825태안군20180495.6358941
4518451944825태안군20180796.3598641