Overview

Dataset statistics

Number of variables12
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory103.3 B

Variable types

Numeric3
Categorical8
Boolean1

Alerts

기준월 has constant value ""Constant
지점 has constant value ""Constant
법정동명 has constant value ""Constant
표준지여부 has constant value ""Constant
특수지구분코드 has constant value ""Constant
특수지구분명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
기본키 is highly overall correlated with 지번 and 1 other fieldsHigh correlation
기준년도 is highly overall correlated with 개별공시지가(원) and 1 other fieldsHigh correlation
개별공시지가(원) is highly overall correlated with 기준년도 and 1 other fieldsHigh correlation
지번 is highly overall correlated with 기본키High correlation
공시일자 is highly overall correlated with 기본키 and 2 other fieldsHigh correlation
기본키 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:16:07.001377
Analysis finished2023-12-10 10:16:09.511802
Duration2.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기본키
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:09.717059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:16:10.434617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2002.72
Minimum1990
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:10.724464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1990
5-th percentile1991
Q11996
median2002
Q32008
95-th percentile2018
Maximum2020
Range30
Interquartile range (IQR)12

Descriptive statistics

Standard deviation8.3775554
Coefficient of variation (CV)0.0041830887
Kurtosis-0.84262584
Mean2002.72
Median Absolute Deviation (MAD)6
Skewness0.35029979
Sum200272
Variance70.183434
MonotonicityNot monotonic
2023-12-10T19:16:10.990651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1990 4
 
4.0%
2000 4
 
4.0%
2008 4
 
4.0%
2007 4
 
4.0%
1991 4
 
4.0%
2005 4
 
4.0%
2004 4
 
4.0%
2003 4
 
4.0%
2002 4
 
4.0%
2001 4
 
4.0%
Other values (21) 60
60.0%
ValueCountFrequency (%)
1990 4
4.0%
1991 4
4.0%
1992 4
4.0%
1993 4
4.0%
1994 4
4.0%
1995 4
4.0%
1996 4
4.0%
1997 4
4.0%
1998 4
4.0%
1999 4
4.0%
ValueCountFrequency (%)
2020 2
2.0%
2019 2
2.0%
2018 2
2.0%
2017 2
2.0%
2016 2
2.0%
2015 2
2.0%
2014 2
2.0%
2013 2
2.0%
2012 2
2.0%
2011 2
2.0%

기준월
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T19:16:11.202107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:11.345699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-1000-0239S-10
100 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-1000-0239S-10
2nd rowA-1000-0239S-10
3rd rowA-1000-0239S-10
4th rowA-1000-0239S-10
5th rowA-1000-0239S-10

Common Values

ValueCountFrequency (%)
A-1000-0239S-10 100
100.0%

Length

2023-12-10T19:16:11.495283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:11.667615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-1000-0239s-10 100
100.0%

법정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울 강동구 상일동
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울 강동구 상일동
2nd row서울 강동구 상일동
3rd row서울 강동구 상일동
4th row서울 강동구 상일동
5th row서울 강동구 상일동

Common Values

ValueCountFrequency (%)
서울 강동구 상일동 100
100.0%

Length

2023-12-10T19:16:11.836943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:11.989486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울 100
33.3%
강동구 100
33.3%
상일동 100
33.3%

지번
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
62 
2
38 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 62
62.0%
2 38
38.0%

Length

2023-12-10T19:16:12.159742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:12.315972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 62
62.0%
2 38
38.0%

개별공시지가(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct40
Distinct (%)40.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean632220
Minimum150000
Maximum1881000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:12.493033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum150000
5-th percentile153000
Q1260000
median360000
Q31160000
95-th percentile1615000
Maximum1881000
Range1731000
Interquartile range (IQR)900000

Descriptive statistics

Standard deviation536164.32
Coefficient of variation (CV)0.84806605
Kurtosis-0.60291505
Mean632220
Median Absolute Deviation (MAD)160000
Skewness1.0004575
Sum63222000
Variance2.8747217 × 1011
MonotonicityNot monotonic
2023-12-10T19:16:12.754308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
310000 6
 
6.0%
200000 4
 
4.0%
153000 4
 
4.0%
260000 4
 
4.0%
270000 4
 
4.0%
360000 4
 
4.0%
330000 4
 
4.0%
174000 4
 
4.0%
398000 4
 
4.0%
1615000 2
 
2.0%
Other values (30) 60
60.0%
ValueCountFrequency (%)
150000 2
2.0%
153000 4
4.0%
163000 2
2.0%
168000 2
2.0%
174000 4
4.0%
186000 2
2.0%
200000 4
4.0%
230000 2
2.0%
240000 2
2.0%
260000 4
4.0%
ValueCountFrequency (%)
1881000 2
2.0%
1738000 2
2.0%
1615000 2
2.0%
1601000 2
2.0%
1566000 2
2.0%
1536000 2
2.0%
1485000 2
2.0%
1420000 2
2.0%
1370000 2
2.0%
1270000 2
2.0%

표준지여부
Boolean

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
100 
ValueCountFrequency (%)
False 100
100.0%
2023-12-10T19:16:12.983121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

특수지구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T19:16:13.167537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:13.345301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

특수지구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
일반
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 100
100.0%

Length

2023-12-10T19:16:13.494737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:13.626465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 100
100.0%

공시일자
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)31.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1990-08-30
 
4
2005-05-31
 
4
1998-06-30
 
4
1992-06-01
 
4
1993-05-22
 
4
Other values (26)
80 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1990-08-30
2nd row1990-08-30
3rd row1991-06-29
4th row1991-06-29
5th row1992-06-01

Common Values

ValueCountFrequency (%)
1990-08-30 4
 
4.0%
2005-05-31 4
 
4.0%
1998-06-30 4
 
4.0%
1992-06-01 4
 
4.0%
1993-05-22 4
 
4.0%
2000-06-30 4
 
4.0%
1995-06-30 4
 
4.0%
1996-06-29 4
 
4.0%
1997-06-30 4
 
4.0%
1999-06-30 4
 
4.0%
Other values (21) 60
60.0%

Length

2023-12-10T19:16:13.802685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1990-08-30 4
 
4.0%
1994-06-30 4
 
4.0%
2002-06-29 4
 
4.0%
2008-05-31 4
 
4.0%
2007-05-31 4
 
4.0%
2005-05-31 4
 
4.0%
1991-06-29 4
 
4.0%
2004-06-30 4
 
4.0%
2003-06-30 4
 
4.0%
2001-06-30 4
 
4.0%
Other values (21) 60
60.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2020-09-26
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-09-26
2nd row2020-09-26
3rd row2020-09-26
4th row2020-09-26
5th row2020-09-26

Common Values

ValueCountFrequency (%)
2020-09-26 100
100.0%

Length

2023-12-10T19:16:13.964690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:14.091411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-09-26 100
100.0%

Interactions

2023-12-10T19:16:08.502834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:07.579616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:08.048572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:08.655912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:07.741328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:08.191909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:08.820514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:07.910301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:08.340280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:16:14.160417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키기준년도지번개별공시지가(원)공시일자
기본키1.0000.9530.9980.8230.914
기준년도0.9531.0000.4450.8701.000
지번0.9980.4451.0000.5200.000
개별공시지가(원)0.8230.8700.5201.0000.956
공시일자0.9141.0000.0000.9561.000
2023-12-10T19:16:14.330457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공시일자지번
공시일자1.0000.000
지번0.0001.000
2023-12-10T19:16:14.455375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키기준년도개별공시지가(원)지번공시일자
기본키1.0000.2330.2010.9220.550
기준년도0.2331.0000.8260.3220.876
개별공시지가(원)0.2010.8261.0000.4920.680
지번0.9220.3220.4921.0000.000
공시일자0.5500.8760.6800.0001.000

Missing values

2023-12-10T19:16:09.057756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:16:09.381012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기본키기준년도기준월지점법정동명지번개별공시지가(원)표준지여부특수지구분코드특수지구분명공시일자데이터기준일자
0119901A-1000-0239S-10서울 강동구 상일동1200000N1일반1990-08-302020-09-26
1219901A-1000-0239S-10서울 강동구 상일동1200000N1일반1990-08-302020-09-26
2319911A-1000-0239S-10서울 강동구 상일동1150000N1일반1991-06-292020-09-26
3419911A-1000-0239S-10서울 강동구 상일동1150000N1일반1991-06-292020-09-26
4519921A-1000-0239S-10서울 강동구 상일동1168000N1일반1992-06-012020-09-26
5619921A-1000-0239S-10서울 강동구 상일동1168000N1일반1992-06-012020-09-26
6719931A-1000-0239S-10서울 강동구 상일동1163000N1일반1993-05-222020-09-26
7819931A-1000-0239S-10서울 강동구 상일동1163000N1일반1993-05-222020-09-26
8919941A-1000-0239S-10서울 강동구 상일동1153000N1일반1994-06-302020-09-26
91019941A-1000-0239S-10서울 강동구 상일동1153000N1일반1994-06-302020-09-26
기본키기준년도기준월지점법정동명지번개별공시지가(원)표준지여부특수지구분코드특수지구분명공시일자데이터기준일자
909120041A-1000-0239S-10서울 강동구 상일동2230000N1일반2004-06-302020-09-26
919220041A-1000-0239S-10서울 강동구 상일동2230000N1일반2004-06-302020-09-26
929320051A-1000-0239S-10서울 강동구 상일동2303000N1일반2005-05-312020-09-26
939420051A-1000-0239S-10서울 강동구 상일동2303000N1일반2005-05-312020-09-26
949520061A-1000-0239S-10서울 강동구 상일동2388000N1일반2006-05-312020-09-26
959620061A-1000-0239S-10서울 강동구 상일동2388000N1일반2006-05-312020-09-26
969720071A-1000-0239S-10서울 강동구 상일동2448000N1일반2007-05-312020-09-26
979820071A-1000-0239S-10서울 강동구 상일동2448000N1일반2007-05-312020-09-26
989920081A-1000-0239S-10서울 강동구 상일동2492000N1일반2008-05-312020-09-26
9910020081A-1000-0239S-10서울 강동구 상일동2492000N1일반2008-05-312020-09-26