Overview

Dataset statistics

Number of variables12
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.0 KiB
Average record size in memory102.3 B

Variable types

Numeric2
Categorical6
Text1
Boolean1
DateTime2

Alerts

기준년도 has constant value ""Constant
기준월 has constant value ""Constant
지점 has constant value ""Constant
법정동명 has constant value ""Constant
특수지구분코드 has constant value ""Constant
특수지구분명 has constant value ""Constant
공시일자 has constant value ""Constant
데이터기준일자 has constant value ""Constant
표준지여부 is highly imbalanced (75.8%)Imbalance
기본키 has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:15:59.717641
Analysis finished2023-12-10 10:16:01.488874
Duration1.77 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기본키
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:01.663082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T19:16:01.942665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

기준년도
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2021
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 100
100.0%

Length

2023-12-10T19:16:02.166399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:02.319535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 100
100.0%

기준월
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T19:16:02.502679image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:02.664154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

지점
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-1000-0239S-10
100 

Length

Max length15
Median length15
Mean length15
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-1000-0239S-10
2nd rowA-1000-0239S-10
3rd rowA-1000-0239S-10
4th rowA-1000-0239S-10
5th rowA-1000-0239S-10

Common Values

ValueCountFrequency (%)
A-1000-0239S-10 100
100.0%

Length

2023-12-10T19:16:02.823852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:02.981922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-1000-0239s-10 100
100.0%

법정동명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
서울 강동구 상일동
100 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울 강동구 상일동
2nd row서울 강동구 상일동
3rd row서울 강동구 상일동
4th row서울 강동구 상일동
5th row서울 강동구 상일동

Common Values

ValueCountFrequency (%)
서울 강동구 상일동 100
100.0%

Length

2023-12-10T19:16:03.149634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:03.346411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울 100
33.3%
강동구 100
33.3%
상일동 100
33.3%

지번
Text

Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T19:16:03.678647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.5
Min length1

Characters and Unicode

Total characters350
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row2
4th row2
5th row2-1
ValueCountFrequency (%)
1 2
 
2.0%
12-12 2
 
2.0%
120 2
 
2.0%
12 2
 
2.0%
12-2 2
 
2.0%
12-3 2
 
2.0%
12-4 2
 
2.0%
12-6 2
 
2.0%
12-8 2
 
2.0%
12-9 2
 
2.0%
Other values (40) 80
80.0%
2023-12-10T19:16:04.244931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 86
24.6%
- 84
24.0%
2 66
18.9%
4 38
10.9%
3 20
 
5.7%
8 20
 
5.7%
6 12
 
3.4%
0 10
 
2.9%
7 6
 
1.7%
5 4
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 266
76.0%
Dash Punctuation 84
 
24.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 86
32.3%
2 66
24.8%
4 38
14.3%
3 20
 
7.5%
8 20
 
7.5%
6 12
 
4.5%
0 10
 
3.8%
7 6
 
2.3%
5 4
 
1.5%
9 4
 
1.5%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 350
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 86
24.6%
- 84
24.0%
2 66
18.9%
4 38
10.9%
3 20
 
5.7%
8 20
 
5.7%
6 12
 
3.4%
0 10
 
2.9%
7 6
 
1.7%
5 4
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 350
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 86
24.6%
- 84
24.0%
2 66
18.9%
4 38
10.9%
3 20
 
5.7%
8 20
 
5.7%
6 12
 
3.4%
0 10
 
2.9%
7 6
 
1.7%
5 4
 
1.1%

개별공시지가(원)
Real number (ℝ)

Distinct20
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean759214
Minimum230100
Maximum2235000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:16:04.437882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum230100
5-th percentile230100
Q1412500
median587500
Q3625000
95-th percentile2105000
Maximum2235000
Range2004900
Interquartile range (IQR)212500

Descriptive statistics

Standard deviation589542.53
Coefficient of variation (CV)0.77651694
Kurtosis1.1569006
Mean759214
Median Absolute Deviation (MAD)175000
Skewness1.6167187
Sum75921400
Variance3.475604 × 1011
MonotonicityNot monotonic
2023-12-10T19:16:04.587993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
412500 24
24.0%
587500 18
18.0%
230100 12
12.0%
618700 6
 
6.0%
596900 4
 
4.0%
558100 4
 
4.0%
1999000 4
 
4.0%
694600 4
 
4.0%
2102000 2
 
2.0%
534300 2
 
2.0%
Other values (10) 20
20.0%
ValueCountFrequency (%)
230100 12
12.0%
412500 24
24.0%
534300 2
 
2.0%
558100 4
 
4.0%
587500 18
18.0%
593700 2
 
2.0%
596900 4
 
4.0%
597300 2
 
2.0%
618700 6
 
6.0%
625000 2
 
2.0%
ValueCountFrequency (%)
2235000 2
2.0%
2126000 2
2.0%
2105000 2
2.0%
2102000 2
2.0%
2083000 2
2.0%
1999000 4
4.0%
1667000 2
2.0%
1080000 2
2.0%
1041000 2
2.0%
694600 4
4.0%

표준지여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size232.0 B
False
96 
True
 
4
ValueCountFrequency (%)
False 96
96.0%
True 4
 
4.0%
2023-12-10T19:16:04.718219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

특수지구분코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T19:16:04.850317image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:04.956751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

특수지구분명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
일반
100 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 100
100.0%

Length

2023-12-10T19:16:05.082916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:16:05.198067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 100
100.0%

공시일자
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2021-05-31 00:00:00
Maximum2021-05-31 00:00:00
2023-12-10T19:16:05.287718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:05.399699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
Minimum2021-08-03 00:00:00
Maximum2021-08-03 00:00:00
2023-12-10T19:16:05.498971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:05.625803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-10T19:16:00.518509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:00.239029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:00.703285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T19:16:00.341660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:16:05.715811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키지번개별공시지가(원)표준지여부
기본키1.0001.0000.6940.378
지번1.0001.0001.0001.000
개별공시지가(원)0.6941.0001.0000.119
표준지여부0.3781.0000.1191.000
2023-12-10T19:16:05.850779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키개별공시지가(원)표준지여부
기본키1.000-0.4940.277
개별공시지가(원)-0.4941.0000.168
표준지여부0.2770.1681.000

Missing values

2023-12-10T19:16:00.962388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:16:01.357257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기본키기준년도기준월지점법정동명지번개별공시지가(원)표준지여부특수지구분코드특수지구분명공시일자데이터기준일자
0120211A-1000-0239S-10서울 강동구 상일동11999000N1일반2021-05-312021-08-03
1220211A-1000-0239S-10서울 강동구 상일동11999000N1일반2021-05-312021-08-03
2320211A-1000-0239S-10서울 강동구 상일동2558100N1일반2021-05-312021-08-03
3420211A-1000-0239S-10서울 강동구 상일동2558100N1일반2021-05-312021-08-03
4520211A-1000-0239S-10서울 강동구 상일동2-12126000N1일반2021-05-312021-08-03
5620211A-1000-0239S-10서울 강동구 상일동2-12126000N1일반2021-05-312021-08-03
6720211A-1000-0239S-10서울 강동구 상일동2-21999000N1일반2021-05-312021-08-03
7820211A-1000-0239S-10서울 강동구 상일동2-21999000N1일반2021-05-312021-08-03
8920211A-1000-0239S-10서울 강동구 상일동2-32105000Y1일반2021-05-312021-08-03
91020211A-1000-0239S-10서울 강동구 상일동2-32105000Y1일반2021-05-312021-08-03
기본키기준년도기준월지점법정동명지번개별공시지가(원)표준지여부특수지구분코드특수지구분명공시일자데이터기준일자
909120211A-1000-0239S-10서울 강동구 상일동76-4230100N1일반2021-05-312021-08-03
919220211A-1000-0239S-10서울 강동구 상일동76-4230100N1일반2021-05-312021-08-03
929320211A-1000-0239S-10서울 강동구 상일동82-2230100N1일반2021-05-312021-08-03
939420211A-1000-0239S-10서울 강동구 상일동82-2230100N1일반2021-05-312021-08-03
949520211A-1000-0239S-10서울 강동구 상일동112596900N1일반2021-05-312021-08-03
959620211A-1000-0239S-10서울 강동구 상일동112596900N1일반2021-05-312021-08-03
969720211A-1000-0239S-10서울 강동구 상일동120596900N1일반2021-05-312021-08-03
979820211A-1000-0239S-10서울 강동구 상일동120596900N1일반2021-05-312021-08-03
989920211A-1000-0239S-10서울 강동구 상일동1232235000N1일반2021-05-312021-08-03
9910020211A-1000-0239S-10서울 강동구 상일동1232235000N1일반2021-05-312021-08-03