Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows53
Duplicate rows (%)0.5%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Categorical4
Text2
Numeric1
DateTime1

Dataset

Description상기 데이터는 연도별 일반건축물에 대한 지방세 부과기준인 시가표준액을 제공하여 물건별 재산가액 확인이 가능하도록 함
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=348&beforeMenuCd=DOM_000000201001001000&publicdatapk=15079984

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 53 (0.5%) duplicate rowsDuplicates
연면적(제곱미터) is highly skewed (γ1 = 25.60892965)Skewed

Reproduction

Analysis started2024-01-09 22:09:51.419661
Analysis finished2024-01-09 22:09:52.091258
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T07:09:52.140514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:52.206051image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부여군
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부여군
2nd row부여군
3rd row부여군
4th row부여군
5th row부여군

Common Values

ValueCountFrequency (%)
부여군 10000
100.0%

Length

2024-01-10T07:09:52.279775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:52.351722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부여군 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44760
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44760
2nd row44760
3rd row44760
4th row44760
5th row44760

Common Values

ValueCountFrequency (%)
44760 10000
100.0%

Length

2024-01-10T07:09:52.421957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:52.490484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44760 10000
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2021
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 10000
100.0%

Length

2024-01-10T07:09:52.559805image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:52.625246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 10000
100.0%
Distinct9155
Distinct (%)91.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:09:52.844778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length27.8202
Min length19

Characters and Unicode

Total characters278202
Distinct characters210
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8686 ?
Unique (%)86.9%

Sample

1st row충청남도 부여군 성흥로97번길 6 0000동 0102호
2nd row충청남도 부여군 남면 마정리 1165-7 1동 102호
3rd row충청남도 부여군 다근로 127-4 0000동 0101호
4th row충청남도 부여군 석성면 현내리 363-3 101호
5th row충청남도 부여군 회곡저실로 81-29 0000동 0103호
ValueCountFrequency (%)
충청남도 10000
 
16.0%
부여군 10000
 
16.0%
101호 3275
 
5.2%
0000동 2083
 
3.3%
1동 2025
 
3.2%
102호 1455
 
2.3%
부여읍 1219
 
2.0%
0101호 1175
 
1.9%
규암면 837
 
1.3%
103호 798
 
1.3%
Other values (4314) 29625
47.4%
2024-01-10T07:09:53.204785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52494
18.9%
0 24741
 
8.9%
1 24143
 
8.7%
11353
 
4.1%
11279
 
4.1%
10655
 
3.8%
10478
 
3.8%
10477
 
3.8%
10139
 
3.6%
10101
 
3.6%
Other values (200) 102342
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 138817
49.9%
Decimal Number 80882
29.1%
Space Separator 52494
 
18.9%
Dash Punctuation 6009
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11353
 
8.2%
11279
 
8.1%
10655
 
7.7%
10478
 
7.5%
10477
 
7.5%
10139
 
7.3%
10101
 
7.3%
10059
 
7.2%
7449
 
5.4%
6232
 
4.5%
Other values (188) 40595
29.2%
Decimal Number
ValueCountFrequency (%)
0 24741
30.6%
1 24143
29.8%
2 8195
 
10.1%
3 5420
 
6.7%
4 4229
 
5.2%
5 3375
 
4.2%
6 3195
 
4.0%
7 2699
 
3.3%
8 2525
 
3.1%
9 2360
 
2.9%
Space Separator
ValueCountFrequency (%)
52494
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6009
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 139385
50.1%
Hangul 138817
49.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11353
 
8.2%
11279
 
8.1%
10655
 
7.7%
10478
 
7.5%
10477
 
7.5%
10139
 
7.3%
10101
 
7.3%
10059
 
7.2%
7449
 
5.4%
6232
 
4.5%
Other values (188) 40595
29.2%
Common
ValueCountFrequency (%)
52494
37.7%
0 24741
17.8%
1 24143
17.3%
2 8195
 
5.9%
- 6009
 
4.3%
3 5420
 
3.9%
4 4229
 
3.0%
5 3375
 
2.4%
6 3195
 
2.3%
7 2699
 
1.9%
Other values (2) 4885
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 139385
50.1%
Hangul 138817
49.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52494
37.7%
0 24741
17.8%
1 24143
17.3%
2 8195
 
5.9%
- 6009
 
4.3%
3 5420
 
3.9%
4 4229
 
3.0%
5 3375
 
2.4%
6 3195
 
2.3%
7 2699
 
1.9%
Other values (2) 4885
 
3.5%
Hangul
ValueCountFrequency (%)
11353
 
8.2%
11279
 
8.1%
10655
 
7.7%
10478
 
7.5%
10477
 
7.5%
10139
 
7.3%
10101
 
7.3%
10059
 
7.2%
7449
 
5.4%
6232
 
4.5%
Other values (188) 40595
29.2%
Distinct8148
Distinct (%)81.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:09:53.453109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length11.1206
Min length8

Characters and Unicode

Total characters111206
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7223 ?
Unique (%)72.2%

Sample

1st row 3,627,000
2nd row 37,950,000
3rd row 4,263,840
4th row 476,000
5th row 330,600
ValueCountFrequency (%)
5,232,600 39
 
0.4%
489,600 21
 
0.2%
432,000 20
 
0.2%
264,000 18
 
0.2%
360,000 16
 
0.2%
633,600 14
 
0.1%
66,000 14
 
0.1%
396,000 13
 
0.1%
561,600 13
 
0.1%
1,320,000 12
 
0.1%
Other values (8138) 9820
98.2%
2024-01-10T07:09:53.817649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 23352
21.0%
20000
18.0%
, 18055
16.2%
1 6984
 
6.3%
2 6942
 
6.2%
4 5998
 
5.4%
6 5542
 
5.0%
5 5388
 
4.8%
8 5219
 
4.7%
3 5170
 
4.6%
Other values (2) 8556
 
7.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 73151
65.8%
Space Separator 20000
 
18.0%
Other Punctuation 18055
 
16.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 23352
31.9%
1 6984
 
9.5%
2 6942
 
9.5%
4 5998
 
8.2%
6 5542
 
7.6%
5 5388
 
7.4%
8 5219
 
7.1%
3 5170
 
7.1%
7 4374
 
6.0%
9 4182
 
5.7%
Space Separator
ValueCountFrequency (%)
20000
100.0%
Other Punctuation
ValueCountFrequency (%)
, 18055
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 111206
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 23352
21.0%
20000
18.0%
, 18055
16.2%
1 6984
 
6.3%
2 6942
 
6.2%
4 5998
 
5.4%
6 5542
 
5.0%
5 5388
 
4.8%
8 5219
 
4.7%
3 5170
 
4.6%
Other values (2) 8556
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 111206
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 23352
21.0%
20000
18.0%
, 18055
16.2%
1 6984
 
6.3%
2 6942
 
6.2%
4 5998
 
5.4%
6 5542
 
5.0%
5 5388
 
4.8%
8 5219
 
4.7%
3 5170
 
4.6%
Other values (2) 8556
 
7.7%

연면적(제곱미터)
Real number (ℝ)

SKEWED 

Distinct5402
Distinct (%)54.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean206.91201
Minimum1.04
Maximum29473.94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:09:53.933774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.04
5-th percentile12
Q136.3
median95.86
Q3198
95-th percentile756.34
Maximum29473.94
Range29472.9
Interquartile range (IQR)161.7

Descriptive statistics

Standard deviation555.20822
Coefficient of variation (CV)2.6833059
Kurtosis1115.266
Mean206.91201
Median Absolute Deviation (MAD)69.42
Skewness25.60893
Sum2069120.1
Variance308256.17
MonotonicityNot monotonic
2024-01-10T07:09:54.237290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 298
 
3.0%
16.5 118
 
1.2%
50.0 49
 
0.5%
16.2 48
 
0.5%
27.0 41
 
0.4%
36.0 40
 
0.4%
40.0 39
 
0.4%
198.0 39
 
0.4%
10.0 38
 
0.4%
12.0 34
 
0.3%
Other values (5392) 9256
92.6%
ValueCountFrequency (%)
1.04 1
 
< 0.1%
1.1 1
 
< 0.1%
1.44 6
0.1%
1.56 1
 
< 0.1%
1.8 1
 
< 0.1%
1.87 1
 
< 0.1%
1.92 1
 
< 0.1%
2.0 2
 
< 0.1%
2.1 1
 
< 0.1%
2.12 1
 
< 0.1%
ValueCountFrequency (%)
29473.94 1
< 0.1%
23255.97 1
< 0.1%
10443.0 1
< 0.1%
10353.42 1
< 0.1%
9888.0 1
< 0.1%
8623.48 1
< 0.1%
6422.37 1
< 0.1%
6096.24 1
< 0.1%
5859.0 1
< 0.1%
5379.0 1
< 0.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-06-01 00:00:00
Maximum2021-06-01 00:00:00
2024-01-10T07:09:54.325800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:09:54.399603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T07:09:51.837735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-01-10T07:09:51.935123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:09:52.037449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자
17931충청남도부여군447602021충청남도 부여군 성흥로97번길 6 0000동 0102호3,627,00080.62021-06-01
20192충청남도부여군447602021충청남도 부여군 남면 마정리 1165-7 1동 102호37,950,000230.02021-06-01
22917충청남도부여군447602021충청남도 부여군 다근로 127-4 0000동 0101호4,263,84022.682021-06-01
12358충청남도부여군447602021충청남도 부여군 석성면 현내리 363-3 101호476,00029.752021-06-01
15594충청남도부여군447602021충청남도 부여군 회곡저실로 81-29 0000동 0103호330,60033.062021-06-01
8317충청남도부여군447602021충청남도 부여군 성흥로75번길 27 0000동 0105호3,136,00098.02021-06-01
9625충청남도부여군447602021충청남도 부여군 홍산면 조현리 478-1 1동 103호480,26043.662021-06-01
14432충청남도부여군447602021충청남도 부여군 은산면 가중리 345-1 101호523,800174.62021-06-01
10477충청남도부여군447602021충청남도 부여군 석성면 증산리 120-5 108호1,400,000280.02021-06-01
7288충청남도부여군447602021충청남도 부여군 옥산면 가덕리 290-20 101호1,134,00021.02021-06-01
시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자
18116충청남도부여군447602021충청남도 부여군 양화면 벽용리 199 1동 102호17,798,220539.342021-06-01
10377충청남도부여군447602021충청남도 부여군 장암면 하황리 546-4 102호5,577,000169.02021-06-01
12064충청남도부여군447602021충청남도 부여군 세도면 귀덕리 583-3 101호432,00018.02021-06-01
8535충청남도부여군447602021충청남도 부여군 임천면 만사리 96-2 105호10,890,000330.02021-06-01
21945충청남도부여군447602021충청남도 부여군 초촌면 신암리 147 1동 101호26,908,00086.82021-06-01
1330충청남도부여군447602021충청남도 부여군 규암면 합송리 216-31 101호860,00010.02021-06-01
2522충청남도부여군447602021충청남도 부여군 내산면 금지리 69-6 1동 101호74,496,000768.02021-06-01
17799충청남도부여군447602021충청남도 부여군 홍산면 조현리 127 101호335,22055.872021-06-01
8302충청남도부여군447602021충청남도 부여군 성흥로 91 0000동 0302호17,285,60052.72021-06-01
13466충청남도부여군447602021충청남도 부여군 임천면 탑산리 666 101호6,370,000140.02021-06-01

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자# duplicates
45충청남도부여군447602021충청남도 부여군 초촌면 응평리 466 1동 101호13,027,200110.42021-06-017
23충청남도부여군447602021충청남도 부여군 세도면 사산리 595 1동 101호42,409,510296.572021-06-015
0충청남도부여군447602021충청남도 부여군 구룡면 구봉리 684-6 1동 101호124,650,1201222.062021-06-013
4충청남도부여군447602021충청남도 부여군 규암면 합정리 575 2동 101호54,741,96098.282021-06-013
6충청남도부여군447602021충청남도 부여군 남면 마정리 1146-9 1동 101호112,000,0001400.02021-06-013
8충청남도부여군447602021충청남도 부여군 남면 마정리 1146-9 1동 103호14,400,000100.02021-06-013
14충청남도부여군447602021충청남도 부여군 석성면 증산리 1079-3 1동 101호9,190,260113.462021-06-013
15충청남도부여군447602021충청남도 부여군 석성면 증산리 1320-214 1동 101호3,672,000108.02021-06-013
17충청남도부여군447602021충청남도 부여군 석성면 증산리 700-4 1동 101호16,992,000144.02021-06-013
25충청남도부여군447602021충청남도 부여군 세도면 청송리 40-1 1동 101호8,147,52066.242021-06-013