Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows14
Duplicate rows (%)0.1%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical5
Text1
Numeric2

Dataset

Description2017~2021년도 충청남도 보령시 일반건축물에 대한 지방세 부과기준인 시가표준액 항목을 제공합니다. *물건별 재산가액 비교에 참조
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=348&beforeMenuCd=DOM_000000201001001000&publicdatapk=15079936

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
Dataset has 14 (0.1%) duplicate rowsDuplicates
과세년도 is highly overall correlated with 기준일자High correlation
기준일자 is highly overall correlated with 과세년도High correlation
시가표준액 is highly overall correlated with 연면적High correlation
연면적 is highly overall correlated with 시가표준액High correlation
시가표준액 is highly skewed (γ1 = 78.26586962)Skewed
연면적 is highly skewed (γ1 = 35.46792757)Skewed

Reproduction

Analysis started2024-01-09 20:55:40.414393
Analysis finished2024-01-09 20:55:41.482927
Duration1.07 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T05:55:41.794540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:41.872934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보령시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보령시
2nd row보령시
3rd row보령시
4th row보령시
5th row보령시

Common Values

ValueCountFrequency (%)
보령시 10000
100.0%

Length

2024-01-10T05:55:41.955688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:42.042663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보령시 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44180
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44180
2nd row44180
3rd row44180
4th row44180
5th row44180

Common Values

ValueCountFrequency (%)
44180 10000
100.0%

Length

2024-01-10T05:55:42.118456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:42.193103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44180 10000
100.0%

과세년도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020
2755 
2019
2587 
2018
2564 
2017
2094 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2019
3rd row2020
4th row2019
5th row2018

Common Values

ValueCountFrequency (%)
2020 2755
27.6%
2019 2587
25.9%
2018 2564
25.6%
2017 2094
20.9%

Length

2024-01-10T05:55:42.276112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:42.357826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 2755
27.6%
2019 2587
25.9%
2018 2564
25.6%
2017 2094
20.9%
Distinct8286
Distinct (%)82.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T05:55:42.610075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length25.3606
Min length19

Characters and Unicode

Total characters253606
Distinct characters270
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6933 ?
Unique (%)69.3%

Sample

1st row[ 대천항로 371 ] 0000동 0101호
2nd row충청남도 보령시 미산면 풍계리 439-2 101호
3rd row충청남도 보령시 주산면 황율리 141-3 101호
4th row[ 옥마벚길 64 ] 0000동 0101호
5th row[ 지장골길 109 ] 0000동 0101호
ValueCountFrequency (%)
7700
 
12.9%
충청남도 6150
 
10.3%
보령시 6150
 
10.3%
0000동 3555
 
5.9%
101호 2659
 
4.4%
0101호 1478
 
2.5%
1동 976
 
1.6%
102호 921
 
1.5%
천북면 893
 
1.5%
대천동 633
 
1.1%
Other values (4443) 28659
47.9%
2024-01-10T05:55:43.017518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49774
19.6%
0 30673
 
12.1%
1 22432
 
8.8%
10115
 
4.0%
2 8198
 
3.2%
7967
 
3.1%
6944
 
2.7%
6864
 
2.7%
6526
 
2.6%
6442
 
2.5%
Other values (260) 97671
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 104600
41.2%
Decimal Number 86340
34.0%
Space Separator 49774
19.6%
Dash Punctuation 5192
 
2.0%
Close Punctuation 3850
 
1.5%
Open Punctuation 3850
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10115
 
9.7%
7967
 
7.6%
6944
 
6.6%
6864
 
6.6%
6526
 
6.2%
6442
 
6.2%
6343
 
6.1%
6308
 
6.0%
6251
 
6.0%
4316
 
4.1%
Other values (246) 36524
34.9%
Decimal Number
ValueCountFrequency (%)
0 30673
35.5%
1 22432
26.0%
2 8198
 
9.5%
3 5014
 
5.8%
4 4513
 
5.2%
7 3344
 
3.9%
5 3331
 
3.9%
6 3296
 
3.8%
8 3195
 
3.7%
9 2344
 
2.7%
Space Separator
ValueCountFrequency (%)
49774
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5192
100.0%
Close Punctuation
ValueCountFrequency (%)
] 3850
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 3850
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 149006
58.8%
Hangul 104600
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10115
 
9.7%
7967
 
7.6%
6944
 
6.6%
6864
 
6.6%
6526
 
6.2%
6442
 
6.2%
6343
 
6.1%
6308
 
6.0%
6251
 
6.0%
4316
 
4.1%
Other values (246) 36524
34.9%
Common
ValueCountFrequency (%)
49774
33.4%
0 30673
20.6%
1 22432
15.1%
2 8198
 
5.5%
- 5192
 
3.5%
3 5014
 
3.4%
4 4513
 
3.0%
] 3850
 
2.6%
[ 3850
 
2.6%
7 3344
 
2.2%
Other values (4) 12166
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149006
58.8%
Hangul 104600
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49774
33.4%
0 30673
20.6%
1 22432
15.1%
2 8198
 
5.5%
- 5192
 
3.5%
3 5014
 
3.4%
4 4513
 
3.0%
] 3850
 
2.6%
[ 3850
 
2.6%
7 3344
 
2.2%
Other values (4) 12166
 
8.2%
Hangul
ValueCountFrequency (%)
10115
 
9.7%
7967
 
7.6%
6944
 
6.6%
6864
 
6.6%
6526
 
6.2%
6442
 
6.2%
6343
 
6.1%
6308
 
6.0%
6251
 
6.0%
4316
 
4.1%
Other values (246) 36524
34.9%

시가표준액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct8968
Distinct (%)89.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70089709
Minimum12080
Maximum4.7260732 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T05:55:43.148466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12080
5-th percentile522000
Q13238630
median18455870
Q364977525
95-th percentile2.4403455 × 108
Maximum4.7260732 × 1010
Range4.726072 × 1010
Interquartile range (IQR)61738895

Descriptive statistics

Standard deviation5.1446701 × 108
Coefficient of variation (CV)7.3401219
Kurtosis7093.2239
Mean70089709
Median Absolute Deviation (MAD)17225040
Skewness78.26587
Sum7.0089709 × 1011
Variance2.646763 × 1017
MonotonicityNot monotonic
2024-01-10T05:55:43.270671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50824800 21
 
0.2%
55770000 11
 
0.1%
97767090 10
 
0.1%
90000 10
 
0.1%
756000 10
 
0.1%
792000 9
 
0.1%
21674730 9
 
0.1%
16756330 9
 
0.1%
22284570 9
 
0.1%
34413440 8
 
0.1%
Other values (8958) 9894
98.9%
ValueCountFrequency (%)
12080 1
< 0.1%
14440 1
< 0.1%
20080 1
< 0.1%
22780 1
< 0.1%
34000 1
< 0.1%
41600 1
< 0.1%
42000 1
< 0.1%
44000 1
< 0.1%
44800 1
< 0.1%
45000 2
< 0.1%
ValueCountFrequency (%)
47260731910 1
< 0.1%
8567550000 1
< 0.1%
5747120320 1
< 0.1%
4708265100 1
< 0.1%
4218933600 1
< 0.1%
3874267550 1
< 0.1%
3725791200 1
< 0.1%
3669558660 1
< 0.1%
3521324410 1
< 0.1%
3485028750 1
< 0.1%

연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct5816
Distinct (%)58.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean228.38397
Minimum0.19
Maximum47500.61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T05:55:43.385484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.19
5-th percentile14.7035
Q148.7425
median105.39
Q3205.94
95-th percentile783.3405
Maximum47500.61
Range47500.42
Interquartile range (IQR)157.1975

Descriptive statistics

Standard deviation724.47461
Coefficient of variation (CV)3.172178
Kurtosis1996.7577
Mean228.38397
Median Absolute Deviation (MAD)69.6343
Skewness35.467928
Sum2283839.7
Variance524863.46
MonotonicityNot monotonic
2024-01-10T05:55:43.508725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 143
 
1.4%
198.0 41
 
0.4%
27.0 35
 
0.4%
36.0 31
 
0.3%
25.41 29
 
0.3%
72.0 27
 
0.3%
16.37 26
 
0.3%
192.0 25
 
0.2%
60.0 25
 
0.2%
12.0 25
 
0.2%
Other values (5806) 9593
95.9%
ValueCountFrequency (%)
0.19 1
 
< 0.1%
1.0 2
< 0.1%
1.31 2
< 0.1%
1.64 1
 
< 0.1%
1.8 2
< 0.1%
1.92 1
 
< 0.1%
1.98 2
< 0.1%
2.0 4
< 0.1%
2.02 1
 
< 0.1%
2.2 3
< 0.1%
ValueCountFrequency (%)
47500.61 1
< 0.1%
24435.62 1
< 0.1%
17775.0 1
< 0.1%
12885.92 1
< 0.1%
10349.42 1
< 0.1%
9293.41 1
< 0.1%
8370.9 1
< 0.1%
7964.19 1
< 0.1%
7256.5 1
< 0.1%
7247.39 1
< 0.1%

기준일자
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2020-12-31
2755 
2019-12-31
2587 
2018-12-31
2564 
2017-12-31
2094 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-12-31
2nd row2019-12-31
3rd row2020-12-31
4th row2019-12-31
5th row2018-12-31

Common Values

ValueCountFrequency (%)
2020-12-31 2755
27.6%
2019-12-31 2587
25.9%
2018-12-31 2564
25.6%
2017-12-31 2094
20.9%

Length

2024-01-10T05:55:43.612926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:43.707215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-12-31 2755
27.6%
2019-12-31 2587
25.9%
2018-12-31 2564
25.6%
2017-12-31 2094
20.9%

Interactions

2024-01-10T05:55:41.074198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:40.919693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:41.167901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:40.993975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:55:43.770823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도시가표준액연면적기준일자
과세년도1.0000.0000.0001.000
시가표준액0.0001.0000.9970.000
연면적0.0000.9971.0000.000
기준일자1.0000.0000.0001.000
2024-01-10T05:55:43.851930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도기준일자
과세년도1.0001.000
기준일자1.0001.000
2024-01-10T05:55:43.922179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액연면적과세년도기준일자
시가표준액1.0000.5940.0000.000
연면적0.5941.0000.0000.000
과세년도0.0000.0001.0001.000
기준일자0.0000.0001.0001.000

Missing values

2024-01-10T05:55:41.280773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:55:41.413082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자
22501충청남도보령시441802020[ 대천항로 371 ] 0000동 0101호170184580163.52112020-12-31
39769충청남도보령시441802019충청남도 보령시 미산면 풍계리 439-2 101호443275037.252019-12-31
1955충청남도보령시441802020충청남도 보령시 주산면 황율리 141-3 101호1579560394.892020-12-31
41284충청남도보령시441802019[ 옥마벚길 64 ] 0000동 0101호6644411901020.6472019-12-31
72687충청남도보령시441802018[ 지장골길 109 ] 0000동 0101호3890264063.382018-12-31
77483충청남도보령시441802018충청남도 보령시 천북면 하만리 192-14 1동 104호2900800725.22018-12-31
22929충청남도보령시441802020충청남도 보령시 남곡동 842 101호125061120294.42020-12-31
81508충청남도보령시441802017[ 해안로 45 ] 0000동 0301호73919440106.822017-12-31
59100충청남도보령시441802018충청남도 보령시 내항동 928 101호165280082.642018-12-31
29171충청남도보령시441802019충청남도 보령시 천북면 낙동리 763-3 102호21000021.02019-12-31
시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자
87675충청남도보령시441802017[ 읍내냇둑길 96 ] 0000동 0101호521520098.42017-12-31
75145충청남도보령시441802018충청남도 보령시 오천면 소성리 66-4 101호450918019.272018-12-31
99808충청남도보령시441802017[ 열린바다로 271 ] 0000동 0202호110009640189.022017-12-31
25735충청남도보령시441802020[ 소쟁이길 9 ] 0000동 0102호47520052.82020-12-31
26071충청남도보령시441802020[ 구시1길 89 ] 0000동 0101호1372800031.22020-12-31
95522충청남도보령시441802017충청남도 보령시 신흑동 1945 1동 8103호1622412022.442017-12-31
26772충청남도보령시441802020[ 한내여중길 79 ] 0000동 0201호3931200100.82020-12-31
51232충청남도보령시441802019충청남도 보령시 청라면 장현리 545-7 1동 105호238800039.82019-12-31
23026충청남도보령시441802020충청남도 보령시 신흑동 1913 101호4360404078.342020-12-31
27608충청남도보령시441802019충청남도 보령시 오천면 삽시도리 산 48-1 34동 101호309816079.442019-12-31

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자# duplicates
0충청남도보령시441802017충청남도 보령시 신흑동 800-1 104호211550042.312017-12-312
1충청남도보령시441802017충청남도 보령시 오천면 오포리 773 116호115406200309.42017-12-312
2충청남도보령시441802017충청남도 보령시 오천면 오포리 773 126호1808256065.282017-12-312
3충청남도보령시441802017충청남도 보령시 천북면 궁포리 381-2 1동 101호136080001512.02017-12-312
4충청남도보령시441802017충청남도 보령시 청라면 장현리 산 52-2 1동 101호1144260023.42017-12-312
5충청남도보령시441802018[ 홀뫼3길 39 ] 0000동 0101호1400000350.02018-12-312
6충청남도보령시441802018충청남도 보령시 웅천읍 구룡리 3-5 1동 101호150675840497.282018-12-312
7충청남도보령시441802018충청남도 보령시 천북면 하만리 391-1 1동 101호1320000330.02018-12-312
8충청남도보령시441802019충청남도 보령시 신흑동 800-1 74호260499036.692019-12-312
9충청남도보령시441802019충청남도 보령시 오천면 영보리 304-2 1동 101호12390000210.02019-12-312