Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows11
Duplicate rows (%)0.1%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical4
Text1
Numeric2
DateTime1

Dataset

Description2017~2021년도 충청남도 보령시 일반건축물에 대한 지방세 부과기준인 시가표준액 항목을 제공합니다. *물건별 재산가액 비교에 참조
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=348&beforeMenuCd=DOM_000000201001001000&publicdatapk=15079936

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
Dataset has 11 (0.1%) duplicate rowsDuplicates
시가표준액 is highly overall correlated with 연면적High correlation
연면적 is highly overall correlated with 시가표준액High correlation
시가표준액 is highly skewed (γ1 = 72.86543482)Skewed
연면적 is highly skewed (γ1 = 32.52096364)Skewed

Reproduction

Analysis started2024-01-09 20:55:22.096324
Analysis finished2024-01-09 20:55:23.100788
Duration1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T05:55:23.153565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:23.235244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보령시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보령시
2nd row보령시
3rd row보령시
4th row보령시
5th row보령시

Common Values

ValueCountFrequency (%)
보령시 10000
100.0%

Length

2024-01-10T05:55:23.321306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:23.402511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보령시 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44180
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44180
2nd row44180
3rd row44180
4th row44180
5th row44180

Common Values

ValueCountFrequency (%)
44180 10000
100.0%

Length

2024-01-10T05:55:23.484238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:23.560614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44180 10000
100.0%

과세년도
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2021
2856 
2019
2599 
2018
2475 
2017
2070 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2017
3rd row2017
4th row2021
5th row2019

Common Values

ValueCountFrequency (%)
2021 2856
28.6%
2019 2599
26.0%
2018 2475
24.8%
2017 2070
20.7%

Length

2024-01-10T05:55:23.657407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:55:23.761723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 2856
28.6%
2019 2599
26.0%
2018 2475
24.8%
2017 2070
20.7%
Distinct8340
Distinct (%)83.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T05:55:24.037325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length33
Mean length25.3173
Min length19

Characters and Unicode

Total characters253173
Distinct characters280
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7014 ?
Unique (%)70.1%

Sample

1st row충청남도 보령시 청라면 장현리 산 52-2 1동 201호
2nd row[ 원동1길 69 ] 0001동 0109호
3rd row[ 간재길 109-10 ] 0001동 0101호
4th row충청남도 보령시 오천면 오포리 773 128호
5th row충청남도 보령시 대천동 500-2 2동 103호
ValueCountFrequency (%)
8006
 
13.4%
충청남도 5997
 
10.0%
보령시 5997
 
10.0%
0000동 3692
 
6.2%
101호 2525
 
4.2%
0101호 1541
 
2.6%
102호 969
 
1.6%
1동 920
 
1.5%
천북면 849
 
1.4%
대천동 633
 
1.1%
Other values (4445) 28624
47.9%
2024-01-10T05:55:24.436827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49753
19.7%
0 31332
 
12.4%
1 22335
 
8.8%
10097
 
4.0%
2 8216
 
3.2%
8065
 
3.2%
6735
 
2.7%
6690
 
2.6%
6326
 
2.5%
6256
 
2.5%
Other values (270) 97368
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 103269
40.8%
Decimal Number 87042
34.4%
Space Separator 49753
19.7%
Dash Punctuation 5103
 
2.0%
Close Punctuation 4003
 
1.6%
Open Punctuation 4003
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10097
 
9.8%
8065
 
7.8%
6735
 
6.5%
6690
 
6.5%
6326
 
6.1%
6256
 
6.1%
6157
 
6.0%
6144
 
5.9%
6071
 
5.9%
4165
 
4.0%
Other values (256) 36563
35.4%
Decimal Number
ValueCountFrequency (%)
0 31332
36.0%
1 22335
25.7%
2 8216
 
9.4%
3 5052
 
5.8%
4 4434
 
5.1%
6 3515
 
4.0%
7 3349
 
3.8%
8 3177
 
3.6%
5 3163
 
3.6%
9 2469
 
2.8%
Space Separator
ValueCountFrequency (%)
49753
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5103
100.0%
Close Punctuation
ValueCountFrequency (%)
] 4003
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 4003
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 149904
59.2%
Hangul 103269
40.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10097
 
9.8%
8065
 
7.8%
6735
 
6.5%
6690
 
6.5%
6326
 
6.1%
6256
 
6.1%
6157
 
6.0%
6144
 
5.9%
6071
 
5.9%
4165
 
4.0%
Other values (256) 36563
35.4%
Common
ValueCountFrequency (%)
49753
33.2%
0 31332
20.9%
1 22335
14.9%
2 8216
 
5.5%
- 5103
 
3.4%
3 5052
 
3.4%
4 4434
 
3.0%
] 4003
 
2.7%
[ 4003
 
2.7%
6 3515
 
2.3%
Other values (4) 12158
 
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149904
59.2%
Hangul 103269
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49753
33.2%
0 31332
20.9%
1 22335
14.9%
2 8216
 
5.5%
- 5103
 
3.4%
3 5052
 
3.4%
4 4434
 
3.0%
] 4003
 
2.7%
[ 4003
 
2.7%
6 3515
 
2.3%
Other values (4) 12158
 
8.1%
Hangul
ValueCountFrequency (%)
10097
 
9.8%
8065
 
7.8%
6735
 
6.5%
6690
 
6.5%
6326
 
6.1%
6256
 
6.1%
6157
 
6.0%
6144
 
5.9%
6071
 
5.9%
4165
 
4.0%
Other values (256) 36563
35.4%

시가표준액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct8979
Distinct (%)89.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69380443
Minimum12000
Maximum4.2323044 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T05:55:24.572481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12000
5-th percentile585000
Q13517080
median19328925
Q366021288
95-th percentile2.3359069 × 108
Maximum4.2323044 × 1010
Range4.2323032 × 1010
Interquartile range (IQR)62504208

Descriptive statistics

Standard deviation4.7387972 × 108
Coefficient of variation (CV)6.8301628
Kurtosis6351.0848
Mean69380443
Median Absolute Deviation (MAD)17951020
Skewness72.865435
Sum6.9380443 × 1011
Variance2.2456199 × 1017
MonotonicityNot monotonic
2024-01-10T05:55:24.701888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
49695360 27
 
0.3%
98040570 11
 
0.1%
96946670 11
 
0.1%
97767090 10
 
0.1%
55852500 10
 
0.1%
22767360 9
 
0.1%
90000 8
 
0.1%
21674730 8
 
0.1%
720000 7
 
0.1%
252000 7
 
0.1%
Other values (8969) 9892
98.9%
ValueCountFrequency (%)
12000 1
< 0.1%
15100 1
< 0.1%
20000 1
< 0.1%
22000 1
< 0.1%
29260 1
< 0.1%
34000 1
< 0.1%
35000 1
< 0.1%
38880 1
< 0.1%
42000 1
< 0.1%
42240 1
< 0.1%
ValueCountFrequency (%)
42323043510 1
< 0.1%
8656425000 1
< 0.1%
7928967200 1
< 0.1%
7875587000 1
< 0.1%
4393227000 1
< 0.1%
4253809680 1
< 0.1%
3874267550 1
< 0.1%
3750891250 1
< 0.1%
3225357750 1
< 0.1%
2964851300 1
< 0.1%

연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct5837
Distinct (%)58.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean229.70317
Minimum0.6
Maximum47500.61
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T05:55:24.825533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.6
5-th percentile15.12
Q148
median108.605
Q3207.4425
95-th percentile743.7625
Maximum47500.61
Range47500.01
Interquartile range (IQR)159.4425

Descriptive statistics

Standard deviation757.14525
Coefficient of variation (CV)3.2961898
Kurtosis1695.942
Mean229.70317
Median Absolute Deviation (MAD)71.65
Skewness32.520964
Sum2297031.7
Variance573268.93
MonotonicityNot monotonic
2024-01-10T05:55:24.948122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 162
 
1.6%
198.0 41
 
0.4%
136.7372 32
 
0.3%
94.12 28
 
0.3%
27.0 28
 
0.3%
36.0 27
 
0.3%
25.41 26
 
0.3%
40.0 24
 
0.2%
72.0 24
 
0.2%
192.0 23
 
0.2%
Other values (5827) 9585
95.9%
ValueCountFrequency (%)
0.6 1
 
< 0.1%
0.72 1
 
< 0.1%
0.81 1
 
< 0.1%
1.0 2
< 0.1%
1.3 1
 
< 0.1%
1.44 1
 
< 0.1%
1.76 1
 
< 0.1%
1.8 3
< 0.1%
1.86 1
 
< 0.1%
1.98 1
 
< 0.1%
ValueCountFrequency (%)
47500.61 1
< 0.1%
24435.62 1
< 0.1%
17775.0 1
< 0.1%
14013.5 1
< 0.1%
12871.7 1
< 0.1%
12027.87 1
< 0.1%
11147.3 1
< 0.1%
9579.2 1
< 0.1%
9021.0 1
< 0.1%
8991.76 1
< 0.1%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2017-12-31 00:00:00
Maximum2021-12-31 00:00:00
2024-01-10T05:55:25.293813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:25.372664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=4)

Interactions

2024-01-10T05:55:22.762570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:22.601792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:22.838769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:55:22.681172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:55:25.440029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
과세년도시가표준액연면적기준일자
과세년도1.0000.0190.0211.000
시가표준액0.0191.0000.9800.019
연면적0.0210.9801.0000.021
기준일자1.0000.0190.0211.000
2024-01-10T05:55:25.520437image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액연면적과세년도
시가표준액1.0000.5920.008
연면적0.5921.0000.014
과세년도0.0080.0141.000

Missing values

2024-01-10T05:55:22.937015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:55:23.046983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자
50287충청남도보령시441802019충청남도 보령시 청라면 장현리 산 52-2 1동 201호97227820198.022019-12-31
99492충청남도보령시441802017[ 원동1길 69 ] 0001동 0109호110640450155.01292017-12-31
90297충청남도보령시441802017[ 간재길 109-10 ] 0001동 0101호1111680057.62017-12-31
1394충청남도보령시441802021충청남도 보령시 오천면 오포리 773 128호1763668065.082021-12-31
46767충청남도보령시441802019충청남도 보령시 대천동 500-2 2동 103호27135000180.92019-12-31
81228충청남도보령시441802017충청남도 보령시 요암동 430-6 101호754020251.342017-12-31
96735충청남도보령시441802017[ 지장골길 155 ] 0002동 0009호1499828039.162017-12-31
4306충청남도보령시441802021충청남도 보령시 청라면 향천리 204 1동 103호141561028.892021-12-31
80471충청남도보령시441802017충청남도 보령시 주교면 주교리 217-11 103호84000024.02017-12-31
57150충청남도보령시441802018충청남도 보령시 웅천읍 대창리 959-3 101호93633920254.442018-12-31
시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자
43993충청남도보령시441802019충청남도 보령시 오천면 오포리 773 106동 43호338688015.122019-12-31
11761충청남도보령시441802021[ 토정로 612 ] 0000동 0101호43755087.512021-12-31
22091충청남도보령시441802021충청남도 보령시 명천동 524-5 9996동 101호217440018.02021-12-31
48221충청남도보령시441802019[ 대흥로 51 ] 0000동 0164호1780237016.372019-12-31
841충청남도보령시441802021충청남도 보령시 웅천읍 관당리 654-4 1동 101호55286550252.452021-12-31
33665충청남도보령시441802019충청남도 보령시 주교면 주교리 산 49-2 106호12000030.02019-12-31
42666충청남도보령시441802019충청남도 보령시 웅천읍 독산리 803-6 101호2398700072.252019-12-31
41754충청남도보령시441802019충청남도 보령시 웅천읍 두룡리 434-1 103호230100059.02019-12-31
95252충청남도보령시441802017충청남도 보령시 명천동 581-9 101호26208000168.02017-12-31
63138충청남도보령시441802018충청남도 보령시 주산면 유곡리 55 1동 102호1584000198.02018-12-31

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드과세년도물건지시가표준액연면적기준일자# duplicates
1충청남도보령시441802018[ 대창증산로 627 ] 0000동 0101호29502000198.02018-12-313
0충청남도보령시441802017충청남도 보령시 오천면 오포리 773 59호11736816006986.22017-12-312
2충청남도보령시441802018충청남도 보령시 오천면 오포리 773 95호279090000775.252018-12-312
3충청남도보령시441802018충청남도 보령시 주교면 신대리 841 5동 101호888000222.02018-12-312
4충청남도보령시441802019충청남도 보령시 주포면 봉당리 635-10 101호65395200157.22019-12-312
5충청남도보령시441802021[ 세편길 47-34 ] 0000동 0101호1522500304.52021-12-312
6충청남도보령시441802021충청남도 보령시 남포면 삼현리 82-1 101호1980000198.02021-12-312
7충청남도보령시441802021충청남도 보령시 오천면 오포리 773 123호3210519074.492021-12-312
8충청남도보령시441802021충청남도 보령시 웅천읍 노천리 563 101호61256250371.252021-12-312
9충청남도보령시441802021충청남도 보령시 주포면 관산리 290-1 1동 4호312745380793.772021-12-312