Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows8
Duplicate rows (%)0.1%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical5
Text1
Numeric2

Dataset

Description제공범위 : 일반건축물에 대한 지방세 부과기준인 시가표준액을 제공. 관련 법령 : 지방세법. 소관기관 : 지방자치단체. 제공기관 : 시군구. 표준데이터 셋 제공시스템 : 표준지방세시스템. 자료기준일 : 매년 12월 31일.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=348&beforeMenuCd=DOM_000000201001001000&publicdatapk=15080111

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
Dataset has 8 (0.1%) duplicate rowsDuplicates
결정일자 is highly overall correlated with 납부년도High correlation
납부년도 is highly overall correlated with 결정일자High correlation

Reproduction

Analysis started2024-01-09 21:23:03.456365
Analysis finished2024-01-09 21:23:04.398630
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T06:23:04.444448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:23:04.514780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
홍성군
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row홍성군
2nd row홍성군
3rd row홍성군
4th row홍성군
5th row홍성군

Common Values

ValueCountFrequency (%)
홍성군 10000
100.0%

Length

2024-01-10T06:23:04.585400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:23:04.650120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
홍성군 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44800
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44800
2nd row44800
3rd row44800
4th row44800
5th row44800

Common Values

ValueCountFrequency (%)
44800 10000
100.0%

Length

2024-01-10T06:23:04.719947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:23:04.784685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44800 10000
100.0%

납부년도
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019
3114 
2018
2894 
2017
2797 
2020
1195 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2019 3114
31.1%
2018 2894
28.9%
2017 2797
28.0%
2020 1195
 
11.9%

Length

2024-01-10T06:23:04.852598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:23:04.926686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019 3114
31.1%
2018 2894
28.9%
2017 2797
28.0%
2020 1195
 
11.9%
Distinct8550
Distinct (%)85.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T06:23:05.165612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length33
Mean length26.3202
Min length20

Characters and Unicode

Total characters263202
Distinct characters165
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7365 ?
Unique (%)73.7%

Sample

1st row[ 조양로33번길 20-18 ] 0000동 0101호
2nd row충청남도 홍성군 홍북읍 내덕리 250 103호
3rd row충청남도 홍성군 홍성읍 학계리 192-3 201호
4th row[ 도청대로 163 ] 0000동 0102호
5th row충청남도 홍성군 서부면 판교리 53-1 118호
ValueCountFrequency (%)
충청남도 6761
 
11.1%
홍성군 6761
 
11.1%
6478
 
10.6%
0000동 2871
 
4.7%
101호 2824
 
4.6%
홍성읍 1339
 
2.2%
102호 1300
 
2.1%
0101호 1223
 
2.0%
광천읍 813
 
1.3%
1동 779
 
1.3%
Other values (4368) 30017
49.1%
2024-01-10T06:23:05.600633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51166
19.4%
0 27819
 
10.6%
1 22213
 
8.4%
10205
 
3.9%
10120
 
3.8%
8798
 
3.3%
2 8125
 
3.1%
7616
 
2.9%
7224
 
2.7%
7093
 
2.7%
Other values (155) 102823
39.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 115573
43.9%
Decimal Number 84695
32.2%
Space Separator 51166
19.4%
Dash Punctuation 5253
 
2.0%
Open Punctuation 3239
 
1.2%
Close Punctuation 3239
 
1.2%
Uppercase Letter 36
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10205
 
8.8%
10120
 
8.8%
8798
 
7.6%
7616
 
6.6%
7224
 
6.3%
7093
 
6.1%
6857
 
5.9%
6761
 
5.8%
6761
 
5.8%
5287
 
4.6%
Other values (136) 38851
33.6%
Decimal Number
ValueCountFrequency (%)
0 27819
32.8%
1 22213
26.2%
2 8125
 
9.6%
3 6213
 
7.3%
4 4522
 
5.3%
5 4051
 
4.8%
6 3326
 
3.9%
8 2930
 
3.5%
7 2798
 
3.3%
9 2698
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
B 16
44.4%
L 12
33.3%
A 7
19.4%
C 1
 
2.8%
Space Separator
ValueCountFrequency (%)
51166
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5253
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 3239
100.0%
Close Punctuation
ValueCountFrequency (%)
] 3239
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 147592
56.1%
Hangul 115573
43.9%
Latin 37
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10205
 
8.8%
10120
 
8.8%
8798
 
7.6%
7616
 
6.6%
7224
 
6.3%
7093
 
6.1%
6857
 
5.9%
6761
 
5.8%
6761
 
5.8%
5287
 
4.6%
Other values (136) 38851
33.6%
Common
ValueCountFrequency (%)
51166
34.7%
0 27819
18.8%
1 22213
15.1%
2 8125
 
5.5%
3 6213
 
4.2%
- 5253
 
3.6%
4 4522
 
3.1%
5 4051
 
2.7%
6 3326
 
2.3%
[ 3239
 
2.2%
Other values (4) 11665
 
7.9%
Latin
ValueCountFrequency (%)
B 16
43.2%
L 12
32.4%
A 7
18.9%
c 1
 
2.7%
C 1
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 147629
56.1%
Hangul 115573
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
51166
34.7%
0 27819
18.8%
1 22213
15.0%
2 8125
 
5.5%
3 6213
 
4.2%
- 5253
 
3.6%
4 4522
 
3.1%
5 4051
 
2.7%
6 3326
 
2.3%
[ 3239
 
2.2%
Other values (9) 11702
 
7.9%
Hangul
ValueCountFrequency (%)
10205
 
8.8%
10120
 
8.8%
8798
 
7.6%
7616
 
6.6%
7224
 
6.3%
7093
 
6.1%
6857
 
5.9%
6761
 
5.8%
6761
 
5.8%
5287
 
4.6%
Other values (136) 38851
33.6%

시가표준액
Real number (ℝ)

Distinct8530
Distinct (%)85.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean55718871
Minimum10800
Maximum4.6193974 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:23:05.718400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10800
5-th percentile372769.5
Q11417870
median10307600
Q353794505
95-th percentile2.2275656 × 108
Maximum4.6193974 × 109
Range4.6193866 × 109
Interquartile range (IQR)52376635

Descriptive statistics

Standard deviation1.5650427 × 108
Coefficient of variation (CV)2.8088198
Kurtosis218.21116
Mean55718871
Median Absolute Deviation (MAD)9812580
Skewness11.460236
Sum5.5718871 × 1011
Variance2.4493587 × 1016
MonotonicityNot monotonic
2024-01-10T06:23:06.024547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
792000 26
 
0.3%
1080000 16
 
0.2%
594000 15
 
0.1%
1320000 15
 
0.1%
1200000 14
 
0.1%
900000 14
 
0.1%
37195700 13
 
0.1%
40572640 13
 
0.1%
1584000 12
 
0.1%
35650170 12
 
0.1%
Other values (8520) 9850
98.5%
ValueCountFrequency (%)
10800 1
< 0.1%
15960 1
< 0.1%
17480 1
< 0.1%
22040 1
< 0.1%
25600 1
< 0.1%
27720 1
< 0.1%
28000 1
< 0.1%
33000 1
< 0.1%
33100 1
< 0.1%
35840 1
< 0.1%
ValueCountFrequency (%)
4619397420 1
< 0.1%
4477872420 1
< 0.1%
3321733270 1
< 0.1%
2922841620 1
< 0.1%
2530271950 1
< 0.1%
2395592280 1
< 0.1%
2352528100 1
< 0.1%
2299694390 1
< 0.1%
2230234020 1
< 0.1%
2155509810 1
< 0.1%

연면적
Real number (ℝ)

Distinct5686
Distinct (%)56.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean226.62961
Minimum0.538
Maximum13546.62
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T06:23:06.126823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.538
5-th percentile18
Q157.0991
median128.2
Q3244.53
95-th percentile769.7625
Maximum13546.62
Range13546.082
Interquartile range (IQR)187.4309

Descriptive statistics

Standard deviation403.5859
Coefficient of variation (CV)1.7808172
Kurtosis211.11503
Mean226.62961
Median Absolute Deviation (MAD)78.785
Skewness10.337532
Sum2266296.1
Variance162881.58
MonotonicityNot monotonic
2024-01-10T06:23:06.238939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 161
 
1.6%
198.0 65
 
0.7%
66.0 38
 
0.4%
27.0 37
 
0.4%
51.5176 36
 
0.4%
56.1948 36
 
0.4%
36.0 34
 
0.3%
84.0 31
 
0.3%
192.0 31
 
0.3%
165.0 28
 
0.3%
Other values (5676) 9503
95.0%
ValueCountFrequency (%)
0.538 1
< 0.1%
0.593 1
< 0.1%
0.598 1
< 0.1%
0.9 1
< 0.1%
1.0 1
< 0.1%
1.26 1
< 0.1%
2.119 1
< 0.1%
2.16 1
< 0.1%
2.28 1
< 0.1%
2.34 1
< 0.1%
ValueCountFrequency (%)
13546.62 1
< 0.1%
9608.09 1
< 0.1%
9021.42 1
< 0.1%
6979.76 1
< 0.1%
6580.0 1
< 0.1%
6271.53 1
< 0.1%
5041.04 1
< 0.1%
4882.43 2
< 0.1%
4869.09 1
< 0.1%
4800.69 1
< 0.1%

결정일자
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2019-06-01
3114 
2018-06-01
2894 
2017-06-01
2797 
2020-06-01
1195 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2018-06-01
2nd row2019-06-01
3rd row2019-06-01
4th row2019-06-01
5th row2019-06-01

Common Values

ValueCountFrequency (%)
2019-06-01 3114
31.1%
2018-06-01 2894
28.9%
2017-06-01 2797
28.0%
2020-06-01 1195
 
11.9%

Length

2024-01-10T06:23:06.337254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T06:23:06.413305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-06-01 3114
31.1%
2018-06-01 2894
28.9%
2017-06-01 2797
28.0%
2020-06-01 1195
 
11.9%

Interactions

2024-01-10T06:23:04.082349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:23:03.926612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:23:04.153791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T06:23:03.991823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T06:23:06.472983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
납부년도시가표준액연면적결정일자
납부년도1.0000.0000.0201.000
시가표준액0.0001.0000.8850.000
연면적0.0200.8851.0000.020
결정일자1.0000.0000.0201.000
2024-01-10T06:23:06.543160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
결정일자납부년도
결정일자1.0001.000
납부년도1.0001.000
2024-01-10T06:23:06.610640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액연면적납부년도결정일자
시가표준액1.0000.4060.0000.000
연면적0.4061.0000.0130.013
납부년도0.0000.0131.0001.000
결정일자0.0000.0131.0001.000

Missing values

2024-01-10T06:23:04.253486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T06:23:04.351710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드납부년도물건지시가표준액연면적결정일자
42289충청남도홍성군448002018[ 조양로33번길 20-18 ] 0000동 0101호39535200115.62018-06-01
66686충청남도홍성군448002019충청남도 홍성군 홍북읍 내덕리 250 103호2136000534.02019-06-01
75659충청남도홍성군448002019충청남도 홍성군 홍성읍 학계리 192-3 201호18174870152.732019-06-01
68738충청남도홍성군448002019[ 도청대로 163 ] 0000동 0102호227175870436.082019-06-01
81546충청남도홍성군448002019충청남도 홍성군 서부면 판교리 53-1 118호257070045.12019-06-01
74660충청남도홍성군448002019충청남도 홍성군 금마면 봉서리 69-2 101호80896000128.02019-06-01
29739충청남도홍성군448002018충청남도 홍성군 금마면 화양리 300-3 101호37800018.02018-06-01
24029충청남도홍성군448002017충청남도 홍성군 홍동면 운월리 353-4 102호194400048.62017-06-01
35712충청남도홍성군448002018[ 충남대로 36 ] 0000동 0204호3693811051.51762018-06-01
44879충청남도홍성군448002018[ 광천로299번길 26 ] 0000동 0101호387780056.22018-06-01
시도명시군구명자치단체코드납부년도물건지시가표준액연면적결정일자
29837충청남도홍성군448002018충청남도 홍성군 금마면 장성리 220-4 101호16327740112.452018-06-01
10589충청남도홍성군448002017[ 충절로 1049 ] 0001동 0101호61198520105.772017-06-01
37020충청남도홍성군448002018충청남도 홍성군 홍성읍 고암리 1077 1동 104호5216381076.24062018-06-01
26753충청남도홍성군448002018충청남도 홍성군 장곡면 지정리 761-4 102호3500000875.02018-06-01
36617충청남도홍성군448002018충청남도 홍성군 홍성읍 월산리 360-2 103호1606216068.062018-06-01
83575충청남도홍성군448002019충청남도 홍성군 결성면 형산리 405 103호787200196.82019-06-01
8861충청남도홍성군448002017[ 의사로43번길 18 ] 0001동 1015호3197511050.732017-06-01
3207충청남도홍성군448002017충청남도 홍성군 홍북면 신경리 893 624호3888680056.19482017-06-01
91057충청남도홍성군448002020[ 조양로 177 ] 0000동 0101호3494817096.172020-06-01
90957충청남도홍성군448002020충청남도 홍성군 구항면 대정리 64-10 1동 101호2483362801585.82020-06-01

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드납부년도물건지시가표준액연면적결정일자# duplicates
0충청남도홍성군448002017충청남도 홍성군 은하면 대율리 266-1 201호21624008.482017-06-012
1충청남도홍성군448002017충청남도 홍성군 은하면 대판리 314-4 101호62244000195.02017-06-012
2충청남도홍성군448002018충청남도 홍성군 결성면 성곡리 245 101호29484032.762018-06-012
3충청남도홍성군448002018충청남도 홍성군 홍성읍 옥암리 305 127호3690700055.252018-06-012
4충청남도홍성군448002019충청남도 홍성군 은하면 대율리 266-1 101호467245019.552019-06-012
5충청남도홍성군448002019충청남도 홍성군 은하면 대율리 266-1 201호20267208.482019-06-012
6충청남도홍성군448002019충청남도 홍성군 장곡면 상송리 237-5 101호472500001350.02019-06-012
7충청남도홍성군448002019충청남도 홍성군 홍북읍 내덕리 161-13 101호364500001012.52019-06-012