Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows57
Duplicate rows (%)0.6%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical4
Text1
Numeric2
DateTime1

Dataset

Description상기 데이터는 연도별 일반건축물에 대한 지방세 부과기준인 시가표준액을 제공하여 물건별 재산가액 확인이 가능하도록 함
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=348&beforeMenuCd=DOM_000000201001001000&publicdatapk=15079984

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
과세년도 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 57 (0.6%) duplicate rowsDuplicates
시가표준액(원) is highly overall correlated with 연면적(제곱미터)High correlation
연면적(제곱미터) is highly overall correlated with 시가표준액(원)High correlation
시가표준액(원) is highly skewed (γ1 = 20.63051434)Skewed
연면적(제곱미터) is highly skewed (γ1 = 24.64493355)Skewed

Reproduction

Analysis started2024-01-09 22:09:45.318613
Analysis finished2024-01-09 22:09:46.473784
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
충청남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row충청남도
2nd row충청남도
3rd row충청남도
4th row충청남도
5th row충청남도

Common Values

ValueCountFrequency (%)
충청남도 10000
100.0%

Length

2024-01-10T07:09:46.523666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:46.598148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부여군
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부여군
2nd row부여군
3rd row부여군
4th row부여군
5th row부여군

Common Values

ValueCountFrequency (%)
부여군 10000
100.0%

Length

2024-01-10T07:09:46.671803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:46.740785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부여군 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
44760
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row44760
2nd row44760
3rd row44760
4th row44760
5th row44760

Common Values

ValueCountFrequency (%)
44760 10000
100.0%

Length

2024-01-10T07:09:46.810475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:46.877749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
44760 10000
100.0%

과세년도
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2022
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 10000
100.0%

Length

2024-01-10T07:09:46.948933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T07:09:47.032970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 10000
100.0%
Distinct9195
Distinct (%)92.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-01-10T07:09:47.284366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length31
Mean length26.5106
Min length21

Characters and Unicode

Total characters265106
Distinct characters215
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8741 ?
Unique (%)87.4%

Sample

1st row[ 독정로147번길 12-6 ] 0000동 0301호
2nd row충청남도 부여군 석성면 증산리 1283-8 101호
3rd row[ 선사로29번길 19 ] 0000동 0101호
4th row충청남도 부여군 구룡면 구봉리 230-2 107호
5th row[ 태봉산길 38 ] 0000동 0301호
ValueCountFrequency (%)
충청남도 7255
 
11.6%
부여군 7255
 
11.6%
5490
 
8.8%
101호 3240
 
5.2%
0000동 2251
 
3.6%
1동 2041
 
3.3%
102호 1431
 
2.3%
0101호 1318
 
2.1%
부여읍 1209
 
1.9%
규암면 773
 
1.2%
Other values (4445) 30186
48.3%
2024-01-10T07:09:47.678070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52451
19.8%
0 25612
 
9.7%
1 24208
 
9.1%
10057
 
3.8%
8611
 
3.2%
8527
 
3.2%
2 8165
 
3.1%
7959
 
3.0%
7734
 
2.9%
7699
 
2.9%
Other values (205) 104083
39.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 119397
45.0%
Decimal Number 81786
30.9%
Space Separator 52451
19.8%
Dash Punctuation 5982
 
2.3%
Open Punctuation 2745
 
1.0%
Close Punctuation 2745
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10057
 
8.4%
8611
 
7.2%
8527
 
7.1%
7959
 
6.7%
7734
 
6.5%
7699
 
6.4%
7390
 
6.2%
7368
 
6.2%
7258
 
6.1%
6048
 
5.1%
Other values (191) 40746
34.1%
Decimal Number
ValueCountFrequency (%)
0 25612
31.3%
1 24208
29.6%
2 8165
 
10.0%
3 5356
 
6.5%
4 4044
 
4.9%
5 3410
 
4.2%
6 3220
 
3.9%
7 2781
 
3.4%
8 2640
 
3.2%
9 2350
 
2.9%
Space Separator
ValueCountFrequency (%)
52451
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5982
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 2745
100.0%
Close Punctuation
ValueCountFrequency (%)
] 2745
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 145709
55.0%
Hangul 119397
45.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10057
 
8.4%
8611
 
7.2%
8527
 
7.1%
7959
 
6.7%
7734
 
6.5%
7699
 
6.4%
7390
 
6.2%
7368
 
6.2%
7258
 
6.1%
6048
 
5.1%
Other values (191) 40746
34.1%
Common
ValueCountFrequency (%)
52451
36.0%
0 25612
17.6%
1 24208
16.6%
2 8165
 
5.6%
- 5982
 
4.1%
3 5356
 
3.7%
4 4044
 
2.8%
5 3410
 
2.3%
6 3220
 
2.2%
7 2781
 
1.9%
Other values (4) 10480
 
7.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 145709
55.0%
Hangul 119397
45.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52451
36.0%
0 25612
17.6%
1 24208
16.6%
2 8165
 
5.6%
- 5982
 
4.1%
3 5356
 
3.7%
4 4044
 
2.8%
5 3410
 
2.3%
6 3220
 
2.2%
7 2781
 
1.9%
Other values (4) 10480
 
7.2%
Hangul
ValueCountFrequency (%)
10057
 
8.4%
8611
 
7.2%
8527
 
7.1%
7959
 
6.7%
7734
 
6.5%
7699
 
6.4%
7390
 
6.2%
7368
 
6.2%
7258
 
6.1%
6048
 
5.1%
Other values (191) 40746
34.1%

시가표준액(원)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct8178
Distinct (%)81.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42787029
Minimum18000
Maximum8.2206155 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:09:47.816911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18000
5-th percentile302242.5
Q11596555
median6936235
Q332542020
95-th percentile1.7309733 × 108
Maximum8.2206155 × 109
Range8.2205975 × 109
Interquartile range (IQR)30945465

Descriptive statistics

Standard deviation1.6638345 × 108
Coefficient of variation (CV)3.8886424
Kurtosis728.59519
Mean42787029
Median Absolute Deviation (MAD)6336235
Skewness20.630514
Sum4.2787029 × 1011
Variance2.7683454 × 1016
MonotonicityNot monotonic
2024-01-10T07:09:47.954702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5427000 41
 
0.4%
374400 21
 
0.2%
604800 20
 
0.2%
446400 19
 
0.2%
1440000 15
 
0.1%
518400 14
 
0.1%
1980000 14
 
0.1%
2376000 13
 
0.1%
66000 13
 
0.1%
250000 12
 
0.1%
Other values (8168) 9818
98.2%
ValueCountFrequency (%)
18000 1
< 0.1%
24000 1
< 0.1%
24650 1
< 0.1%
27250 1
< 0.1%
39000 1
< 0.1%
40000 1
< 0.1%
41160 1
< 0.1%
41600 1
< 0.1%
42550 1
< 0.1%
45000 1
< 0.1%
ValueCountFrequency (%)
8220615480 1
< 0.1%
4340037900 1
< 0.1%
3432145040 1
< 0.1%
3330555220 1
< 0.1%
3001030800 1
< 0.1%
2987497600 1
< 0.1%
2936272000 1
< 0.1%
2595565860 1
< 0.1%
2570454000 1
< 0.1%
2513796060 1
< 0.1%

연면적(제곱미터)
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct5423
Distinct (%)54.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean206.80717
Minimum0.52
Maximum29473.94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-01-10T07:09:48.089663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.52
5-th percentile12.7785
Q136.715
median96
Q3198
95-th percentile754.8585
Maximum29473.94
Range29473.42
Interquartile range (IQR)161.285

Descriptive statistics

Standard deviation560.08737
Coefficient of variation (CV)2.708259
Kurtosis1021.9006
Mean206.80717
Median Absolute Deviation (MAD)69.26
Skewness24.644934
Sum2068071.7
Variance313697.86
MonotonicityNot monotonic
2024-01-10T07:09:48.235025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 311
 
3.1%
16.5 129
 
1.3%
16.2 50
 
0.5%
32.0 44
 
0.4%
198.0 44
 
0.4%
50.0 43
 
0.4%
10.0 38
 
0.4%
24.0 35
 
0.4%
99.22 34
 
0.3%
330.0 34
 
0.3%
Other values (5413) 9238
92.4%
ValueCountFrequency (%)
0.52 1
< 0.1%
0.7 1
< 0.1%
1.37 1
< 0.1%
1.44 1
< 0.1%
1.65 1
< 0.1%
1.8 1
< 0.1%
1.96 1
< 0.1%
2.08 1
< 0.1%
2.12 1
< 0.1%
2.16 2
< 0.1%
ValueCountFrequency (%)
29473.94 1
< 0.1%
19707.0 1
< 0.1%
17928.0 1
< 0.1%
10353.42 1
< 0.1%
8623.48 1
< 0.1%
7091.71 1
< 0.1%
6982.0 1
< 0.1%
6422.37 1
< 0.1%
5859.0 1
< 0.1%
5379.0 1
< 0.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-06-01 00:00:00
Maximum2022-06-01 00:00:00
2024-01-10T07:09:48.344556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:09:48.434633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-01-10T07:09:45.875670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:09:45.732407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:09:45.955597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T07:09:45.800681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T07:09:48.504083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액(원)연면적(제곱미터)
시가표준액(원)1.0000.808
연면적(제곱미터)0.8081.000
2024-01-10T07:09:48.587975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액(원)연면적(제곱미터)
시가표준액(원)1.0000.608
연면적(제곱미터)0.6081.000

Missing values

2024-01-10T07:09:46.072789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T07:09:46.416832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자
8536충청남도부여군447602022[ 독정로147번길 12-6 ] 0000동 0301호739608063.21442022-06-01
7544충청남도부여군447602022충청남도 부여군 석성면 증산리 1283-8 101호2535356050.222022-06-01
6545충청남도부여군447602022[ 선사로29번길 19 ] 0000동 0101호207700067.02022-06-01
119충청남도부여군447602022충청남도 부여군 구룡면 구봉리 230-2 107호589500015.02022-06-01
14034충청남도부여군447602022[ 태봉산길 38 ] 0000동 0301호929136044.672022-06-01
23305충청남도부여군447602022충청남도 부여군 초촌면 추양리 20-1 103호1832922052.222022-06-01
17452충청남도부여군447602022충청남도 부여군 구룡면 죽교리 152 101호1332500266.52022-06-01
20128충청남도부여군447602022충청남도 부여군 장암면 상황리 328-14 201호549792066.242022-06-01
7275충청남도부여군447602022충청남도 부여군 장암면 원문리 173 1동 101호2356800471.362022-06-01
8695충청남도부여군447602022충청남도 부여군 규암면 신성리 60-5 103호4899200306.22022-06-01
시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자
15838충청남도부여군447602022충청남도 부여군 은산면 나령리 산 74-5 106호35000070.02022-06-01
6101충청남도부여군447602022충청남도 부여군 세도면 수고리 997 101호1872000374.42022-06-01
18564충청남도부여군447602022충청남도 부여군 외산면 만수리 40-15 101호139680029.12022-06-01
7255충청남도부여군447602022충청남도 부여군 초촌면 신암리 452 101호19250017.52022-06-01
18230충청남도부여군447602022[ 금천동로26번길 22 ] 0000동 0102호4443180105.792022-06-01
17722충청남도부여군447602022충청남도 부여군 남면 마정리 1168-6 101호43443180391.382022-06-01
20823충청남도부여군447602022충청남도 부여군 양화면 수원리 156 103호2006880167.242022-06-01
13209충청남도부여군447602022충청남도 부여군 양화면 입포리 114-2 102호788500157.72022-06-01
13319충청남도부여군447602022충청남도 부여군 구룡면 용당리 416-74 103호79200072.02022-06-01
16530충청남도부여군447602022[ 홍연로26번길 3-18 ] 0000동 0101호3889590095.12022-06-01

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드과세년도물건지 주소시가표준액(원)연면적(제곱미터)데이터기준일자# duplicates
53충청남도부여군447602022충청남도 부여군 초촌면 응평리 466 1동 101호13137600110.42022-06-018
18충청남도부여군447602022충청남도 부여군 석성면 비당리 919-4 1동 101호14295960115.292022-06-014
22충청남도부여군447602022충청남도 부여군 석성면 증산리 1320-214 1동 101호3348000108.02022-06-014
30충청남도부여군447602022충청남도 부여군 세도면 사산리 595 1동 101호36281900250.222022-06-014
56충청남도부여군447602022충청남도 부여군 홍산면 토정리 391-6 1동 101호29597400783.02022-06-014
6충청남도부여군447602022충청남도 부여군 구룡면 금사리 183-2 1동 101호120709600199.522022-06-013
13충청남도부여군447602022충청남도 부여군 규암면 합정리 575 2동 101호5601960098.282022-06-013
19충청남도부여군447602022충청남도 부여군 석성면 석성리 370-1 1동 101호1027200096.02022-06-013
24충청남도부여군447602022충청남도 부여군 석성면 증산리 655-3 1동 101호12096000126.02022-06-013
25충청남도부여군447602022충청남도 부여군 석성면 증산리 934-18 1동 101호282720000480.02022-06-013