Overview

Dataset statistics

Number of variables9
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory830.1 KiB
Average record size in memory85.0 B

Variable types

Numeric4
Categorical3
Text1
DateTime1

Dataset

Description일반건축물에 대한 지방세 부과기준인 시가표준액을 제공(2017,2018,2019,2020,2021,2022)-항목 : 물건지, 시간표준액, 연면적, 결정일자
Author인천광역시 옹진군
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15079846&srcSe=7661IVAWM27C61E190

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
데이터기준일 has constant value ""Constant
순번 is highly overall correlated with 과세년도High correlation
과세년도 is highly overall correlated with 순번High correlation
시가표준액 is highly overall correlated with 연면적High correlation
연면적 is highly overall correlated with 시가표준액High correlation
시가표준액 is highly skewed (γ1 = 23.77154947)Skewed
연면적 is highly skewed (γ1 = 27.45907313)Skewed
순번 has unique valuesUnique

Reproduction

Analysis started2024-04-17 10:14:22.862345
Analysis finished2024-04-17 10:14:24.881115
Duration2.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35627.171
Minimum11
Maximum71161
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T19:14:24.941903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile3693.8
Q118009.75
median35589
Q353577.5
95-th percentile67487.45
Maximum71161
Range71150
Interquartile range (IQR)35567.75

Descriptive statistics

Standard deviation20390.043
Coefficient of variation (CV)0.57231721
Kurtosis-1.1904036
Mean35627.171
Median Absolute Deviation (MAD)17762
Skewness0.0021873884
Sum3.5627171 × 108
Variance4.1575384 × 108
MonotonicityNot monotonic
2024-04-17T19:14:25.258297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
66413 1
 
< 0.1%
18787 1
 
< 0.1%
68276 1
 
< 0.1%
65184 1
 
< 0.1%
64145 1
 
< 0.1%
18799 1
 
< 0.1%
25948 1
 
< 0.1%
65755 1
 
< 0.1%
58489 1
 
< 0.1%
43886 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
11 1
< 0.1%
20 1
< 0.1%
28 1
< 0.1%
31 1
< 0.1%
49 1
< 0.1%
61 1
< 0.1%
70 1
< 0.1%
104 1
< 0.1%
107 1
< 0.1%
108 1
< 0.1%
ValueCountFrequency (%)
71161 1
< 0.1%
71157 1
< 0.1%
71151 1
< 0.1%
71140 1
< 0.1%
71135 1
< 0.1%
71120 1
< 0.1%
71119 1
< 0.1%
71112 1
< 0.1%
71101 1
< 0.1%
71093 1
< 0.1%

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
인천광역시
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row인천광역시
2nd row인천광역시
3rd row인천광역시
4th row인천광역시
5th row인천광역시

Common Values

ValueCountFrequency (%)
인천광역시 10000
100.0%

Length

2024-04-17T19:14:25.369921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:14:25.449951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인천광역시 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
옹진군
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row옹진군
2nd row옹진군
3rd row옹진군
4th row옹진군
5th row옹진군

Common Values

ValueCountFrequency (%)
옹진군 10000
100.0%

Length

2024-04-17T19:14:25.520532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:14:25.589686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
옹진군 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
28720
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row28720
2nd row28720
3rd row28720
4th row28720
5th row28720

Common Values

ValueCountFrequency (%)
28720 10000
100.0%

Length

2024-04-17T19:14:25.673328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:14:25.759021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
28720 10000
100.0%

과세년도
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2019.7189
Minimum2017
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T19:14:25.831570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2017
5-th percentile2017
Q12018
median2020
Q32021
95-th percentile2022
Maximum2022
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.6858728
Coefficient of variation (CV)0.00083470664
Kurtosis-1.2223512
Mean2019.7189
Median Absolute Deviation (MAD)1
Skewness-0.15484632
Sum20197189
Variance2.842167
MonotonicityNot monotonic
2024-04-17T19:14:25.914057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2022 1966
19.7%
2021 1847
18.5%
2020 1740
17.4%
2019 1615
16.2%
2018 1521
15.2%
2017 1311
13.1%
ValueCountFrequency (%)
2017 1311
13.1%
2018 1521
15.2%
2019 1615
16.2%
2020 1740
17.4%
2021 1847
18.5%
2022 1966
19.7%
ValueCountFrequency (%)
2022 1966
19.7%
2021 1847
18.5%
2020 1740
17.4%
2019 1615
16.2%
2018 1521
15.2%
2017 1311
13.1%
Distinct6383
Distinct (%)63.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-17T19:14:26.169658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length31
Mean length27.8075
Min length19

Characters and Unicode

Total characters278075
Distinct characters86
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4330 ?
Unique (%)43.3%

Sample

1st row인천광역시 옹진군 영흥면 내리 1329-257 1동 201호
2nd row인천광역시 옹진군 백령면 진촌리 795-1 1동 101호
3rd row인천광역시 옹진군 자월면 자월리 616-2 1동 101호
4th row[ 내동로131번길 86-2 ] 0000동 0101호
5th row인천광역시 옹진군 자월면 이작리 227
ValueCountFrequency (%)
7586
 
11.8%
인천광역시 6207
 
9.7%
옹진군 6207
 
9.7%
1동 3815
 
5.9%
101호 2870
 
4.5%
영흥면 2309
 
3.6%
0001동 2293
 
3.6%
0101호 1601
 
2.5%
백령면 1458
 
2.3%
0000동 1108
 
1.7%
Other values (3385) 28707
44.7%
2024-04-17T19:14:26.551295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
54161
19.5%
0 27753
 
10.0%
1 27068
 
9.7%
8910
 
3.2%
8888
 
3.2%
2 8644
 
3.1%
7036
 
2.5%
6426
 
2.3%
6207
 
2.2%
6207
 
2.2%
Other values (76) 116775
42.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 119382
42.9%
Decimal Number 91789
33.0%
Space Separator 54161
19.5%
Dash Punctuation 5156
 
1.9%
Open Punctuation 3793
 
1.4%
Close Punctuation 3793
 
1.4%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8910
 
7.5%
8888
 
7.4%
7036
 
5.9%
6426
 
5.4%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
Other values (61) 50880
42.6%
Decimal Number
ValueCountFrequency (%)
0 27753
30.2%
1 27068
29.5%
2 8644
 
9.4%
3 6120
 
6.7%
4 5535
 
6.0%
5 4019
 
4.4%
6 3553
 
3.9%
7 3460
 
3.8%
8 2901
 
3.2%
9 2736
 
3.0%
Space Separator
ValueCountFrequency (%)
54161
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5156
100.0%
Open Punctuation
ValueCountFrequency (%)
[ 3793
100.0%
Close Punctuation
ValueCountFrequency (%)
] 3793
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 158692
57.1%
Hangul 119382
42.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8910
 
7.5%
8888
 
7.4%
7036
 
5.9%
6426
 
5.4%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
Other values (61) 50880
42.6%
Common
ValueCountFrequency (%)
54161
34.1%
0 27753
17.5%
1 27068
17.1%
2 8644
 
5.4%
3 6120
 
3.9%
4 5535
 
3.5%
- 5156
 
3.2%
5 4019
 
2.5%
[ 3793
 
2.4%
] 3793
 
2.4%
Other values (4) 12650
 
8.0%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 158693
57.1%
Hangul 119382
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
54161
34.1%
0 27753
17.5%
1 27068
17.1%
2 8644
 
5.4%
3 6120
 
3.9%
4 5535
 
3.5%
- 5156
 
3.2%
5 4019
 
2.5%
[ 3793
 
2.4%
] 3793
 
2.4%
Other values (5) 12651
 
8.0%
Hangul
ValueCountFrequency (%)
8910
 
7.5%
8888
 
7.4%
7036
 
5.9%
6426
 
5.4%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
6207
 
5.2%
Other values (61) 50880
42.6%

시가표준액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct7866
Distinct (%)78.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50702215
Minimum32250
Maximum8.464754 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T19:14:26.673048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum32250
5-th percentile570240
Q12430000
median12690080
Q344931788
95-th percentile1.6610832 × 108
Maximum8.464754 × 109
Range8.4647217 × 109
Interquartile range (IQR)42501788

Descriptive statistics

Standard deviation2.3169085 × 108
Coefficient of variation (CV)4.5696395
Kurtosis707.62125
Mean50702215
Median Absolute Deviation (MAD)11656680
Skewness23.771549
Sum5.0702215 × 1011
Variance5.3680648 × 1016
MonotonicityNot monotonic
2024-04-17T19:14:26.788562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1497600 17
 
0.2%
792000 15
 
0.1%
30608350 15
 
0.1%
216000 15
 
0.1%
13052010 14
 
0.1%
1296000 13
 
0.1%
633600 13
 
0.1%
720000 13
 
0.1%
230400 13
 
0.1%
3834000 12
 
0.1%
Other values (7856) 9860
98.6%
ValueCountFrequency (%)
32250 2
 
< 0.1%
39600 1
 
< 0.1%
55440 1
 
< 0.1%
56000 1
 
< 0.1%
60000 1
 
< 0.1%
61920 1
 
< 0.1%
63360 6
0.1%
64000 1
 
< 0.1%
66000 1
 
< 0.1%
66240 1
 
< 0.1%
ValueCountFrequency (%)
8464753980 1
< 0.1%
8092222020 1
< 0.1%
8003373840 1
< 0.1%
6992375060 1
< 0.1%
6461993670 1
< 0.1%
5959078040 1
< 0.1%
4826001140 1
< 0.1%
4578428440 1
< 0.1%
4459942230 1
< 0.1%
4274358240 1
< 0.1%

연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct3576
Distinct (%)35.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean119.95108
Minimum0.8
Maximum20696.22
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-17T19:14:26.899798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.8
5-th percentile9.9
Q118
median49.395
Q3109.3875
95-th percentile366.308
Maximum20696.22
Range20695.42
Interquartile range (IQR)91.3875

Descriptive statistics

Standard deviation493.23002
Coefficient of variation (CV)4.1119264
Kurtosis988.70417
Mean119.95108
Median Absolute Deviation (MAD)32.615
Skewness27.459073
Sum1199510.8
Variance243275.85
MonotonicityNot monotonic
2024-04-17T19:14:27.029518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 898
 
9.0%
9.9 361
 
3.6%
13.2 265
 
2.6%
36.0 102
 
1.0%
9.0 97
 
1.0%
27.0 90
 
0.9%
10.0 81
 
0.8%
15.0 60
 
0.6%
24.0 58
 
0.6%
66.0 57
 
0.6%
Other values (3566) 7931
79.3%
ValueCountFrequency (%)
0.8 2
 
< 0.1%
1.44 2
 
< 0.1%
1.8 4
< 0.1%
1.9 1
 
< 0.1%
1.96 1
 
< 0.1%
2.0 5
0.1%
2.1 1
 
< 0.1%
2.15 4
< 0.1%
2.16 1
 
< 0.1%
2.6 1
 
< 0.1%
ValueCountFrequency (%)
20696.22 2
< 0.1%
20210.54 1
 
< 0.1%
10874.23 1
 
< 0.1%
10824.11 2
< 0.1%
8687.72 3
< 0.1%
7470.59 2
< 0.1%
6083.0 1
 
< 0.1%
4328.42 1
 
< 0.1%
3860.4 1
 
< 0.1%
3720.71 1
 
< 0.1%

데이터기준일
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-08-31 00:00:00
Maximum2022-08-31 00:00:00
2024-04-17T19:14:27.128429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:27.196083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-17T19:14:24.382202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.441665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.733867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.088882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.461806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.518673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.828898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.167559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.534749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.592681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.923540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.243238image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.601928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:23.661668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.012191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-17T19:14:24.313382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-17T19:14:27.251694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번과세년도시가표준액연면적
순번1.0000.9580.0110.004
과세년도0.9581.0000.0000.000
시가표준액0.0110.0001.0000.925
연면적0.0040.0000.9251.000
2024-04-17T19:14:27.325882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번과세년도시가표준액연면적
순번1.0000.9850.021-0.012
과세년도0.9851.000-0.002-0.025
시가표준액0.021-0.0021.0000.824
연면적-0.012-0.0250.8241.000

Missing values

2024-04-17T19:14:24.692912image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:14:24.817739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번시도명시군구명자치단체코드과세년도물건지시가표준액연면적데이터기준일
6641266413인천광역시옹진군287202022인천광역시 옹진군 영흥면 내리 1329-257 1동 201호1191547016.782022-08-31
5709257093인천광역시옹진군287202022인천광역시 옹진군 백령면 진촌리 795-1 1동 101호131743360168.042022-08-31
6788067881인천광역시옹진군287202022인천광역시 옹진군 자월면 자월리 616-2 1동 101호376360019.42022-08-31
1229812299인천광역시옹진군287202018[ 내동로131번길 86-2 ] 0000동 0101호48540609.092022-08-31
95169517인천광역시옹진군287202017인천광역시 옹진군 자월면 이작리 22765490011.12022-08-31
6250562506인천광역시옹진군287202022인천광역시 옹진군 영흥면 내리 8-202 1동 101호1944000067.52022-08-31
5382053821인천광역시옹진군287202021인천광역시 옹진군 자월면 자월리 1291 1동 101호139392013.22022-08-31
816817인천광역시옹진군287202017[ 영흥남로321번길 41-2 ] 0001동 0206호2259018030.612022-08-31
6180561806인천광역시옹진군287202022인천광역시 옹진군 백령면 남포리 1510 1동 2호1940520041.22022-08-31
2529325294인천광역시옹진군287202019인천광역시 옹진군 대청면 대청리 788-1 2동 101호15048009.92022-08-31
순번시도명시군구명자치단체코드과세년도물건지시가표준액연면적데이터기준일
1084210843인천광역시옹진군287202018인천광역시 옹진군 자월면 승봉리 784 121호2861040062.882022-08-31
5759657597인천광역시옹진군287202022인천광역시 옹진군 연평면 연평리 478-53 1동 101호38376021.322022-08-31
1599715998인천광역시옹진군287202018[ 영흥로176번길 46 ] 0001동 0102호365295024.852022-08-31
2667526676인천광역시옹진군287202019[ 영흥북로 11 ] 0003동 0101호30600009.02022-08-31
2596225963인천광역시옹진군287202019[ 백령로 475 ] 0001동 0001호231840018.02022-08-31
1456314564인천광역시옹진군287202018인천광역시 옹진군 대청면 대청리 1290 1동 101호163350016.52022-08-31
1409414095인천광역시옹진군287202018[ 대청로 201-1 ] 9999동 0202호84099780158.382022-08-31
87248725인천광역시옹진군287202017[ 대청로 30 ] 0000동 0002호4208098083.662022-08-31
4898048981인천광역시옹진군287202021인천광역시 옹진군 영흥면 내리 202-3 1동 103호51375240196.842022-08-31
2532125322인천광역시옹진군287202019인천광역시 옹진군 대청면 대청리 996 2동 101호2217609.92022-08-31