Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows6
Duplicate rows (%)0.1%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Categorical3
Text1
Numeric2
DateTime1

Dataset

Description경상남도 김해시 일반건축물 시가표준액 현황에 대한 데이터로 시도명,시군구명,자치단체코드,물건지,시가표준액,연면적,기준일자의 정보를 제공하고 있습니다.
Author경상남도 김해시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15092800

Alerts

시도명 has constant value ""Constant
시군구명 has constant value ""Constant
자치단체코드 has constant value ""Constant
기준일자 has constant value ""Constant
Dataset has 6 (0.1%) duplicate rowsDuplicates
시가표준액 is highly overall correlated with 연면적High correlation
연면적 is highly overall correlated with 시가표준액High correlation
시가표준액 is highly skewed (γ1 = 29.77772802)Skewed
연면적 is highly skewed (γ1 = 20.40282559)Skewed

Reproduction

Analysis started2023-12-10 23:44:11.510958
Analysis finished2023-12-10 23:44:12.619758
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
경상남도
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경상남도
2nd row경상남도
3rd row경상남도
4th row경상남도
5th row경상남도

Common Values

ValueCountFrequency (%)
경상남도 10000
100.0%

Length

2023-12-11T08:44:12.682791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:44:12.772212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경상남도 10000
100.0%

시군구명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
김해시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row김해시
2nd row김해시
3rd row김해시
4th row김해시
5th row김해시

Common Values

ValueCountFrequency (%)
김해시 10000
100.0%

Length

2023-12-11T08:44:12.859529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:44:12.940126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
김해시 10000
100.0%

자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
48250
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row48250
2nd row48250
3rd row48250
4th row48250
5th row48250

Common Values

ValueCountFrequency (%)
48250 10000
100.0%

Length

2023-12-11T08:44:13.027802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T08:44:13.111088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48250 10000
100.0%
Distinct9590
Distinct (%)95.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T08:44:13.368411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length34
Mean length28.3545
Min length16

Characters and Unicode

Total characters283545
Distinct characters166
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9224 ?
Unique (%)92.2%

Sample

1st row경상남도 김해시 삼방동 172-5 1003호
2nd row경상남도 김해시 외동 698-4 102호
3rd row경상남도 김해시 번화1로 40 0000동 0903호
4th row경상남도 김해시 내동 179-9 100호
5th row경상남도 김해시 상동면 우계리 205-3 104호
ValueCountFrequency (%)
경상남도 10000
 
17.0%
김해시 10000
 
17.0%
0000동 5376
 
9.2%
101호 1535
 
2.6%
0101호 1144
 
2.0%
한림면 609
 
1.0%
201호 501
 
0.9%
102호 500
 
0.9%
0201호 448
 
0.8%
주촌면 440
 
0.8%
Other values (5197) 28099
47.9%
2023-12-11T08:44:13.808822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48652
17.2%
0 40156
 
14.2%
1 20926
 
7.4%
10501
 
3.7%
10446
 
3.7%
10383
 
3.7%
10135
 
3.6%
10068
 
3.6%
10020
 
3.5%
10018
 
3.5%
Other values (156) 102240
36.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 130783
46.1%
Decimal Number 99252
35.0%
Space Separator 48652
 
17.2%
Dash Punctuation 4647
 
1.6%
Uppercase Letter 211
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10501
 
8.0%
10446
 
8.0%
10383
 
7.9%
10135
 
7.7%
10068
 
7.7%
10020
 
7.7%
10018
 
7.7%
10011
 
7.7%
9184
 
7.0%
5425
 
4.1%
Other values (138) 34592
26.4%
Decimal Number
ValueCountFrequency (%)
0 40156
40.5%
1 20926
21.1%
2 9296
 
9.4%
3 6215
 
6.3%
4 4631
 
4.7%
5 4445
 
4.5%
6 3969
 
4.0%
7 3705
 
3.7%
8 3015
 
3.0%
9 2894
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
B 87
41.2%
L 74
35.1%
C 16
 
7.6%
D 15
 
7.1%
K 12
 
5.7%
A 7
 
3.3%
Space Separator
ValueCountFrequency (%)
48652
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4647
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 152551
53.8%
Hangul 130783
46.1%
Latin 211
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10501
 
8.0%
10446
 
8.0%
10383
 
7.9%
10135
 
7.7%
10068
 
7.7%
10020
 
7.7%
10018
 
7.7%
10011
 
7.7%
9184
 
7.0%
5425
 
4.1%
Other values (138) 34592
26.4%
Common
ValueCountFrequency (%)
48652
31.9%
0 40156
26.3%
1 20926
13.7%
2 9296
 
6.1%
3 6215
 
4.1%
- 4647
 
3.0%
4 4631
 
3.0%
5 4445
 
2.9%
6 3969
 
2.6%
7 3705
 
2.4%
Other values (2) 5909
 
3.9%
Latin
ValueCountFrequency (%)
B 87
41.2%
L 74
35.1%
C 16
 
7.6%
D 15
 
7.1%
K 12
 
5.7%
A 7
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 152762
53.9%
Hangul 130783
46.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
48652
31.8%
0 40156
26.3%
1 20926
13.7%
2 9296
 
6.1%
3 6215
 
4.1%
- 4647
 
3.0%
4 4631
 
3.0%
5 4445
 
2.9%
6 3969
 
2.6%
7 3705
 
2.4%
Other values (8) 6120
 
4.0%
Hangul
ValueCountFrequency (%)
10501
 
8.0%
10446
 
8.0%
10383
 
7.9%
10135
 
7.7%
10068
 
7.7%
10020
 
7.7%
10018
 
7.7%
10011
 
7.7%
9184
 
7.0%
5425
 
4.1%
Other values (138) 34592
26.4%

시가표준액
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct9021
Distinct (%)90.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83598195
Minimum34000
Maximum1.6005741 × 1010
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:44:13.978658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum34000
5-th percentile1199970
Q19504000
median33328305
Q386644228
95-th percentile2.7815591 × 108
Maximum1.6005741 × 1010
Range1.6005707 × 1010
Interquartile range (IQR)77140228

Descriptive statistics

Standard deviation2.6177829 × 108
Coefficient of variation (CV)3.1313869
Kurtosis1512.9051
Mean83598195
Median Absolute Deviation (MAD)28357245
Skewness29.777728
Sum8.3598195 × 1011
Variance6.8527873 × 1016
MonotonicityNot monotonic
2023-12-11T08:44:14.137728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4769500 35
 
0.4%
1042220 29
 
0.3%
33393750 26
 
0.3%
4075560 19
 
0.2%
51526080 15
 
0.1%
15654810 14
 
0.1%
44716620 13
 
0.1%
1016160 13
 
0.1%
26340450 10
 
0.1%
26373960 9
 
0.1%
Other values (9011) 9817
98.2%
ValueCountFrequency (%)
34000 1
< 0.1%
50400 1
< 0.1%
51000 1
< 0.1%
53000 1
< 0.1%
61200 1
< 0.1%
68000 1
< 0.1%
70000 2
< 0.1%
71760 1
< 0.1%
73600 1
< 0.1%
74100 1
< 0.1%
ValueCountFrequency (%)
16005741110 1
< 0.1%
7861408920 1
< 0.1%
5889371400 1
< 0.1%
5519509500 1
< 0.1%
4041471000 1
< 0.1%
4038116520 1
< 0.1%
3501825100 1
< 0.1%
3027381000 1
< 0.1%
2850835050 1
< 0.1%
2434969890 1
< 0.1%

연면적
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct6826
Distinct (%)68.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean184.92635
Minimum0.24
Maximum21705
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T08:44:14.280310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.24
5-th percentile5.8695
Q132.6236
median78.815
Q3179.155
95-th percentile621.75
Maximum21705
Range21704.76
Interquartile range (IQR)146.5314

Descriptive statistics

Standard deviation501.96654
Coefficient of variation (CV)2.7144133
Kurtosis722.85425
Mean184.92635
Median Absolute Deviation (MAD)56.5967
Skewness20.402826
Sum1849263.5
Variance251970.41
MonotonicityNot monotonic
2023-12-11T08:44:14.406203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.0 55
 
0.5%
66.0 37
 
0.4%
8.5018 35
 
0.4%
60.0 31
 
0.3%
1.8578 29
 
0.3%
39.0114 26
 
0.3%
36.0 26
 
0.3%
40.0 25
 
0.2%
48.0 24
 
0.2%
24.0 24
 
0.2%
Other values (6816) 9688
96.9%
ValueCountFrequency (%)
0.24 1
 
< 0.1%
0.32 1
 
< 0.1%
0.44 3
< 0.1%
0.54 2
 
< 0.1%
0.62 2
 
< 0.1%
0.6655 7
0.1%
0.7218 6
0.1%
0.769 1
 
< 0.1%
0.7761 1
 
< 0.1%
0.8015 1
 
< 0.1%
ValueCountFrequency (%)
21705.0 1
< 0.1%
21578.35 1
< 0.1%
10070.26 1
< 0.1%
8759.23 1
< 0.1%
8175.98 1
< 0.1%
6904.91 1
< 0.1%
6886.075 1
< 0.1%
6304.0 1
< 0.1%
6059.4 1
< 0.1%
5948.86 1
< 0.1%

기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-06-01 00:00:00
Maximum2021-06-01 00:00:00
2023-12-11T08:44:14.510581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:14.613415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T08:44:12.122578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:11.946896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:12.266109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T08:44:12.036410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T08:44:14.678701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액연면적
시가표준액1.0000.954
연면적0.9541.000
2023-12-11T08:44:14.754589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시가표준액연면적
시가표준액1.0000.852
연면적0.8521.000

Missing values

2023-12-11T08:44:12.404752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T08:44:12.544372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시도명시군구명자치단체코드물건지시가표준액연면적기준일자
18661경상남도김해시48250경상남도 김해시 삼방동 172-5 1003호28217407.862021-06-01
33993경상남도김해시48250경상남도 김해시 외동 698-4 102호686705018.152021-06-01
14191경상남도김해시48250경상남도 김해시 번화1로 40 0000동 0903호6137500088.69222021-06-01
38520경상남도김해시48250경상남도 김해시 내동 179-9 100호280933047.92021-06-01
70167경상남도김해시48250경상남도 김해시 상동면 우계리 205-3 104호80506490175.742021-06-01
33201경상남도김해시48250경상남도 김해시 내외중앙로 36 0000동 0908호2634045042.832021-06-01
58166경상남도김해시48250경상남도 김해시 생림면 마사리 19-10 101호115960000260.02021-06-01
49806경상남도김해시48250경상남도 김해시 분성로 345-21 0000동 0201호2731938034.982021-06-01
5638경상남도김해시48250경상남도 김해시 서부로1701번길 69-1 0000동 0101호230753880489.32021-06-01
9421경상남도김해시48250경상남도 김해시 계동로23번길 9 0000동 0313호2994926036.34622021-06-01
시도명시군구명자치단체코드물건지시가표준액연면적기준일자
23031경상남도김해시48250경상남도 김해시 진영읍 진영리 1614-6 101호49683709.612021-06-01
22882경상남도김해시48250경상남도 김해시 지내동 266-2 101호8398150210.482021-06-01
57168경상남도김해시48250경상남도 김해시 내외로 67 0000동 0301호278143060578.262021-06-01
41108경상남도김해시48250경상남도 김해시 경원로55번길 2 0000동 0707호2952471049.292021-06-01
35359경상남도김해시48250경상남도 김해시 경원로73번길 15 0000동 0502호916058015.90382021-06-01
91476경상남도김해시48250경상남도 김해시 진례면 담안리 1420-3 101호37719180140.222021-06-01
23120경상남도김해시48250경상남도 김해시 진영로 175 0000동 0001호1187148056.412021-06-01
55407경상남도김해시48250경상남도 김해시 김해대로 1713 0000동 0302호330706000758.52021-06-01
36928경상남도김해시48250경상남도 김해시 외동 1197-1 100호58632750113.32021-06-01
85694경상남도김해시48250경상남도 김해시 진례면 송현리 266-1 101호13716400129.42021-06-01

Duplicate rows

Most frequently occurring

시도명시군구명자치단체코드물건지시가표준액연면적기준일자# duplicates
5경상남도김해시48250경상남도 김해시 주촌면 농소리 629-6 201호2856546069.32021-06-013
0경상남도김해시48250경상남도 김해시 안동 256-15 1동 201호390220017.072021-06-012
1경상남도김해시48250경상남도 김해시 어방동 1095-4 500호94482040198.452021-06-012
2경상남도김해시48250경상남도 김해시 어방동 607 1동 8100호1145600040.02021-06-012
3경상남도김해시48250경상남도 김해시 어방동 986 101호4743500053.02021-06-012
4경상남도김해시48250경상남도 김해시 주촌면 농소리 629-6 101호343500000750.02021-06-012