Overview

Dataset statistics

Number of variables6
Number of observations3449
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory171.9 KiB
Average record size in memory51.0 B

Variable types

Numeric3
Categorical2
Text1

Dataset

Description부산광역시동구_공유재산현황_20221024
Author부산광역시 동구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15092496

Alerts

담당부서명 has constant value ""Constant
토지지목코드 is highly imbalanced (93.6%)Imbalance
면적 is highly skewed (γ1 = 31.46079441)Skewed
순번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:25:12.165668
Analysis finished2023-12-10 17:25:15.873197
Duration3.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct3449
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1725
Minimum1
Maximum3449
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.4 KiB
2023-12-11T02:25:16.092646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile173.4
Q1863
median1725
Q32587
95-th percentile3276.6
Maximum3449
Range3448
Interquartile range (IQR)1724

Descriptive statistics

Standard deviation995.78487
Coefficient of variation (CV)0.57726659
Kurtosis-1.2
Mean1725
Median Absolute Deviation (MAD)862
Skewness0
Sum5949525
Variance991587.5
MonotonicityStrictly increasing
2023-12-11T02:25:16.571869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2292 1
 
< 0.1%
2294 1
 
< 0.1%
2295 1
 
< 0.1%
2296 1
 
< 0.1%
2297 1
 
< 0.1%
2298 1
 
< 0.1%
2299 1
 
< 0.1%
2300 1
 
< 0.1%
2301 1
 
< 0.1%
Other values (3439) 3439
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3449 1
< 0.1%
3448 1
< 0.1%
3447 1
< 0.1%
3446 1
< 0.1%
3445 1
< 0.1%
3444 1
< 0.1%
3443 1
< 0.1%
3442 1
< 0.1%
3441 1
< 0.1%
3440 1
< 0.1%

담당부서명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
재무과
3449 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row재무과
2nd row재무과
3rd row재무과
4th row재무과
5th row재무과

Common Values

ValueCountFrequency (%)
재무과 3449
100.0%

Length

2023-12-11T02:25:16.979004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:25:17.251513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재무과 3449
100.0%

소재지
Text

UNIQUE 

Distinct3449
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
2023-12-11T02:25:18.143494image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length20.203537
Min length17

Characters and Unicode

Total characters69682
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3449 ?
Unique (%)100.0%

Sample

1st row부산광역시 동구 초량동 103-5
2nd row부산광역시 동구 초량동 110-18
3rd row부산광역시 동구 초량동 110-40
4th row부산광역시 동구 초량동 122-38
5th row부산광역시 동구 초량동 122-40
ValueCountFrequency (%)
부산광역시 3449
25.0%
동구 3449
25.0%
수정동 1370
 
9.9%
초량동 841
 
6.1%
범일동 684
 
5.0%
좌천동 554
 
4.0%
806-86 2
 
< 0.1%
978-2 2
 
< 0.1%
711-5 2
 
< 0.1%
973-26 2
 
< 0.1%
Other values (3403) 3442
24.9%
2023-12-11T02:25:19.399062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13797
19.8%
6898
 
9.9%
3450
 
5.0%
3449
 
4.9%
3449
 
4.9%
3449
 
4.9%
3449
 
4.9%
3449
 
4.9%
- 3425
 
4.9%
1 3092
 
4.4%
Other values (17) 21775
31.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34491
49.5%
Decimal Number 17969
25.8%
Space Separator 13797
19.8%
Dash Punctuation 3425
 
4.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6898
20.0%
3450
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
1370
 
4.0%
1370
 
4.0%
841
 
2.4%
Other values (5) 3317
9.6%
Decimal Number
ValueCountFrequency (%)
1 3092
17.2%
4 2187
12.2%
9 1914
10.7%
7 1676
9.3%
8 1626
9.0%
2 1609
9.0%
6 1563
8.7%
3 1530
8.5%
5 1478
8.2%
0 1294
7.2%
Space Separator
ValueCountFrequency (%)
13797
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3425
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 35191
50.5%
Hangul 34491
49.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6898
20.0%
3450
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
1370
 
4.0%
1370
 
4.0%
841
 
2.4%
Other values (5) 3317
9.6%
Common
ValueCountFrequency (%)
13797
39.2%
- 3425
 
9.7%
1 3092
 
8.8%
4 2187
 
6.2%
9 1914
 
5.4%
7 1676
 
4.8%
8 1626
 
4.6%
2 1609
 
4.6%
6 1563
 
4.4%
3 1530
 
4.3%
Other values (2) 2772
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 35191
50.5%
Hangul 34491
49.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13797
39.2%
- 3425
 
9.7%
1 3092
 
8.8%
4 2187
 
6.2%
9 1914
 
5.4%
7 1676
 
4.8%
8 1626
 
4.6%
2 1609
 
4.6%
6 1563
 
4.4%
3 1530
 
4.3%
Other values (2) 2772
 
7.9%
Hangul
ValueCountFrequency (%)
6898
20.0%
3450
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
3449
10.0%
1370
 
4.0%
1370
 
4.0%
841
 
2.4%
Other values (5) 3317
9.6%

토지지목코드
Categorical

IMBALANCE 

Distinct9
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size27.1 KiB
08-대
3372 
05-임야
 
42
01-전
 
11
27-묘지
 
7
14-도로
 
7
Other values (4)
 
10

Length

Max length7
Median length4
Mean length4.0229052
Min length4

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row08-대
2nd row08-대
3rd row08-대
4th row08-대
5th row08-대

Common Values

ValueCountFrequency (%)
08-대 3372
97.8%
05-임야 42
 
1.2%
01-전 11
 
0.3%
27-묘지 7
 
0.2%
14-도로 7
 
0.2%
10-학교용지 4
 
0.1%
25-종교용지 3
 
0.1%
02-답 2
 
0.1%
28-잡종지 1
 
< 0.1%

Length

2023-12-11T02:25:19.841731image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T02:25:20.272024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
08-대 3372
97.8%
05-임야 42
 
1.2%
01-전 11
 
0.3%
27-묘지 7
 
0.2%
14-도로 7
 
0.2%
10-학교용지 4
 
0.1%
25-종교용지 3
 
0.1%
02-답 2
 
0.1%
28-잡종지 1
 
< 0.1%

면적
Real number (ℝ)

SKEWED 

Distinct330
Distinct (%)9.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.450797
Minimum0.1
Maximum6662
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.4 KiB
2023-12-11T02:25:20.655598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.1
5-th percentile1
Q13
median10
Q327
95-th percentile93.54
Maximum6662
Range6661.9
Interquartile range (IQR)24

Descriptive statistics

Standard deviation143.72468
Coefficient of variation (CV)4.5698263
Kurtosis1345.4841
Mean31.450797
Median Absolute Deviation (MAD)8
Skewness31.460794
Sum108473.8
Variance20656.784
MonotonicityNot monotonic
2023-12-11T02:25:21.022811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 317
 
9.2%
3.0 262
 
7.6%
2.0 257
 
7.5%
7.0 190
 
5.5%
4.0 136
 
3.9%
5.0 126
 
3.7%
10.0 120
 
3.5%
6.0 105
 
3.0%
13.0 97
 
2.8%
9.0 95
 
2.8%
Other values (320) 1744
50.6%
ValueCountFrequency (%)
0.1 2
 
0.1%
0.2 1
 
< 0.1%
0.3 4
 
0.1%
0.4 2
 
0.1%
0.5 1
 
< 0.1%
0.7 2
 
0.1%
0.8 1
 
< 0.1%
0.9 1
 
< 0.1%
1.0 317
9.2%
1.4 2
 
0.1%
ValueCountFrequency (%)
6662.0 1
< 0.1%
2118.0 1
< 0.1%
1786.1 1
< 0.1%
1682.0 1
< 0.1%
1622.8 1
< 0.1%
1212.0 1
< 0.1%
886.0 1
< 0.1%
831.4 1
< 0.1%
807.0 1
< 0.1%
791.1 1
< 0.1%

공시지가
Real number (ℝ)

Distinct1529
Distinct (%)44.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean735121.37
Minimum27000
Maximum7230000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size30.4 KiB
2023-12-11T02:25:21.415632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum27000
5-th percentile223700
Q1576600
median689600
Q3857100
95-th percentile1347800
Maximum7230000
Range7203000
Interquartile range (IQR)280500

Descriptive statistics

Standard deviation402326.49
Coefficient of variation (CV)0.5472926
Kurtosis31.506653
Mean735121.37
Median Absolute Deviation (MAD)134400
Skewness3.4922808
Sum2.5354336 × 109
Variance1.618666 × 1011
MonotonicityNot monotonic
2023-12-11T02:25:21.867787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
638500 41
 
1.2%
682000 33
 
1.0%
250800 23
 
0.7%
635000 16
 
0.5%
570200 16
 
0.5%
536000 16
 
0.5%
671500 16
 
0.5%
555000 15
 
0.4%
651500 15
 
0.4%
597600 15
 
0.4%
Other values (1519) 3243
94.0%
ValueCountFrequency (%)
27000 1
 
< 0.1%
28800 1
 
< 0.1%
54400 1
 
< 0.1%
116800 6
0.2%
156600 1
 
< 0.1%
165000 2
 
0.1%
174900 1
 
< 0.1%
176800 5
0.1%
180800 2
 
0.1%
182800 3
0.1%
ValueCountFrequency (%)
7230000 1
< 0.1%
4288000 1
< 0.1%
4092000 1
< 0.1%
3786000 1
< 0.1%
3744000 1
< 0.1%
3648000 1
< 0.1%
3586000 1
< 0.1%
3535000 2
0.1%
3461000 1
< 0.1%
3357000 2
0.1%

Interactions

2023-12-11T02:25:14.548592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:13.021020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:13.791841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:14.855017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:13.286872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:14.034127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:15.094523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:13.552035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T02:25:14.263373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T02:25:22.114758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번토지지목코드면적공시지가
순번1.0000.1040.0470.227
토지지목코드0.1041.0000.2170.000
면적0.0470.2171.0000.197
공시지가0.2270.0000.1971.000
2023-12-11T02:25:23.036985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적공시지가토지지목코드
순번1.0000.104-0.2430.047
면적0.1041.000-0.0560.127
공시지가-0.243-0.0561.0000.000
토지지목코드0.0470.1270.0001.000

Missing values

2023-12-11T02:25:15.432771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:25:15.754376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번담당부서명소재지토지지목코드면적공시지가
01재무과부산광역시 동구 초량동 103-508-대723.73177000
12재무과부산광역시 동구 초량동 110-1808-대449.92866000
23재무과부산광역시 동구 초량동 110-4008-대115.0861300
34재무과부산광역시 동구 초량동 122-3808-대5.01137000
45재무과부산광역시 동구 초량동 122-4008-대76.8490000
56재무과부산광역시 동구 초량동 129-1708-대45.3490000
67재무과부산광역시 동구 초량동 134-3908-대26.51425000
78재무과부산광역시 동구 초량동 148-508-대0.91570000
89재무과부산광역시 동구 초량동 248-808-대2.22768000
910재무과부산광역시 동구 초량동 287-2508-대2.51252000
순번담당부서명소재지토지지목코드면적공시지가
34393440재무과부산광역시 동구 범일동 1622-1008-대10.0867300
34403441재무과부산광역시 동구 범일동 1622-1808-대83.0893800
34413442재무과부산광역시 동구 범일동 1622-2808-대39.01975000
34423443재무과부산광역시 동구 범일동 1622-3108-대19.01975000
34433444재무과부산광역시 동구 범일동 1622-3308-대24.02265000
34443445재무과부산광역시 동구 범일동 1622-3508-대3.01575000
34453446재무과부산광역시 동구 범일동 1622-3608-대7.01575000
34463447재무과부산광역시 동구 범일동 1622-4108-대11.01655000
34473448재무과부산광역시 동구 범일동 1623-2208-대8.0296600
34483449재무과부산광역시 동구 범일동 1635-108-대39.0998500