Overview

Dataset statistics

Number of variables5
Number of observations767
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory31.6 KiB
Average record size in memory42.2 B

Variable types

Numeric2
Categorical2
Text1

Dataset

Description부산광역시 남구의 공유재산(일반재산)에 관한 담당 부서명, 소재지, 면적 등의 다양하고 자세한 자료를 제공합니다.
URLhttps://www.data.go.kr/data/3080532/fileData.do

Alerts

순번 is highly overall correlated with 담당부서명High correlation
담당부서명 is highly overall correlated with 순번High correlation
토지지목코드 is highly imbalanced (64.5%)Imbalance
면적(제곱미터) is highly skewed (γ1 = 25.92780957)Skewed
순번 has unique valuesUnique
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:01:05.846969
Analysis finished2023-12-13 00:01:06.500280
Duration0.65 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct767
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean384
Minimum1
Maximum767
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-13T09:01:06.573734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile39.3
Q1192.5
median384
Q3575.5
95-th percentile728.7
Maximum767
Range766
Interquartile range (IQR)383

Descriptive statistics

Standard deviation221.55812
Coefficient of variation (CV)0.57697427
Kurtosis-1.2
Mean384
Median Absolute Deviation (MAD)192
Skewness0
Sum294528
Variance49088
MonotonicityStrictly increasing
2023-12-13T09:01:06.702480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
2 1
 
0.1%
507 1
 
0.1%
508 1
 
0.1%
509 1
 
0.1%
510 1
 
0.1%
511 1
 
0.1%
512 1
 
0.1%
513 1
 
0.1%
514 1
 
0.1%
Other values (757) 757
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
767 1
0.1%
766 1
0.1%
765 1
0.1%
764 1
0.1%
763 1
0.1%
762 1
0.1%
761 1
0.1%
760 1
0.1%
759 1
0.1%
758 1
0.1%

담당부서명
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
재무과
423 
건축과
344 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건축과
2nd row건축과
3rd row건축과
4th row재무과
5th row재무과

Common Values

ValueCountFrequency (%)
재무과 423
55.1%
건축과 344
44.9%

Length

2023-12-13T09:01:06.807797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:01:06.881570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
재무과 423
55.1%
건축과 344
44.9%

소재지
Text

UNIQUE 

Distinct767
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-12-13T09:01:07.194065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length19.770535
Min length17

Characters and Unicode

Total characters15164
Distinct characters31
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique767 ?
Unique (%)100.0%

Sample

1st row부산광역시 남구 대연동 219-36
2nd row부산광역시 남구 대연동 219-41
3rd row부산광역시 남구 대연동 219-45
4th row부산광역시 남구 대연동 225-3
5th row부산광역시 남구 대연동 235-1
ValueCountFrequency (%)
부산광역시 767
25.0%
남구 767
25.0%
문현동 518
16.9%
감만동 119
 
3.9%
대연동 65
 
2.1%
우암동 55
 
1.8%
용당동 5
 
0.2%
용호동 5
 
0.2%
5
 
0.2%
982-11 1
 
< 0.1%
Other values (766) 766
24.9%
2023-12-13T09:01:07.679804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3071
20.3%
772
 
5.1%
767
 
5.1%
767
 
5.1%
767
 
5.1%
767
 
5.1%
767
 
5.1%
767
 
5.1%
767
 
5.1%
- 759
 
5.0%
Other values (21) 5193
34.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7675
50.6%
Decimal Number 3659
24.1%
Space Separator 3071
20.3%
Dash Punctuation 759
 
5.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
772
10.1%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
518
6.7%
518
6.7%
Other values (9) 498
6.5%
Decimal Number
ValueCountFrequency (%)
1 720
19.7%
3 447
12.2%
5 423
11.6%
2 413
11.3%
6 330
9.0%
4 307
8.4%
8 303
8.3%
9 253
 
6.9%
7 240
 
6.6%
0 223
 
6.1%
Space Separator
ValueCountFrequency (%)
3071
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 759
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7675
50.6%
Common 7489
49.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
772
10.1%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
518
6.7%
518
6.7%
Other values (9) 498
6.5%
Common
ValueCountFrequency (%)
3071
41.0%
- 759
 
10.1%
1 720
 
9.6%
3 447
 
6.0%
5 423
 
5.6%
2 413
 
5.5%
6 330
 
4.4%
4 307
 
4.1%
8 303
 
4.0%
9 253
 
3.4%
Other values (2) 463
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7675
50.6%
ASCII 7489
49.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3071
41.0%
- 759
 
10.1%
1 720
 
9.6%
3 447
 
6.0%
5 423
 
5.6%
2 413
 
5.5%
6 330
 
4.4%
4 307
 
4.1%
8 303
 
4.0%
9 253
 
3.4%
Other values (2) 463
 
6.2%
Hangul
ValueCountFrequency (%)
772
10.1%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
767
10.0%
518
6.7%
518
6.7%
Other values (9) 498
6.5%

토지지목코드
Categorical

IMBALANCE 

Distinct8
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
08-대
632 
01-전
 
62
05-임야
 
22
14-도로
 
21
18-구거
 
14
Other values (3)
 
16

Length

Max length7
Median length4
Mean length4.0964798
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row02-답
2nd row02-답
3rd row08-대
4th row02-답
5th row08-대

Common Values

ValueCountFrequency (%)
08-대 632
82.4%
01-전 62
 
8.1%
05-임야 22
 
2.9%
14-도로 21
 
2.7%
18-구거 14
 
1.8%
02-답 9
 
1.2%
28-잡종지 4
 
0.5%
10-학교용지 3
 
0.4%

Length

2023-12-13T09:01:07.804781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:01:07.904529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
08-대 632
82.4%
01-전 62
 
8.1%
05-임야 22
 
2.9%
14-도로 21
 
2.7%
18-구거 14
 
1.8%
02-답 9
 
1.2%
28-잡종지 4
 
0.5%
10-학교용지 3
 
0.4%

면적(제곱미터)
Real number (ℝ)

SKEWED 

Distinct149
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.301069
Minimum0.13
Maximum9017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.9 KiB
2023-12-13T09:01:08.039096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.13
5-th percentile1
Q14
median14
Q336
95-th percentile122
Maximum9017
Range9016.87
Interquartile range (IQR)32

Descriptive statistics

Standard deviation331.80319
Coefficient of variation (CV)7.3244008
Kurtosis700.39673
Mean45.301069
Median Absolute Deviation (MAD)12
Skewness25.92781
Sum34745.92
Variance110093.36
MonotonicityNot monotonic
2023-12-13T09:01:08.155729image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.0 80
 
10.4%
3.0 60
 
7.8%
7.0 45
 
5.9%
2.0 40
 
5.2%
10.0 26
 
3.4%
20.0 22
 
2.9%
6.0 21
 
2.7%
4.0 20
 
2.6%
23.0 19
 
2.5%
17.0 19
 
2.5%
Other values (139) 415
54.1%
ValueCountFrequency (%)
0.13 1
 
0.1%
0.86 1
 
0.1%
1.0 80
10.4%
1.68 1
 
0.1%
1.9 1
 
0.1%
2.0 40
5.2%
2.1 1
 
0.1%
2.79 1
 
0.1%
3.0 60
7.8%
3.87 2
 
0.3%
ValueCountFrequency (%)
9017.0 1
0.1%
832.0 1
0.1%
676.0 1
0.1%
602.0 1
0.1%
538.0 1
0.1%
530.0 1
0.1%
507.0 1
0.1%
412.0 1
0.1%
380.8 1
0.1%
361.0 1
0.1%

Interactions

2023-12-13T09:01:06.217098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:01:06.018174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:01:06.306577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T09:01:06.120996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T09:01:08.232157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번담당부서명토지지목코드면적(제곱미터)
순번1.0000.7740.4150.000
담당부서명0.7741.0000.3230.000
토지지목코드0.4150.3231.0000.251
면적(제곱미터)0.0000.0000.2511.000
2023-12-13T09:01:08.309917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
담당부서명토지지목코드
담당부서명1.0000.241
토지지목코드0.2411.000
2023-12-13T09:01:08.386761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번면적(제곱미터)담당부서명토지지목코드
순번1.0000.0720.6070.212
면적(제곱미터)0.0721.0000.0000.187
담당부서명0.6070.0001.0000.241
토지지목코드0.2120.1870.2411.000

Missing values

2023-12-13T09:01:06.396825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:01:06.470817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번담당부서명소재지토지지목코드면적(제곱미터)
01건축과부산광역시 남구 대연동 219-3602-답21.0
12건축과부산광역시 남구 대연동 219-4102-답1.0
23건축과부산광역시 남구 대연동 219-4508-대6.0
34재무과부산광역시 남구 대연동 225-302-답2.0
45재무과부산광역시 남구 대연동 235-108-대50.0
56재무과부산광역시 남구 대연동 245-4508-대2.0
67재무과부산광역시 남구 대연동 245-9808-대1.0
78재무과부산광역시 남구 대연동 245-22201-전18.0
89재무과부산광역시 남구 대연동 282-408-대28.6
910재무과부산광역시 남구 대연동 293-008-대15.2
순번담당부서명소재지토지지목코드면적(제곱미터)
757758재무과부산광역시 남구 감만동 73-20108-대2.0
758759재무과부산광역시 남구 감만동 75-5808-대1.0
759760재무과부산광역시 남구 감만동 128-1414-도로9.0
760761재무과부산광역시 남구 감만동 141-108-대9.14
761762재무과부산광역시 남구 감만동 141-1214-도로0.86
762763재무과부산광역시 남구 감만동 205-14014-도로2.0
763764건축과부산광역시 남구 감만동 589-508-대13.0
764765건축과부산광역시 남구 감만동 589-1408-대14.0
765766건축과부산광역시 남구 감만동 590-308-대7.0
766767건축과부산광역시 남구 감만동 590-1108-대10.0