Overview

Dataset statistics

Number of variables4
Number of observations10000
Missing cells375
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory419.9 KiB
Average record size in memory43.0 B

Variable types

Numeric3
Text1

Dataset

Description경상남도 창원시 관내 등록된 건축물대장(일반)에 관한 데이터로 건축물 주소, 건축 면적, 주차 면수의 정보를 제공합니다.
Author경상남도 창원시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15064338

Alerts

건축 면적 is highly overall correlated with 주차 대수High correlation
주차 대수 is highly overall correlated with 건축 면적High correlation
건축 면적 has 375 (3.8%) missing valuesMissing
건축 면적 is highly skewed (γ1 = 20.55724589)Skewed
주차 대수 is highly skewed (γ1 = 26.58065615)Skewed
순번 has unique valuesUnique
건축 면적 has 2423 (24.2%) zerosZeros
주차 대수 has 7521 (75.2%) zerosZeros

Reproduction

Analysis started2023-12-11 00:26:17.033236
Analysis finished2023-12-11 00:26:18.903976
Duration1.87 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50053.087
Minimum12
Maximum100082
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:26:18.998972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum12
5-th percentile5311.75
Q124964.5
median50207.5
Q374825.75
95-th percentile94888.8
Maximum100082
Range100070
Interquartile range (IQR)49861.25

Descriptive statistics

Standard deviation28722.444
Coefficient of variation (CV)0.5738396
Kurtosis-1.1965444
Mean50053.087
Median Absolute Deviation (MAD)24920.5
Skewness0.001855686
Sum5.0053087 × 108
Variance8.2497877 × 108
MonotonicityNot monotonic
2023-12-11T09:26:19.168717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
43731 1
 
< 0.1%
75821 1
 
< 0.1%
71094 1
 
< 0.1%
67594 1
 
< 0.1%
67846 1
 
< 0.1%
14954 1
 
< 0.1%
56760 1
 
< 0.1%
12725 1
 
< 0.1%
71362 1
 
< 0.1%
13383 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
12 1
< 0.1%
51 1
< 0.1%
86 1
< 0.1%
108 1
< 0.1%
117 1
< 0.1%
145 1
< 0.1%
153 1
< 0.1%
154 1
< 0.1%
158 1
< 0.1%
165 1
< 0.1%
ValueCountFrequency (%)
100082 1
< 0.1%
100066 1
< 0.1%
100061 1
< 0.1%
100053 1
< 0.1%
100044 1
< 0.1%
100043 1
< 0.1%
100038 1
< 0.1%
100027 1
< 0.1%
100024 1
< 0.1%
100001 1
< 0.1%
Distinct9611
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T09:26:19.536886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length31
Mean length26.8815
Min length21

Characters and Unicode

Total characters268815
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9391 ?
Unique (%)93.9%

Sample

1st row경상남도 창원시 마산합포구 완월동 대지 249-7
2nd row경상남도 창원시 성산구 중앙동 대지 45-6
3rd row경상남도 창원시 마산회원구 내서읍 신감리 대지 917-5
4th row경상남도 창원시 마산합포구 진북면 추곡리 대지 583-3
5th row경상남도 창원시 진해구 여좌동 대지 122-22
ValueCountFrequency (%)
경상남도 10000
16.0%
창원시 10000
16.0%
대지 9935
15.9%
마산합포구 2674
 
4.3%
의창구 2484
 
4.0%
진해구 1881
 
3.0%
마산회원구 1771
 
2.8%
성산구 1190
 
1.9%
북면 420
 
0.7%
동읍 394
 
0.6%
Other values (7280) 21868
34.9%
2023-12-11T09:26:19.990885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52617
19.6%
12564
 
4.7%
12251
 
4.6%
10545
 
3.9%
10488
 
3.9%
10443
 
3.9%
10313
 
3.8%
10272
 
3.8%
10102
 
3.8%
10100
 
3.8%
Other values (147) 119120
44.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 165561
61.6%
Space Separator 52617
 
19.6%
Decimal Number 40637
 
15.1%
Dash Punctuation 10000
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12564
 
7.6%
12251
 
7.4%
10545
 
6.4%
10488
 
6.3%
10443
 
6.3%
10313
 
6.2%
10272
 
6.2%
10102
 
6.1%
10100
 
6.1%
10017
 
6.1%
Other values (135) 58466
35.3%
Decimal Number
ValueCountFrequency (%)
1 8150
20.1%
2 5217
12.8%
0 4355
10.7%
3 4336
10.7%
4 3896
9.6%
6 3506
8.6%
5 3427
8.4%
7 2835
 
7.0%
8 2518
 
6.2%
9 2397
 
5.9%
Space Separator
ValueCountFrequency (%)
52617
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 165561
61.6%
Common 103254
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12564
 
7.6%
12251
 
7.4%
10545
 
6.4%
10488
 
6.3%
10443
 
6.3%
10313
 
6.2%
10272
 
6.2%
10102
 
6.1%
10100
 
6.1%
10017
 
6.1%
Other values (135) 58466
35.3%
Common
ValueCountFrequency (%)
52617
51.0%
- 10000
 
9.7%
1 8150
 
7.9%
2 5217
 
5.1%
0 4355
 
4.2%
3 4336
 
4.2%
4 3896
 
3.8%
6 3506
 
3.4%
5 3427
 
3.3%
7 2835
 
2.7%
Other values (2) 4915
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 165561
61.6%
ASCII 103254
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
52617
51.0%
- 10000
 
9.7%
1 8150
 
7.9%
2 5217
 
5.1%
0 4355
 
4.2%
3 4336
 
4.2%
4 3896
 
3.8%
6 3506
 
3.4%
5 3427
 
3.3%
7 2835
 
2.7%
Other values (2) 4915
 
4.8%
Hangul
ValueCountFrequency (%)
12564
 
7.6%
12251
 
7.4%
10545
 
6.4%
10488
 
6.3%
10443
 
6.3%
10313
 
6.2%
10272
 
6.2%
10102
 
6.1%
10100
 
6.1%
10017
 
6.1%
Other values (135) 58466
35.3%

건축 면적
Real number (ℝ)

HIGH CORRELATION  MISSING  SKEWED  ZEROS 

Distinct5600
Distinct (%)58.2%
Missing375
Missing (%)3.8%
Infinite0
Infinite (%)0.0%
Mean197.25784
Minimum0
Maximum37107.185
Zeros2423
Zeros (%)24.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:26:20.108580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median80.5
Q3127.62
95-th percentile493.4
Maximum37107.185
Range37107.185
Interquartile range (IQR)127.62

Descriptive statistics

Standard deviation1033.5971
Coefficient of variation (CV)5.2398274
Kurtosis562.10936
Mean197.25784
Median Absolute Deviation (MAD)56.5
Skewness20.557246
Sum1898606.7
Variance1068322.9
MonotonicityNot monotonic
2023-12-11T09:26:20.216165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 2423
 
24.2%
198.0 19
 
0.2%
19.83 14
 
0.1%
26.45 13
 
0.1%
66.0 13
 
0.1%
18.0 12
 
0.1%
49.59 11
 
0.1%
50.0 11
 
0.1%
29.75 11
 
0.1%
59.5 11
 
0.1%
Other values (5590) 7087
70.9%
(Missing) 375
 
3.8%
ValueCountFrequency (%)
0.0 2423
24.2%
1.0 5
 
0.1%
1.2 3
 
< 0.1%
1.41 1
 
< 0.1%
1.437 1
 
< 0.1%
1.44 1
 
< 0.1%
1.5 1
 
< 0.1%
1.54 1
 
< 0.1%
1.76 1
 
< 0.1%
2.16 1
 
< 0.1%
ValueCountFrequency (%)
37107.185 2
< 0.1%
31128.79 1
< 0.1%
26886.704 1
< 0.1%
22551.143 1
< 0.1%
20228.1525 1
< 0.1%
18063.86 1
< 0.1%
16869.92 1
< 0.1%
16643.72 1
< 0.1%
16004.17 1
< 0.1%
15765.21 1
< 0.1%

주차 대수
Real number (ℝ)

HIGH CORRELATION  SKEWED  ZEROS 

Distinct125
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.603
Minimum0
Maximum3068
Zeros7521
Zeros (%)75.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T09:26:20.327516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile7
Maximum3068
Range3068
Interquartile range (IQR)0

Descriptive statistics

Standard deviation96.973843
Coefficient of variation (CV)14.686331
Kurtosis786.98533
Mean6.603
Median Absolute Deviation (MAD)0
Skewness26.580656
Sum66030
Variance9403.9262
MonotonicityNot monotonic
2023-12-11T09:26:20.432816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7521
75.2%
2 639
 
6.4%
1 588
 
5.9%
4 307
 
3.1%
3 245
 
2.5%
5 116
 
1.2%
8 107
 
1.1%
7 80
 
0.8%
6 73
 
0.7%
10 34
 
0.3%
Other values (115) 290
 
2.9%
ValueCountFrequency (%)
0 7521
75.2%
1 588
 
5.9%
2 639
 
6.4%
3 245
 
2.5%
4 307
 
3.1%
5 116
 
1.2%
6 73
 
0.7%
7 80
 
0.8%
8 107
 
1.1%
9 17
 
0.2%
ValueCountFrequency (%)
3068 6
0.1%
3055 1
 
< 0.1%
2796 1
 
< 0.1%
1788 1
 
< 0.1%
1265 1
 
< 0.1%
1127 4
< 0.1%
1090 1
 
< 0.1%
926 2
 
< 0.1%
839 1
 
< 0.1%
835 1
 
< 0.1%

Interactions

2023-12-11T09:26:18.435740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:17.522810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:17.855114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:18.531914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:17.611591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:17.958761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:18.633613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:17.736724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:26:18.051154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:26:20.500532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번건축 면적주차 대수
순번1.0000.0680.058
건축 면적0.0681.0000.116
주차 대수0.0580.1161.000
2023-12-11T09:26:20.567137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번건축 면적주차 대수
순번1.000-0.0770.002
건축 면적-0.0771.0000.533
주차 대수0.0020.5331.000

Missing values

2023-12-11T09:26:18.747182image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:26:18.858006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번건축 위치건축 면적주차 대수
4373043731경상남도 창원시 마산합포구 완월동 대지 249-70.00
1171311714경상남도 창원시 성산구 중앙동 대지 45-6115.171
9178591876경상남도 창원시 마산회원구 내서읍 신감리 대지 917-560.00
6997569976경상남도 창원시 마산합포구 진북면 추곡리 대지 583-3171.40
7560175602경상남도 창원시 진해구 여좌동 대지 122-2243.040
64776478경상남도 창원시 의창구 명서동 대지 108-2115.980
4015840159경상남도 창원시 마산합포구 월영동 대지 400-120.00
4459644597경상남도 창원시 성산구 귀곡동 대지 555-0253.00
4774647747경상남도 창원시 마산회원구 구암동 대지 44-52650.824
2964329644경상남도 창원시 진해구 소사동 대지 177-499.190
순번건축 위치건축 면적주차 대수
1506915070경상남도 창원시 의창구 봉곡동 대지 128-10119.682
95759576경상남도 창원시 성산구 중앙동 대지 74-12140.110
8985589946경상남도 창원시 진해구 이동 대지 574-4196.01
7959179592경상남도 창원시 진해구 경화동 대지 954-1861.920
4378443785경상남도 창원시 마산합포구 현동 대지 830-1102.480
4786547866경상남도 창원시 마산회원구 구암동 대지 78-2167.610
4656746568경상남도 창원시 마산합포구 자산동 대지 316-280.00
6210762108경상남도 창원시 마산회원구 내서읍 삼계리 대지 233-2325.430
3514835149경상남도 창원시 마산합포구 대창동 대지 44-00.00
8212082211경상남도 창원시 진해구 경화동 대지 1156-2893.160