Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory585.9 KiB
Average record size in memory60.0 B

Variable types

Text1
Categorical1
Numeric3
DateTime1

Dataset

Description전북특별자치도 진안군 도시계획정보시스템 건축물대장에 대한 데이터로 건축물대장고유번호, 시군구, 법정동, 건축물대장 본번, 건축물대장 부번, 수정일 정보를 제공합니다.
Author전북특별자치도 진안군
URLhttps://www.data.go.kr/data/15119148/fileData.do

Alerts

시군구 has constant value ""Constant
건축물대장 부번 is highly skewed (γ1 = 22.32764182)Skewed
건축물대장고유번호 has unique valuesUnique
건축물대장 부번 has 3588 (35.9%) zerosZeros

Reproduction

Analysis started2024-03-14 19:10:58.986817
Analysis finished2024-03-14 19:11:02.478239
Duration3.49 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T04:11:03.481720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length11.2798
Min length7

Characters and Unicode

Total characters112798
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10000 ?
Unique (%)100.0%

Sample

1st row45720-16918
2nd row45720-17597
3rd row45720-9218
4th row45720-4041
5th row45720-15253
ValueCountFrequency (%)
45720-16918 1
 
< 0.1%
45720-100178428 1
 
< 0.1%
45720-4155 1
 
< 0.1%
45720-9992 1
 
< 0.1%
45720-16960 1
 
< 0.1%
45720-6106 1
 
< 0.1%
45720-9352 1
 
< 0.1%
45720-17214 1
 
< 0.1%
45720-1208 1
 
< 0.1%
45720-11885 1
 
< 0.1%
Other values (9990) 9990
99.9%
2024-03-15T04:11:05.016908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 17183
15.2%
7 14873
13.2%
2 14126
12.5%
4 14075
12.5%
5 14026
12.4%
1 11652
10.3%
- 10000
8.9%
8 5010
 
4.4%
3 4197
 
3.7%
6 4032
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 102798
91.1%
Dash Punctuation 10000
 
8.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 17183
16.7%
7 14873
14.5%
2 14126
13.7%
4 14075
13.7%
5 14026
13.6%
1 11652
11.3%
8 5010
 
4.9%
3 4197
 
4.1%
6 4032
 
3.9%
9 3624
 
3.5%
Dash Punctuation
ValueCountFrequency (%)
- 10000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 112798
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 17183
15.2%
7 14873
13.2%
2 14126
12.5%
4 14075
12.5%
5 14026
12.4%
1 11652
10.3%
- 10000
8.9%
8 5010
 
4.4%
3 4197
 
3.7%
6 4032
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 112798
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 17183
15.2%
7 14873
13.2%
2 14126
12.5%
4 14075
12.5%
5 14026
12.4%
1 11652
10.3%
- 10000
8.9%
8 5010
 
4.4%
3 4197
 
3.7%
6 4032
 
3.6%

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
45720
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row45720
2nd row45720
3rd row45720
4th row45720
5th row45720

Common Values

ValueCountFrequency (%)
45720 10000
100.0%

Length

2024-03-15T04:11:05.439157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T04:11:05.759921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
45720 10000
100.0%

법정동
Real number (ℝ)

Distinct77
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32808.486
Minimum25021
Maximum40026
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T04:11:06.239122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum25021
5-th percentile25021
Q125030
median35022
Q338021
95-th percentile40023
Maximum40026
Range15005
Interquartile range (IQR)12991

Descriptive statistics

Standard deviation5701.2497
Coefficient of variation (CV)0.17377363
Kurtosis-1.4413262
Mean32808.486
Median Absolute Deviation (MAD)3997
Skewness-0.36251472
Sum3.2808486 × 108
Variance32504249
MonotonicityNot monotonic
2024-03-15T04:11:06.863472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
25021 1040
 
10.4%
25032 341
 
3.4%
40022 337
 
3.4%
25022 328
 
3.3%
31025 271
 
2.7%
25023 267
 
2.7%
37024 251
 
2.5%
40021 249
 
2.5%
37021 246
 
2.5%
38021 221
 
2.2%
Other values (67) 6449
64.5%
ValueCountFrequency (%)
25021 1040
10.4%
25022 328
 
3.3%
25023 267
 
2.7%
25024 184
 
1.8%
25025 184
 
1.8%
25026 44
 
0.4%
25027 130
 
1.3%
25028 179
 
1.8%
25029 123
 
1.2%
25030 42
 
0.4%
ValueCountFrequency (%)
40026 139
1.4%
40025 182
1.8%
40024 140
1.4%
40023 169
1.7%
40022 337
3.4%
40021 249
2.5%
39025 103
 
1.0%
39024 4
 
< 0.1%
39023 23
 
0.2%
39022 104
 
1.0%

건축물대장 본번
Real number (ℝ)

Distinct1612
Distinct (%)16.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean616.4202
Minimum0
Maximum2484
Zeros15
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T04:11:07.324193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile60
Q1281
median540
Q3875
95-th percentile1482.05
Maximum2484
Range2484
Interquartile range (IQR)594

Descriptive statistics

Standard deviation438.67787
Coefficient of variation (CV)0.71165395
Kurtosis0.63394323
Mean616.4202
Median Absolute Deviation (MAD)296
Skewness0.91301207
Sum6164202
Variance192438.28
MonotonicityNot monotonic
2024-03-15T04:11:07.836042image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200 169
 
1.7%
768 127
 
1.3%
603 101
 
1.0%
387 90
 
0.9%
313 66
 
0.7%
60 53
 
0.5%
288 43
 
0.4%
403 36
 
0.4%
406 33
 
0.3%
110 31
 
0.3%
Other values (1602) 9251
92.5%
ValueCountFrequency (%)
0 15
0.1%
1 11
0.1%
2 9
0.1%
3 6
 
0.1%
4 7
0.1%
5 6
 
0.1%
6 7
0.1%
7 2
 
< 0.1%
8 2
 
< 0.1%
9 3
 
< 0.1%
ValueCountFrequency (%)
2484 1
 
< 0.1%
2453 2
 
< 0.1%
2431 6
0.1%
2421 2
 
< 0.1%
2417 1
 
< 0.1%
2323 4
< 0.1%
2198 4
< 0.1%
2129 2
 
< 0.1%
2117 1
 
< 0.1%
2112 5
0.1%

건축물대장 부번
Real number (ℝ)

SKEWED  ZEROS 

Distinct87
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2114
Minimum0
Maximum464
Zeros3588
Zeros (%)35.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T04:11:08.335937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile12
Maximum464
Range464
Interquartile range (IQR)3

Descriptive statistics

Standard deviation10.037957
Coefficient of variation (CV)3.1257262
Kurtosis905.11809
Mean3.2114
Median Absolute Deviation (MAD)1
Skewness22.327642
Sum32114
Variance100.76059
MonotonicityNot monotonic
2024-03-15T04:11:08.929154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 3588
35.9%
1 2891
28.9%
2 705
 
7.0%
3 594
 
5.9%
5 409
 
4.1%
4 385
 
3.9%
6 232
 
2.3%
7 188
 
1.9%
8 158
 
1.6%
9 139
 
1.4%
Other values (77) 711
 
7.1%
ValueCountFrequency (%)
0 3588
35.9%
1 2891
28.9%
2 705
 
7.0%
3 594
 
5.9%
4 385
 
3.9%
5 409
 
4.1%
6 232
 
2.3%
7 188
 
1.9%
8 158
 
1.6%
9 139
 
1.4%
ValueCountFrequency (%)
464 1
 
< 0.1%
463 1
 
< 0.1%
124 1
 
< 0.1%
110 4
< 0.1%
106 2
< 0.1%
104 1
 
< 0.1%
102 1
 
< 0.1%
99 1
 
< 0.1%
98 1
 
< 0.1%
89 1
 
< 0.1%
Distinct657
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2009-03-21 00:00:00
Maximum2014-03-25 00:00:00
2024-03-15T04:11:09.384735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:10.479744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-03-15T04:11:01.077440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:10:59.385313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:00.240023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:01.350006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:10:59.701776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:00.569587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:01.605886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:10:59.973142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T04:11:00.821877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T04:11:10.989587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동건축물대장 본번건축물대장 부번
법정동1.0000.2920.082
건축물대장 본번0.2921.0000.071
건축물대장 부번0.0820.0711.000
2024-03-15T04:11:11.482909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동건축물대장 본번건축물대장 부번
법정동1.000-0.008-0.029
건축물대장 본번-0.0081.000-0.017
건축물대장 부번-0.029-0.0171.000

Missing values

2024-03-15T04:11:01.951426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T04:11:02.321091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

건축물대장고유번호시군구법정동건축물대장 본번건축물대장 부번수정일
1997945720-169184572025022110222013-08-27
857045720-17597457202502120012012-03-08
1272945720-9218457202502664102011-04-15
648345720-4041457204002553542009-03-21
1196745720-15253457203202275302011-04-15
1612845720-100182456457203802145002011-10-06
1400045720-10618457203802218222011-11-22
993145720-7563457203502765802011-11-18
1049345720-10018619945720380221872012-12-07
1613645720-1001824944572040022177702012-12-21
건축물대장고유번호시군구법정동건축물대장 본번건축물대장 부번수정일
419545720-7346457202503238232011-11-25
1280345720-100179851457203502265502010-11-09
1964345720-1581845720390251202011-04-15
1803345720-17846457202502238752010-07-22
2105245720-11089457203502824912011-04-15
1694245720-78534572025032177502009-03-21
1772545720-13221457202503223332010-11-09
1999645720-16937457203902158912013-06-01
350745720-1027457202502142702011-10-21
1407545720-16562457202502216332012-11-22