Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory732.4 KiB
Average record size in memory75.0 B

Variable types

Categorical5
Numeric3

Dataset

Description경기도 파주시 전체 필지에 대한 개별공시지가 데이터로서 시군구, 읍면동, 리, 구분(일반, 산), 본번, 부번, 지가에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15113708/fileData.do

Alerts

시군구 has constant value ""Constant
데이터기준일 has constant value ""Constant
is highly overall correlated with 읍면동High correlation
읍면동 is highly overall correlated with and 1 other fieldsHigh correlation
구분 is highly overall correlated with 읍면동High correlation
구분 is highly imbalanced (74.8%)Imbalance
부번 has 1503 (15.0%) zerosZeros

Reproduction

Analysis started2023-12-12 16:22:38.006827
Analysis finished2023-12-12 16:22:40.073567
Duration2.07 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군구
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
파주시
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row파주시
2nd row파주시
3rd row파주시
4th row파주시
5th row파주시

Common Values

ValueCountFrequency (%)
파주시 10000
100.0%

Length

2023-12-13T01:22:40.152914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:22:40.259451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
파주시 10000
100.0%

읍면동
Categorical

HIGH CORRELATION 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
문산읍
3229 
금촌동
693 
상지석동
597 
야당동
562 
동패동
454 
Other values (19)
4465 

Length

Max length4
Median length3
Mean length3.1122
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row금촌동
2nd row동패동
3rd row문산읍
4th row상지석동
5th row문산읍

Common Values

ValueCountFrequency (%)
문산읍 3229
32.3%
금촌동 693
 
6.9%
상지석동 597
 
6.0%
야당동 562
 
5.6%
동패동 454
 
4.5%
검산동 331
 
3.3%
아동동 327
 
3.3%
연다산동 322
 
3.2%
맥금동 321
 
3.2%
산남동 315
 
3.1%
Other values (14) 2849
28.5%

Length

2023-12-13T01:22:40.381868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
문산읍 3229
32.3%
금촌동 693
 
6.9%
상지석동 597
 
6.0%
야당동 562
 
5.6%
동패동 454
 
4.5%
검산동 331
 
3.3%
아동동 327
 
3.3%
연다산동 322
 
3.2%
맥금동 321
 
3.2%
산남동 315
 
3.1%
Other values (14) 2849
28.5%


Categorical

HIGH CORRELATION 

Distinct12
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
6489 
선유리
 
569
내포리
 
505
마정리
 
460
당동리
 
301
Other values (7)
1676 

Length

Max length4
Median length4
Mean length3.6489
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row문산리
4th row<NA>
5th row이천리

Common Values

ValueCountFrequency (%)
<NA> 6489
64.9%
선유리 569
 
5.7%
내포리 505
 
5.1%
마정리 460
 
4.6%
당동리 301
 
3.0%
사목리 294
 
2.9%
문산리 283
 
2.8%
봉서리 282
 
2.8%
이천리 270
 
2.7%
운천리 245
 
2.5%
Other values (2) 302
 
3.0%

Length

2023-12-13T01:22:40.520031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 6489
64.9%
선유리 569
 
5.7%
내포리 505
 
5.1%
마정리 460
 
4.6%
당동리 301
 
3.0%
사목리 294
 
2.9%
문산리 283
 
2.8%
봉서리 282
 
2.8%
이천리 270
 
2.7%
운천리 245
 
2.5%
Other values (2) 302
 
3.0%

구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9359 
 
449
가지번
 
192

Length

Max length3
Median length2
Mean length1.9743
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9359
93.6%
449
 
4.5%
가지번 192
 
1.9%

Length

2023-12-13T01:22:40.650922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:22:40.747363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9359
93.6%
449
 
4.5%
가지번 192
 
1.9%

본번
Real number (ℝ)

Distinct1582
Distinct (%)15.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean481.4459
Minimum1
Maximum9800
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:22:40.875825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile23
Q1131
median344
Q3660
95-th percentile1386.1
Maximum9800
Range9799
Interquartile range (IQR)529

Descriptive statistics

Standard deviation543.35215
Coefficient of variation (CV)1.128584
Kurtosis75.469698
Mean481.4459
Median Absolute Deviation (MAD)245
Skewness5.8421751
Sum4814459
Variance295231.56
MonotonicityNot monotonic
2023-12-13T01:22:41.058644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
56 54
 
0.5%
17 52
 
0.5%
466 48
 
0.5%
554 46
 
0.5%
114 46
 
0.5%
551 42
 
0.4%
10 42
 
0.4%
421 40
 
0.4%
177 40
 
0.4%
81 39
 
0.4%
Other values (1572) 9551
95.5%
ValueCountFrequency (%)
1 27
0.3%
2 26
0.3%
3 19
0.2%
4 26
0.3%
5 35
0.4%
6 18
0.2%
7 16
 
0.2%
8 17
0.2%
9 21
0.2%
10 42
0.4%
ValueCountFrequency (%)
9800 5
0.1%
9007 1
 
< 0.1%
8400 3
< 0.1%
7500 1
 
< 0.1%
7200 1
 
< 0.1%
7100 1
 
< 0.1%
5020 1
 
< 0.1%
5019 1
 
< 0.1%
5016 1
 
< 0.1%
5014 1
 
< 0.1%

부번
Real number (ℝ)

ZEROS 

Distinct265
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.4021
Minimum0
Maximum591
Zeros1503
Zeros (%)15.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:22:41.200278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median5
Q314
95-th percentile71
Maximum591
Range591
Interquartile range (IQR)13

Descriptive statistics

Standard deviation38.537365
Coefficient of variation (CV)2.3495385
Kurtosis48.940954
Mean16.4021
Median Absolute Deviation (MAD)4
Skewness5.924337
Sum164021
Variance1485.1285
MonotonicityNot monotonic
2023-12-13T01:22:41.348632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1503
15.0%
1 1223
 
12.2%
2 910
 
9.1%
3 700
 
7.0%
4 580
 
5.8%
5 491
 
4.9%
6 391
 
3.9%
7 347
 
3.5%
8 320
 
3.2%
9 239
 
2.4%
Other values (255) 3296
33.0%
ValueCountFrequency (%)
0 1503
15.0%
1 1223
12.2%
2 910
9.1%
3 700
7.0%
4 580
 
5.8%
5 491
 
4.9%
6 391
 
3.9%
7 347
 
3.5%
8 320
 
3.2%
9 239
 
2.4%
ValueCountFrequency (%)
591 1
< 0.1%
568 1
< 0.1%
538 1
< 0.1%
527 1
< 0.1%
514 1
< 0.1%
513 1
< 0.1%
472 1
< 0.1%
445 1
< 0.1%
429 1
< 0.1%
416 1
< 0.1%

공시지가
Real number (ℝ)

Distinct3495
Distinct (%)34.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean364912.83
Minimum6270
Maximum5206000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:22:41.506168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6270
5-th percentile23800
Q164000
median171750
Q3521525
95-th percentile1231000
Maximum5206000
Range5199730
Interquartile range (IQR)457525

Descriptive statistics

Standard deviation471978.65
Coefficient of variation (CV)1.2934011
Kurtosis14.285322
Mean364912.83
Median Absolute Deviation (MAD)132650
Skewness2.9544751
Sum3.6491283 × 109
Variance2.2276385 × 1011
MonotonicityNot monotonic
2023-12-13T01:22:41.969376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
27200 162
 
1.6%
71000 76
 
0.8%
23400 68
 
0.7%
949300 62
 
0.6%
39100 58
 
0.6%
23800 55
 
0.5%
44900 53
 
0.5%
52500 53
 
0.5%
20900 49
 
0.5%
44300 45
 
0.4%
Other values (3485) 9319
93.2%
ValueCountFrequency (%)
6270 1
 
< 0.1%
7350 2
 
< 0.1%
7950 1
 
< 0.1%
8050 1
 
< 0.1%
8410 1
 
< 0.1%
8840 1
 
< 0.1%
9850 1
 
< 0.1%
11000 19
0.2%
11400 1
 
< 0.1%
11700 2
 
< 0.1%
ValueCountFrequency (%)
5206000 1
< 0.1%
4440000 2
< 0.1%
4429000 1
< 0.1%
4370000 2
< 0.1%
4235000 1
< 0.1%
4190000 1
< 0.1%
4189000 1
< 0.1%
4045000 1
< 0.1%
4018000 2
< 0.1%
3980000 1
< 0.1%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-01-01
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-01-01
2nd row2023-01-01
3rd row2023-01-01
4th row2023-01-01
5th row2023-01-01

Common Values

ValueCountFrequency (%)
2023-01-01 10000
100.0%

Length

2023-12-13T01:22:42.086595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:22:42.187676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-01-01 10000
100.0%

Interactions

2023-12-13T01:22:39.393667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:38.721282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:39.066847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:39.501333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:38.840710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:39.168607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:39.625308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:38.970185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:22:39.274878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:22:42.277214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
읍면동구분본번부번공시지가
읍면동1.0001.0000.8350.5860.2440.534
1.0001.0000.1640.4170.3350.447
구분0.8350.1641.0000.4110.0540.331
본번0.5860.4170.4111.0000.0000.211
부번0.2440.3350.0540.0001.0000.111
공시지가0.5340.4470.3310.2110.1111.000
2023-12-13T01:22:42.375709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분읍면동
1.0000.1570.999
구분0.1571.0000.585
읍면동0.9990.5851.000
2023-12-13T01:22:42.458354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
본번부번공시지가읍면동구분
본번1.000-0.2400.0320.2480.4000.287
부번-0.2401.0000.1880.0910.1490.032
공시지가0.0320.1881.0000.2260.2390.210
읍면동0.2480.0910.2261.0000.9990.585
0.4000.1490.2390.9991.0000.157
구분0.2870.0320.2100.5850.1571.000

Missing values

2023-12-13T01:22:39.809156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:22:39.986839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군구읍면동구분본번부번공시지가데이터기준일
2926파주시금촌동<NA>일반3736753002023-01-01
43343파주시동패동<NA>일반20941210390002023-01-01
63284파주시문산읍문산리일반56854507002023-01-01
32025파주시상지석동<NA>일반44735450002023-01-01
91317파주시문산읍이천리일반3180497002023-01-01
95197파주시파주읍봉서리일반7532228002023-01-01
78620파주시문산읍마정리일반101272002023-01-01
25145파주시야당동<NA>일반4231410500002023-01-01
19690파주시교하동<NA>일반5203712002023-01-01
39804파주시동패동<NA>일반4041289002023-01-01
시군구읍면동구분본번부번공시지가데이터기준일
29496파주시오도동<NA>일반10544437002023-01-01
19881파주시교하동<NA>일반56023184002023-01-01
27732파주시다율동<NA>가지번113749105002023-01-01
78031파주시문산읍사목리일반6611868002023-01-01
78148파주시문산읍사목리일반7101758002023-01-01
171파주시금촌동<NA>일반2469142002023-01-01
38807파주시산남동<NA>일반4500311002023-01-01
16309파주시맥금동<NA>일반557271063002023-01-01
8819파주시아동동<NA>일반30310601002023-01-01
7032파주시아동동<NA>일반8416944002023-01-01