Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

Numeric2
Categorical3
Text2

Dataset

Description의정부시 전체 개별공지가로 일련번호, 법정동, 구분, 본번, 부번, 결정지가(원), 공시기준일을 제공합니다. 자세한 개별공시지가는 부동산공시가격 알리미 사이트를 이용하시기 바랍니다.
Author공공데이터포털
URLhttps://www.data.go.kr/data/15113674/fileData.do

Alerts

공시기준일 has constant value ""Constant
일련번호 is highly overall correlated with 법정동High correlation
법정동 is highly overall correlated with 일련번호High correlation
구분 is highly imbalanced (80.6%)Imbalance
일련번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 16:29:55.201782
Analysis finished2024-04-21 16:29:57.477239
Duration2.28 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26646.155
Minimum10
Maximum53201
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T01:29:57.879586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile2674.35
Q113519.75
median26581.5
Q339968.5
95-th percentile50597.25
Maximum53201
Range53191
Interquartile range (IQR)26448.75

Descriptive statistics

Standard deviation15334.237
Coefficient of variation (CV)0.57547655
Kurtosis-1.1999325
Mean26646.155
Median Absolute Deviation (MAD)13214.5
Skewness0.0030955706
Sum2.6646155 × 108
Variance2.3513882 × 108
MonotonicityNot monotonic
2024-04-22T01:29:58.313220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
50509 1
 
< 0.1%
9263 1
 
< 0.1%
4096 1
 
< 0.1%
47496 1
 
< 0.1%
37008 1
 
< 0.1%
25751 1
 
< 0.1%
19648 1
 
< 0.1%
43405 1
 
< 0.1%
41057 1
 
< 0.1%
43247 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
10 1
< 0.1%
13 1
< 0.1%
16 1
< 0.1%
24 1
< 0.1%
31 1
< 0.1%
41 1
< 0.1%
42 1
< 0.1%
46 1
< 0.1%
62 1
< 0.1%
65 1
< 0.1%
ValueCountFrequency (%)
53201 1
< 0.1%
53200 1
< 0.1%
53199 1
< 0.1%
53197 1
< 0.1%
53195 1
< 0.1%
53192 1
< 0.1%
53180 1
< 0.1%
53176 1
< 0.1%
53171 1
< 0.1%
53168 1
< 0.1%

법정동
Categorical

HIGH CORRELATION 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
가능동
1710 
의정부동
1555 
신곡동
1012 
호원동
1009 
금오동
795 
Other values (8)
3919 

Length

Max length4
Median length3
Mean length3.1555
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고산동
2nd row용현동
3rd row금오동
4th row호원동
5th row용현동

Common Values

ValueCountFrequency (%)
가능동 1710
17.1%
의정부동 1555
15.6%
신곡동 1012
10.1%
호원동 1009
10.1%
금오동 795
8.0%
녹양동 690
6.9%
용현동 660
 
6.6%
장암동 593
 
5.9%
고산동 536
 
5.4%
자일동 460
 
4.6%
Other values (3) 980
9.8%

Length

2024-04-22T01:29:58.722287image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
가능동 1710
17.1%
의정부동 1555
15.6%
신곡동 1012
10.1%
호원동 1009
10.1%
금오동 795
8.0%
녹양동 690
6.9%
용현동 660
 
6.6%
장암동 593
 
5.9%
고산동 536
 
5.4%
자일동 460
 
4.6%
Other values (3) 980
9.8%

구분
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반
9155 
 
796
블럭(지구)
 
37
블럭
 
8
 
4

Length

Max length6
Median length2
Mean length1.9348
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반
2nd row일반
3rd row일반
4th row일반
5th row일반

Common Values

ValueCountFrequency (%)
일반 9155
91.5%
796
 
8.0%
블럭(지구) 37
 
0.4%
블럭 8
 
0.1%
4
 
< 0.1%

Length

2024-04-22T01:29:59.101947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:29:59.422354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반 9155
91.5%
796
 
8.0%
블럭(지구 37
 
0.4%
블럭 8
 
0.1%
4
 
< 0.1%

본번
Text

Distinct919
Distinct (%)9.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T01:30:00.886674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.7689
Min length1

Characters and Unicode

Total characters27689
Distinct characters28
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique87 ?
Unique (%)0.9%

Sample

1st row761
2nd row522
3rd row386
4th row270
5th row186
ValueCountFrequency (%)
15 103
 
1.0%
581 71
 
0.7%
229 66
 
0.7%
53 63
 
0.6%
226 62
 
0.6%
119 49
 
0.5%
31 47
 
0.5%
7 45
 
0.4%
316 44
 
0.4%
230 41
 
0.4%
Other values (909) 9409
94.1%
2024-04-22T01:30:02.521604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3795
13.7%
2 3550
12.8%
3 3510
12.7%
4 2882
10.4%
5 2786
10.1%
6 2679
9.7%
7 2402
8.7%
8 2128
7.7%
0 2008
7.3%
9 1900
6.9%
Other values (18) 49
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 27640
99.8%
Other Letter 42
 
0.2%
Uppercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
23.8%
6
14.3%
5
11.9%
5
11.9%
4
 
9.5%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (6) 6
14.3%
Decimal Number
ValueCountFrequency (%)
1 3795
13.7%
2 3550
12.8%
3 3510
12.7%
4 2882
10.4%
5 2786
10.1%
6 2679
9.7%
7 2402
8.7%
8 2128
7.7%
0 2008
7.3%
9 1900
6.9%
Uppercase Letter
ValueCountFrequency (%)
B 6
85.7%
A 1
 
14.3%

Most occurring scripts

ValueCountFrequency (%)
Common 27640
99.8%
Hangul 42
 
0.2%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
23.8%
6
14.3%
5
11.9%
5
11.9%
4
 
9.5%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (6) 6
14.3%
Common
ValueCountFrequency (%)
1 3795
13.7%
2 3550
12.8%
3 3510
12.7%
4 2882
10.4%
5 2786
10.1%
6 2679
9.7%
7 2402
8.7%
8 2128
7.7%
0 2008
7.3%
9 1900
6.9%
Latin
ValueCountFrequency (%)
B 6
85.7%
A 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 27647
99.8%
Hangul 42
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3795
13.7%
2 3550
12.8%
3 3510
12.7%
4 2882
10.4%
5 2786
10.1%
6 2679
9.7%
7 2402
8.7%
8 2128
7.7%
0 2008
7.3%
9 1900
6.9%
Other values (2) 7
 
< 0.1%
Hangul
ValueCountFrequency (%)
10
23.8%
6
14.3%
5
11.9%
5
11.9%
4
 
9.5%
2
 
4.8%
1
 
2.4%
1
 
2.4%
1
 
2.4%
1
 
2.4%
Other values (6) 6
14.3%

부번
Text

Distinct341
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-04-22T01:30:03.623653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.5576
Min length1

Characters and Unicode

Total characters15576
Distinct characters11
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique116 ?
Unique (%)1.2%

Sample

1st row2
2nd row1
3rd row29
4th row2
5th row3
ValueCountFrequency (%)
0 783
 
7.8%
1 757
 
7.6%
2 659
 
6.6%
3 551
 
5.5%
4 477
 
4.8%
5 443
 
4.4%
6 346
 
3.5%
7 329
 
3.3%
8 294
 
2.9%
9 293
 
2.9%
Other values (331) 5068
50.7%
2024-04-22T01:30:04.949970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3642
23.4%
2 2431
15.6%
3 1864
12.0%
0 1523
9.8%
4 1426
 
9.2%
5 1209
 
7.8%
6 987
 
6.3%
7 886
 
5.7%
8 815
 
5.2%
9 792
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15575
> 99.9%
Other Letter 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3642
23.4%
2 2431
15.6%
3 1864
12.0%
0 1523
9.8%
4 1426
 
9.2%
5 1209
 
7.8%
6 987
 
6.3%
7 886
 
5.7%
8 815
 
5.2%
9 792
 
5.1%
Other Letter
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15575
> 99.9%
Hangul 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3642
23.4%
2 2431
15.6%
3 1864
12.0%
0 1523
9.8%
4 1426
 
9.2%
5 1209
 
7.8%
6 987
 
6.3%
7 886
 
5.7%
8 815
 
5.2%
9 792
 
5.1%
Hangul
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15575
> 99.9%
Hangul 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3642
23.4%
2 2431
15.6%
3 1864
12.0%
0 1523
9.8%
4 1426
 
9.2%
5 1209
 
7.8%
6 987
 
6.3%
7 886
 
5.7%
8 815
 
5.2%
9 792
 
5.1%
Hangul
ValueCountFrequency (%)
1
100.0%

결정지가(원)
Real number (ℝ)

Distinct3817
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1017639.1
Minimum1100
Maximum12260000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-22T01:30:05.198724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1100
5-th percentile23400
Q1236450
median731600
Q31477000
95-th percentile2671000
Maximum12260000
Range12258900
Interquartile range (IQR)1240550

Descriptive statistics

Standard deviation1124592
Coefficient of variation (CV)1.105099
Kurtosis24.449346
Mean1017639.1
Median Absolute Deviation (MAD)593400
Skewness3.6471815
Sum1.0176391 × 1010
Variance1.2647072 × 1012
MonotonicityNot monotonic
2024-04-22T01:30:05.466285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1774000 123
 
1.2%
96600 44
 
0.4%
510500 41
 
0.4%
1411000 41
 
0.4%
264500 36
 
0.4%
393300 35
 
0.4%
1564000 35
 
0.4%
42100 33
 
0.3%
92200 32
 
0.3%
280500 31
 
0.3%
Other values (3807) 9549
95.5%
ValueCountFrequency (%)
1100 1
 
< 0.1%
1250 3
 
< 0.1%
1260 3
 
< 0.1%
1330 12
0.1%
1760 4
 
< 0.1%
1810 1
 
< 0.1%
2800 1
 
< 0.1%
2820 1
 
< 0.1%
2850 1
 
< 0.1%
2860 1
 
< 0.1%
ValueCountFrequency (%)
12260000 1
 
< 0.1%
11830000 2
< 0.1%
11750000 3
< 0.1%
11640000 1
 
< 0.1%
11540000 1
 
< 0.1%
11510000 2
< 0.1%
11430000 4
< 0.1%
11310000 3
< 0.1%
11050000 2
< 0.1%
11040000 1
 
< 0.1%

공시기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-01-01
10000 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-01-01
2nd row2023-01-01
3rd row2023-01-01
4th row2023-01-01
5th row2023-01-01

Common Values

ValueCountFrequency (%)
2023-01-01 10000
100.0%

Length

2024-04-22T01:30:05.705797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-22T01:30:05.861603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-01-01 10000
100.0%

Interactions

2024-04-22T01:29:56.292727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:29:55.765115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:29:56.559242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-22T01:29:56.023395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-22T01:30:05.957197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호법정동구분결정지가(원)
일련번호1.0000.9470.3140.521
법정동0.9471.0000.2820.442
구분0.3140.2821.0000.248
결정지가(원)0.5210.4420.2481.000
2024-04-22T01:30:06.113526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
법정동구분
법정동1.0000.156
구분0.1561.000
2024-04-22T01:30:06.255798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호결정지가(원)법정동구분
일련번호1.000-0.3150.8000.135
결정지가(원)-0.3151.0000.2000.105
법정동0.8000.2001.0000.156
구분0.1350.1050.1561.000

Missing values

2024-04-22T01:29:56.920738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-22T01:29:57.311307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호법정동구분본번부번결정지가(원)공시기준일
5050850509고산동일반76122645002023-01-01
2525025251용현동일반52216766002023-01-01
3427234273금오동일반3862913970002023-01-01
1113111132호원동일반27023412002023-01-01
2347223473용현동일반18632218002023-01-01
2813528136낙양동일반34128133002023-01-01
84858486호원동일반41016030002023-01-01
1846718468신곡동일반3525512460002023-01-01
4819648197녹양동536239002023-01-01
3537935380금오동일반46914702002023-01-01
일련번호법정동구분본번부번결정지가(원)공시기준일
2328623287용현동일반168119106002023-01-01
1905319054신곡동일반41884851002023-01-01
3890538906가능동일반43545407002023-01-01
1851118512신곡동일반353193927002023-01-01
15691570의정부동일반1451927580002023-01-01
3255232553금오동일반131663366002023-01-01
1329613297호원동일반461016800002023-01-01
3997539976가능동일반591157588002023-01-01
5131051311고산동682463002023-01-01
1143511436호원동일반30033090002023-01-01