Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells152
Missing cells (%)0.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory712.9 KiB
Average record size in memory73.0 B

Variable types

Numeric1
Text5
Categorical2

Dataset

Description경기부동산포털_토지_집합건물대지권
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=6Q00870RW3QVDW108WL334228762&infSeq=1

Alerts

집합건물_실 is highly imbalanced (99.8%)Imbalance
대지권지분비율 has 145 (1.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 22:28:04.588609
Analysis finished2023-12-10 22:28:05.743946
Duration1.16 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

토지고유번호
Real number (ℝ)

Distinct578
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1128727 × 1018
Minimum4.1111129 × 1018
Maximum4.1800253 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T07:28:05.809807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum4.1111129 × 1018
5-th percentile4.111113 × 1018
Q14.111113 × 1018
median4.1111136 × 1018
Q34.1115138 × 1018
95-th percentile4.1210102 × 1018
Maximum4.1800253 × 1018
Range6.8912422 × 1016
Interquartile range (IQR)4.008007 × 1014

Descriptive statistics

Standard deviation4.6056754 × 1015
Coefficient of variation (CV)0.0011198196
Kurtosis54.4575
Mean4.1128727 × 1018
Median Absolute Deviation (MAD)6.0000499 × 1011
Skewness5.2882097
Sum-7.5124631 × 1018
Variance2.1212246 × 1031
MonotonicityNot monotonic
2023-12-11T07:28:05.961121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4111113300803330001 533
 
5.3%
4111113300803330000 509
 
5.1%
4111113000103950000 439
 
4.4%
4111113000104010000 439
 
4.4%
4111113000103950005 435
 
4.3%
4111113000103130001 430
 
4.3%
4121010200101050000 286
 
2.9%
4121010200102410000 279
 
2.8%
4111710500109700003 172
 
1.7%
4111313200104630008 156
 
1.6%
Other values (568) 6322
63.2%
ValueCountFrequency (%)
4111112900101990000 36
0.4%
4111112900102080014 4
 
< 0.1%
4111112900102120005 55
0.5%
4111112900102500007 5
 
0.1%
4111112900102590004 1
 
< 0.1%
4111112900103770000 1
 
< 0.1%
4111112900103770004 1
 
< 0.1%
4111112900103770005 2
 
< 0.1%
4111112900103850002 3
 
< 0.1%
4111112900103860001 8
 
0.1%
ValueCountFrequency (%)
4180025322105300046 2
< 0.1%
4180025322105300045 1
 
< 0.1%
4180025322105300035 3
< 0.1%
4180025322105300010 3
< 0.1%
4180025322105300009 2
< 0.1%
4150031021102210026 2
< 0.1%
4150031021102210024 2
< 0.1%
4150031021102210021 3
< 0.1%
4141010500108500000 2
< 0.1%
4141010500108490001 2
< 0.1%
Distinct578
Distinct (%)5.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:28:06.407031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length19
Mean length16.0234
Min length12

Characters and Unicode

Total characters160234
Distinct characters88
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique115 ?
Unique (%)1.1%

Sample

1st row수원시장안구 파장동 산 150
2nd row수원시장안구 정자동 395
3rd row수원시팔달구 화서동 250-4
4th row수원시장안구 정자동 395
5th row수원시팔달구 지동 138-3
ValueCountFrequency (%)
수원시장안구 5493
18.2%
정자동 2633
 
8.7%
수원시팔달구 1662
 
5.5%
광명시 1313
 
4.4%
화서동 1218
 
4.0%
철산동 1170
 
3.9%
수원시권선구 1116
 
3.7%
천천동 1052
 
3.5%
333-1 533
 
1.8%
율전동 523
 
1.7%
Other values (616) 13440
44.6%
2023-12-11T07:28:07.118263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
30270
18.9%
9989
 
6.2%
9838
 
6.1%
8974
 
5.6%
8898
 
5.6%
8551
 
5.3%
3 8189
 
5.1%
1 7121
 
4.4%
- 6809
 
4.2%
5889
 
3.7%
Other values (78) 55706
34.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 85828
53.6%
Decimal Number 37327
23.3%
Space Separator 30270
 
18.9%
Dash Punctuation 6809
 
4.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9989
11.6%
9838
11.5%
8974
10.5%
8898
10.4%
8551
10.0%
5889
 
6.9%
5493
 
6.4%
2637
 
3.1%
2633
 
3.1%
2129
 
2.5%
Other values (66) 20797
24.2%
Decimal Number
ValueCountFrequency (%)
3 8189
21.9%
1 7121
19.1%
2 4026
10.8%
4 3708
9.9%
5 3639
9.7%
9 3121
 
8.4%
0 2418
 
6.5%
6 2077
 
5.6%
7 1913
 
5.1%
8 1115
 
3.0%
Space Separator
ValueCountFrequency (%)
30270
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6809
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 85828
53.6%
Common 74406
46.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
9989
11.6%
9838
11.5%
8974
10.5%
8898
10.4%
8551
10.0%
5889
 
6.9%
5493
 
6.4%
2637
 
3.1%
2633
 
3.1%
2129
 
2.5%
Other values (66) 20797
24.2%
Common
ValueCountFrequency (%)
30270
40.7%
3 8189
 
11.0%
1 7121
 
9.6%
- 6809
 
9.2%
2 4026
 
5.4%
4 3708
 
5.0%
5 3639
 
4.9%
9 3121
 
4.2%
0 2418
 
3.2%
6 2077
 
2.8%
Other values (2) 3028
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 85828
53.6%
ASCII 74406
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30270
40.7%
3 8189
 
11.0%
1 7121
 
9.6%
- 6809
 
9.2%
2 4026
 
5.4%
4 3708
 
5.0%
5 3639
 
4.9%
9 3121
 
4.2%
0 2418
 
3.2%
6 2077
 
2.8%
Other values (2) 3028
 
4.1%
Hangul
ValueCountFrequency (%)
9989
11.6%
9838
11.5%
8974
10.5%
8898
10.4%
8551
10.0%
5889
 
6.9%
5493
 
6.4%
2637
 
3.1%
2633
 
3.1%
2129
 
2.5%
Other values (66) 20797
24.2%
Distinct313
Distinct (%)3.1%
Missing7
Missing (%)0.1%
Memory size156.2 KiB
2023-12-11T07:28:07.404677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length5
Mean length5.9173421
Min length3

Characters and Unicode

Total characters59132
Distinct characters206
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)0.4%

Sample

1st row궁전아파트
2nd row동신아파트
3rd row화서아파트
4th row동신아파트
5th row진우아파트
ValueCountFrequency (%)
동신아파트 1858
18.6%
화서아파트 1109
 
11.1%
주공아파트 1100
 
11.0%
강남아파트 300
 
3.0%
주공아파트(10,11단지 286
 
2.9%
고층주공아파트(13단지 279
 
2.8%
고층주공아파트(12단지 254
 
2.5%
천록아파트 248
 
2.5%
벽산아파트 231
 
2.3%
벽적골9단지주공아파트 172
 
1.7%
Other values (301) 4163
41.6%
2023-12-11T07:28:07.854651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8494
14.4%
8492
14.4%
8464
14.3%
2642
 
4.5%
2398
 
4.1%
2064
 
3.5%
1915
 
3.2%
1 1524
 
2.6%
1311
 
2.2%
1297
 
2.2%
Other values (196) 20531
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 53653
90.7%
Decimal Number 2793
 
4.7%
Close Punctuation 1184
 
2.0%
Open Punctuation 1184
 
2.0%
Other Punctuation 306
 
0.5%
Space Separator 11
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8494
15.8%
8492
15.8%
8464
15.8%
2642
 
4.9%
2398
 
4.5%
2064
 
3.8%
1915
 
3.6%
1311
 
2.4%
1297
 
2.4%
1294
 
2.4%
Other values (182) 15282
28.5%
Decimal Number
ValueCountFrequency (%)
1 1524
54.6%
0 286
 
10.2%
3 279
 
10.0%
2 254
 
9.1%
9 184
 
6.6%
7 135
 
4.8%
4 131
 
4.7%
Other Punctuation
ValueCountFrequency (%)
, 286
93.5%
. 16
 
5.2%
4
 
1.3%
Close Punctuation
ValueCountFrequency (%)
) 1184
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1184
100.0%
Space Separator
ValueCountFrequency (%)
11
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 53653
90.7%
Common 5479
 
9.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8494
15.8%
8492
15.8%
8464
15.8%
2642
 
4.9%
2398
 
4.5%
2064
 
3.8%
1915
 
3.6%
1311
 
2.4%
1297
 
2.4%
1294
 
2.4%
Other values (182) 15282
28.5%
Common
ValueCountFrequency (%)
1 1524
27.8%
) 1184
21.6%
( 1184
21.6%
0 286
 
5.2%
, 286
 
5.2%
3 279
 
5.1%
2 254
 
4.6%
9 184
 
3.4%
7 135
 
2.5%
4 131
 
2.4%
Other values (4) 32
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 53653
90.7%
ASCII 5474
 
9.3%
None 4
 
< 0.1%
Box Drawing 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8494
15.8%
8492
15.8%
8464
15.8%
2642
 
4.9%
2398
 
4.5%
2064
 
3.8%
1915
 
3.6%
1311
 
2.4%
1297
 
2.4%
1294
 
2.4%
Other values (182) 15282
28.5%
ASCII
ValueCountFrequency (%)
1 1524
27.8%
) 1184
21.6%
( 1184
21.6%
0 286
 
5.2%
, 286
 
5.2%
3 279
 
5.1%
2 254
 
4.6%
9 184
 
3.4%
7 135
 
2.5%
4 131
 
2.4%
Other values (2) 27
 
0.5%
None
ValueCountFrequency (%)
4
100.0%
Box Drawing
ValueCountFrequency (%)
1
100.0%
Distinct282
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:28:08.272866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.5657
Min length1

Characters and Unicode

Total characters25657
Distinct characters26
Distinct categories4 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.1%

Sample

1st row1
2nd row107
3rd row26
4th row102
5th row1
ValueCountFrequency (%)
0000 1138
 
11.4%
1 440
 
4.4%
364
 
3.6%
101 340
 
3.4%
319
 
3.2%
102 309
 
3.1%
2 298
 
3.0%
103 244
 
2.4%
3 216
 
2.2%
105 204
 
2.0%
Other values (271) 6128
61.3%
2023-12-11T07:28:08.799698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8141
31.7%
1 7000
27.3%
2 2642
 
10.3%
3 1422
 
5.5%
4 1116
 
4.3%
5 912
 
3.6%
7 788
 
3.1%
9 734
 
2.9%
6 672
 
2.6%
8 640
 
2.5%
Other values (16) 1590
 
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24067
93.8%
Other Letter 1582
 
6.2%
Lowercase Letter 6
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
537
33.9%
322
20.4%
166
 
10.5%
163
 
10.3%
104
 
6.6%
102
 
6.4%
102
 
6.4%
55
 
3.5%
14
 
0.9%
7
 
0.4%
Other values (2) 10
 
0.6%
Decimal Number
ValueCountFrequency (%)
0 8141
33.8%
1 7000
29.1%
2 2642
 
11.0%
3 1422
 
5.9%
4 1116
 
4.6%
5 912
 
3.8%
7 788
 
3.3%
9 734
 
3.0%
6 672
 
2.8%
8 640
 
2.7%
Lowercase Letter
ValueCountFrequency (%)
a 5
83.3%
b 1
 
16.7%
Other Punctuation
ValueCountFrequency (%)
. 1
50.0%
/ 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24069
93.8%
Hangul 1582
 
6.2%
Latin 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8141
33.8%
1 7000
29.1%
2 2642
 
11.0%
3 1422
 
5.9%
4 1116
 
4.6%
5 912
 
3.8%
7 788
 
3.3%
9 734
 
3.0%
6 672
 
2.8%
8 640
 
2.7%
Other values (2) 2
 
< 0.1%
Hangul
ValueCountFrequency (%)
537
33.9%
322
20.4%
166
 
10.5%
163
 
10.3%
104
 
6.6%
102
 
6.4%
102
 
6.4%
55
 
3.5%
14
 
0.9%
7
 
0.4%
Other values (2) 10
 
0.6%
Latin
ValueCountFrequency (%)
a 5
83.3%
b 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24075
93.8%
Hangul 1582
 
6.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8141
33.8%
1 7000
29.1%
2 2642
 
11.0%
3 1422
 
5.9%
4 1116
 
4.6%
5 912
 
3.8%
7 788
 
3.3%
9 734
 
3.0%
6 672
 
2.8%
8 640
 
2.7%
Other values (4) 8
 
< 0.1%
Hangul
ValueCountFrequency (%)
537
33.9%
322
20.4%
166
 
10.5%
163
 
10.3%
104
 
6.6%
102
 
6.4%
102
 
6.4%
55
 
3.5%
14
 
0.9%
7
 
0.4%
Other values (2) 10
 
0.6%

집합건물_층
Categorical

Distinct33
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1
1835 
2
1726 
3
1415 
4
1183 
5
1120 
Other values (28)
2721 

Length

Max length4
Median length1
Mean length1.1955
Min length1

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row12
3rd row4
4th row4
5th row11

Common Values

ValueCountFrequency (%)
1 1835
18.4%
2 1726
17.3%
3 1415
14.1%
4 1183
11.8%
5 1120
11.2%
6 309
 
3.1%
8 282
 
2.8%
7 274
 
2.7%
11 254
 
2.5%
10 249
 
2.5%
Other values (23) 1353
13.5%

Length

2023-12-11T07:28:08.968233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
1 1835
18.4%
2 1726
17.3%
3 1415
14.1%
4 1183
11.8%
5 1120
11.2%
6 309
 
3.1%
8 282
 
2.8%
7 274
 
2.7%
11 254
 
2.5%
10 249
 
2.5%
Other values (23) 1353
13.5%
Distinct371
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-11T07:28:09.407445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length3.0776
Min length1

Characters and Unicode

Total characters30776
Distinct characters22
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)1.0%

Sample

1st row114
2nd row1205
3rd row408
4th row410
5th row1101
ValueCountFrequency (%)
101 314
 
3.1%
201 309
 
3.1%
102 298
 
3.0%
202 267
 
2.7%
302 241
 
2.4%
301 233
 
2.3%
203 208
 
2.1%
103 200
 
2.0%
104 192
 
1.9%
204 187
 
1.9%
Other values (361) 7551
75.5%
2023-12-11T07:28:09.942155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 9983
32.4%
1 5597
18.2%
2 3547
 
11.5%
3 2882
 
9.4%
4 2421
 
7.9%
5 2347
 
7.6%
6 1392
 
4.5%
7 977
 
3.2%
8 923
 
3.0%
9 597
 
1.9%
Other values (12) 110
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 30666
99.6%
Other Letter 92
 
0.3%
Dash Punctuation 18
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
31.5%
17
18.5%
16
17.4%
7
 
7.6%
7
 
7.6%
5
 
5.4%
4
 
4.3%
3
 
3.3%
2
 
2.2%
1
 
1.1%
Decimal Number
ValueCountFrequency (%)
0 9983
32.6%
1 5597
18.3%
2 3547
 
11.6%
3 2882
 
9.4%
4 2421
 
7.9%
5 2347
 
7.7%
6 1392
 
4.5%
7 977
 
3.2%
8 923
 
3.0%
9 597
 
1.9%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 30684
99.7%
Hangul 92
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 9983
32.5%
1 5597
18.2%
2 3547
 
11.6%
3 2882
 
9.4%
4 2421
 
7.9%
5 2347
 
7.6%
6 1392
 
4.5%
7 977
 
3.2%
8 923
 
3.0%
9 597
 
1.9%
Hangul
ValueCountFrequency (%)
29
31.5%
17
18.5%
16
17.4%
7
 
7.6%
7
 
7.6%
5
 
5.4%
4
 
4.3%
3
 
3.3%
2
 
2.2%
1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30684
99.7%
Hangul 92
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 9983
32.5%
1 5597
18.2%
2 3547
 
11.6%
3 2882
 
9.4%
4 2421
 
7.9%
5 2347
 
7.6%
6 1392
 
4.5%
7 977
 
3.2%
8 923
 
3.0%
9 597
 
1.9%
Hangul
ValueCountFrequency (%)
29
31.5%
17
18.5%
16
17.4%
7
 
7.6%
7
 
7.6%
5
 
5.4%
4
 
4.3%
3
 
3.3%
2
 
2.2%
1
 
1.1%

집합건물_실
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0000
9998 
지하실
 
1
소매점
 
1

Length

Max length4
Median length4
Mean length3.9998
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0000
2nd row0000
3rd row0000
4th row0000
5th row0000

Common Values

ValueCountFrequency (%)
0000 9998
> 99.9%
지하실 1
 
< 0.1%
소매점 1
 
< 0.1%

Length

2023-12-11T07:28:10.084288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:28:10.192702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0000 9998
> 99.9%
지하실 1
 
< 0.1%
소매점 1
 
< 0.1%

대지권지분비율
Text

MISSING 

Distinct981
Distinct (%)10.0%
Missing145
Missing (%)1.5%
Memory size156.2 KiB
2023-12-11T07:28:10.447334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length11.936276
Min length3

Characters and Unicode

Total characters117632
Distinct characters14
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique418 ?
Unique (%)4.2%

Sample

1st row40.95/5377
2nd row33.33/55803
3rd row44.2/51592
4th row33.33/55803
5th row27.1629/3901
ValueCountFrequency (%)
33.33/55803 778
 
7.9%
58.63/28393 483
 
4.9%
44.2/51592 384
 
3.9%
84.92/148202.9 282
 
2.9%
67.936/148202.9 238
 
2.4%
44.07/20019 215
 
2.2%
25.29/55803 193
 
2.0%
36.75/9074 188
 
1.9%
55.198/148202.9 161
 
1.6%
39.45/55803 149
 
1.5%
Other values (949) 6785
68.8%
2023-12-11T07:28:10.925975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 13689
11.6%
. 13482
11.5%
5 12070
10.3%
2 11618
9.9%
/ 9855
8.4%
9 9194
7.8%
4 9098
7.7%
1 7915
6.7%
8 7745
6.6%
0 7091
6.0%
Other values (4) 15875
13.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 90193
76.7%
Other Punctuation 23462
 
19.9%
Space Separator 3977
 
3.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 13689
15.2%
5 12070
13.4%
2 11618
12.9%
9 9194
10.2%
4 9098
10.1%
1 7915
8.8%
8 7745
8.6%
0 7091
7.9%
6 6879
7.6%
7 4894
 
5.4%
Other Punctuation
ValueCountFrequency (%)
. 13482
57.5%
/ 9855
42.0%
, 125
 
0.5%
Space Separator
ValueCountFrequency (%)
3977
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 117632
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 13689
11.6%
. 13482
11.5%
5 12070
10.3%
2 11618
9.9%
/ 9855
8.4%
9 9194
7.8%
4 9098
7.7%
1 7915
6.7%
8 7745
6.6%
0 7091
6.0%
Other values (4) 15875
13.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 117632
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 13689
11.6%
. 13482
11.5%
5 12070
10.3%
2 11618
9.9%
/ 9855
8.4%
9 9194
7.8%
4 9098
7.7%
1 7915
6.7%
8 7745
6.6%
0 7091
6.0%
Other values (4) 15875
13.5%

Interactions

2023-12-11T07:28:05.018013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:28:11.031605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지고유번호집합건물_층집합건물_실
토지고유번호1.0000.1330.000
집합건물_층0.1331.0000.091
집합건물_실0.0000.0911.000
2023-12-11T07:28:11.168519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
집합건물_실집합건물_층
집합건물_실1.0000.042
집합건물_층0.0421.000
2023-12-11T07:28:11.289302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지고유번호집합건물_층집합건물_실
토지고유번호1.0000.0580.000
집합건물_층0.0581.0000.042
집합건물_실0.0000.0421.000

Missing values

2023-12-11T07:28:05.486014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:28:05.603576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:28:05.698319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

토지고유번호토지소재지집합건물명집합건물_동집합건물_층집합건물_호집합건물_실대지권지분비율
52794111112900201500000수원시장안구 파장동 산 150궁전아파트11114000040.95/5377
206914111113000103950000수원시장안구 정자동 395동신아파트107121205000033.33/55803
19604111513800802500004수원시팔달구 화서동 250-4화서아파트264408000044.2/51592
191574111113000103950000수원시장안구 정자동 395동신아파트1024410000033.33/55803
295644111513900101380003수원시팔달구 지동 138-3진우아파트1111101000027.1629/3901
205834111113000100440013수원시장안구 정자동 44-13정자연립2203000032.166/1158
28404111513800801220002수원시팔달구 화서동 122-2화서아파트484410000044.2/51592
222124111113000104010000수원시장안구 정자동 401동신아파트108131312000025.29/55803
75624111313200104630008수원시권선구 구운동 463-8강남아파트17706000045.106/24946.6
272994111312600104490010수원시권선구 세류동 449-10장안주택00001105000043.08/775.5
토지고유번호토지소재지집합건물명집합건물_동집합건물_층집합건물_호집합건물_실대지권지분비율
32744111513800100110011수원시팔달구 화서동 11-11화서아파트12202000044.07/20019
374554111312700101510000수원시권선구 평동 151동남아파트1067702000060.9031/23224
345534111312600810130005수원시권선구 세류동 1013-5태일아파트61104000054.5/6980
340324111113700101360007수원시장안구 연무동 136-7경성아파트에이3312000036.69/3863
366304111312700101510000수원시권선구 평동 151동남아파트1026603000041.4124/23224
78724111112900104220003수원시장안구 파장동 422-3영광연립00001104000054.69/669
204734111113000100720006수원시장안구 정자동 72-6수성아파트00003313000035.31/1271
148004111113200101090000수원시장안구 율전동 109장안아파트5503000029.16/4632
289204111113600106360014수원시장안구 조원동 636-14현대연립1103000075.25/1806
213904111113000103130001수원시장안구 정자동 313-1동신아파트205101009000031.59/52527