Overview

Dataset statistics

Number of variables7
Number of observations471
Missing cells8
Missing cells (%)0.2%
Duplicate rows1
Duplicate rows (%)0.2%
Total size in memory27.3 KiB
Average record size in memory59.3 B

Variable types

Numeric3
Categorical3
Text1

Dataset

Description전라남도 무안군 지적문서관리시스템 보유자료 중 구지적도정보(법정동코드, 토지임야구분, 축척, 구읍면, 동리, 파일수량)데이터입니다.
URLhttps://www.data.go.kr/data/15041568/fileData.do

Alerts

Dataset has 1 (0.2%) duplicate rowsDuplicates
순번 is highly overall correlated with 법정동코드 and 1 other fieldsHigh correlation
법정동코드 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
파일수량 is highly overall correlated with 토지임야구분High correlation
토지임야구분 is highly overall correlated with 파일수량 and 1 other fieldsHigh correlation
축척 is highly overall correlated with 토지임야구분High correlation
구읍면 is highly overall correlated with 순번 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 00:04:00.240016
Analysis finished2023-12-12 00:04:01.928128
Duration1.69 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION 

Distinct469
Distinct (%)100.0%
Missing2
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean235
Minimum1
Maximum469
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T09:04:02.015610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile24.4
Q1118
median235
Q3352
95-th percentile445.6
Maximum469
Range468
Interquartile range (IQR)234

Descriptive statistics

Standard deviation135.5329
Coefficient of variation (CV)0.57673574
Kurtosis-1.2
Mean235
Median Absolute Deviation (MAD)117
Skewness0
Sum110215
Variance18369.167
MonotonicityStrictly increasing
2023-12-12T09:04:02.142848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
310 1
 
0.2%
322 1
 
0.2%
321 1
 
0.2%
320 1
 
0.2%
319 1
 
0.2%
318 1
 
0.2%
317 1
 
0.2%
316 1
 
0.2%
315 1
 
0.2%
314 1
 
0.2%
Other values (459) 459
97.5%
(Missing) 2
 
0.4%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
469 1
0.2%
468 1
0.2%
467 1
0.2%
466 1
0.2%
465 1
0.2%
464 1
0.2%
463 1
0.2%
462 1
0.2%
461 1
0.2%
460 1
0.2%

법정동코드
Real number (ℝ)

HIGH CORRELATION 

Distinct103
Distinct (%)22.0%
Missing2
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean31911.322
Minimum25021
Maximum37025
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T09:04:02.319952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum25021
5-th percentile25026.4
Q131024
median33025
Q335021
95-th percentile37021
Maximum37025
Range12004
Interquartile range (IQR)3997

Descriptive statistics

Standard deviation3890.5141
Coefficient of variation (CV)0.12191642
Kurtosis-0.66758025
Mean31911.322
Median Absolute Deviation (MAD)1999
Skewness-0.77092347
Sum14966410
Variance15136100
MonotonicityIncreasing
2023-12-12T09:04:02.456512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31027 6
 
1.3%
32023 6
 
1.3%
33025 5
 
1.1%
33033 5
 
1.1%
34024 5
 
1.1%
34023 5
 
1.1%
34022 5
 
1.1%
34021 5
 
1.1%
33036 5
 
1.1%
33035 5
 
1.1%
Other values (93) 417
88.5%
ValueCountFrequency (%)
25021 2
 
0.4%
25022 4
0.8%
25023 4
0.8%
25024 5
1.1%
25025 4
0.8%
25026 5
1.1%
25027 5
1.1%
25028 5
1.1%
25029 4
0.8%
25030 5
1.1%
ValueCountFrequency (%)
37025 5
1.1%
37024 5
1.1%
37023 5
1.1%
37022 5
1.1%
37021 5
1.1%
36036 4
0.8%
36035 5
1.1%
36034 5
1.1%
36033 5
1.1%
36032 5
1.1%

토지임야구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
토지
297 
임야
172 
<NA>
 
2

Length

Max length4
Median length2
Mean length2.0084926
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토지
2nd row토지
3rd row토지
4th row토지
5th row임야

Common Values

ValueCountFrequency (%)
토지 297
63.1%
임야 172
36.5%
<NA> 2
 
0.4%

Length

2023-12-12T09:04:02.594509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:04:02.691522image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토지 297
63.1%
임야 172
36.5%
na 2
 
0.4%

축척
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
일람도
175 
1/1200
103 
1/3000
100 
1/1000
86 
1/500
 
5

Length

Max length6
Median length6
Mean length4.866242
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일람도
2nd row1/1200
3rd row일람도
4th row1/1200
5th row일람도

Common Values

ValueCountFrequency (%)
일람도 175
37.2%
1/1200 103
21.9%
1/3000 100
21.2%
1/1000 86
18.3%
1/500 5
 
1.1%
<NA> 2
 
0.4%

Length

2023-12-12T09:04:02.791932image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:04:02.892021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일람도 175
37.2%
1/1200 103
21.9%
1/3000 100
21.2%
1/1000 86
18.3%
1/500 5
 
1.1%
na 2
 
0.4%

구읍면
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size3.8 KiB
청계면
80 
몽탄면
75 
해제면
73 
일로읍
59 
현경면
57 
Other values (5)
127 

Length

Max length4
Median length3
Mean length3.0042463
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row무안읍
2nd row무안읍
3rd row무안읍
4th row무안읍
5th row무안읍

Common Values

ValueCountFrequency (%)
청계면 80
17.0%
몽탄면 75
15.9%
해제면 73
15.5%
일로읍 59
12.5%
현경면 57
12.1%
무안읍 43
9.1%
삼향읍 35
7.4%
운남면 25
 
5.3%
망운면 22
 
4.7%
<NA> 2
 
0.4%

Length

2023-12-12T09:04:03.002661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T09:04:03.109321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
청계면 80
17.0%
몽탄면 75
15.9%
해제면 73
15.5%
일로읍 59
12.5%
현경면 57
12.1%
무안읍 43
9.1%
삼향읍 35
7.4%
운남면 25
 
5.3%
망운면 22
 
4.7%
na 2
 
0.4%

동리
Text

Distinct99
Distinct (%)21.1%
Missing2
Missing (%)0.4%
Memory size3.8 KiB
2023-12-12T09:04:03.357023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length2.9808102
Min length2

Characters and Unicode

Total characters1398
Distinct characters88
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row성내리
2nd row성내리
3rd row성동리
4th row성동리
5th row성동리
ValueCountFrequency (%)
송현리 9
 
1.9%
복룡리 9
 
1.9%
내리 9
 
1.9%
성내리 7
 
1.5%
지산리 6
 
1.3%
양장리 6
 
1.3%
수양리 5
 
1.1%
다산리 5
 
1.1%
외반리 5
 
1.1%
월선리 5
 
1.1%
Other values (89) 403
85.9%
2023-12-12T09:04:03.765675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
469
33.5%
84
 
6.0%
29
 
2.1%
29
 
2.1%
28
 
2.0%
26
 
1.9%
26
 
1.9%
24
 
1.7%
24
 
1.7%
22
 
1.6%
Other values (78) 637
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1398
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
469
33.5%
84
 
6.0%
29
 
2.1%
29
 
2.1%
28
 
2.0%
26
 
1.9%
26
 
1.9%
24
 
1.7%
24
 
1.7%
22
 
1.6%
Other values (78) 637
45.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1398
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
469
33.5%
84
 
6.0%
29
 
2.1%
29
 
2.1%
28
 
2.0%
26
 
1.9%
26
 
1.9%
24
 
1.7%
24
 
1.7%
22
 
1.6%
Other values (78) 637
45.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1398
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
469
33.5%
84
 
6.0%
29
 
2.1%
29
 
2.1%
28
 
2.0%
26
 
1.9%
26
 
1.9%
24
 
1.7%
24
 
1.7%
22
 
1.6%
Other values (78) 637
45.6%

파일수량
Real number (ℝ)

HIGH CORRELATION 

Distinct56
Distinct (%)11.9%
Missing2
Missing (%)0.4%
Infinite0
Infinite (%)0.0%
Mean12.379531
Minimum1
Maximum95
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2023-12-12T09:04:03.925198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q14
median6
Q319
95-th percentile39
Maximum95
Range94
Interquartile range (IQR)15

Descriptive statistics

Standard deviation13.721314
Coefficient of variation (CV)1.1083872
Kurtosis5.1709387
Mean12.379531
Median Absolute Deviation (MAD)4
Skewness2.0150172
Sum5806
Variance188.27445
MonotonicityNot monotonic
2023-12-12T09:04:04.059164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 60
 
12.7%
4 51
 
10.8%
5 50
 
10.6%
6 33
 
7.0%
2 27
 
5.7%
7 24
 
5.1%
3 21
 
4.5%
8 18
 
3.8%
9 12
 
2.5%
13 10
 
2.1%
Other values (46) 163
34.6%
ValueCountFrequency (%)
1 60
12.7%
2 27
5.7%
3 21
 
4.5%
4 51
10.8%
5 50
10.6%
6 33
7.0%
7 24
 
5.1%
8 18
 
3.8%
9 12
 
2.5%
10 4
 
0.8%
ValueCountFrequency (%)
95 1
0.2%
77 1
0.2%
63 2
0.4%
62 2
0.4%
59 1
0.2%
58 1
0.2%
57 2
0.4%
56 1
0.2%
53 1
0.2%
51 1
0.2%

Interactions

2023-12-12T09:04:01.205742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:00.635005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:00.918103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:01.305300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:00.751155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:01.001995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:01.397407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:00.843142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T09:04:01.102230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T09:04:04.150847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번법정동코드토지임야구분축척구읍면동리파일수량
순번1.0000.9190.0000.0000.9490.9970.136
법정동코드0.9191.0000.0000.0000.9700.9990.073
토지임야구분0.0000.0001.0000.6500.0000.0000.514
축척0.0000.0000.6501.0000.0000.0000.662
구읍면0.9490.9700.0000.0001.0001.0000.289
동리0.9970.9990.0000.0001.0001.0000.000
파일수량0.1360.0730.5140.6620.2890.0001.000
2023-12-12T09:04:05.012067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
토지임야구분축척구읍면
토지임야구분1.0000.7770.000
축척0.7771.0000.000
구읍면0.0000.0001.000
2023-12-12T09:04:05.106054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번법정동코드파일수량토지임야구분축척구읍면
순번1.0001.0000.0490.0000.0000.822
법정동코드1.0001.0000.0510.0000.0000.938
파일수량0.0490.0511.0000.5120.4580.096
토지임야구분0.0000.0000.5121.0000.7770.000
축척0.0000.0000.4580.7771.0000.000
구읍면0.8220.9380.0960.0000.0001.000

Missing values

2023-12-12T09:04:01.547781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T09:04:01.709058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T09:04:01.822415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

순번법정동코드토지임야구분축척구읍면동리파일수량
0125021토지일람도무안읍성내리3
1225021토지1/1200무안읍성내리8
2325022토지일람도무안읍성동리3
3425022토지1/1200무안읍성동리30
4525022임야일람도무안읍성동리1
5625022임야1/3000무안읍성동리7
6725023토지일람도무안읍성남리3
7825023토지1/1200무안읍성남리21
8925023임야일람도무안읍성남리1
91025023임야1/3000무안읍성남리5
순번법정동코드토지임야구분축척구읍면동리파일수량
46146237024토지1/1200운남면내리48
46246337024임야일람도운남면내리2
46346437024임야1/3000운남면내리9
46446537025토지일람도운남면성내리8
46546637025토지1/1000운남면성내리42
46646737025토지1/1200운남면성내리57
46746837025임야일람도운남면성내리2
46846937025임야1/3000운남면성내리7
469<NA><NA><NA><NA><NA><NA><NA>
470<NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

순번법정동코드토지임야구분축척구읍면동리파일수량# duplicates
0<NA><NA><NA><NA><NA><NA><NA>2