Overview

Dataset statistics

Number of variables4
Number of observations217
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.3 KiB
Average record size in memory34.6 B

Variable types

Text2
Numeric2

Dataset

Description전라남도 여수시 도시계획정보시스템(UPIS) 취락지구 현황 데이터 자료로 취락지구명, 면적 등을 제공합니다.
URLhttps://www.data.go.kr/data/15119482/fileData.do

Alerts

면적_도형 is highly overall correlated with 길이_도형High correlation
길이_도형 is highly overall correlated with 면적_도형High correlation
현황도형 관리번호 has unique valuesUnique
면적_도형 has 34 (15.7%) zerosZeros
길이_도형 has 34 (15.7%) zerosZeros

Reproduction

Analysis started2023-12-12 20:27:52.566365
Analysis finished2023-12-12 20:27:53.356728
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct217
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T05:27:53.514692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters5208
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique217 ?
Unique (%)100.0%

Sample

1st row46130UQ128PS199705260018
2nd row46130UQ128PS199705260019
3rd row46130UQ128PS199705260020
4th row46130UQ128PS199705260021
5th row46130UQ128PS200412310032
ValueCountFrequency (%)
46130uq128ps199705260018 1
 
0.5%
46130uq128ps200905040031 1
 
0.5%
46130uq128ps200905040044 1
 
0.5%
46130uq128ps201501060010 1
 
0.5%
46130uq128ps201501060012 1
 
0.5%
46130uq128ps200905040023 1
 
0.5%
46130uq128ps200905040024 1
 
0.5%
46130uq128ps200905040025 1
 
0.5%
46130uq128ps200905040026 1
 
0.5%
46130uq128ps200905040027 1
 
0.5%
Other values (207) 207
95.4%
2023-12-13T05:27:53.896664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1385
26.6%
1 678
13.0%
2 557
10.7%
4 387
 
7.4%
3 351
 
6.7%
8 291
 
5.6%
6 283
 
5.4%
U 217
 
4.2%
Q 217
 
4.2%
P 217
 
4.2%
Other values (4) 625
12.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4340
83.3%
Uppercase Letter 868
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1385
31.9%
1 678
15.6%
2 557
12.8%
4 387
 
8.9%
3 351
 
8.1%
8 291
 
6.7%
6 283
 
6.5%
9 195
 
4.5%
5 158
 
3.6%
7 55
 
1.3%
Uppercase Letter
ValueCountFrequency (%)
U 217
25.0%
Q 217
25.0%
P 217
25.0%
S 217
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4340
83.3%
Latin 868
 
16.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1385
31.9%
1 678
15.6%
2 557
12.8%
4 387
 
8.9%
3 351
 
8.1%
8 291
 
6.7%
6 283
 
6.5%
9 195
 
4.5%
5 158
 
3.6%
7 55
 
1.3%
Latin
ValueCountFrequency (%)
U 217
25.0%
Q 217
25.0%
P 217
25.0%
S 217
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5208
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1385
26.6%
1 678
13.0%
2 557
10.7%
4 387
 
7.4%
3 351
 
6.7%
8 291
 
5.6%
6 283
 
5.4%
U 217
 
4.2%
Q 217
 
4.2%
P 217
 
4.2%
Other values (4) 625
12.0%
Distinct182
Distinct (%)83.9%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-13T05:27:54.265665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.7695853
Min length4

Characters and Unicode

Total characters1035
Distinct characters115
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique179 ?
Unique (%)82.5%

Sample

1st row궁항지구
2nd row풍류지구
3rd row조산지구
4th row대곡지구
5th row산수동지구
ValueCountFrequency (%)
자연취락지구 34
 
15.5%
상여 3
 
1.4%
구족도지구 2
 
0.9%
남산지구 2
 
0.9%
굴전지구 1
 
0.5%
장수1지구 1
 
0.5%
평사1지구 1
 
0.5%
이천3지구 1
 
0.5%
만성지구 1
 
0.5%
하동지구 1
 
0.5%
Other values (173) 173
78.6%
2023-12-13T05:27:54.801741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
220
21.3%
219
21.2%
37
 
3.6%
36
 
3.5%
36
 
3.5%
34
 
3.3%
2 24
 
2.3%
1 23
 
2.2%
19
 
1.8%
17
 
1.6%
Other values (105) 370
35.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 952
92.0%
Decimal Number 80
 
7.7%
Space Separator 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
220
23.1%
219
23.0%
37
 
3.9%
36
 
3.8%
36
 
3.8%
34
 
3.6%
19
 
2.0%
17
 
1.8%
13
 
1.4%
13
 
1.4%
Other values (96) 308
32.4%
Decimal Number
ValueCountFrequency (%)
2 24
30.0%
1 23
28.7%
3 14
17.5%
4 8
 
10.0%
5 5
 
6.2%
6 3
 
3.8%
7 2
 
2.5%
8 1
 
1.2%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 952
92.0%
Common 83
 
8.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
220
23.1%
219
23.0%
37
 
3.9%
36
 
3.8%
36
 
3.8%
34
 
3.6%
19
 
2.0%
17
 
1.8%
13
 
1.4%
13
 
1.4%
Other values (96) 308
32.4%
Common
ValueCountFrequency (%)
2 24
28.9%
1 23
27.7%
3 14
16.9%
4 8
 
9.6%
5 5
 
6.0%
6 3
 
3.6%
3
 
3.6%
7 2
 
2.4%
8 1
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 952
92.0%
ASCII 83
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
220
23.1%
219
23.0%
37
 
3.9%
36
 
3.8%
36
 
3.8%
34
 
3.6%
19
 
2.0%
17
 
1.8%
13
 
1.4%
13
 
1.4%
Other values (96) 308
32.4%
ASCII
ValueCountFrequency (%)
2 24
28.9%
1 23
27.7%
3 14
16.9%
4 8
 
9.6%
5 5
 
6.0%
6 3
 
3.6%
3
 
3.6%
7 2
 
2.4%
8 1
 
1.2%

면적_도형
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct184
Distinct (%)84.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39529.912
Minimum0
Maximum310825.24
Zeros34
Zeros (%)15.7%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T05:27:54.966214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q112190.858
median25817.712
Q352026.219
95-th percentile127248.14
Maximum310825.24
Range310825.24
Interquartile range (IQR)39835.361

Descriptive statistics

Standard deviation45894.136
Coefficient of variation (CV)1.1609977
Kurtosis8.448534
Mean39529.912
Median Absolute Deviation (MAD)18746.401
Skewness2.5066159
Sum8577990.9
Variance2.1062717 × 109
MonotonicityNot monotonic
2023-12-13T05:27:55.132944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 34
 
15.7%
13776.45465 1
 
0.5%
23247.36168 1
 
0.5%
76295.01062 1
 
0.5%
17509.89665 1
 
0.5%
3868.668808 1
 
0.5%
2975.588216 1
 
0.5%
7071.310677 1
 
0.5%
9907.16311 1
 
0.5%
30512.86742 1
 
0.5%
Other values (174) 174
80.2%
ValueCountFrequency (%)
0.0 34
15.7%
2975.588216 1
 
0.5%
3536.046242 1
 
0.5%
3868.668808 1
 
0.5%
4939.137644 1
 
0.5%
5611.79672 1
 
0.5%
6049.019336 1
 
0.5%
6085.468029 1
 
0.5%
6371.812816 1
 
0.5%
6642.252284 1
 
0.5%
ValueCountFrequency (%)
310825.2396 1
0.5%
233174.4847 1
0.5%
224314.0263 1
0.5%
203494.0885 1
0.5%
176273.393 1
0.5%
172025.1736 1
0.5%
171005.8897 1
0.5%
166629.8406 1
0.5%
157803.6534 1
0.5%
136185.8875 1
0.5%

길이_도형
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct184
Distinct (%)84.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1219.2434
Minimum0
Maximum7404.094
Zeros34
Zeros (%)15.7%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-13T05:27:55.274171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1644.50151
median954.95973
Q31488.7965
95-th percentile3242.1864
Maximum7404.094
Range7404.094
Interquartile range (IQR)844.29504

Descriptive statistics

Standard deviation1124.8604
Coefficient of variation (CV)0.92258882
Kurtosis6.947134
Mean1219.2434
Median Absolute Deviation (MAD)435.52741
Skewness2.1406459
Sum264575.83
Variance1265310.9
MonotonicityNot monotonic
2023-12-13T05:27:55.439740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 34
 
15.7%
859.5591421 1
 
0.5%
1484.931027 1
 
0.5%
3037.815275 1
 
0.5%
749.5796211 1
 
0.5%
309.0746335 1
 
0.5%
260.6207118 1
 
0.5%
359.7530817 1
 
0.5%
449.4361878 1
 
0.5%
1725.43068 1
 
0.5%
Other values (174) 174
80.2%
ValueCountFrequency (%)
0.0 34
15.7%
260.6207118 1
 
0.5%
304.2138087 1
 
0.5%
304.7597638 1
 
0.5%
309.0746335 1
 
0.5%
359.7530817 1
 
0.5%
413.2296751 1
 
0.5%
433.6520572 1
 
0.5%
447.6825072 1
 
0.5%
449.4361878 1
 
0.5%
ValueCountFrequency (%)
7404.093971 1
0.5%
6243.521903 1
0.5%
5868.520596 1
0.5%
5140.307001 1
0.5%
4347.796653 1
0.5%
4223.167268 1
0.5%
4000.945265 1
0.5%
3752.205586 1
0.5%
3699.272854 1
0.5%
3571.840503 1
0.5%

Interactions

2023-12-13T05:27:52.960692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:27:52.758907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:27:53.062633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:27:52.860063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:27:55.536638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적_도형길이_도형
면적_도형1.0000.883
길이_도형0.8831.000
2023-12-13T05:27:55.640482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적_도형길이_도형
면적_도형1.0000.915
길이_도형0.9151.000

Missing values

2023-12-13T05:27:53.200288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:27:53.313478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

현황도형 관리번호라벨명면적_도형길이_도형
046130UQ128PS199705260018궁항지구37323.2795874.390546
146130UQ128PS199705260019풍류지구67462.115711026.799684
246130UQ128PS199705260020조산지구44871.62013993.24456
346130UQ128PS199705260021대곡지구69011.496461007.971106
446130UQ128PS200412310032산수동지구19116.82723775.952386
546130UQ128PS200412310033굴앞지구16223.95184778.972304
646130UQ128PS199705260022가사리지구54188.651021811.257507
746130UQ128PS200905040087상여 3지구30538.81991899.353616
846130UQ128PS200905040088월호3지구52361.47571802.569332
946130UQ128PS200905040089조발2지구17011.50078687.203201
현황도형 관리번호라벨명면적_도형길이_도형
20746130UQ128PS199705260010하취적지구52950.858331139.797693
20846130UQ128PS199705260011봉정지구76680.615531892.580237
20946130UQ128PS199705260012내동지구66787.931581477.008243
21046130UQ128PS200012110001산곡지구176273.3933016.809818
21146130UQ128PS199705260013두봉지구36942.842811139.818284
21246130UQ128PS199705260014내리지구32029.12702900.377598
21346130UQ128PS199705260015반월지구28071.08305712.434799
21446130UQ128PS200412310031대동지구22022.29911211.90064
21546130UQ128PS199705260016상복지구30557.47804720.505639
21646130UQ128PS199705260017당촌지구30237.68306775.081186