Overview

Dataset statistics

Number of variables8
Number of observations347
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory23.2 KiB
Average record size in memory68.4 B

Variable types

Numeric4
Text1
Categorical3

Dataset

Description양평군 내 산사태 취약지역의 지번주소, 취약지역 유형, 취약지역 면적, 취약지역 관리주체 등의 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15117076/fileData.do

Alerts

관리주체 has constant value ""Constant
데이터기준일 has constant value ""Constant
취약지역유형 is highly imbalanced (62.6%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:55:35.366041
Analysis finished2023-12-12 09:55:37.547665
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct347
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean174
Minimum1
Maximum347
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T18:55:37.625197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.3
Q187.5
median174
Q3260.5
95-th percentile329.7
Maximum347
Range346
Interquartile range (IQR)173

Descriptive statistics

Standard deviation100.31451
Coefficient of variation (CV)0.57652015
Kurtosis-1.2
Mean174
Median Absolute Deviation (MAD)87
Skewness0
Sum60378
Variance10063
MonotonicityStrictly increasing
2023-12-12T18:55:37.833886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
230 1
 
0.3%
238 1
 
0.3%
237 1
 
0.3%
236 1
 
0.3%
235 1
 
0.3%
234 1
 
0.3%
233 1
 
0.3%
232 1
 
0.3%
231 1
 
0.3%
Other values (337) 337
97.1%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
347 1
0.3%
346 1
0.3%
345 1
0.3%
344 1
0.3%
343 1
0.3%
342 1
0.3%
341 1
0.3%
340 1
0.3%
339 1
0.3%
338 1
0.3%
Distinct331
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-12-12T18:55:38.134218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length19.991354
Min length18

Characters and Unicode

Total characters6937
Distinct characters108
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)91.4%

Sample

1st row경기도 양평군 강하면 항금리 산176
2nd row경기도 양평군 청운면 삼성리 산1
3rd row경기도 양평군 지평면 일신리 산51
4th row경기도 양평군 양동면 매월리 산27-3
5th row경기도 양평군 단월면 산음리 1006
ValueCountFrequency (%)
경기도 347
20.0%
양평군 347
20.0%
양동면 53
 
3.1%
단월면 52
 
3.0%
청운면 48
 
2.8%
지평면 42
 
2.4%
용문면 39
 
2.2%
서종면 34
 
2.0%
옥천면 22
 
1.3%
양서면 20
 
1.2%
Other values (377) 731
42.1%
2023-12-12T18:55:38.589334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1392
20.1%
431
 
6.2%
403
 
5.8%
351
 
5.1%
351
 
5.1%
347
 
5.0%
347
 
5.0%
347
 
5.0%
339
 
4.9%
1 259
 
3.7%
Other values (98) 2370
34.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4360
62.9%
Space Separator 1392
 
20.1%
Decimal Number 1071
 
15.4%
Dash Punctuation 114
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
431
 
9.9%
403
 
9.2%
351
 
8.1%
351
 
8.1%
347
 
8.0%
347
 
8.0%
347
 
8.0%
339
 
7.8%
231
 
5.3%
67
 
1.5%
Other values (86) 1146
26.3%
Decimal Number
ValueCountFrequency (%)
1 259
24.2%
2 115
10.7%
4 102
 
9.5%
3 99
 
9.2%
6 94
 
8.8%
5 92
 
8.6%
7 87
 
8.1%
0 77
 
7.2%
9 75
 
7.0%
8 71
 
6.6%
Space Separator
ValueCountFrequency (%)
1392
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 114
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4360
62.9%
Common 2577
37.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
431
 
9.9%
403
 
9.2%
351
 
8.1%
351
 
8.1%
347
 
8.0%
347
 
8.0%
347
 
8.0%
339
 
7.8%
231
 
5.3%
67
 
1.5%
Other values (86) 1146
26.3%
Common
ValueCountFrequency (%)
1392
54.0%
1 259
 
10.1%
2 115
 
4.5%
- 114
 
4.4%
4 102
 
4.0%
3 99
 
3.8%
6 94
 
3.6%
5 92
 
3.6%
7 87
 
3.4%
0 77
 
3.0%
Other values (2) 146
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4360
62.9%
ASCII 2577
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1392
54.0%
1 259
 
10.1%
2 115
 
4.5%
- 114
 
4.4%
4 102
 
4.0%
3 99
 
3.8%
6 94
 
3.6%
5 92
 
3.6%
7 87
 
3.4%
0 77
 
3.0%
Other values (2) 146
 
5.7%
Hangul
ValueCountFrequency (%)
431
 
9.9%
403
 
9.2%
351
 
8.1%
351
 
8.1%
347
 
8.0%
347
 
8.0%
347
 
8.0%
339
 
7.8%
231
 
5.3%
67
 
1.5%
Other values (86) 1146
26.3%

위도
Real number (ℝ)

Distinct330
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.520157
Minimum37.38166
Maximum37.654733
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T18:55:38.758880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.38166
5-th percentile37.415999
Q137.467214
median37.528353
Q337.565061
95-th percentile37.613484
Maximum37.654733
Range0.27307299
Interquartile range (IQR)0.097847435

Descriptive statistics

Standard deviation0.063152447
Coefficient of variation (CV)0.0016831605
Kurtosis-0.8417224
Mean37.520157
Median Absolute Deviation (MAD)0.04853905
Skewness-0.11226304
Sum13019.495
Variance0.0039882315
MonotonicityNot monotonic
2023-12-12T18:55:38.920758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.55811602 3
 
0.9%
37.52276301 3
 
0.9%
37.58338726 2
 
0.6%
37.40656956 2
 
0.6%
37.65473314 2
 
0.6%
37.53465573 2
 
0.6%
37.44412655 2
 
0.6%
37.52167392 2
 
0.6%
37.62781235 2
 
0.6%
37.38166015 2
 
0.6%
Other values (320) 325
93.7%
ValueCountFrequency (%)
37.38166015 2
0.6%
37.38664778 1
0.3%
37.38934016 1
0.3%
37.38960532 1
0.3%
37.39548831 1
0.3%
37.399 1
0.3%
37.40321659 1
0.3%
37.40381404 1
0.3%
37.40594564 1
0.3%
37.40656956 2
0.6%
ValueCountFrequency (%)
37.65473314 2
0.6%
37.64965737 1
0.3%
37.64668598 1
0.3%
37.64139327 1
0.3%
37.63732363 1
0.3%
37.63577668 1
0.3%
37.63462775 1
0.3%
37.63251805 1
0.3%
37.63161224 1
0.3%
37.63079253 1
0.3%

경도
Real number (ℝ)

Distinct330
Distinct (%)95.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean127.59901
Minimum127.33615
Maximum127.79964
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T18:55:39.074154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.33615
5-th percentile127.37725
Q1127.48805
median127.62936
Q3127.7075
95-th percentile127.77115
Maximum127.79964
Range0.463485
Interquartile range (IQR)0.2194514

Descriptive statistics

Standard deviation0.12934626
Coefficient of variation (CV)0.0010136933
Kurtosis-1.0738957
Mean127.59901
Median Absolute Deviation (MAD)0.0934307
Skewness-0.42117393
Sum44276.856
Variance0.016730456
MonotonicityNot monotonic
2023-12-12T18:55:39.233570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.5056172 3
 
0.9%
127.750087 3
 
0.9%
127.6974748 2
 
0.6%
127.7229938 2
 
0.6%
127.4148607 2
 
0.6%
127.5352367 2
 
0.6%
127.6958534 2
 
0.6%
127.7001758 2
 
0.6%
127.4513158 2
 
0.6%
127.7579743 2
 
0.6%
Other values (320) 325
93.7%
ValueCountFrequency (%)
127.3361549 1
0.3%
127.3442179 1
0.3%
127.3490396 1
0.3%
127.3528613 1
0.3%
127.3586056 1
0.3%
127.3636517 1
0.3%
127.3657174 1
0.3%
127.365877 1
0.3%
127.3664 1
0.3%
127.3667717 1
0.3%
ValueCountFrequency (%)
127.7996399 1
0.3%
127.7988751 1
0.3%
127.7923714 1
0.3%
127.7899559 1
0.3%
127.7876882 1
0.3%
127.7866595 1
0.3%
127.7864002 1
0.3%
127.7852967 1
0.3%
127.7796631 1
0.3%
127.7779977 1
0.3%

취약지역유형
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
토석류
322 
산사태
 
25

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토석류
2nd row토석류
3rd row토석류
4th row토석류
5th row토석류

Common Values

ValueCountFrequency (%)
토석류 322
92.8%
산사태 25
 
7.2%

Length

2023-12-12T18:55:39.363599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:55:39.452546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토석류 322
92.8%
산사태 25
 
7.2%

면적(제곱미터)
Real number (ℝ)

Distinct304
Distinct (%)87.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2468.2622
Minimum105
Maximum11237
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2023-12-12T18:55:39.567965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum105
5-th percentile380.7
Q1902
median1842
Q33208
95-th percentile7728.2
Maximum11237
Range11132
Interquartile range (IQR)2306

Descriptive statistics

Standard deviation2190.7956
Coefficient of variation (CV)0.88758623
Kurtosis2.6808922
Mean2468.2622
Median Absolute Deviation (MAD)1102
Skewness1.6882408
Sum856487
Variance4799585.3
MonotonicityNot monotonic
2023-12-12T18:55:39.715821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1250 13
 
3.7%
702 9
 
2.6%
701 6
 
1.7%
703 4
 
1.2%
706 3
 
0.9%
700 3
 
0.9%
1959 2
 
0.6%
528 2
 
0.6%
2565 2
 
0.6%
1120 2
 
0.6%
Other values (294) 301
86.7%
ValueCountFrequency (%)
105 1
0.3%
152 1
0.3%
160 1
0.3%
161 1
0.3%
169 1
0.3%
190 1
0.3%
200 1
0.3%
239 1
0.3%
241 1
0.3%
249 1
0.3%
ValueCountFrequency (%)
11237 1
0.3%
10438 1
0.3%
10403 1
0.3%
10073 1
0.3%
9825 1
0.3%
9174 1
0.3%
9021 1
0.3%
8831 1
0.3%
8784 1
0.3%
8753 1
0.3%

관리주체
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
양평군
347 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양평군
2nd row양평군
3rd row양평군
4th row양평군
5th row양평군

Common Values

ValueCountFrequency (%)
양평군 347
100.0%

Length

2023-12-12T18:55:39.851787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:55:39.955653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
양평군 347
100.0%

데이터기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.8 KiB
2023-08-28
347 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-28
2nd row2023-08-28
3rd row2023-08-28
4th row2023-08-28
5th row2023-08-28

Common Values

ValueCountFrequency (%)
2023-08-28 347
100.0%

Length

2023-12-12T18:55:40.058479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:55:40.152663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-28 347
100.0%

Interactions

2023-12-12T18:55:36.856787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:35.670484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.044987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.466562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.953650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:35.762396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.147753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.563499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:37.061075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:35.843943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.251381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.661481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:37.182142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:35.948185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.370528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:55:36.775707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:55:40.218476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도취약지역유형면적(제곱미터)
연번1.0000.0680.0000.2120.512
위도0.0681.0000.5820.0000.265
경도0.0000.5821.0000.0000.167
취약지역유형0.2120.0000.0001.0000.201
면적(제곱미터)0.5120.2650.1670.2011.000
2023-12-12T18:55:40.339253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도면적(제곱미터)취약지역유형
연번1.0000.0210.0170.1860.161
위도0.0211.000-0.3240.0840.000
경도0.017-0.3241.0000.0450.000
면적(제곱미터)0.1860.0840.0451.0000.152
취약지역유형0.1610.0000.0000.1521.000

Missing values

2023-12-12T18:55:37.344202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:55:37.485905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번취약지역지번주소위도경도취약지역유형면적(제곱미터)관리주체데이터기준일
01경기도 양평군 강하면 항금리 산17637.443377127.38565토석류2302양평군2023-08-28
12경기도 양평군 청운면 삼성리 산137.574687127.751526토석류1423양평군2023-08-28
23경기도 양평군 지평면 일신리 산5137.437445127.695032토석류975양평군2023-08-28
34경기도 양평군 양동면 매월리 산27-337.462518127.70842토석류1684양평군2023-08-28
45경기도 양평군 단월면 산음리 100637.612448127.614161토석류1250양평군2023-08-28
56경기도 양평군 지평면 무왕리 산6737.467064127.679461토석류1751양평군2023-08-28
67경기도 양평군 단월면 석산리 산11537.631612127.580466토석류1614양평군2023-08-28
78경기도 양평군 단월면 산음리 103737.606472127.603673토석류1830양평군2023-08-28
89경기도 양평군 청운면 용두리 산23-437.551211127.730463토석류3882양평군2023-08-28
910경기도 양평군 강하면 항금리 산2437.452702127.404044토석류2779양평군2023-08-28
연번취약지역지번주소위도경도취약지역유형면적(제곱미터)관리주체데이터기준일
337338경기도 양평군 용문면 화전리 824-437.456174127.606016토석류7086양평군2023-08-28
338339경기도 양평군 양동면 쌍학리 85637.415515127.738309토석류701양평군2023-08-28
339340경기도 양평군 서종면 수능리 산1437.588936127.395326토석류5641양평군2023-08-28
340341경기도 양평군 용문면 신점리 산10-137.531408127.599951토석류3420양평군2023-08-28
341342경기도 양평군 양동면 고송리 942-337.477875127.69783토석류2135양평군2023-08-28
342343경기도 양평군 단월면 향소리 982-337.581245127.635624토석류10438양평군2023-08-28
343344경기도 양평군 단월면 부안리 산35-237.583416127.673789토석류7389양평군2023-08-28
344345경기도 양평군 양동면 삼산리 134037.38166127.757974토석류702양평군2023-08-28
345346경기도 양평군 양동면 계정리 431-337.485307127.759854토석류701양평군2023-08-28
346347경기도 양평군 지평면 무왕리 산8237.470682127.675338토석류1226양평군2023-08-28