Overview

Dataset statistics

Number of variables5
Number of observations23
Missing cells22
Missing cells (%)19.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory45.7 B

Variable types

Text2
Categorical2
DateTime1

Dataset

Description경기도 이천시의 급경사지 붕괴위험지역에 대한 데이터로, 급경사지 지구명, 위치, 관리기관, 재해위험평가등급, 지정일자에 대한 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15104141/fileData.do

Alerts

지정일자 has constant value ""Constant
지정일자 has 22 (95.7%) missing valuesMissing
지구명 has unique valuesUnique
위치 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:05:46.420361
Analysis finished2023-12-12 12:05:46.819008
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지구명
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T21:05:46.955740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length9.6956522
Min length4

Characters and Unicode

Total characters223
Distinct characters48
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row이치지구
2nd row오천지구
3rd row경사지구
4th row가좌지구
5th row진암지구
ValueCountFrequency (%)
경기 10
16.4%
이천 10
16.4%
n1지구 7
 
11.5%
중부내륙선 4
 
6.6%
창전 3
 
4.9%
마장 2
 
3.3%
관고 2
 
3.3%
n2지구 2
 
3.3%
이치지구 1
 
1.6%
수남 1
 
1.6%
Other values (19) 19
31.1%
2023-12-12T21:05:47.360936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
17.0%
22
 
9.9%
18
 
8.1%
12
 
5.4%
11
 
4.9%
11
 
4.9%
N 10
 
4.5%
10
 
4.5%
1 10
 
4.5%
2 6
 
2.7%
Other values (38) 75
33.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 154
69.1%
Space Separator 38
 
17.0%
Decimal Number 18
 
8.1%
Uppercase Letter 10
 
4.5%
Dash Punctuation 3
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
14.3%
18
 
11.7%
12
 
7.8%
11
 
7.1%
11
 
7.1%
10
 
6.5%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (32) 52
33.8%
Decimal Number
ValueCountFrequency (%)
1 10
55.6%
2 6
33.3%
3 2
 
11.1%
Space Separator
ValueCountFrequency (%)
38
100.0%
Uppercase Letter
ValueCountFrequency (%)
N 10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 154
69.1%
Common 59
 
26.5%
Latin 10
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
14.3%
18
 
11.7%
12
 
7.8%
11
 
7.1%
11
 
7.1%
10
 
6.5%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (32) 52
33.8%
Common
ValueCountFrequency (%)
38
64.4%
1 10
 
16.9%
2 6
 
10.2%
- 3
 
5.1%
3 2
 
3.4%
Latin
ValueCountFrequency (%)
N 10
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 154
69.1%
ASCII 69
30.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
38
55.1%
N 10
 
14.5%
1 10
 
14.5%
2 6
 
8.7%
- 3
 
4.3%
3 2
 
2.9%
Hangul
ValueCountFrequency (%)
22
14.3%
18
 
11.7%
12
 
7.8%
11
 
7.1%
11
 
7.1%
10
 
6.5%
5
 
3.2%
5
 
3.2%
4
 
2.6%
4
 
2.6%
Other values (32) 52
33.8%

위치
Text

UNIQUE 

Distinct23
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size316.0 B
2023-12-12T21:05:47.643059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length20.043478
Min length15

Characters and Unicode

Total characters461
Distinct characters62
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)100.0%

Sample

1st row경기도 이천시 마장면 이치리 275-4
2nd row경기도 이천시 마장면 오천리 249-3
3rd row경기도 이천시 백사면 경사리 643-51
4th row경기도 이천시 부발읍 가좌리 산268-17
5th row경기도 이천시 장호원읍 진암리 산26-31
ValueCountFrequency (%)
경기도 23
21.3%
이천시 23
21.3%
마장면 5
 
4.6%
장호원읍 4
 
3.7%
창전동 3
 
2.8%
증일동 2
 
1.9%
장암리 2
 
1.9%
풍계리 2
 
1.9%
신둔면 2
 
1.9%
관고동 2
 
1.9%
Other values (40) 40
37.0%
2023-12-12T21:05:48.038336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
18.4%
24
 
5.2%
24
 
5.2%
24
 
5.2%
23
 
5.0%
23
 
5.0%
23
 
5.0%
- 21
 
4.6%
1 19
 
4.1%
2 16
 
3.5%
Other values (52) 179
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 268
58.1%
Decimal Number 87
 
18.9%
Space Separator 85
 
18.4%
Dash Punctuation 21
 
4.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
9.0%
24
 
9.0%
24
 
9.0%
23
 
8.6%
23
 
8.6%
23
 
8.6%
15
 
5.6%
12
 
4.5%
12
 
4.5%
10
 
3.7%
Other values (40) 78
29.1%
Decimal Number
ValueCountFrequency (%)
1 19
21.8%
2 16
18.4%
3 13
14.9%
6 7
 
8.0%
7 7
 
8.0%
5 6
 
6.9%
4 6
 
6.9%
0 6
 
6.9%
8 4
 
4.6%
9 3
 
3.4%
Space Separator
ValueCountFrequency (%)
85
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 268
58.1%
Common 193
41.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
9.0%
24
 
9.0%
24
 
9.0%
23
 
8.6%
23
 
8.6%
23
 
8.6%
15
 
5.6%
12
 
4.5%
12
 
4.5%
10
 
3.7%
Other values (40) 78
29.1%
Common
ValueCountFrequency (%)
85
44.0%
- 21
 
10.9%
1 19
 
9.8%
2 16
 
8.3%
3 13
 
6.7%
6 7
 
3.6%
7 7
 
3.6%
5 6
 
3.1%
4 6
 
3.1%
0 6
 
3.1%
Other values (2) 7
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 268
58.1%
ASCII 193
41.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
44.0%
- 21
 
10.9%
1 19
 
9.8%
2 16
 
8.3%
3 13
 
6.7%
6 7
 
3.6%
7 7
 
3.6%
5 6
 
3.1%
4 6
 
3.1%
0 6
 
3.1%
Other values (2) 7
 
3.6%
Hangul
ValueCountFrequency (%)
24
 
9.0%
24
 
9.0%
24
 
9.0%
23
 
8.6%
23
 
8.6%
23
 
8.6%
15
 
5.6%
12
 
4.5%
12
 
4.5%
10
 
3.7%
Other values (40) 78
29.1%

관리기관
Categorical

Distinct2
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size316.0 B
사유시설
14 
공공시설

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사유시설
2nd row사유시설
3rd row사유시설
4th row사유시설
5th row사유시설

Common Values

ValueCountFrequency (%)
사유시설 14
60.9%
공공시설 9
39.1%

Length

2023-12-12T21:05:48.215023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:05:48.659807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사유시설 14
60.9%
공공시설 9
39.1%
Distinct4
Distinct (%)17.4%
Missing0
Missing (%)0.0%
Memory size316.0 B
C
12 
B
D
 
1
A
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique2 ?
Unique (%)8.7%

Sample

1st rowB
2nd rowB
3rd rowD
4th rowB
5th rowC

Common Values

ValueCountFrequency (%)
C 12
52.2%
B 9
39.1%
D 1
 
4.3%
A 1
 
4.3%

Length

2023-12-12T21:05:48.773934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:05:48.902633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
c 12
52.2%
b 9
39.1%
d 1
 
4.3%
a 1
 
4.3%

지정일자
Date

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing22
Missing (%)95.7%
Memory size316.0 B
Minimum2020-12-01 00:00:00
Maximum2020-12-01 00:00:00
2023-12-12T21:05:49.007385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:05:49.131091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-12T21:05:49.240822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지구명위치관리기관재해위험평가등급
지구명1.0001.0001.0001.000
위치1.0001.0001.0001.000
관리기관1.0001.0001.0000.485
재해위험평가등급1.0001.0000.4851.000
2023-12-12T21:05:49.346768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관재해위험평가등급
관리기관1.0000.303
재해위험평가등급0.3031.000
2023-12-12T21:05:49.460990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
관리기관재해위험평가등급
관리기관1.0000.303
재해위험평가등급0.3031.000

Missing values

2023-12-12T21:05:46.667397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:05:46.777130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지구명위치관리기관재해위험평가등급지정일자
0이치지구경기도 이천시 마장면 이치리 275-4사유시설B<NA>
1오천지구경기도 이천시 마장면 오천리 249-3사유시설B<NA>
2경사지구경기도 이천시 백사면 경사리 643-51사유시설D2020-12-01
3가좌지구경기도 이천시 부발읍 가좌리 산268-17사유시설B<NA>
4진암지구경기도 이천시 장호원읍 진암리 산26-31사유시설C<NA>
5매곡지구경기도 이천시 호법면 매곡리 323-4사유시설A<NA>
6고척지구경기도 이천시 신둔면 고척리 363-13사유시설C<NA>
7장암지구경기도 이천시 마장면 장암리 산117-2공공시설B<NA>
8수도권본부 경강선1경기도 이천시 증일동 60-7공공시설C<NA>
9중부내륙선 1공구경기도 이천시 대월면 장평리 산30-8공공시설B<NA>
지구명위치관리기관재해위험평가등급지정일자
13경기 이천 창전 N1지구경기도 이천시 창전동 산8-10사유시설C<NA>
14경기 이천 창전 N2지구경기도 이천시 창전동 산9-1사유시설C<NA>
15경기 이천 창전 N3지구경기도 이천시 창전동 산11공공시설C<NA>
16경기 이천 관고 N1지구경기도 이천시 관고동 산20-7공공시설B<NA>
17경기 이천 관고 N2지구경기도 이천시 관고동 산21-1사유시설C<NA>
18경기 이천 마장 회억 N1지구경기도 이천시 마장면 회억리 223-22사유시설C<NA>
19경기 이천 마장 장암 N1지구경기도 이천시 마장면 장암리 산58-7공공시설C<NA>
20경기 이천 송정 N1지구경기도 이천시 송정동 산34사유시설C<NA>
21경기 이천 신둔 수남 N1지구경기도 이천시 신둔면 수남리 176-10사유시설C<NA>
22경기 이천 증일 N1지구경기도 이천시 증일동 325-4사유시설C<NA>