Overview

Dataset statistics

Number of variables5
Number of observations46
Missing cells0
Missing cells (%)0.0%
Duplicate rows1
Duplicate rows (%)2.2%
Total size in memory2.0 KiB
Average record size in memory43.9 B

Variable types

Categorical2
Numeric1
Text2

Dataset

Description한국광해광업공단 지반침하 응급조치사업 대상지에 대하여 지역별, 연도별, 광산별, 시설, 주소 등의 현황 정보 제공
URLhttps://www.data.go.kr/data/15042160/fileData.do

Alerts

Dataset has 1 (2.2%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 14:38:51.596227
Analysis finished2023-12-12 14:38:52.218120
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct6
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
충청도
18 
영남지역
12 
강원도
부산지역
호남지역

Length

Max length4
Median length3
Mean length3.4347826
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영남지역
2nd row강원도
3rd row충청도
4th row호남지역
5th row영남지역

Common Values

ValueCountFrequency (%)
충청도 18
39.1%
영남지역 12
26.1%
강원도 8
17.4%
부산지역 4
 
8.7%
호남지역 2
 
4.3%
경인지역 2
 
4.3%

Length

2023-12-12T23:38:52.282712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:38:52.380175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충청도 18
39.1%
영남지역 12
26.1%
강원도 8
17.4%
부산지역 4
 
8.7%
호남지역 2
 
4.3%
경인지역 2
 
4.3%

사업연도
Real number (ℝ)

Distinct14
Distinct (%)30.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2014.1957
Minimum2006
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size546.0 B
2023-12-12T23:38:52.480181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2006
5-th percentile2008
Q12010
median2012.5
Q32017.75
95-th percentile2022
Maximum2022
Range16
Interquartile range (IQR)7.75

Descriptive statistics

Standard deviation4.928915
Coefficient of variation (CV)0.0024470885
Kurtosis-1.1839382
Mean2014.1957
Median Absolute Deviation (MAD)3.5
Skewness0.43245727
Sum92653
Variance24.294203
MonotonicityIncreasing
2023-12-12T23:38:52.597807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
2010 7
15.2%
2012 7
15.2%
2022 6
13.0%
2021 5
10.9%
2013 4
8.7%
2017 4
8.7%
2008 3
6.5%
2009 3
6.5%
2011 2
 
4.3%
2006 1
 
2.2%
Other values (4) 4
8.7%
ValueCountFrequency (%)
2006 1
 
2.2%
2008 3
6.5%
2009 3
6.5%
2010 7
15.2%
2011 2
 
4.3%
2012 7
15.2%
2013 4
8.7%
2014 1
 
2.2%
2015 1
 
2.2%
2016 1
 
2.2%
ValueCountFrequency (%)
2022 6
13.0%
2021 5
10.9%
2018 1
 
2.2%
2017 4
8.7%
2016 1
 
2.2%
2015 1
 
2.2%
2014 1
 
2.2%
2013 4
8.7%
2012 7
15.2%
2011 2
 
4.3%
Distinct40
Distinct (%)87.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T23:38:52.832993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length10
Mean length4.0869565
Min length2

Characters and Unicode

Total characters188
Distinct characters87
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)76.1%

Sample

1st row토현
2nd row나전탄광 옥갑구역
3rd row보은원정
4th row장흥
5th row선댁
ValueCountFrequency (%)
성주산 3
 
5.5%
석공신성 2
 
3.6%
득익 2
 
3.6%
광덕 2
 
3.6%
덕수 2
 
3.6%
오복 2
 
3.6%
응급 1
 
1.8%
칠성광업소 1
 
1.8%
효경 1
 
1.8%
덕수(예산군 1
 
1.8%
Other values (38) 38
69.1%
2023-12-12T23:38:53.218557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11
 
5.9%
10
 
5.3%
9
 
4.8%
8
 
4.3%
7
 
3.7%
6
 
3.2%
( 6
 
3.2%
5
 
2.7%
5
 
2.7%
) 5
 
2.7%
Other values (77) 116
61.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 165
87.8%
Space Separator 9
 
4.8%
Open Punctuation 6
 
3.2%
Close Punctuation 5
 
2.7%
Decimal Number 2
 
1.1%
Other Punctuation 1
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11
 
6.7%
10
 
6.1%
8
 
4.8%
7
 
4.2%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (71) 101
61.2%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 165
87.8%
Common 23
 
12.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11
 
6.7%
10
 
6.1%
8
 
4.8%
7
 
4.2%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (71) 101
61.2%
Common
ValueCountFrequency (%)
9
39.1%
( 6
26.1%
) 5
21.7%
, 1
 
4.3%
1 1
 
4.3%
2 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 165
87.8%
ASCII 23
 
12.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11
 
6.7%
10
 
6.1%
8
 
4.8%
7
 
4.2%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
Other values (71) 101
61.2%
ASCII
ValueCountFrequency (%)
9
39.1%
( 6
26.1%
) 5
21.7%
, 1
 
4.3%
1 1
 
4.3%
2 1
 
4.3%

시설 개요
Categorical

Distinct6
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Memory size500.0 B
충전
17 
안전펜스
12 
갱구막이
11 
안전표지판
충전,안전표지판
 
1

Length

Max length8
Median length4
Mean length3.4347826
Min length2

Unique

Unique2 ?
Unique (%)4.3%

Sample

1st row안전펜스
2nd row안전표지판
3rd row갱구막이
4th row안전펜스
5th row안전펜스

Common Values

ValueCountFrequency (%)
충전 17
37.0%
안전펜스 12
26.1%
갱구막이 11
23.9%
안전표지판 4
 
8.7%
충전,안전표지판 1
 
2.2%
안전휀스 1
 
2.2%

Length

2023-12-12T23:38:53.371682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:38:53.495519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
충전 17
37.0%
안전펜스 12
26.1%
갱구막이 11
23.9%
안전표지판 4
 
8.7%
충전,안전표지판 1
 
2.2%
안전휀스 1
 
2.2%

주소
Text

Distinct42
Distinct (%)91.3%
Missing0
Missing (%)0.0%
Memory size500.0 B
2023-12-12T23:38:53.887416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length21
Mean length19.23913
Min length15

Characters and Unicode

Total characters885
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)84.8%

Sample

1st row경북 의성군 사곡면 토현리 산69
2nd row강원도 정선군 북면 장열리 산1-1
3rd row충북 보은군 마로면 소여리 산 84
4th row전남 장흥군 관산면 신동리 25-1임
5th row경북 안동시 길안면 백자리 신방골 산57
ValueCountFrequency (%)
충남 14
 
6.1%
경북 10
 
4.3%
성주면 10
 
4.3%
보령시 10
 
4.3%
성주리 9
 
3.9%
강원도 8
 
3.5%
정선군 4
 
1.7%
부산시 4
 
1.7%
충북 4
 
1.7%
의성군 3
 
1.3%
Other values (133) 154
67.0%
2023-12-12T23:38:54.416276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
185
20.9%
42
 
4.7%
41
 
4.6%
36
 
4.1%
1 34
 
3.8%
29
 
3.3%
28
 
3.2%
- 27
 
3.1%
23
 
2.6%
20
 
2.3%
Other values (110) 420
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 546
61.7%
Space Separator 185
 
20.9%
Decimal Number 127
 
14.4%
Dash Punctuation 27
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
7.7%
41
 
7.5%
36
 
6.6%
29
 
5.3%
28
 
5.1%
23
 
4.2%
20
 
3.7%
20
 
3.7%
19
 
3.5%
18
 
3.3%
Other values (98) 270
49.5%
Decimal Number
ValueCountFrequency (%)
1 34
26.8%
3 19
15.0%
4 18
14.2%
5 12
 
9.4%
6 10
 
7.9%
2 10
 
7.9%
8 9
 
7.1%
7 8
 
6.3%
9 5
 
3.9%
0 2
 
1.6%
Space Separator
ValueCountFrequency (%)
185
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 546
61.7%
Common 339
38.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
7.7%
41
 
7.5%
36
 
6.6%
29
 
5.3%
28
 
5.1%
23
 
4.2%
20
 
3.7%
20
 
3.7%
19
 
3.5%
18
 
3.3%
Other values (98) 270
49.5%
Common
ValueCountFrequency (%)
185
54.6%
1 34
 
10.0%
- 27
 
8.0%
3 19
 
5.6%
4 18
 
5.3%
5 12
 
3.5%
6 10
 
2.9%
2 10
 
2.9%
8 9
 
2.7%
7 8
 
2.4%
Other values (2) 7
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 546
61.7%
ASCII 339
38.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
185
54.6%
1 34
 
10.0%
- 27
 
8.0%
3 19
 
5.6%
4 18
 
5.3%
5 12
 
3.5%
6 10
 
2.9%
2 10
 
2.9%
8 9
 
2.7%
7 8
 
2.4%
Other values (2) 7
 
2.1%
Hangul
ValueCountFrequency (%)
42
 
7.7%
41
 
7.5%
36
 
6.6%
29
 
5.3%
28
 
5.1%
23
 
4.2%
20
 
3.7%
20
 
3.7%
19
 
3.5%
18
 
3.3%
Other values (98) 270
49.5%

Interactions

2023-12-12T23:38:51.935037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:38:54.771285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지역사업연도광산명시설 개요주소
지역1.0000.1661.0000.4931.000
사업연도0.1661.0000.0000.0860.000
광산명1.0000.0001.0000.9440.991
시설 개요0.4930.0860.9441.0000.985
주소1.0000.0000.9910.9851.000
2023-12-12T23:38:54.874992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시설 개요지역
시설 개요1.0000.190
지역0.1901.000
2023-12-12T23:38:54.950611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업연도지역시설 개요
사업연도1.0000.0000.000
지역0.0001.0000.190
시설 개요0.0000.1901.000

Missing values

2023-12-12T23:38:52.096308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:38:52.182744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지역사업연도광산명시설 개요주소
0영남지역2006토현안전펜스경북 의성군 사곡면 토현리 산69
1강원도2008나전탄광 옥갑구역안전표지판강원도 정선군 북면 장열리 산1-1
2충청도2008보은원정갱구막이충북 보은군 마로면 소여리 산 84
3호남지역2008장흥안전펜스전남 장흥군 관산면 신동리 25-1임
4영남지역2009선댁안전펜스경북 안동시 길안면 백자리 신방골 산57
5영남지역2009광덕충전경북 의성군 다인면 덕미리 95
6경인지역2009양평안전표지판경기도 양평군 양동면 금왕리 45
7강원도2010어룡(대덕)안전펜스강원도 정선군 고한읍 고한리 산2-141
8충청도2010성주광업소안전펜스충남 보령시 성주면 개화리 산23-4
9충청도2010덕수안전펜스충남 보령시 성주면 성주리 산37-1
지역사업연도광산명시설 개요주소
36충청도2021석공신성갱구막이충남 보령시 성주면 성주리 산32
37충청도2021칠성광업소충전충남 청양군 대치면 주정리 58
38충청도2021(가)삼방충전충남 아산시 배방읍 중리 54
39호남지역2021덕신충전전남 화순군 동복면 안성리 산 87-2
40부산지역2022경창안전휀스부산시 사상구 모라동 산31
41부산지역2022기장갱구막이부산시 기장군 기장읍 서부리 산8
42부산지역2022동래납석갱구막이부산시 기장군 정관읍 두명리 산41
43부산지역2022부산철광갱구막이부산시 사하구 괴정동 산1-1
44강원도2022태영탄광충전강원도 태백시 화전동 168-1
45강원도2022삼원광산갱구막이강원도 삼척시 노곡면 둔달리 81

Duplicate rows

Most frequently occurring

지역사업연도광산명시설 개요주소# duplicates
0충청도2017성주산충전충남 보령시 성주면 성주리 산31-32