Overview

Dataset statistics

Number of variables7
Number of observations58
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.4 KiB
Average record size in memory59.3 B

Variable types

DateTime2
Text2
Numeric1
Categorical2

Dataset

Description전라남도 무안군 관내 토석채취허가 현황 및 채석신고 현황(시작일자, 종료일자, 소재지, 허가면적, 토석구분, 용도, 업체 등) 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15060109/fileData.do

Alerts

토석구분 is highly imbalanced (53.2%)Imbalance

Reproduction

Analysis started2023-12-12 06:35:59.882170
Analysis finished2023-12-12 06:36:00.490418
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct53
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2001-10-08 00:00:00
Maximum2020-12-31 00:00:00
2023-12-12T15:36:00.546852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:36:00.668159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct42
Distinct (%)72.4%
Missing0
Missing (%)0.0%
Memory size596.0 B
Minimum2002-10-30 00:00:00
Maximum2029-09-07 00:00:00
2023-12-12T15:36:00.869468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:36:01.000674image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
Distinct55
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-12T15:36:01.274442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length23
Mean length21.741379
Min length19

Characters and Unicode

Total characters1261
Distinct characters64
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)89.7%

Sample

1st row전라남도 무안군 삼향읍 임성리 산14-1
2nd row전라남도 무안군 청계면 송현리 산78-5
3rd row전라남도 무안군 일로읍 광암리 산34
4th row전라남도 무안군 삼향읍 왕산리 산 127-7
5th row전라남도 무안군 삼향읍 맥포리 222-17
ValueCountFrequency (%)
전라남도 58
19.3%
무안군 58
19.3%
삼향읍 21
 
7.0%
몽탄면 10
 
3.3%
10
 
3.3%
청계면 9
 
3.0%
일로읍 8
 
2.7%
임성리 7
 
2.3%
맥포리 6
 
2.0%
송현리 5
 
1.7%
Other values (80) 108
36.0%
2023-12-12T15:36:01.693582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
242
19.2%
61
 
4.8%
61
 
4.8%
61
 
4.8%
58
 
4.6%
58
 
4.6%
58
 
4.6%
58
 
4.6%
58
 
4.6%
57
 
4.5%
Other values (54) 489
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 802
63.6%
Space Separator 242
 
19.2%
Decimal Number 180
 
14.3%
Dash Punctuation 37
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
61
 
7.6%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
57
 
7.1%
32
 
4.0%
Other values (42) 240
29.9%
Decimal Number
ValueCountFrequency (%)
1 43
23.9%
2 31
17.2%
3 21
11.7%
8 16
 
8.9%
7 15
 
8.3%
4 14
 
7.8%
6 11
 
6.1%
5 11
 
6.1%
9 9
 
5.0%
0 9
 
5.0%
Space Separator
ValueCountFrequency (%)
242
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 802
63.6%
Common 459
36.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
61
 
7.6%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
57
 
7.1%
32
 
4.0%
Other values (42) 240
29.9%
Common
ValueCountFrequency (%)
242
52.7%
1 43
 
9.4%
- 37
 
8.1%
2 31
 
6.8%
3 21
 
4.6%
8 16
 
3.5%
7 15
 
3.3%
4 14
 
3.1%
6 11
 
2.4%
5 11
 
2.4%
Other values (2) 18
 
3.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 802
63.6%
ASCII 459
36.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
242
52.7%
1 43
 
9.4%
- 37
 
8.1%
2 31
 
6.8%
3 21
 
4.6%
8 16
 
3.5%
7 15
 
3.3%
4 14
 
3.1%
6 11
 
2.4%
5 11
 
2.4%
Other values (2) 18
 
3.9%
Hangul
ValueCountFrequency (%)
61
 
7.6%
61
 
7.6%
61
 
7.6%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
58
 
7.2%
57
 
7.1%
32
 
4.0%
Other values (42) 240
29.9%
Distinct56
Distinct (%)96.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25547.534
Minimum574
Maximum79969
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size654.0 B
2023-12-12T15:36:01.891104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum574
5-th percentile2061.45
Q16976
median21699
Q340820.5
95-th percentile61412
Maximum79969
Range79395
Interquartile range (IQR)33844.5

Descriptive statistics

Standard deviation21243.8
Coefficient of variation (CV)0.83154014
Kurtosis-0.2472161
Mean25547.534
Median Absolute Deviation (MAD)15883
Skewness0.78230905
Sum1481757
Variance4.5129905 × 108
MonotonicityNot monotonic
2023-12-12T15:36:02.099122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1945 2
 
3.4%
24100 2
 
3.4%
79969 1
 
1.7%
3302 1
 
1.7%
41220 1
 
1.7%
15663 1
 
1.7%
28844 1
 
1.7%
43366 1
 
1.7%
33024 1
 
1.7%
2082 1
 
1.7%
Other values (46) 46
79.3%
ValueCountFrequency (%)
574 1
1.7%
1945 2
3.4%
2082 1
1.7%
2095 1
1.7%
2176 1
1.7%
2270 1
1.7%
3135 1
1.7%
3302 1
1.7%
3465 1
1.7%
3659 1
1.7%
ValueCountFrequency (%)
79969 1
1.7%
78709 1
1.7%
61582 1
1.7%
61382 1
1.7%
61204 1
1.7%
60252 1
1.7%
60090 1
1.7%
51088 1
1.7%
51039 1
1.7%
48104 1
1.7%

토석구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
토사점토
49 
골재
석재
 
2

Length

Max length4
Median length4
Mean length3.6896552
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row토사점토
2nd row골재
3rd row토사점토
4th row토사점토
5th row토사점토

Common Values

ValueCountFrequency (%)
토사점토 49
84.5%
골재 7
 
12.1%
석재 2
 
3.4%

Length

2023-12-12T15:36:02.274502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:36:02.423831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
토사점토 49
84.5%
골재 7
 
12.1%
석재 2
 
3.4%

용도
Categorical

Distinct3
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Memory size596.0 B
기타
38 
토목용
17 
쇄골재용
 
3

Length

Max length4
Median length2
Mean length2.3965517
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기타
2nd row쇄골재용
3rd row기타
4th row기타
5th row기타

Common Values

ValueCountFrequency (%)
기타 38
65.5%
토목용 17
29.3%
쇄골재용 3
 
5.2%

Length

2023-12-12T15:36:02.583678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:36:02.709270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 38
65.5%
토목용 17
29.3%
쇄골재용 3
 
5.2%

업체
Text

Distinct51
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Memory size596.0 B
2023-12-12T15:36:02.999343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length6.8103448
Min length3

Characters and Unicode

Total characters395
Distinct characters95
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)79.3%

Sample

1st row(유)부국건업
2nd row경성산업주식회사
3rd row박춘재
4th row유한회사 강토건설
5th row김원만
ValueCountFrequency (%)
주)부국건설 4
 
5.6%
유한회사 4
 
5.6%
유)남악산업 3
 
4.2%
주식회사 3
 
4.2%
예원건설㈜ 2
 
2.8%
남화산업(주 2
 
2.8%
유)부국건업 2
 
2.8%
유)태원 2
 
2.8%
서변선 2
 
2.8%
대표 1
 
1.4%
Other values (47) 47
65.3%
2023-12-12T15:36:03.527285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 31
 
7.8%
) 31
 
7.8%
27
 
6.8%
22
 
5.6%
18
 
4.6%
16
 
4.1%
16
 
4.1%
16
 
4.1%
14
 
3.5%
11
 
2.8%
Other values (85) 193
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 317
80.3%
Open Punctuation 31
 
7.8%
Close Punctuation 31
 
7.8%
Space Separator 14
 
3.5%
Other Symbol 2
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
8.5%
22
 
6.9%
18
 
5.7%
16
 
5.0%
16
 
5.0%
16
 
5.0%
11
 
3.5%
11
 
3.5%
9
 
2.8%
8
 
2.5%
Other values (81) 163
51.4%
Open Punctuation
ValueCountFrequency (%)
( 31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 31
100.0%
Space Separator
ValueCountFrequency (%)
14
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 319
80.8%
Common 76
 
19.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
8.5%
22
 
6.9%
18
 
5.6%
16
 
5.0%
16
 
5.0%
16
 
5.0%
11
 
3.4%
11
 
3.4%
9
 
2.8%
8
 
2.5%
Other values (82) 165
51.7%
Common
ValueCountFrequency (%)
( 31
40.8%
) 31
40.8%
14
18.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 317
80.3%
ASCII 76
 
19.2%
None 2
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 31
40.8%
) 31
40.8%
14
18.4%
Hangul
ValueCountFrequency (%)
27
 
8.5%
22
 
6.9%
18
 
5.7%
16
 
5.0%
16
 
5.0%
16
 
5.0%
11
 
3.5%
11
 
3.5%
9
 
2.8%
8
 
2.5%
Other values (81) 163
51.4%
None
ValueCountFrequency (%)
2
100.0%

Interactions

2023-12-12T15:36:00.257941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:36:03.642529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시작일자종료일자소재지허가면적(제곱미터)토석구분용도업체
시작일자1.0000.9660.9890.7101.0001.0000.982
종료일자0.9661.0000.9710.7770.4150.9570.935
소재지0.9890.9711.0000.7031.0000.0000.921
허가면적(제곱미터)0.7100.7770.7031.0000.0000.4800.815
토석구분1.0000.4151.0000.0001.0000.8051.000
용도1.0000.9570.0000.4800.8051.0000.950
업체0.9820.9350.9210.8151.0000.9501.000
2023-12-12T15:36:03.779758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
용도토석구분
용도1.0000.470
토석구분0.4701.000
2023-12-12T15:36:03.893915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
허가면적(제곱미터)토석구분용도
허가면적(제곱미터)1.0000.0000.227
토석구분0.0001.0000.470
용도0.2270.4701.000

Missing values

2023-12-12T15:36:00.359792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:36:00.451610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시작일자종료일자소재지허가면적(제곱미터)토석구분용도업체
02001-10-082008-06-28전라남도 무안군 삼향읍 임성리 산14-148104토사점토기타(유)부국건업
12001-10-312002-10-30전라남도 무안군 청계면 송현리 산78-524236골재쇄골재용경성산업주식회사
22002-09-242003-09-30전라남도 무안군 일로읍 광암리 산343465토사점토기타박춘재
32002-10-092005-04-21전라남도 무안군 삼향읍 왕산리 산 127-715493토사점토기타유한회사 강토건설
42002-10-102003-12-31전라남도 무안군 삼향읍 맥포리 222-1712822토사점토기타김원만
52002-12-232003-05-30전라남도 무안군 일로읍 청호리 283-16805토사점토기타박효장
62003-08-182003-12-31전라남도 무안군 해제면 천장리 산1473135토사점토토목용대진종합건설(주)
72003-11-112006-11-30전라남도 무안군 청계면 송현리 산78-544875골재쇄골재용(주)경성산업
82003-11-212005-06-30전라남도 무안군 삼향읍 유교리 산61-1660090토사점토기타(주)부국건설
92003-12-052006-09-30전라남도 무안군 몽탄면 양장리 산9021308토사점토토목용김영복
시작일자종료일자소재지허가면적(제곱미터)토석구분용도업체
482015-03-232017-06-30전라남도 무안군 삼향읍 맥포리 산78-744642토사점토기타(주)오룡산업
492015-03-242016-09-30전라남도 무안군 현경면 해운리 산123-423037토사점토기타농업회사법인(주) 서연
502015-04-012020-04-30전라남도 무안군 몽탄면 달산리 산 23224100토사점토기타예원건설㈜
512015-06-012025-06-30전라남도 무안군 일로읍 구정리 산4678709골재기타(유)한석산업
522015-10-222016-09-15전라남도 무안군 몽탄면 사창리 196-17890토사점토기타송원토건 주식회사
532017-01-092022-12-31전라남도 무안군 몽탄면 봉명리 산12860252토사점토기타유한회사 승광산업
542017-08-282020-04-30전라남도 무안군 몽탄면 달산리 산23224100토사점토기타예원건설㈜
552020-02-282021-02-27전라남도 무안군 무안읍 신학리 441-22095토사점토기타에이치토건 주식회사
562020-07-012029-09-07전라남도 무안군 일로읍 구정리 산4661582골재쇄골재용유한회사 무안골재
572020-12-312021-12-31전라남도 무안군 무안읍 신학리 산47-12176토사점토기타김구원