Overview

Dataset statistics

Number of variables8
Number of observations742
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory48.7 KiB
Average record size in memory67.2 B

Variable types

Numeric3
Categorical3
Text1
DateTime1

Dataset

Description대한석탄공사가 보유중인 각 광업소별 부동산 유휴 토지목록을 기재한 데이터입니다. 유휴 토지는 매년 매각 추진 중이며, 업데이트 주기는 1년입니다.
Author대한석탄공사
URLhttps://www.data.go.kr/data/15009054/fileData.do

Alerts

사업소 is highly overall correlated with 시군High correlation
시군 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
번호 is highly overall correlated with 시군High correlation
면적(평방미터) is highly overall correlated with 2023년공시가 High correlation
2023년공시가 is highly overall correlated with 면적(평방미터) High correlation
사업소 is highly imbalanced (94.8%)Imbalance
번호 has unique valuesUnique
소재지지번 has unique valuesUnique

Reproduction

Analysis started2024-04-06 08:06:45.009826
Analysis finished2024-04-06 08:06:48.860011
Duration3.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct742
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean371.5
Minimum1
Maximum742
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.7 KiB
2024-04-06T17:06:49.001049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile38.05
Q1186.25
median371.5
Q3556.75
95-th percentile704.95
Maximum742
Range741
Interquartile range (IQR)370.5

Descriptive statistics

Standard deviation214.34124
Coefficient of variation (CV)0.57696161
Kurtosis-1.2
Mean371.5
Median Absolute Deviation (MAD)185.5
Skewness0
Sum275653
Variance45942.167
MonotonicityStrictly increasing
2024-04-06T17:06:49.313477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
500 1
 
0.1%
491 1
 
0.1%
492 1
 
0.1%
493 1
 
0.1%
494 1
 
0.1%
495 1
 
0.1%
496 1
 
0.1%
497 1
 
0.1%
498 1
 
0.1%
Other values (732) 732
98.7%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
742 1
0.1%
741 1
0.1%
740 1
0.1%
739 1
0.1%
738 1
0.1%
737 1
0.1%
736 1
0.1%
735 1
0.1%
734 1
0.1%
733 1
0.1%

사업소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
장성
735 
호남
 
6
도계
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row장성
2nd row장성
3rd row장성
4th row장성
5th row장성

Common Values

ValueCountFrequency (%)
장성 735
99.1%
호남 6
 
0.8%
도계 1
 
0.1%

Length

2024-04-06T17:06:49.637993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:49.845865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
장성 735
99.1%
호남 6
 
0.8%
도계 1
 
0.1%

시군
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
영월군
260 
평창군
161 
정선군
114 
영양군
73 
봉화군
68 
Other values (6)
66 

Length

Max length3
Median length3
Mean length2.990566
Min length2

Unique

Unique2 ?
Unique (%)0.3%

Sample

1st row영월군
2nd row영월군
3rd row영월군
4th row영월군
5th row영월군

Common Values

ValueCountFrequency (%)
영월군 260
35.0%
평창군 161
21.7%
정선군 114
15.4%
영양군 73
 
9.8%
봉화군 68
 
9.2%
단양군 31
 
4.2%
문경시 24
 
3.2%
보령 6
 
0.8%
예천군 3
 
0.4%
청송군 1
 
0.1%

Length

2024-04-06T17:06:50.087059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
영월군 260
35.0%
평창군 161
21.7%
정선군 114
15.4%
영양군 73
 
9.8%
봉화군 68
 
9.2%
단양군 31
 
4.2%
문경시 24
 
3.2%
보령 6
 
0.8%
예천군 3
 
0.4%
청송군 1
 
0.1%

소재지지번
Text

UNIQUE 

Distinct742
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
2024-04-06T17:06:50.509078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length14.770889
Min length11

Characters and Unicode

Total characters10960
Distinct characters133
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique742 ?
Unique (%)100.0%

Sample

1st row강원영월군김삿갓면주문리산90-1
2nd row강원영월군김삿갓면주문리산90-2
3rd row강원영월군북면마차리산1-3
4th row강원영월군북면마차리산15-4
5th row강원영월군북면마차리산26
ValueCountFrequency (%)
강원영월군김삿갓면주문리산90-1 1
 
0.1%
강원평창군미탄면율치리551 1
 
0.1%
강원평창군미탄면율치리527-5 1
 
0.1%
강원평창군미탄면율치리602 1
 
0.1%
강원평창군미탄면율치리546 1
 
0.1%
강원평창군미탄면율치리547-1 1
 
0.1%
강원평창군미탄면율치리547 1
 
0.1%
강원평창군미탄면율치리549 1
 
0.1%
강원평창군미탄면율치리550 1
 
0.1%
강원평창군미탄면율치리551-1 1
 
0.1%
Other values (732) 732
98.7%
2024-04-06T17:06:51.193672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
737
 
6.7%
711
 
6.5%
628
 
5.7%
545
 
5.0%
540
 
4.9%
1 518
 
4.7%
464
 
4.2%
2 385
 
3.5%
- 384
 
3.5%
335
 
3.1%
Other values (123) 5713
52.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8096
73.9%
Decimal Number 2480
 
22.6%
Dash Punctuation 384
 
3.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
737
 
9.1%
711
 
8.8%
628
 
7.8%
545
 
6.7%
540
 
6.7%
464
 
5.7%
335
 
4.1%
295
 
3.6%
271
 
3.3%
260
 
3.2%
Other values (112) 3310
40.9%
Decimal Number
ValueCountFrequency (%)
1 518
20.9%
2 385
15.5%
3 238
9.6%
5 233
9.4%
7 224
9.0%
6 220
8.9%
4 215
8.7%
9 161
 
6.5%
8 152
 
6.1%
0 134
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 384
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8096
73.9%
Common 2864
 
26.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
737
 
9.1%
711
 
8.8%
628
 
7.8%
545
 
6.7%
540
 
6.7%
464
 
5.7%
335
 
4.1%
295
 
3.6%
271
 
3.3%
260
 
3.2%
Other values (112) 3310
40.9%
Common
ValueCountFrequency (%)
1 518
18.1%
2 385
13.4%
- 384
13.4%
3 238
8.3%
5 233
8.1%
7 224
7.8%
6 220
7.7%
4 215
7.5%
9 161
 
5.6%
8 152
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8096
73.9%
ASCII 2864
 
26.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
737
 
9.1%
711
 
8.8%
628
 
7.8%
545
 
6.7%
540
 
6.7%
464
 
5.7%
335
 
4.1%
295
 
3.6%
271
 
3.3%
260
 
3.2%
Other values (112) 3310
40.9%
ASCII
ValueCountFrequency (%)
1 518
18.1%
2 385
13.4%
- 384
13.4%
3 238
8.3%
5 233
8.1%
7 224
7.8%
6 220
7.7%
4 215
7.5%
9 161
 
5.6%
8 152
 
5.3%

지목
Categorical

Distinct5
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
임야
300 
266 
121 
잡종지
45 
 
10

Length

Max length3
Median length2
Mean length1.6886792
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row임야
2nd row임야
3rd row임야
4th row임야
5th row임야

Common Values

ValueCountFrequency (%)
임야 300
40.4%
266
35.8%
121
16.3%
잡종지 45
 
6.1%
10
 
1.3%

Length

2024-04-06T17:06:51.473769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-06T17:06:51.726734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
임야 300
40.4%
266
35.8%
121
16.3%
잡종지 45
 
6.1%
10
 
1.3%
Distinct70
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size5.9 KiB
Minimum1954-06-28 00:00:00
Maximum2017-10-31 00:00:00
2024-04-06T17:06:51.991728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:52.306715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

면적(평방미터)
Real number (ℝ)

HIGH CORRELATION 

Distinct556
Distinct (%)74.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25732.679
Minimum1
Maximum2294468
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.7 KiB
2024-04-06T17:06:52.679103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile23
Q1212
median886
Q33176.25
95-th percentile104103.65
Maximum2294468
Range2294467
Interquartile range (IQR)2964.25

Descriptive statistics

Standard deviation131506.09
Coefficient of variation (CV)5.1104703
Kurtosis181.60985
Mean25732.679
Median Absolute Deviation (MAD)787
Skewness12.340252
Sum19093648
Variance1.7293853 × 1010
MonotonicityNot monotonic
2024-04-06T17:06:53.038608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46 8
 
1.1%
3 8
 
1.1%
909 6
 
0.8%
13 6
 
0.8%
26 5
 
0.7%
109 5
 
0.7%
116 5
 
0.7%
198 5
 
0.7%
36 5
 
0.7%
7 5
 
0.7%
Other values (546) 684
92.2%
ValueCountFrequency (%)
1 1
 
0.1%
3 8
1.1%
6 1
 
0.1%
7 5
0.7%
10 3
 
0.4%
11 1
 
0.1%
13 6
0.8%
16 4
0.5%
17 2
 
0.3%
19 1
 
0.1%
ValueCountFrequency (%)
2294468 1
0.1%
1836947 1
0.1%
1328056 1
0.1%
708099 1
0.1%
539929 1
0.1%
495500 1
0.1%
463338 1
0.1%
420893 1
0.1%
388796 1
0.1%
362975 1
0.1%

2023년공시가
Real number (ℝ)

HIGH CORRELATION 

Distinct738
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16931914
Minimum1260
Maximum1.1700173 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.7 KiB
2024-04-06T17:06:53.370106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1260
5-th percentile127230.8
Q11009406.5
median3804534
Q312522519
95-th percentile52281867
Maximum1.1700173 × 109
Range1.1700161 × 109
Interquartile range (IQR)11513112

Descriptive statistics

Standard deviation72257368
Coefficient of variation (CV)4.2675251
Kurtosis196.00041
Mean16931914
Median Absolute Deviation (MAD)3431684
Skewness13.203605
Sum1.256348 × 1010
Variance5.2211272 × 1015
MonotonicityNot monotonic
2024-04-06T17:06:54.176970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5356820 2
 
0.3%
579200 2
 
0.3%
1033560 2
 
0.3%
102720 2
 
0.3%
177600 1
 
0.1%
3669750 1
 
0.1%
1668440 1
 
0.1%
593920 1
 
0.1%
5359200 1
 
0.1%
11919000 1
 
0.1%
Other values (728) 728
98.1%
ValueCountFrequency (%)
1260 1
0.1%
1566 1
0.1%
3750 1
0.1%
11352 1
0.1%
13944 1
0.1%
20868 1
0.1%
21060 1
0.1%
26200 1
0.1%
31000 1
0.1%
33000 1
0.1%
ValueCountFrequency (%)
1170017336 1
0.1%
1161000808 1
0.1%
856017302 1
0.1%
235999172 1
0.1%
220291032 1
0.1%
218802591 1
0.1%
208247994 1
0.1%
203970900 1
0.1%
199235340 1
0.1%
166983500 1
0.1%

Interactions

2024-04-06T17:06:47.857186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:46.708268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:47.268897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:48.048454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:46.885791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:47.443519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:48.251098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:47.056036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-06T17:06:47.625433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-06T17:06:54.510614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사업소시군지목취득일자면적(평방미터)2023년공시가
번호1.0000.2810.8710.7920.9190.0770.148
사업소0.2811.0001.0000.2601.0000.0000.000
시군0.8711.0001.0000.6990.9930.6740.332
지목0.7920.2600.6991.0000.8540.0000.093
취득일자0.9191.0000.9930.8541.0000.9330.944
면적(평방미터)0.0770.0000.6740.0000.9331.0000.897
2023년공시가0.1480.0000.3320.0930.9440.8971.000
2024-04-06T17:06:54.728105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소지목시군
사업소1.0000.2030.995
지목0.2031.0000.478
시군0.9950.4781.000
2024-04-06T17:06:54.910592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호면적(평방미터)2023년공시가사업소시군지목
번호1.0000.3830.2640.1740.6150.447
면적(평방미터)0.3831.0000.7430.0000.4150.000
2023년공시가0.2640.7431.0000.0000.1890.034
사업소0.1740.0000.0001.0000.9950.203
시군0.6150.4150.1890.9951.0000.478
지목0.4470.0000.0340.2030.4781.000

Missing values

2024-04-06T17:06:48.527800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-06T17:06:48.769889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업소시군소재지지번지목취득일자면적(평방미터)2023년공시가
01장성영월군강원영월군김삿갓면주문리산90-1임야2004-02-11120177600
12장성영월군강원영월군김삿갓면주문리산90-2임야2004-02-1115442732880
23장성영월군강원영월군북면마차리산1-3임야1971-08-2018111166284
34장성영월군강원영월군북면마차리산15-4임야1971-08-201954345858
45장성영월군강원영월군북면마차리산26임야1972-09-284006621555508
56장성영월군강원영월군북면마차리산49임야1972-12-3169422797626
67장성영월군강원영월군북면마차리산50임야1972-12-3123897818
78장성영월군강원영월군북면마차리산103-2임야1973-01-0131260
89장성영월군강원영월군북면마차리산127-1임야1973-09-25198140184
910장성영월군강원영월군북면마차리산132-5임야1973-01-015968626739328
번호사업소시군소재지지번지목취득일자면적(평방미터)2023년공시가
732733장성단양군충북단양군대강면황정리산7-3임야1963-07-1812351230060
733734장성단양군충북단양군적성면상리산38-5임야1963-07-18394437340
734735장성단양군충북단양군적성면애곡리산26-4임야1963-07-18360317880
735736도계동해강원특별자치도동해시발한동599-11990-12-3122143161300
736737호남보령충남보령시성주면성주리202-2잡종지1968-08-0210281000
737738호남보령충남보령시성주면성주리228잡종지1967-11-14271112497710
738739호남보령충남보령시성주면성주리229잡종지1967-11-1436135347240
739740호남보령충남보령시성주면성주리230잡종지1967-11-144001476000
740741호남보령충남보령시성주면성주리265-3402009-09-10359722804980
741742호남보령충남보령시성주면성주리265-64잡종지1967-11-1452111253600