Overview

Dataset statistics

Number of variables6
Number of observations31
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory53.3 B

Variable types

Categorical3
Text1
Numeric1
Boolean1

Alerts

생성일 has constant value ""Constant
수정일 has constant value ""Constant
사용여부 has constant value ""Constant
지목면적(㎥) has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:16:41.700269
Analysis finished2023-12-10 13:16:42.521624
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업코드
Categorical

Distinct6
Distinct (%)19.4%
Missing0
Missing (%)0.0%
Memory size380.0 B
DG2006C001
10 
DG2006B001
DG2005E013
DG2006A001
DG2005Q001
 
1

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique2 ?
Unique (%)6.5%

Sample

1st rowDG2005E013
2nd rowDG2005E013
3rd rowDG2005E013
4th rowDG2005E013
5th rowDG2005E013

Common Values

ValueCountFrequency (%)
DG2006C001 10
32.3%
DG2006B001 9
29.0%
DG2005E013 7
22.6%
DG2006A001 3
 
9.7%
DG2005Q001 1
 
3.2%
DG2006E009 1
 
3.2%

Length

2023-12-10T22:16:42.631660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:16:42.798282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
dg2006c001 10
32.3%
dg2006b001 9
29.0%
dg2005e013 7
22.6%
dg2006a001 3
 
9.7%
dg2005q001 1
 
3.2%
dg2006e009 1
 
3.2%
Distinct16
Distinct (%)51.6%
Missing0
Missing (%)0.0%
Memory size380.0 B
2023-12-10T22:16:43.059726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length2
Mean length2.0967742
Min length1

Characters and Unicode

Total characters65
Distinct characters29
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)29.0%

Sample

1st row
2nd row하천
3rd row기타
4th row제방
5th row도로
ValueCountFrequency (%)
하천 4
12.9%
4
12.9%
임야 4
12.9%
3
9.7%
도로 3
9.7%
기타 2
 
6.5%
구거 2
 
6.5%
제방 1
 
3.2%
구거지 1
 
3.2%
대지 1
 
3.2%
Other values (6) 6
19.4%
2023-12-10T22:16:43.552952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
4
 
6.2%
3
 
4.6%
3
 
4.6%
3
 
4.6%
Other values (19) 28
43.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 63
96.9%
Open Punctuation 1
 
1.5%
Close Punctuation 1
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
3
 
4.8%
3
 
4.8%
3
 
4.8%
Other values (17) 26
41.3%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 63
96.9%
Common 2
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
3
 
4.8%
3
 
4.8%
3
 
4.8%
Other values (17) 26
41.3%
Common
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 63
96.9%
ASCII 2
 
3.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
4
 
6.3%
3
 
4.8%
3
 
4.8%
3
 
4.8%
Other values (17) 26
41.3%
ASCII
ValueCountFrequency (%)
( 1
50.0%
) 1
50.0%

지목면적(㎥)
Real number (ℝ)

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean67086.802
Minimum19.482
Maximum319985
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size411.0 B
2023-12-10T22:16:43.766425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19.482
5-th percentile121.1835
Q12207.5
median16128
Q352103.5
95-th percentile292115.5
Maximum319985
Range319965.52
Interquartile range (IQR)49896

Descriptive statistics

Standard deviation103983.04
Coefficient of variation (CV)1.5499776
Kurtosis0.91513163
Mean67086.802
Median Absolute Deviation (MAD)15910
Skewness1.5623603
Sum2079690.8
Variance1.0812473 × 1010
MonotonicityNot monotonic
2023-12-10T22:16:43.997512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
134165.0 1
 
3.2%
18980.0 1
 
3.2%
1135.0 1
 
3.2%
44956.0 1
 
3.2%
2312.0 1
 
3.2%
5004.0 1
 
3.2%
267490.0 1
 
3.2%
4368.0 1
 
3.2%
51797.0 1
 
3.2%
6223.0 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
19.482 1
3.2%
24.367 1
3.2%
218.0 1
3.2%
296.0 1
3.2%
650.0 1
3.2%
990.0 1
3.2%
1135.0 1
3.2%
2190.0 1
3.2%
2225.0 1
3.2%
2312.0 1
3.2%
ValueCountFrequency (%)
319985.0 1
3.2%
302402.0 1
3.2%
281829.0 1
3.2%
267490.0 1
3.2%
235869.0 1
3.2%
180900.0 1
3.2%
134165.0 1
3.2%
52410.0 1
3.2%
51797.0 1
3.2%
44956.0 1
3.2%

생성일
Categorical

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
2019-01-07
31 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-01-07
2nd row2019-01-07
3rd row2019-01-07
4th row2019-01-07
5th row2019-01-07

Common Values

ValueCountFrequency (%)
2019-01-07 31
100.0%

Length

2023-12-10T22:16:44.323224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:16:44.625342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-01-07 31
100.0%

수정일
Categorical

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size380.0 B
2019-12-03
31 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019-12-03
2nd row2019-12-03
3rd row2019-12-03
4th row2019-12-03
5th row2019-12-03

Common Values

ValueCountFrequency (%)
2019-12-03 31
100.0%

Length

2023-12-10T22:16:44.823388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:16:45.012059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2019-12-03 31
100.0%

사용여부
Boolean

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size163.0 B
True
31 
ValueCountFrequency (%)
True 31
100.0%
2023-12-10T22:16:45.174428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-10T22:16:42.073986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:16:45.270542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업코드지목명지목면적(㎥)
사업코드1.0000.0000.549
지목명0.0001.0000.000
지목면적(㎥)0.5490.0001.000
2023-12-10T22:16:45.420983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지목면적(㎥)사업코드
지목면적(㎥)1.0000.349
사업코드0.3491.000

Missing values

2023-12-10T22:16:42.278917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:16:42.446510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업코드지목명지목면적(㎥)생성일수정일사용여부
0DG2005E013134165.02019-01-072019-12-03Y
1DG2005E013하천18980.02019-01-072019-12-03Y
2DG2005E013기타8740.02019-01-072019-12-03Y
3DG2005E013제방52410.02019-01-072019-12-03Y
4DG2005E013도로38290.02019-01-072019-12-03Y
5DG2005E013650.02019-01-072019-12-03Y
6DG2005E013임야39160.02019-01-072019-12-03Y
7DG2005Q001하천180900.02019-01-072019-12-03Y
8DG2006A001임야319985.02019-01-072019-12-03Y
9DG2006A0012225.02019-01-072019-12-03Y
사업코드지목명지목면적(㎥)생성일수정일사용여부
21DG2006C001불부합296.02019-01-072019-12-03Y
22DG2006C001과수원990.02019-01-072019-12-03Y
23DG2006C001구거6223.02019-01-072019-12-03Y
24DG2006C00151797.02019-01-072019-12-03Y
25DG2006C001도로4368.02019-01-072019-12-03Y
26DG2006C001임야267490.02019-01-072019-12-03Y
27DG2006C001하천5004.02019-01-072019-12-03Y
28DG2006C001잡종지2312.02019-01-072019-12-03Y
29DG2006C00144956.02019-01-072019-12-03Y
30DG2006E009대(기협의)1135.02019-01-072019-12-03Y