Overview

Dataset statistics

Number of variables5
Number of observations3599
Missing cells14
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory144.2 KiB
Average record size in memory41.0 B

Variable types

Numeric1
Text2
Categorical2

Dataset

Description해양수산에 관련하여 시도별 격자에 대하여 전남격자 정보를 담고 있으며 공간정보일련번호, 격자아이디, 레이어명등과같은 데이터를 제공한다.
Author해양수산부
URLhttps://www.data.go.kr/data/15113909/fileData.do

Alerts

레이어분류내용(lyr_cl_cn) is highly overall correlated with 공간정보일련번호(gid) and 1 other fieldsHigh correlation
레이어명(lyr_nm) is highly overall correlated with 공간정보일련번호(gid) and 1 other fieldsHigh correlation
공간정보일련번호(gid) is highly overall correlated with 레이어명(lyr_nm) and 1 other fieldsHigh correlation
레이어명(lyr_nm) is highly imbalanced (96.3%)Imbalance
레이어분류내용(lyr_cl_cn) is highly imbalanced (96.3%)Imbalance
공간정보일련번호(gid) has unique valuesUnique
특성평가격자아이디(msp_id) has unique valuesUnique

Reproduction

Analysis started2024-05-18 08:44:50.760678
Analysis finished2024-05-18 08:44:52.274749
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간정보일련번호(gid)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct3599
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1800
Minimum1
Maximum3599
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size31.8 KiB
2024-05-18T17:44:52.600446image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile180.9
Q1900.5
median1800
Q32699.5
95-th percentile3419.1
Maximum3599
Range3598
Interquartile range (IQR)1799

Descriptive statistics

Standard deviation1039.0861
Coefficient of variation (CV)0.57727008
Kurtosis-1.2
Mean1800
Median Absolute Deviation (MAD)900
Skewness0
Sum6478200
Variance1079700
MonotonicityStrictly increasing
2024-05-18T17:44:53.135328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
< 0.1%
2364 1
 
< 0.1%
2394 1
 
< 0.1%
2395 1
 
< 0.1%
2396 1
 
< 0.1%
2397 1
 
< 0.1%
2398 1
 
< 0.1%
2399 1
 
< 0.1%
2400 1
 
< 0.1%
2401 1
 
< 0.1%
Other values (3589) 3589
99.7%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
3599 1
< 0.1%
3598 1
< 0.1%
3597 1
< 0.1%
3596 1
< 0.1%
3595 1
< 0.1%
3594 1
< 0.1%
3593 1
< 0.1%
3592 1
< 0.1%
3591 1
< 0.1%
3590 1
< 0.1%
Distinct3599
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
2024-05-18T17:44:53.864588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length16
Mean length15.77077
Min length15

Characters and Unicode

Total characters56759
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3599 ?
Unique (%)100.0%

Sample

1st rowMSP_GR4_F4C13_L1
2nd rowMSP_GR4_F4C13_L3
3rd rowMSP_GR4_F4C13_L4
4th rowMSP_GR4_F4C13_M3
5th rowMSP_GR4_F4C13_M4
ValueCountFrequency (%)
msp_gr4_f4c13_l1 1
 
< 0.1%
msp_gr4_f4g34_e1 1
 
< 0.1%
msp_gr4_f4g34_j2 1
 
< 0.1%
msp_gr4_f4g33_g1 1
 
< 0.1%
msp_gr4_f4g33_g2 1
 
< 0.1%
msp_gr4_f4g33_h1 1
 
< 0.1%
msp_gr4_f4g33_h2 1
 
< 0.1%
msp_gr4_f4g33_i1 1
 
< 0.1%
msp_gr4_f4g33_i2 1
 
< 0.1%
msp_gr4_f4g34_h1 1
 
< 0.1%
Other values (3589) 3589
99.7%
2024-05-18T17:44:55.380636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
_ 10797
19.0%
4 8887
15.7%
G 4697
8.3%
F 4535
8.0%
P 3744
 
6.6%
M 3742
 
6.6%
R 3742
 
6.6%
S 3734
 
6.6%
3 3572
 
6.3%
2 2375
 
4.2%
Other values (20) 6934
12.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 28792
50.7%
Decimal Number 17170
30.3%
Connector Punctuation 10797
 
19.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G 4697
16.3%
F 4535
15.8%
P 3744
13.0%
M 3742
13.0%
R 3742
13.0%
S 3734
13.0%
H 1099
 
3.8%
C 501
 
1.7%
B 337
 
1.2%
J 249
 
0.9%
Other values (15) 2412
8.4%
Decimal Number
ValueCountFrequency (%)
4 8887
51.8%
3 3572
20.8%
2 2375
 
13.8%
1 2336
 
13.6%
Connector Punctuation
ValueCountFrequency (%)
_ 10797
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28792
50.7%
Common 27967
49.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 4697
16.3%
F 4535
15.8%
P 3744
13.0%
M 3742
13.0%
R 3742
13.0%
S 3734
13.0%
H 1099
 
3.8%
C 501
 
1.7%
B 337
 
1.2%
J 249
 
0.9%
Other values (15) 2412
8.4%
Common
ValueCountFrequency (%)
_ 10797
38.6%
4 8887
31.8%
3 3572
 
12.8%
2 2375
 
8.5%
1 2336
 
8.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56759
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
_ 10797
19.0%
4 8887
15.7%
G 4697
8.3%
F 4535
8.0%
P 3744
 
6.6%
M 3742
 
6.6%
R 3742
 
6.6%
S 3734
 
6.6%
3 3572
 
6.3%
2 2375
 
4.2%
Other values (20) 6934
12.2%
Distinct3585
Distinct (%)100.0%
Missing14
Missing (%)0.4%
Memory size28.2 KiB
2024-05-18T17:44:56.231779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.77378
Min length11

Characters and Unicode

Total characters42209
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3585 ?
Unique (%)100.0%

Sample

1st rowGR4_F4C13_L1
2nd rowGR4_F4C13_L3
3rd rowGR4_F4C13_L4
4th rowGR4_F4C13_M3
5th rowGR4_F4C13_M4
ValueCountFrequency (%)
gr4_f4g13_b4 1
 
< 0.1%
gr4_f4g42_n1 1
 
< 0.1%
gr4_f4g33_h1 1
 
< 0.1%
gr4_f4f44_g2 1
 
< 0.1%
gr4_f4f44_h1 1
 
< 0.1%
gr4_f4f44_h2 1
 
< 0.1%
gr4_f4f44_i1 1
 
< 0.1%
gr4_f4f44_i2 1
 
< 0.1%
gr4_f4f44_j1 1
 
< 0.1%
gr4_f4f44_j2 1
 
< 0.1%
Other values (3575) 3575
99.7%
2024-05-18T17:44:57.742265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 8873
21.0%
_ 7170
17.0%
G 4683
11.1%
F 4521
10.7%
R 3728
8.8%
3 3558
8.4%
2 2362
 
5.6%
1 2321
 
5.5%
H 1099
 
2.6%
C 501
 
1.2%
Other values (20) 3393
 
8.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 17925
42.5%
Decimal Number 17114
40.5%
Connector Punctuation 7170
 
17.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G 4683
26.1%
F 4521
25.2%
R 3728
20.8%
H 1099
 
6.1%
C 501
 
2.8%
B 336
 
1.9%
J 240
 
1.3%
K 239
 
1.3%
E 222
 
1.2%
L 221
 
1.2%
Other values (15) 2135
11.9%
Decimal Number
ValueCountFrequency (%)
4 8873
51.8%
3 3558
20.8%
2 2362
 
13.8%
1 2321
 
13.6%
Connector Punctuation
ValueCountFrequency (%)
_ 7170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24284
57.5%
Latin 17925
42.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 4683
26.1%
F 4521
25.2%
R 3728
20.8%
H 1099
 
6.1%
C 501
 
2.8%
B 336
 
1.9%
J 240
 
1.3%
K 239
 
1.3%
E 222
 
1.2%
L 221
 
1.2%
Other values (15) 2135
11.9%
Common
ValueCountFrequency (%)
4 8873
36.5%
_ 7170
29.5%
3 3558
14.7%
2 2362
 
9.7%
1 2321
 
9.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42209
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 8873
21.0%
_ 7170
17.0%
G 4683
11.1%
F 4521
10.7%
R 3728
8.8%
3 3558
8.4%
2 2362
 
5.6%
1 2321
 
5.5%
H 1099
 
2.6%
C 501
 
1.2%
Other values (20) 3393
 
8.0%

레이어명(lyr_nm)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
전남격자
3585 
<NA>
 
14

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전남격자
2nd row전남격자
3rd row전남격자
4th row전남격자
5th row전남격자

Common Values

ValueCountFrequency (%)
전남격자 3585
99.6%
<NA> 14
 
0.4%

Length

2024-05-18T17:44:58.368955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T17:44:58.706097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전남격자 3585
99.6%
na 14
 
0.4%

레이어분류내용(lyr_cl_cn)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size28.2 KiB
특성평가격자
3585 
<NA>
 
14

Length

Max length6
Median length6
Mean length5.9922201
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row특성평가격자
2nd row특성평가격자
3rd row특성평가격자
4th row특성평가격자
5th row특성평가격자

Common Values

ValueCountFrequency (%)
특성평가격자 3585
99.6%
<NA> 14
 
0.4%

Length

2024-05-18T17:44:59.177155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-18T17:44:59.545955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특성평가격자 3585
99.6%
na 14
 
0.4%

Interactions

2024-05-18T17:44:51.240350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-05-18T17:44:59.741234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간정보일련번호(gid)
공간정보일련번호(gid)1.000
2024-05-18T17:45:00.021235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
레이어분류내용(lyr_cl_cn)레이어명(lyr_nm)
레이어분류내용(lyr_cl_cn)1.0001.000
레이어명(lyr_nm)1.0001.000
2024-05-18T17:45:00.320475image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간정보일련번호(gid)레이어명(lyr_nm)레이어분류내용(lyr_cl_cn)
공간정보일련번호(gid)1.0001.0001.000
레이어명(lyr_nm)1.0001.0001.000
레이어분류내용(lyr_cl_cn)1.0001.0001.000

Missing values

2024-05-18T17:44:51.629606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-05-18T17:44:52.078061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간정보일련번호(gid)특성평가격자아이디(msp_id)격자아이디(og_id)레이어명(lyr_nm)레이어분류내용(lyr_cl_cn)
01MSP_GR4_F4C13_L1GR4_F4C13_L1전남격자특성평가격자
12MSP_GR4_F4C13_L3GR4_F4C13_L3전남격자특성평가격자
23MSP_GR4_F4C13_L4GR4_F4C13_L4전남격자특성평가격자
34MSP_GR4_F4C13_M3GR4_F4C13_M3전남격자특성평가격자
45MSP_GR4_F4C13_M4GR4_F4C13_M4전남격자특성평가격자
56MSP_GR4_F4C13_T3GR4_F4C13_T3전남격자특성평가격자
67MSP_GR4_F4C13_Y1GR4_F4C13_Y1전남격자특성평가격자
78MSP_GR4_F4C13_Y2GR4_F4C13_Y2전남격자특성평가격자
89MSP_GR4_F4C14_U1GR4_F4C14_U1전남격자특성평가격자
910MSP_GR4_F4C14_U2GR4_F4C14_U2전남격자특성평가격자
공간정보일련번호(gid)특성평가격자아이디(msp_id)격자아이디(og_id)레이어명(lyr_nm)레이어분류내용(lyr_cl_cn)
35893590MSP_GR3_F4I22_T<NA><NA><NA>
35903591MSP_GR3_F4J11_P<NA><NA><NA>
35913592MSP_GR3_F4J11_U<NA><NA><NA>
35923593MSP_GR3_F4J11_V<NA><NA><NA>
35933594MSP_GR3_F4J11_W<NA><NA><NA>
35943595MSP_GR3_F4J11_X<NA><NA><NA>
35953596MSP_GR3_F4J11_Y<NA><NA><NA>
35963597MSP_GR3_F4J12_U<NA><NA><NA>
35973598MSP_GR3_F4J12_V<NA><NA><NA>
35983599MSP_GR3_F4J12_W<NA><NA><NA>