Overview

Dataset statistics

Number of variables4
Number of observations38
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory37.5 B

Variable types

Numeric2
Text1
Categorical1

Dataset

Description해양수산 관련하여 해양용도구역 최초 고시인 부산용도구역에 대한 데이터로 항만항행구역 정보를 파일 형태로 사용자는 확인 할 수 있다.
URLhttps://www.data.go.kr/data/15113948/fileData.do

Alerts

용도구역면적(ua_ar) is highly overall correlated with 용도구역상세정보(ua_dt_dc)High correlation
용도구역상세정보(ua_dt_dc) is highly overall correlated with 용도구역면적(ua_ar)High correlation
용도구역상세정보(ua_dt_dc) is highly imbalanced (73.8%)Imbalance
공간정보일련번호(gid) has unique valuesUnique
용도구역면적(ua_ar) has 30 (78.9%) zerosZeros

Reproduction

Analysis started2023-12-12 12:39:00.242775
Analysis finished2023-12-12 12:39:01.066222
Duration0.82 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

공간정보일련번호(gid)
Real number (ℝ)

UNIQUE 

Distinct38
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.5
Minimum1
Maximum38
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-12T21:39:01.147206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.85
Q110.25
median19.5
Q328.75
95-th percentile36.15
Maximum38
Range37
Interquartile range (IQR)18.5

Descriptive statistics

Standard deviation11.113055
Coefficient of variation (CV)0.56990028
Kurtosis-1.2
Mean19.5
Median Absolute Deviation (MAD)9.5
Skewness0
Sum741
Variance123.5
MonotonicityNot monotonic
2023-12-12T21:39:01.570943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=38)
ValueCountFrequency (%)
1 1
 
2.6%
30 1
 
2.6%
20 1
 
2.6%
21 1
 
2.6%
25 1
 
2.6%
26 1
 
2.6%
27 1
 
2.6%
28 1
 
2.6%
29 1
 
2.6%
31 1
 
2.6%
Other values (28) 28
73.7%
ValueCountFrequency (%)
1 1
2.6%
2 1
2.6%
3 1
2.6%
4 1
2.6%
5 1
2.6%
6 1
2.6%
7 1
2.6%
8 1
2.6%
9 1
2.6%
10 1
2.6%
ValueCountFrequency (%)
38 1
2.6%
37 1
2.6%
36 1
2.6%
35 1
2.6%
34 1
2.6%
33 1
2.6%
32 1
2.6%
31 1
2.6%
30 1
2.6%
29 1
2.6%
Distinct37
Distinct (%)97.4%
Missing0
Missing (%)0.0%
Memory size436.0 B
2023-12-12T21:39:01.784580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters798
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)94.7%

Sample

1st rowSS350143N1290733E2020
2nd rowSS350249N1285831E2020
3rd rowSS351949N1292751E2020
4th rowSS350627N1284704E2020
5th rowSS350630N1290736E2020
ValueCountFrequency (%)
ss350348n1285006e2020 2
 
5.3%
ss350143n1290733e2020 1
 
2.6%
ss350354n1285006e2020 1
 
2.6%
ss350360n1285025e2020 1
 
2.6%
ss350358n1285025e2020 1
 
2.6%
ss350357n1285025e2020 1
 
2.6%
ss350356n1285002e2020 1
 
2.6%
ss350355n1285026e2020 1
 
2.6%
ss350353n1285027e2020 1
 
2.6%
ss350352n1285006e2020 1
 
2.6%
Other values (27) 27
71.1%
2023-12-12T21:39:02.085795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 172
21.6%
2 140
17.5%
5 92
11.5%
S 76
9.5%
3 72
9.0%
1 57
 
7.1%
8 42
 
5.3%
N 38
 
4.8%
E 38
 
4.8%
4 27
 
3.4%
Other values (3) 44
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 646
81.0%
Uppercase Letter 152
 
19.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 172
26.6%
2 140
21.7%
5 92
14.2%
3 72
11.1%
1 57
 
8.8%
8 42
 
6.5%
4 27
 
4.2%
7 18
 
2.8%
6 16
 
2.5%
9 10
 
1.5%
Uppercase Letter
ValueCountFrequency (%)
S 76
50.0%
N 38
25.0%
E 38
25.0%

Most occurring scripts

ValueCountFrequency (%)
Common 646
81.0%
Latin 152
 
19.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 172
26.6%
2 140
21.7%
5 92
14.2%
3 72
11.1%
1 57
 
8.8%
8 42
 
6.5%
4 27
 
4.2%
7 18
 
2.8%
6 16
 
2.5%
9 10
 
1.5%
Latin
ValueCountFrequency (%)
S 76
50.0%
N 38
25.0%
E 38
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 798
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 172
21.6%
2 140
17.5%
5 92
11.5%
S 76
9.5%
3 72
9.0%
1 57
 
7.1%
8 42
 
5.3%
N 38
 
4.8%
E 38
 
4.8%
4 27
 
3.4%
Other values (3) 44
 
5.5%

용도구역면적(ua_ar)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct8
Distinct (%)21.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.687368
Minimum0
Maximum386.74
Zeros30
Zeros (%)78.9%
Negative0
Negative (%)0.0%
Memory size474.0 B
2023-12-12T21:39:02.194822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile9.699
Maximum386.74
Range386.74
Interquartile range (IQR)0

Descriptive statistics

Standard deviation63.102055
Coefficient of variation (CV)5.3991671
Kurtosis36.419178
Mean11.687368
Median Absolute Deviation (MAD)0
Skewness5.9923624
Sum444.12
Variance3981.8694
MonotonicityNot monotonic
2023-12-12T21:39:02.284133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0.0 30
78.9%
0.01 2
 
5.3%
386.74 1
 
2.6%
0.05 1
 
2.6%
54.29 1
 
2.6%
1.83 1
 
2.6%
0.11 1
 
2.6%
1.08 1
 
2.6%
ValueCountFrequency (%)
0.0 30
78.9%
0.01 2
 
5.3%
0.05 1
 
2.6%
0.11 1
 
2.6%
1.08 1
 
2.6%
1.83 1
 
2.6%
54.29 1
 
2.6%
386.74 1
 
2.6%
ValueCountFrequency (%)
386.74 1
 
2.6%
54.29 1
 
2.6%
1.83 1
 
2.6%
1.08 1
 
2.6%
0.11 1
 
2.6%
0.05 1
 
2.6%
0.01 2
 
5.3%
0.0 30
78.9%

용도구역상세정보(ua_dt_dc)
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size436.0 B
무역항
35 
무역항 연안항 항로구기능구 교통안전특정해역 통항분리제도 양길항로부 묘박지 부산신항예정지
 
1
교통안전특정해역_울산
 
1
내용없음
 
1

Length

Max length55
Median length3
Mean length4.6052632
Min length3

Unique

Unique3 ?
Unique (%)7.9%

Sample

1st row무역항 연안항 항로구기능구 교통안전특정해역 통항분리제도 양길항로부 묘박지 부산신항예정지
2nd row무역항
3rd row교통안전특정해역_울산
4th row무역항
5th row무역항

Common Values

ValueCountFrequency (%)
무역항 35
92.1%
무역항 연안항 항로구기능구 교통안전특정해역 통항분리제도 양길항로부 묘박지 부산신항예정지 1
 
2.6%
교통안전특정해역_울산 1
 
2.6%
내용없음 1
 
2.6%

Length

2023-12-12T21:39:02.388865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:39:02.486578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
무역항 36
80.0%
연안항 1
 
2.2%
항로구기능구 1
 
2.2%
교통안전특정해역 1
 
2.2%
통항분리제도 1
 
2.2%
양길항로부 1
 
2.2%
묘박지 1
 
2.2%
부산신항예정지 1
 
2.2%
교통안전특정해역_울산 1
 
2.2%
내용없음 1
 
2.2%

Interactions

2023-12-12T21:39:00.639491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:39:00.386308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:39:00.776904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:39:00.500632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:39:02.573756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간정보일련번호(gid)용도구역아이디(ua_id)용도구역면적(ua_ar)용도구역상세정보(ua_dt_dc)
공간정보일련번호(gid)1.0001.0000.0000.000
용도구역아이디(ua_id)1.0001.0001.0001.000
용도구역면적(ua_ar)0.0001.0001.0001.000
용도구역상세정보(ua_dt_dc)0.0001.0001.0001.000
2023-12-12T21:39:02.665991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공간정보일련번호(gid)용도구역면적(ua_ar)용도구역상세정보(ua_dt_dc)
공간정보일련번호(gid)1.000-0.4170.000
용도구역면적(ua_ar)-0.4171.0000.986
용도구역상세정보(ua_dt_dc)0.0000.9861.000

Missing values

2023-12-12T21:39:00.923938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:39:01.028435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

공간정보일련번호(gid)용도구역아이디(ua_id)용도구역면적(ua_ar)용도구역상세정보(ua_dt_dc)
01SS350143N1290733E2020386.74무역항 연안항 항로구기능구 교통안전특정해역 통항분리제도 양길항로부 묘박지 부산신항예정지
12SS350249N1285831E20200.05무역항
23SS351949N1292751E202054.29교통안전특정해역_울산
34SS350627N1284704E20201.83무역항
45SS350630N1290736E20200.0무역항
56SS350601N1284753E20200.01무역항
613SS350402N1285020E20200.0무역항
722SS350357N1285003E20200.0무역항
824SS350356N1285006E20200.0무역항
97SS350550N1284804E20200.11무역항
공간정보일련번호(gid)용도구역아이디(ua_id)용도구역면적(ua_ar)용도구역상세정보(ua_dt_dc)
2829SS350352N1285006E20200.0무역항
2930SS350333N1285024E20201.08무역항
3031SS350351N1285007E20200.0무역항
3132SS350350N1285007E20200.0무역항
3233SS350348N1285006E20200.0무역항
3334SS350348N1285006E20200.0무역항
3435SS350347N1285007E20200.0무역항
3536SS350329N1290446E20200.0무역항
3637SS350257N1285737E20200.01무역항
3738SS350259N1285732E20200.0무역항