Overview

Dataset statistics

Number of variables5
Number of observations45
Missing cells41
Missing cells (%)18.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory42.9 B

Variable types

Categorical3
Text2

Dataset

Description울산항의 시설물에 대한 안전등급 정보입니다. 시설물별, 시설물명, 안전등급이 포함되어 있습니다. 비고의 빈칸은 특이사항이 없는 시설입니다.
Author울산항만공사
URLhttps://www.data.go.kr/data/15105783/fileData.do

Alerts

안전등급 has constant value ""Constant
시설물별 is highly imbalanced (84.6%)Imbalance
비고 has 41 (91.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 21:57:28.137870
Analysis finished2023-12-12 21:57:28.549237
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

종 별
Categorical

Distinct3
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size492.0 B
기타
26 
2종
18 
1종
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row1종
2nd row2종
3rd row2종
4th row2종
5th row2종

Common Values

ValueCountFrequency (%)
기타 26
57.8%
2종 18
40.0%
1종 1
 
2.2%

Length

2023-12-13T06:57:28.627577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:28.730582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기타 26
57.8%
2종 18
40.0%
1종 1
 
2.2%

시설물별
Categorical

IMBALANCE 

Distinct2
Distinct (%)4.4%
Missing0
Missing (%)0.0%
Memory size492.0 B
계류시설
44 
부잔교
 
1

Length

Max length4
Median length4
Mean length3.9777778
Min length3

Unique

Unique1 ?
Unique (%)2.2%

Sample

1st row계류시설
2nd row계류시설
3rd row계류시설
4th row계류시설
5th row계류시설

Common Values

ValueCountFrequency (%)
계류시설 44
97.8%
부잔교 1
 
2.2%

Length

2023-12-13T06:57:28.853249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:28.968860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
계류시설 44
97.8%
부잔교 1
 
2.2%
Distinct43
Distinct (%)95.6%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T06:57:29.132058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length6.7111111
Min length4

Characters and Unicode

Total characters302
Distinct characters65
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)93.3%

Sample

1st row양곡부두
2nd row석탄부두
3rd row울산항 2부두(1)
4th row울산항 2부두(2)
5th row울산항 3부두
ValueCountFrequency (%)
울산항 13
 
17.8%
온산항 5
 
6.8%
1부두 4
 
5.5%
장생포부두 3
 
4.1%
2부두 3
 
4.1%
용잠 2
 
2.7%
장생포 2
 
2.7%
일반부두 2
 
2.7%
sk 2
 
2.7%
4부두 2
 
2.7%
Other values (34) 35
47.9%
2023-12-13T06:57:29.543477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
13.9%
41
 
13.6%
28
 
9.3%
19
 
6.3%
19
 
6.3%
14
 
4.6%
2 11
 
3.6%
) 8
 
2.6%
( 8
 
2.6%
8
 
2.6%
Other values (55) 104
34.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 223
73.8%
Decimal Number 29
 
9.6%
Space Separator 28
 
9.3%
Close Punctuation 8
 
2.6%
Open Punctuation 8
 
2.6%
Uppercase Letter 6
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
18.8%
41
18.4%
19
 
8.5%
19
 
8.5%
14
 
6.3%
8
 
3.6%
8
 
3.6%
6
 
2.7%
5
 
2.2%
5
 
2.2%
Other values (40) 56
25.1%
Decimal Number
ValueCountFrequency (%)
2 11
37.9%
1 6
20.7%
4 3
 
10.3%
3 3
 
10.3%
6 2
 
6.9%
7 1
 
3.4%
9 1
 
3.4%
5 1
 
3.4%
8 1
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
S 3
50.0%
K 2
33.3%
T 1
 
16.7%
Space Separator
ValueCountFrequency (%)
28
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 223
73.8%
Common 73
 
24.2%
Latin 6
 
2.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
18.8%
41
18.4%
19
 
8.5%
19
 
8.5%
14
 
6.3%
8
 
3.6%
8
 
3.6%
6
 
2.7%
5
 
2.2%
5
 
2.2%
Other values (40) 56
25.1%
Common
ValueCountFrequency (%)
28
38.4%
2 11
 
15.1%
) 8
 
11.0%
( 8
 
11.0%
1 6
 
8.2%
4 3
 
4.1%
3 3
 
4.1%
6 2
 
2.7%
7 1
 
1.4%
9 1
 
1.4%
Other values (2) 2
 
2.7%
Latin
ValueCountFrequency (%)
S 3
50.0%
K 2
33.3%
T 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 223
73.8%
ASCII 79
 
26.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
18.8%
41
18.4%
19
 
8.5%
19
 
8.5%
14
 
6.3%
8
 
3.6%
8
 
3.6%
6
 
2.7%
5
 
2.2%
5
 
2.2%
Other values (40) 56
25.1%
ASCII
ValueCountFrequency (%)
28
35.4%
2 11
 
13.9%
) 8
 
10.1%
( 8
 
10.1%
1 6
 
7.6%
S 3
 
3.8%
4 3
 
3.8%
3 3
 
3.8%
6 2
 
2.5%
K 2
 
2.5%
Other values (5) 5
 
6.3%

비고
Text

MISSING 

Distinct4
Distinct (%)100.0%
Missing41
Missing (%)91.1%
Memory size492.0 B
2023-12-13T06:57:29.749556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length6
Mean length6.5
Min length3

Characters and Unicode

Total characters26
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)100.0%

Sample

1st row1,2번 선석
2nd row통선장
3rd row역무선부두
4th row역무선 및 어선물양장
ValueCountFrequency (%)
1,2번 1
14.3%
선석 1
14.3%
통선장 1
14.3%
역무선부두 1
14.3%
역무선 1
14.3%
1
14.3%
어선물양장 1
14.3%
2023-12-13T06:57:30.103383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5
19.2%
3
11.5%
2
 
7.7%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (7) 7
26.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20
76.9%
Space Separator 3
 
11.5%
Decimal Number 2
 
7.7%
Other Punctuation 1
 
3.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
25.0%
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (3) 3
15.0%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20
76.9%
Common 6
 
23.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
25.0%
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (3) 3
15.0%
Common
ValueCountFrequency (%)
3
50.0%
1 1
 
16.7%
, 1
 
16.7%
2 1
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20
76.9%
ASCII 6
 
23.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5
25.0%
2
 
10.0%
2
 
10.0%
2
 
10.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
1
 
5.0%
Other values (3) 3
15.0%
ASCII
ValueCountFrequency (%)
3
50.0%
1 1
 
16.7%
, 1
 
16.7%
2 1
 
16.7%

안전등급
Categorical

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
B(양호)
45 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB(양호)
2nd rowB(양호)
3rd rowB(양호)
4th rowB(양호)
5th rowB(양호)

Common Values

ValueCountFrequency (%)
B(양호) 45
100.0%

Length

2023-12-13T06:57:30.270560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T06:57:30.398766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
b(양호 45
100.0%

Correlations

2023-12-13T06:57:30.481454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종 별시설물별시설물명비고
종 별1.0000.0001.0001.000
시설물별0.0001.0001.000NaN
시설물명1.0001.0001.0001.000
비고1.000NaN1.0001.000
2023-12-13T06:57:30.586863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종 별시설물별
종 별1.0000.000
시설물별0.0001.000
2023-12-13T06:57:30.684001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종 별시설물별
종 별1.0000.000
시설물별0.0001.000

Missing values

2023-12-13T06:57:28.397587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T06:57:28.502858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

종 별시설물별시설물명비고안전등급
01종계류시설양곡부두<NA>B(양호)
12종계류시설석탄부두<NA>B(양호)
22종계류시설울산항 2부두(1)<NA>B(양호)
32종계류시설울산항 2부두(2)<NA>B(양호)
42종계류시설울산항 3부두<NA>B(양호)
52종계류시설울산항 4부두<NA>B(양호)
62종계류시설울산항 5부두<NA>B(양호)
72종계류시설울산항 6부두<NA>B(양호)
82종계류시설울산항 6부두(2)<NA>B(양호)
92종계류시설울산항 7부두<NA>B(양호)
종 별시설물별시설물명비고안전등급
35기타계류시설매암부두<NA>B(양호)
36기타계류시설신항 예부선부두<NA>B(양호)
37기타계류시설장생포 소형선부두<NA>B(양호)
38기타계류시설온산항 잡종선부두<NA>B(양호)
39기타계류시설남화 예선부두<NA>B(양호)
40기타계류시설일반부두 물양장<NA>B(양호)
41기타계류시설한전물양장(1)<NA>B(양호)
42기타계류시설한전물양장(2)<NA>B(양호)
43기타계류시설미포조선안벽<NA>B(양호)
44기타부잔교함선(울산2호)<NA>B(양호)