Overview

Dataset statistics

Number of variables6
Number of observations208
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory49.6 B

Variable types

Text4
Categorical1
Numeric1

Dataset

Description전라남도 완도군에 설치된 육상 양식장 현황(양식장 허가자 성명, 인허가번호, 수산물품목, 수조면적, 시설규모, 양식장위치)
Author전라남도 완도군
URLhttps://www.data.go.kr/data/15106760/fileData.do

Alerts

수산물품목 is highly imbalanced (50.0%)Imbalance
인허가번호 has unique valuesUnique
수조면적 has unique valuesUnique

Reproduction

Analysis started2023-12-12 02:42:01.917779
Analysis finished2023-12-12 02:42:02.521617
Duration0.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct184
Distinct (%)88.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T11:42:02.701824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length4.9375
Min length2

Characters and Unicode

Total characters1027
Distinct characters161
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique167 ?
Unique (%)80.3%

Sample

1st row섬전복영어조합법인
2nd row이*숙
3rd row박*희
4th row정*양
5th row이*희
ValueCountFrequency (%)
김*주 4
 
1.6%
김*현 3
 
1.2%
이*주 3
 
1.2%
정*호 3
 
1.2%
김*순 3
 
1.2%
영어조합법인 3
 
1.2%
어업회사법인 3
 
1.2%
김*진 3
 
1.2%
이*철 3
 
1.2%
이*희 3
 
1.2%
Other values (197) 217
87.5%
2023-12-12T11:42:03.106705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 199
19.4%
62
 
6.0%
40
 
3.9%
39
 
3.8%
39
 
3.8%
38
 
3.7%
37
 
3.6%
34
 
3.3%
34
 
3.3%
29
 
2.8%
Other values (151) 476
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 764
74.4%
Other Punctuation 201
 
19.6%
Space Separator 40
 
3.9%
Decimal Number 8
 
0.8%
Other Symbol 6
 
0.6%
Close Punctuation 4
 
0.4%
Open Punctuation 4
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
8.1%
39
 
5.1%
39
 
5.1%
38
 
5.0%
37
 
4.8%
34
 
4.5%
34
 
4.5%
29
 
3.8%
24
 
3.1%
21
 
2.7%
Other values (142) 407
53.3%
Decimal Number
ValueCountFrequency (%)
1 5
62.5%
2 2
 
25.0%
3 1
 
12.5%
Other Punctuation
ValueCountFrequency (%)
* 199
99.0%
, 2
 
1.0%
Space Separator
ValueCountFrequency (%)
40
100.0%
Other Symbol
ValueCountFrequency (%)
6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 770
75.0%
Common 257
 
25.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
8.1%
39
 
5.1%
39
 
5.1%
38
 
4.9%
37
 
4.8%
34
 
4.4%
34
 
4.4%
29
 
3.8%
24
 
3.1%
21
 
2.7%
Other values (143) 413
53.6%
Common
ValueCountFrequency (%)
* 199
77.4%
40
 
15.6%
1 5
 
1.9%
) 4
 
1.6%
( 4
 
1.6%
, 2
 
0.8%
2 2
 
0.8%
3 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 764
74.4%
ASCII 257
 
25.0%
None 6
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 199
77.4%
40
 
15.6%
1 5
 
1.9%
) 4
 
1.6%
( 4
 
1.6%
, 2
 
0.8%
2 2
 
0.8%
3 1
 
0.4%
Hangul
ValueCountFrequency (%)
62
 
8.1%
39
 
5.1%
39
 
5.1%
38
 
5.0%
37
 
4.8%
34
 
4.5%
34
 
4.5%
29
 
3.8%
24
 
3.1%
21
 
2.7%
Other values (142) 407
53.3%
None
ValueCountFrequency (%)
6
100.0%

인허가번호
Text

UNIQUE 

Distinct208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T11:42:03.422606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters1456
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique208 ?
Unique (%)100.0%

Sample

1st row2017-30
2nd row2017-32
3rd row2017-34
4th row2017-35
5th row2017-36
ValueCountFrequency (%)
2017-30 1
 
0.5%
2017-32 1
 
0.5%
2021-18 1
 
0.5%
2021-05 1
 
0.5%
2021-06 1
 
0.5%
2021-07 1
 
0.5%
2021-08 1
 
0.5%
2021-10 1
 
0.5%
2021-11 1
 
0.5%
2021-12 1
 
0.5%
Other values (198) 198
95.2%
2023-12-12T11:42:03.867854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 456
31.3%
0 323
22.2%
- 208
14.3%
1 179
 
12.3%
3 71
 
4.9%
8 54
 
3.7%
9 49
 
3.4%
4 36
 
2.5%
5 31
 
2.1%
7 28
 
1.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1248
85.7%
Dash Punctuation 208
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 456
36.5%
0 323
25.9%
1 179
 
14.3%
3 71
 
5.7%
8 54
 
4.3%
9 49
 
3.9%
4 36
 
2.9%
5 31
 
2.5%
7 28
 
2.2%
6 21
 
1.7%
Dash Punctuation
ValueCountFrequency (%)
- 208
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1456
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 456
31.3%
0 323
22.2%
- 208
14.3%
1 179
 
12.3%
3 71
 
4.9%
8 54
 
3.7%
9 49
 
3.4%
4 36
 
2.5%
5 31
 
2.1%
7 28
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1456
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 456
31.3%
0 323
22.2%
- 208
14.3%
1 179
 
12.3%
3 71
 
4.9%
8 54
 
3.7%
9 49
 
3.4%
4 36
 
2.5%
5 31
 
2.1%
7 28
 
1.9%

수산물품목
Categorical

IMBALANCE 

Distinct22
Distinct (%)10.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
넙치
131 
어류
17 
전복
 
12
새우
 
11
어류(넙치 등)
 
11
Other values (17)
26 

Length

Max length12
Median length2
Mean length2.7884615
Min length2

Unique

Unique14 ?
Unique (%)6.7%

Sample

1st row넙치
2nd row어류
3rd row어류
4th row어류
5th row어류

Common Values

ValueCountFrequency (%)
넙치 131
63.0%
어류 17
 
8.2%
전복 12
 
5.8%
새우 11
 
5.3%
어류(넙치 등) 11
 
5.3%
어류(넙치) 8
 
3.8%
조기 2
 
1.0%
넙치,전복 2
 
1.0%
전복,문어 1
 
0.5%
어류(넙치 등), 해삼 1
 
0.5%
Other values (12) 12
 
5.8%

Length

2023-12-12T11:42:04.004973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
넙치 131
59.3%
어류(넙치 20
 
9.0%
어류 17
 
7.7%
전복 12
 
5.4%
12
 
5.4%
새우 11
 
5.0%
조기 2
 
0.9%
넙치,전복 2
 
0.9%
넙치,조기 1
 
0.5%
넙치,도다리 1
 
0.5%
Other values (12) 12
 
5.4%

수조면적
Real number (ℝ)

UNIQUE 

Distinct208
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4054.6092
Minimum100.21
Maximum27232
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.0 KiB
2023-12-12T11:42:04.125521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum100.21
5-th percentile563.677
Q11931.01
median3408
Q35044.4425
95-th percentile9165.5035
Maximum27232
Range27131.79
Interquartile range (IQR)3113.4325

Descriptive statistics

Standard deviation3436.1415
Coefficient of variation (CV)0.84746554
Kurtosis18.442509
Mean4054.6092
Median Absolute Deviation (MAD)1539.01
Skewness3.2833072
Sum843358.71
Variance11807069
MonotonicityNot monotonic
2023-12-12T11:42:04.268608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4290.22 1
 
0.5%
3000.0 1
 
0.5%
1900.0 1
 
0.5%
232.8 1
 
0.5%
1575.63 1
 
0.5%
4115.92 1
 
0.5%
1612.8 1
 
0.5%
2119.41 1
 
0.5%
6758.62 1
 
0.5%
4213.0 1
 
0.5%
Other values (198) 198
95.2%
ValueCountFrequency (%)
100.21 1
0.5%
123.98 1
0.5%
144.0 1
0.5%
232.8 1
0.5%
273.0 1
0.5%
300.0 1
0.5%
351.78 1
0.5%
412.06 1
0.5%
425.36 1
0.5%
432.82 1
0.5%
ValueCountFrequency (%)
27232.0 1
0.5%
27089.79 1
0.5%
13169.47 1
0.5%
12120.0 1
0.5%
11285.64 1
0.5%
10893.0 1
0.5%
10869.21 1
0.5%
9600.03 1
0.5%
9446.2 1
0.5%
9271.5 1
0.5%
Distinct205
Distinct (%)98.6%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T11:42:04.614819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length5.8125
Min length1

Characters and Unicode

Total characters1209
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique204 ?
Unique (%)98.1%

Sample

1st row4969.73
2nd row5349.86
3rd row5211.94
4th row1208
5th row4866.02
ValueCountFrequency (%)
0 4
 
1.9%
9061 1
 
0.5%
2021 1
 
0.5%
404 1
 
0.5%
1798.84 1
 
0.5%
4460.86 1
 
0.5%
1893.17 1
 
0.5%
2445 1
 
0.5%
7273.44 1
 
0.5%
4692 1
 
0.5%
Other values (195) 195
93.8%
2023-12-12T11:42:05.232969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 140
11.6%
3 135
11.2%
2 130
10.8%
1 125
10.3%
5 114
9.4%
4 106
8.8%
7 101
8.4%
6 97
8.0%
8 91
7.5%
0 85
7.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1069
88.4%
Other Punctuation 140
 
11.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 135
12.6%
2 130
12.2%
1 125
11.7%
5 114
10.7%
4 106
9.9%
7 101
9.4%
6 97
9.1%
8 91
8.5%
0 85
8.0%
9 85
8.0%
Other Punctuation
ValueCountFrequency (%)
. 140
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1209
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 140
11.6%
3 135
11.2%
2 130
10.8%
1 125
10.3%
5 114
9.4%
4 106
8.8%
7 101
8.4%
6 97
8.0%
8 91
7.5%
0 85
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1209
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 140
11.6%
3 135
11.2%
2 130
10.8%
1 125
10.3%
5 114
9.4%
4 106
8.8%
7 101
8.4%
6 97
8.0%
8 91
7.5%
0 85
7.0%
Distinct207
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2023-12-12T11:42:05.571180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length118
Median length60
Mean length24.418269
Min length9

Characters and Unicode

Total characters5079
Distinct characters87
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique206 ?
Unique (%)99.0%

Sample

1st row신지면 대곡리 5-12, 5-39
2nd row생일면 유서리 1034외 6필지
3rd row신지면 신상리 636-1
4th row고금면 덕동리 24-1
5th row신지면 대곡리 1260외 4필지
ValueCountFrequency (%)
신지면 76
 
7.9%
고금면 40
 
4.2%
완도읍 40
 
4.2%
군외면 21
 
2.2%
동고리 21
 
2.2%
월양리 21
 
2.2%
약산면 20
 
2.1%
신상리 16
 
1.7%
가교리 13
 
1.3%
죽청리 12
 
1.2%
Other values (467) 683
70.9%
2023-12-12T11:42:06.131282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
762
 
15.0%
1 418
 
8.2%
- 373
 
7.3%
, 265
 
5.2%
2 225
 
4.4%
3 213
 
4.2%
206
 
4.1%
203
 
4.0%
4 192
 
3.8%
165
 
3.2%
Other values (77) 2057
40.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1842
36.3%
Other Letter 1659
32.7%
Space Separator 762
15.0%
Dash Punctuation 373
 
7.3%
Other Punctuation 265
 
5.2%
Open Punctuation 89
 
1.8%
Close Punctuation 89
 
1.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
12.4%
203
 
12.2%
165
 
9.9%
156
 
9.4%
126
 
7.6%
97
 
5.8%
61
 
3.7%
44
 
2.7%
44
 
2.7%
44
 
2.7%
Other values (60) 513
30.9%
Decimal Number
ValueCountFrequency (%)
1 418
22.7%
2 225
12.2%
3 213
11.6%
4 192
10.4%
5 165
 
9.0%
6 143
 
7.8%
7 137
 
7.4%
0 119
 
6.5%
9 118
 
6.4%
8 112
 
6.1%
Open Punctuation
ValueCountFrequency (%)
( 88
98.9%
[ 1
 
1.1%
Close Punctuation
ValueCountFrequency (%)
) 88
98.9%
] 1
 
1.1%
Space Separator
ValueCountFrequency (%)
762
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 373
100.0%
Other Punctuation
ValueCountFrequency (%)
, 265
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3420
67.3%
Hangul 1659
32.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
12.4%
203
 
12.2%
165
 
9.9%
156
 
9.4%
126
 
7.6%
97
 
5.8%
61
 
3.7%
44
 
2.7%
44
 
2.7%
44
 
2.7%
Other values (60) 513
30.9%
Common
ValueCountFrequency (%)
762
22.3%
1 418
12.2%
- 373
10.9%
, 265
 
7.7%
2 225
 
6.6%
3 213
 
6.2%
4 192
 
5.6%
5 165
 
4.8%
6 143
 
4.2%
7 137
 
4.0%
Other values (7) 527
15.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3420
67.3%
Hangul 1659
32.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
762
22.3%
1 418
12.2%
- 373
10.9%
, 265
 
7.7%
2 225
 
6.6%
3 213
 
6.2%
4 192
 
5.6%
5 165
 
4.8%
6 143
 
4.2%
7 137
 
4.0%
Other values (7) 527
15.4%
Hangul
ValueCountFrequency (%)
206
12.4%
203
 
12.2%
165
 
9.9%
156
 
9.4%
126
 
7.6%
97
 
5.8%
61
 
3.7%
44
 
2.7%
44
 
2.7%
44
 
2.7%
Other values (60) 513
30.9%

Interactions

2023-12-12T11:42:02.165346image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T11:42:06.252141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수산물품목수조면적
수산물품목1.0000.000
수조면적0.0001.000
2023-12-12T11:42:06.352167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수조면적수산물품목
수조면적1.0000.000
수산물품목0.0001.000

Missing values

2023-12-12T11:42:02.314560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T11:42:02.472050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

양식장 허가자 성명인허가번호수산물품목수조면적시설규모양식장위치
0섬전복영어조합법인2017-30넙치4290.224969.73신지면 대곡리 5-12, 5-39
1이*숙2017-32어류4958.025349.86생일면 유서리 1034외 6필지
2박*희2017-34어류4536.05211.94신지면 신상리 636-1
3정*양2017-35어류985.61208고금면 덕동리 24-1
4이*희2017-36어류3899.954866.02신지면 대곡리 1260외 4필지
5주*윤2017-37넙치1467.781748신지면 동고리 619, 619-8
6차*상2017-38넙치3481.05254신지면 동고리 746-1 외3필지
7최*옥2017-39넙치4460.05320신지면 월양리 139, 2017
8안*희2018-01어류6615.07580.58신지면 신상리 1174-14외 3필지
9청하영어조합법인 송*숙2018-02어류4640.135414.58군외면 영풍리 486-1외 7필지
양식장 허가자 성명인허가번호수산물품목수조면적시설규모양식장위치
198봉황수산 영어조합법인2022-32어류(넙치 등)3960.04505.4군외면 영풍리 761외 1필지(761-2)
199해득수산㈜ 최*영2022-33넙치3270.613733.1노화읍 신양리 113외 2필지(108, 108-1)
200조*아2022-34어류(넙치 등)3382.693674.28신지면 월양리 1136-40
201강*희2022-35어류(넙치 등)8898.09782.97신지면 대곡리 5-11외 2필지(-12, -46)
202김*순2022-36어류(넙치 등)2994.783441.55군외면 영풍리 759
203임*석2022-37어류(넙치 등)11285.6412452.75신지면 대곡리 5외 3필지(5-8, 5-14, 11-1)
204김*우2022-38어류(넙치 등)2038.762239.38신지면 월양리 1136-3외 4필지(-4, -46, -48, -59)
205박*리2022-39어류(넙치 등)2722.42951.46고금면 가교리 114-24외 2필지(-33, -39)
206이*선2022-40어류(넙치 등)3756.184275.49군외면 영풍리 7-3외 2필지(7-44, 7-45)
207서*헌2022-41어류(넙치 등)100.21102.72군외면 영풍리 386-5외 2필지(288-1, 288-2)