Overview

Dataset statistics

Number of variables4
Number of observations45
Missing cells43
Missing cells (%)23.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory34.9 B

Variable types

Text2
Categorical1
DateTime1

Dataset

Description제주특별자치도 서귀포시 관내 과수종자업체 현황에 관한 데이터로 과수종자업체 상호, 취급작물, 연락처 정보를 제공합니다.
Author제주특별자치도 서귀포시
URLhttps://www.data.go.kr/data/15049887/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
취급 작물 is highly imbalanced (67.2%)Imbalance
연락처 has 43 (95.6%) missing valuesMissing
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:05:28.953367
Analysis finished2023-12-12 16:05:29.277422
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

UNIQUE 

Distinct45
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size492.0 B
2023-12-13T01:05:29.457081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length5.6
Min length3

Characters and Unicode

Total characters252
Distinct characters92
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row천도종묘
2nd row제일종묘
3rd row석파농산
4th row자원종묘
5th row제주우리농원
ValueCountFrequency (%)
천도종묘 1
 
2.2%
감귤나라 1
 
2.2%
장인종묘 1
 
2.2%
오누이망고 1
 
2.2%
한미종묘 1
 
2.2%
안성종묘 1
 
2.2%
제주그린팜 1
 
2.2%
제주감귤묘목영농조합법인 1
 
2.2%
큰솔종묘 1
 
2.2%
청정제주녹차영농조합법인 1
 
2.2%
Other values (36) 36
78.3%
2023-12-13T01:05:29.925774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
22
 
8.7%
21
 
8.3%
16
 
6.3%
10
 
4.0%
10
 
4.0%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
Other values (82) 143
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 246
97.6%
Open Punctuation 2
 
0.8%
Close Punctuation 2
 
0.8%
Space Separator 1
 
0.4%
Other Symbol 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
22
 
8.9%
21
 
8.5%
16
 
6.5%
10
 
4.1%
10
 
4.1%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
Other values (78) 137
55.7%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 246
97.6%
Common 5
 
2.0%
Han 1
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
22
 
8.9%
21
 
8.5%
16
 
6.5%
10
 
4.1%
10
 
4.1%
7
 
2.8%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
Other values (78) 137
55.7%
Common
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
1
20.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 245
97.2%
ASCII 5
 
2.0%
CJK 1
 
0.4%
None 1
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
22
 
9.0%
21
 
8.6%
16
 
6.5%
10
 
4.1%
10
 
4.1%
7
 
2.9%
6
 
2.4%
6
 
2.4%
6
 
2.4%
5
 
2.0%
Other values (77) 136
55.5%
ASCII
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
1
20.0%
CJK
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
1
100.0%

취급 작물
Categorical

IMBALANCE 

Distinct4
Distinct (%)8.9%
Missing0
Missing (%)0.0%
Memory size492.0 B
감귤
40 
망고
 
3
열대과수류(바나나 등)
 
1
참다래
 
1

Length

Max length12
Median length2
Mean length2.2444444
Min length2

Unique

Unique2 ?
Unique (%)4.4%

Sample

1st row감귤
2nd row감귤
3rd row감귤
4th row감귤
5th row감귤

Common Values

ValueCountFrequency (%)
감귤 40
88.9%
망고 3
 
6.7%
열대과수류(바나나 등) 1
 
2.2%
참다래 1
 
2.2%

Length

2023-12-13T01:05:30.091942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:05:30.218232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
감귤 40
87.0%
망고 3
 
6.5%
열대과수류(바나나 1
 
2.2%
1
 
2.2%
참다래 1
 
2.2%

연락처
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing43
Missing (%)95.6%
Memory size492.0 B
2023-12-13T01:05:30.383300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters24
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st row064-739-5401
2nd row064-760-1039
ValueCountFrequency (%)
064-739-5401 1
50.0%
064-760-1039 1
50.0%
2023-12-13T01:05:30.698265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 5
20.8%
- 4
16.7%
6 3
12.5%
4 3
12.5%
7 2
 
8.3%
3 2
 
8.3%
9 2
 
8.3%
1 2
 
8.3%
5 1
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 20
83.3%
Dash Punctuation 4
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 5
25.0%
6 3
15.0%
4 3
15.0%
7 2
 
10.0%
3 2
 
10.0%
9 2
 
10.0%
1 2
 
10.0%
5 1
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 5
20.8%
- 4
16.7%
6 3
12.5%
4 3
12.5%
7 2
 
8.3%
3 2
 
8.3%
9 2
 
8.3%
1 2
 
8.3%
5 1
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 5
20.8%
- 4
16.7%
6 3
12.5%
4 3
12.5%
7 2
 
8.3%
3 2
 
8.3%
9 2
 
8.3%
1 2
 
8.3%
5 1
 
4.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size492.0 B
Minimum2023-10-31 00:00:00
Maximum2023-10-31 00:00:00
2023-12-13T01:05:30.823620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:05:30.932574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2023-12-13T01:05:31.015130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상호취급 작물연락처
상호1.0001.0000.000
취급 작물1.0001.000NaN
연락처0.000NaN1.000

Missing values

2023-12-13T01:05:29.134964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:05:29.236869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호취급 작물연락처데이터기준일자
0천도종묘감귤<NA>2023-10-31
1제일종묘감귤<NA>2023-10-31
2석파농산감귤<NA>2023-10-31
3자원종묘감귤<NA>2023-10-31
4제주우리농원감귤<NA>2023-10-31
5성림종묘감귤<NA>2023-10-31
6한라종묘감귤<NA>2023-10-31
7서귀종묘감귤<NA>2023-10-31
8영농조합법인황금낭(록산영농조합법인)감귤<NA>2023-10-31
9중문종묘감귤<NA>2023-10-31
상호취급 작물연락처데이터기준일자
35위미농업협동조합감귤064-760-10392023-10-31
36바른(正)수종묘목감귤<NA>2023-10-31
37청파원종묘감귤<NA>2023-10-31
38황금종묘감귤<NA>2023-10-31
39보해농원감귤<NA>2023-10-31
40제주나무미농수산감귤<NA>2023-10-31
41산방 농업회사법인㈜감귤<NA>2023-10-31
42강두희감귤<NA>2023-10-31
43어드래농장감귤<NA>2023-10-31
44강정호감귤<NA>2023-10-31