Overview

Dataset statistics

Number of variables3
Number of observations1101
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.9 KiB
Average record size in memory24.1 B

Variable types

Text2
Categorical1

Reproduction

Analysis started2024-01-09 20:36:35.445189
Analysis finished2024-01-09 20:36:35.780541
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1048
Distinct (%)95.2%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2024-01-10T05:36:35.962803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length4
Mean length4.5703906
Min length2

Characters and Unicode

Total characters5032
Distinct characters322
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1006 ?
Unique (%)91.4%

Sample

1st row오도농장
2nd row지환농장1
3rd row지환농장2
4th row옥수뿔농장
5th row해창농장
ValueCountFrequency (%)
농장 27
 
2.4%
대성농장 5
 
0.4%
서해농장 5
 
0.4%
목장 5
 
0.4%
당산농장 4
 
0.3%
행정농장 3
 
0.3%
농업회사법인 3
 
0.3%
운곡농장 3
 
0.3%
우리농장 3
 
0.3%
소농장 3
 
0.3%
Other values (1049) 1086
94.7%
2024-01-10T05:36:36.410636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1038
20.6%
878
 
17.4%
182
 
3.6%
91
 
1.8%
84
 
1.7%
57
 
1.1%
56
 
1.1%
56
 
1.1%
52
 
1.0%
51
 
1.0%
Other values (312) 2487
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4898
97.3%
Decimal Number 57
 
1.1%
Space Separator 46
 
0.9%
Uppercase Letter 12
 
0.2%
Open Punctuation 9
 
0.2%
Close Punctuation 9
 
0.2%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1038
21.2%
878
 
17.9%
182
 
3.7%
91
 
1.9%
84
 
1.7%
57
 
1.2%
56
 
1.1%
56
 
1.1%
52
 
1.1%
51
 
1.0%
Other values (295) 2353
48.0%
Uppercase Letter
ValueCountFrequency (%)
O 3
25.0%
E 2
16.7%
R 1
 
8.3%
T 1
 
8.3%
N 1
 
8.3%
M 1
 
8.3%
K 1
 
8.3%
F 1
 
8.3%
B 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 34
59.6%
1 21
36.8%
3 1
 
1.8%
4 1
 
1.8%
Space Separator
ValueCountFrequency (%)
46
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4898
97.3%
Common 122
 
2.4%
Latin 12
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1038
21.2%
878
 
17.9%
182
 
3.7%
91
 
1.9%
84
 
1.7%
57
 
1.2%
56
 
1.1%
56
 
1.1%
52
 
1.1%
51
 
1.0%
Other values (295) 2353
48.0%
Latin
ValueCountFrequency (%)
O 3
25.0%
E 2
16.7%
R 1
 
8.3%
T 1
 
8.3%
N 1
 
8.3%
M 1
 
8.3%
K 1
 
8.3%
F 1
 
8.3%
B 1
 
8.3%
Common
ValueCountFrequency (%)
46
37.7%
2 34
27.9%
1 21
17.2%
( 9
 
7.4%
) 9
 
7.4%
. 1
 
0.8%
3 1
 
0.8%
4 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4898
97.3%
ASCII 134
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1038
21.2%
878
 
17.9%
182
 
3.7%
91
 
1.9%
84
 
1.7%
57
 
1.2%
56
 
1.1%
56
 
1.1%
52
 
1.1%
51
 
1.0%
Other values (295) 2353
48.0%
ASCII
ValueCountFrequency (%)
46
34.3%
2 34
25.4%
1 21
15.7%
( 9
 
6.7%
) 9
 
6.7%
O 3
 
2.2%
E 2
 
1.5%
R 1
 
0.7%
T 1
 
0.7%
N 1
 
0.7%
Other values (7) 7
 
5.2%

축종
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
857 
돼지
132 
112 

Length

Max length2
Median length1
Mean length1.119891
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
857
77.8%
돼지 132
 
12.0%
112
 
10.2%

Length

2024-01-10T05:36:36.567416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:36:36.683855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
857
77.8%
돼지 132
 
12.0%
112
 
10.2%
Distinct1096
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size8.7 KiB
2024-01-10T05:36:36.955540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length85
Median length83
Mean length21.812897
Min length14

Characters and Unicode

Total characters24016
Distinct characters137
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1092 ?
Unique (%)99.2%

Sample

1st row충청남도 당진시 고대면 금암로 367
2nd row충청남도 당진시 고대면 당진포리 1149-1
3rd row충청남도 당진시 고대면 당진포리 1150-1
4th row충청남도 당진시 고대면 당진포리 233-1
5th row충청남도 당진시 고대면 당진포리 236
ValueCountFrequency (%)
충청남도 1101
19.9%
당진시 1101
19.9%
고대면 167
 
3.0%
순성면 150
 
2.7%
합덕읍 144
 
2.6%
신평면 111
 
2.0%
송악읍 79
 
1.4%
면천면 77
 
1.4%
송산면 65
 
1.2%
석문면 63
 
1.1%
Other values (1233) 2487
44.9%
2024-01-10T05:36:37.388690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4476
18.6%
1162
 
4.8%
1158
 
4.8%
1148
 
4.8%
1136
 
4.7%
1114
 
4.6%
1107
 
4.6%
1101
 
4.6%
1010
 
4.2%
866
 
3.6%
Other values (127) 9738
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14289
59.5%
Space Separator 4476
 
18.6%
Decimal Number 4364
 
18.2%
Dash Punctuation 813
 
3.4%
Other Punctuation 60
 
0.2%
Open Punctuation 7
 
< 0.1%
Close Punctuation 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1162
 
8.1%
1158
 
8.1%
1148
 
8.0%
1136
 
8.0%
1114
 
7.8%
1107
 
7.7%
1101
 
7.7%
1010
 
7.1%
866
 
6.1%
292
 
2.0%
Other values (112) 4195
29.4%
Decimal Number
ValueCountFrequency (%)
1 815
18.7%
2 590
13.5%
3 552
12.6%
4 437
10.0%
5 396
9.1%
7 334
7.7%
6 319
 
7.3%
8 315
 
7.2%
9 303
 
6.9%
0 303
 
6.9%
Space Separator
ValueCountFrequency (%)
4476
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 813
100.0%
Other Punctuation
ValueCountFrequency (%)
, 60
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14289
59.5%
Common 9727
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1162
 
8.1%
1158
 
8.1%
1148
 
8.0%
1136
 
8.0%
1114
 
7.8%
1107
 
7.7%
1101
 
7.7%
1010
 
7.1%
866
 
6.1%
292
 
2.0%
Other values (112) 4195
29.4%
Common
ValueCountFrequency (%)
4476
46.0%
1 815
 
8.4%
- 813
 
8.4%
2 590
 
6.1%
3 552
 
5.7%
4 437
 
4.5%
5 396
 
4.1%
7 334
 
3.4%
6 319
 
3.3%
8 315
 
3.2%
Other values (5) 680
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14289
59.5%
ASCII 9727
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4476
46.0%
1 815
 
8.4%
- 813
 
8.4%
2 590
 
6.1%
3 552
 
5.7%
4 437
 
4.5%
5 396
 
4.1%
7 334
 
3.4%
6 319
 
3.3%
8 315
 
3.2%
Other values (5) 680
 
7.0%
Hangul
ValueCountFrequency (%)
1162
 
8.1%
1158
 
8.1%
1148
 
8.0%
1136
 
8.0%
1114
 
7.8%
1107
 
7.7%
1101
 
7.7%
1010
 
7.1%
866
 
6.1%
292
 
2.0%
Other values (112) 4195
29.4%

Missing values

2024-01-10T05:36:35.687403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:36:35.753097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

농장명축종농장주소
0오도농장충청남도 당진시 고대면 금암로 367
1지환농장1충청남도 당진시 고대면 당진포리 1149-1
2지환농장2충청남도 당진시 고대면 당진포리 1150-1
3옥수뿔농장충청남도 당진시 고대면 당진포리 233-1
4해창농장충청남도 당진시 고대면 당진포리 236
5당나루농장충청남도 당진시 고대면 당진포리 286
6순광농장충청남도 당진시 고대면 당진포리 307
7고기석농장충청남도 당진시 고대면 당진포리 457-35
8김기용 농장충청남도 당진시 고대면 당진포리 462-4
9실조암농장충청남도 당진시 고대면 당진포리 500
농장명축종농장주소
1091농업회사법인(주)대통충청남도 당진시 대호지면 두산리 560번지 6호 , 560-11, 560-13, 560-14, 560-15, 장정리 503-25, 503-26
1092바른농장충청남도 당진시 고대면 당진포리 2049번지 2050, 2051, 2052, 2053, 2054, 2055
1093제경농장충청남도 당진시 대호지면 송전리 산 16번지
1094대명농장충청남도 당진시 대호지면 마중리 2번지 2호 양계장
1095호선농장충청남도 당진시 정미면 승산리 307번지 3호 ,307-10, 307-14, 307-16
1096일성농장충청남도 당진시 합덕읍 도곡리 378번지 2호 , 378-23, 378-92
1097기린 고대농장충청남도 당진시 고대면 당진포리 492번지 3호 , 492-7, 492-4, 492-8, 492-9, 492-10
1098효원농장충청남도 당진시 면천면 문봉리 185번지 19호 , 185-20, 185-17
1099수훈농장충청남도 당진시 신평면 부수리 303번지 2호 , 303-13, 303-14, 304-10, 304-12
1100원농장충청남도 당진시 합덕읍 성동리 산 58번지 2호