Overview

Dataset statistics

Number of variables4
Number of observations1958
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory63.2 KiB
Average record size in memory33.1 B

Variable types

Numeric1
Text1
Categorical2

Dataset

Description전라남도 나주시 축산업 허가 및 등록 현황으로 데이터 제공 신청자가 요구한 사업장 명칭, 주사육업종, 사업장 소재지에 대한 데이터. 사업장 명칭에는 농장주의 이름이 포함된 경우가 있고, 사업장 소재지는 농장주의 실제 거주지 소재지와 같은 경우도 존재하여 개인정보에 해당할 수 있기 때문에 그의 일부분만 제공.
URLhttps://www.data.go.kr/data/15117021/fileData.do

Alerts

주사육업종 is highly imbalanced (56.4%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 13:36:36.044885
Analysis finished2023-12-12 13:36:36.627481
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct1958
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean979.5
Minimum1
Maximum1958
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size17.3 KiB
2023-12-12T22:36:36.713481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile98.85
Q1490.25
median979.5
Q31468.75
95-th percentile1860.15
Maximum1958
Range1957
Interquartile range (IQR)978.5

Descriptive statistics

Standard deviation565.37023
Coefficient of variation (CV)0.57720289
Kurtosis-1.2
Mean979.5
Median Absolute Deviation (MAD)489.5
Skewness0
Sum1917861
Variance319643.5
MonotonicityStrictly increasing
2023-12-12T22:36:36.901979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
1317 1
 
0.1%
1315 1
 
0.1%
1314 1
 
0.1%
1313 1
 
0.1%
1312 1
 
0.1%
1311 1
 
0.1%
1310 1
 
0.1%
1309 1
 
0.1%
1308 1
 
0.1%
Other values (1948) 1948
99.5%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1958 1
0.1%
1957 1
0.1%
1956 1
0.1%
1955 1
0.1%
1954 1
0.1%
1953 1
0.1%
1952 1
0.1%
1951 1
0.1%
1950 1
0.1%
1949 1
0.1%
Distinct153
Distinct (%)7.8%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
2023-12-12T22:36:37.129892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length4
Mean length4.2778345
Min length2

Characters and Unicode

Total characters8376
Distinct characters107
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)5.3%

Sample

1st row○○축산
2nd row○○축산
3rd row○○농장
4th row○○농장
5th row○○축산
ValueCountFrequency (%)
○○농장 1217
61.0%
○○축산 221
 
11.1%
○○목장 122
 
6.1%
○○○농장 44
 
2.2%
○○○ 42
 
2.1%
○○2농장 26
 
1.3%
○○축사 17
 
0.9%
○○ 16
 
0.8%
○○농장2 12
 
0.6%
○○○축사 11
 
0.6%
Other values (141) 267
 
13.4%
2023-12-12T22:36:37.471351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4127
49.3%
1550
 
18.5%
1440
 
17.2%
276
 
3.3%
249
 
3.0%
144
 
1.7%
62
 
0.7%
2 55
 
0.7%
37
 
0.4%
24
 
0.3%
Other values (97) 412
 
4.9%

Most occurring categories

ValueCountFrequency (%)
Other Symbol 4131
49.3%
Other Letter 4118
49.2%
Decimal Number 71
 
0.8%
Space Separator 37
 
0.4%
Lowercase Letter 12
 
0.1%
Close Punctuation 2
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1550
37.6%
1440
35.0%
276
 
6.7%
249
 
6.0%
144
 
3.5%
62
 
1.5%
24
 
0.6%
24
 
0.6%
22
 
0.5%
20
 
0.5%
Other values (79) 307
 
7.5%
Lowercase Letter
ValueCountFrequency (%)
r 3
25.0%
a 2
16.7%
m 2
16.7%
e 2
16.7%
n 1
 
8.3%
f 1
 
8.3%
g 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 55
77.5%
1 11
 
15.5%
3 4
 
5.6%
5 1
 
1.4%
Other Symbol
ValueCountFrequency (%)
4127
99.9%
4
 
0.1%
Space Separator
ValueCountFrequency (%)
37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
· 2
100.0%
Uppercase Letter
ValueCountFrequency (%)
F 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4241
50.6%
Hangul 4122
49.2%
Latin 13
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1550
37.6%
1440
34.9%
276
 
6.7%
249
 
6.0%
144
 
3.5%
62
 
1.5%
24
 
0.6%
24
 
0.6%
22
 
0.5%
20
 
0.5%
Other values (80) 311
 
7.5%
Common
ValueCountFrequency (%)
4127
97.3%
2 55
 
1.3%
37
 
0.9%
1 11
 
0.3%
3 4
 
0.1%
) 2
 
< 0.1%
( 2
 
< 0.1%
· 2
 
< 0.1%
5 1
 
< 0.1%
Latin
ValueCountFrequency (%)
r 3
23.1%
a 2
15.4%
m 2
15.4%
e 2
15.4%
n 1
 
7.7%
f 1
 
7.7%
g 1
 
7.7%
F 1
 
7.7%

Most occurring blocks

ValueCountFrequency (%)
Geometric Shapes 4127
49.3%
Hangul 4118
49.2%
ASCII 125
 
1.5%
None 6
 
0.1%

Most frequent character per block

Geometric Shapes
ValueCountFrequency (%)
4127
100.0%
Hangul
ValueCountFrequency (%)
1550
37.6%
1440
35.0%
276
 
6.7%
249
 
6.0%
144
 
3.5%
62
 
1.5%
24
 
0.6%
24
 
0.6%
22
 
0.5%
20
 
0.5%
Other values (79) 307
 
7.5%
ASCII
ValueCountFrequency (%)
2 55
44.0%
37
29.6%
1 11
 
8.8%
3 4
 
3.2%
r 3
 
2.4%
) 2
 
1.6%
( 2
 
1.6%
a 2
 
1.6%
m 2
 
1.6%
e 2
 
1.6%
Other values (5) 5
 
4.0%
None
ValueCountFrequency (%)
4
66.7%
· 2
33.3%

주사육업종
Categorical

IMBALANCE 

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
한우
1467 
오리
 
119
육계
 
94
돼지
 
93
젖소
 
84
Other values (6)
 
101

Length

Max length6
Median length2
Mean length2.0638407
Min length2

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row한우
2nd row한우
3rd row돼지
4th row한우
5th row한우

Common Values

ValueCountFrequency (%)
한우 1467
74.9%
오리 119
 
6.1%
육계 94
 
4.8%
돼지 93
 
4.7%
젖소 84
 
4.3%
종계/산란계 31
 
1.6%
산양 28
 
1.4%
사슴 16
 
0.8%
염소 14
 
0.7%
육우 11
 
0.6%

Length

2023-12-12T22:36:37.640894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1467
74.9%
오리 119
 
6.1%
육계 94
 
4.8%
돼지 93
 
4.7%
젖소 84
 
4.3%
종계/산란계 31
 
1.6%
산양 28
 
1.4%
사슴 16
 
0.8%
염소 14
 
0.7%
육우 11
 
0.6%
Distinct28
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size15.4 KiB
전라남도 나주시 왕곡면
234 
전라남도 나주시 공산면
234 
전라남도 나주시 동강면
228 
전라남도 나주시 봉황면
205 
전라남도 나주시 노안면
185 
Other values (23)
872 

Length

Max length12
Median length12
Mean length12
Min length12

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row전라남도 나주시 동강면
2nd row전라남도 나주시 동강면
3rd row전라남도 나주시 동강면
4th row전라남도 나주시 공산면
5th row전라남도 나주시 공산면

Common Values

ValueCountFrequency (%)
전라남도 나주시 왕곡면 234
12.0%
전라남도 나주시 공산면 234
12.0%
전라남도 나주시 동강면 228
11.6%
전라남도 나주시 봉황면 205
10.5%
전라남도 나주시 노안면 185
9.4%
전라남도 나주시 반남면 175
8.9%
전라남도 나주시 세지면 154
7.9%
전라남도 나주시 다시면 137
7.0%
전라남도 나주시 문평면 84
 
4.3%
전라남도 나주시 다도면 69
 
3.5%
Other values (18) 253
12.9%

Length

2023-12-12T22:36:37.784836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전라남도 1958
33.3%
나주시 1958
33.3%
왕곡면 234
 
4.0%
공산면 234
 
4.0%
동강면 228
 
3.9%
봉황면 205
 
3.5%
노안면 185
 
3.1%
반남면 175
 
3.0%
세지면 154
 
2.6%
다시면 137
 
2.3%
Other values (20) 406
 
6.9%

Interactions

2023-12-12T22:36:36.283277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T22:36:37.858081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종사업장소재지(지번)
연번1.0000.4030.307
주사육업종0.4031.0000.431
사업장소재지(지번)0.3070.4311.000
2023-12-12T22:36:37.972417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장소재지(지번)주사육업종
사업장소재지(지번)1.0000.162
주사육업종0.1621.000
2023-12-12T22:36:38.082994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번주사육업종사업장소재지(지번)
연번1.0000.1840.114
주사육업종0.1841.0000.162
사업장소재지(지번)0.1140.1621.000

Missing values

2023-12-12T22:36:36.463258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T22:36:36.586940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명칭주사육업종사업장소재지(지번)
01○○축산한우전라남도 나주시 동강면
12○○축산한우전라남도 나주시 동강면
23○○농장돼지전라남도 나주시 동강면
34○○농장한우전라남도 나주시 공산면
45○○축산한우전라남도 나주시 공산면
56○○축산한우전라남도 나주시 다시면
67○○종합농원한우전라남도 나주시 왕곡면
78○○축산한우전라남도 나주시 동강면
89○○축산한우전라남도 나주시 동강면
910○○목장한우전라남도 나주시 봉황면
연번사업장명칭주사육업종사업장소재지(지번)
19481949○○원한우전라남도 나주시 다시면
19491950○○농장한우전라남도 나주시 노안면
19501951○○축산한우전라남도 나주시 봉황면
19511952○○농장염소전라남도 나주시 노안면
19521953○○농장한우전라남도 나주시 세지면
19531954○○농장한우전라남도 나주시 봉황면
19541955○○농장한우전라남도 나주시 공산면
19551956○○농장한우전라남도 나주시 봉황면
19561957○○농장한우전라남도 나주시 노안면
19571958○○농장한우전라남도 나주시 왕곡면