Overview

Dataset statistics

Number of variables6
Number of observations69
Missing cells76
Missing cells (%)18.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory51.9 B

Variable types

Numeric2
Text3
Categorical1

Dataset

Description전라남도 장흥군에 위치한 가금류 농장의 일련번호, 농장이름, 사육축종, 사육수, 농장상세주소 등의 정보를 제공함
Author전라남도 장흥군
URLhttps://www.data.go.kr/data/15076922/fileData.do

Alerts

농장명 has 8 (11.6%) missing valuesMissing
지번주소 has 6 (8.7%) missing valuesMissing
도로명주소 has 62 (89.9%) missing valuesMissing
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-14 14:18:19.238651
Analysis finished2024-03-14 14:18:21.279953
Duration2.04 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct69
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35
Minimum1
Maximum69
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size749.0 B
2024-03-14T23:18:21.504393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.4
Q118
median35
Q352
95-th percentile65.6
Maximum69
Range68
Interquartile range (IQR)34

Descriptive statistics

Standard deviation20.062403
Coefficient of variation (CV)0.5732115
Kurtosis-1.2
Mean35
Median Absolute Deviation (MAD)17
Skewness0
Sum2415
Variance402.5
MonotonicityStrictly increasing
2024-03-14T23:18:21.959291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.4%
45 1
 
1.4%
51 1
 
1.4%
50 1
 
1.4%
49 1
 
1.4%
48 1
 
1.4%
47 1
 
1.4%
46 1
 
1.4%
44 1
 
1.4%
53 1
 
1.4%
Other values (59) 59
85.5%
ValueCountFrequency (%)
1 1
1.4%
2 1
1.4%
3 1
1.4%
4 1
1.4%
5 1
1.4%
6 1
1.4%
7 1
1.4%
8 1
1.4%
9 1
1.4%
10 1
1.4%
ValueCountFrequency (%)
69 1
1.4%
68 1
1.4%
67 1
1.4%
66 1
1.4%
65 1
1.4%
64 1
1.4%
63 1
1.4%
62 1
1.4%
61 1
1.4%
60 1
1.4%

농장명
Text

MISSING 

Distinct56
Distinct (%)91.8%
Missing8
Missing (%)11.6%
Memory size680.0 B
2024-03-14T23:18:22.901358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length4
Mean length5.0163934
Min length3

Characters and Unicode

Total characters306
Distinct characters102
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)85.2%

Sample

1st row형제농장
2nd row풍길오리농장
3rd row풍길오리농장
4th row은비치농장
5th row덕암농장
ValueCountFrequency (%)
대성농장 3
 
4.7%
회진오리농장 2
 
3.1%
여의동농장 2
 
3.1%
풍길오리농장 2
 
3.1%
민주농장 1
 
1.6%
영섭농장 1
 
1.6%
형제농장 1
 
1.6%
믿음오리농장 1
 
1.6%
주천농장 1
 
1.6%
벧엘농장 1
 
1.6%
Other values (49) 49
76.6%
2024-03-14T23:18:24.292262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
56
18.3%
55
18.0%
19
 
6.2%
18
 
5.9%
5
 
1.6%
4
 
1.3%
4
 
1.3%
4
 
1.3%
4
 
1.3%
3
 
1.0%
Other values (92) 134
43.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 291
95.1%
Decimal Number 4
 
1.3%
Lowercase Letter 4
 
1.3%
Space Separator 3
 
1.0%
Open Punctuation 2
 
0.7%
Close Punctuation 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
19.2%
55
18.9%
19
 
6.5%
18
 
6.2%
5
 
1.7%
4
 
1.4%
4
 
1.4%
4
 
1.4%
4
 
1.4%
3
 
1.0%
Other values (83) 119
40.9%
Lowercase Letter
ValueCountFrequency (%)
f 1
25.0%
a 1
25.0%
r 1
25.0%
m 1
25.0%
Decimal Number
ValueCountFrequency (%)
1 2
50.0%
2 2
50.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 291
95.1%
Common 11
 
3.6%
Latin 4
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
19.2%
55
18.9%
19
 
6.5%
18
 
6.2%
5
 
1.7%
4
 
1.4%
4
 
1.4%
4
 
1.4%
4
 
1.4%
3
 
1.0%
Other values (83) 119
40.9%
Common
ValueCountFrequency (%)
3
27.3%
1 2
18.2%
( 2
18.2%
) 2
18.2%
2 2
18.2%
Latin
ValueCountFrequency (%)
f 1
25.0%
a 1
25.0%
r 1
25.0%
m 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 291
95.1%
ASCII 15
 
4.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
56
19.2%
55
18.9%
19
 
6.5%
18
 
6.2%
5
 
1.7%
4
 
1.4%
4
 
1.4%
4
 
1.4%
4
 
1.4%
3
 
1.0%
Other values (83) 119
40.9%
ASCII
ValueCountFrequency (%)
3
20.0%
1 2
13.3%
( 2
13.3%
) 2
13.3%
2 2
13.3%
f 1
 
6.7%
a 1
 
6.7%
r 1
 
6.7%
m 1
 
6.7%

축종
Categorical

Distinct8
Distinct (%)11.6%
Missing0
Missing (%)0.0%
Memory size680.0 B
육용오리
43 
육계
11 
삼계
종오리
 
2
산란계
 
2
Other values (3)
 
3

Length

Max length6
Median length4
Mean length3.3913043
Min length2

Unique

Unique3 ?
Unique (%)4.3%

Sample

1st row육용오리
2nd row육용오리
3rd row육용오리
4th row육용오리
5th row육용오리

Common Values

ValueCountFrequency (%)
육용오리 43
62.3%
육계 11
 
15.9%
삼계 8
 
11.6%
종오리 2
 
2.9%
산란계 2
 
2.9%
원종오리 1
 
1.4%
육용오리 1
 
1.4%
청계 1
 
1.4%

Length

2024-03-14T23:18:24.734158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:18:25.098749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육용오리 44
63.8%
육계 11
 
15.9%
삼계 8
 
11.6%
종오리 2
 
2.9%
산란계 2
 
2.9%
원종오리 1
 
1.4%
청계 1
 
1.4%

사육수
Real number (ℝ)

Distinct43
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28602.783
Minimum700
Maximum114000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size749.0 B
2024-03-14T23:18:25.497717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum700
5-th percentile6240
Q110000
median17592
Q342000
95-th percentile89800
Maximum114000
Range113300
Interquartile range (IQR)32000

Descriptive statistics

Standard deviation26757.724
Coefficient of variation (CV)0.93549375
Kurtosis1.8486229
Mean28602.783
Median Absolute Deviation (MAD)7592
Skewness1.5801615
Sum1973592
Variance7.1597582 × 108
MonotonicityNot monotonic
2024-03-14T23:18:26.117537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=43)
ValueCountFrequency (%)
10000 9
 
13.0%
15000 3
 
4.3%
9000 3
 
4.3%
13000 3
 
4.3%
25000 3
 
4.3%
18000 2
 
2.9%
12000 2
 
2.9%
17000 2
 
2.9%
7000 2
 
2.9%
21000 2
 
2.9%
Other values (33) 38
55.1%
ValueCountFrequency (%)
700 1
 
1.4%
900 1
 
1.4%
3500 1
 
1.4%
6000 1
 
1.4%
6600 1
 
1.4%
7000 2
 
2.9%
9000 3
 
4.3%
9500 1
 
1.4%
10000 9
13.0%
11500 1
 
1.4%
ValueCountFrequency (%)
114000 1
1.4%
107000 1
1.4%
95000 1
1.4%
93000 1
1.4%
85000 2
2.9%
65000 1
1.4%
60000 1
1.4%
59000 2
2.9%
57000 1
1.4%
55000 1
1.4%

지번주소
Text

MISSING 

Distinct60
Distinct (%)95.2%
Missing6
Missing (%)8.7%
Memory size680.0 B
2024-03-14T23:18:27.233040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length21
Min length16

Characters and Unicode

Total characters1323
Distinct characters70
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)90.5%

Sample

1st row전라남도 장흥군 용산면 풍길리 937-1
2nd row전라남도 장흥군 용산면 풍길리 7
3rd row전라남도 장흥군 용산면 산72
4th row전라남도 장흥군 안양면 사촌리 33-1
5th row전라남도 장흥군 용산면 덕암리 439-9
ValueCountFrequency (%)
전라남도 63
20.1%
장흥군 63
20.1%
기동리 13
 
4.1%
장평면 12
 
3.8%
부산면 10
 
3.2%
용산면 9
 
2.9%
회진면 7
 
2.2%
장평 6
 
1.9%
회진리 6
 
1.9%
장흥 5
 
1.6%
Other values (98) 120
38.2%
2024-03-14T23:18:28.761301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
252
19.0%
89
 
6.7%
69
 
5.2%
64
 
4.8%
63
 
4.8%
63
 
4.8%
63
 
4.8%
63
 
4.8%
62
 
4.7%
1 49
 
3.7%
Other values (60) 486
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 799
60.4%
Space Separator 252
 
19.0%
Decimal Number 231
 
17.5%
Dash Punctuation 41
 
3.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
11.1%
69
 
8.6%
64
 
8.0%
63
 
7.9%
63
 
7.9%
63
 
7.9%
63
 
7.9%
62
 
7.8%
42
 
5.3%
33
 
4.1%
Other values (48) 188
23.5%
Decimal Number
ValueCountFrequency (%)
1 49
21.2%
4 32
13.9%
3 31
13.4%
2 22
9.5%
5 21
9.1%
7 19
 
8.2%
9 19
 
8.2%
6 14
 
6.1%
8 13
 
5.6%
0 11
 
4.8%
Space Separator
ValueCountFrequency (%)
252
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 799
60.4%
Common 524
39.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
11.1%
69
 
8.6%
64
 
8.0%
63
 
7.9%
63
 
7.9%
63
 
7.9%
63
 
7.9%
62
 
7.8%
42
 
5.3%
33
 
4.1%
Other values (48) 188
23.5%
Common
ValueCountFrequency (%)
252
48.1%
1 49
 
9.4%
- 41
 
7.8%
4 32
 
6.1%
3 31
 
5.9%
2 22
 
4.2%
5 21
 
4.0%
7 19
 
3.6%
9 19
 
3.6%
6 14
 
2.7%
Other values (2) 24
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 799
60.4%
ASCII 524
39.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
252
48.1%
1 49
 
9.4%
- 41
 
7.8%
4 32
 
6.1%
3 31
 
5.9%
2 22
 
4.2%
5 21
 
4.0%
7 19
 
3.6%
9 19
 
3.6%
6 14
 
2.7%
Other values (2) 24
 
4.6%
Hangul
ValueCountFrequency (%)
89
11.1%
69
 
8.6%
64
 
8.0%
63
 
7.9%
63
 
7.9%
63
 
7.9%
63
 
7.9%
62
 
7.8%
42
 
5.3%
33
 
4.1%
Other values (48) 188
23.5%

도로명주소
Text

MISSING 

Distinct6
Distinct (%)85.7%
Missing62
Missing (%)89.9%
Memory size680.0 B
2024-03-14T23:18:29.379245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length24
Mean length23.142857
Min length21

Characters and Unicode

Total characters162
Distinct characters42
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)71.4%

Sample

1st row전라남도 장흥군 장흥읍 장흥대로 3027-145
2nd row전라남도 장흥군 용산면 덕암풍길로 90
3rd row전라남도 장흥군 장평면 곰치로 1032-15
4th row전라남도 장흥군 장평면 곰치로 1032-15
5th row전라남도 장흥군 용산 접정남포로248-4
ValueCountFrequency (%)
전라남도 7
20.6%
장흥군 7
20.6%
장평면 2
 
5.9%
곰치로 2
 
5.9%
1032-15 2
 
5.9%
접정남포로248-4 1
 
2.9%
홍성로 1
 
2.9%
장동 1
 
2.9%
11-61 1
 
2.9%
지천1길 1
 
2.9%
Other values (9) 9
26.5%
2024-03-14T23:18:30.352670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
16.7%
12
 
7.4%
1 11
 
6.8%
9
 
5.6%
8
 
4.9%
7
 
4.3%
7
 
4.3%
7
 
4.3%
7
 
4.3%
6
 
3.7%
Other values (32) 61
37.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 93
57.4%
Decimal Number 36
 
22.2%
Space Separator 27
 
16.7%
Dash Punctuation 6
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
12.9%
9
 
9.7%
8
 
8.6%
7
 
7.5%
7
 
7.5%
7
 
7.5%
7
 
7.5%
6
 
6.5%
3
 
3.2%
2
 
2.2%
Other values (20) 25
26.9%
Decimal Number
ValueCountFrequency (%)
1 11
30.6%
0 5
13.9%
2 5
13.9%
4 4
 
11.1%
3 4
 
11.1%
5 3
 
8.3%
6 1
 
2.8%
8 1
 
2.8%
7 1
 
2.8%
9 1
 
2.8%
Space Separator
ValueCountFrequency (%)
27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 93
57.4%
Common 69
42.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
12.9%
9
 
9.7%
8
 
8.6%
7
 
7.5%
7
 
7.5%
7
 
7.5%
7
 
7.5%
6
 
6.5%
3
 
3.2%
2
 
2.2%
Other values (20) 25
26.9%
Common
ValueCountFrequency (%)
27
39.1%
1 11
15.9%
- 6
 
8.7%
0 5
 
7.2%
2 5
 
7.2%
4 4
 
5.8%
3 4
 
5.8%
5 3
 
4.3%
6 1
 
1.4%
8 1
 
1.4%
Other values (2) 2
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 93
57.4%
ASCII 69
42.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27
39.1%
1 11
15.9%
- 6
 
8.7%
0 5
 
7.2%
2 5
 
7.2%
4 4
 
5.8%
3 4
 
5.8%
5 3
 
4.3%
6 1
 
1.4%
8 1
 
1.4%
Other values (2) 2
 
2.9%
Hangul
ValueCountFrequency (%)
12
12.9%
9
 
9.7%
8
 
8.6%
7
 
7.5%
7
 
7.5%
7
 
7.5%
7
 
7.5%
6
 
6.5%
3
 
3.2%
2
 
2.2%
Other values (20) 25
26.9%

Interactions

2024-03-14T23:18:20.225790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:18:19.771898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:18:20.378761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:18:20.027050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-14T23:18:30.509035image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호농장명축종사육수지번주소도로명주소
번호1.0000.9590.6390.5640.9291.000
농장명0.9591.0001.0000.3920.9971.000
축종0.6391.0001.0000.4161.0001.000
사육수0.5640.3920.4161.0000.0000.797
지번주소0.9290.9971.0000.0001.000NaN
도로명주소1.0001.0001.0000.797NaN1.000
2024-03-14T23:18:30.694404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호사육수축종
번호1.0000.3360.363
사육수0.3361.0000.207
축종0.3630.2071.000

Missing values

2024-03-14T23:18:20.566518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:18:20.847900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T23:18:21.139970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

번호농장명축종사육수지번주소도로명주소
01형제농장육용오리28000전라남도 장흥군 용산면 풍길리 937-1<NA>
12풍길오리농장육용오리13000전라남도 장흥군 용산면 풍길리 7<NA>
23풍길오리농장육용오리10000전라남도 장흥군 용산면 산72<NA>
34은비치농장육용오리45000전라남도 장흥군 안양면 사촌리 33-1<NA>
45덕암농장육용오리22000전라남도 장흥군 용산면 덕암리 439-9<NA>
56모산농장육용오리18000전라남도 장흥군 용산면 모산리 30-11<NA>
67청수농장육용오리12000전라남도 장흥군 용산면 접정리 304-8<NA>
78오리누리육용오리17000전라남도 장흥군 회진면 회진리 2166-1<NA>
89우리농장육용오리10000전라남도 장흥군 회진면 회진리 1355<NA>
910대성농장육용오리7000전라남도 장흥군 회진면 덕산리 2111-3<NA>
번호농장명축종사육수지번주소도로명주소
5960사안농장육계40000전라남도 장흥군 장흥 사안리 104<NA>
6061삼산농장육계55000전라남도 장흥군 장흥 삼산리 산92-5<NA>
6162삼화농장삼계85000전라남도 장흥군 회진 회진리 1443-2<NA>
6263여의동농장삼계95000전라남도 장흥군 장평 등촌리 297<NA>
6364<NA>삼계10000전라남도 장흥군 용산 월송리 546<NA>
6465모령농장육계42000<NA>전라남도 장흥군 안양 지천1길 11-61
6566<NA>삼계45000<NA>전라남도 장흥군 장동 홍성로 1230-41
6667길병문 농장산란계20000전라남도 장흥군 장평 축내리 694<NA>
6768행복을 farm청계900전라남도 장흥군 장평 축내리 718<NA>
6869인암농원산란계700전라남도 장흥군 유치 조양리 859-2<NA>