Overview

Dataset statistics

Number of variables8
Number of observations71
Missing cells8
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory66.9 B

Variable types

Categorical4
Text3
Numeric1

Dataset

Description이천시 내의 가금류(산란계, 육계, 오리 등) 농장 현황으로 농장명, 축종명, 상세구분, 위치, 사육두수, 소재지도로명주소, 소재지 지번주소 등의 정보를 제공
URLhttps://www.data.go.kr/data/15060393/fileData.do

Alerts

시군명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
상세구분 is highly overall correlated with 축종명High correlation
축종명 is highly overall correlated with 상세구분High correlation
소재지도로명주소 has 8 (11.3%) missing valuesMissing
소재지지번주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 07:05:59.726161
Analysis finished2023-12-12 07:06:00.855524
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size700.0 B
이천시
71 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이천시
2nd row이천시
3rd row이천시
4th row이천시
5th row이천시

Common Values

ValueCountFrequency (%)
이천시 71
100.0%

Length

2023-12-12T16:06:00.924791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:01.018033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이천시 71
100.0%
Distinct68
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-12T16:06:01.222111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length4
Mean length5.915493
Min length3

Characters and Unicode

Total characters420
Distinct characters107
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)91.5%

Sample

1st row동원종계장
2nd row이동종계장
3rd row상봉농장
4th row농업회사법인 한국양계티에스(주) 1종계장
5th row농업회사법인 조인팜스주식회사 이천지점
ValueCountFrequency (%)
농업회사법인 6
 
7.2%
경기농장 2
 
2.4%
조인팜스주식회사 2
 
2.4%
이천지점 2
 
2.4%
우미농장 2
 
2.4%
송이농장(2 1
 
1.2%
원두농장 1
 
1.2%
성재농장 1
 
1.2%
서경농장 1
 
1.2%
명성농장 1
 
1.2%
Other values (64) 64
77.1%
2023-12-12T16:06:01.612812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
62
 
14.8%
54
 
12.9%
14
 
3.3%
14
 
3.3%
12
 
2.9%
12
 
2.9%
10
 
2.4%
9
 
2.1%
9
 
2.1%
9
 
2.1%
Other values (97) 215
51.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 390
92.9%
Space Separator 12
 
2.9%
Open Punctuation 7
 
1.7%
Close Punctuation 7
 
1.7%
Decimal Number 4
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
62
 
15.9%
54
 
13.8%
14
 
3.6%
14
 
3.6%
12
 
3.1%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
8
 
2.1%
Other values (92) 189
48.5%
Decimal Number
ValueCountFrequency (%)
1 3
75.0%
2 1
 
25.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 390
92.9%
Common 30
 
7.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
62
 
15.9%
54
 
13.8%
14
 
3.6%
14
 
3.6%
12
 
3.1%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
8
 
2.1%
Other values (92) 189
48.5%
Common
ValueCountFrequency (%)
12
40.0%
( 7
23.3%
) 7
23.3%
1 3
 
10.0%
2 1
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 390
92.9%
ASCII 30
 
7.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
62
 
15.9%
54
 
13.8%
14
 
3.6%
14
 
3.6%
12
 
3.1%
10
 
2.6%
9
 
2.3%
9
 
2.3%
9
 
2.3%
8
 
2.1%
Other values (92) 189
48.5%
ASCII
ValueCountFrequency (%)
12
40.0%
( 7
23.3%
) 7
23.3%
1 3
 
10.0%
2 1
 
3.3%

축종명
Categorical

HIGH CORRELATION 

Distinct6
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Memory size700.0 B
육계
34 
종계/산란계
20 
종계업
11 
오리
 
3
메추리
 
2

Length

Max length6
Median length2
Mean length3.2957746
Min length1

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row종계업
2nd row종계업
3rd row종계업
4th row종계업
5th row종계업

Common Values

ValueCountFrequency (%)
육계 34
47.9%
종계/산란계 20
28.2%
종계업 11
 
15.5%
오리 3
 
4.2%
메추리 2
 
2.8%
1
 
1.4%

Length

2023-12-12T16:06:01.767706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:01.904611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육계 34
47.9%
종계/산란계 20
28.2%
종계업 11
 
15.5%
오리 3
 
4.2%
메추리 2
 
2.8%
1
 
1.4%

상세구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size700.0 B
육계
31 
종계/산란계
20 
종계
11 
오리
 
3
종계/산란계, 육계
 
3
Other values (2)
 
3

Length

Max length10
Median length2
Mean length3.4788732
Min length1

Unique

Unique1 ?
Unique (%)1.4%

Sample

1st row종계
2nd row종계
3rd row종계
4th row종계
5th row종계

Common Values

ValueCountFrequency (%)
육계 31
43.7%
종계/산란계 20
28.2%
종계 11
 
15.5%
오리 3
 
4.2%
종계/산란계, 육계 3
 
4.2%
메추리 2
 
2.8%
1
 
1.4%

Length

2023-12-12T16:06:02.059872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:02.187529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육계 34
45.9%
종계/산란계 23
31.1%
종계 11
 
14.9%
오리 3
 
4.1%
메추리 2
 
2.7%
1
 
1.4%

사육두수
Real number (ℝ)

Distinct50
Distinct (%)70.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62326.493
Minimum30
Maximum600000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size771.0 B
2023-12-12T16:06:02.354436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile1575
Q120000
median35000
Q369000
95-th percentile209652
Maximum600000
Range599970
Interquartile range (IQR)49000

Descriptive statistics

Standard deviation94363.733
Coefficient of variation (CV)1.5140228
Kurtosis19.304933
Mean62326.493
Median Absolute Deviation (MAD)25000
Skewness4.0792938
Sum4425181
Variance8.9045142 × 109
MonotonicityNot monotonic
2023-12-12T16:06:02.537275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30000 5
 
7.0%
75000 4
 
5.6%
40000 4
 
5.6%
35000 3
 
4.2%
33000 2
 
2.8%
60000 2
 
2.8%
27000 2
 
2.8%
68000 2
 
2.8%
65000 2
 
2.8%
10000 2
 
2.8%
Other values (40) 43
60.6%
ValueCountFrequency (%)
30 1
1.4%
140 1
1.4%
150 2
2.8%
3000 1
1.4%
5000 2
2.8%
8000 1
1.4%
9000 1
1.4%
10000 2
2.8%
10300 1
1.4%
13800 1
1.4%
ValueCountFrequency (%)
600000 1
1.4%
474560 1
1.4%
222500 1
1.4%
213804 1
1.4%
205500 1
1.4%
137651 1
1.4%
127000 1
1.4%
105000 1
1.4%
100890 1
1.4%
95000 1
1.4%
Distinct62
Distinct (%)98.4%
Missing8
Missing (%)11.3%
Memory size700.0 B
2023-12-12T16:06:02.869226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length27
Mean length24.761905
Min length18

Characters and Unicode

Total characters1560
Distinct characters78
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique61 ?
Unique (%)96.8%

Sample

1st row경기도 이천시 백사면 원적로532번길 199
2nd row경기도 이천시 설성면 설가로219번길 325-143
3rd row경기도 이천시 장호원읍 어석로 260
4th row경기도 이천시 대월면 양녕로51번길 33
5th row경기도 이천시 설성면 원설로112번길 36
ValueCountFrequency (%)
경기도 63
20.1%
이천시 63
20.1%
설성면 12
 
3.8%
장호원읍 12
 
3.8%
백사면 9
 
2.9%
율면 7
 
2.2%
부발읍 6
 
1.9%
마장면 5
 
1.6%
대월면 4
 
1.3%
호법면 3
 
1.0%
Other values (114) 130
41.4%
2023-12-12T16:06:03.443190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
 
16.1%
1 70
 
4.5%
68
 
4.4%
67
 
4.3%
66
 
4.2%
64
 
4.1%
63
 
4.0%
63
 
4.0%
63
 
4.0%
2 63
 
4.0%
Other values (68) 722
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 878
56.3%
Decimal Number 395
25.3%
Space Separator 251
 
16.1%
Dash Punctuation 36
 
2.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
 
7.7%
67
 
7.6%
66
 
7.5%
64
 
7.3%
63
 
7.2%
63
 
7.2%
63
 
7.2%
49
 
5.6%
49
 
5.6%
44
 
5.0%
Other values (56) 282
32.1%
Decimal Number
ValueCountFrequency (%)
1 70
17.7%
2 63
15.9%
5 41
10.4%
9 40
10.1%
3 38
9.6%
6 37
9.4%
8 32
8.1%
0 29
7.3%
4 27
 
6.8%
7 18
 
4.6%
Space Separator
ValueCountFrequency (%)
251
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 878
56.3%
Common 682
43.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
 
7.7%
67
 
7.6%
66
 
7.5%
64
 
7.3%
63
 
7.2%
63
 
7.2%
63
 
7.2%
49
 
5.6%
49
 
5.6%
44
 
5.0%
Other values (56) 282
32.1%
Common
ValueCountFrequency (%)
251
36.8%
1 70
 
10.3%
2 63
 
9.2%
5 41
 
6.0%
9 40
 
5.9%
3 38
 
5.6%
6 37
 
5.4%
- 36
 
5.3%
8 32
 
4.7%
0 29
 
4.3%
Other values (2) 45
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 878
56.3%
ASCII 682
43.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
36.8%
1 70
 
10.3%
2 63
 
9.2%
5 41
 
6.0%
9 40
 
5.9%
3 38
 
5.6%
6 37
 
5.4%
- 36
 
5.3%
8 32
 
4.7%
0 29
 
4.3%
Other values (2) 45
 
6.6%
Hangul
ValueCountFrequency (%)
68
 
7.7%
67
 
7.6%
66
 
7.5%
64
 
7.3%
63
 
7.2%
63
 
7.2%
63
 
7.2%
49
 
5.6%
49
 
5.6%
44
 
5.0%
Other values (56) 282
32.1%
Distinct71
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-12-12T16:06:03.852767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length20.211268
Min length14

Characters and Unicode

Total characters1435
Distinct characters86
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)100.0%

Sample

1st row경기도 이천시 백사면 신대리 391-4
2nd row경기도 이천시 백사면 신대리 414-1
3rd row경기도 이천시 설성면 상봉리 481-1
4th row경기도 이천시 장호원읍 어석리 77-1
5th row경기도 이천시 대월면 군량리 620-6
ValueCountFrequency (%)
경기도 71
20.1%
이천시 71
20.1%
설성면 13
 
3.7%
장호원읍 13
 
3.7%
백사면 10
 
2.8%
율면 9
 
2.5%
부발읍 7
 
2.0%
신대리 5
 
1.4%
마장면 5
 
1.4%
와현리 5
 
1.4%
Other values (117) 145
41.0%
2023-12-12T16:06:04.388193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
283
19.7%
74
 
5.2%
73
 
5.1%
73
 
5.1%
72
 
5.0%
72
 
5.0%
71
 
4.9%
70
 
4.9%
50
 
3.5%
1 43
 
3.0%
Other values (76) 554
38.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 854
59.5%
Space Separator 283
 
19.7%
Decimal Number 256
 
17.8%
Dash Punctuation 42
 
2.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
74
 
8.7%
73
 
8.5%
73
 
8.5%
72
 
8.4%
72
 
8.4%
71
 
8.3%
70
 
8.2%
50
 
5.9%
24
 
2.8%
22
 
2.6%
Other values (64) 253
29.6%
Decimal Number
ValueCountFrequency (%)
1 43
16.8%
2 34
13.3%
4 33
12.9%
7 33
12.9%
3 28
10.9%
8 22
8.6%
6 21
8.2%
9 16
 
6.2%
5 15
 
5.9%
0 11
 
4.3%
Space Separator
ValueCountFrequency (%)
283
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 854
59.5%
Common 581
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
74
 
8.7%
73
 
8.5%
73
 
8.5%
72
 
8.4%
72
 
8.4%
71
 
8.3%
70
 
8.2%
50
 
5.9%
24
 
2.8%
22
 
2.6%
Other values (64) 253
29.6%
Common
ValueCountFrequency (%)
283
48.7%
1 43
 
7.4%
- 42
 
7.2%
2 34
 
5.9%
4 33
 
5.7%
7 33
 
5.7%
3 28
 
4.8%
8 22
 
3.8%
6 21
 
3.6%
9 16
 
2.8%
Other values (2) 26
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 854
59.5%
ASCII 581
40.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
283
48.7%
1 43
 
7.4%
- 42
 
7.2%
2 34
 
5.9%
4 33
 
5.7%
7 33
 
5.7%
3 28
 
4.8%
8 22
 
3.8%
6 21
 
3.6%
9 16
 
2.8%
Other values (2) 26
 
4.5%
Hangul
ValueCountFrequency (%)
74
 
8.7%
73
 
8.5%
73
 
8.5%
72
 
8.4%
72
 
8.4%
71
 
8.3%
70
 
8.2%
50
 
5.9%
24
 
2.8%
22
 
2.6%
Other values (64) 253
29.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size700.0 B
2023-06-02
71 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-02
2nd row2023-06-02
3rd row2023-06-02
4th row2023-06-02
5th row2023-06-02

Common Values

ValueCountFrequency (%)
2023-06-02 71
100.0%

Length

2023-12-12T16:06:04.568484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:06:04.675133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-02 71
100.0%

Interactions

2023-12-12T16:06:00.244550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T16:06:04.759934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농장명축종명상세구분사육두수소재지도로명주소소재지지번주소
농장명1.0000.9840.9881.0000.9931.000
축종명0.9841.0001.0000.6131.0001.000
상세구분0.9881.0001.0000.4841.0001.000
사육두수1.0000.6130.4841.0000.9721.000
소재지도로명주소0.9931.0001.0000.9721.0001.000
소재지지번주소1.0001.0001.0001.0001.0001.000
2023-12-12T16:06:04.880795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
상세구분축종명
상세구분1.0000.992
축종명0.9921.000
2023-12-12T16:06:04.981191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사육두수축종명상세구분
사육두수1.0000.2590.309
축종명0.2591.0000.992
상세구분0.3090.9921.000

Missing values

2023-12-12T16:06:00.684330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:06:00.803327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군명농장명축종명상세구분사육두수소재지도로명주소소재지지번주소데이터기준일자
0이천시동원종계장종계업종계25500경기도 이천시 백사면 원적로532번길 199경기도 이천시 백사면 신대리 391-42023-06-02
1이천시이동종계장종계업종계16500<NA>경기도 이천시 백사면 신대리 414-12023-06-02
2이천시상봉농장종계업종계16200경기도 이천시 설성면 설가로219번길 325-143경기도 이천시 설성면 상봉리 481-12023-06-02
3이천시농업회사법인 한국양계티에스(주) 1종계장종계업종계222500경기도 이천시 장호원읍 어석로 260경기도 이천시 장호원읍 어석리 77-12023-06-02
4이천시농업회사법인 조인팜스주식회사 이천지점종계업종계17050경기도 이천시 대월면 양녕로51번길 33경기도 이천시 대월면 군량리 620-62023-06-02
5이천시농업회사법인 조인팜스주식회사 이천지점종계업종계13800경기도 이천시 설성면 원설로112번길 36경기도 이천시 설성면 대죽리 7762023-06-02
6이천시태경축산종계업종계22000경기도 이천시 장호원읍 이풍로426번길 95-100경기도 이천시 장호원읍 와현리 417-202023-06-02
7이천시호현농장종계업종계22500경기도 이천시 장호원읍 이풍로426번길 95-49경기도 이천시 장호원읍 와현리 388-42023-06-02
8이천시강화농장종계업종계24154경기도 이천시 장호원읍 이풍로476번길 166경기도 이천시 장호원읍 풍계리 862-152023-06-02
9이천시상진농장종계업종계14000경기도 이천시 장호원읍 경충대로307번길 423-99경기도 이천시 장호원읍 선읍리 481-22023-06-02
시군명농장명축종명상세구분사육두수소재지도로명주소소재지지번주소데이터기준일자
61이천시희망축산육계육계150경기도 이천시 백사면 원적로618번길 300-82경기도 이천시 백사면 신대리 2912023-06-02
62이천시수광농장육계육계140경기도 이천시 백사면 청백리로393번길 520-31경기도 이천시 백사면 현방리 393-12023-06-02
63이천시광림농원종계/산란계종계/산란계150<NA>경기도 이천시 장호원읍 선읍리 산1312023-06-02
64이천시북두농장육계육계64500경기도 이천시 율면 주래본죽로612번길 113-20경기도 이천시 율면 북두리 271-12023-06-02
65이천시농업회사법인 골드아이 유한회사종계/산란계종계/산란계213804경기도 이천시 설성면 진상미로264번길 251-61경기도 이천시 설성면 행죽리 779-12023-06-02
66이천시천왕농원종계/산란계종계/산란계38000경기도 이천시 율면 금율로554번길 166경기도 이천시 율면 산성리 4742023-06-02
67이천시비에프팜종계/산란계종계/산란계8000<NA>경기도 이천시 부발읍 가산리 4252023-06-02
68이천시솔잎농장육계육계205500경기도 이천시 마장면 이장로 107-62경기도 이천시 마장면 이치리 172-62023-06-02
69이천시복평농장육계육계28000경기도 이천시 설성면 진상미로924번길 160-12경기도 이천시 설성면 수산리 224-112023-06-02
70이천시설성호두농원종계/산란계종계/산란계30<NA>경기도 이천시 설성면 장능리 260-182023-06-02