Overview

Dataset statistics

Number of variables11
Number of observations6718
Missing cells17940
Missing cells (%)24.3%
Duplicate rows30
Duplicate rows (%)0.4%
Total size in memory597.1 KiB
Average record size in memory91.0 B

Variable types

Categorical3
Text3
DateTime2
Numeric3

Dataset

Description축산물 수입 판매업체 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=DK84V9S1I7CB2GWI0V0J308050&infSeq=1

Alerts

Dataset has 30 (0.4%) duplicate rowsDuplicates
소재지우편번호 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
축산업무구분명 is highly imbalanced (55.6%)Imbalance
폐업일자 has 5270 (78.4%) missing valuesMissing
소재지도로명주소 has 1380 (20.5%) missing valuesMissing
소재지우편번호 has 2477 (36.9%) missing valuesMissing
WGS84위도 has 4402 (65.5%) missing valuesMissing
WGS84경도 has 4402 (65.5%) missing valuesMissing

Reproduction

Analysis started2023-12-10 21:23:33.331542
Analysis finished2023-12-10 21:23:36.185353
Duration2.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size52.6 KiB
안성시
833 
화성시
769 
포천시
582 
파주시
567 
여주시
 
326
Other values (26)
3641 

Length

Max length4
Median length3
Mean length3.0462935
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
안성시 833
 
12.4%
화성시 769
 
11.4%
포천시 582
 
8.7%
파주시 567
 
8.4%
여주시 326
 
4.9%
용인시 322
 
4.8%
이천시 308
 
4.6%
고양시 291
 
4.3%
양주시 284
 
4.2%
연천군 276
 
4.1%
Other values (21) 2160
32.2%

Length

2023-12-11T06:23:36.265637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안성시 833
 
12.4%
화성시 769
 
11.4%
포천시 582
 
8.7%
파주시 567
 
8.4%
여주시 326
 
4.9%
용인시 322
 
4.8%
이천시 308
 
4.6%
고양시 291
 
4.3%
양주시 284
 
4.2%
연천군 276
 
4.1%
Other values (21) 2160
32.2%
Distinct5643
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size52.6 KiB
2023-12-11T06:23:36.616699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length4
Mean length5.293242
Min length1

Characters and Unicode

Total characters35560
Distinct characters807
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4999 ?
Unique (%)74.4%

Sample

1st row위곡농장
2nd row세화수입정육점
3rd row금강닭수입소고기전문판매점
4th row백두농장
5th row오성농장
ValueCountFrequency (%)
주식회사 250
 
3.3%
102
 
1.4%
농업회사법인 74
 
1.0%
농장 56
 
0.7%
목장 35
 
0.5%
우리농장 14
 
0.2%
대성농장 13
 
0.2%
유한회사 12
 
0.2%
한우농장 10
 
0.1%
형제농장 10
 
0.1%
Other values (5850) 6937
92.3%
2023-12-11T06:23:37.095165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4297
 
12.1%
2661
 
7.5%
1788
 
5.0%
1091
 
3.1%
796
 
2.2%
) 772
 
2.2%
( 769
 
2.2%
666
 
1.9%
552
 
1.6%
421
 
1.2%
Other values (797) 21747
61.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31980
89.9%
Space Separator 797
 
2.2%
Close Punctuation 772
 
2.2%
Open Punctuation 769
 
2.2%
Uppercase Letter 487
 
1.4%
Lowercase Letter 369
 
1.0%
Decimal Number 230
 
0.6%
Dash Punctuation 113
 
0.3%
Other Punctuation 33
 
0.1%
Other Symbol 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4297
 
13.4%
2661
 
8.3%
1788
 
5.6%
1091
 
3.4%
666
 
2.1%
552
 
1.7%
421
 
1.3%
396
 
1.2%
374
 
1.2%
349
 
1.1%
Other values (721) 19385
60.6%
Uppercase Letter
ValueCountFrequency (%)
O 46
 
9.4%
E 44
 
9.0%
D 39
 
8.0%
T 36
 
7.4%
A 33
 
6.8%
P 26
 
5.3%
C 25
 
5.1%
G 24
 
4.9%
N 23
 
4.7%
S 20
 
4.1%
Other values (15) 171
35.1%
Lowercase Letter
ValueCountFrequency (%)
e 47
12.7%
o 43
 
11.7%
a 34
 
9.2%
n 25
 
6.8%
t 22
 
6.0%
i 19
 
5.1%
r 18
 
4.9%
l 17
 
4.6%
d 17
 
4.6%
m 17
 
4.6%
Other values (14) 110
29.8%
Decimal Number
ValueCountFrequency (%)
2 145
63.0%
1 42
 
18.3%
3 17
 
7.4%
4 9
 
3.9%
0 4
 
1.7%
5 4
 
1.7%
9 4
 
1.7%
6 3
 
1.3%
7 1
 
0.4%
8 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
. 14
42.4%
& 6
18.2%
, 3
 
9.1%
! 3
 
9.1%
* 2
 
6.1%
· 2
 
6.1%
? 1
 
3.0%
/ 1
 
3.0%
; 1
 
3.0%
Space Separator
ValueCountFrequency (%)
796
99.9%
  1
 
0.1%
Letter Number
ValueCountFrequency (%)
2
66.7%
1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 772
100.0%
Open Punctuation
ValueCountFrequency (%)
( 769
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 113
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31984
89.9%
Common 2714
 
7.6%
Latin 859
 
2.4%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4297
 
13.4%
2661
 
8.3%
1788
 
5.6%
1091
 
3.4%
666
 
2.1%
552
 
1.7%
421
 
1.3%
396
 
1.2%
374
 
1.2%
349
 
1.1%
Other values (719) 19389
60.6%
Latin
ValueCountFrequency (%)
e 47
 
5.5%
O 46
 
5.4%
E 44
 
5.1%
o 43
 
5.0%
D 39
 
4.5%
T 36
 
4.2%
a 34
 
4.0%
A 33
 
3.8%
P 26
 
3.0%
C 25
 
2.9%
Other values (41) 486
56.6%
Common
ValueCountFrequency (%)
796
29.3%
) 772
28.4%
( 769
28.3%
2 145
 
5.3%
- 113
 
4.2%
1 42
 
1.5%
3 17
 
0.6%
. 14
 
0.5%
4 9
 
0.3%
& 6
 
0.2%
Other values (14) 31
 
1.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31977
89.9%
ASCII 3567
 
10.0%
None 10
 
< 0.1%
Number Forms 3
 
< 0.1%
CJK 3
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4297
 
13.4%
2661
 
8.3%
1788
 
5.6%
1091
 
3.4%
666
 
2.1%
552
 
1.7%
421
 
1.3%
396
 
1.2%
374
 
1.2%
349
 
1.1%
Other values (718) 19382
60.6%
ASCII
ValueCountFrequency (%)
796
22.3%
) 772
21.6%
( 769
21.6%
2 145
 
4.1%
- 113
 
3.2%
e 47
 
1.3%
O 46
 
1.3%
E 44
 
1.2%
o 43
 
1.2%
1 42
 
1.2%
Other values (61) 750
21.0%
None
ValueCountFrequency (%)
7
70.0%
· 2
 
20.0%
  1
 
10.0%
Number Forms
ValueCountFrequency (%)
2
66.7%
1
33.3%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct2901
Distinct (%)43.2%
Missing0
Missing (%)0.0%
Memory size52.6 KiB
Minimum1977-03-26 00:00:00
Maximum2023-11-28 00:00:00
2023-12-11T06:23:37.233272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:37.361682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size52.6 KiB
정상
4357 
폐업
1312 
운영중
 
352
말소
 
307
휴업
 
244
Other values (2)
 
146

Length

Max length4
Median length2
Mean length2.0958619
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row말소
2nd row운영중
3rd row운영중
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 4357
64.9%
폐업 1312
 
19.5%
운영중 352
 
5.2%
말소 307
 
4.6%
휴업 244
 
3.6%
폐업 등 143
 
2.1%
휴업 등 3
 
< 0.1%

Length

2023-12-11T06:23:37.505563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:23:37.652789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 4357
63.5%
폐업 1455
 
21.2%
운영중 352
 
5.1%
말소 307
 
4.5%
휴업 247
 
3.6%
146
 
2.1%

폐업일자
Date

MISSING 

Distinct627
Distinct (%)43.3%
Missing5270
Missing (%)78.4%
Memory size52.6 KiB
Minimum1999-10-01 00:00:00
Maximum2023-11-28 00:00:00
2023-12-11T06:23:37.766393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:37.897645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

축산업무구분명
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size52.6 KiB
가축사육업
4729 
사료제조업
1358 
축산물판매업
498 
종축업
 
56
가축인공수정소
 
49
Other values (2)
 
28

Length

Max length7
Median length5
Mean length5.0637094
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가축사육업
2nd row축산물판매업
3rd row축산물판매업
4th row가축사육업
5th row가축사육업

Common Values

ValueCountFrequency (%)
가축사육업 4729
70.4%
사료제조업 1358
 
20.2%
축산물판매업 498
 
7.4%
종축업 56
 
0.8%
가축인공수정소 49
 
0.7%
부화업 15
 
0.2%
도축업 13
 
0.2%

Length

2023-12-11T06:23:38.058008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:23:38.402969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가축사육업 4729
70.4%
사료제조업 1358
 
20.2%
축산물판매업 498
 
7.4%
종축업 56
 
0.8%
가축인공수정소 49
 
0.7%
부화업 15
 
0.2%
도축업 13
 
0.2%
Distinct4438
Distinct (%)83.1%
Missing1380
Missing (%)20.5%
Memory size52.6 KiB
2023-12-11T06:23:38.720751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length58
Mean length26.046272
Min length14

Characters and Unicode

Total characters139035
Distinct characters603
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3824 ?
Unique (%)71.6%

Sample

1st row경기도 가평군 설악면 한서로 ***-*
2nd row경기도 가평군 가평읍 가화로 88
3rd row경기도 가평군 가평읍 잎너비길 **-**
4th row경기도 가평군 상면 아랫벌길 **-**
5th row경기도 가평군 설악면 한서로***번길 ***
ValueCountFrequency (%)
경기도 5338
 
17.9%
3178
 
10.6%
안성시 713
 
2.4%
화성시 590
 
2.0%
포천시 504
 
1.7%
파주시 293
 
1.0%
용인시 284
 
1.0%
여주시 281
 
0.9%
이천시 273
 
0.9%
양주시 247
 
0.8%
Other values (5533) 18182
60.8%
2023-12-11T06:23:39.189713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24552
 
17.7%
* 14947
 
10.8%
5589
 
4.0%
5519
 
4.0%
5436
 
3.9%
5105
 
3.7%
3681
 
2.6%
3384
 
2.4%
2806
 
2.0%
- 2672
 
1.9%
Other values (593) 65344
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 79965
57.5%
Space Separator 24552
 
17.7%
Other Punctuation 16355
 
11.8%
Decimal Number 11610
 
8.4%
Dash Punctuation 2672
 
1.9%
Open Punctuation 1744
 
1.3%
Close Punctuation 1744
 
1.3%
Uppercase Letter 330
 
0.2%
Lowercase Letter 53
 
< 0.1%
Letter Number 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5589
 
7.0%
5519
 
6.9%
5436
 
6.8%
5105
 
6.4%
3681
 
4.6%
3384
 
4.2%
2806
 
3.5%
2619
 
3.3%
1991
 
2.5%
1659
 
2.1%
Other values (528) 42176
52.7%
Uppercase Letter
ValueCountFrequency (%)
B 58
17.6%
A 58
17.6%
C 27
 
8.2%
I 27
 
8.2%
T 21
 
6.4%
E 18
 
5.5%
S 16
 
4.8%
K 14
 
4.2%
R 12
 
3.6%
G 8
 
2.4%
Other values (16) 71
21.5%
Lowercase Letter
ValueCountFrequency (%)
e 13
24.5%
m 6
11.3%
p 5
 
9.4%
r 5
 
9.4%
a 5
 
9.4%
n 5
 
9.4%
c 4
 
7.5%
t 4
 
7.5%
b 3
 
5.7%
o 2
 
3.8%
Decimal Number
ValueCountFrequency (%)
1 2635
22.7%
2 1653
14.2%
3 1268
10.9%
0 1102
9.5%
4 1049
 
9.0%
5 949
 
8.2%
6 791
 
6.8%
7 761
 
6.6%
8 759
 
6.5%
9 643
 
5.5%
Other Punctuation
ValueCountFrequency (%)
* 14947
91.4%
, 1381
 
8.4%
. 9
 
0.1%
& 6
 
< 0.1%
; 4
 
< 0.1%
: 3
 
< 0.1%
/ 3
 
< 0.1%
@ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 1740
99.8%
[ 4
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 1740
99.8%
] 4
 
0.2%
Space Separator
ValueCountFrequency (%)
24552
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2672
100.0%
Math Symbol
ValueCountFrequency (%)
~ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 79965
57.5%
Common 58681
42.2%
Latin 389
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5589
 
7.0%
5519
 
6.9%
5436
 
6.8%
5105
 
6.4%
3681
 
4.6%
3384
 
4.2%
2806
 
3.5%
2619
 
3.3%
1991
 
2.5%
1659
 
2.1%
Other values (528) 42176
52.7%
Latin
ValueCountFrequency (%)
B 58
14.9%
A 58
14.9%
C 27
 
6.9%
I 27
 
6.9%
T 21
 
5.4%
E 18
 
4.6%
S 16
 
4.1%
K 14
 
3.6%
e 13
 
3.3%
R 12
 
3.1%
Other values (30) 125
32.1%
Common
ValueCountFrequency (%)
24552
41.8%
* 14947
25.5%
- 2672
 
4.6%
1 2635
 
4.5%
( 1740
 
3.0%
) 1740
 
3.0%
2 1653
 
2.8%
, 1381
 
2.4%
3 1268
 
2.2%
0 1102
 
1.9%
Other values (15) 4991
 
8.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 79965
57.5%
ASCII 59064
42.5%
Number Forms 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
24552
41.6%
* 14947
25.3%
- 2672
 
4.5%
1 2635
 
4.5%
( 1740
 
2.9%
) 1740
 
2.9%
2 1653
 
2.8%
, 1381
 
2.3%
3 1268
 
2.1%
0 1102
 
1.9%
Other values (52) 5374
 
9.1%
Hangul
ValueCountFrequency (%)
5589
 
7.0%
5519
 
6.9%
5436
 
6.8%
5105
 
6.4%
3681
 
4.6%
3384
 
4.2%
2806
 
3.5%
2619
 
3.3%
1991
 
2.5%
1659
 
2.1%
Other values (528) 42176
52.7%
Number Forms
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Distinct5165
Distinct (%)77.0%
Missing9
Missing (%)0.1%
Memory size52.6 KiB
2023-12-11T06:23:39.527024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length77
Mean length24.270681
Min length1

Characters and Unicode

Total characters162832
Distinct characters516
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4310 ?
Unique (%)64.2%

Sample

1st row경기도 가평군 설악면 위곡리 **-*
2nd row경기도 가평군 청평면 청평리 440-1번지
3rd row경기도 가평군 가평읍 읍내리 475-19번지
4th row경기도 가평군 가평읍 읍내리 ***-*
5th row경기도 가평군 상면 태봉리 **-*
ValueCountFrequency (%)
경기도 6701
 
18.3%
5390
 
14.7%
안성시 833
 
2.3%
화성시 769
 
2.1%
포천시 581
 
1.6%
파주시 567
 
1.5%
여주시 326
 
0.9%
용인시 319
 
0.9%
이천시 308
 
0.8%
고양시 291
 
0.8%
Other values (4625) 20539
56.1%
2023-12-11T06:23:39.995644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35881
22.0%
* 20718
 
12.7%
7033
 
4.3%
6833
 
4.2%
6731
 
4.1%
6174
 
3.8%
- 5376
 
3.3%
5093
 
3.1%
3664
 
2.3%
2719
 
1.7%
Other values (506) 62610
38.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 87067
53.5%
Space Separator 35881
22.0%
Other Punctuation 22184
 
13.6%
Decimal Number 11520
 
7.1%
Dash Punctuation 5376
 
3.3%
Uppercase Letter 263
 
0.2%
Close Punctuation 243
 
0.1%
Open Punctuation 242
 
0.1%
Lowercase Letter 49
 
< 0.1%
Letter Number 6
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7033
 
8.1%
6833
 
7.8%
6731
 
7.7%
6174
 
7.1%
5093
 
5.8%
3664
 
4.2%
2719
 
3.1%
2266
 
2.6%
1864
 
2.1%
1768
 
2.0%
Other values (444) 42922
49.3%
Uppercase Letter
ValueCountFrequency (%)
B 42
16.0%
A 36
13.7%
I 25
9.5%
C 22
 
8.4%
T 20
 
7.6%
E 16
 
6.1%
S 13
 
4.9%
R 11
 
4.2%
K 10
 
3.8%
U 7
 
2.7%
Other values (16) 61
23.2%
Lowercase Letter
ValueCountFrequency (%)
e 13
26.5%
r 5
 
10.2%
n 5
 
10.2%
m 5
 
10.2%
p 4
 
8.2%
a 4
 
8.2%
c 4
 
8.2%
t 4
 
8.2%
o 2
 
4.1%
b 2
 
4.1%
Decimal Number
ValueCountFrequency (%)
1 2353
20.4%
2 1500
13.0%
3 1254
10.9%
0 1099
9.5%
4 1046
9.1%
5 1038
9.0%
6 937
 
8.1%
7 819
 
7.1%
8 774
 
6.7%
9 700
 
6.1%
Other Punctuation
ValueCountFrequency (%)
* 20718
93.4%
, 1440
 
6.5%
. 12
 
0.1%
& 5
 
< 0.1%
/ 4
 
< 0.1%
; 3
 
< 0.1%
@ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%
Space Separator
ValueCountFrequency (%)
35881
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5376
100.0%
Close Punctuation
ValueCountFrequency (%)
) 243
100.0%
Open Punctuation
ValueCountFrequency (%)
( 242
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 87067
53.5%
Common 75447
46.3%
Latin 318
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7033
 
8.1%
6833
 
7.8%
6731
 
7.7%
6174
 
7.1%
5093
 
5.8%
3664
 
4.2%
2719
 
3.1%
2266
 
2.6%
1864
 
2.1%
1768
 
2.0%
Other values (444) 42922
49.3%
Latin
ValueCountFrequency (%)
B 42
 
13.2%
A 36
 
11.3%
I 25
 
7.9%
C 22
 
6.9%
T 20
 
6.3%
E 16
 
5.0%
S 13
 
4.1%
e 13
 
4.1%
R 11
 
3.5%
K 10
 
3.1%
Other values (30) 110
34.6%
Common
ValueCountFrequency (%)
35881
47.6%
* 20718
27.5%
- 5376
 
7.1%
1 2353
 
3.1%
2 1500
 
2.0%
, 1440
 
1.9%
3 1254
 
1.7%
0 1099
 
1.5%
4 1046
 
1.4%
5 1038
 
1.4%
Other values (12) 3742
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 87067
53.5%
ASCII 75759
46.5%
Number Forms 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35881
47.4%
* 20718
27.3%
- 5376
 
7.1%
1 2353
 
3.1%
2 1500
 
2.0%
, 1440
 
1.9%
3 1254
 
1.7%
0 1099
 
1.5%
4 1046
 
1.4%
5 1038
 
1.4%
Other values (49) 4054
 
5.4%
Hangul
ValueCountFrequency (%)
7033
 
8.1%
6833
 
7.8%
6731
 
7.7%
6174
 
7.1%
5093
 
5.8%
3664
 
4.2%
2719
 
3.1%
2266
 
2.6%
1864
 
2.1%
1768
 
2.0%
Other values (444) 42922
49.3%
Number Forms
ValueCountFrequency (%)
4
66.7%
1
 
16.7%
1
 
16.7%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1436
Distinct (%)33.9%
Missing2477
Missing (%)36.9%
Infinite0
Infinite (%)0.0%
Mean14066.153
Minimum10004
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.2 KiB
2023-12-11T06:23:40.127329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10004
5-th percentile10416
Q111134
median12798
Q317414
95-th percentile18521
Maximum18635
Range8631
Interquartile range (IQR)6280

Descriptive statistics

Standard deviation2931.226
Coefficient of variation (CV)0.2083886
Kurtosis-1.5783366
Mean14066.153
Median Absolute Deviation (MAD)1992
Skewness0.25452138
Sum59654554
Variance8592085.6
MonotonicityNot monotonic
2023-12-11T06:23:40.260318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10804 75
 
1.1%
10801 58
 
0.9%
11101 35
 
0.5%
17508 34
 
0.5%
11426 33
 
0.5%
11134 32
 
0.5%
18586 31
 
0.5%
17502 28
 
0.4%
11049 27
 
0.4%
17504 27
 
0.4%
Other values (1426) 3861
57.5%
(Missing) 2477
36.9%
ValueCountFrequency (%)
10004 3
< 0.1%
10005 7
0.1%
10009 3
< 0.1%
10011 1
 
< 0.1%
10012 4
0.1%
10013 2
 
< 0.1%
10014 1
 
< 0.1%
10015 2
 
< 0.1%
10016 1
 
< 0.1%
10019 1
 
< 0.1%
ValueCountFrequency (%)
18635 1
 
< 0.1%
18633 1
 
< 0.1%
18632 1
 
< 0.1%
18631 1
 
< 0.1%
18630 4
0.1%
18629 4
0.1%
18628 5
0.1%
18626 5
0.1%
18623 1
 
< 0.1%
18622 2
 
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1994
Distinct (%)86.1%
Missing4402
Missing (%)65.5%
Infinite0
Infinite (%)0.0%
Mean37.456177
Minimum36.923654
Maximum38.229436
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.2 KiB
2023-12-11T06:23:40.388342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.923654
5-th percentile37.038112
Q137.263589
median37.41074
Q337.67196
95-th percentile37.912843
Maximum38.229436
Range1.3057828
Interquartile range (IQR)0.40837151

Descriptive statistics

Standard deviation0.26964819
Coefficient of variation (CV)0.0071990313
Kurtosis-0.69952341
Mean37.456177
Median Absolute Deviation (MAD)0.20962529
Skewness0.26396243
Sum86748.506
Variance0.072710147
MonotonicityNot monotonic
2023-12-11T06:23:40.509107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.2227729912 10
 
0.1%
37.5503668716 7
 
0.1%
37.4458457243 6
 
0.1%
37.4135068616 6
 
0.1%
37.0371270867 5
 
0.1%
37.3304076955 5
 
0.1%
37.2110217235 4
 
0.1%
37.746162424 4
 
0.1%
37.2632899256 4
 
0.1%
37.6346983641 4
 
0.1%
Other values (1984) 2261
33.7%
(Missing) 4402
65.5%
ValueCountFrequency (%)
36.9236536026 1
 
< 0.1%
36.9360150583 1
 
< 0.1%
36.9400315719 1
 
< 0.1%
36.9417359688 1
 
< 0.1%
36.9445487055 3
< 0.1%
36.947534208 1
 
< 0.1%
36.9503547712 1
 
< 0.1%
36.9507991841 1
 
< 0.1%
36.9523591039 1
 
< 0.1%
36.9532978239 1
 
< 0.1%
ValueCountFrequency (%)
38.2294364114 1
< 0.1%
38.2015048012 1
< 0.1%
38.1979997301 1
< 0.1%
38.1973911369 1
< 0.1%
38.1782390901 1
< 0.1%
38.1612174486 1
< 0.1%
38.1578623349 1
< 0.1%
38.1554543396 1
< 0.1%
38.1532983273 1
< 0.1%
38.1428771298 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1994
Distinct (%)86.1%
Missing4402
Missing (%)65.5%
Infinite0
Infinite (%)0.0%
Mean127.05517
Minimum126.54205
Maximum127.77371
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size59.2 KiB
2023-12-11T06:23:40.631292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.54205
5-th percentile126.7325
Q1126.84912
median127.06435
Q3127.19432
95-th percentile127.50614
Maximum127.77371
Range1.2316585
Interquartile range (IQR)0.3452013

Descriptive statistics

Standard deviation0.24041466
Coefficient of variation (CV)0.0018922069
Kurtosis-0.1670316
Mean127.05517
Median Absolute Deviation (MAD)0.17294166
Skewness0.44857609
Sum294259.77
Variance0.057799209
MonotonicityNot monotonic
2023-12-11T06:23:40.746319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.0912463499 10
 
0.1%
127.191138275 7
 
0.1%
126.8020023203 6
 
0.1%
127.1295714415 6
 
0.1%
127.4198383756 5
 
0.1%
127.1469907639 5
 
0.1%
127.0895177988 4
 
0.1%
126.6921097139 4
 
0.1%
127.1380225448 4
 
0.1%
127.1233192885 4
 
0.1%
Other values (1984) 2261
33.7%
(Missing) 4402
65.5%
ValueCountFrequency (%)
126.5420491273 1
< 0.1%
126.5485079113 1
< 0.1%
126.5514966635 1
< 0.1%
126.5567307169 1
< 0.1%
126.5580331619 1
< 0.1%
126.5605914897 1
< 0.1%
126.5613688834 1
< 0.1%
126.5655445852 1
< 0.1%
126.5746662632 2
< 0.1%
126.5746807386 1
< 0.1%
ValueCountFrequency (%)
127.7737076569 1
< 0.1%
127.7732974033 1
< 0.1%
127.7726398548 1
< 0.1%
127.7708902295 1
< 0.1%
127.7705634841 1
< 0.1%
127.7661458186 1
< 0.1%
127.7594016083 1
< 0.1%
127.7584501188 1
< 0.1%
127.7557862962 1
< 0.1%
127.7511876496 1
< 0.1%

Interactions

2023-12-11T06:23:35.468291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:34.894385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.169208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.569510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:34.975904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.261513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.668451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.079708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:23:35.371574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:23:40.823004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명축산업무구분명소재지우편번호WGS84위도WGS84경도
시군명1.0000.5210.5620.9930.9450.921
영업상태명0.5211.0000.8140.3390.3390.210
축산업무구분명0.5620.8141.0000.3840.3450.295
소재지우편번호0.9930.3390.3841.0000.9040.825
WGS84위도0.9450.3390.3450.9041.0000.576
WGS84경도0.9210.2100.2950.8250.5761.000
2023-12-11T06:23:40.919131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
영업상태명축산업무구분명시군명
영업상태명1.0000.4120.246
축산업무구분명0.4121.0000.272
시군명0.2460.2721.000
2023-12-11T06:23:40.993937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지우편번호WGS84위도WGS84경도시군명영업상태명축산업무구분명
소재지우편번호1.000-0.8990.2210.9370.1790.206
WGS84위도-0.8991.000-0.2350.7140.1790.182
WGS84경도0.221-0.2351.0000.6450.1070.154
시군명0.9370.7140.6451.0000.2460.272
영업상태명0.1790.1790.1070.2461.0000.412
축산업무구분명0.2060.1820.1540.2720.4121.000

Missing values

2023-12-11T06:23:35.798305image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:23:35.964722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:23:36.097484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군위곡농장20050503말소<NA>가축사육업경기도 가평군 설악면 한서로 ***-*경기도 가평군 설악면 위곡리 **-*12463<NA><NA>
1가평군세화수입정육점19981021운영중<NA>축산물판매업<NA>경기도 가평군 청평면 청평리 440-1번지1245237.736102127.415913
2가평군금강닭수입소고기전문판매점19981016운영중<NA>축산물판매업경기도 가평군 가평읍 가화로 88경기도 가평군 가평읍 읍내리 475-19번지1241937.827598127.514795
3가평군백두농장20091010정상<NA>가축사육업경기도 가평군 가평읍 잎너비길 **-**경기도 가평군 가평읍 읍내리 ***-*<NA><NA><NA>
4가평군오성농장20041214정상<NA>가축사육업경기도 가평군 상면 아랫벌길 **-**경기도 가평군 상면 태봉리 **-*<NA><NA><NA>
5가평군광신농장20041213정상<NA>가축사육업<NA>경기도 가평군 설악면 위곡리 ***-*12463<NA><NA>
6가평군창현농장20041213정상<NA>가축사육업경기도 가평군 설악면 한서로***번길 ***경기도 가평군 설악면 위곡리 ***-* **통*반12463<NA><NA>
7가평군칠악골목장20210715정상<NA>가축사육업<NA>경기도 가평군 가평읍 하색리 *** ***-*,***-*번지<NA><NA><NA>
8가평군일식목장20131105정상<NA>가축사육업경기도 가평군 설악면 양방가루재길 **-**경기도 가평군 설악면 방일리 **12472<NA><NA>
9가평군천유목장20050718정상<NA>가축사육업경기도 가평군 청평면 수리재길 ***경기도 가평군 청평면 상천리 ***-*12449<NA><NA>
시군명사업장명인허가일자영업상태명폐업일자축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
6708화성시알짜목장20050401휴업<NA>가축사육업경기도 화성시 남양읍 남양로 ***-**경기도 화성시 남양읍 신남리 ****<NA><NA><NA>
6709화성시민서네 꿩(약초)농장2019-06-05휴업<NA>가축사육업경기도 화성시 마도면 마도서길 **-*경기도 화성시 마도면 고모리 ***-* ,***-*18544<NA><NA>
6710화성시예찬농장20090811휴업<NA>가축사육업경기도 화성시 매송면 매송로***번길 **경기도 화성시 매송면 야목리 ***<NA><NA><NA>
6711화성시삼현농장2005-03-03휴업<NA>가축사육업<NA>경기도 화성시 우정읍 주곡리 ***-*<NA><NA><NA>
6712화성시혜지농장20090120휴업<NA>가축사육업경기도 화성시 매송면 매송로 ***경기도 화성시 매송면 야목리 ***<NA><NA><NA>
6713화성시감우농장20041212휴업<NA>가축사육업경기도 화성시 남양읍 현대연구소로 79-26경기도 화성시 남양읍 장덕리 10771827837.164738126.819959
6714화성시백봉실크 오골계농장2017-06-23휴업<NA>가축사육업<NA>경기도 화성시 비봉면 남전리 ***-*18282<NA><NA>
6715화성시해창농장2017-08-30휴업<NA>가축사육업경기도 화성시 팔탄면 *.*만세로 ***-**경기도 화성시 팔탄면 해창리 산 **-*<NA><NA><NA>
6716화성시재건목장2005-11-25휴업<NA>가축사육업경기도 화성시 매송면 송숙로**번길 **경기도 화성시 매송면 송라리 ***<NA><NA><NA>
6717화성시소망농원2009-06-16휴업<NA>가축사육업<NA>경기도 화성시 마도면 백곡리 **18544<NA><NA>

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도# duplicates
3고양시더 DOG립만세20211001폐업20221229사료제조업경기도 고양시 일산동구 백마로 195, 엠시티타워&amp;엠시티오피스텔 2층 2192호 (장항동)경기도 고양시 일산동구 장항동 869 엠시티타워&amp;엠시티오피스텔1040337.654908126.7715383
11동두천시소요산생물보호센터20130607폐업20221223가축사육업경기도 동두천시 평화로 2896-15 (상봉암동)경기도 동두천시 상봉암동 142-21130737.945493127.0619273
20파주시깅스키친20210430정상<NA>사료제조업경기도 파주시 조리읍 봉천로 37-23경기도 파주시 조리읍 봉일천리 155-61093737.74334126.8065163
21파주시더DOG립만세20210416폐업20221117사료제조업경기도 파주시 하우4길 26-22 (상지석동)경기도 파주시 상지석동 554-1231091037.717598126.7749983
24파주시에스와이앤썬즈(주)-도그펄슨20210610정상<NA>사료제조업경기도 파주시 운정로 149 (상지석동)경기도 파주시 상지석동 531-131091037.721113126.7823963
25파주시주식회사 봄봄2023-08-17정상<NA>사료제조업경기도 파주시 소라지로 264 (송촌동)경기도 파주시 송촌동 556-361086337.746162126.692113
0고양시(주)베스트칩20211029정상<NA>사료제조업경기도 고양시 일산동구 견달산로194번길 42(식사동)경기도 고양시 일산동구 식사동 187-11031637.684481126.8230222
1고양시(주)오른푸드시스템20220330정상<NA>사료제조업경기도 고양시 일산동구 성현로268번길 33-1, 나동 (성석동)경기도 고양시 일산동구 성석동 13-2 나동1031337.703483126.8140912
2고양시6DECO20210802정상<NA>사료제조업경기도 고양시 덕양구 고골길 54, A동 (관산동)경기도 고양시 덕양구 관산동 574-31026537.711977126.8595882
4고양시멍뭉식탁20211203정상<NA>사료제조업경기도 고양시 일산동구 호수로 340-28, 비잔티움2단지 1층 108-2호 (백석동)경기도 고양시 일산동구 백석동 1318-4 비잔티움2단지 108-2호1044937.638327126.7883732