Overview

Dataset statistics

Number of variables12
Number of observations7766
Missing cells20381
Missing cells (%)21.9%
Duplicate rows27
Duplicate rows (%)0.3%
Total size in memory758.5 KiB
Average record size in memory100.0 B

Variable types

Categorical3
Text3
DateTime2
Numeric4

Dataset

Description축산물 유통전문 판매업체 현황
Author행정안전부
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=33776A7B31D37SKFP885296687&infSeq=1

Alerts

Dataset has 27 (0.3%) duplicate rowsDuplicates
소재지우편번호 is highly overall correlated with WGS84위도 and 1 other fieldsHigh correlation
WGS84위도 is highly overall correlated with 소재지우편번호 and 1 other fieldsHigh correlation
WGS84경도 is highly overall correlated with 시군명High correlation
시군명 is highly overall correlated with 소재지우편번호 and 2 other fieldsHigh correlation
축산업무구분명 is highly imbalanced (51.1%)Imbalance
폐업일자 has 6143 (79.1%) missing valuesMissing
소재지면적(㎡) has 1652 (21.3%) missing valuesMissing
소재지도로명주소 has 1335 (17.2%) missing valuesMissing
소재지우편번호 has 2464 (31.7%) missing valuesMissing
WGS84위도 has 4387 (56.5%) missing valuesMissing
WGS84경도 has 4387 (56.5%) missing valuesMissing
소재지면적(㎡) is highly skewed (γ1 = 71.85358876)Skewed
소재지면적(㎡) has 4785 (61.6%) zerosZeros

Reproduction

Analysis started2023-12-10 22:05:12.908384
Analysis finished2023-12-10 22:05:16.490817
Duration3.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size60.8 KiB
안성시
862 
화성시
820 
포천시
656 
파주시
586 
고양시
 
390
Other values (26)
4452 

Length

Max length4
Median length3
Mean length3.0540819
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가평군
2nd row가평군
3rd row가평군
4th row가평군
5th row가평군

Common Values

ValueCountFrequency (%)
안성시 862
 
11.1%
화성시 820
 
10.6%
포천시 656
 
8.4%
파주시 586
 
7.5%
고양시 390
 
5.0%
용인시 388
 
5.0%
이천시 360
 
4.6%
여주시 341
 
4.4%
남양주시 317
 
4.1%
양주시 316
 
4.1%
Other values (21) 2730
35.2%

Length

2023-12-11T07:05:16.553249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
안성시 862
 
11.1%
화성시 820
 
10.6%
포천시 656
 
8.4%
파주시 586
 
7.5%
고양시 390
 
5.0%
용인시 388
 
5.0%
이천시 360
 
4.6%
여주시 341
 
4.4%
남양주시 317
 
4.1%
양주시 316
 
4.1%
Other values (21) 2730
35.2%
Distinct6655
Distinct (%)85.7%
Missing0
Missing (%)0.0%
Memory size60.8 KiB
2023-12-11T07:05:16.851605image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length29
Mean length5.7154262
Min length1

Characters and Unicode

Total characters44386
Distinct characters845
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5973 ?
Unique (%)76.9%

Sample

1st row재종농장
2nd row위곡농장
3rd row영기농장
4th row상색목장
5th row선화목장
ValueCountFrequency (%)
주식회사 438
 
4.9%
농업회사법인 140
 
1.6%
103
 
1.1%
농장 53
 
0.6%
목장 33
 
0.4%
24
 
0.3%
유한회사 14
 
0.2%
우리농장 14
 
0.2%
대성농장 13
 
0.1%
영농조합법인 12
 
0.1%
Other values (6916) 8140
90.6%
2023-12-11T07:05:17.302511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4328
 
9.8%
2838
 
6.4%
1795
 
4.0%
1733
 
3.9%
) 1229
 
2.8%
( 1222
 
2.8%
1219
 
2.7%
870
 
2.0%
791
 
1.8%
733
 
1.7%
Other values (835) 27628
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39323
88.6%
Close Punctuation 1229
 
2.8%
Open Punctuation 1222
 
2.8%
Space Separator 1220
 
2.7%
Uppercase Letter 552
 
1.2%
Lowercase Letter 406
 
0.9%
Decimal Number 244
 
0.5%
Dash Punctuation 112
 
0.3%
Other Punctuation 63
 
0.1%
Other Symbol 12
 
< 0.1%
Other values (2) 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4328
 
11.0%
2838
 
7.2%
1795
 
4.6%
1733
 
4.4%
870
 
2.2%
791
 
2.0%
733
 
1.9%
598
 
1.5%
527
 
1.3%
482
 
1.2%
Other values (757) 24628
62.6%
Lowercase Letter
ValueCountFrequency (%)
o 56
13.8%
e 49
12.1%
a 34
 
8.4%
n 29
 
7.1%
t 24
 
5.9%
d 22
 
5.4%
m 21
 
5.2%
i 19
 
4.7%
p 18
 
4.4%
c 17
 
4.2%
Other values (15) 117
28.8%
Uppercase Letter
ValueCountFrequency (%)
O 48
 
8.7%
F 47
 
8.5%
D 46
 
8.3%
E 42
 
7.6%
S 34
 
6.2%
T 33
 
6.0%
C 31
 
5.6%
P 31
 
5.6%
B 25
 
4.5%
A 24
 
4.3%
Other values (15) 191
34.6%
Decimal Number
ValueCountFrequency (%)
2 152
62.3%
1 43
 
17.6%
3 20
 
8.2%
4 10
 
4.1%
6 5
 
2.0%
0 4
 
1.6%
9 4
 
1.6%
5 4
 
1.6%
8 1
 
0.4%
7 1
 
0.4%
Other Punctuation
ValueCountFrequency (%)
& 26
41.3%
. 16
25.4%
, 4
 
6.3%
· 4
 
6.3%
; 4
 
6.3%
/ 3
 
4.8%
! 3
 
4.8%
* 2
 
3.2%
? 1
 
1.6%
Space Separator
ValueCountFrequency (%)
1219
99.9%
  1
 
0.1%
Math Symbol
ValueCountFrequency (%)
> 1
50.0%
< 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1229
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1222
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%
Other Symbol
ValueCountFrequency (%)
12
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39332
88.6%
Common 4092
 
9.2%
Latin 959
 
2.2%
Han 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4328
 
11.0%
2838
 
7.2%
1795
 
4.6%
1733
 
4.4%
870
 
2.2%
791
 
2.0%
733
 
1.9%
598
 
1.5%
527
 
1.3%
482
 
1.2%
Other values (755) 24637
62.6%
Latin
ValueCountFrequency (%)
o 56
 
5.8%
e 49
 
5.1%
O 48
 
5.0%
F 47
 
4.9%
D 46
 
4.8%
E 42
 
4.4%
a 34
 
3.5%
S 34
 
3.5%
T 33
 
3.4%
C 31
 
3.2%
Other values (41) 539
56.2%
Common
ValueCountFrequency (%)
) 1229
30.0%
( 1222
29.9%
1219
29.8%
2 152
 
3.7%
- 112
 
2.7%
1 43
 
1.1%
& 26
 
0.6%
3 20
 
0.5%
. 16
 
0.4%
4 10
 
0.2%
Other values (16) 43
 
1.1%
Han
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39320
88.6%
ASCII 5045
 
11.4%
None 17
 
< 0.1%
CJK 3
 
< 0.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4328
 
11.0%
2838
 
7.2%
1795
 
4.6%
1733
 
4.4%
870
 
2.2%
791
 
2.0%
733
 
1.9%
598
 
1.5%
527
 
1.3%
482
 
1.2%
Other values (754) 24625
62.6%
ASCII
ValueCountFrequency (%)
) 1229
24.4%
( 1222
24.2%
1219
24.2%
2 152
 
3.0%
- 112
 
2.2%
o 56
 
1.1%
e 49
 
1.0%
O 48
 
1.0%
F 47
 
0.9%
D 46
 
0.9%
Other values (64) 865
17.1%
None
ValueCountFrequency (%)
12
70.6%
· 4
 
23.5%
  1
 
5.9%
Number Forms
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Distinct3322
Distinct (%)42.8%
Missing0
Missing (%)0.0%
Memory size60.8 KiB
Minimum1978-08-22 00:00:00
Maximum2023-11-28 00:00:00
2023-12-11T07:05:17.430329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:17.548201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

영업상태명
Categorical

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size60.8 KiB
정상
4368 
폐업
1297 
운영중
1201 
폐업 등
 
333
말소
 
311
Other values (2)
 
256

Length

Max length4
Median length2
Mean length2.2406644
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row말소
2nd row말소
3rd row정상
4th row정상
5th row정상

Common Values

ValueCountFrequency (%)
정상 4368
56.2%
폐업 1297
 
16.7%
운영중 1201
 
15.5%
폐업 등 333
 
4.3%
말소 311
 
4.0%
휴업 255
 
3.3%
휴업 등 1
 
< 0.1%

Length

2023-12-11T07:05:17.678546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:05:17.821193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정상 4368
53.9%
폐업 1630
 
20.1%
운영중 1201
 
14.8%
334
 
4.1%
말소 311
 
3.8%
휴업 256
 
3.2%

폐업일자
Date

MISSING 

Distinct789
Distinct (%)48.6%
Missing6143
Missing (%)79.1%
Memory size60.8 KiB
Minimum2005-06-23 00:00:00
Maximum2023-11-28 00:00:00
2023-12-11T07:05:17.941971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:18.096735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소재지면적(㎡)
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct992
Distinct (%)16.2%
Missing1652
Missing (%)21.3%
Infinite0
Infinite (%)0.0%
Mean86.885615
Minimum0
Maximum173329
Zeros4785
Zeros (%)61.6%
Negative0
Negative (%)0.0%
Memory size68.4 KiB
2023-12-11T07:05:18.223614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile186.6165
Maximum173329
Range173329
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2282.6946
Coefficient of variation (CV)26.272411
Kurtosis5431.742
Mean86.885615
Median Absolute Deviation (MAD)0
Skewness71.853589
Sum531218.65
Variance5210694.5
MonotonicityNot monotonic
2023-12-11T07:05:18.342140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 4785
61.6%
33.0 21
 
0.3%
198.0 18
 
0.2%
20.0 15
 
0.2%
10.0 14
 
0.2%
30.0 13
 
0.2%
60.0 11
 
0.1%
66.0 10
 
0.1%
50.0 8
 
0.1%
15.0 8
 
0.1%
Other values (982) 1211
 
15.6%
(Missing) 1652
 
21.3%
ValueCountFrequency (%)
0.0 4785
61.6%
0.35 1
 
< 0.1%
1.11 1
 
< 0.1%
1.21 1
 
< 0.1%
1.56 1
 
< 0.1%
1.8 1
 
< 0.1%
1.98 1
 
< 0.1%
2.93 1
 
< 0.1%
3.0 3
 
< 0.1%
3.09 1
 
< 0.1%
ValueCountFrequency (%)
173329.0 1
< 0.1%
22298.84 1
< 0.1%
14544.3 1
< 0.1%
13400.0 1
< 0.1%
13018.9 1
< 0.1%
9656.0 1
< 0.1%
9000.0 1
< 0.1%
8957.72 1
< 0.1%
8459.0 1
< 0.1%
7123.54 1
< 0.1%

축산업무구분명
Categorical

IMBALANCE 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size60.8 KiB
가축사육업
4736 
축산물판매업
1534 
사료제조업
1366 
종축업
 
54
가축인공수정소
 
45
Other values (3)
 
31

Length

Max length7
Median length5
Mean length5.1873551
Min length3

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row가축사육업
2nd row가축사육업
3rd row가축사육업
4th row가축사육업
5th row가축사육업

Common Values

ValueCountFrequency (%)
가축사육업 4736
61.0%
축산물판매업 1534
 
19.8%
사료제조업 1366
 
17.6%
종축업 54
 
0.7%
가축인공수정소 45
 
0.6%
부화업 17
 
0.2%
도축업 13
 
0.2%
<NA> 1
 
< 0.1%

Length

2023-12-11T07:05:18.465099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T07:05:18.574448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가축사육업 4736
61.0%
축산물판매업 1534
 
19.8%
사료제조업 1366
 
17.6%
종축업 54
 
0.7%
가축인공수정소 45
 
0.6%
부화업 17
 
0.2%
도축업 13
 
0.2%
na 1
 
< 0.1%
Distinct5460
Distinct (%)84.9%
Missing1335
Missing (%)17.2%
Memory size60.8 KiB
2023-12-11T07:05:18.813205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length58
Mean length26.315037
Min length14

Characters and Unicode

Total characters169232
Distinct characters622
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4771 ?
Unique (%)74.2%

Sample

1st row경기도 가평군 가평읍 아랫마장길 ***-***
2nd row경기도 가평군 설악면 한서로 ***-*
3rd row경기도 가평군 가평읍 각담말길 ***-**
4th row경기도 가평군 가평읍 경춘로 ****-**
5th row경기도 가평군 설악면 묵안로 ***-***
ValueCountFrequency (%)
경기도 6431
 
17.7%
3200
 
8.8%
안성시 743
 
2.0%
화성시 645
 
1.8%
포천시 583
 
1.6%
용인시 357
 
1.0%
고양시 347
 
1.0%
이천시 329
 
0.9%
파주시 321
 
0.9%
여주시 293
 
0.8%
Other values (6688) 23159
63.6%
2023-12-11T07:05:19.181750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29987
 
17.7%
* 15057
 
8.9%
6739
 
4.0%
6658
 
3.9%
6564
 
3.9%
6241
 
3.7%
4616
 
2.7%
4014
 
2.4%
1 3882
 
2.3%
3523
 
2.1%
Other values (612) 81951
48.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 96981
57.3%
Space Separator 29987
 
17.7%
Other Punctuation 16977
 
10.0%
Decimal Number 16808
 
9.9%
Dash Punctuation 3065
 
1.8%
Open Punctuation 2476
 
1.5%
Close Punctuation 2476
 
1.5%
Uppercase Letter 393
 
0.2%
Lowercase Letter 56
 
< 0.1%
Letter Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6739
 
6.9%
6658
 
6.9%
6564
 
6.8%
6241
 
6.4%
4616
 
4.8%
4014
 
4.1%
3523
 
3.6%
3030
 
3.1%
2168
 
2.2%
2106
 
2.2%
Other values (546) 51322
52.9%
Uppercase Letter
ValueCountFrequency (%)
B 80
20.4%
A 68
17.3%
I 32
 
8.1%
C 27
 
6.9%
T 25
 
6.4%
E 21
 
5.3%
S 16
 
4.1%
K 15
 
3.8%
R 15
 
3.8%
D 10
 
2.5%
Other values (16) 84
21.4%
Lowercase Letter
ValueCountFrequency (%)
e 15
26.8%
a 6
 
10.7%
m 6
 
10.7%
n 5
 
8.9%
p 4
 
7.1%
c 4
 
7.1%
r 4
 
7.1%
t 4
 
7.1%
b 3
 
5.4%
o 2
 
3.6%
Other values (3) 3
 
5.4%
Decimal Number
ValueCountFrequency (%)
1 3882
23.1%
2 2373
14.1%
3 1761
10.5%
0 1589
9.5%
4 1486
 
8.8%
5 1414
 
8.4%
6 1202
 
7.2%
7 1105
 
6.6%
8 1040
 
6.2%
9 956
 
5.7%
Other Punctuation
ValueCountFrequency (%)
* 15057
88.7%
, 1892
 
11.1%
. 10
 
0.1%
& 9
 
0.1%
; 4
 
< 0.1%
: 3
 
< 0.1%
/ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 2472
99.8%
[ 4
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 2472
99.8%
] 4
 
0.2%
Space Separator
ValueCountFrequency (%)
29987
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3065
100.0%
Math Symbol
ValueCountFrequency (%)
~ 6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 96980
57.3%
Common 71795
42.4%
Latin 456
 
0.3%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6739
 
6.9%
6658
 
6.9%
6564
 
6.8%
6241
 
6.4%
4616
 
4.8%
4014
 
4.1%
3523
 
3.6%
3030
 
3.1%
2168
 
2.2%
2106
 
2.2%
Other values (545) 51321
52.9%
Latin
ValueCountFrequency (%)
B 80
17.5%
A 68
14.9%
I 32
 
7.0%
C 27
 
5.9%
T 25
 
5.5%
E 21
 
4.6%
S 16
 
3.5%
K 15
 
3.3%
e 15
 
3.3%
R 15
 
3.3%
Other values (32) 142
31.1%
Common
ValueCountFrequency (%)
29987
41.8%
* 15057
21.0%
1 3882
 
5.4%
- 3065
 
4.3%
( 2472
 
3.4%
) 2472
 
3.4%
2 2373
 
3.3%
, 1892
 
2.6%
3 1761
 
2.5%
0 1589
 
2.2%
Other values (14) 7245
 
10.1%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 96980
57.3%
ASCII 72244
42.7%
Number Forms 7
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29987
41.5%
* 15057
20.8%
1 3882
 
5.4%
- 3065
 
4.2%
( 2472
 
3.4%
) 2472
 
3.4%
2 2373
 
3.3%
, 1892
 
2.6%
3 1761
 
2.4%
0 1589
 
2.2%
Other values (53) 7694
 
10.7%
Hangul
ValueCountFrequency (%)
6739
 
6.9%
6658
 
6.9%
6564
 
6.8%
6241
 
6.4%
4616
 
4.8%
4014
 
4.1%
3523
 
3.6%
3030
 
3.1%
2168
 
2.2%
2106
 
2.2%
Other values (545) 51321
52.9%
Number Forms
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct6164
Distinct (%)79.5%
Missing13
Missing (%)0.2%
Memory size60.8 KiB
2023-12-11T07:05:19.441737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length75
Mean length24.101896
Min length1

Characters and Unicode

Total characters186862
Distinct characters528
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5250 ?
Unique (%)67.7%

Sample

1st row경기도 가평군 가평읍 마장리 ***
2nd row경기도 가평군 설악면 위곡리 **-*
3rd row경기도 가평군 가평읍 마장리 ***-*
4th row경기도 가평군 가평읍 상색리 ***-*
5th row경기도 가평군 설악면 엄소리 ***
ValueCountFrequency (%)
경기도 7745
 
18.5%
5404
 
12.9%
안성시 862
 
2.1%
화성시 820
 
2.0%
포천시 656
 
1.6%
파주시 586
 
1.4%
고양시 390
 
0.9%
용인시 384
 
0.9%
이천시 360
 
0.9%
여주시 341
 
0.8%
Other values (5619) 24257
58.0%
2023-12-11T07:05:19.843331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39978
21.4%
* 20842
 
11.2%
8122
 
4.3%
7911
 
4.2%
7781
 
4.2%
7248
 
3.9%
- 6229
 
3.3%
5573
 
3.0%
3872
 
2.1%
3450
 
1.8%
Other values (518) 75856
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 101380
54.3%
Space Separator 39978
 
21.4%
Other Punctuation 22363
 
12.0%
Decimal Number 16028
 
8.6%
Dash Punctuation 6229
 
3.3%
Uppercase Letter 300
 
0.2%
Close Punctuation 265
 
0.1%
Open Punctuation 264
 
0.1%
Lowercase Letter 46
 
< 0.1%
Letter Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8122
 
8.0%
7911
 
7.8%
7781
 
7.7%
7248
 
7.1%
5573
 
5.5%
3872
 
3.8%
3450
 
3.4%
2722
 
2.7%
2388
 
2.4%
2210
 
2.2%
Other values (457) 50103
49.4%
Uppercase Letter
ValueCountFrequency (%)
B 60
20.0%
A 39
13.0%
I 29
9.7%
T 21
 
7.0%
C 21
 
7.0%
E 17
 
5.7%
S 15
 
5.0%
R 12
 
4.0%
K 12
 
4.0%
U 8
 
2.7%
Other values (16) 66
22.0%
Decimal Number
ValueCountFrequency (%)
1 3287
20.5%
2 2172
13.6%
3 1703
10.6%
4 1484
9.3%
5 1430
8.9%
0 1414
8.8%
6 1340
8.4%
7 1105
 
6.9%
8 1097
 
6.8%
9 996
 
6.2%
Lowercase Letter
ValueCountFrequency (%)
e 13
28.3%
m 5
 
10.9%
n 5
 
10.9%
r 4
 
8.7%
t 4
 
8.7%
c 4
 
8.7%
a 4
 
8.7%
p 3
 
6.5%
o 2
 
4.3%
b 2
 
4.3%
Other Punctuation
ValueCountFrequency (%)
* 20842
93.2%
, 1492
 
6.7%
. 17
 
0.1%
& 6
 
< 0.1%
; 3
 
< 0.1%
/ 2
 
< 0.1%
@ 1
 
< 0.1%
Letter Number
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
Space Separator
ValueCountFrequency (%)
39978
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6229
100.0%
Close Punctuation
ValueCountFrequency (%)
) 265
100.0%
Open Punctuation
ValueCountFrequency (%)
( 264
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 101379
54.3%
Common 85129
45.6%
Latin 353
 
0.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8122
 
8.0%
7911
 
7.8%
7781
 
7.7%
7248
 
7.1%
5573
 
5.5%
3872
 
3.8%
3450
 
3.4%
2722
 
2.7%
2388
 
2.4%
2210
 
2.2%
Other values (456) 50102
49.4%
Latin
ValueCountFrequency (%)
B 60
17.0%
A 39
 
11.0%
I 29
 
8.2%
T 21
 
5.9%
C 21
 
5.9%
E 17
 
4.8%
S 15
 
4.2%
e 13
 
3.7%
R 12
 
3.4%
K 12
 
3.4%
Other values (29) 114
32.3%
Common
ValueCountFrequency (%)
39978
47.0%
* 20842
24.5%
- 6229
 
7.3%
1 3287
 
3.9%
2 2172
 
2.6%
3 1703
 
2.0%
, 1492
 
1.8%
4 1484
 
1.7%
5 1430
 
1.7%
0 1414
 
1.7%
Other values (12) 5098
 
6.0%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 101379
54.3%
ASCII 85475
45.7%
Number Forms 7
 
< 0.1%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
39978
46.8%
* 20842
24.4%
- 6229
 
7.3%
1 3287
 
3.8%
2 2172
 
2.5%
3 1703
 
2.0%
, 1492
 
1.7%
4 1484
 
1.7%
5 1430
 
1.7%
0 1414
 
1.7%
Other values (48) 5444
 
6.4%
Hangul
ValueCountFrequency (%)
8122
 
8.0%
7911
 
7.8%
7781
 
7.7%
7248
 
7.1%
5573
 
5.5%
3872
 
3.8%
3450
 
3.4%
2722
 
2.7%
2388
 
2.4%
2210
 
2.2%
Other values (456) 50102
49.4%
Number Forms
ValueCountFrequency (%)
4
57.1%
2
28.6%
1
 
14.3%
CJK
ValueCountFrequency (%)
1
100.0%

소재지우편번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct1847
Distinct (%)34.8%
Missing2464
Missing (%)31.7%
Infinite0
Infinite (%)0.0%
Mean13997.707
Minimum10004
Maximum18635
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.4 KiB
2023-12-11T07:05:19.966600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10004
5-th percentile10355.15
Q111149.25
median12805
Q317376
95-th percentile18472
Maximum18635
Range8631
Interquartile range (IQR)6226.75

Descriptive statistics

Standard deviation2878.362
Coefficient of variation (CV)0.20563097
Kurtosis-1.5126259
Mean13997.707
Median Absolute Deviation (MAD)1999
Skewness0.28202195
Sum74215841
Variance8284967.8
MonotonicityNot monotonic
2023-12-11T07:05:20.078296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10804 70
 
0.9%
10801 60
 
0.8%
11134 35
 
0.5%
17508 34
 
0.4%
11101 33
 
0.4%
18586 32
 
0.4%
11426 31
 
0.4%
17502 29
 
0.4%
11136 27
 
0.3%
11102 26
 
0.3%
Other values (1837) 4925
63.4%
(Missing) 2464
31.7%
ValueCountFrequency (%)
10004 3
< 0.1%
10005 7
0.1%
10008 1
 
< 0.1%
10009 2
 
< 0.1%
10010 2
 
< 0.1%
10011 2
 
< 0.1%
10012 4
0.1%
10013 4
0.1%
10014 1
 
< 0.1%
10015 1
 
< 0.1%
ValueCountFrequency (%)
18635 1
 
< 0.1%
18633 1
 
< 0.1%
18632 1
 
< 0.1%
18631 1
 
< 0.1%
18630 5
0.1%
18629 4
0.1%
18628 5
0.1%
18626 6
0.1%
18624 1
 
< 0.1%
18623 1
 
< 0.1%

WGS84위도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2929
Distinct (%)86.7%
Missing4387
Missing (%)56.5%
Infinite0
Infinite (%)0.0%
Mean37.467384
Minimum36.923654
Maximum38.229436
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.4 KiB
2023-12-11T07:05:20.208688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.923654
5-th percentile37.056914
Q137.280158
median37.432112
Q337.669432
95-th percentile37.899139
Maximum38.229436
Range1.3057828
Interquartile range (IQR)0.38927431

Descriptive statistics

Standard deviation0.25863799
Coefficient of variation (CV)0.0069030169
Kurtosis-0.67876637
Mean37.467384
Median Absolute Deviation (MAD)0.20057399
Skewness0.18362504
Sum126602.29
Variance0.066893609
MonotonicityNot monotonic
2023-12-11T07:05:20.342165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.5503668716 7
 
0.1%
37.9279504525 7
 
0.1%
38.0240704012 7
 
0.1%
37.530228378 6
 
0.1%
37.4458457243 6
 
0.1%
37.3535503568 5
 
0.1%
37.2227729912 5
 
0.1%
37.0745455315 5
 
0.1%
37.0371270867 5
 
0.1%
37.5224210033 5
 
0.1%
Other values (2919) 3321
42.8%
(Missing) 4387
56.5%
ValueCountFrequency (%)
36.9236536026 1
 
< 0.1%
36.9360150583 1
 
< 0.1%
36.9400315719 1
 
< 0.1%
36.9417359688 1
 
< 0.1%
36.9443041006 1
 
< 0.1%
36.9445487055 3
< 0.1%
36.947534208 1
 
< 0.1%
36.949498937 1
 
< 0.1%
36.9503547712 1
 
< 0.1%
36.9507991841 1
 
< 0.1%
ValueCountFrequency (%)
38.2294364114 1
< 0.1%
38.2015048012 1
< 0.1%
38.1979997301 1
< 0.1%
38.1973911369 1
< 0.1%
38.1782390901 1
< 0.1%
38.1612174486 1
< 0.1%
38.1578623349 1
< 0.1%
38.1554543396 1
< 0.1%
38.1532983273 1
< 0.1%
38.1428771298 1
< 0.1%

WGS84경도
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct2929
Distinct (%)86.7%
Missing4387
Missing (%)56.5%
Infinite0
Infinite (%)0.0%
Mean127.05532
Minimum126.54205
Maximum127.77371
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size68.4 KiB
2023-12-11T07:05:20.465904image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.54205
5-th percentile126.73297
Q1126.84759
median127.07002
Q3127.20562
95-th percentile127.48276
Maximum127.77371
Range1.2316585
Interquartile range (IQR)0.35803647

Descriptive statistics

Standard deviation0.23426148
Coefficient of variation (CV)0.0018437755
Kurtosis-0.312231
Mean127.05532
Median Absolute Deviation (MAD)0.16901738
Skewness0.33810472
Sum429319.92
Variance0.05487844
MonotonicityNot monotonic
2023-12-11T07:05:20.642547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.191138275 7
 
0.1%
127.2198669931 7
 
0.1%
127.0505011108 7
 
0.1%
126.7616835706 6
 
0.1%
126.8020023203 6
 
0.1%
127.3273196181 5
 
0.1%
127.0912463499 5
 
0.1%
127.4943412952 5
 
0.1%
127.4198383756 5
 
0.1%
126.767192498 5
 
0.1%
Other values (2919) 3321
42.8%
(Missing) 4387
56.5%
ValueCountFrequency (%)
126.5420491273 1
< 0.1%
126.5485079113 1
< 0.1%
126.5514966635 1
< 0.1%
126.5521932886 1
< 0.1%
126.5528321631 1
< 0.1%
126.5567307169 1
< 0.1%
126.5580331619 1
< 0.1%
126.5598292251 1
< 0.1%
126.5605914897 1
< 0.1%
126.5613688834 1
< 0.1%
ValueCountFrequency (%)
127.7737076569 1
< 0.1%
127.7732974033 1
< 0.1%
127.7726398548 1
< 0.1%
127.7708902295 1
< 0.1%
127.7705634841 1
< 0.1%
127.7661458186 1
< 0.1%
127.7594016083 1
< 0.1%
127.7584501188 1
< 0.1%
127.7557862962 1
< 0.1%
127.7511876496 1
< 0.1%

Interactions

2023-12-11T07:05:15.482692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.411737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.783431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.107410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.584073image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.500861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.868701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.203642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.664609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.585052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.941065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.283059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.748742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:14.687454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.026394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T07:05:15.366323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T07:05:20.720520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군명영업상태명소재지면적(㎡)축산업무구분명소재지우편번호WGS84위도WGS84경도
시군명1.0000.5070.0000.5350.9930.9470.917
영업상태명0.5071.0000.0000.8140.2790.2880.177
소재지면적(㎡)0.0000.0001.0000.2780.0000.0000.000
축산업무구분명0.5350.8140.2781.0000.3210.2910.265
소재지우편번호0.9930.2790.0000.3211.0000.9050.831
WGS84위도0.9470.2880.0000.2910.9051.0000.561
WGS84경도0.9170.1770.0000.2650.8310.5611.000
2023-12-11T07:05:20.819477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
축산업무구분명영업상태명시군명
축산업무구분명1.0000.4120.255
영업상태명0.4121.0000.238
시군명0.2550.2381.000
2023-12-11T07:05:20.904697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소재지면적(㎡)소재지우편번호WGS84위도WGS84경도시군명영업상태명축산업무구분명
소재지면적(㎡)1.0000.030-0.0310.1080.0000.0000.194
소재지우편번호0.0301.000-0.9010.2310.9400.1450.168
WGS84위도-0.031-0.9011.000-0.2330.7230.1500.152
WGS84경도0.1080.231-0.2331.0000.6360.0900.137
시군명0.0000.9400.7230.6361.0000.2380.255
영업상태명0.0000.1450.1500.0900.2381.0000.412
축산업무구분명0.1940.1680.1520.1370.2550.4121.000

Missing values

2023-12-11T07:05:15.869638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T07:05:16.038517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T07:05:16.392524image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
0가평군재종농장20070510말소<NA>0.0가축사육업경기도 가평군 가평읍 아랫마장길 ***-***경기도 가평군 가평읍 마장리 ***<NA><NA><NA>
1가평군위곡농장20050503말소<NA>0.0가축사육업경기도 가평군 설악면 한서로 ***-*경기도 가평군 설악면 위곡리 **-*12463<NA><NA>
2가평군영기농장2006-10-02정상<NA>0.0가축사육업경기도 가평군 가평읍 각담말길 ***-**경기도 가평군 가평읍 마장리 ***-*<NA><NA><NA>
3가평군상색목장20050617정상<NA>0.0가축사육업경기도 가평군 가평읍 경춘로 ****-**경기도 가평군 가평읍 상색리 ***-*12426<NA><NA>
4가평군선화목장2013-11-21정상<NA><NA>가축사육업경기도 가평군 설악면 묵안로 ***-***경기도 가평군 설악면 엄소리 ***12470<NA><NA>
5가평군윤정목장20130823정상<NA><NA>가축사육업경기도 가평군 가평읍 달전천벚꽃길 ***-***경기도 가평군 가평읍 하색리 ***-*<NA><NA><NA>
6가평군용광덕목장20100128정상<NA>0.0가축사육업경기도 가평군 북면 오목골길 **-**경기도 가평군 북면 도대리 **-** 외*필지(**-**,**-**,**-**)<NA><NA><NA>
7가평군참터목장20080708정상<NA>0.0가축사육업경기도 가평군 상면 청군로 ***-**경기도 가평군 상면 항사리 ***-*12445<NA><NA>
8가평군정풍목장20150826정상<NA><NA>가축사육업경기도 가평군 가평읍 능모루길 **, 정풍목장경기도 가평군 가평읍 개곡리 ***-* 정풍목장12410<NA><NA>
9가평군수복목장20140317정상<NA><NA>가축사육업경기도 가평군 설악면 평촌길 **-*경기도 가평군 설악면 방일리 ***-*12472<NA><NA>
시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도
7756화성시행정목장20080201휴업<NA>0.0가축사육업<NA>경기도 화성시 향남읍 행정리 62-11859037.125643126.933654
7757화성시제이피에프20210421휴업<NA><NA>가축사육업경기도 화성시 향남읍 장안로 ***-***경기도 화성시 향남읍 구문천리 ***<NA><NA><NA>
7758화성시제기목장2007-03-23휴업<NA>0.0가축사육업경기도 화성시 정남면 제기길 ***-**경기도 화성시 정남면 제기리 ***-*18515<NA><NA>
7759화성시현진목장20051013휴업<NA>0.0가축사육업경기도 화성시 봉담읍 복만터길 **경기도 화성시 봉담읍 마하리 ***-*18335<NA><NA>
7760화성시우리목장2010-02-10휴업<NA>0.0가축사육업경기도 화성시 봉담읍 청궁안뜰*길 *경기도 화성시 봉담읍 내리 ***-*18294<NA><NA>
7761화성시상신목장2008-07-04휴업<NA>0.0가축사육업경기도 화성시 향남읍 마곡동길 ***-**경기도 화성시 향남읍 하길리 ***-*<NA><NA><NA>
7762화성시혜영농장2005-11-25휴업<NA>0.0가축사육업<NA>경기도 화성시 마도면 금당리 ***<NA><NA><NA>
7763화성시해창농장2017-08-30휴업<NA><NA>가축사육업경기도 화성시 팔탄면 *.*만세로 ***-**경기도 화성시 팔탄면 해창리 산 **-*<NA><NA><NA>
7764화성시더바람협동조합20200924휴업<NA>0.0사료제조업경기도 화성시 봉담읍 동화길 51, 프리미엄원희캐슬 6층 656호경기도 화성시 봉담읍 동화리 564 656호1830337.219036126.955459
7765화성시(주)명푸드20170207휴업 등<NA>203.0축산물판매업경기도 화성시 장안면 포승장안로 1140-12경기도 화성시 장안면 독정리 476-10번지1858337.068492126.853351

Duplicate rows

Most frequently occurring

시군명사업장명인허가일자영업상태명폐업일자소재지면적(㎡)축산업무구분명소재지도로명주소소재지지번주소소재지우편번호WGS84위도WGS84경도# duplicates
2고양시더 DOG립만세20211001폐업202212290.0사료제조업경기도 고양시 일산동구 백마로 195, 엠시티타워&amp;엠시티오피스텔 2층 2192호 (장항동)경기도 고양시 일산동구 장항동 869 엠시티타워&amp;엠시티오피스텔1040337.654908126.7715383
9동두천시소요산생물보호센터20130607폐업20221223<NA>가축사육업경기도 동두천시 평화로 2896-15 (상봉암동)경기도 동두천시 상봉암동 142-21130737.945493127.0619273
17파주시깅스키친20210430정상<NA>0.0사료제조업경기도 파주시 조리읍 봉천로 37-23경기도 파주시 조리읍 봉일천리 155-61093737.74334126.8065163
18파주시더DOG립만세20210416폐업202211170.0사료제조업경기도 파주시 하우4길 26-22 (상지석동)경기도 파주시 상지석동 554-1231091037.717598126.7749983
21파주시에스와이앤썬즈(주)-도그펄슨20210610정상<NA>0.0사료제조업경기도 파주시 운정로 149 (상지석동)경기도 파주시 상지석동 531-131091037.721113126.7823963
22파주시주식회사 봄봄2023-08-17정상<NA>0.0사료제조업경기도 파주시 소라지로 264 (송촌동)경기도 파주시 송촌동 556-361086337.746162126.692113
0고양시(주)베스트칩20211029정상<NA>0.0사료제조업경기도 고양시 일산동구 견달산로194번길 42(식사동)경기도 고양시 일산동구 식사동 187-11031637.684481126.8230222
1고양시6DECO20210802정상<NA>0.0사료제조업경기도 고양시 덕양구 고골길 54, A동 (관산동)경기도 고양시 덕양구 관산동 574-31026537.711977126.8595882
3고양시멍뭉식탁20211203정상<NA>0.0사료제조업경기도 고양시 일산동구 호수로 340-28, 비잔티움2단지 1층 108-2호 (백석동)경기도 고양시 일산동구 백석동 1318-4 비잔티움2단지 108-2호1044937.638327126.7883732
4고양시에이케이사이언스20210524정상<NA>0.0사료제조업경기도 고양시 일산동구 지영로 201 (지영동)경기도 고양시 일산동구 지영동 302-101025437.715264126.8283582