Overview

Dataset statistics

Number of variables8
Number of observations36
Missing cells12
Missing cells (%)4.2%
Duplicate rows1
Duplicate rows (%)2.8%
Total size in memory2.4 KiB
Average record size in memory67.7 B

Variable types

Categorical3
Text4
DateTime1

Dataset

Description경기도 고양시 농약판매업체현황 데이터로 상호명, 업체소재지, 대표자, 연락처,판매업 등록일에 대한 항목을 제공합니다.
Author경기도 고양시
URLhttps://www.data.go.kr/data/15055270/fileData.do

Alerts

Dataset has 1 (2.8%) duplicate rowsDuplicates
소속시도 is highly overall correlated with 농약업유형 and 1 other fieldsHigh correlation
농약업유형 is highly overall correlated with 소속시도 and 1 other fieldsHigh correlation
소속시군구 is highly overall correlated with 농약업유형 and 1 other fieldsHigh correlation
농약업유형 is highly imbalanced (69.0%)Imbalance
소속시도 is highly imbalanced (69.0%)Imbalance
소속시군구 is highly imbalanced (69.0%)Imbalance
업체명 has 2 (5.6%) missing valuesMissing
전화번호 has 4 (11.1%) missing valuesMissing
주소 has 2 (5.6%) missing valuesMissing
대표자 has 2 (5.6%) missing valuesMissing
등록일자 has 2 (5.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 05:56:41.148090
Analysis finished2023-12-12 05:56:41.859301
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

농약업유형
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
일반판매업
34 
<NA>
 
2

Length

Max length5
Median length5
Mean length4.9444444
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반판매업
2nd row일반판매업
3rd row일반판매업
4th row일반판매업
5th row일반판매업

Common Values

ValueCountFrequency (%)
일반판매업 34
94.4%
<NA> 2
 
5.6%

Length

2023-12-12T14:56:41.939564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:42.048620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반판매업 34
94.4%
na 2
 
5.6%

소속시도
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
경기도
34 
<NA>
 
2

Length

Max length4
Median length3
Mean length3.0555556
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도
2nd row경기도
3rd row경기도
4th row경기도
5th row경기도

Common Values

ValueCountFrequency (%)
경기도 34
94.4%
<NA> 2
 
5.6%

Length

2023-12-12T14:56:42.162212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:42.306862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 34
94.4%
na 2
 
5.6%

소속시군구
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
고양시
34 
<NA>
 
2

Length

Max length4
Median length3
Mean length3.0555556
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고양시
2nd row고양시
3rd row고양시
4th row고양시
5th row고양시

Common Values

ValueCountFrequency (%)
고양시 34
94.4%
<NA> 2
 
5.6%

Length

2023-12-12T14:56:42.414081image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:56:42.535705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고양시 34
94.4%
na 2
 
5.6%

업체명
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing2
Missing (%)5.6%
Memory size420.0 B
2023-12-12T14:56:42.740331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length6.8823529
Min length3

Characters and Unicode

Total characters234
Distinct characters85
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row가야조경건설 주식회사
2nd row경기아그로
3rd row경농사
4th row고양농자재상사
5th row그린솔루션
ValueCountFrequency (%)
바우농자재 1
 
2.8%
경농사 1
 
2.8%
자유로농약농자재 1
 
2.8%
아리유통 1
 
2.8%
영광종묘농약사 1
 
2.8%
우림종묘사 1
 
2.8%
원당농협자재센터 1
 
2.8%
원당상회 1
 
2.8%
일산농협영농지원센터 1
 
2.8%
신도농협화전지점 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T14:56:43.151459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
10.7%
14
 
6.0%
10
 
4.3%
9
 
3.8%
8
 
3.4%
8
 
3.4%
7
 
3.0%
6
 
2.6%
6
 
2.6%
6
 
2.6%
Other values (75) 135
57.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 226
96.6%
Open Punctuation 2
 
0.9%
Close Punctuation 2
 
0.9%
Space Separator 2
 
0.9%
Uppercase Letter 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
11.1%
14
 
6.2%
10
 
4.4%
9
 
4.0%
8
 
3.5%
8
 
3.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (70) 127
56.2%
Uppercase Letter
ValueCountFrequency (%)
H 1
50.0%
C 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 226
96.6%
Common 6
 
2.6%
Latin 2
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
11.1%
14
 
6.2%
10
 
4.4%
9
 
4.0%
8
 
3.5%
8
 
3.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (70) 127
56.2%
Common
ValueCountFrequency (%)
( 2
33.3%
) 2
33.3%
2
33.3%
Latin
ValueCountFrequency (%)
H 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 226
96.6%
ASCII 8
 
3.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
11.1%
14
 
6.2%
10
 
4.4%
9
 
4.0%
8
 
3.5%
8
 
3.5%
7
 
3.1%
6
 
2.7%
6
 
2.7%
6
 
2.7%
Other values (70) 127
56.2%
ASCII
ValueCountFrequency (%)
( 2
25.0%
) 2
25.0%
2
25.0%
H 1
12.5%
C 1
12.5%

전화번호
Text

MISSING 

Distinct32
Distinct (%)100.0%
Missing4
Missing (%)11.1%
Memory size420.0 B
2023-12-12T14:56:43.353144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length11.875
Min length11

Characters and Unicode

Total characters380
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique32 ?
Unique (%)100.0%

Sample

1st row031-918-5768
2nd row031-963-1665
3rd row031-971-0172
4th row031-968-3693
5th row02-3158-4177
ValueCountFrequency (%)
02-388-1438 1
 
3.1%
031-963-1665 1
 
3.1%
031-970-1324 1
 
3.1%
031-923-7243 1
 
3.1%
02-381-9440 1
 
3.1%
031-975-0420 1
 
3.1%
031-965-1110 1
 
3.1%
031-962-6483 1
 
3.1%
031-918-5768 1
 
3.1%
02-3158-3191 1
 
3.1%
Other values (22) 22
68.8%
2023-12-12T14:56:43.694988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 64
16.8%
1 53
13.9%
0 52
13.7%
3 51
13.4%
9 36
9.5%
2 24
 
6.3%
7 24
 
6.3%
8 21
 
5.5%
6 20
 
5.3%
4 18
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 316
83.2%
Dash Punctuation 64
 
16.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 53
16.8%
0 52
16.5%
3 51
16.1%
9 36
11.4%
2 24
7.6%
7 24
7.6%
8 21
 
6.6%
6 20
 
6.3%
4 18
 
5.7%
5 17
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 64
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 380
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 64
16.8%
1 53
13.9%
0 52
13.7%
3 51
13.4%
9 36
9.5%
2 24
 
6.3%
7 24
 
6.3%
8 21
 
5.5%
6 20
 
5.3%
4 18
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 380
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 64
16.8%
1 53
13.9%
0 52
13.7%
3 51
13.4%
9 36
9.5%
2 24
 
6.3%
7 24
 
6.3%
8 21
 
5.5%
6 20
 
5.3%
4 18
 
4.7%

주소
Text

MISSING 

Distinct34
Distinct (%)100.0%
Missing2
Missing (%)5.6%
Memory size420.0 B
2023-12-12T14:56:43.918074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length14.382353
Min length9

Characters and Unicode

Total characters489
Distinct characters60
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row일산동구 고봉로 616 나동 103호
2nd row일산서구 고양대로632번길 60 201호
3rd row덕양구 유산길 5
4th row덕양구 토당로 10
5th row경기 고양시 덕양구 서오릉로 855
ValueCountFrequency (%)
덕양구 20
 
17.2%
일산서구 8
 
6.9%
일산동구 6
 
5.2%
토당로 4
 
3.4%
경기 3
 
2.6%
고양시 3
 
2.6%
화랑로 3
 
2.6%
원당로 3
 
2.6%
장항로 2
 
1.7%
고봉로 2
 
1.7%
Other values (58) 62
53.4%
2023-12-12T14:56:44.277692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
82
16.8%
34
 
7.0%
33
 
6.7%
26
 
5.3%
1 21
 
4.3%
2 21
 
4.3%
20
 
4.1%
6 19
 
3.9%
18
 
3.7%
17
 
3.5%
Other values (50) 198
40.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 270
55.2%
Decimal Number 127
26.0%
Space Separator 82
 
16.8%
Dash Punctuation 10
 
2.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
34
 
12.6%
33
 
12.2%
26
 
9.6%
20
 
7.4%
18
 
6.7%
17
 
6.3%
11
 
4.1%
9
 
3.3%
8
 
3.0%
8
 
3.0%
Other values (38) 86
31.9%
Decimal Number
ValueCountFrequency (%)
1 21
16.5%
2 21
16.5%
6 19
15.0%
5 14
11.0%
9 13
10.2%
0 10
7.9%
3 9
7.1%
4 8
 
6.3%
7 7
 
5.5%
8 5
 
3.9%
Space Separator
ValueCountFrequency (%)
82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 270
55.2%
Common 219
44.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
34
 
12.6%
33
 
12.2%
26
 
9.6%
20
 
7.4%
18
 
6.7%
17
 
6.3%
11
 
4.1%
9
 
3.3%
8
 
3.0%
8
 
3.0%
Other values (38) 86
31.9%
Common
ValueCountFrequency (%)
82
37.4%
1 21
 
9.6%
2 21
 
9.6%
6 19
 
8.7%
5 14
 
6.4%
9 13
 
5.9%
0 10
 
4.6%
- 10
 
4.6%
3 9
 
4.1%
4 8
 
3.7%
Other values (2) 12
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 270
55.2%
ASCII 219
44.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
82
37.4%
1 21
 
9.6%
2 21
 
9.6%
6 19
 
8.7%
5 14
 
6.4%
9 13
 
5.9%
0 10
 
4.6%
- 10
 
4.6%
3 9
 
4.1%
4 8
 
3.7%
Other values (2) 12
 
5.5%
Hangul
ValueCountFrequency (%)
34
 
12.6%
33
 
12.2%
26
 
9.6%
20
 
7.4%
18
 
6.7%
17
 
6.3%
11
 
4.1%
9
 
3.3%
8
 
3.0%
8
 
3.0%
Other values (38) 86
31.9%

대표자
Text

MISSING 

Distinct18
Distinct (%)52.9%
Missing2
Missing (%)5.6%
Memory size420.0 B
2023-12-12T14:56:44.452151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters102
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)29.4%

Sample

1st row김**
2nd row정**
3rd row김**
4th row김**
5th row송**
ValueCountFrequency (%)
6
17.6%
5
14.7%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%
2023-12-12T14:56:44.719156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
* 68
66.7%
6
 
5.9%
5
 
4.9%
3
 
2.9%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
2
 
2.0%
1
 
1.0%
Other values (9) 9
 
8.8%

Most occurring categories

ValueCountFrequency (%)
Other Punctuation 68
66.7%
Other Letter 34
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
17.6%
5
14.7%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%
Other Punctuation
ValueCountFrequency (%)
* 68
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 68
66.7%
Hangul 34
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
17.6%
5
14.7%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%
Common
ValueCountFrequency (%)
* 68
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68
66.7%
Hangul 34
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
* 68
100.0%
Hangul
ValueCountFrequency (%)
6
17.6%
5
14.7%
3
 
8.8%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
2
 
5.9%
1
 
2.9%
1
 
2.9%
Other values (8) 8
23.5%

등록일자
Date

MISSING 

Distinct7
Distinct (%)20.6%
Missing2
Missing (%)5.6%
Memory size420.0 B
Minimum2018-01-01 00:00:00
Maximum2022-03-17 00:00:00
2023-12-12T14:56:44.825901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:56:44.929857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)

Correlations

2023-12-12T14:56:45.001590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명전화번호주소대표자등록일자
업체명1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.000
대표자1.0001.0001.0001.0000.857
등록일자1.0001.0001.0000.8571.000
2023-12-12T14:56:45.090600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
소속시도농약업유형소속시군구
소속시도1.0001.0001.000
농약업유형1.0001.0001.000
소속시군구1.0001.0001.000
2023-12-12T14:56:45.169309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
농약업유형소속시도소속시군구
농약업유형1.0001.0001.000
소속시도1.0001.0001.000
소속시군구1.0001.0001.000

Missing values

2023-12-12T14:56:41.512199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:56:41.629588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T14:56:41.769471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

농약업유형소속시도소속시군구업체명전화번호주소대표자등록일자
0일반판매업경기도고양시가야조경건설 주식회사031-918-5768일산동구 고봉로 616 나동 103호김**2021-05-24
1일반판매업경기도고양시경기아그로<NA>일산서구 고양대로632번길 60 201호정**2018-01-01
2일반판매업경기도고양시경농사031-963-1665덕양구 유산길 5김**2018-01-01
3일반판매업경기도고양시고양농자재상사031-971-0172덕양구 토당로 10김**2018-01-01
4일반판매업경기도고양시그린솔루션031-968-3693경기 고양시 덕양구 서오릉로 855송**2019-02-20
5일반판매업경기도고양시농민상회02-3158-4177덕양구 화랑로 47최**2018-01-01
6일반판매업경기도고양시농업회사법인 원우바이오031-923-5355일산서구 송산로 210 4동정**2020-02-04
7일반판매업경기도고양시모범농자재031-972-7070덕양구 토당로 66-29신**2018-01-01
8일반판매업경기도고양시미성종합자재(주)031-368-3699경기 고양시 덕양구 고양대로 1569-1정**2019-02-19
9일반판매업경기도고양시바우농자재031-965-7789경기 고양시 덕양구 원당로 459번길29-17손**2019-02-19
농약업유형소속시도소속시군구업체명전화번호주소대표자등록일자
26일반판매업경기도고양시일산종묘사031-975-2400일산서구 고양대로672번길 15-2조**2018-01-01
27일반판매업경기도고양시자유로농약농자재031-905-7171일산동구 장항로 329 가동 2호나**2018-01-01
28일반판매업경기도고양시중앙종묘사02-381-7932덕양구 삼송로 168조**2018-01-01
29일반판매업경기도고양시지도농업협동조합031-974-7591덕양구 토당로 75 지도농업협동조합장**2018-01-01
30일반판매업경기도고양시한아름종묘사031-976-6843일산서구 일청로 19-1여**2018-01-01
31일반판매업경기도고양시현대농약사031-908-2187일산동구 장항로 50신**2018-01-01
32일반판매업경기도고양시현대조경자재02-381-3031덕양구 삼송로 295지축동석**2018-01-01
33일반판매업경기도고양시화전농약농자재02-3158-3800덕양구 화랑로 31최**2018-01-01
34<NA><NA><NA><NA><NA><NA><NA><NA>
35<NA><NA><NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

농약업유형소속시도소속시군구업체명전화번호주소대표자등록일자# duplicates
0<NA><NA><NA><NA><NA><NA><NA><NA>2