Overview

Dataset statistics

Number of variables5
Number of observations72
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.9 KiB
Average record size in memory41.8 B

Variable types

Text4
Categorical1

Dataset

Description국립농산물품질관리원에서 관리하는 GAP인증기관의 부사무소 정보(지정번호, 인증기관명, 본사/지사, 소재지, 전화번호)
Author국립농산물품질관리원
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220614000000002103

Alerts

본사_지사 is highly imbalanced (62.9%)Imbalance
소재지 has unique valuesUnique

Reproduction

Analysis started2023-12-11 03:03:14.273536
Analysis finished2023-12-11 03:03:14.834826
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct58
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-11T12:03:15.017201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters360
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)72.2%

Sample

1st row제001호
2nd row제002호
3rd row제004호
4th row제005호
5th row제006호
ValueCountFrequency (%)
제066호 6
 
8.3%
제073호 4
 
5.6%
제055호 3
 
4.2%
제051호 3
 
4.2%
제012호 2
 
2.8%
제069호 2
 
2.8%
제074호 1
 
1.4%
제078호 1
 
1.4%
제077호 1
 
1.4%
제076호 1
 
1.4%
Other values (48) 48
66.7%
2023-12-11T12:03:15.382010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 83
23.1%
72
20.0%
72
20.0%
6 27
 
7.5%
7 19
 
5.3%
5 19
 
5.3%
8 14
 
3.9%
3 13
 
3.6%
9 13
 
3.6%
2 10
 
2.8%
Other values (2) 18
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 216
60.0%
Other Letter 144
40.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 83
38.4%
6 27
 
12.5%
7 19
 
8.8%
5 19
 
8.8%
8 14
 
6.5%
3 13
 
6.0%
9 13
 
6.0%
2 10
 
4.6%
1 9
 
4.2%
4 9
 
4.2%
Other Letter
ValueCountFrequency (%)
72
50.0%
72
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 216
60.0%
Hangul 144
40.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 83
38.4%
6 27
 
12.5%
7 19
 
8.8%
5 19
 
8.8%
8 14
 
6.5%
3 13
 
6.0%
9 13
 
6.0%
2 10
 
4.6%
1 9
 
4.2%
4 9
 
4.2%
Hangul
ValueCountFrequency (%)
72
50.0%
72
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 216
60.0%
Hangul 144
40.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 83
38.4%
6 27
 
12.5%
7 19
 
8.8%
5 19
 
8.8%
8 14
 
6.5%
3 13
 
6.0%
9 13
 
6.0%
2 10
 
4.6%
1 9
 
4.2%
4 9
 
4.2%
Hangul
ValueCountFrequency (%)
72
50.0%
72
50.0%
Distinct58
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-11T12:03:15.718325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length14
Mean length11.555556
Min length6

Characters and Unicode

Total characters832
Distinct characters135
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)72.2%

Sample

1st row농협경제지주(주)식품R&D연구소
2nd row㈔한국생약협회
3rd row주식회사 온누리친환경
4th row주식회사 글로벌농식품인증원
5th row(재)금산인삼약초산업진흥원
ValueCountFrequency (%)
주식회사 25
23.4%
농식품인증관리원 6
 
5.6%
산학협력단 5
 
4.7%
녹색친환경 4
 
3.7%
㈜비씨에스코리아 3
 
2.8%
사)한솔농림수산식품인증센터 3
 
2.8%
강원대학교 2
 
1.9%
주)오에이티씨 2
 
1.9%
주)농산물품질인증평가원 1
 
0.9%
한국농식품분석연구소 1
 
0.9%
Other values (55) 55
51.4%
2023-12-11T12:03:16.161797image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
51
 
6.1%
39
 
4.7%
38
 
4.6%
35
 
4.2%
33
 
4.0%
33
 
4.0%
32
 
3.8%
) 30
 
3.6%
( 30
 
3.6%
29
 
3.5%
Other values (125) 482
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 730
87.7%
Space Separator 35
 
4.2%
Close Punctuation 30
 
3.6%
Open Punctuation 30
 
3.6%
Other Symbol 4
 
0.5%
Uppercase Letter 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
51
 
7.0%
39
 
5.3%
38
 
5.2%
33
 
4.5%
33
 
4.5%
32
 
4.4%
29
 
4.0%
25
 
3.4%
16
 
2.2%
16
 
2.2%
Other values (117) 418
57.3%
Other Symbol
ValueCountFrequency (%)
3
75.0%
1
 
25.0%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
D 1
50.0%
Space Separator
ValueCountFrequency (%)
35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 734
88.2%
Common 96
 
11.5%
Latin 2
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
51
 
6.9%
39
 
5.3%
38
 
5.2%
33
 
4.5%
33
 
4.5%
32
 
4.4%
29
 
4.0%
25
 
3.4%
16
 
2.2%
16
 
2.2%
Other values (119) 422
57.5%
Common
ValueCountFrequency (%)
35
36.5%
) 30
31.2%
( 30
31.2%
& 1
 
1.0%
Latin
ValueCountFrequency (%)
R 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 730
87.7%
ASCII 98
 
11.8%
None 4
 
0.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
51
 
7.0%
39
 
5.3%
38
 
5.2%
33
 
4.5%
33
 
4.5%
32
 
4.4%
29
 
4.0%
25
 
3.4%
16
 
2.2%
16
 
2.2%
Other values (117) 418
57.3%
ASCII
ValueCountFrequency (%)
35
35.7%
) 30
30.6%
( 30
30.6%
R 1
 
1.0%
& 1
 
1.0%
D 1
 
1.0%
None
ValueCountFrequency (%)
3
75.0%
1
 
25.0%

본사_지사
Categorical

IMBALANCE 

Distinct15
Distinct (%)20.8%
Missing0
Missing (%)0.0%
Memory size708.0 B
본사
58 
영동사무소
 
1
전남사무소
 
1
강원사무소
 
1
충청사무소
 
1
Other values (10)
10 

Length

Max length6
Median length2
Mean length2.5416667
Min length2

Unique

Unique14 ?
Unique (%)19.4%

Sample

1st row본사
2nd row본사
3rd row본사
4th row본사
5th row본사

Common Values

ValueCountFrequency (%)
본사 58
80.6%
영동사무소 1
 
1.4%
전남사무소 1
 
1.4%
강원사무소 1
 
1.4%
충청사무소 1
 
1.4%
호남사무소 1
 
1.4%
호남지사 1
 
1.4%
경북지사 1
 
1.4%
강원지사 1
 
1.4%
경남지사 1
 
1.4%
Other values (5) 5
 
6.9%

Length

2023-12-11T12:03:16.323507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
본사 58
80.6%
전남사무소 2
 
2.8%
영동사무소 1
 
1.4%
강원사무소 1
 
1.4%
충청사무소 1
 
1.4%
호남사무소 1
 
1.4%
호남지사 1
 
1.4%
경북지사 1
 
1.4%
강원지사 1
 
1.4%
경남지사 1
 
1.4%
Other values (4) 4
 
5.6%

소재지
Text

UNIQUE 

Distinct72
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-11T12:03:16.564770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length36
Mean length29.347222
Min length18

Characters and Unicode

Total characters2113
Distinct characters219
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)100.0%

Sample

1st row경기도 수원시 영통구 센트럴타운로 114-8 8층
2nd row서울특별시 동대문구 약령동길 88
3rd row충청남도 천안시 서북구 성정중7길 26 (성정동)
4th row대구광역시 북구 칠곡중앙대로136길 30 (동호동)
5th row충청남도 금산군 금산읍 인삼광장로 25
ValueCountFrequency (%)
경기도 10
 
2.3%
전라남도 9
 
2.1%
경상남도 7
 
1.6%
전라북도 7
 
1.6%
광주광역시 6
 
1.4%
충청남도 6
 
1.4%
강원도 5
 
1.1%
충청북도 5
 
1.1%
서구 5
 
1.1%
2층 5
 
1.1%
Other values (324) 373
85.2%
2023-12-11T12:03:16.995589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
373
 
17.7%
1 76
 
3.6%
64
 
3.0%
63
 
3.0%
61
 
2.9%
57
 
2.7%
2 54
 
2.6%
( 53
 
2.5%
) 53
 
2.5%
3 42
 
2.0%
Other values (209) 1217
57.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1277
60.4%
Space Separator 373
 
17.7%
Decimal Number 340
 
16.1%
Open Punctuation 53
 
2.5%
Close Punctuation 53
 
2.5%
Dash Punctuation 16
 
0.8%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
64
 
5.0%
63
 
4.9%
61
 
4.8%
57
 
4.5%
35
 
2.7%
33
 
2.6%
30
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
Other values (194) 849
66.5%
Decimal Number
ValueCountFrequency (%)
1 76
22.4%
2 54
15.9%
3 42
12.4%
0 40
11.8%
4 31
9.1%
5 25
 
7.4%
8 22
 
6.5%
7 19
 
5.6%
6 18
 
5.3%
9 13
 
3.8%
Space Separator
ValueCountFrequency (%)
373
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 16
100.0%
Other Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1277
60.4%
Common 836
39.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
5.0%
63
 
4.9%
61
 
4.8%
57
 
4.5%
35
 
2.7%
33
 
2.6%
30
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
Other values (194) 849
66.5%
Common
ValueCountFrequency (%)
373
44.6%
1 76
 
9.1%
2 54
 
6.5%
( 53
 
6.3%
) 53
 
6.3%
3 42
 
5.0%
0 40
 
4.8%
4 31
 
3.7%
5 25
 
3.0%
8 22
 
2.6%
Other values (5) 67
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1277
60.4%
ASCII 835
39.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
373
44.7%
1 76
 
9.1%
2 54
 
6.5%
( 53
 
6.3%
) 53
 
6.3%
3 42
 
5.0%
0 40
 
4.8%
4 31
 
3.7%
5 25
 
3.0%
8 22
 
2.6%
Other values (4) 66
 
7.9%
Hangul
ValueCountFrequency (%)
64
 
5.0%
63
 
4.9%
61
 
4.8%
57
 
4.5%
35
 
2.7%
33
 
2.6%
30
 
2.3%
29
 
2.3%
28
 
2.2%
28
 
2.2%
Other values (194) 849
66.5%
None
ValueCountFrequency (%)
1
100.0%
Distinct69
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size708.0 B
2023-12-11T12:03:17.295916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12
Min length11

Characters and Unicode

Total characters864
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)94.4%

Sample

1st row031-8021-7015
2nd row02-967-8133
3rd row041-555-1915
4th row053-326-9895
5th row041-750-1690
ValueCountFrequency (%)
055-356-6279 4
 
5.6%
033-812-2010 1
 
1.4%
053-324-3232 1
 
1.4%
062-956-9320 1
 
1.4%
061-334-8500 1
 
1.4%
062-971-9002 1
 
1.4%
053-783-9393 1
 
1.4%
02-423-7748 1
 
1.4%
055-880-2880 1
 
1.4%
031-8021-7015 1
 
1.4%
Other values (59) 59
81.9%
2023-12-11T12:03:17.738349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 144
16.7%
0 129
14.9%
3 99
11.5%
5 87
10.1%
2 78
9.0%
1 72
8.3%
6 69
8.0%
7 49
 
5.7%
4 49
 
5.7%
8 47
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 720
83.3%
Dash Punctuation 144
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 129
17.9%
3 99
13.8%
5 87
12.1%
2 78
10.8%
1 72
10.0%
6 69
9.6%
7 49
 
6.8%
4 49
 
6.8%
8 47
 
6.5%
9 41
 
5.7%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 864
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 144
16.7%
0 129
14.9%
3 99
11.5%
5 87
10.1%
2 78
9.0%
1 72
8.3%
6 69
8.0%
7 49
 
5.7%
4 49
 
5.7%
8 47
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 864
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 144
16.7%
0 129
14.9%
3 99
11.5%
5 87
10.1%
2 78
9.0%
1 72
8.3%
6 69
8.0%
7 49
 
5.7%
4 49
 
5.7%
8 47
 
5.4%

Correlations

2023-12-11T12:03:17.869251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지정번호인증기관명본사_지사소재지전화번호
지정번호1.0001.0000.0001.0001.000
인증기관명1.0001.0000.0001.0001.000
본사_지사0.0000.0001.0001.0000.000
소재지1.0001.0001.0001.0001.000
전화번호1.0001.0000.0001.0001.000

Missing values

2023-12-11T12:03:14.676234image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:03:14.786360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

지정번호인증기관명본사_지사소재지전화번호
0제001호농협경제지주(주)식품R&D연구소본사경기도 수원시 영통구 센트럴타운로 114-8 8층031-8021-7015
1제002호㈔한국생약협회본사서울특별시 동대문구 약령동길 8802-967-8133
2제004호주식회사 온누리친환경본사충청남도 천안시 서북구 성정중7길 26 (성정동)041-555-1915
3제005호주식회사 글로벌농식품인증원본사대구광역시 북구 칠곡중앙대로136길 30 (동호동)053-326-9895
4제006호(재)금산인삼약초산업진흥원본사충청남도 금산군 금산읍 인삼광장로 25041-750-1690
5제012호강원대학교 산학협력단본사강원도 춘천시 강원대학길 1 3동(효자동 강원대학교 태백관)033-250-7267
6제012호강원대학교 산학협력단영동사무소강원도 삼척시 중앙로 346 강원대학교 삼척캠퍼스 학생회관 3층 308호033-570-6437
7제023호토지글로닉스 주식회사본사광주광역시 서구 매월2로 16 (매월동)062-655-0755
8제024호(재)충북테크노파크 한방천연물센터본사충청북도 제천시 바이오밸리2로 41 (왕암동)043-270-2610
9제029호주식회사 이앤컴퍼니본사전라남도 화순군 능주면 죽수길 89 3층061-371-2022
지정번호인증기관명본사_지사소재지전화번호
62제087호주식회사 경기농업인증센터본사경기도 안성시 중앙로379번길 40 (대천동)031-676-6023
63제088호(주)에코아임친환경기술연구원본사전라남도 강진군 성전면 강진산단로1길 1 창업보육실 124호061-433-2675
64제089호농업회사법인(주)푸른솔본사경기도 수원시 영통구 월드컵로150번길 56 한경대학교 경기친환경농업연구센터 405호 (원천동)031-215-9833
65제090호카네기경영연구원 주식회사본사전라북도 익산시 중앙로 22-228 (중앙동2가)063-915-1133
66제092호주식회사 금강인증센터본사충청남도 공주시 봉황로 154 (교동)041-881-9775
67제093호순천대학교산학협력단본사전라남도 순천시 중앙로 255 (석현동)061-750-5471
68제095호유한회사 빛그린인증원본사전라남도 나주시 호수로 86 401호(빛가람동)061-335-3485
69제096호(주)한국지에이피인증원본사강원도 원주시 소초면 간촌길 13-1033-901-9010
70제097호(주)티에스피인증관리원본사경기도 평택시 만세로 1738-17 (죽백동) 102호031-647-0410
71제098호이풀약초협동조합본사서울특별시 은평구 통일로 684 미래청(1동) 409호 (녹번동)02-3674-5200