Overview

Dataset statistics

Number of variables5
Number of observations25
Missing cells25
Missing cells (%)20.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.1 KiB
Average record size in memory46.3 B

Variable types

Text4
Unsupported1

Dataset

Description전국에 수출입식물방제업 신고증을 받은 업체정보를 제공하는 서비스
Author농림축산검역본부
URLhttps://data.mafra.go.kr/opendata/data/indexOpenDataDetail.do?data_id=20220214000000001887

Alerts

Unnamed: 4 has 25 (100.0%) missing valuesMissing
업체명 has unique valuesUnique
주소 has unique valuesUnique
전화번호 has unique valuesUnique
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-11 03:11:27.738621
Analysis finished2023-12-11 03:11:28.517646
Duration0.78 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업체명
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T12:11:28.683453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length8.88
Min length5

Characters and Unicode

Total characters222
Distinct characters68
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row평택당진방역(주)
2nd row국제에프티엘(주)
3rd row대한티이씨(주)
4th row(주)영화기업사
5th row인천훈증주식회사
ValueCountFrequency (%)
주식회사 2
 
7.4%
평택당진방역(주 1
 
3.7%
글로벌방제주식회사(광양지점 1
 
3.7%
유)대신방역산업 1
 
3.7%
유)대진방역 1
 
3.7%
주)대명에스앤피 1
 
3.7%
주)해암산업 1
 
3.7%
한국방역산업주식회사 1
 
3.7%
주)명륜 1
 
3.7%
아주종합방제(주 1
 
3.7%
Other values (16) 16
59.3%
2023-12-11T12:11:29.154520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
24
 
10.8%
( 20
 
9.0%
) 20
 
9.0%
13
 
5.9%
8
 
3.6%
8
 
3.6%
7
 
3.2%
7
 
3.2%
6
 
2.7%
5
 
2.3%
Other values (58) 104
46.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 177
79.7%
Open Punctuation 20
 
9.0%
Close Punctuation 20
 
9.0%
Uppercase Letter 3
 
1.4%
Space Separator 2
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
13.6%
13
 
7.3%
8
 
4.5%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
5
 
2.8%
Other values (52) 89
50.3%
Uppercase Letter
ValueCountFrequency (%)
P 1
33.3%
C 1
33.3%
S 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 20
100.0%
Close Punctuation
ValueCountFrequency (%)
) 20
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 177
79.7%
Common 42
 
18.9%
Latin 3
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
13.6%
13
 
7.3%
8
 
4.5%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
5
 
2.8%
Other values (52) 89
50.3%
Common
ValueCountFrequency (%)
( 20
47.6%
) 20
47.6%
2
 
4.8%
Latin
ValueCountFrequency (%)
P 1
33.3%
C 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 177
79.7%
ASCII 45
 
20.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
24
 
13.6%
13
 
7.3%
8
 
4.5%
8
 
4.5%
7
 
4.0%
7
 
4.0%
6
 
3.4%
5
 
2.8%
5
 
2.8%
5
 
2.8%
Other values (52) 89
50.3%
ASCII
ValueCountFrequency (%)
( 20
44.4%
) 20
44.4%
2
 
4.4%
P 1
 
2.2%
C 1
 
2.2%
S 1
 
2.2%

주소
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T12:11:29.491495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length25
Mean length23.16
Min length17

Characters and Unicode

Total characters579
Distinct characters99
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row경기도 평택시 포승읍 신영리 951 (평택당진항만내)
2nd row부산광역시 동구 중앙대로236번길 10 금호빌딩 501호
3rd row부산광역시 사하구 동매로9번길 20
4th row인천광역시 중구 신포로15번길 73
5th row인천광역시 동구 제물량로 351
ValueCountFrequency (%)
부산광역시 10
 
8.5%
인천광역시 6
 
5.1%
중구 5
 
4.2%
강서구 4
 
3.4%
동구 3
 
2.5%
남구 3
 
2.5%
광양시 3
 
2.5%
전라남도 3
 
2.5%
중앙대로236번길 2
 
1.7%
10 2
 
1.7%
Other values (66) 77
65.3%
2023-12-11T12:11:29.972873image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
110
 
19.0%
25
 
4.3%
20
 
3.5%
1 20
 
3.5%
18
 
3.1%
18
 
3.1%
17
 
2.9%
2 16
 
2.8%
7 15
 
2.6%
15
 
2.6%
Other values (89) 305
52.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 341
58.9%
Space Separator 110
 
19.0%
Decimal Number 110
 
19.0%
Dash Punctuation 9
 
1.6%
Open Punctuation 4
 
0.7%
Close Punctuation 4
 
0.7%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.3%
20
 
5.9%
18
 
5.3%
18
 
5.3%
17
 
5.0%
15
 
4.4%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
Other values (74) 186
54.5%
Decimal Number
ValueCountFrequency (%)
1 20
18.2%
2 16
14.5%
7 15
13.6%
0 12
10.9%
3 10
9.1%
5 10
9.1%
4 9
8.2%
6 8
 
7.3%
9 8
 
7.3%
8 2
 
1.8%
Space Separator
ValueCountFrequency (%)
110
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 341
58.9%
Common 237
40.9%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.3%
20
 
5.9%
18
 
5.3%
18
 
5.3%
17
 
5.0%
15
 
4.4%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
Other values (74) 186
54.5%
Common
ValueCountFrequency (%)
110
46.4%
1 20
 
8.4%
2 16
 
6.8%
7 15
 
6.3%
0 12
 
5.1%
3 10
 
4.2%
5 10
 
4.2%
4 9
 
3.8%
- 9
 
3.8%
6 8
 
3.4%
Other values (4) 18
 
7.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 341
58.9%
ASCII 238
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
110
46.2%
1 20
 
8.4%
2 16
 
6.7%
7 15
 
6.3%
0 12
 
5.0%
3 10
 
4.2%
5 10
 
4.2%
4 9
 
3.8%
- 9
 
3.8%
6 8
 
3.4%
Other values (5) 19
 
8.0%
Hangul
ValueCountFrequency (%)
25
 
7.3%
20
 
5.9%
18
 
5.3%
18
 
5.3%
17
 
5.0%
15
 
4.4%
14
 
4.1%
10
 
2.9%
9
 
2.6%
9
 
2.6%
Other values (74) 186
54.5%

전화번호
Text

UNIQUE 

Distinct25
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T12:11:30.247801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters300
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row031-681-7288
2nd row051-462-8777
3rd row051-203-5719
4th row032-589-4403
5th row032-584-3802
ValueCountFrequency (%)
031-681-7288 1
 
4.0%
061-792-7891 1
 
4.0%
063-467-6154 1
 
4.0%
061-792-7890 1
 
4.0%
051-637-4648 1
 
4.0%
054-274-7245 1
 
4.0%
051-326-8871 1
 
4.0%
032-765-7600 1
 
4.0%
051-465-6400 1
 
4.0%
051-668-9930 1
 
4.0%
Other values (15) 15
60.0%
2023-12-11T12:11:30.657978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 50
16.7%
0 40
13.3%
1 31
10.3%
8 27
9.0%
4 27
9.0%
6 25
8.3%
7 25
8.3%
5 24
8.0%
2 22
7.3%
3 18
 
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 250
83.3%
Dash Punctuation 50
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 40
16.0%
1 31
12.4%
8 27
10.8%
4 27
10.8%
6 25
10.0%
7 25
10.0%
5 24
9.6%
2 22
8.8%
3 18
7.2%
9 11
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 300
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 50
16.7%
0 40
13.3%
1 31
10.3%
8 27
9.0%
4 27
9.0%
6 25
8.3%
7 25
8.3%
5 24
8.0%
2 22
7.3%
3 18
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 300
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 50
16.7%
0 40
13.3%
1 31
10.3%
8 27
9.0%
4 27
9.0%
6 25
8.3%
7 25
8.3%
5 24
8.0%
2 22
7.3%
3 18
 
6.0%
Distinct21
Distinct (%)84.0%
Missing0
Missing (%)0.0%
Memory size332.0 B
2023-12-11T12:11:30.908875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters75
Distinct characters45
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)72.0%

Sample

1st row홍재희
2nd row김호윤
3rd row임기택
4th row주창대
5th row홍재희
ValueCountFrequency (%)
이진우 3
 
12.0%
홍재희 2
 
8.0%
김호윤 2
 
8.0%
이돈홍 1
 
4.0%
이성운 1
 
4.0%
유인영 1
 
4.0%
김채현 1
 
4.0%
노환덕 1
 
4.0%
강인식 1
 
4.0%
박연진 1
 
4.0%
Other values (11) 11
44.0%
2023-12-11T12:11:31.311931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
2
 
2.7%
2
 
2.7%
Other values (35) 41
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
2
 
2.7%
2
 
2.7%
Other values (35) 41
54.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
2
 
2.7%
2
 
2.7%
Other values (35) 41
54.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6
 
8.0%
5
 
6.7%
4
 
5.3%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
3
 
4.0%
2
 
2.7%
2
 
2.7%
Other values (35) 41
54.7%

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing25
Missing (%)100.0%
Memory size357.0 B

Correlations

2023-12-11T12:11:31.447457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명주소전화번호대표자명
업체명1.0001.0001.0001.000
주소1.0001.0001.0001.000
전화번호1.0001.0001.0001.000
대표자명1.0001.0001.0001.000

Missing values

2023-12-11T12:11:28.336847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T12:11:28.463687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명주소전화번호대표자명Unnamed: 4
0평택당진방역(주)경기도 평택시 포승읍 신영리 951 (평택당진항만내)031-681-7288홍재희<NA>
1국제에프티엘(주)부산광역시 동구 중앙대로236번길 10 금호빌딩 501호051-462-8777김호윤<NA>
2대한티이씨(주)부산광역시 사하구 동매로9번길 20051-203-5719임기택<NA>
3(주)영화기업사인천광역시 중구 신포로15번길 73032-589-4403주창대<NA>
4인천훈증주식회사인천광역시 동구 제물량로 351032-584-3802홍재희<NA>
5글로벌방제주식회사울산광역시 남구 용연로 295-109052-240-8370이진우<NA>
6세림방역(주)경기도 평택시 포승읍 포승공단순환로 49031-686-5548문봉종<NA>
7(주)금호방역부산광역시 강서구 유통단지1로 76 12-301051-441-7800함영숙<NA>
8PCS(주)전라남도 광양시 중마용소4길 20061-903-1788용근식<NA>
9한국종합방제(주)인천광역시 중구 서해대로417번길 27-7032-887-8111이웅기<NA>
업체명주소전화번호대표자명Unnamed: 4
15(주)신우티앤씨인천광역시 중구 인항로 6 씨팰리스 A-805032-886-4417이성운<NA>
16글로벌방제주식회사(부산지점)부산광역시 남구 수영로 74-5 2층 202호051-668-9930이진우<NA>
17아주종합방제(주)부산광역시 사하구 동매로 140-1051-465-6400안호제<NA>
18(주)명륜인천광역시 중구 신포로27번길 33 3층032-765-7600고도명<NA>
19한국방역산업주식회사부산광역시 강서구 화전산단2로 70-16051-326-8871박연진<NA>
20(주)해암산업경상북도 포항시 북구 삼호로74번길 7054-274-7245강인식<NA>
21(주)대명에스앤피부산광역시 남구 우암로 75 (감만동)051-637-4648노환덕<NA>
22(유)대진방역전라남도 광양시 용지길 67-1 (태인동)061-792-7890김채현<NA>
23(유)대신방역산업전라북도 군산시 외항안길 295063-467-6154유인영<NA>
24한국방역써비스(주)전라북도 군산시 풍전4길 42063-465-0251염대환<NA>