Overview

Dataset statistics

Number of variables5
Number of observations27
Missing cells21
Missing cells (%)15.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory44.9 B

Variable types

Text4
Categorical1

Dataset

Description전라북도 정읍시에 소재한 소독업체 현황중(업체명, 소재지도로명주소, 소재지지번주소, 전화번호)등의 정보를 제공합니다.
Author전라북도 정읍시
URLhttps://www.data.go.kr/data/3073953/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
소재지도로명주소 has 2 (7.4%) missing valuesMissing
소재지지번주소 has 3 (11.1%) missing valuesMissing
전화번호 has 16 (59.3%) missing valuesMissing

Reproduction

Analysis started2023-12-16 15:01:24.668845
Analysis finished2023-12-16 15:01:26.510547
Duration1.84 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct24
Distinct (%)88.9%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-16T15:01:26.898274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length7.3703704
Min length2

Characters and Unicode

Total characters199
Distinct characters90
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)77.8%

Sample

1st row(유)한국놀이시설안전관리원 정읍점
2nd row누리
3rd row농업회사법인유한회사바이오디에이치
4th row상동환경
5th row페스트제로
ValueCountFrequency (%)
유한회사전라환경 2
 
6.1%
하얀환경 2
 
6.1%
유한회사 2
 
6.1%
미래환경개발 2
 
6.1%
미래환경 1
 
3.0%
정읍점 1
 
3.0%
유)한국놀이시설안전관리원 1
 
3.0%
누리 1
 
3.0%
원자력경호경비시스템 1
 
3.0%
지구환경 1
 
3.0%
Other values (19) 19
57.6%
2023-12-16T15:01:28.405988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
 
7.0%
12
 
6.0%
9
 
4.5%
9
 
4.5%
8
 
4.0%
7
 
3.5%
6
 
3.0%
5
 
2.5%
4
 
2.0%
4
 
2.0%
Other values (80) 121
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 187
94.0%
Space Separator 6
 
3.0%
Open Punctuation 3
 
1.5%
Close Punctuation 3
 
1.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
7.5%
12
 
6.4%
9
 
4.8%
9
 
4.8%
8
 
4.3%
7
 
3.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
3
 
1.6%
Other values (77) 112
59.9%
Space Separator
ValueCountFrequency (%)
6
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 187
94.0%
Common 12
 
6.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
 
7.5%
12
 
6.4%
9
 
4.8%
9
 
4.8%
8
 
4.3%
7
 
3.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
3
 
1.6%
Other values (77) 112
59.9%
Common
ValueCountFrequency (%)
6
50.0%
( 3
25.0%
) 3
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 187
94.0%
ASCII 12
 
6.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
 
7.5%
12
 
6.4%
9
 
4.8%
9
 
4.8%
8
 
4.3%
7
 
3.7%
5
 
2.7%
4
 
2.1%
4
 
2.1%
3
 
1.6%
Other values (77) 112
59.9%
ASCII
ValueCountFrequency (%)
6
50.0%
( 3
25.0%
) 3
25.0%
Distinct25
Distinct (%)100.0%
Missing2
Missing (%)7.4%
Memory size348.0 B
2023-12-16T15:01:29.122857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length28
Mean length25.2
Min length13

Characters and Unicode

Total characters630
Distinct characters89
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique25 ?
Unique (%)100.0%

Sample

1st row전라북도 정읍시 초산로 3-1, 상가동 3층 G-12호 (시기동, 시기현대아파트)
2nd row전라북도 정읍시 소성면 소성로 432
3rd row전라북도 정읍시 신태인읍 신태인북길 585
4th row전라북도 정읍시 충정로 121, 2층 (상동)
5th row전라북도 정읍시 충정로 134, 1층 (상동)
ValueCountFrequency (%)
전라북도 25
 
17.7%
정읍시 23
 
16.3%
시기동 6
 
4.3%
2층 5
 
3.5%
연지동 4
 
2.8%
충정로 3
 
2.1%
초산로 3
 
2.1%
중앙로 3
 
2.1%
수성동 3
 
2.1%
상동 3
 
2.1%
Other values (60) 63
44.7%
2023-12-16T15:01:30.646343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
119
18.9%
30
 
4.8%
28
 
4.4%
27
 
4.3%
25
 
4.0%
25
 
4.0%
25
 
4.0%
25
 
4.0%
1 22
 
3.5%
21
 
3.3%
Other values (79) 283
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 364
57.8%
Space Separator 119
 
18.9%
Decimal Number 99
 
15.7%
Close Punctuation 14
 
2.2%
Open Punctuation 14
 
2.2%
Other Punctuation 13
 
2.1%
Dash Punctuation 6
 
1.0%
Uppercase Letter 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
30
 
8.2%
28
 
7.7%
27
 
7.4%
25
 
6.9%
25
 
6.9%
25
 
6.9%
25
 
6.9%
21
 
5.8%
16
 
4.4%
11
 
3.0%
Other values (63) 131
36.0%
Decimal Number
ValueCountFrequency (%)
1 22
22.2%
2 19
19.2%
3 13
13.1%
4 9
9.1%
5 9
9.1%
8 8
 
8.1%
6 5
 
5.1%
0 5
 
5.1%
7 5
 
5.1%
9 4
 
4.0%
Space Separator
ValueCountFrequency (%)
119
100.0%
Close Punctuation
ValueCountFrequency (%)
) 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Other Punctuation
ValueCountFrequency (%)
, 13
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 6
100.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 364
57.8%
Common 265
42.1%
Latin 1
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
30
 
8.2%
28
 
7.7%
27
 
7.4%
25
 
6.9%
25
 
6.9%
25
 
6.9%
25
 
6.9%
21
 
5.8%
16
 
4.4%
11
 
3.0%
Other values (63) 131
36.0%
Common
ValueCountFrequency (%)
119
44.9%
1 22
 
8.3%
2 19
 
7.2%
) 14
 
5.3%
( 14
 
5.3%
, 13
 
4.9%
3 13
 
4.9%
4 9
 
3.4%
5 9
 
3.4%
8 8
 
3.0%
Other values (5) 25
 
9.4%
Latin
ValueCountFrequency (%)
G 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 364
57.8%
ASCII 266
42.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
119
44.7%
1 22
 
8.3%
2 19
 
7.1%
) 14
 
5.3%
( 14
 
5.3%
, 13
 
4.9%
3 13
 
4.9%
4 9
 
3.4%
5 9
 
3.4%
8 8
 
3.0%
Other values (6) 26
 
9.8%
Hangul
ValueCountFrequency (%)
30
 
8.2%
28
 
7.7%
27
 
7.4%
25
 
6.9%
25
 
6.9%
25
 
6.9%
25
 
6.9%
21
 
5.8%
16
 
4.4%
11
 
3.0%
Other values (63) 131
36.0%

소재지지번주소
Text

MISSING 

Distinct23
Distinct (%)95.8%
Missing3
Missing (%)11.1%
Memory size348.0 B
2023-12-16T15:01:31.870819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length26
Mean length21.833333
Min length13

Characters and Unicode

Total characters524
Distinct characters78
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)91.7%

Sample

1st row전라북도 정읍시 시기동 506-14 시기현대아파트
2nd row전라북도 정읍시 소성면 주천리 381-1
3rd row전라북도 정읍시 신태인읍 연정리 581
4th row전라북도 정읍시 상동 370-15
5th row전라북도 정읍시 수성동 603-7
ValueCountFrequency (%)
전라북도 24
20.5%
정읍시 22
18.8%
시기동 5
 
4.3%
연지동 4
 
3.4%
상동 3
 
2.6%
수성동 3
 
2.6%
255번지 2
 
1.7%
31 2
 
1.7%
연지3길 2
 
1.7%
장명동 2
 
1.7%
Other values (46) 48
41.0%
2023-12-16T15:01:34.923362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
95
18.1%
29
 
5.5%
25
 
4.8%
25
 
4.8%
25
 
4.8%
24
 
4.6%
24
 
4.6%
24
 
4.6%
5 18
 
3.4%
18
 
3.4%
Other values (68) 217
41.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 323
61.6%
Space Separator 95
 
18.1%
Decimal Number 93
 
17.7%
Dash Punctuation 7
 
1.3%
Open Punctuation 3
 
0.6%
Close Punctuation 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
9.0%
25
 
7.7%
25
 
7.7%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
18
 
5.6%
12
 
3.7%
Other values (54) 99
30.7%
Decimal Number
ValueCountFrequency (%)
5 18
19.4%
1 13
14.0%
7 12
12.9%
3 11
11.8%
2 8
8.6%
6 8
8.6%
8 7
 
7.5%
0 7
 
7.5%
4 6
 
6.5%
9 3
 
3.2%
Space Separator
ValueCountFrequency (%)
95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 323
61.6%
Common 201
38.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
9.0%
25
 
7.7%
25
 
7.7%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
18
 
5.6%
12
 
3.7%
Other values (54) 99
30.7%
Common
ValueCountFrequency (%)
95
47.3%
5 18
 
9.0%
1 13
 
6.5%
7 12
 
6.0%
3 11
 
5.5%
2 8
 
4.0%
6 8
 
4.0%
8 7
 
3.5%
0 7
 
3.5%
- 7
 
3.5%
Other values (4) 15
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 323
61.6%
ASCII 201
38.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
95
47.3%
5 18
 
9.0%
1 13
 
6.5%
7 12
 
6.0%
3 11
 
5.5%
2 8
 
4.0%
6 8
 
4.0%
8 7
 
3.5%
0 7
 
3.5%
- 7
 
3.5%
Other values (4) 15
 
7.5%
Hangul
ValueCountFrequency (%)
29
 
9.0%
25
 
7.7%
25
 
7.7%
25
 
7.7%
24
 
7.4%
24
 
7.4%
24
 
7.4%
18
 
5.6%
18
 
5.6%
12
 
3.7%
Other values (54) 99
30.7%

전화번호
Text

MISSING 

Distinct9
Distinct (%)81.8%
Missing16
Missing (%)59.3%
Memory size348.0 B
2023-12-16T15:01:35.587555image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters132
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)63.6%

Sample

1st row063-535-3112
2nd row063-537-5295
3rd row063-537-2101
4th row063-537-1155
5th row063-538-8252
ValueCountFrequency (%)
063-537-1238 2
18.2%
063-537-4341 2
18.2%
063-535-3112 1
9.1%
063-537-5295 1
9.1%
063-537-2101 1
9.1%
063-537-1155 1
9.1%
063-538-8252 1
9.1%
063-534-9996 1
9.1%
063-531-0472 1
9.1%
2023-12-16T15:01:36.917030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 27
20.5%
- 22
16.7%
5 17
12.9%
0 13
9.8%
6 12
9.1%
1 11
8.3%
7 8
 
6.1%
2 8
 
6.1%
4 6
 
4.5%
8 4
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 110
83.3%
Dash Punctuation 22
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 27
24.5%
5 17
15.5%
0 13
11.8%
6 12
10.9%
1 11
10.0%
7 8
 
7.3%
2 8
 
7.3%
4 6
 
5.5%
8 4
 
3.6%
9 4
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 132
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 27
20.5%
- 22
16.7%
5 17
12.9%
0 13
9.8%
6 12
9.1%
1 11
8.3%
7 8
 
6.1%
2 8
 
6.1%
4 6
 
4.5%
8 4
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 27
20.5%
- 22
16.7%
5 17
12.9%
0 13
9.8%
6 12
9.1%
1 11
8.3%
7 8
 
6.1%
2 8
 
6.1%
4 6
 
4.5%
8 4
 
3.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12
27 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-12-12
2nd row2023-12-12
3rd row2023-12-12
4th row2023-12-12
5th row2023-12-12

Common Values

ValueCountFrequency (%)
2023-12-12 27
100.0%

Length

2023-12-16T15:01:37.558741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-16T15:01:37.984737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-12-12 27
100.0%

Correlations

2023-12-16T15:01:38.689344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명소재지도로명주소소재지지번주소전화번호
업체명1.0001.0000.9711.000
소재지도로명주소1.0001.0001.0001.000
소재지지번주소0.9711.0001.0001.000
전화번호1.0001.0001.0001.000

Missing values

2023-12-16T15:01:25.538958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-16T15:01:26.104585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-16T15:01:26.369989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

업체명소재지도로명주소소재지지번주소전화번호데이터기준일자
0(유)한국놀이시설안전관리원 정읍점전라북도 정읍시 초산로 3-1, 상가동 3층 G-12호 (시기동, 시기현대아파트)전라북도 정읍시 시기동 506-14 시기현대아파트<NA>2023-12-12
1누리전라북도 정읍시 소성면 소성로 432전라북도 정읍시 소성면 주천리 381-1<NA>2023-12-12
2농업회사법인유한회사바이오디에이치전라북도 정읍시 신태인읍 신태인북길 585전라북도 정읍시 신태인읍 연정리 581<NA>2023-12-12
3상동환경전라북도 정읍시 충정로 121, 2층 (상동)전라북도 정읍시 상동 370-15<NA>2023-12-12
4페스트제로전라북도 정읍시 충정로 134, 1층 (상동)<NA>063-535-31122023-12-12
5농업회사법인주소율주식회사전라북도 정읍시 중앙1길 55, 2층 202호 (수성동)전라북도 정읍시 수성동 603-7<NA>2023-12-12
6윤테크 주식회사전라북도 정읍시 중앙로 189, 윤테크 주식회사 2층 (시기동)전라북도 정읍시 시기동 587 세탁나라<NA>2023-12-12
7미화산업전라북도 정읍시 충정로 201, 미화산업 (장명동)전라북도 정읍시 장명동 82번지 그린가스시공<NA>2023-12-12
8(유)정읍드론항공방제전라북도 정읍시 관통로 18, 2층 (장명동)전라북도 정읍시 장명동 175번지 장이랑쌈이랑 음식점<NA>2023-12-12
9버그클리너전라북도 정읍시 초산로 5-8, 버그클리너 1층 6호 (시기동)전라북도 정읍시 시기동 505번지 6호 상아슈퍼<NA>2023-12-12
업체명소재지도로명주소소재지지번주소전화번호데이터기준일자
17하늘그린전라북도 정읍시 벚꽃로 316 (시기동)전라북도 정읍시 시기동 530번지 87호063-534-99962023-12-12
18지구환경<NA>전라북도 정읍시 상동 197번지<NA>2023-12-12
19미래환경개발전라북도 중앙로 254전라북도 정읍시 상동 343번지 8호 207호063-537-12382023-12-12
20하얀환경전라북도 수성1로 64-9전라북도 정읍시 수성동 967번지 3호<NA>2023-12-12
21유한회사전라환경전라북도 정읍시 연지3길 31 (연지동)전라북도 정읍시 연지동 275번지 4호063-537-43412023-12-12
22미래환경개발전라북도 정읍시 상동 343번지 8호 207호전라북도 중앙로 254063-537-12382023-12-12
23하얀환경전라북도 정읍시 수성동 967번지 3호전라북도 수성1로 64-9063-531-04722023-12-12
24원자력경호경비시스템전라북도 정읍시 시기동 82번지 10호전라북도 정읍시 정읍사로 512-2 (시기동)<NA>2023-12-12
25유한회사 대한환경개발<NA>전라북도 정읍시 연지3길 31 (연지동)<NA>2023-12-12
26유한회사전라환경전라북도 정읍시 연지동 275번지 4호전라북도 정읍시 연지3길 31 (연지동)063-537-43412023-12-12