Overview

Dataset statistics

Number of variables8
Number of observations107
Missing cells10
Missing cells (%)1.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.8 KiB
Average record size in memory65.2 B

Variable types

Text4
Categorical2
DateTime2

Dataset

Description전라북도 정읍시에 소재한 대기오염배출 신고 및 허가 사업장 목록의(사업장명, 사업장전화번호, 도로명주소, 업종, 대기종별, 인허가, 신고허가일자 등) 정보제공
Author전라북도 정읍시
URLhttps://www.data.go.kr/data/15077826/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
인허가 is highly imbalanced (55.2%)Imbalance
사업장전화번호 has 10 (9.3%) missing valuesMissing
사업장명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 08:35:12.291802
Analysis finished2024-04-21 08:35:14.242191
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct107
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size984.0 B
2024-04-21T17:35:14.887619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length9.1962617
Min length3

Characters and Unicode

Total characters984
Distinct characters198
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique107 ?
Unique (%)100.0%

Sample

1st row농업회사법인 정읍유기질비료 주식회사
2nd row(유)현영
3rd row(주)카라
4th row㈜웅지피엔지(P&G)
5th row(주)삼호유황오리
ValueCountFrequency (%)
농업회사법인 5
 
3.8%
주식회사 5
 
3.8%
유한회사 3
 
2.3%
한국가스공사 1
 
0.8%
전북지역본부 1
 
0.8%
정읍시청 1
 
0.8%
정읍시농협rpc공동사업장 1
 
0.8%
친환경양돈영농조합법인 1
 
0.8%
샘골농업협동조합 1
 
0.8%
조아라산업 1
 
0.8%
Other values (112) 112
84.8%
2024-04-21T17:35:16.178247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 60
 
6.1%
) 60
 
6.1%
49
 
5.0%
40
 
4.1%
39
 
4.0%
27
 
2.7%
26
 
2.6%
25
 
2.5%
25
 
2.5%
24
 
2.4%
Other values (188) 609
61.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 819
83.2%
Open Punctuation 60
 
6.1%
Close Punctuation 60
 
6.1%
Space Separator 25
 
2.5%
Uppercase Letter 12
 
1.2%
Decimal Number 4
 
0.4%
Other Symbol 2
 
0.2%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
49
 
6.0%
40
 
4.9%
39
 
4.8%
27
 
3.3%
26
 
3.2%
25
 
3.1%
24
 
2.9%
23
 
2.8%
22
 
2.7%
18
 
2.2%
Other values (173) 526
64.2%
Uppercase Letter
ValueCountFrequency (%)
P 3
25.0%
R 2
16.7%
C 2
16.7%
K 1
 
8.3%
O 1
 
8.3%
G 1
 
8.3%
S 1
 
8.3%
F 1
 
8.3%
Decimal Number
ValueCountFrequency (%)
1 3
75.0%
2 1
 
25.0%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Space Separator
ValueCountFrequency (%)
25
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 821
83.4%
Common 151
 
15.3%
Latin 12
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
49
 
6.0%
40
 
4.9%
39
 
4.8%
27
 
3.3%
26
 
3.2%
25
 
3.0%
24
 
2.9%
23
 
2.8%
22
 
2.7%
18
 
2.2%
Other values (174) 528
64.3%
Latin
ValueCountFrequency (%)
P 3
25.0%
R 2
16.7%
C 2
16.7%
K 1
 
8.3%
O 1
 
8.3%
G 1
 
8.3%
S 1
 
8.3%
F 1
 
8.3%
Common
ValueCountFrequency (%)
( 60
39.7%
) 60
39.7%
25
16.6%
1 3
 
2.0%
& 2
 
1.3%
2 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 819
83.2%
ASCII 163
 
16.6%
None 2
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 60
36.8%
) 60
36.8%
25
15.3%
1 3
 
1.8%
P 3
 
1.8%
R 2
 
1.2%
C 2
 
1.2%
& 2
 
1.2%
K 1
 
0.6%
O 1
 
0.6%
Other values (4) 4
 
2.5%
Hangul
ValueCountFrequency (%)
49
 
6.0%
40
 
4.9%
39
 
4.8%
27
 
3.3%
26
 
3.2%
25
 
3.1%
24
 
2.9%
23
 
2.8%
22
 
2.7%
18
 
2.2%
Other values (173) 526
64.2%
None
ValueCountFrequency (%)
2
100.0%

사업장전화번호
Text

MISSING 

Distinct95
Distinct (%)97.9%
Missing10
Missing (%)9.3%
Memory size984.0 B
2024-04-21T17:35:17.098574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010309
Min length12

Characters and Unicode

Total characters1165
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique93 ?
Unique (%)95.9%

Sample

1st row063-531-3665
2nd row063-571-2215
3rd row063-536-3434
4th row063-536-1560
5th row063-532-8080
ValueCountFrequency (%)
063-717-7711 2
 
2.1%
063-536-5541 2
 
2.1%
063-571-0591 1
 
1.0%
063-539-5335 1
 
1.0%
063-537-4776 1
 
1.0%
063-537-9079 1
 
1.0%
063-537-3900 1
 
1.0%
031-8085-3013 1
 
1.0%
063-535-1001 1
 
1.0%
063-532-8011 1
 
1.0%
Other values (85) 85
87.6%
2024-04-21T17:35:18.127657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 224
19.2%
- 194
16.7%
0 161
13.8%
5 160
13.7%
6 147
12.6%
1 84
 
7.2%
7 68
 
5.8%
9 35
 
3.0%
8 31
 
2.7%
2 31
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 971
83.3%
Dash Punctuation 194
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 224
23.1%
0 161
16.6%
5 160
16.5%
6 147
15.1%
1 84
 
8.7%
7 68
 
7.0%
9 35
 
3.6%
8 31
 
3.2%
2 31
 
3.2%
4 30
 
3.1%
Dash Punctuation
ValueCountFrequency (%)
- 194
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1165
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 224
19.2%
- 194
16.7%
0 161
13.8%
5 160
13.7%
6 147
12.6%
1 84
 
7.2%
7 68
 
5.8%
9 35
 
3.0%
8 31
 
2.7%
2 31
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1165
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 224
19.2%
- 194
16.7%
0 161
13.8%
5 160
13.7%
6 147
12.6%
1 84
 
7.2%
7 68
 
5.8%
9 35
 
3.0%
8 31
 
2.7%
2 31
 
2.7%
Distinct105
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size984.0 B
2024-04-21T17:35:19.316752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length30
Mean length20.140187
Min length10

Characters and Unicode

Total characters2155
Distinct characters105
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique103 ?
Unique (%)96.3%

Sample

1st row가곡길 67-67(영파동)
2nd row감곡면 흥방길 45
3rd row고부면 고부농단길 11-5
4th row고부면 고부농단길 11-6
5th row고부면 고부농단길 12-30
ValueCountFrequency (%)
정읍시 77
 
15.8%
전라북도 76
 
15.6%
북면 15
 
3.1%
신태인읍 14
 
2.9%
고부면 13
 
2.7%
태인면 9
 
1.9%
석지로 8
 
1.6%
고부농단길 7
 
1.4%
정신로 6
 
1.2%
정우면 6
 
1.2%
Other values (180) 255
52.5%
2024-04-21T17:35:20.759705image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
381
 
17.7%
106
 
4.9%
102
 
4.7%
99
 
4.6%
1 90
 
4.2%
80
 
3.7%
77
 
3.6%
77
 
3.6%
76
 
3.5%
76
 
3.5%
Other values (95) 991
46.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1266
58.7%
Decimal Number 401
 
18.6%
Space Separator 381
 
17.7%
Dash Punctuation 50
 
2.3%
Open Punctuation 26
 
1.2%
Close Punctuation 26
 
1.2%
Other Punctuation 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
8.4%
102
 
8.1%
99
 
7.8%
80
 
6.3%
77
 
6.1%
77
 
6.1%
76
 
6.0%
76
 
6.0%
56
 
4.4%
46
 
3.6%
Other values (80) 471
37.2%
Decimal Number
ValueCountFrequency (%)
1 90
22.4%
2 51
12.7%
4 44
11.0%
5 37
9.2%
6 36
 
9.0%
7 35
 
8.7%
3 34
 
8.5%
0 30
 
7.5%
8 22
 
5.5%
9 22
 
5.5%
Space Separator
ValueCountFrequency (%)
381
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 50
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Other Punctuation
ValueCountFrequency (%)
, 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1266
58.7%
Common 889
41.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
8.4%
102
 
8.1%
99
 
7.8%
80
 
6.3%
77
 
6.1%
77
 
6.1%
76
 
6.0%
76
 
6.0%
56
 
4.4%
46
 
3.6%
Other values (80) 471
37.2%
Common
ValueCountFrequency (%)
381
42.9%
1 90
 
10.1%
2 51
 
5.7%
- 50
 
5.6%
4 44
 
4.9%
5 37
 
4.2%
6 36
 
4.0%
7 35
 
3.9%
3 34
 
3.8%
0 30
 
3.4%
Other values (5) 101
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1266
58.7%
ASCII 889
41.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
381
42.9%
1 90
 
10.1%
2 51
 
5.7%
- 50
 
5.6%
4 44
 
4.9%
5 37
 
4.2%
6 36
 
4.0%
7 35
 
3.9%
3 34
 
3.8%
0 30
 
3.4%
Other values (5) 101
 
11.4%
Hangul
ValueCountFrequency (%)
106
 
8.4%
102
 
8.1%
99
 
7.8%
80
 
6.3%
77
 
6.1%
77
 
6.1%
76
 
6.0%
76
 
6.0%
56
 
4.4%
46
 
3.6%
Other values (80) 471
37.2%

업종
Text

Distinct83
Distinct (%)77.6%
Missing0
Missing (%)0.0%
Memory size984.0 B
2024-04-21T17:35:21.656477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length18
Mean length11.570093
Min length3

Characters and Unicode

Total characters1238
Distinct characters141
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)62.6%

Sample

1st row기타 비료 및 질소화합물 제조업
2nd row건설폐기물중간처리업
3rd row자동차 부품 도장 및 피막처리업
4th row고무제품 및 플라스틱제품 제조업
5th row가금류 가공 및 저장 처리업
ValueCountFrequency (%)
제조업 37
 
12.8%
35
 
12.1%
기타 13
 
4.5%
자동차 7
 
2.4%
수리업 6
 
2.1%
플라스틱제품 5
 
1.7%
처리업 5
 
1.7%
사료 5
 
1.7%
가공 5
 
1.7%
질소화합물 5
 
1.7%
Other values (122) 167
57.6%
2024-04-21T17:35:22.796972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
185
 
14.9%
100
 
8.1%
74
 
6.0%
66
 
5.3%
47
 
3.8%
38
 
3.1%
33
 
2.7%
32
 
2.6%
27
 
2.2%
25
 
2.0%
Other values (131) 611
49.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1042
84.2%
Space Separator 185
 
14.9%
Other Punctuation 5
 
0.4%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
100
 
9.6%
74
 
7.1%
66
 
6.3%
47
 
4.5%
38
 
3.6%
33
 
3.2%
32
 
3.1%
27
 
2.6%
25
 
2.4%
22
 
2.1%
Other values (126) 578
55.5%
Other Punctuation
ValueCountFrequency (%)
, 4
80.0%
/ 1
 
20.0%
Space Separator
ValueCountFrequency (%)
185
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1042
84.2%
Common 196
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
100
 
9.6%
74
 
7.1%
66
 
6.3%
47
 
4.5%
38
 
3.6%
33
 
3.2%
32
 
3.1%
27
 
2.6%
25
 
2.4%
22
 
2.1%
Other values (126) 578
55.5%
Common
ValueCountFrequency (%)
185
94.4%
, 4
 
2.0%
) 3
 
1.5%
( 3
 
1.5%
/ 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1042
84.2%
ASCII 196
 
15.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
185
94.4%
, 4
 
2.0%
) 3
 
1.5%
( 3
 
1.5%
/ 1
 
0.5%
Hangul
ValueCountFrequency (%)
100
 
9.6%
74
 
7.1%
66
 
6.3%
47
 
4.5%
38
 
3.6%
33
 
3.2%
32
 
3.1%
27
 
2.6%
25
 
2.4%
22
 
2.1%
Other values (126) 578
55.5%

대기종별
Categorical

Distinct4
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Memory size984.0 B
5종
57 
4종
37 
2종
3종

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5종
2nd row4종
3rd row3종
4th row3종
5th row4종

Common Values

ValueCountFrequency (%)
5종 57
53.3%
4종 37
34.6%
2종 7
 
6.5%
3종 6
 
5.6%

Length

2024-04-21T17:35:23.014684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:35:23.188023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 57
53.3%
4종 37
34.6%
2종 7
 
6.5%
3종 6
 
5.6%

인허가
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size984.0 B
신고
97 
허가
10 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고
2nd row신고
3rd row신고
4th row신고
5th row신고

Common Values

ValueCountFrequency (%)
신고 97
90.7%
허가 10
 
9.3%

Length

2024-04-21T17:35:23.369609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T17:35:23.532179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 97
90.7%
허가 10
 
9.3%
Distinct103
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size984.0 B
Minimum1989-07-22 00:00:00
Maximum2020-12-14 00:00:00
2024-04-21T17:35:23.726119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:35:23.976377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size984.0 B
Minimum2021-03-17 00:00:00
Maximum2021-03-17 00:00:00
2024-04-21T17:35:24.166111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T17:35:24.321295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-04-21T17:35:24.442644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장전화번호업종대기종별인허가
사업장전화번호1.0000.9920.0001.000
업종0.9921.0000.9660.954
대기종별0.0000.9661.0000.268
인허가1.0000.9540.2681.000
2024-04-21T17:35:24.595098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대기종별인허가
대기종별1.0000.176
인허가0.1761.000
2024-04-21T17:35:24.729978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대기종별인허가
대기종별1.0000.176
인허가0.1761.000

Missing values

2024-04-21T17:35:13.682452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T17:35:14.084535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명사업장전화번호도로명주소업종대기종별인허가신고허가일자데이터기준일자
0농업회사법인 정읍유기질비료 주식회사063-531-3665가곡길 67-67(영파동)기타 비료 및 질소화합물 제조업5종신고2016-02-192021-03-17
1(유)현영063-571-2215감곡면 흥방길 45건설폐기물중간처리업4종신고2002-10-102021-03-17
2(주)카라063-536-3434고부면 고부농단길 11-5자동차 부품 도장 및 피막처리업3종신고2000-11-012021-03-17
3㈜웅지피엔지(P&G)063-536-1560고부면 고부농단길 11-6고무제품 및 플라스틱제품 제조업3종신고1999-11-162021-03-17
4(주)삼호유황오리063-532-8080고부면 고부농단길 12-30가금류 가공 및 저장 처리업4종신고2001-03-152021-03-17
5(유)신우S&F063-535-3799고부면 고부농단길 30기타비알콜성음료제조업5종신고2015-12-292021-03-17
6(유)일경063-536-1649고부면 고신길 37-11폐기물종헙재활용업3종허가2017-03-222021-03-17
7(주)만수산업063-535-1321고부면 영원로 172-64시멘트, 석회, 플라스터 및 그 제품 제조시설4종신고1990-04-162021-03-17
8(유)큰바위식품063-535-9333고부면 입석6길 17국수, 라면 및 유사식품 제조업5종신고1997-04-022021-03-17
9대우전자부품(주)063-530-8131공단2길 3 (망제동)그외 기타 전자부품 제조업5종신고1989-07-222021-03-17
사업장명사업장전화번호도로명주소업종대기종별인허가신고허가일자데이터기준일자
97대풍년영농조합법인063-538-2135정읍시 북면 3산단4길 114-7기타과실채소 가공 및 저장업5종신고2013-12-102021-03-17
98정읍정애영농조합법인063-536-0300전라북도 정읍시 이평면 궁동길 241-19 치악농장폐기물최종재활용업5종신고2018-11-012021-03-17
99(주)씨에스캠 입암지점063-531-5557전라북도 정읍시 입암면 정읍남로 391-3일반용도료및관련제품제조업5종허가2003-08-092021-03-17
100주식회사 금강063-538-2131전라북도 정읍시 입암면 정읍남로 453폐기물처리업(폐유 재생)5종신고1994-04-192021-03-17
101노령산업(주)063-534-5902전라북도 정읍시 입암면 정읍남로 453 (102-22)아스콘제조업2종신고2007-06-082021-03-17
102정읍아산병원063-530-6650충정로 606-22 (용계동)종합병원5종신고2015-12-242021-03-17
103(유)영광레미콘<NA>태인면 오봉리 180-4, 180-11레미콘 제조업4종신고2013-11-212021-03-17
104(주)태인063-537-0711전라북도 정읍시 태인면 왕림길 20-1아스콘제조업2종허가2020-10-272021-03-17
105효성케이디<NA>전라북도 정읍시 태인면 정읍북로 1404-4 (외 1필지)지정폐기물중간처리업5종신고2008-05-282021-03-17
106(주)디에스피드063-537-0996태인면 태인공단2길 29동물용 사료 및 조제식품 제조업4종신고2012-12-242021-03-17