Overview

Dataset statistics

Number of variables5
Number of observations548
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.1 KiB
Average record size in memory41.2 B

Variable types

Text4
Categorical1

Dataset

Description대기환경보전법 제23조에 따른 대기배출시설 설치신고 대상 사업장에 대한 데이터로 대기배출사업장의 상호명, 주소지, 업종, 연락처 및 대기 종수 등에 대한 내용이 포함되어있습니다.
Author충청북도 진천군
URLhttps://www.data.go.kr/data/15084001/fileData.do

Reproduction

Analysis started2024-03-14 17:24:43.353948
Analysis finished2024-03-14 17:24:44.806870
Duration1.45 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct545
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T02:24:45.347763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length6.899635
Min length2

Characters and Unicode

Total characters3781
Distinct characters352
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique542 ?
Unique (%)98.9%

Sample

1st row농업회사법인(주)주원산오리
2nd row㈜팔도
3rd row동국제약(주) 회죽공장
4th row㈜신일
5th row알리코제약(주)
ValueCountFrequency (%)
진천공장 8
 
1.3%
제2공장 6
 
1.0%
2공장 5
 
0.8%
농업회사법인 4
 
0.7%
진천지점 4
 
0.7%
㈜대명피에스 3
 
0.5%
㈜현대에버다임 2
 
0.3%
㈜디와이푸드 2
 
0.3%
제1공장 2
 
0.3%
㈜에코비트프리텍 2
 
0.3%
Other values (552) 565
93.7%
2024-03-15T02:24:46.466733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
319
 
8.4%
) 136
 
3.6%
( 136
 
3.6%
129
 
3.4%
117
 
3.1%
99
 
2.6%
84
 
2.2%
80
 
2.1%
79
 
2.1%
66
 
1.7%
Other values (342) 2536
67.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3082
81.5%
Other Symbol 319
 
8.4%
Close Punctuation 136
 
3.6%
Open Punctuation 136
 
3.6%
Space Separator 56
 
1.5%
Decimal Number 32
 
0.8%
Uppercase Letter 19
 
0.5%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
129
 
4.2%
117
 
3.8%
99
 
3.2%
84
 
2.7%
80
 
2.6%
79
 
2.6%
66
 
2.1%
59
 
1.9%
56
 
1.8%
53
 
1.7%
Other values (322) 2260
73.3%
Uppercase Letter
ValueCountFrequency (%)
T 5
26.3%
S 2
 
10.5%
G 2
 
10.5%
E 2
 
10.5%
O 1
 
5.3%
Y 1
 
5.3%
K 1
 
5.3%
N 1
 
5.3%
A 1
 
5.3%
M 1
 
5.3%
Other values (2) 2
 
10.5%
Decimal Number
ValueCountFrequency (%)
2 22
68.8%
1 6
 
18.8%
3 4
 
12.5%
Other Symbol
ValueCountFrequency (%)
319
100.0%
Close Punctuation
ValueCountFrequency (%)
) 136
100.0%
Open Punctuation
ValueCountFrequency (%)
( 136
100.0%
Space Separator
ValueCountFrequency (%)
56
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3401
89.9%
Common 361
 
9.5%
Latin 19
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
319
 
9.4%
129
 
3.8%
117
 
3.4%
99
 
2.9%
84
 
2.5%
80
 
2.4%
79
 
2.3%
66
 
1.9%
59
 
1.7%
56
 
1.6%
Other values (323) 2313
68.0%
Latin
ValueCountFrequency (%)
T 5
26.3%
S 2
 
10.5%
G 2
 
10.5%
E 2
 
10.5%
O 1
 
5.3%
Y 1
 
5.3%
K 1
 
5.3%
N 1
 
5.3%
A 1
 
5.3%
M 1
 
5.3%
Other values (2) 2
 
10.5%
Common
ValueCountFrequency (%)
) 136
37.7%
( 136
37.7%
56
15.5%
2 22
 
6.1%
1 6
 
1.7%
3 4
 
1.1%
- 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3082
81.5%
ASCII 380
 
10.1%
None 319
 
8.4%

Most frequent character per block

None
ValueCountFrequency (%)
319
100.0%
ASCII
ValueCountFrequency (%)
) 136
35.8%
( 136
35.8%
56
14.7%
2 22
 
5.8%
1 6
 
1.6%
T 5
 
1.3%
3 4
 
1.1%
S 2
 
0.5%
G 2
 
0.5%
E 2
 
0.5%
Other values (9) 9
 
2.4%
Hangul
ValueCountFrequency (%)
129
 
4.2%
117
 
3.8%
99
 
3.2%
84
 
2.7%
80
 
2.6%
79
 
2.6%
66
 
2.1%
59
 
1.9%
56
 
1.8%
53
 
1.7%
Other values (322) 2260
73.3%
Distinct507
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T02:24:47.464776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length12.313869
Min length9

Characters and Unicode

Total characters6748
Distinct characters123
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique470 ?
Unique (%)85.8%

Sample

1st row광혜원면 진광로 941-5
2nd row광혜원면 광혜원산단길 88
3rd row광혜원면 진광로 1103
4th row광혜원면 용소2길 46
5th row광혜원면 용소2길 21
ValueCountFrequency (%)
이월면 178
 
10.8%
덕산읍 109
 
6.6%
진천읍 82
 
5.0%
문백면 79
 
4.8%
광혜원면 51
 
3.1%
초평면 42
 
2.6%
진광로 42
 
2.6%
문진로 24
 
1.5%
이덕로 20
 
1.2%
덕금로 18
 
1.1%
Other values (569) 1002
60.8%
2024-03-15T02:24:49.128973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1105
 
16.4%
1 401
 
5.9%
358
 
5.3%
2 298
 
4.4%
283
 
4.2%
261
 
3.9%
3 246
 
3.6%
- 219
 
3.2%
206
 
3.1%
198
 
2.9%
Other values (113) 3173
47.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3451
51.1%
Decimal Number 1971
29.2%
Space Separator 1105
 
16.4%
Dash Punctuation 219
 
3.2%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
358
 
10.4%
283
 
8.2%
261
 
7.6%
206
 
6.0%
198
 
5.7%
191
 
5.5%
167
 
4.8%
152
 
4.4%
139
 
4.0%
120
 
3.5%
Other values (100) 1376
39.9%
Decimal Number
ValueCountFrequency (%)
1 401
20.3%
2 298
15.1%
3 246
12.5%
7 177
9.0%
4 166
8.4%
6 161
8.2%
8 144
 
7.3%
5 137
 
7.0%
9 128
 
6.5%
0 113
 
5.7%
Space Separator
ValueCountFrequency (%)
1105
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 219
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3451
51.1%
Common 3297
48.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
358
 
10.4%
283
 
8.2%
261
 
7.6%
206
 
6.0%
198
 
5.7%
191
 
5.5%
167
 
4.8%
152
 
4.4%
139
 
4.0%
120
 
3.5%
Other values (100) 1376
39.9%
Common
ValueCountFrequency (%)
1105
33.5%
1 401
 
12.2%
2 298
 
9.0%
3 246
 
7.5%
- 219
 
6.6%
7 177
 
5.4%
4 166
 
5.0%
6 161
 
4.9%
8 144
 
4.4%
5 137
 
4.2%
Other values (3) 243
 
7.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3451
51.1%
ASCII 3297
48.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1105
33.5%
1 401
 
12.2%
2 298
 
9.0%
3 246
 
7.5%
- 219
 
6.6%
7 177
 
5.4%
4 166
 
5.0%
6 161
 
4.9%
8 144
 
4.4%
5 137
 
4.2%
Other values (3) 243
 
7.4%
Hangul
ValueCountFrequency (%)
358
 
10.4%
283
 
8.2%
261
 
7.6%
206
 
6.0%
198
 
5.7%
191
 
5.5%
167
 
4.8%
152
 
4.4%
139
 
4.0%
120
 
3.5%
Other values (100) 1376
39.9%
Distinct369
Distinct (%)67.3%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T02:24:49.929598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length176
Median length89
Mean length25.760949
Min length2

Characters and Unicode

Total characters14117
Distinct characters280
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique299 ?
Unique (%)54.6%

Sample

1st row가금류가공및저장처리업(10121)
2nd row액상시유 및 기타 낙농제품 제조업(10501)기타비알콜음료제조업(11209)
3rd row완전의약품제조업(21210)
4th row금속단조제품제조업(25912)
5th row완전의약품제조업(21210)건강기능식품제조업(10797)그외기타식료품제조업(10799)
ValueCountFrequency (%)
52
 
5.7%
기타 25
 
2.7%
플라스틱 13
 
1.4%
12
 
1.3%
기타비료및질소화합물제조업(20209 12
 
1.3%
가공및재생플라스틱원료생산업(20303 11
 
1.2%
자동차종합수리업(95211 9
 
1.0%
도장및기타피막처리업(25923 8
 
0.9%
곡물도정업(10611 8
 
0.9%
완전의약품제조업(21210 8
 
0.9%
Other values (518) 759
82.8%
2024-03-15T02:24:51.234928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 1121
 
7.9%
768
 
5.4%
745
 
5.3%
729
 
5.2%
) 720
 
5.1%
( 720
 
5.1%
1 718
 
5.1%
0 473
 
3.4%
3 459
 
3.3%
9 455
 
3.2%
Other values (270) 7209
51.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8396
59.5%
Decimal Number 3768
26.7%
Close Punctuation 726
 
5.1%
Open Punctuation 726
 
5.1%
Space Separator 378
 
2.7%
Other Punctuation 123
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
768
 
9.1%
745
 
8.9%
729
 
8.7%
401
 
4.8%
322
 
3.8%
241
 
2.9%
227
 
2.7%
162
 
1.9%
146
 
1.7%
135
 
1.6%
Other values (252) 4520
53.8%
Decimal Number
ValueCountFrequency (%)
2 1121
29.8%
1 718
19.1%
0 473
12.6%
3 459
12.2%
9 455
12.1%
4 153
 
4.1%
5 142
 
3.8%
7 95
 
2.5%
8 79
 
2.1%
6 73
 
1.9%
Other Punctuation
ValueCountFrequency (%)
, 121
98.4%
· 1
 
0.8%
. 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 720
99.2%
] 6
 
0.8%
Open Punctuation
ValueCountFrequency (%)
( 720
99.2%
[ 6
 
0.8%
Space Separator
ValueCountFrequency (%)
378
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8396
59.5%
Common 5721
40.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
768
 
9.1%
745
 
8.9%
729
 
8.7%
401
 
4.8%
322
 
3.8%
241
 
2.9%
227
 
2.7%
162
 
1.9%
146
 
1.7%
135
 
1.6%
Other values (252) 4520
53.8%
Common
ValueCountFrequency (%)
2 1121
19.6%
) 720
12.6%
( 720
12.6%
1 718
12.6%
0 473
8.3%
3 459
8.0%
9 455
8.0%
378
 
6.6%
4 153
 
2.7%
5 142
 
2.5%
Other values (8) 382
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8396
59.5%
ASCII 5720
40.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1121
19.6%
) 720
12.6%
( 720
12.6%
1 718
12.6%
0 473
8.3%
3 459
8.0%
9 455
8.0%
378
 
6.6%
4 153
 
2.7%
5 142
 
2.5%
Other values (7) 381
 
6.7%
Hangul
ValueCountFrequency (%)
768
 
9.1%
745
 
8.9%
729
 
8.7%
401
 
4.8%
322
 
3.8%
241
 
2.9%
227
 
2.7%
162
 
1.9%
146
 
1.7%
135
 
1.6%
Other values (252) 4520
53.8%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct488
Distinct (%)89.1%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
2024-03-15T02:24:52.254897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.658759
Min length6

Characters and Unicode

Total characters6389
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique458 ?
Unique (%)83.6%

Sample

1st row043-530-6276
2nd row043-535-0300
3rd row043-530-0200
4th row043-535-2851
5th row043-535-8877
ValueCountFrequency (%)
연락처미공개 32
 
5.8%
031-719-6883 2
 
0.4%
043-533-8401 2
 
0.4%
043-530-3300 2
 
0.4%
043-536-6670 2
 
0.4%
043-537-0105 2
 
0.4%
043-532-5751 2
 
0.4%
043-536-5447 2
 
0.4%
043-536-6700 2
 
0.4%
043-536-0028 2
 
0.4%
Other values (478) 498
90.9%
2024-03-15T02:24:53.728813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1243
19.5%
- 1032
16.2%
0 898
14.1%
4 710
11.1%
5 703
11.0%
7 325
 
5.1%
1 314
 
4.9%
8 284
 
4.4%
2 279
 
4.4%
6 249
 
3.9%
Other values (7) 352
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5165
80.8%
Dash Punctuation 1032
 
16.2%
Other Letter 192
 
3.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1243
24.1%
0 898
17.4%
4 710
13.7%
5 703
13.6%
7 325
 
6.3%
1 314
 
6.1%
8 284
 
5.5%
2 279
 
5.4%
6 249
 
4.8%
9 160
 
3.1%
Other Letter
ValueCountFrequency (%)
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 1032
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6197
97.0%
Hangul 192
 
3.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1243
20.1%
- 1032
16.7%
0 898
14.5%
4 710
11.5%
5 703
11.3%
7 325
 
5.2%
1 314
 
5.1%
8 284
 
4.6%
2 279
 
4.5%
6 249
 
4.0%
Hangul
ValueCountFrequency (%)
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6197
97.0%
Hangul 192
 
3.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1243
20.1%
- 1032
16.7%
0 898
14.5%
4 710
11.5%
5 703
11.3%
7 325
 
5.2%
1 314
 
5.1%
8 284
 
4.6%
2 279
 
4.5%
6 249
 
4.0%
Hangul
ValueCountFrequency (%)
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%
32
16.7%

종별(대기)
Categorical

Distinct5
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size4.4 KiB
5
330 
4
183 
3
 
19
2
 
15
1
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row4
2nd row4
3rd row2
4th row4
5th row3

Common Values

ValueCountFrequency (%)
5 330
60.2%
4 183
33.4%
3 19
 
3.5%
2 15
 
2.7%
1 1
 
0.2%

Length

2024-03-15T02:24:54.161464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T02:24:54.502645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 330
60.2%
4 183
33.4%
3 19
 
3.5%
2 15
 
2.7%
1 1
 
0.2%

Missing values

2024-03-15T02:24:44.131917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T02:24:44.564761image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업소명도로명업종(업종코드)연락처종별(대기)
0농업회사법인(주)주원산오리광혜원면 진광로 941-5가금류가공및저장처리업(10121)043-530-62764
1㈜팔도광혜원면 광혜원산단길 88액상시유 및 기타 낙농제품 제조업(10501)기타비알콜음료제조업(11209)043-535-03004
2동국제약(주) 회죽공장광혜원면 진광로 1103완전의약품제조업(21210)043-530-02002
3㈜신일광혜원면 용소2길 46금속단조제품제조업(25912)043-535-28514
4알리코제약(주)광혜원면 용소2길 21완전의약품제조업(21210)건강기능식품제조업(10797)그외기타식료품제조업(10799)043-535-88773
5(주)유영제약광혜원면 용소2길 33완전의약품제조업(21210)043-539-88004
6아그라나프루트코리아(주)광혜원면 진광로 1333기타과실·채소가공및저장처리업(10309)043-535-10014
7예림금속(주)광혜원면 진광로 1135-19강관제조업(24132)043-535-38003
8부영산업(주)광혜원면 진광로 1689-61레미콘제조업(23322)043-535-99934
9하니웰퍼포먼스머터리얼스앤테크놀로지스코리아(주)광혜원면 진광로 1689-35기타비철금속제련,정련및합금제조업(24219)전자접속카드제조업(26296)그외기타전자부품제조업(26299)전자기측정,시험및분석기구제조업(27212)물질검사,측정및분석기구제조업(27213)043-530-29005
업소명도로명업종(업종코드)연락처종별(대기)
538㈜편한초평면 초평로 726금속원료재생업(38301)043-533-18924
539극동전선(주)초평면 용정길 29-23절연금속선 및 케이블제조업외4종(28302,22191,22192,26410,26429)043-530-20212
540㈜아라미스초평면 초금로 160알루미늄제련,정련및합금제조업(24212)043-753-81915
541SY에너지㈜초평면 초평로 473표면가공목재 및 특정 목적용 제재목 제조업(16102)043-534-89041
542㈜리엔초평면 은진로 183그외기타분류안된 화학제품 제조업(20499)비금속광물 분쇄 물 생산업(23993)043-211-10085
543㈜케이사초평면 구정로 107플라스틱 필름 제조업(22221)043-836-55005
544㈜신영이앤피초평면 초평로 473표면가공목재 및 특정 목적용 제재목 제조업(16102)043-241-22115
545㈜엔씨프라코초평면 구정로 113포장용 플라스틱 성형용기 제조업 외 1종(22232, 31113)043-215-20455
546TEL디자인 연구소초평면 초평로 481기타목재가구제조업(32029)02-458-20825
547산영산업초평면 대구동길 35-2지정외폐기물처리업(38210)연락처미공개5