Overview

Dataset statistics

Number of variables4
Number of observations752
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory24.4 KiB
Average record size in memory33.2 B

Variable types

Text3
Categorical1

Dataset

Description충청북도에 소재한 대기오염물질 배출시설 현황에 대한 정보입니다. (상호, 소재지, 업종, 종별(1-5)) 컬럼 정보를 제공합니다.
URLhttps://www.data.go.kr/data/15121179/fileData.do

Reproduction

Analysis started2023-12-12 12:56:58.913203
Analysis finished2023-12-12 12:56:59.799561
Duration0.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

상호
Text

Distinct738
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-12T21:56:59.955366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length7.5412234
Min length2

Characters and Unicode

Total characters5671
Distinct characters425
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique724 ?
Unique (%)96.3%

Sample

1st row㈜에이원알폼
2nd row㈜신성아그로
3rd row㈜엔케이
4th row세진종합기술㈜
5th row㈜시티오브테크
ValueCountFrequency (%)
주식회사 43
 
4.6%
2공장 10
 
1.1%
제2공장 8
 
0.9%
충주공장 8
 
0.9%
진천공장 7
 
0.8%
농업회사법인 7
 
0.8%
음성공장 6
 
0.6%
청주공장 5
 
0.5%
㈜심텍 4
 
0.4%
오송공장 4
 
0.4%
Other values (772) 830
89.1%
2023-12-12T21:57:00.367577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
638
 
11.3%
207
 
3.7%
197
 
3.5%
183
 
3.2%
147
 
2.6%
133
 
2.3%
132
 
2.3%
114
 
2.0%
86
 
1.5%
81
 
1.4%
Other values (415) 3753
66.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4649
82.0%
Other Symbol 638
 
11.3%
Space Separator 183
 
3.2%
Decimal Number 62
 
1.1%
Uppercase Letter 48
 
0.8%
Open Punctuation 42
 
0.7%
Close Punctuation 42
 
0.7%
Other Punctuation 4
 
0.1%
Lowercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
207
 
4.5%
197
 
4.2%
147
 
3.2%
133
 
2.9%
132
 
2.8%
114
 
2.5%
86
 
1.8%
81
 
1.7%
77
 
1.7%
74
 
1.6%
Other values (381) 3401
73.2%
Uppercase Letter
ValueCountFrequency (%)
G 5
 
10.4%
S 4
 
8.3%
T 4
 
8.3%
C 4
 
8.3%
E 3
 
6.2%
O 3
 
6.2%
L 3
 
6.2%
M 3
 
6.2%
J 2
 
4.2%
P 2
 
4.2%
Other values (10) 15
31.2%
Decimal Number
ValueCountFrequency (%)
2 42
67.7%
3 9
 
14.5%
1 8
 
12.9%
4 2
 
3.2%
5 1
 
1.6%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
n 1
33.3%
o 1
33.3%
Other Punctuation
ValueCountFrequency (%)
& 3
75.0%
* 1
 
25.0%
Other Symbol
ValueCountFrequency (%)
638
100.0%
Space Separator
ValueCountFrequency (%)
183
100.0%
Open Punctuation
ValueCountFrequency (%)
( 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5287
93.2%
Common 333
 
5.9%
Latin 51
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
638
 
12.1%
207
 
3.9%
197
 
3.7%
147
 
2.8%
133
 
2.5%
132
 
2.5%
114
 
2.2%
86
 
1.6%
81
 
1.5%
77
 
1.5%
Other values (382) 3475
65.7%
Latin
ValueCountFrequency (%)
G 5
 
9.8%
S 4
 
7.8%
T 4
 
7.8%
C 4
 
7.8%
E 3
 
5.9%
O 3
 
5.9%
L 3
 
5.9%
M 3
 
5.9%
J 2
 
3.9%
P 2
 
3.9%
Other values (13) 18
35.3%
Common
ValueCountFrequency (%)
183
55.0%
2 42
 
12.6%
( 42
 
12.6%
) 42
 
12.6%
3 9
 
2.7%
1 8
 
2.4%
& 3
 
0.9%
4 2
 
0.6%
5 1
 
0.3%
* 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4649
82.0%
None 638
 
11.3%
ASCII 384
 
6.8%

Most frequent character per block

None
ValueCountFrequency (%)
638
100.0%
Hangul
ValueCountFrequency (%)
207
 
4.5%
197
 
4.2%
147
 
3.2%
133
 
2.9%
132
 
2.8%
114
 
2.5%
86
 
1.8%
81
 
1.7%
77
 
1.7%
74
 
1.6%
Other values (381) 3401
73.2%
ASCII
ValueCountFrequency (%)
183
47.7%
2 42
 
10.9%
( 42
 
10.9%
) 42
 
10.9%
3 9
 
2.3%
1 8
 
2.1%
G 5
 
1.3%
S 4
 
1.0%
T 4
 
1.0%
C 4
 
1.0%
Other values (23) 41
 
10.7%
Distinct696
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-12T21:57:00.680582image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length32
Mean length24.349734
Min length6

Characters and Unicode

Total characters18311
Distinct characters164
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique648 ?
Unique (%)86.2%

Sample

1st row충청북도 괴산군 괴산읍 대제산단3길 46
2nd row충청북도 괴산군 괴산읍 대제산단1길 39-29
3rd row충청북도 괴산군 괴산읍 대제산단1길 39-9
4th row충청북도 괴산군 괴산읍 대제산단1길 39-17
5th row충청북도 괴산군 괴산읍 대제산단1길 39-13
ValueCountFrequency (%)
충청북도 748
 
18.6%
청주시 306
 
7.6%
흥덕구 212
 
5.3%
음성군 184
 
4.6%
충주시 108
 
2.7%
진천군 97
 
2.4%
오창읍 90
 
2.2%
청원구 90
 
2.2%
덕산읍 76
 
1.9%
금왕읍 65
 
1.6%
Other values (589) 2045
50.9%
2023-12-12T21:57:01.134791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3281
 
17.9%
1168
 
6.4%
883
 
4.8%
777
 
4.2%
751
 
4.1%
682
 
3.7%
1 589
 
3.2%
585
 
3.2%
468
 
2.6%
416
 
2.3%
Other values (154) 8711
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12070
65.9%
Space Separator 3281
 
17.9%
Decimal Number 2588
 
14.1%
Dash Punctuation 132
 
0.7%
Open Punctuation 115
 
0.6%
Close Punctuation 115
 
0.6%
Uppercase Letter 6
 
< 0.1%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1168
 
9.7%
883
 
7.3%
777
 
6.4%
751
 
6.2%
682
 
5.7%
585
 
4.8%
468
 
3.9%
416
 
3.4%
389
 
3.2%
347
 
2.9%
Other values (133) 5604
46.4%
Decimal Number
ValueCountFrequency (%)
1 589
22.8%
2 393
15.2%
3 296
11.4%
4 233
 
9.0%
5 217
 
8.4%
6 200
 
7.7%
7 188
 
7.3%
0 171
 
6.6%
8 164
 
6.3%
9 137
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
B 1
16.7%
G 1
16.7%
E 1
16.7%
C 1
16.7%
V 1
16.7%
I 1
16.7%
Space Separator
ValueCountFrequency (%)
3281
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 132
100.0%
Open Punctuation
ValueCountFrequency (%)
( 115
100.0%
Close Punctuation
ValueCountFrequency (%)
) 115
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12070
65.9%
Common 6235
34.1%
Latin 6
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1168
 
9.7%
883
 
7.3%
777
 
6.4%
751
 
6.2%
682
 
5.7%
585
 
4.8%
468
 
3.9%
416
 
3.4%
389
 
3.2%
347
 
2.9%
Other values (133) 5604
46.4%
Common
ValueCountFrequency (%)
3281
52.6%
1 589
 
9.4%
2 393
 
6.3%
3 296
 
4.7%
4 233
 
3.7%
5 217
 
3.5%
6 200
 
3.2%
7 188
 
3.0%
0 171
 
2.7%
8 164
 
2.6%
Other values (5) 503
 
8.1%
Latin
ValueCountFrequency (%)
B 1
16.7%
G 1
16.7%
E 1
16.7%
C 1
16.7%
V 1
16.7%
I 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12070
65.9%
ASCII 6241
34.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3281
52.6%
1 589
 
9.4%
2 393
 
6.3%
3 296
 
4.7%
4 233
 
3.7%
5 217
 
3.5%
6 200
 
3.2%
7 188
 
3.0%
0 171
 
2.7%
8 164
 
2.6%
Other values (11) 509
 
8.2%
Hangul
ValueCountFrequency (%)
1168
 
9.7%
883
 
7.3%
777
 
6.4%
751
 
6.2%
682
 
5.7%
585
 
4.8%
468
 
3.9%
416
 
3.4%
389
 
3.2%
347
 
2.9%
Other values (133) 5604
46.4%

업종
Text

Distinct440
Distinct (%)58.5%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
2023-12-12T21:57:01.442281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length99
Median length64
Mean length24.646277
Min length3

Characters and Unicode

Total characters18534
Distinct characters269
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique334 ?
Unique (%)44.4%

Sample

1st row금속 조립구조재 제조업(25113)
2nd row복합비료 및 기타 화학비료 제조업(20312)
3rd row표면가공목재 및 특정 목적용 제재목 제조업(16102)
4th row그외기타분류안된비금속광물제품제조업(23999) 외1(23991)
5th row그외기타유리제품제조업(23199)외1(29299)
ValueCountFrequency (%)
292
 
9.1%
기타 232
 
7.2%
135
 
4.2%
92
 
2.9%
제조업 85
 
2.7%
플라스틱 58
 
1.8%
화학제품 55
 
1.7%
안된 37
 
1.2%
제조업(20499 37
 
1.2%
분류 34
 
1.1%
Other values (831) 2144
67.0%
2023-12-12T21:57:01.927991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2460
 
13.3%
2 1446
 
7.8%
1043
 
5.6%
1 934
 
5.0%
) 835
 
4.5%
( 835
 
4.5%
818
 
4.4%
807
 
4.4%
0 723
 
3.9%
9 627
 
3.4%
Other values (259) 8006
43.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9385
50.6%
Decimal Number 4815
26.0%
Space Separator 2460
 
13.3%
Close Punctuation 835
 
4.5%
Open Punctuation 835
 
4.5%
Other Punctuation 204
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1043
 
11.1%
818
 
8.7%
807
 
8.6%
463
 
4.9%
354
 
3.8%
302
 
3.2%
301
 
3.2%
232
 
2.5%
162
 
1.7%
147
 
1.6%
Other values (242) 4756
50.7%
Decimal Number
ValueCountFrequency (%)
2 1446
30.0%
1 934
19.4%
0 723
15.0%
9 627
13.0%
3 395
 
8.2%
4 291
 
6.0%
7 163
 
3.4%
6 97
 
2.0%
5 78
 
1.6%
8 61
 
1.3%
Other Punctuation
ValueCountFrequency (%)
, 192
94.1%
· 6
 
2.9%
? 5
 
2.5%
. 1
 
0.5%
Space Separator
ValueCountFrequency (%)
2460
100.0%
Close Punctuation
ValueCountFrequency (%)
) 835
100.0%
Open Punctuation
ValueCountFrequency (%)
( 835
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9385
50.6%
Common 9149
49.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1043
 
11.1%
818
 
8.7%
807
 
8.6%
463
 
4.9%
354
 
3.8%
302
 
3.2%
301
 
3.2%
232
 
2.5%
162
 
1.7%
147
 
1.6%
Other values (242) 4756
50.7%
Common
ValueCountFrequency (%)
2460
26.9%
2 1446
15.8%
1 934
 
10.2%
) 835
 
9.1%
( 835
 
9.1%
0 723
 
7.9%
9 627
 
6.9%
3 395
 
4.3%
4 291
 
3.2%
, 192
 
2.1%
Other values (7) 411
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9385
50.6%
ASCII 9143
49.3%
None 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2460
26.9%
2 1446
15.8%
1 934
 
10.2%
) 835
 
9.1%
( 835
 
9.1%
0 723
 
7.9%
9 627
 
6.9%
3 395
 
4.3%
4 291
 
3.2%
, 192
 
2.1%
Other values (6) 405
 
4.4%
Hangul
ValueCountFrequency (%)
1043
 
11.1%
818
 
8.7%
807
 
8.6%
463
 
4.9%
354
 
3.8%
302
 
3.2%
301
 
3.2%
232
 
2.5%
162
 
1.7%
147
 
1.6%
Other values (242) 4756
50.7%
None
ValueCountFrequency (%)
· 6
100.0%

종별
Categorical

Distinct5
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size6.0 KiB
5
399 
4
242 
2
45 
3
45 
1
 
21

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row5
3rd row4
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 399
53.1%
4 242
32.2%
2 45
 
6.0%
3 45
 
6.0%
1 21
 
2.8%

Length

2023-12-12T21:57:02.102499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:57:02.240900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 399
53.1%
4 242
32.2%
2 45
 
6.0%
3 45
 
6.0%
1 21
 
2.8%

Missing values

2023-12-12T21:56:59.674324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:56:59.760609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호소재지업종종별
0㈜에이원알폼충청북도 괴산군 괴산읍 대제산단3길 46금속 조립구조재 제조업(25113)4
1㈜신성아그로충청북도 괴산군 괴산읍 대제산단1길 39-29복합비료 및 기타 화학비료 제조업(20312)5
2㈜엔케이충청북도 괴산군 괴산읍 대제산단1길 39-9표면가공목재 및 특정 목적용 제재목 제조업(16102)4
3세진종합기술㈜충청북도 괴산군 괴산읍 대제산단1길 39-17그외기타분류안된비금속광물제품제조업(23999) 외1(23991)5
4㈜시티오브테크충청북도 괴산군 괴산읍 대제산단1길 39-13그외기타유리제품제조업(23199)외1(29299)5
5명진화학㈜충청북도 괴산군 괴산읍 대제산단1길 92일반용도료 및 관련제품 제조업(20411,20202)5
6㈜제이피이충청북도 괴산군 괴산읍 대제산단3길 11도금,착색 및 기타표면 처리강재제조업(24191)절삭가공 및 유사처리업(25924)그외 기타 금속가공업(25929)5
7주식회사 와이즈메이커스충청북도 괴산군 괴산읍 대제산단2길 54육류 기타 가공 및 저장처리업 외 2(10129,10121,10742)5
8에프와이지㈜충청북도 괴산군 괴산읍 대제산단2길 64가금류 가공 및 저장 처리업 외 1(10121,10742)5
9성지피에스 주식회사충청북도 괴산군 괴산읍 대제산단2길 5기타 구조용 금속제품 제조업(25119)5
상호소재지업종종별
742㈜삼텍충청북도 충주시 충주산단3로 81합성수지 및 기타 플라스틱 물질 제조업(20302)5
743㈜가온테코충청북도 충주시 충주산단7로 5타일 및 유사 배내화 요업제품 제조업(26232)5
744씨테크충청북도 충주시 충주산단5로 19(용탄동)합성수지 및 기타 플라스틱물질 제조업(20202)5
745현대엘리베이터㈜충청북도 충주시 충주산단1로 128승강기 제조업(29162)2
746현대엘리베이터㈜(기숙사)충청북도 충주시 용탄동 1208번지승강기 제조업(29162)5
747정일산업㈜ 충주공장충청북도 충주시 충주산단2로 128승강기 제조업(29162)4
748한일식품㈜충청북도 충주시 용탄동 1229, 1230면류, 마카로니 및 유사식품 제조업(10730)4
749㈜태진정공 충주지점충청북도 충주시 충주산단3로 60자동차 차체용 신품 부품 제조업 외1종(30320,30400)4
750㈜케이지씨예본충청북도 충주시 가주농공2길 27 (가주동)건강기능식품제조업(10797)외 3종(10795, 10796, 10799)2
751㈜보원케미칼충청북도 충주시 산척면 동충주산업단지 E3-1 구역플라스틱 적층, 도포 및 기타 표면처리 제품 제조업(22292)4