Overview

Dataset statistics

Number of variables7
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory22.4 KiB
Average record size in memory57.3 B

Variable types

Numeric1
Text2
Categorical3
DateTime1

Dataset

Description○ 세종특별자치시 대기오염물질 배출사업정 관련 현황(업체명, 소재지, 종별, 신고/허가, 휴업여부 등) ※ 대기측정업 창업 관련 활용 가능
URLhttps://www.data.go.kr/data/15080720/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
비고 is highly imbalanced (90.3%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 16:19:23.580793
Analysis finished2023-12-12 16:19:24.297976
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.5
Minimum1
Maximum400
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-13T01:19:24.735743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.95
Q1100.75
median200.5
Q3300.25
95-th percentile380.05
Maximum400
Range399
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation115.6143
Coefficient of variation (CV)0.57662993
Kurtosis-1.2
Mean200.5
Median Absolute Deviation (MAD)100
Skewness0
Sum80200
Variance13366.667
MonotonicityStrictly increasing
2023-12-13T01:19:24.912644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
265 1
 
0.2%
275 1
 
0.2%
274 1
 
0.2%
273 1
 
0.2%
272 1
 
0.2%
271 1
 
0.2%
270 1
 
0.2%
269 1
 
0.2%
268 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
400 1
0.2%
399 1
0.2%
398 1
0.2%
397 1
0.2%
396 1
0.2%
395 1
0.2%
394 1
0.2%
393 1
0.2%
392 1
0.2%
391 1
0.2%
Distinct391
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-13T01:19:25.192664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length20
Mean length8.675
Min length2

Characters and Unicode

Total characters3470
Distinct characters327
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique382 ?
Unique (%)95.5%

Sample

1st row한림제지
2nd row(주)동양식품
3rd row부강목욕탕
4th row유진기업(주)세종
5th row새마을정미소
ValueCountFrequency (%)
세종공장 10
 
2.2%
주식회사 6
 
1.3%
세종사업장 5
 
1.1%
세종특별자치시 4
 
0.9%
농업회사법인 3
 
0.6%
한국콜마(주 3
 
0.6%
주)포스코퓨처엠 3
 
0.6%
유진통신공업(주 2
 
0.4%
세종시농협쌀조합공동사업법인 2
 
0.4%
한국가스공사 2
 
0.4%
Other values (411) 425
91.4%
2023-12-13T01:19:25.602734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
290
 
8.4%
( 289
 
8.3%
) 289
 
8.3%
87
 
2.5%
87
 
2.5%
70
 
2.0%
70
 
2.0%
67
 
1.9%
65
 
1.9%
65
 
1.9%
Other values (317) 2091
60.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2796
80.6%
Open Punctuation 289
 
8.3%
Close Punctuation 289
 
8.3%
Space Separator 65
 
1.9%
Uppercase Letter 15
 
0.4%
Decimal Number 13
 
0.4%
Other Symbol 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
290
 
10.4%
87
 
3.1%
87
 
3.1%
70
 
2.5%
70
 
2.5%
67
 
2.4%
65
 
2.3%
58
 
2.1%
56
 
2.0%
43
 
1.5%
Other values (298) 1903
68.1%
Uppercase Letter
ValueCountFrequency (%)
A 2
13.3%
K 2
13.3%
S 2
13.3%
P 2
13.3%
E 1
6.7%
M 1
6.7%
H 1
6.7%
D 1
6.7%
V 1
6.7%
O 1
6.7%
Decimal Number
ValueCountFrequency (%)
1 7
53.8%
2 4
30.8%
9 2
 
15.4%
Open Punctuation
ValueCountFrequency (%)
( 289
100.0%
Close Punctuation
ValueCountFrequency (%)
) 289
100.0%
Space Separator
ValueCountFrequency (%)
65
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2798
80.6%
Common 657
 
18.9%
Latin 15
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
290
 
10.4%
87
 
3.1%
87
 
3.1%
70
 
2.5%
70
 
2.5%
67
 
2.4%
65
 
2.3%
58
 
2.1%
56
 
2.0%
43
 
1.5%
Other values (299) 1905
68.1%
Latin
ValueCountFrequency (%)
A 2
13.3%
K 2
13.3%
S 2
13.3%
P 2
13.3%
E 1
6.7%
M 1
6.7%
H 1
6.7%
D 1
6.7%
V 1
6.7%
O 1
6.7%
Common
ValueCountFrequency (%)
( 289
44.0%
) 289
44.0%
65
 
9.9%
1 7
 
1.1%
2 4
 
0.6%
9 2
 
0.3%
& 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2796
80.6%
ASCII 672
 
19.4%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
290
 
10.4%
87
 
3.1%
87
 
3.1%
70
 
2.5%
70
 
2.5%
67
 
2.4%
65
 
2.3%
58
 
2.1%
56
 
2.0%
43
 
1.5%
Other values (298) 1903
68.1%
ASCII
ValueCountFrequency (%)
( 289
43.0%
) 289
43.0%
65
 
9.7%
1 7
 
1.0%
2 4
 
0.6%
A 2
 
0.3%
9 2
 
0.3%
K 2
 
0.3%
S 2
 
0.3%
P 2
 
0.3%
Other values (8) 8
 
1.2%
None
ValueCountFrequency (%)
2
100.0%
Distinct375
Distinct (%)93.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-13T01:19:25.880802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length21.845
Min length17

Characters and Unicode

Total characters8738
Distinct characters240
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique354 ?
Unique (%)88.5%

Sample

1st row세종특별자치시 조치원읍 새내6길 14
2nd row세종특별자치시 조치원읍 허만석로 40-2
3rd row세종특별자치시 부강면 부강5길 24
4th row세종특별자치시 금남면 안금로 351-43
5th row세종특별자치시 장군면 문성2길 16
ValueCountFrequency (%)
세종특별자치시 400
24.2%
부강면 92
 
5.6%
전의면 78
 
4.7%
전동면 46
 
2.8%
연서면 38
 
2.3%
조치원읍 37
 
2.2%
연동면 32
 
1.9%
산단길 30
 
1.8%
소정면 23
 
1.4%
장군면 17
 
1.0%
Other values (523) 858
52.0%
2023-12-13T01:19:26.275244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1306
 
14.9%
439
 
5.0%
417
 
4.8%
416
 
4.8%
414
 
4.7%
401
 
4.6%
401
 
4.6%
400
 
4.6%
349
 
4.0%
1 284
 
3.3%
Other values (230) 3911
44.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5839
66.8%
Decimal Number 1356
 
15.5%
Space Separator 1306
 
14.9%
Dash Punctuation 167
 
1.9%
Open Punctuation 30
 
0.3%
Close Punctuation 30
 
0.3%
Uppercase Letter 9
 
0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
439
 
7.5%
417
 
7.1%
416
 
7.1%
414
 
7.1%
401
 
6.9%
401
 
6.9%
400
 
6.9%
349
 
6.0%
203
 
3.5%
170
 
2.9%
Other values (208) 2229
38.2%
Decimal Number
ValueCountFrequency (%)
1 284
20.9%
2 214
15.8%
3 145
10.7%
4 131
9.7%
7 118
8.7%
6 102
 
7.5%
5 101
 
7.4%
0 98
 
7.2%
8 84
 
6.2%
9 79
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
S 2
22.2%
C 2
22.2%
B 1
11.1%
K 1
11.1%
A 1
11.1%
F 1
11.1%
L 1
11.1%
Space Separator
ValueCountFrequency (%)
1306
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 167
100.0%
Open Punctuation
ValueCountFrequency (%)
( 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 30
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5840
66.8%
Common 2889
33.1%
Latin 9
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
439
 
7.5%
417
 
7.1%
416
 
7.1%
414
 
7.1%
401
 
6.9%
401
 
6.9%
400
 
6.8%
349
 
6.0%
203
 
3.5%
170
 
2.9%
Other values (209) 2230
38.2%
Common
ValueCountFrequency (%)
1306
45.2%
1 284
 
9.8%
2 214
 
7.4%
- 167
 
5.8%
3 145
 
5.0%
4 131
 
4.5%
7 118
 
4.1%
6 102
 
3.5%
5 101
 
3.5%
0 98
 
3.4%
Other values (4) 223
 
7.7%
Latin
ValueCountFrequency (%)
S 2
22.2%
C 2
22.2%
B 1
11.1%
K 1
11.1%
A 1
11.1%
F 1
11.1%
L 1
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5839
66.8%
ASCII 2898
33.2%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1306
45.1%
1 284
 
9.8%
2 214
 
7.4%
- 167
 
5.8%
3 145
 
5.0%
4 131
 
4.5%
7 118
 
4.1%
6 102
 
3.5%
5 101
 
3.5%
0 98
 
3.4%
Other values (11) 232
 
8.0%
Hangul
ValueCountFrequency (%)
439
 
7.5%
417
 
7.1%
416
 
7.1%
414
 
7.1%
401
 
6.9%
401
 
6.9%
400
 
6.9%
349
 
6.0%
203
 
3.5%
170
 
2.9%
Other values (208) 2229
38.2%
None
ValueCountFrequency (%)
1
100.0%

종별
Categorical

Distinct5
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
5종
250 
4종
117 
2종
 
14
3종
 
10
1종
 
9

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4종
2nd row5종
3rd row5종
4th row4종
5th row5종

Common Values

ValueCountFrequency (%)
5종 250
62.5%
4종 117
29.2%
2종 14
 
3.5%
3종 10
 
2.5%
1종 9
 
2.2%

Length

2023-12-13T01:19:26.432435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:26.549930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 250
62.5%
4종 117
29.2%
2종 14
 
3.5%
3종 10
 
2.5%
1종 9
 
2.2%

신고-허가
Categorical

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
신고
350 
허가
50 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신고
2nd row신고
3rd row신고
4th row신고
5th row신고

Common Values

ValueCountFrequency (%)
신고 350
87.5%
허가 50
 
12.5%

Length

2023-12-13T01:19:26.686979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:26.781915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
신고 350
87.5%
허가 50
 
12.5%

비고
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
395 
휴업
 
5

Length

Max length2
Median length1
Mean length1.0125
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
395
98.8%
휴업 5
 
1.2%

Length

2023-12-13T01:19:26.892216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:26.990211image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
휴업 5
100.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2023-08-24 00:00:00
Maximum2023-08-24 00:00:00
2023-12-13T01:19:27.079716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.177083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T01:19:23.988200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:19:27.275370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별신고-허가비고
연번1.0000.2740.3310.189
종별0.2741.0000.3560.000
신고-허가0.3310.3561.0000.000
비고0.1890.0000.0001.000
2023-12-13T01:19:27.371491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종별신고-허가비고
종별1.0000.4330.000
신고-허가0.4331.0000.000
비고0.0000.0001.000
2023-12-13T01:19:27.487414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번종별신고-허가비고
연번1.0000.1160.2520.143
종별0.1161.0000.4330.000
신고-허가0.2520.4331.0000.000
비고0.1430.0000.0001.000

Missing values

2023-12-13T01:19:24.122617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:19:24.256058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업체명소재지종별신고-허가비고데이터기준일자
01한림제지세종특별자치시 조치원읍 새내6길 144종신고2023-08-24
12(주)동양식품세종특별자치시 조치원읍 허만석로 40-25종신고2023-08-24
23부강목욕탕세종특별자치시 부강면 부강5길 245종신고2023-08-24
34유진기업(주)세종세종특별자치시 금남면 안금로 351-434종신고2023-08-24
45새마을정미소세종특별자치시 장군면 문성2길 165종신고2023-08-24
56(주)일성안티몬세종특별자치시 전의면 의당전의로 9045종신고2023-08-24
67(주)에스피씨삼립 세종공장세종특별자치시 금남면 진동길 154종신고2023-08-24
78(주)우석산업세종특별자치시 부강면 연청로 1147-262종신고2023-08-24
89(주)중부판지세종특별자치시 부강면 금호리 501-14종허가휴업2023-08-24
910국곡제재소세종특별자치시 금남면 국곡길 115종신고2023-08-24
연번업체명소재지종별신고-허가비고데이터기준일자
390391(주)진켐 세종공장세종특별자치시 소정면 소정산단3로 244종신고2023-08-24
391392(주)동현엔지니어링세종특별자치시 부강면 노호등곡로 155종신고2023-08-24
392393(주)대국환경기업세종특별자치시 금남면 안금로 395-36 외 2필지(두만리 315 325-2)5종신고2023-08-24
393394(주)마이크로이미지 세종사업장세종특별자치시 집현중앙2로 15 (집현동)5종신고2023-08-24
394395한화첨단소재(주) 세종사업장세종특별자치시 부강면 금호안골길 79-202종허가2023-08-24
395396에스케이머티리얼즈그룹포틴(주)세종특별자치시 연동면 명학산단로 130 2층5종신고2023-08-24
396397(주)레이크머티리얼즈(세종미래지점)세종특별자치시 전의면 양곡리 592 (미래일반산업단지)5종허가2023-08-24
397398(주)신안피앤씨세종특별자치시 소정면 고등리 704 (첨단일반산업단지)3종신고2023-08-24
398399(주)포스코퓨처엠세종특별자치시 소정면 고등리 748 749(첨단일반산업단지)5종허가2023-08-24
399400(주)스위트바이오세종특별자치시 전의면 양곡리 590 (미래일반산업단지 6-3BL)4종신고2023-08-24