Overview

Dataset statistics

Number of variables6
Number of observations91
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.5 KiB
Average record size in memory50.5 B

Variable types

Numeric1
Text2
Categorical2
DateTime1

Dataset

Description남동구 비산먼지 발생사업 현황으로 연번,사업장명,현장소재지,발생사업,대상사업,데이터기준일자 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/15087548/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
발생사업 is highly overall correlated with 대상사업High correlation
대상사업 is highly overall correlated with 발생사업High correlation
발생사업 is highly imbalanced (52.7%)Imbalance
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 05:04:51.831222
Analysis finished2023-12-12 05:04:52.969416
Duration1.14 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct91
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46
Minimum1
Maximum91
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size951.0 B
2023-12-12T14:04:53.067472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.5
Q123.5
median46
Q368.5
95-th percentile86.5
Maximum91
Range90
Interquartile range (IQR)45

Descriptive statistics

Standard deviation26.41338
Coefficient of variation (CV)0.57420392
Kurtosis-1.2
Mean46
Median Absolute Deviation (MAD)23
Skewness0
Sum4186
Variance697.66667
MonotonicityStrictly increasing
2023-12-12T14:04:53.277430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
59 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
Other values (81) 81
89.0%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
Distinct90
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T14:04:53.586621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length7.956044
Min length3

Characters and Unicode

Total characters724
Distinct characters134
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)97.8%

Sample

1st row주식회사 장원레미콘(인천공장지점)
2nd row대영건설산업(주)
3rd row세화골재
4th row영광골재
5th row㈜인조상사
ValueCountFrequency (%)
주식회사 10
 
9.4%
금호건설(주 2
 
1.9%
청명건설주식회사 1
 
0.9%
주)엘엠종합건설 1
 
0.9%
현원건설(주 1
 
0.9%
주)준수디앤씨 1
 
0.9%
다온건설 1
 
0.9%
주)오남건설 1
 
0.9%
주)피엠건설 1
 
0.9%
위본건설(주 1
 
0.9%
Other values (86) 86
81.1%
2023-12-12T14:04:54.096483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
78
 
10.8%
( 58
 
8.0%
) 58
 
8.0%
51
 
7.0%
46
 
6.4%
20
 
2.8%
18
 
2.5%
18
 
2.5%
16
 
2.2%
15
 
2.1%
Other values (124) 346
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 580
80.1%
Open Punctuation 58
 
8.0%
Close Punctuation 58
 
8.0%
Space Separator 16
 
2.2%
Other Symbol 11
 
1.5%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
78
 
13.4%
51
 
8.8%
46
 
7.9%
20
 
3.4%
18
 
3.1%
18
 
3.1%
15
 
2.6%
15
 
2.6%
10
 
1.7%
10
 
1.7%
Other values (119) 299
51.6%
Open Punctuation
ValueCountFrequency (%)
( 58
100.0%
Close Punctuation
ValueCountFrequency (%)
) 58
100.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Other Symbol
ValueCountFrequency (%)
11
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 591
81.6%
Common 133
 
18.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
78
 
13.2%
51
 
8.6%
46
 
7.8%
20
 
3.4%
18
 
3.0%
18
 
3.0%
15
 
2.5%
15
 
2.5%
11
 
1.9%
10
 
1.7%
Other values (120) 309
52.3%
Common
ValueCountFrequency (%)
( 58
43.6%
) 58
43.6%
16
 
12.0%
1 1
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 580
80.1%
ASCII 133
 
18.4%
None 11
 
1.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
78
 
13.4%
51
 
8.8%
46
 
7.9%
20
 
3.4%
18
 
3.1%
18
 
3.1%
15
 
2.6%
15
 
2.6%
10
 
1.7%
10
 
1.7%
Other values (119) 299
51.6%
ASCII
ValueCountFrequency (%)
( 58
43.6%
) 58
43.6%
16
 
12.0%
1 1
 
0.8%
None
ValueCountFrequency (%)
11
100.0%
Distinct90
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
2023-12-12T14:04:54.534696image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length37
Mean length21.681319
Min length16

Characters and Unicode

Total characters1973
Distinct characters64
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)97.8%

Sample

1st row인천광역시 남동구 논현동 442-34
2nd row인천광역시 남동구 논현동 66-12
3rd row인천광역시 남동구 만수동 986-18
4th row인천광역시 남동구 간석동 406-2
5th row인천광역시 남동구 구월동 1128-3
ValueCountFrequency (%)
인천광역시 91
22.4%
남동구 91
22.4%
간석동 23
 
5.7%
논현동 14
 
3.4%
구월동 13
 
3.2%
고잔동 12
 
2.9%
일원 11
 
2.7%
만수동 9
 
2.2%
8
 
2.0%
서창동 5
 
1.2%
Other values (114) 130
31.9%
2023-12-12T14:04:55.576876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
317
16.1%
182
 
9.2%
107
 
5.4%
1 101
 
5.1%
95
 
4.8%
91
 
4.6%
91
 
4.6%
91
 
4.6%
91
 
4.6%
91
 
4.6%
Other values (54) 716
36.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1130
57.3%
Decimal Number 425
 
21.5%
Space Separator 317
 
16.1%
Dash Punctuation 67
 
3.4%
Other Punctuation 19
 
1.0%
Close Punctuation 5
 
0.3%
Open Punctuation 5
 
0.3%
Math Symbol 5
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
182
16.1%
107
9.5%
95
 
8.4%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
23
 
2.0%
23
 
2.0%
Other values (38) 245
21.7%
Decimal Number
ValueCountFrequency (%)
1 101
23.8%
6 54
12.7%
2 50
11.8%
3 39
 
9.2%
4 38
 
8.9%
7 38
 
8.9%
8 35
 
8.2%
0 26
 
6.1%
9 22
 
5.2%
5 22
 
5.2%
Space Separator
ValueCountFrequency (%)
317
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 67
100.0%
Other Punctuation
ValueCountFrequency (%)
, 19
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1130
57.3%
Common 843
42.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
182
16.1%
107
9.5%
95
 
8.4%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
23
 
2.0%
23
 
2.0%
Other values (38) 245
21.7%
Common
ValueCountFrequency (%)
317
37.6%
1 101
 
12.0%
- 67
 
7.9%
6 54
 
6.4%
2 50
 
5.9%
3 39
 
4.6%
4 38
 
4.5%
7 38
 
4.5%
8 35
 
4.2%
0 26
 
3.1%
Other values (6) 78
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1130
57.3%
ASCII 843
42.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
317
37.6%
1 101
 
12.0%
- 67
 
7.9%
6 54
 
6.4%
2 50
 
5.9%
3 39
 
4.6%
4 38
 
4.5%
7 38
 
4.5%
8 35
 
4.2%
0 26
 
3.1%
Other values (6) 78
 
9.3%
Hangul
ValueCountFrequency (%)
182
16.1%
107
9.5%
95
 
8.4%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
91
 
8.1%
23
 
2.0%
23
 
2.0%
Other values (38) 245
21.7%

발생사업
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)9.9%
Missing0
Missing (%)0.0%
Memory size860.0 B
건설업
68 
비금속물질의 채취,제조,가공업
 
6
시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업
 
5
제1차금속제조업
 
4
비금속물질의 채취 ,제조,가공업
 
3
Other values (4)
 
5

Length

Max length60
Median length3
Mean length8.4395604
Min length3

Unique

Unique3 ?
Unique (%)3.3%

Sample

1st row시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업, 비금속물질의 채취 ,제조,가공업
2nd row시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업
3rd row 비금속물질의 채취,제조,가공업
4th row 비금속물질의 채취,제조,가공업
5th row 비금속물질의 채취,제조,가공업

Common Values

ValueCountFrequency (%)
건설업 68
74.7%
비금속물질의 채취,제조,가공업 6
 
6.6%
시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업 5
 
5.5%
제1차금속제조업 4
 
4.4%
비금속물질의 채취 ,제조,가공업 3
 
3.3%
시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업, 비금속물질의 채취 ,제조,가공업 2
 
2.2%
비료및사료제품제조업 1
 
1.1%
금속제품제조업 1
 
1.1%
시멘트, 석회, 플라스터 및 시멘트관련 제품의 제조 및 가공업 1
 
1.1%

Length

2023-12-12T14:04:55.789643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:04:55.951910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건설업 68
42.8%
16
 
10.1%
비금속물질의 11
 
6.9%
제품의 8
 
5.0%
제조 8
 
5.0%
가공업 8
 
5.0%
시멘트관련 8
 
5.0%
시멘트,석회,플라스터(plaster 7
 
4.4%
채취,제조,가공업 6
 
3.8%
채취 5
 
3.1%
Other values (7) 14
 
8.8%

대상사업
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size860.0 B
건축물축조공사
38 
토목공사
10 
도장공사
콘크리트제품제조업
토사석광업(골재보관,판매업)
Other values (15)
27 

Length

Max length24
Median length20
Mean length7.4725275
Min length4

Unique

Unique9 ?
Unique (%)9.9%

Sample

1st row콘크리트제품제조업, 비금속광물 분쇄물 생산업
2nd row콘크리트제품제조업
3rd row토사석광업(골재보관,판매업)
4th row토사석광업(골재보관,판매업)
5th row토사석광업(골재보관,판매업)

Common Values

ValueCountFrequency (%)
건축물축조공사 38
41.8%
토목공사 10
 
11.0%
도장공사 7
 
7.7%
콘크리트제품제조업 5
 
5.5%
토사석광업(골재보관,판매업) 4
 
4.4%
건설폐기물처리업 4
 
4.4%
금속주조업 4
 
4.4%
건축물해체공사 4
 
4.4%
건추물축조공사 2
 
2.2%
조경공사 2
 
2.2%
Other values (10) 11
 
12.1%

Length

2023-12-12T14:04:56.187433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
건축물축조공사 41
41.4%
토목공사 13
 
13.1%
도장공사 7
 
7.1%
콘크리트제품제조업 6
 
6.1%
토사석광업(골재보관,판매업 4
 
4.0%
건설폐기물처리업 4
 
4.0%
금속주조업 4
 
4.0%
건축물해체공사 4
 
4.0%
조경공사 2
 
2.0%
콘크리트제품제조업(벽돌제조 2
 
2.0%
Other values (11) 12
 
12.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size860.0 B
Minimum2023-05-11 00:00:00
Maximum2023-05-11 00:00:00
2023-12-12T14:04:56.349066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:04:56.507476image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T14:04:52.657357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:04:56.614230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명현장소재지발생사업대상사업
연번1.0001.0000.9450.6150.819
사업장명1.0001.0000.9991.0001.000
현장소재지0.9450.9991.0001.0001.000
발생사업0.6151.0001.0001.0000.966
대상사업0.8191.0001.0000.9661.000
2023-12-12T14:04:56.726789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발생사업대상사업
발생사업1.0000.785
대상사업0.7851.000
2023-12-12T14:04:56.841536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번발생사업대상사업
연번1.0000.3010.360
발생사업0.3011.0000.785
대상사업0.3600.7851.000

Missing values

2023-12-12T14:04:52.771646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:04:52.922251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명현장소재지발생사업대상사업데이터기준일자
01주식회사 장원레미콘(인천공장지점)인천광역시 남동구 논현동 442-34시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업, 비금속물질의 채취 ,제조,가공업콘크리트제품제조업, 비금속광물 분쇄물 생산업2023-05-11
12대영건설산업(주)인천광역시 남동구 논현동 66-12시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업콘크리트제품제조업2023-05-11
23세화골재인천광역시 남동구 만수동 986-18비금속물질의 채취,제조,가공업토사석광업(골재보관,판매업)2023-05-11
34영광골재인천광역시 남동구 간석동 406-2비금속물질의 채취,제조,가공업토사석광업(골재보관,판매업)2023-05-11
45㈜인조상사인천광역시 남동구 구월동 1128-3비금속물질의 채취,제조,가공업토사석광업(골재보관,판매업)2023-05-11
56성진레미콘㈜인천광역시 남동구 고잔동 166-7시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업, 비금속물질의 채취 ,제조,가공업콘크리트제품제조업2023-05-11
67서울콘크리트인천광역시 남동구 서창동 318,222,226시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업콘크리트제품제조업(벽돌제조)2023-05-11
78삼화건재인천광역시 남동구 만수동 1071,1071-7,1071-8시멘트,석회,플라스터(Plaster) 및 시멘트관련 제품의 제조 및 가공업콘크리트제품제조업(벽돌제조)2023-05-11
89㈜대인골재인천광역시 남동구 구월동 1139-16비금속물질의 채취,제조,가공업토사석광업(골재보관,판매업)2023-05-11
910(주)세종인바먼트인천광역시 남동구 고잔동 512-2비금속물질의 채취 ,제조,가공업건설폐기물처리업2023-05-11
연번사업장명현장소재지발생사업대상사업데이터기준일자
8182금성건설(주)인천광역시 남동구 논현동 767-16건설업건축물축조공사2023-05-11
8283아이엔지건설(주)장도로(소1~4호선) ~ 포구로(대3~78호선) 일원건설업토목공사(굴정공사)2023-05-11
8384삼창엔지니어링(주)인천광역시 남동구 간석동 408-1건설업도장공사2023-05-11
8485(주)원나인건설인천광역시 남동구 서창동 680번지, 논현동 111-47번지, 논현동 111-11번지 일원건설업토목공사2023-05-11
8586주연건설주식회사인천광역시 남동구 구월동 1513건설업도장공사2023-05-11
8687에스비종합건설주식회사인천광역시 남동구 수산동 470-6건설업건축물축조공사, 토목공사2023-05-11
8788다인건설(주)인천광역시 남동구 구월동 1476건설업도장공사2023-05-11
8889서원건설 주식회사인천광역시 남동구 간석동 179-6 외 2필지건설업건축물축조공사2023-05-11
8990성강종합건설(주)인천광역시 남동구 고잔동 372-11건설업건축물축조공사2023-05-11
9091(주)렉스건설인천광역시 남동구 만수동 857 외 7필지건설업건축물축조공사2023-05-11