Overview

Dataset statistics

Number of variables4
Number of observations360
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.7 KiB
Average record size in memory33.4 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description대구광역시 서구 내에서 대기오염물질을 배출하는 시설에 대한 현황입니다. 각 사업소명과 도로명주소 등이 반영되어 있습니다.
Author대구광역시 서구
URLhttps://www.data.go.kr/data/15088617/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
번호 has unique valuesUnique

Reproduction

Analysis started2024-03-16 06:37:32.182039
Analysis finished2024-03-16 06:37:34.129075
Duration1.95 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct360
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean180.5
Minimum1
Maximum360
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.3 KiB
2024-03-16T06:37:34.427078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile18.95
Q190.75
median180.5
Q3270.25
95-th percentile342.05
Maximum360
Range359
Interquartile range (IQR)179.5

Descriptive statistics

Standard deviation104.06729
Coefficient of variation (CV)0.57655006
Kurtosis-1.2
Mean180.5
Median Absolute Deviation (MAD)90
Skewness0
Sum64980
Variance10830
MonotonicityStrictly increasing
2024-03-16T06:37:35.262898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
249 1
 
0.3%
247 1
 
0.3%
246 1
 
0.3%
245 1
 
0.3%
244 1
 
0.3%
243 1
 
0.3%
242 1
 
0.3%
241 1
 
0.3%
240 1
 
0.3%
Other values (350) 350
97.2%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
360 1
0.3%
359 1
0.3%
358 1
0.3%
357 1
0.3%
356 1
0.3%
355 1
0.3%
354 1
0.3%
353 1
0.3%
352 1
0.3%
351 1
0.3%
Distinct356
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-16T06:37:36.246403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length20
Mean length7.1222222
Min length2

Characters and Unicode

Total characters2564
Distinct characters278
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique352 ?
Unique (%)97.8%

Sample

1st row(주)미앤부티
2nd row삼일테크
3rd row보성종합정비(주)
4th row(유)대안에이엔씨
5th row(주)대경에프엔티
ValueCountFrequency (%)
주식회사 6
 
1.5%
중앙모터스 3
 
0.8%
주)미앤부티 2
 
0.5%
주)동진상사 2
 
0.5%
모터스 2
 
0.5%
현대다이텍(주 2
 
0.5%
비산공장 2
 
0.5%
dyetec연구원 2
 
0.5%
이현공장 2
 
0.5%
화성금속 2
 
0.5%
Other values (366) 366
93.6%
2024-03-16T06:37:37.552886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 165
 
6.4%
) 165
 
6.4%
165
 
6.4%
76
 
3.0%
75
 
2.9%
65
 
2.5%
57
 
2.2%
53
 
2.1%
52
 
2.0%
49
 
1.9%
Other values (268) 1642
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2138
83.4%
Open Punctuation 165
 
6.4%
Close Punctuation 165
 
6.4%
Uppercase Letter 47
 
1.8%
Space Separator 31
 
1.2%
Decimal Number 12
 
0.5%
Other Symbol 3
 
0.1%
Dash Punctuation 2
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
165
 
7.7%
76
 
3.6%
75
 
3.5%
65
 
3.0%
57
 
2.7%
53
 
2.5%
52
 
2.4%
49
 
2.3%
40
 
1.9%
40
 
1.9%
Other values (239) 1466
68.6%
Uppercase Letter
ValueCountFrequency (%)
E 6
12.8%
C 5
10.6%
T 5
10.6%
D 4
 
8.5%
I 3
 
6.4%
Y 3
 
6.4%
F 3
 
6.4%
N 3
 
6.4%
P 2
 
4.3%
M 2
 
4.3%
Other values (10) 11
23.4%
Decimal Number
ValueCountFrequency (%)
1 8
66.7%
2 3
 
25.0%
3 1
 
8.3%
Open Punctuation
ValueCountFrequency (%)
( 165
100.0%
Close Punctuation
ValueCountFrequency (%)
) 165
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2141
83.5%
Common 376
 
14.7%
Latin 47
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
165
 
7.7%
76
 
3.5%
75
 
3.5%
65
 
3.0%
57
 
2.7%
53
 
2.5%
52
 
2.4%
49
 
2.3%
40
 
1.9%
40
 
1.9%
Other values (240) 1469
68.6%
Latin
ValueCountFrequency (%)
E 6
12.8%
C 5
10.6%
T 5
10.6%
D 4
 
8.5%
I 3
 
6.4%
Y 3
 
6.4%
F 3
 
6.4%
N 3
 
6.4%
P 2
 
4.3%
M 2
 
4.3%
Other values (10) 11
23.4%
Common
ValueCountFrequency (%)
( 165
43.9%
) 165
43.9%
31
 
8.2%
1 8
 
2.1%
2 3
 
0.8%
- 2
 
0.5%
3 1
 
0.3%
& 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2138
83.4%
ASCII 423
 
16.5%
None 3
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 165
39.0%
) 165
39.0%
31
 
7.3%
1 8
 
1.9%
E 6
 
1.4%
C 5
 
1.2%
T 5
 
1.2%
D 4
 
0.9%
I 3
 
0.7%
Y 3
 
0.7%
Other values (18) 28
 
6.6%
Hangul
ValueCountFrequency (%)
165
 
7.7%
76
 
3.6%
75
 
3.5%
65
 
3.0%
57
 
2.7%
53
 
2.5%
52
 
2.4%
49
 
2.3%
40
 
1.9%
40
 
1.9%
Other values (239) 1466
68.6%
None
ValueCountFrequency (%)
3
100.0%
Distinct353
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-16T06:37:38.587112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length43
Mean length25.261111
Min length21

Characters and Unicode

Total characters9094
Distinct characters84
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique346 ?
Unique (%)96.1%

Sample

1st row대구광역시 서구 염색공단로 108 (비산동)
2nd row대구광역시 서구 염색공단천로1길 14 (비산동)
3rd row대구광역시 서구 북비산로 87 (이현동)
4th row대구광역시 서구 염색공단로21길 8 (비산동)
5th row대구광역시 서구 문화로4길 16 (이현동)
ValueCountFrequency (%)
대구광역시 360
19.7%
서구 360
19.7%
비산동 152
 
8.3%
이현동 121
 
6.6%
중리동 64
 
3.5%
와룡로 22
 
1.2%
달서천로 19
 
1.0%
염색공단로 18
 
1.0%
평리동 17
 
0.9%
염색공단중앙로 15
 
0.8%
Other values (283) 680
37.2%
2024-03-16T06:37:40.038608image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1484
 
16.3%
731
 
8.0%
388
 
4.3%
372
 
4.1%
( 366
 
4.0%
) 365
 
4.0%
364
 
4.0%
360
 
4.0%
360
 
4.0%
360
 
4.0%
Other values (74) 3944
43.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5543
61.0%
Space Separator 1484
 
16.3%
Decimal Number 1267
 
13.9%
Open Punctuation 366
 
4.0%
Close Punctuation 365
 
4.0%
Dash Punctuation 69
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
731
 
13.2%
388
 
7.0%
372
 
6.7%
364
 
6.6%
360
 
6.5%
360
 
6.5%
360
 
6.5%
360
 
6.5%
231
 
4.2%
186
 
3.4%
Other values (60) 1831
33.0%
Decimal Number
ValueCountFrequency (%)
1 288
22.7%
2 196
15.5%
3 162
12.8%
7 107
 
8.4%
6 104
 
8.2%
4 102
 
8.1%
9 85
 
6.7%
0 78
 
6.2%
8 77
 
6.1%
5 68
 
5.4%
Space Separator
ValueCountFrequency (%)
1484
100.0%
Open Punctuation
ValueCountFrequency (%)
( 366
100.0%
Close Punctuation
ValueCountFrequency (%)
) 365
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 69
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5543
61.0%
Common 3551
39.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
731
 
13.2%
388
 
7.0%
372
 
6.7%
364
 
6.6%
360
 
6.5%
360
 
6.5%
360
 
6.5%
360
 
6.5%
231
 
4.2%
186
 
3.4%
Other values (60) 1831
33.0%
Common
ValueCountFrequency (%)
1484
41.8%
( 366
 
10.3%
) 365
 
10.3%
1 288
 
8.1%
2 196
 
5.5%
3 162
 
4.6%
7 107
 
3.0%
6 104
 
2.9%
4 102
 
2.9%
9 85
 
2.4%
Other values (4) 292
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5543
61.0%
ASCII 3551
39.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1484
41.8%
( 366
 
10.3%
) 365
 
10.3%
1 288
 
8.1%
2 196
 
5.5%
3 162
 
4.6%
7 107
 
3.0%
6 104
 
2.9%
4 102
 
2.9%
9 85
 
2.4%
Other values (4) 292
 
8.2%
Hangul
ValueCountFrequency (%)
731
 
13.2%
388
 
7.0%
372
 
6.7%
364
 
6.6%
360
 
6.5%
360
 
6.5%
360
 
6.5%
360
 
6.5%
231
 
4.2%
186
 
3.4%
Other values (60) 1831
33.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-03-01
360 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2024-03-01
2nd row2024-03-01
3rd row2024-03-01
4th row2024-03-01
5th row2024-03-01

Common Values

ValueCountFrequency (%)
2024-03-01 360
100.0%

Length

2024-03-16T06:37:40.546921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-16T06:37:41.125885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2024-03-01 360
100.0%

Interactions

2024-03-16T06:37:32.904274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-03-16T06:37:33.428417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-16T06:37:33.963293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호사업장명도로명소재지데이터기준일자
01(주)미앤부티대구광역시 서구 염색공단로 108 (비산동)2024-03-01
12삼일테크대구광역시 서구 염색공단천로1길 14 (비산동)2024-03-01
23보성종합정비(주)대구광역시 서구 북비산로 87 (이현동)2024-03-01
34(유)대안에이엔씨대구광역시 서구 염색공단로21길 8 (비산동)2024-03-01
45(주)대경에프엔티대구광역시 서구 문화로4길 16 (이현동)2024-03-01
56대구염색산업단지관리공단(열병합발전소)대구광역시 서구 염색공단중앙로 33 (평리동)2024-03-01
67우진표면테크대구광역시 서구 와룡로90길 8-6 (이현동)2024-03-01
78(주)태영산업대구광역시 서구 염색공단천로12길 6 (비산동)2024-03-01
89(주)에스엔에스대구광역시 서구 달서천로 168 (평리동)2024-03-01
910신창염직(주)대구광역시 서구 염색공단로 104 (비산동)2024-03-01
번호사업장명도로명소재지데이터기준일자
350351휠플레이대구광역시 서구 북비산로 113-8 (평리동)2024-03-01
351352(주)대성도어몰딩(3공장)대구광역시 서구 문화로23길 26 (이현동)2024-03-01
352353대동산업대구광역시 서구 국채보상로23길 13 (이현동)2024-03-01
353354경동정비특장대구광역시 서구 염색공단천로14길 15-6 (비산동)2024-03-01
354355대호정비대구광역시 서구 북비산로 77-6 (이현동)2024-03-01
355356대서양분체대구광역시 서구 염색공단천로10길 18 (비산동)2024-03-01
356357(주)오성우드스틸대구광역시 서구 와룡로98길 50 (이현동)2024-03-01
357358아진TEX대구광역시 서구 달서천로 76 (평리동)2024-03-01
358359태산ENC대구광역시 서구 염색공단천로14길 16-11 외 1필지 (비산동)2024-03-01
359360가람모터스대구광역시 서구 염색공단로5길 21 (비산동)2024-03-01