Overview

Dataset statistics

Number of variables5
Number of observations155
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.3 KiB
Average record size in memory41.9 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description인천광역시 서구 휘발성유기화합물 배출사업장 현황에 대한 데이터로 사업장명, 소재지 주소, 업종 등의 정보가 포함되어 있습니다.
Author인천광역시 서구
URLhttps://www.data.go.kr/data/15090913/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-17 11:34:34.883925
Analysis finished2024-04-17 11:34:35.282500
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct155
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78
Minimum1
Maximum155
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.5 KiB
2024-04-17T20:34:35.340813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.7
Q139.5
median78
Q3116.5
95-th percentile147.3
Maximum155
Range154
Interquartile range (IQR)77

Descriptive statistics

Standard deviation44.888751
Coefficient of variation (CV)0.57549681
Kurtosis-1.2
Mean78
Median Absolute Deviation (MAD)39
Skewness0
Sum12090
Variance2015
MonotonicityStrictly increasing
2024-04-17T20:34:35.453199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.6%
108 1
 
0.6%
101 1
 
0.6%
102 1
 
0.6%
103 1
 
0.6%
104 1
 
0.6%
105 1
 
0.6%
106 1
 
0.6%
107 1
 
0.6%
109 1
 
0.6%
Other values (145) 145
93.5%
ValueCountFrequency (%)
1 1
0.6%
2 1
0.6%
3 1
0.6%
4 1
0.6%
5 1
0.6%
6 1
0.6%
7 1
0.6%
8 1
0.6%
9 1
0.6%
10 1
0.6%
ValueCountFrequency (%)
155 1
0.6%
154 1
0.6%
153 1
0.6%
152 1
0.6%
151 1
0.6%
150 1
0.6%
149 1
0.6%
148 1
0.6%
147 1
0.6%
146 1
0.6%
Distinct149
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-17T20:34:35.663392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length47
Mean length9.7096774
Min length2

Characters and Unicode

Total characters1505
Distinct characters215
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique143 ?
Unique (%)92.3%

Sample

1st row대성산업㈜인천주유소
2nd row세진산업(2022.03.22. 멸실확인)
3rd row㈜혜성환경지점
4th row유창금속(휴업:2017.06.13~사업장재운영시까지)
5th row코오롱인더스트리(주)인천공장
ValueCountFrequency (%)
㈜보문 3
 
1.5%
3
 
1.5%
기주산업㈜ 2
 
1.0%
㈜자연에너지 2
 
1.0%
현대오일뱅크㈜직영 2
 
1.0%
인천플러스 2
 
1.0%
현대금속 2
 
1.0%
진흥산업 2
 
1.0%
한국금속 2
 
1.0%
㈜성진로지스 2
 
1.0%
Other values (177) 177
88.9%
2024-04-17T20:34:35.991440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
77
 
5.1%
74
 
4.9%
69
 
4.6%
65
 
4.3%
2 48
 
3.2%
46
 
3.1%
. 40
 
2.7%
0 38
 
2.5%
) 37
 
2.5%
( 37
 
2.5%
Other values (205) 974
64.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1072
71.2%
Decimal Number 155
 
10.3%
Other Symbol 69
 
4.6%
Other Punctuation 54
 
3.6%
Space Separator 46
 
3.1%
Close Punctuation 38
 
2.5%
Open Punctuation 38
 
2.5%
Uppercase Letter 22
 
1.5%
Math Symbol 7
 
0.5%
Dash Punctuation 4
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
77
 
7.2%
74
 
6.9%
65
 
6.1%
32
 
3.0%
24
 
2.2%
22
 
2.1%
22
 
2.1%
20
 
1.9%
18
 
1.7%
18
 
1.7%
Other values (179) 700
65.3%
Decimal Number
ValueCountFrequency (%)
2 48
31.0%
0 38
24.5%
1 27
17.4%
8 10
 
6.5%
3 10
 
6.5%
6 7
 
4.5%
7 5
 
3.2%
5 5
 
3.2%
9 4
 
2.6%
4 1
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
S 7
31.8%
K 7
31.8%
I 3
13.6%
C 3
13.6%
H 2
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 40
74.1%
: 9
 
16.7%
, 5
 
9.3%
Close Punctuation
ValueCountFrequency (%)
) 37
97.4%
] 1
 
2.6%
Open Punctuation
ValueCountFrequency (%)
( 37
97.4%
[ 1
 
2.6%
Other Symbol
ValueCountFrequency (%)
69
100.0%
Space Separator
ValueCountFrequency (%)
46
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1141
75.8%
Common 342
 
22.7%
Latin 22
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
77
 
6.7%
74
 
6.5%
69
 
6.0%
65
 
5.7%
32
 
2.8%
24
 
2.1%
22
 
1.9%
22
 
1.9%
20
 
1.8%
18
 
1.6%
Other values (180) 718
62.9%
Common
ValueCountFrequency (%)
2 48
14.0%
46
13.5%
. 40
11.7%
0 38
11.1%
) 37
10.8%
( 37
10.8%
1 27
7.9%
8 10
 
2.9%
3 10
 
2.9%
: 9
 
2.6%
Other values (10) 40
11.7%
Latin
ValueCountFrequency (%)
S 7
31.8%
K 7
31.8%
I 3
13.6%
C 3
13.6%
H 2
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1072
71.2%
ASCII 364
 
24.2%
None 69
 
4.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
77
 
7.2%
74
 
6.9%
65
 
6.1%
32
 
3.0%
24
 
2.2%
22
 
2.1%
22
 
2.1%
20
 
1.9%
18
 
1.7%
18
 
1.7%
Other values (179) 700
65.3%
None
ValueCountFrequency (%)
69
100.0%
ASCII
ValueCountFrequency (%)
2 48
13.2%
46
12.6%
. 40
11.0%
0 38
10.4%
) 37
10.2%
( 37
10.2%
1 27
7.4%
8 10
 
2.7%
3 10
 
2.7%
: 9
 
2.5%
Other values (15) 62
17.0%
Distinct149
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-17T20:34:36.278782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length95
Median length32
Mean length20.883871
Min length15

Characters and Unicode

Total characters3237
Distinct characters77
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique146 ?
Unique (%)94.2%

Sample

1st row인천광역시 서구 가좌동 111-4
2nd row인천광역시 서구 가좌동 173-99
3rd row인천광역시 서구 가좌동 602-36(A-301,302)
4th row인천광역시 서구 가좌동 178-35(1호)
5th row인천광역시 서구 가좌동 294
ValueCountFrequency (%)
인천광역시 155
23.6%
서구 155
23.6%
가좌동 57
 
8.7%
석남동 30
 
4.6%
원창동 13
 
2.0%
금곡동 7
 
1.1%
경서동 6
 
0.9%
연희동 5
 
0.8%
당하동 5
 
0.8%
173-91 4
 
0.6%
Other values (192) 221
33.6%
2024-04-17T20:34:36.694392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
506
 
15.6%
163
 
5.0%
1 162
 
5.0%
157
 
4.9%
- 157
 
4.9%
156
 
4.8%
155
 
4.8%
155
 
4.8%
155
 
4.8%
155
 
4.8%
Other values (67) 1316
40.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1621
50.1%
Decimal Number 870
26.9%
Space Separator 506
 
15.6%
Dash Punctuation 157
 
4.9%
Other Punctuation 28
 
0.9%
Open Punctuation 26
 
0.8%
Close Punctuation 26
 
0.8%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
163
10.1%
157
9.7%
156
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
67
 
4.1%
59
 
3.6%
Other values (48) 244
15.1%
Decimal Number
ValueCountFrequency (%)
1 162
18.6%
3 121
13.9%
2 111
12.8%
5 86
9.9%
4 74
8.5%
7 74
8.5%
0 72
8.3%
6 68
7.8%
9 54
 
6.2%
8 48
 
5.5%
Uppercase Letter
ValueCountFrequency (%)
A 1
33.3%
G 1
33.3%
S 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 25
89.3%
. 3
 
10.7%
Space Separator
ValueCountFrequency (%)
506
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 157
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1621
50.1%
Common 1613
49.8%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
163
10.1%
157
9.7%
156
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
67
 
4.1%
59
 
3.6%
Other values (48) 244
15.1%
Common
ValueCountFrequency (%)
506
31.4%
1 162
 
10.0%
- 157
 
9.7%
3 121
 
7.5%
2 111
 
6.9%
5 86
 
5.3%
4 74
 
4.6%
7 74
 
4.6%
0 72
 
4.5%
6 68
 
4.2%
Other values (6) 182
 
11.3%
Latin
ValueCountFrequency (%)
A 1
33.3%
G 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1621
50.1%
ASCII 1616
49.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
506
31.3%
1 162
 
10.0%
- 157
 
9.7%
3 121
 
7.5%
2 111
 
6.9%
5 86
 
5.3%
4 74
 
4.6%
7 74
 
4.6%
0 72
 
4.5%
6 68
 
4.2%
Other values (9) 185
 
11.4%
Hangul
ValueCountFrequency (%)
163
10.1%
157
9.7%
156
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
155
9.6%
67
 
4.1%
59
 
3.6%
Other values (48) 244
15.1%
Distinct145
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2024-04-17T20:34:36.931680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length33
Mean length23.56129
Min length9

Characters and Unicode

Total characters3652
Distinct characters87
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique141 ?
Unique (%)91.0%

Sample

1st row인천광역시 서구 가남로 181(가좌동)
2nd row인천광역시 서구 건지로 122(가좌동)
3rd row인천광역시 서구 중봉대로198번길 33, A-301,302(가좌동)
4th row인천광역시 서구 백범로910번길 49, 1호(가좌동)
5th row인천광역시 서구 백범로 680(가좌동)
ValueCountFrequency (%)
인천광역시 155
23.7%
서구 155
23.7%
건지로 19
 
2.9%
봉수대로 15
 
2.3%
24 12
 
1.8%
백범로 12
 
1.8%
중봉대로240번길 12
 
1.8%
서곶로 10
 
1.5%
112 9
 
1.4%
미부여 7
 
1.1%
Other values (199) 249
38.0%
2024-04-17T20:34:37.297020image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
506
 
13.9%
174
 
4.8%
156
 
4.3%
155
 
4.2%
155
 
4.2%
155
 
4.2%
155
 
4.2%
155
 
4.2%
155
 
4.2%
148
 
4.1%
Other values (77) 1738
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2195
60.1%
Decimal Number 614
 
16.8%
Space Separator 506
 
13.9%
Close Punctuation 146
 
4.0%
Open Punctuation 146
 
4.0%
Other Punctuation 33
 
0.9%
Dash Punctuation 11
 
0.3%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
174
 
7.9%
156
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
148
 
6.7%
71
 
3.2%
Other values (61) 716
32.6%
Decimal Number
ValueCountFrequency (%)
1 119
19.4%
2 101
16.4%
4 88
14.3%
3 63
10.3%
0 62
10.1%
6 45
 
7.3%
5 42
 
6.8%
8 35
 
5.7%
9 33
 
5.4%
7 26
 
4.2%
Space Separator
ValueCountFrequency (%)
506
100.0%
Close Punctuation
ValueCountFrequency (%)
) 146
100.0%
Open Punctuation
ValueCountFrequency (%)
( 146
100.0%
Other Punctuation
ValueCountFrequency (%)
, 33
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2195
60.1%
Common 1456
39.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
174
 
7.9%
156
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
148
 
6.7%
71
 
3.2%
Other values (61) 716
32.6%
Common
ValueCountFrequency (%)
506
34.8%
) 146
 
10.0%
( 146
 
10.0%
1 119
 
8.2%
2 101
 
6.9%
4 88
 
6.0%
3 63
 
4.3%
0 62
 
4.3%
6 45
 
3.1%
5 42
 
2.9%
Other values (5) 138
 
9.5%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2195
60.1%
ASCII 1457
39.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
506
34.7%
) 146
 
10.0%
( 146
 
10.0%
1 119
 
8.2%
2 101
 
6.9%
4 88
 
6.0%
3 63
 
4.3%
0 62
 
4.3%
6 45
 
3.1%
5 42
 
2.9%
Other values (6) 139
 
9.5%
Hangul
ValueCountFrequency (%)
174
 
7.9%
156
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
155
 
7.1%
148
 
6.7%
71
 
3.2%
Other values (61) 716
32.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2022-08-31
155 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022-08-31
2nd row2022-08-31
3rd row2022-08-31
4th row2022-08-31
5th row2022-08-31

Common Values

ValueCountFrequency (%)
2022-08-31 155
100.0%

Length

2024-04-17T20:34:37.414619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T20:34:37.492479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022-08-31 155
100.0%

Interactions

2024-04-17T20:34:35.074247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-17T20:34:35.172815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T20:34:35.248385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명소재지(지번)소재지(도로명)데이터기준일자
01대성산업㈜인천주유소인천광역시 서구 가좌동 111-4인천광역시 서구 가남로 181(가좌동)2022-08-31
12세진산업(2022.03.22. 멸실확인)인천광역시 서구 가좌동 173-99인천광역시 서구 건지로 122(가좌동)2022-08-31
23㈜혜성환경지점인천광역시 서구 가좌동 602-36(A-301,302)인천광역시 서구 중봉대로198번길 33, A-301,302(가좌동)2022-08-31
34유창금속(휴업:2017.06.13~사업장재운영시까지)인천광역시 서구 가좌동 178-35(1호)인천광역시 서구 백범로910번길 49, 1호(가좌동)2022-08-31
45코오롱인더스트리(주)인천공장인천광역시 서구 가좌동 294인천광역시 서구 백범로 680(가좌동)2022-08-31
56기주산업㈜인천광역시 서구 석남동 650-55인천광역시 서구 중봉대로 300-12(석남동)2022-08-31
67부일기업인천광역시 서구 석남동 223-310인천광역시 서구 건지로153번길 40(석남동)2022-08-31
78강한기업(휴업:2022.02.25 ~ 2023.02.21.)인천광역시 서구 가좌동 178-148인천광역시 서구 백범로934번길 38-10(가좌동)2022-08-31
89㈜디어포스인천광역시 서구 가좌동 290인천광역시 서구 가좌로83번길 52(가좌동)2022-08-31
910SK인천석유화학㈜인천광역시 서구 원창동 100인천광역시 서구 봉수대로 415(원창동)외 12022-08-31
연번사업장명소재지(지번)소재지(도로명)데이터기준일자
145146테라스크린인천광역시 서구 원당동 848-5, 상가 13,104,105인천광역시 서구 고산로40번길 11(원당동)2022-08-31
146147(주)광역주유소인천광역시 서구 백석동 72-3인천광역시 서구 서곶로 723(백석동)2022-08-31
147148현대오일뱅크㈜직영 가정셀프주유소인천광역시 서구 원창동 산 9-23인천광역시 서구 봉수대로 541(원창동)2022-08-31
148149티에이치에너지㈜인천현대셀프인천광역시 서구 가좌동 585-71번지 외1필지인천광역시 서구 백범로 818 (가좌동)2022-08-31
149150SK에너지㈜아라웨이주유소인천광역시 서구 오류동 1545-5인천광역시 서구 뱃길로 172022-08-31
150151아트샵가좌IC주유소인천광역시 서구 가좌동 291-7인천광역시 서구 백범로 708(가좌동)2022-08-31
151152우진목재공업(주)우진주유소(임대: 우진주유소, 김응수, 기간: 2023.08.20.)인천광역시 서구 가좌동 162-8인천광역시 서구 가정로 63(가좌동)2022-08-31
152153(유)중앙자동차학원(지점)인천광역시 서구 백석동 83인천광역시 서구 드림로 285(백석동)2022-08-31
153154서일석유㈜북항서일주유소인천광역시 서구 원창동 393-102인천광역시 서구 미부여2022-08-31
154155당하동주유소인천광역시 서구 당하동 748-2 외 1필지인천광역시 서구 미부여2022-08-31