Overview

Dataset statistics

Number of variables4
Number of observations250
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory33.5 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description인천광역시 남동구 환경오염물질 중 대기오염물질 배출업소 허가 현황에 대한 데이터로 사업장명, 업종 등을 제공합니다.
Author인천광역시 남동구
URLhttps://www.data.go.kr/data/15113402/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:20:14.563585
Analysis finished2024-04-21 02:20:15.708872
Duration1.15 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct250
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean125.5
Minimum1
Maximum250
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2024-04-21T11:20:15.866511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.45
Q163.25
median125.5
Q3187.75
95-th percentile237.55
Maximum250
Range249
Interquartile range (IQR)124.5

Descriptive statistics

Standard deviation72.312977
Coefficient of variation (CV)0.57619902
Kurtosis-1.2
Mean125.5
Median Absolute Deviation (MAD)62.5
Skewness0
Sum31375
Variance5229.1667
MonotonicityStrictly increasing
2024-04-21T11:20:16.190253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
173 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
164 1
 
0.4%
165 1
 
0.4%
166 1
 
0.4%
167 1
 
0.4%
Other values (240) 240
96.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
250 1
0.4%
249 1
0.4%
248 1
0.4%
247 1
0.4%
246 1
0.4%
245 1
0.4%
244 1
0.4%
243 1
0.4%
242 1
0.4%
241 1
0.4%
Distinct249
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T11:20:16.565392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length18
Mean length7.304
Min length2

Characters and Unicode

Total characters1826
Distinct characters270
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique248 ?
Unique (%)99.2%

Sample

1st row인천정비주식회사
2nd row㈜장원(인천공장지점)
3rd row남동서비스 기아오토큐㈜
4th row㈜세한코팅
5th row(주)라인퍼니처
ValueCountFrequency (%)
주식회사 11
 
3.7%
㈜한국금거래소에프티씨 3
 
1.0%
제일금속 2
 
0.7%
인천지역본부 2
 
0.7%
인천 2
 
0.7%
㈜에스쓰리알 2
 
0.7%
인천지점 2
 
0.7%
지점 2
 
0.7%
유한회사 2
 
0.7%
㈜제이엘이 2
 
0.7%
Other values (263) 264
89.8%
2024-04-21T11:20:16.977426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
102
 
5.6%
61
 
3.3%
51
 
2.8%
49
 
2.7%
48
 
2.6%
45
 
2.5%
33
 
1.8%
30
 
1.6%
30
 
1.6%
30
 
1.6%
Other values (260) 1347
73.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1605
87.9%
Other Symbol 102
 
5.6%
Space Separator 45
 
2.5%
Open Punctuation 26
 
1.4%
Close Punctuation 26
 
1.4%
Uppercase Letter 13
 
0.7%
Decimal Number 8
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
61
 
3.8%
51
 
3.2%
49
 
3.1%
48
 
3.0%
33
 
2.1%
30
 
1.9%
30
 
1.9%
30
 
1.9%
30
 
1.9%
29
 
1.8%
Other values (242) 1214
75.6%
Uppercase Letter
ValueCountFrequency (%)
A 2
15.4%
O 2
15.4%
R 2
15.4%
C 2
15.4%
M 1
7.7%
T 1
7.7%
S 1
7.7%
L 1
7.7%
B 1
7.7%
Decimal Number
ValueCountFrequency (%)
1 4
50.0%
3 2
25.0%
4 1
 
12.5%
2 1
 
12.5%
Other Symbol
ValueCountFrequency (%)
102
100.0%
Space Separator
ValueCountFrequency (%)
45
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1707
93.5%
Common 106
 
5.8%
Latin 13
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
102
 
6.0%
61
 
3.6%
51
 
3.0%
49
 
2.9%
48
 
2.8%
33
 
1.9%
30
 
1.8%
30
 
1.8%
30
 
1.8%
30
 
1.8%
Other values (243) 1243
72.8%
Latin
ValueCountFrequency (%)
A 2
15.4%
O 2
15.4%
R 2
15.4%
C 2
15.4%
M 1
7.7%
T 1
7.7%
S 1
7.7%
L 1
7.7%
B 1
7.7%
Common
ValueCountFrequency (%)
45
42.5%
( 26
24.5%
) 26
24.5%
1 4
 
3.8%
3 2
 
1.9%
4 1
 
0.9%
2 1
 
0.9%
. 1
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1605
87.9%
ASCII 119
 
6.5%
None 102
 
5.6%

Most frequent character per block

None
ValueCountFrequency (%)
102
100.0%
Hangul
ValueCountFrequency (%)
61
 
3.8%
51
 
3.2%
49
 
3.1%
48
 
3.0%
33
 
2.1%
30
 
1.9%
30
 
1.9%
30
 
1.9%
30
 
1.9%
29
 
1.8%
Other values (242) 1214
75.6%
ASCII
ValueCountFrequency (%)
45
37.8%
( 26
21.8%
) 26
21.8%
1 4
 
3.4%
A 2
 
1.7%
3 2
 
1.7%
O 2
 
1.7%
R 2
 
1.7%
C 2
 
1.7%
4 1
 
0.8%
Other values (7) 7
 
5.9%

업종
Text

Distinct171
Distinct (%)68.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2024-04-21T11:20:17.197929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length40
Mean length15.136
Min length3

Characters and Unicode

Total characters3784
Distinct characters180
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique136 ?
Unique (%)54.4%

Sample

1st row자동차정비업
2nd row시멘트,석회,프라스틱제품제조
3rd row운수장비수선및세차
4th row조립금속제품제조
5th row목재제품
ValueCountFrequency (%)
35
 
5.8%
기타 32
 
5.3%
지정외 24
 
4.0%
제조업 19
 
3.2%
18
 
3.0%
폐기물처리업(38210 15
 
2.5%
폐기물처리업 14
 
2.3%
금속원료재생업 12
 
2.0%
금속원료 11
 
1.8%
10
 
1.7%
Other values (231) 409
68.3%
2024-04-21T11:20:17.558594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
349
 
9.2%
276
 
7.3%
133
 
3.5%
2 129
 
3.4%
126
 
3.3%
( 110
 
2.9%
) 110
 
2.9%
105
 
2.8%
3 101
 
2.7%
97
 
2.6%
Other values (170) 2248
59.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2648
70.0%
Decimal Number 540
 
14.3%
Space Separator 349
 
9.2%
Open Punctuation 110
 
2.9%
Close Punctuation 110
 
2.9%
Other Punctuation 24
 
0.6%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
 
10.4%
133
 
5.0%
126
 
4.8%
105
 
4.0%
97
 
3.7%
97
 
3.7%
93
 
3.5%
76
 
2.9%
69
 
2.6%
65
 
2.5%
Other values (153) 1511
57.1%
Decimal Number
ValueCountFrequency (%)
2 129
23.9%
3 101
18.7%
1 95
17.6%
8 61
11.3%
0 60
11.1%
9 51
 
9.4%
4 18
 
3.3%
5 15
 
2.8%
6 7
 
1.3%
7 3
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
P 1
33.3%
C 1
33.3%
B 1
33.3%
Space Separator
ValueCountFrequency (%)
349
100.0%
Open Punctuation
ValueCountFrequency (%)
( 110
100.0%
Close Punctuation
ValueCountFrequency (%)
) 110
100.0%
Other Punctuation
ValueCountFrequency (%)
, 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2648
70.0%
Common 1133
29.9%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
 
10.4%
133
 
5.0%
126
 
4.8%
105
 
4.0%
97
 
3.7%
97
 
3.7%
93
 
3.5%
76
 
2.9%
69
 
2.6%
65
 
2.5%
Other values (153) 1511
57.1%
Common
ValueCountFrequency (%)
349
30.8%
2 129
 
11.4%
( 110
 
9.7%
) 110
 
9.7%
3 101
 
8.9%
1 95
 
8.4%
8 61
 
5.4%
0 60
 
5.3%
9 51
 
4.5%
, 24
 
2.1%
Other values (4) 43
 
3.8%
Latin
ValueCountFrequency (%)
P 1
33.3%
C 1
33.3%
B 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2648
70.0%
ASCII 1136
30.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
349
30.7%
2 129
 
11.4%
( 110
 
9.7%
) 110
 
9.7%
3 101
 
8.9%
1 95
 
8.4%
8 61
 
5.4%
0 60
 
5.3%
9 51
 
4.5%
, 24
 
2.1%
Other values (7) 46
 
4.0%
Hangul
ValueCountFrequency (%)
276
 
10.4%
133
 
5.0%
126
 
4.8%
105
 
4.0%
97
 
3.7%
97
 
3.7%
93
 
3.5%
76
 
2.9%
69
 
2.6%
65
 
2.5%
Other values (153) 1511
57.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
Minimum2024-03-29 00:00:00
Maximum2024-03-29 00:00:00
2024-04-21T11:20:17.674784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:20:17.785891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-21T11:20:15.160786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2024-04-21T11:20:15.346702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:20:15.590673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명업종데이터기준일자
01인천정비주식회사자동차정비업2024-03-29
12㈜장원(인천공장지점)시멘트,석회,프라스틱제품제조2024-03-29
23남동서비스 기아오토큐㈜운수장비수선및세차2024-03-29
34㈜세한코팅조립금속제품제조2024-03-29
45(주)라인퍼니처목재제품2024-03-29
56대영건설산업주식회사레미콘제조업2024-03-29
67㈜서해엠에스티도장및기타피막처리업2024-03-29
78연수자동차서비스자동차정비업2024-03-29
89동화상사금속제품제조업(도장)2024-03-29
910성진레미콘 주식회사비금속광물제품제조업2024-03-29
연번사업장명업종데이터기준일자
240241㈜고려솔더기타비철금속압연압출및연신제품제조업(24229)2024-03-29
241242㈜하나리사이클링 제2공장지정외폐기물처리업(38210)2024-03-29
242243㈜제이엘이 기업부설연구소비철금속제조업(24290)2024-03-29
243244㈜한국금거래소에프티씨(3공장)비철금속제련정련및합금제조업(24219)2024-03-29
244245㈜두원필름플라스틱필름제조업2024-03-29
245246테슬라코리아 유한회사자동차전문수리업2024-03-29
246247㈜현대에코텍 고잔사업소지정외폐기물처리업(38210) 비금속원료재생업(38322)2024-03-29
247248국제케미칼지정외폐기물처리업(38210) 비금속원료재생업(38322)2024-03-29
248249㈜와이에이치물산지정외폐기물처리업(38210) 비금속원료재생업(38322)2024-03-29
249250메타일렉트로㈜금속류원료재생업(38312) 기타기초무기화합물제조업(20129) 분말야금제품제조업(25911)2024-03-29