Overview

Dataset statistics

Number of variables3
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory26.3 B

Variable types

Numeric1
Text2

Dataset

Description인천광역시 남동구 환경오염물질 중 소음발생 및 배출업소 허가 현황에 대한 데이터로 사업장명, 업종 등을 제공합니다.
Author인천광역시 남동구
URLhttps://www.data.go.kr/data/15113430/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2024-04-21 02:20:31.376196
Analysis finished2024-04-21 02:20:33.554963
Duration2.18 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-21T11:20:33.650532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2024-04-21T11:20:33.812677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
Distinct99
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-21T11:20:34.111473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length11
Mean length5.54
Min length3

Characters and Unicode

Total characters554
Distinct characters162
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)98.0%

Sample

1st row문방산업사
2nd row우성산업
3rd row㈜세한코팅
4th row대영건설산업주식회사
5th row㈜서해엠에스티
ValueCountFrequency (%)
세진정공 2
 
1.9%
주식회사 2
 
1.9%
피디에스테크 1
 
1.0%
오성산업 1
 
1.0%
대광정밀고무 1
 
1.0%
아주자동차공업사 1
 
1.0%
㈜창안 1
 
1.0%
㈜세인아이엔디 1
 
1.0%
선주실업 1
 
1.0%
성진쇼트 1
 
1.0%
Other values (92) 92
88.5%
2024-04-21T11:20:34.487094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
36
 
6.5%
23
 
4.2%
19
 
3.4%
18
 
3.2%
14
 
2.5%
13
 
2.3%
12
 
2.2%
10
 
1.8%
10
 
1.8%
10
 
1.8%
Other values (152) 389
70.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 496
89.5%
Other Symbol 36
 
6.5%
Open Punctuation 6
 
1.1%
Close Punctuation 6
 
1.1%
Uppercase Letter 5
 
0.9%
Space Separator 4
 
0.7%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
4.6%
19
 
3.8%
18
 
3.6%
14
 
2.8%
13
 
2.6%
12
 
2.4%
10
 
2.0%
10
 
2.0%
10
 
2.0%
9
 
1.8%
Other values (143) 358
72.2%
Uppercase Letter
ValueCountFrequency (%)
A 2
40.0%
B 1
20.0%
R 1
20.0%
L 1
20.0%
Other Symbol
ValueCountFrequency (%)
36
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 532
96.0%
Common 17
 
3.1%
Latin 5
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
36
 
6.8%
23
 
4.3%
19
 
3.6%
18
 
3.4%
14
 
2.6%
13
 
2.4%
12
 
2.3%
10
 
1.9%
10
 
1.9%
10
 
1.9%
Other values (144) 367
69.0%
Common
ValueCountFrequency (%)
( 6
35.3%
) 6
35.3%
4
23.5%
3 1
 
5.9%
Latin
ValueCountFrequency (%)
A 2
40.0%
B 1
20.0%
R 1
20.0%
L 1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 496
89.5%
None 36
 
6.5%
ASCII 22
 
4.0%

Most frequent character per block

None
ValueCountFrequency (%)
36
100.0%
Hangul
ValueCountFrequency (%)
23
 
4.6%
19
 
3.8%
18
 
3.6%
14
 
2.8%
13
 
2.6%
12
 
2.4%
10
 
2.0%
10
 
2.0%
10
 
2.0%
9
 
1.8%
Other values (143) 358
72.2%
ASCII
ValueCountFrequency (%)
( 6
27.3%
) 6
27.3%
4
18.2%
A 2
 
9.1%
B 1
 
4.5%
R 1
 
4.5%
L 1
 
4.5%
3 1
 
4.5%

업종
Text

Distinct81
Distinct (%)81.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-21T11:20:34.769565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length25
Mean length13.21
Min length4

Characters and Unicode

Total characters1321
Distinct characters141
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)69.0%

Sample

1st row기타화학
2nd row금속제품
3rd row조립금속제품제조
4th row레미콘제조업
5th row도장및기타피막처리업
ValueCountFrequency (%)
21
 
8.9%
기타 16
 
6.8%
제조업 15
 
6.3%
그외 6
 
2.5%
6
 
2.5%
도장 6
 
2.5%
5
 
2.1%
도장및기타피막처리업 4
 
1.7%
금속제품 4
 
1.7%
금속제품제조업 4
 
1.7%
Other values (123) 150
63.3%
2024-04-21T11:20:35.156755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
137
 
10.4%
105
 
7.9%
92
 
7.0%
76
 
5.8%
61
 
4.6%
42
 
3.2%
42
 
3.2%
36
 
2.7%
35
 
2.6%
2 32
 
2.4%
Other values (131) 663
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1033
78.2%
Space Separator 137
 
10.4%
Decimal Number 100
 
7.6%
Open Punctuation 21
 
1.6%
Close Punctuation 21
 
1.6%
Other Punctuation 9
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
105
 
10.2%
92
 
8.9%
76
 
7.4%
61
 
5.9%
42
 
4.1%
42
 
4.1%
36
 
3.5%
35
 
3.4%
32
 
3.1%
23
 
2.2%
Other values (118) 489
47.3%
Decimal Number
ValueCountFrequency (%)
2 32
32.0%
9 22
22.0%
3 14
14.0%
1 10
 
10.0%
0 6
 
6.0%
8 6
 
6.0%
4 5
 
5.0%
5 5
 
5.0%
Other Punctuation
ValueCountFrequency (%)
, 8
88.9%
. 1
 
11.1%
Space Separator
ValueCountFrequency (%)
137
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1033
78.2%
Common 288
 
21.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
105
 
10.2%
92
 
8.9%
76
 
7.4%
61
 
5.9%
42
 
4.1%
42
 
4.1%
36
 
3.5%
35
 
3.4%
32
 
3.1%
23
 
2.2%
Other values (118) 489
47.3%
Common
ValueCountFrequency (%)
137
47.6%
2 32
 
11.1%
9 22
 
7.6%
( 21
 
7.3%
) 21
 
7.3%
3 14
 
4.9%
1 10
 
3.5%
, 8
 
2.8%
0 6
 
2.1%
8 6
 
2.1%
Other values (3) 11
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1033
78.2%
ASCII 288
 
21.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
137
47.6%
2 32
 
11.1%
9 22
 
7.6%
( 21
 
7.3%
) 21
 
7.3%
3 14
 
4.9%
1 10
 
3.5%
, 8
 
2.8%
0 6
 
2.1%
8 6
 
2.1%
Other values (3) 11
 
3.8%
Hangul
ValueCountFrequency (%)
105
 
10.2%
92
 
8.9%
76
 
7.4%
61
 
5.9%
42
 
4.1%
42
 
4.1%
36
 
3.5%
35
 
3.4%
32
 
3.1%
23
 
2.2%
Other values (118) 489
47.3%

Interactions

2024-04-21T11:20:33.257142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:20:35.251057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번사업장명업종
연번1.0000.9400.858
사업장명0.9401.0000.997
업종0.8580.9971.000

Missing values

2024-04-21T11:20:33.434356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:20:33.509327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명업종
01문방산업사기타화학
12우성산업금속제품
23㈜세한코팅조립금속제품제조
34대영건설산업주식회사레미콘제조업
45㈜서해엠에스티도장및기타피막처리업
56동화상사금속제품제조업(도장)
67(주)엔에스운동및경기용구제조업
78명성산업도장및기타피막처리업
89삼정기업금속제품
910영진금속금속표면처리업
연번사업장명업종
9091㈜신원에스엔제이목재 보존, 방부처리, 도장 및 유사처리업
9192㈜엠알메탈로지정 폐기물처리업(38220)
9293천광특수강도금, 착색 및 기타표면처리 강재 제조업(24191)
9394㈜서평코리아탭, 밸브 및 유사장치 제조업
9495웅지시스템절삭가공 및 유사처리업(25924)
9596㈜우신주형 및 금형제조업(29294) 그 외 기타 특수 목적용 기계 제조업(29299)
9697정원씨앤씨그 외 기타 금속가공업(25929)
9798온세화학(빅본)산업용 그외 비경화고무제품제조업
9899㈜고려솔더기타비철금속압연압출및연신제품제조업
99100㈜한국금거래소에프티씨(3공장)비철금속제련정련및합금제조업