Overview

Dataset statistics

Number of variables8
Number of observations31
Missing cells1
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 KiB
Average record size in memory70.1 B

Variable types

Numeric2
Categorical2
Text3
DateTime1

Dataset

Description인천광역시 계양구 사업장폐기물배출자 신고현황에 대한 데이터로, 구분, 상호명, 폐기물 종류, 연락처, 사업장도로명주소, 신고 기준년도 등을 제공합니다.
Author인천광역시 계양구
URLhttps://www.data.go.kr/data/15060334/fileData.do

Alerts

구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 신고기준년도High correlation
신고기준년도 is highly overall correlated with 연번High correlation
연락처 has 1 (3.2%) missing valuesMissing
연번 has unique valuesUnique
상호명 has unique valuesUnique
사업장도로명주소 has unique valuesUnique

Reproduction

Analysis started2024-03-14 20:28:28.094856
Analysis finished2024-03-14 20:28:30.428773
Duration2.33 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16
Minimum1
Maximum31
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size407.0 B
2024-03-15T05:28:30.628152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.5
Q18.5
median16
Q323.5
95-th percentile29.5
Maximum31
Range30
Interquartile range (IQR)15

Descriptive statistics

Standard deviation9.0921211
Coefficient of variation (CV)0.56825757
Kurtosis-1.2
Mean16
Median Absolute Deviation (MAD)8
Skewness0
Sum496
Variance82.666667
MonotonicityStrictly increasing
2024-03-15T05:28:31.013381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 1
 
3.2%
2 1
 
3.2%
31 1
 
3.2%
30 1
 
3.2%
29 1
 
3.2%
28 1
 
3.2%
27 1
 
3.2%
26 1
 
3.2%
25 1
 
3.2%
24 1
 
3.2%
Other values (21) 21
67.7%
ValueCountFrequency (%)
1 1
3.2%
2 1
3.2%
3 1
3.2%
4 1
3.2%
5 1
3.2%
6 1
3.2%
7 1
3.2%
8 1
3.2%
9 1
3.2%
10 1
3.2%
ValueCountFrequency (%)
31 1
3.2%
30 1
3.2%
29 1
3.2%
28 1
3.2%
27 1
3.2%
26 1
3.2%
25 1
3.2%
24 1
3.2%
23 1
3.2%
22 1
3.2%

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size376.0 B
사업장일반
31 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반
2nd row사업장일반
3rd row사업장일반
4th row사업장일반
5th row사업장일반

Common Values

ValueCountFrequency (%)
사업장일반 31
100.0%

Length

2024-03-15T05:28:31.248724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:28:31.412442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반 31
100.0%

상호명
Text

UNIQUE 

Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size376.0 B
2024-03-15T05:28:32.147310image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.8387097
Min length4

Characters and Unicode

Total characters274
Distinct characters117
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row현우산업(주) 서운공장
2nd row파트너환경㈜
3rd row(주)세종파마텍
4th row계양구청(스마트도시재생과)
5th row주식회사 유일
ValueCountFrequency (%)
주식회사 3
 
7.5%
서운공장 2
 
5.0%
홈플러스(주 2
 
5.0%
작전점 1
 
2.5%
수성자원개발(주 1
 
2.5%
앰코테크놀로지코리아(주 1
 
2.5%
계양구청(청소행정과 1
 
2.5%
한국도로공사 1
 
2.5%
인천지사 1
 
2.5%
주)오성자원 1
 
2.5%
Other values (26) 26
65.0%
2024-03-15T05:28:33.408470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
18
 
6.6%
( 17
 
6.2%
) 17
 
6.2%
9
 
3.3%
6
 
2.2%
6
 
2.2%
6
 
2.2%
5
 
1.8%
5
 
1.8%
4
 
1.5%
Other values (107) 181
66.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 228
83.2%
Open Punctuation 17
 
6.2%
Close Punctuation 17
 
6.2%
Space Separator 9
 
3.3%
Other Symbol 2
 
0.7%
Dash Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
18
 
7.9%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (102) 166
72.8%
Open Punctuation
ValueCountFrequency (%)
( 17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 17
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%
Other Symbol
ValueCountFrequency (%)
2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 230
83.9%
Common 44
 
16.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
18
 
7.8%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.7%
4
 
1.7%
4
 
1.7%
4
 
1.7%
Other values (103) 168
73.0%
Common
ValueCountFrequency (%)
( 17
38.6%
) 17
38.6%
9
20.5%
- 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 228
83.2%
ASCII 44
 
16.1%
None 2
 
0.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
18
 
7.9%
6
 
2.6%
6
 
2.6%
6
 
2.6%
5
 
2.2%
5
 
2.2%
4
 
1.8%
4
 
1.8%
4
 
1.8%
4
 
1.8%
Other values (102) 166
72.8%
ASCII
ValueCountFrequency (%)
( 17
38.6%
) 17
38.6%
9
20.5%
- 1
 
2.3%
None
ValueCountFrequency (%)
2
100.0%

폐기물 종류
Categorical

Distinct7
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size376.0 B
폐합성수지류
23 
폐수처리오니
 
2
수산물가공잔재물
 
2
폐목재류
 
1
축산물가공잔재물
 
1
Other values (2)
 
2

Length

Max length11
Median length6
Mean length6.3548387
Min length4

Unique

Unique4 ?
Unique (%)12.9%

Sample

1st row폐목재류
2nd row폐합성수지류
3rd row폐합성수지류
4th row폐수처리오니
5th row축산물가공잔재물

Common Values

ValueCountFrequency (%)
폐합성수지류 23
74.2%
폐수처리오니 2
 
6.5%
수산물가공잔재물 2
 
6.5%
폐목재류 1
 
3.2%
축산물가공잔재물 1
 
3.2%
석재+골재폐수처리오니 1
 
3.2%
화학점결폐주물사 1
 
3.2%

Length

2024-03-15T05:28:33.675088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T05:28:33.895579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐합성수지류 23
74.2%
폐수처리오니 2
 
6.5%
수산물가공잔재물 2
 
6.5%
폐목재류 1
 
3.2%
축산물가공잔재물 1
 
3.2%
석재+골재폐수처리오니 1
 
3.2%
화학점결폐주물사 1
 
3.2%

연락처
Text

MISSING 

Distinct28
Distinct (%)93.3%
Missing1
Missing (%)3.2%
Memory size376.0 B
2024-03-15T05:28:34.701069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters360
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)86.7%

Sample

1st row032-720-1390
2nd row032-543-9936
3rd row032-508-1284
4th row032-450-5713
5th row032-679-7100
ValueCountFrequency (%)
032-545-5287 2
 
6.7%
032-240-8407 2
 
6.7%
032-720-1390 1
 
3.3%
032-540-3118 1
 
3.3%
032-675-2818 1
 
3.3%
032-541-2306 1
 
3.3%
032-547-0967 1
 
3.3%
032-540-1134 1
 
3.3%
032-451-2408 1
 
3.3%
032-717-1052 1
 
3.3%
Other values (18) 18
60.0%
2024-03-15T05:28:35.986511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 63
17.5%
- 60
16.7%
2 47
13.1%
3 39
10.8%
5 38
10.6%
4 30
8.3%
1 25
 
6.9%
8 19
 
5.3%
7 19
 
5.3%
6 11
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 300
83.3%
Dash Punctuation 60
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 63
21.0%
2 47
15.7%
3 39
13.0%
5 38
12.7%
4 30
10.0%
1 25
 
8.3%
8 19
 
6.3%
7 19
 
6.3%
6 11
 
3.7%
9 9
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 60
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 360
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 63
17.5%
- 60
16.7%
2 47
13.1%
3 39
10.8%
5 38
10.6%
4 30
8.3%
1 25
 
6.9%
8 19
 
5.3%
7 19
 
5.3%
6 11
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 360
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 63
17.5%
- 60
16.7%
2 47
13.1%
3 39
10.8%
5 38
10.6%
4 30
8.3%
1 25
 
6.9%
8 19
 
5.3%
7 19
 
5.3%
6 11
 
3.1%
Distinct31
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size376.0 B
2024-03-15T05:28:36.865785image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length30
Mean length24.419355
Min length18

Characters and Unicode

Total characters757
Distinct characters61
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique31 ?
Unique (%)100.0%

Sample

1st row인천광역시 계양구 서운산단로4길 18, 서운일반산업단지(서운동)
2nd row인천광역시 계양구 아나지로 586
3rd row인천광역시 계양구 서운산단로 54(서운동)
4th row인천광역시 계양구 서운산단로2길 68 (서운동)
5th row인천광역시 계양구 아나지로 562 (서운동)
ValueCountFrequency (%)
인천광역시 31
20.4%
계양구 31
20.4%
작전동 8
 
5.3%
서운동 7
 
4.6%
아나지로 7
 
4.6%
효성동 5
 
3.3%
계산동 5
 
3.3%
14 3
 
2.0%
장제로 3
 
2.0%
서운산단로1길 2
 
1.3%
Other values (50) 50
32.9%
2024-03-15T05:28:38.207137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
121
 
16.0%
42
 
5.5%
35
 
4.6%
31
 
4.1%
31
 
4.1%
31
 
4.1%
31
 
4.1%
31
 
4.1%
31
 
4.1%
30
 
4.0%
Other values (51) 343
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 481
63.5%
Space Separator 121
 
16.0%
Decimal Number 95
 
12.5%
Open Punctuation 29
 
3.8%
Close Punctuation 29
 
3.8%
Dash Punctuation 1
 
0.1%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
8.7%
35
 
7.3%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
29
 
6.0%
Other values (37) 159
33.1%
Decimal Number
ValueCountFrequency (%)
1 19
20.0%
2 13
13.7%
8 12
12.6%
4 12
12.6%
6 11
11.6%
5 11
11.6%
7 8
8.4%
3 6
 
6.3%
0 3
 
3.2%
Space Separator
ValueCountFrequency (%)
121
100.0%
Open Punctuation
ValueCountFrequency (%)
( 29
100.0%
Close Punctuation
ValueCountFrequency (%)
) 29
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 481
63.5%
Common 276
36.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
8.7%
35
 
7.3%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
29
 
6.0%
Other values (37) 159
33.1%
Common
ValueCountFrequency (%)
121
43.8%
( 29
 
10.5%
) 29
 
10.5%
1 19
 
6.9%
2 13
 
4.7%
8 12
 
4.3%
4 12
 
4.3%
6 11
 
4.0%
5 11
 
4.0%
7 8
 
2.9%
Other values (4) 11
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 481
63.5%
ASCII 276
36.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
121
43.8%
( 29
 
10.5%
) 29
 
10.5%
1 19
 
6.9%
2 13
 
4.7%
8 12
 
4.3%
4 12
 
4.3%
6 11
 
4.0%
5 11
 
4.0%
7 8
 
2.9%
Other values (4) 11
 
4.0%
Hangul
ValueCountFrequency (%)
42
 
8.7%
35
 
7.3%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
31
 
6.4%
30
 
6.2%
29
 
6.0%
Other values (37) 159
33.1%

신고기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct18
Distinct (%)58.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2012.2903
Minimum1999
Maximum2022
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size407.0 B
2024-03-15T05:28:38.506752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1999
5-th percentile2001
Q12005
median2013
Q32019
95-th percentile2022
Maximum2022
Range23
Interquartile range (IQR)14

Descriptive statistics

Standard deviation7.6689571
Coefficient of variation (CV)0.003811059
Kurtosis-1.3128535
Mean2012.2903
Median Absolute Deviation (MAD)6
Skewness-0.33926916
Sum62381
Variance58.812903
MonotonicityNot monotonic
2024-03-15T05:28:38.841109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
2002 5
16.1%
2022 4
12.9%
2019 3
 
9.7%
2012 3
 
9.7%
2018 2
 
6.5%
2017 2
 
6.5%
2015 1
 
3.2%
2014 1
 
3.2%
2013 1
 
3.2%
2021 1
 
3.2%
Other values (8) 8
25.8%
ValueCountFrequency (%)
1999 1
 
3.2%
2000 1
 
3.2%
2002 5
16.1%
2004 1
 
3.2%
2006 1
 
3.2%
2008 1
 
3.2%
2009 1
 
3.2%
2011 1
 
3.2%
2012 3
9.7%
2013 1
 
3.2%
ValueCountFrequency (%)
2022 4
12.9%
2021 1
 
3.2%
2020 1
 
3.2%
2019 3
9.7%
2018 2
6.5%
2017 2
6.5%
2015 1
 
3.2%
2014 1
 
3.2%
2013 1
 
3.2%
2012 3
9.7%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size376.0 B
Minimum2024-02-29 00:00:00
Maximum2024-02-29 00:00:00
2024-03-15T05:28:39.156594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:28:39.465586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-15T05:28:29.221129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:28:28.461118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:28:29.464651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-15T05:28:28.713974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-15T05:28:39.679010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호명폐기물 종류연락처사업장도로명주소신고기준년도
연번1.0001.0000.0000.8741.0000.915
상호명1.0001.0001.0001.0001.0001.000
폐기물 종류0.0001.0001.0001.0001.0000.000
연락처0.8741.0001.0001.0001.0001.000
사업장도로명주소1.0001.0001.0001.0001.0001.000
신고기준년도0.9151.0000.0001.0001.0001.000
2024-03-15T05:28:39.952980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고기준년도폐기물 종류
연번1.000-0.5500.000
신고기준년도-0.5501.0000.000
폐기물 종류0.0000.0001.000

Missing values

2024-03-15T05:28:29.795123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-15T05:28:30.268172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분상호명폐기물 종류연락처사업장도로명주소신고기준년도데이터기준일자
01사업장일반현우산업(주) 서운공장폐목재류032-720-1390인천광역시 계양구 서운산단로4길 18, 서운일반산업단지(서운동)20222024-02-29
12사업장일반파트너환경㈜폐합성수지류032-543-9936인천광역시 계양구 아나지로 58620222024-02-29
23사업장일반(주)세종파마텍폐합성수지류032-508-1284인천광역시 계양구 서운산단로 54(서운동)20212024-02-29
34사업장일반계양구청(스마트도시재생과)폐수처리오니032-450-5713인천광역시 계양구 서운산단로2길 68 (서운동)20202024-02-29
45사업장일반주식회사 유일축산물가공잔재물032-679-7100인천광역시 계양구 아나지로 562 (서운동)20192024-02-29
56사업장일반키움산업폐합성수지류032-671-9888인천광역시 계양구 안남로457번길 14 (효성동)20192024-02-29
67사업장일반(주)에스에이치비피폐합성수지류032-290-5605인천광역시 계양구 아나지로 412 (작전동)20182024-02-29
78사업장일반(주)유진수산수산물가공잔재물032-545-5287인천광역시 계양구 아나지로 466 (서운동)20182024-02-29
89사업장일반인천세종병원폐합성수지류032-240-8407인천광역시 계양구 계양문화로 20 (작전동)20172024-02-29
910사업장일반선영고무공업(주)폐합성수지류032-240-8407인천광역시 계양구 서운산업로41번길 14 (서운동)20172024-02-29
연번구분상호명폐기물 종류연락처사업장도로명주소신고기준년도데이터기준일자
2122사업장일반홈플러스(주) 작전점폐합성수지류032-540-8124인천광역시 계양구 계양대로 27 (작전동)20022024-02-29
2223사업장일반홈플러스(주)폐합성수지류032-551-2080인천광역시 계양구 오조산공원로 14 (계산동)20022024-02-29
2324사업장일반(주)이마트계양점폐합성수지류032-717-1052인천광역시 계양구 봉오대로 785 (작전동)20022024-02-29
2425사업장일반인천교통공사폐합성수지류032-451-2408인천광역시 계양구 만봉길 65 (귤현동)20022024-02-29
2526사업장일반경인교육대학교폐합성수지류032-540-1134인천광역시 계양구 계산로 62 (계산동)20022024-02-29
2627사업장일반삼민화학공업(주)폐합성수지류032-547-0967인천광역시 계양구 안남로 461 (효성동)20002024-02-29
2728사업장일반정보특수금속화학점결폐주물사032-541-2306인천광역시 계양구 계양대로16번길 87 (작전동)19992024-02-29
2829사업장일반동신부로아 주식회사폐합성수지류032-675-2818인천광역시 계양구 서운산단로1길 35 (서운동)20222024-02-29
2930사업장일반㈜와이지-원 서운공장폐합성수지류032-500-5698인천광역시 계양구 서운산단로1길 11 (서운동)20222024-02-29
3031사업장일반(주)선도씨푸드수산물가공잔재물032-545-5287인천광역시 계양구 아나지로 42020192024-02-29