Overview

Dataset statistics

Number of variables6
Number of observations221
Missing cells18
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.7 KiB
Average record size in memory49.6 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description경상남도 거창군 내 사업장폐기물 배출자 신고 현황에 대한 데이터로 페기물 구분, 상호명, 연락처, 사업장도로명주소, 데이터기준일자 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15060319/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 폐기물구분High correlation
폐기물구분 is highly overall correlated with 연번High correlation
연락처 has 18 (8.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:46:35.369682
Analysis finished2023-12-12 04:46:36.097472
Duration0.73 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct221
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean111
Minimum1
Maximum221
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.1 KiB
2023-12-12T13:46:36.218495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile12
Q156
median111
Q3166
95-th percentile210
Maximum221
Range220
Interquartile range (IQR)110

Descriptive statistics

Standard deviation63.941379
Coefficient of variation (CV)0.57604846
Kurtosis-1.2
Mean111
Median Absolute Deviation (MAD)55
Skewness0
Sum24531
Variance4088.5
MonotonicityStrictly increasing
2023-12-12T13:46:36.358244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.5%
153 1
 
0.5%
142 1
 
0.5%
143 1
 
0.5%
144 1
 
0.5%
145 1
 
0.5%
146 1
 
0.5%
147 1
 
0.5%
148 1
 
0.5%
149 1
 
0.5%
Other values (211) 211
95.5%
ValueCountFrequency (%)
1 1
0.5%
2 1
0.5%
3 1
0.5%
4 1
0.5%
5 1
0.5%
6 1
0.5%
7 1
0.5%
8 1
0.5%
9 1
0.5%
10 1
0.5%
ValueCountFrequency (%)
221 1
0.5%
220 1
0.5%
219 1
0.5%
218 1
0.5%
217 1
0.5%
216 1
0.5%
215 1
0.5%
214 1
0.5%
213 1
0.5%
212 1
0.5%

폐기물구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
지정폐기물
135 
사업장폐기물
86 

Length

Max length6
Median length5
Mean length5.3891403
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물
2nd row사업장폐기물
3rd row사업장폐기물
4th row사업장폐기물
5th row사업장폐기물

Common Values

ValueCountFrequency (%)
지정폐기물 135
61.1%
사업장폐기물 86
38.9%

Length

2023-12-12T13:46:36.503433image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:46:36.613817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
지정폐기물 135
61.1%
사업장폐기물 86
38.9%
Distinct216
Distinct (%)97.7%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:46:36.818126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length21
Mean length8.1131222
Min length2

Characters and Unicode

Total characters1793
Distinct characters252
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique211 ?
Unique (%)95.5%

Sample

1st row㈜동양카본
2nd row거창군청(환경과)
3rd row삼성레미콘 주식회사
4th row(주)삼덕개발
5th row(주)하늘바이오 농업회사법인
ValueCountFrequency (%)
주식회사 7
 
2.7%
농업회사법인 5
 
2.0%
거창지점 3
 
1.2%
주)거창자동차해체재활용산업 2
 
0.8%
주)성보산업 2
 
0.8%
거창적십자병원 2
 
0.8%
거창군 2
 
0.8%
동물병원 2
 
0.8%
차오름 2
 
0.8%
이도 2
 
0.8%
Other values (226) 226
88.6%
2023-12-12T13:46:37.209826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
89
 
5.0%
69
 
3.8%
( 57
 
3.2%
) 57
 
3.2%
55
 
3.1%
44
 
2.5%
43
 
2.4%
42
 
2.3%
40
 
2.2%
37
 
2.1%
Other values (242) 1260
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1641
91.5%
Open Punctuation 57
 
3.2%
Close Punctuation 57
 
3.2%
Space Separator 34
 
1.9%
Uppercase Letter 2
 
0.1%
Other Symbol 1
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
89
 
5.4%
69
 
4.2%
55
 
3.4%
44
 
2.7%
43
 
2.6%
42
 
2.6%
40
 
2.4%
37
 
2.3%
35
 
2.1%
35
 
2.1%
Other values (235) 1152
70.2%
Uppercase Letter
ValueCountFrequency (%)
S 1
50.0%
C 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 57
100.0%
Space Separator
ValueCountFrequency (%)
34
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Decimal Number
ValueCountFrequency (%)
2 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1642
91.6%
Common 149
 
8.3%
Latin 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
89
 
5.4%
69
 
4.2%
55
 
3.3%
44
 
2.7%
43
 
2.6%
42
 
2.6%
40
 
2.4%
37
 
2.3%
35
 
2.1%
35
 
2.1%
Other values (236) 1153
70.2%
Common
ValueCountFrequency (%)
( 57
38.3%
) 57
38.3%
34
22.8%
2 1
 
0.7%
Latin
ValueCountFrequency (%)
S 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1641
91.5%
ASCII 151
 
8.4%
None 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
89
 
5.4%
69
 
4.2%
55
 
3.4%
44
 
2.7%
43
 
2.6%
42
 
2.6%
40
 
2.4%
37
 
2.3%
35
 
2.1%
35
 
2.1%
Other values (235) 1152
70.2%
ASCII
ValueCountFrequency (%)
( 57
37.7%
) 57
37.7%
34
22.5%
S 1
 
0.7%
C 1
 
0.7%
2 1
 
0.7%
None
ValueCountFrequency (%)
1
100.0%

연락처
Text

MISSING 

Distinct146
Distinct (%)71.9%
Missing18
Missing (%)8.1%
Memory size1.9 KiB
2023-12-12T13:46:37.469513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length9.453202
Min length1

Characters and Unicode

Total characters1919
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique137 ?
Unique (%)67.5%

Sample

1st row055-942-9091
2nd row055-940-3948
3rd row055-943-0557
4th row055-391-6054
5th row055-943-3330
ValueCountFrequency (%)
055-941-0592 5
 
3.2%
055-945-6511 2
 
1.3%
055-943-7705 2
 
1.3%
055-942-4391 2
 
1.3%
055-945-2222 2
 
1.3%
055-943-3330 2
 
1.3%
055-944-3849 2
 
1.3%
055-940-3948 2
 
1.3%
055-940-8602 1
 
0.6%
055-943-7901 1
 
0.6%
Other values (135) 135
86.5%
2023-12-12T13:46:37.912848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 426
22.2%
- 312
16.3%
0 257
13.4%
9 214
11.2%
4 207
10.8%
2 106
 
5.5%
3 102
 
5.3%
1 75
 
3.9%
8 68
 
3.5%
7 65
 
3.4%
Other values (2) 87
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1560
81.3%
Dash Punctuation 312
 
16.3%
Space Separator 47
 
2.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 426
27.3%
0 257
16.5%
9 214
13.7%
4 207
13.3%
2 106
 
6.8%
3 102
 
6.5%
1 75
 
4.8%
8 68
 
4.4%
7 65
 
4.2%
6 40
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 312
100.0%
Space Separator
ValueCountFrequency (%)
47
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1919
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 426
22.2%
- 312
16.3%
0 257
13.4%
9 214
11.2%
4 207
10.8%
2 106
 
5.5%
3 102
 
5.3%
1 75
 
3.9%
8 68
 
3.5%
7 65
 
3.4%
Other values (2) 87
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1919
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 426
22.2%
- 312
16.3%
0 257
13.4%
9 214
11.2%
4 207
10.8%
2 106
 
5.5%
3 102
 
5.3%
1 75
 
3.9%
8 68
 
3.5%
7 65
 
3.4%
Other values (2) 87
 
4.5%
Distinct205
Distinct (%)92.8%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-12-12T13:46:38.202174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length41
Mean length23.393665
Min length18

Characters and Unicode

Total characters5170
Distinct characters189
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique194 ?
Unique (%)87.8%

Sample

1st row경상남도 거창군 남상면 홍덕길 110
2nd row경상남도 거창군 거창읍 심소정길 139-14_ 거창군 공공재활용 선별시설
3rd row경상남도 거창군 남하면 무릉리 1300
4th row경상남도 거창군 가조면 석강3길 167
5th row경상남도 거창군 주상면 주곡로 1190-14
ValueCountFrequency (%)
거창군 223
18.9%
경상남도 220
18.6%
거창읍 108
 
9.1%
위천면 30
 
2.5%
거창대로 23
 
1.9%
가조면 21
 
1.8%
남상면 16
 
1.4%
화리골길 15
 
1.3%
대동리 11
 
0.9%
중앙로 11
 
0.9%
Other values (327) 503
42.6%
2023-12-12T13:46:38.610572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
963
18.6%
379
 
7.3%
360
 
7.0%
254
 
4.9%
253
 
4.9%
225
 
4.4%
224
 
4.3%
223
 
4.3%
1 192
 
3.7%
113
 
2.2%
Other values (179) 1984
38.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3250
62.9%
Space Separator 963
 
18.6%
Decimal Number 801
 
15.5%
Dash Punctuation 79
 
1.5%
Connector Punctuation 47
 
0.9%
Open Punctuation 15
 
0.3%
Close Punctuation 15
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
379
 
11.7%
360
 
11.1%
254
 
7.8%
253
 
7.8%
225
 
6.9%
224
 
6.9%
223
 
6.9%
113
 
3.5%
108
 
3.3%
105
 
3.2%
Other values (164) 1006
31.0%
Decimal Number
ValueCountFrequency (%)
1 192
24.0%
2 95
11.9%
3 92
11.5%
5 82
10.2%
7 66
 
8.2%
6 62
 
7.7%
0 61
 
7.6%
9 55
 
6.9%
4 49
 
6.1%
8 47
 
5.9%
Space Separator
ValueCountFrequency (%)
963
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 79
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3250
62.9%
Common 1920
37.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
379
 
11.7%
360
 
11.1%
254
 
7.8%
253
 
7.8%
225
 
6.9%
224
 
6.9%
223
 
6.9%
113
 
3.5%
108
 
3.3%
105
 
3.2%
Other values (164) 1006
31.0%
Common
ValueCountFrequency (%)
963
50.2%
1 192
 
10.0%
2 95
 
4.9%
3 92
 
4.8%
5 82
 
4.3%
- 79
 
4.1%
7 66
 
3.4%
6 62
 
3.2%
0 61
 
3.2%
9 55
 
2.9%
Other values (5) 173
 
9.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3250
62.9%
ASCII 1920
37.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
963
50.2%
1 192
 
10.0%
2 95
 
4.9%
3 92
 
4.8%
5 82
 
4.3%
- 79
 
4.1%
7 66
 
3.4%
6 62
 
3.2%
0 61
 
3.2%
9 55
 
2.9%
Other values (5) 173
 
9.0%
Hangul
ValueCountFrequency (%)
379
 
11.7%
360
 
11.1%
254
 
7.8%
253
 
7.8%
225
 
6.9%
224
 
6.9%
223
 
6.9%
113
 
3.5%
108
 
3.3%
105
 
3.2%
Other values (164) 1006
31.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.9 KiB
2023-05-15
221 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-05-15
2nd row2023-05-15
3rd row2023-05-15
4th row2023-05-15
5th row2023-05-15

Common Values

ValueCountFrequency (%)
2023-05-15 221
100.0%

Length

2023-12-12T13:46:38.793979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:46:38.898045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-05-15 221
100.0%

Interactions

2023-12-12T13:46:35.780534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:46:38.960825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번폐기물구분
연번1.0001.000
폐기물구분1.0001.000
2023-12-12T13:46:39.049834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번폐기물구분
연번1.0000.956
폐기물구분0.9561.000

Missing values

2023-12-12T13:46:35.898460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:46:36.041360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번폐기물구분상호명연락처사업장도로명주소데이터기준일자
01사업장폐기물㈜동양카본055-942-9091경상남도 거창군 남상면 홍덕길 1102023-05-15
12사업장폐기물거창군청(환경과)055-940-3948경상남도 거창군 거창읍 심소정길 139-14_ 거창군 공공재활용 선별시설2023-05-15
23사업장폐기물삼성레미콘 주식회사<NA>경상남도 거창군 남하면 무릉리 13002023-05-15
34사업장폐기물(주)삼덕개발055-943-0557경상남도 거창군 가조면 석강3길 1672023-05-15
45사업장폐기물(주)하늘바이오 농업회사법인<NA>경상남도 거창군 주상면 주곡로 1190-142023-05-15
56사업장폐기물농업회사법인(주)얼음골식품 거창지점055-391-6054경상남도 거창군 가조면 석강3길 1672023-05-15
67사업장폐기물(주)성보산업055-943-3330경상남도 거창군 주상면 주곡로 1190-142023-05-15
78사업장폐기물거창군농협쌀조합공동사업법인055-943-2378경상남도 거창군 거창읍 밤티재로 1291-102023-05-15
89사업장폐기물주식회사 이도 거창지점055-945-2222경상남도 거창군 신원면 감악산로 436-582023-05-15
910사업장폐기물농업회사법인(주)세영푸드055-945-5270경상남도 거창군 가조면 석강3길 912023-05-15
연번폐기물구분상호명연락처사업장도로명주소데이터기준일자
211212지정폐기물광보한의원055-943-9155경상남도 거창군 거창읍 중앙로 157_ 2층2023-05-15
212213지정폐기물경희한의원055-944-7879경상남도 거창군 거창읍 강변로 1512023-05-15
213214지정폐기물일신한의원055-944-3771경상남도 거창군 거창읍 중앙리 180-12023-05-15
214215지정폐기물동부건설(주)055-945-8221경상남도 거창군 남상면 둔동리 656-12023-05-15
215216지정폐기물주식회사 이도 거창지점055-945-2222경상남도 거창군 신원면 감악산로 436-582023-05-15
216217지정폐기물(주)지오콘054-534-0901경상북도 상주시 공성면 평천공단길 152023-05-15
217218지정폐기물희봉위생공사055-942-7755경상남도 거창군 거창읍 심소정길 139-31_ 희봉위생공사2023-05-15
218219지정폐기물(주)성보산업055-943-3330경상남도 거창군 주상면 내오리 산 105-2 외 4필지2023-05-15
219220지정폐기물(주)거창자동차해체재활용산업055-943-7705경상남도 거창군 거창읍 정장리 105-502023-05-15
220221지정폐기물거창군 생활폐기물소각처리시설(환경시설관리주식회사)055-941-0592경상남도 거창군 거창읍 심소정길 139-7_ 거창군 생활폐기물 소각처리시설2023-05-15