Overview

Dataset statistics

Number of variables12
Number of observations814
Missing cells865
Missing cells (%)8.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory78.8 KiB
Average record size in memory99.2 B

Variable types

Numeric2
Categorical3
Text6
Unsupported1

Dataset

Description충청남도 금산군 사업장폐기물배출자 신고현황(사업장 상호, 소재지, 도로명주소, 전화번호, 폐기물종류 등) 안내입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=382&beforeMenuCd=DOM_000000201001001000&publicdatapk=15060379

Alerts

폐기물구분(사업장일반폐기물지정폐기물) has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 신고기준년도High correlation
신고기준년도 is highly overall correlated with 연번High correlation
사업자등록번호 has 51 (6.3%) missing valuesMissing
전화번호 has 814 (100.0%) missing valuesMissing
연번 has unique valuesUnique
전화번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-01-09 20:41:52.593682
Analysis finished2024-01-09 20:41:53.845803
Duration1.25 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct814
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean407.5
Minimum1
Maximum814
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.3 KiB
2024-01-10T05:41:54.218193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile41.65
Q1204.25
median407.5
Q3610.75
95-th percentile773.35
Maximum814
Range813
Interquartile range (IQR)406.5

Descriptive statistics

Standard deviation235.12585
Coefficient of variation (CV)0.57699596
Kurtosis-1.2
Mean407.5
Median Absolute Deviation (MAD)203.5
Skewness0
Sum331705
Variance55284.167
MonotonicityStrictly increasing
2024-01-10T05:41:54.375798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
548 1
 
0.1%
538 1
 
0.1%
539 1
 
0.1%
540 1
 
0.1%
541 1
 
0.1%
542 1
 
0.1%
543 1
 
0.1%
544 1
 
0.1%
545 1
 
0.1%
Other values (804) 804
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
814 1
0.1%
813 1
0.1%
812 1
0.1%
811 1
0.1%
810 1
0.1%
809 1
0.1%
808 1
0.1%
807 1
0.1%
806 1
0.1%
805 1
0.1%
Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
사업장일반폐기물
814 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장일반폐기물
2nd row사업장일반폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row사업장일반폐기물

Common Values

ValueCountFrequency (%)
사업장일반폐기물 814
100.0%

Length

2024-01-10T05:41:54.521167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:41:54.618065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반폐기물 814
100.0%
Distinct328
Distinct (%)40.3%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:41:54.877202image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length8.54914
Min length1

Characters and Unicode

Total characters6959
Distinct characters283
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique183 ?
Unique (%)22.5%

Sample

1st row금산환경재생산업(주)
2nd row대광무역 방치폐기물
3rd row대광무역 방치폐기물
4th row대광무역 방치폐기물
5th row승원
ValueCountFrequency (%)
주식회사 35
 
3.8%
금산공장 35
 
3.8%
한국타이어앤테크놀로지(주 26
 
2.8%
주)모던이앤알 23
 
2.5%
인선지에스(주 20
 
2.2%
인선기업(주 19
 
2.1%
주)시공아트 18
 
2.0%
두리화장품(주 16
 
1.7%
한국타이어(주)금산공장 14
 
1.5%
대신산업 14
 
1.5%
Other values (332) 698
76.0%
2024-01-10T05:41:55.417013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
622
 
8.9%
( 594
 
8.5%
) 594
 
8.5%
276
 
4.0%
203
 
2.9%
163
 
2.3%
158
 
2.3%
156
 
2.2%
132
 
1.9%
118
 
1.7%
Other values (273) 3943
56.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5603
80.5%
Open Punctuation 594
 
8.5%
Close Punctuation 594
 
8.5%
Space Separator 118
 
1.7%
Decimal Number 32
 
0.5%
Uppercase Letter 14
 
0.2%
Other Punctuation 3
 
< 0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
Decimal Number
ValueCountFrequency (%)
2 16
50.0%
0 4
 
12.5%
1 4
 
12.5%
3 4
 
12.5%
4 3
 
9.4%
7 1
 
3.1%
Uppercase Letter
ValueCountFrequency (%)
G 3
21.4%
E 3
21.4%
S 3
21.4%
P 3
21.4%
C 1
 
7.1%
J 1
 
7.1%
Open Punctuation
ValueCountFrequency (%)
( 594
100.0%
Close Punctuation
ValueCountFrequency (%)
) 594
100.0%
Space Separator
ValueCountFrequency (%)
118
100.0%
Other Punctuation
ValueCountFrequency (%)
& 3
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5603
80.5%
Common 1342
 
19.3%
Latin 14
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
Common
ValueCountFrequency (%)
( 594
44.3%
) 594
44.3%
118
 
8.8%
2 16
 
1.2%
0 4
 
0.3%
1 4
 
0.3%
3 4
 
0.3%
4 3
 
0.2%
& 3
 
0.2%
7 1
 
0.1%
Latin
ValueCountFrequency (%)
G 3
21.4%
E 3
21.4%
S 3
21.4%
P 3
21.4%
C 1
 
7.1%
J 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5603
80.5%
ASCII 1356
 
19.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
622
 
11.1%
276
 
4.9%
203
 
3.6%
163
 
2.9%
158
 
2.8%
156
 
2.8%
132
 
2.4%
117
 
2.1%
107
 
1.9%
104
 
1.9%
Other values (256) 3565
63.6%
ASCII
ValueCountFrequency (%)
( 594
43.8%
) 594
43.8%
118
 
8.7%
2 16
 
1.2%
0 4
 
0.3%
1 4
 
0.3%
3 4
 
0.3%
4 3
 
0.2%
G 3
 
0.2%
E 3
 
0.2%
Other values (7) 13
 
1.0%
Distinct105
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:41:55.699138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length84
Median length64
Mean length10.124079
Min length1

Characters and Unicode

Total characters8241
Distinct characters192
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)4.4%

Sample

1st row
2nd row폐유리
3rd row폐유리
4th row폐유리
5th row그 밖의 분진
ValueCountFrequency (%)
제외한다 137
 
9.8%
폐합성수지류(폐염화비닐수지류는 122
 
8.7%
100
 
7.1%
밖의 100
 
7.1%
폐합성수지류 69
 
4.9%
사업장폐기물 53
 
3.8%
폐수처리오니 52
 
3.7%
식물성잔재물 47
 
3.4%
폐합성고무류 28
 
2.0%
폐합성수지 24
 
1.7%
Other values (160) 668
47.7%
2024-01-10T05:41:56.124383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
761
 
9.2%
714
 
8.7%
429
 
5.2%
418
 
5.1%
363
 
4.4%
323
 
3.9%
267
 
3.2%
245
 
3.0%
181
 
2.2%
169
 
2.1%
Other values (182) 4371
53.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7127
86.5%
Space Separator 714
 
8.7%
Open Punctuation 161
 
2.0%
Close Punctuation 161
 
2.0%
Connector Punctuation 63
 
0.8%
Decimal Number 15
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.7%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (173) 3814
53.5%
Decimal Number
ValueCountFrequency (%)
1 11
73.3%
2 3
 
20.0%
3 1
 
6.7%
Open Punctuation
ValueCountFrequency (%)
( 160
99.4%
1
 
0.6%
Close Punctuation
ValueCountFrequency (%)
) 160
99.4%
1
 
0.6%
Space Separator
ValueCountFrequency (%)
714
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7127
86.5%
Common 1114
 
13.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.7%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (173) 3814
53.5%
Common
ValueCountFrequency (%)
714
64.1%
( 160
 
14.4%
) 160
 
14.4%
_ 63
 
5.7%
1 11
 
1.0%
2 3
 
0.3%
3 1
 
0.1%
1
 
0.1%
1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7105
86.2%
ASCII 1112
 
13.5%
Compat Jamo 22
 
0.3%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
761
 
10.7%
429
 
6.0%
418
 
5.9%
363
 
5.1%
323
 
4.5%
267
 
3.8%
245
 
3.4%
181
 
2.5%
169
 
2.4%
157
 
2.2%
Other values (172) 3792
53.4%
ASCII
ValueCountFrequency (%)
714
64.2%
( 160
 
14.4%
) 160
 
14.4%
_ 63
 
5.7%
1 11
 
1.0%
2 3
 
0.3%
3 1
 
0.1%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

사업자등록번호
Text

MISSING 

Distinct282
Distinct (%)37.0%
Missing51
Missing (%)6.3%
Memory size6.5 KiB
2024-01-10T05:41:56.344454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters9156
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique150 ?
Unique (%)19.7%

Sample

1st row305-26-84981
2nd row597-81-01972
3rd row305-81-82022
4th row305-81-82022
5th row305-81-82022
ValueCountFrequency (%)
305-81-27257 39
 
5.1%
305-85-39523 39
 
5.1%
305-81-65417 23
 
3.0%
305-81-43859 20
 
2.6%
314-81-65181 14
 
1.8%
763-81-01152 13
 
1.7%
305-85-05251 13
 
1.7%
305-81-66639 13
 
1.7%
305-81-50383 11
 
1.4%
305-83-01284 10
 
1.3%
Other values (272) 568
74.4%
2024-01-10T05:41:56.672198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 1526
16.7%
5 1170
12.8%
3 1121
12.2%
0 1069
11.7%
8 1016
11.1%
1 1011
11.0%
2 631
6.9%
7 438
 
4.8%
4 427
 
4.7%
6 391
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7630
83.3%
Dash Punctuation 1526
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 1170
15.3%
3 1121
14.7%
0 1069
14.0%
8 1016
13.3%
1 1011
13.3%
2 631
8.3%
7 438
 
5.7%
4 427
 
5.6%
6 391
 
5.1%
9 356
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 1526
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9156
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 1526
16.7%
5 1170
12.8%
3 1121
12.2%
0 1069
11.7%
8 1016
11.1%
1 1011
11.0%
2 631
6.9%
7 438
 
4.8%
4 427
 
4.7%
6 391
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 1526
16.7%
5 1170
12.8%
3 1121
12.2%
0 1069
11.7%
8 1016
11.1%
1 1011
11.0%
2 631
6.9%
7 438
 
4.8%
4 427
 
4.7%
6 391
 
4.3%

전화번호
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing814
Missing (%)100.0%
Memory size7.3 KiB
Distinct262
Distinct (%)32.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:41:56.906009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length28
Mean length6.6695332
Min length1

Characters and Unicode

Total characters5429
Distinct characters229
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique153 ?
Unique (%)18.8%

Sample

1st row
2nd row(주)연호환경
3rd row(주)연호환경충주지점
4th row(주)이에스그린
5th row주식회사강우
ValueCountFrequency (%)
자가처리 45
 
6.0%
유림환경(합 27
 
3.6%
유)세광 23
 
3.1%
자)대신환경 22
 
2.9%
주)동양환경 19
 
2.5%
주)제이앤텍 19
 
2.5%
삼성환경개발(주 17
 
2.3%
주)시공아트 16
 
2.1%
계룡우드(주 14
 
1.9%
주)제이엔텍 11
 
1.5%
Other values (253) 541
71.8%
2024-01-10T05:41:57.282707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 573
 
10.6%
) 572
 
10.5%
497
 
9.2%
291
 
5.4%
285
 
5.2%
145
 
2.7%
130
 
2.4%
128
 
2.4%
126
 
2.3%
93
 
1.7%
Other values (219) 2589
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4161
76.6%
Open Punctuation 574
 
10.6%
Close Punctuation 573
 
10.6%
Space Separator 77
 
1.4%
Connector Punctuation 34
 
0.6%
Decimal Number 7
 
0.1%
Uppercase Letter 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
497
 
11.9%
291
 
7.0%
285
 
6.8%
145
 
3.5%
130
 
3.1%
128
 
3.1%
126
 
3.0%
93
 
2.2%
75
 
1.8%
74
 
1.8%
Other values (207) 2317
55.7%
Decimal Number
ValueCountFrequency (%)
7 4
57.1%
2 2
28.6%
1 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 573
99.8%
[ 1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 572
99.8%
] 1
 
0.2%
Uppercase Letter
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%
Space Separator
ValueCountFrequency (%)
77
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4161
76.6%
Common 1266
 
23.3%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
497
 
11.9%
291
 
7.0%
285
 
6.8%
145
 
3.5%
130
 
3.1%
128
 
3.1%
126
 
3.0%
93
 
2.2%
75
 
1.8%
74
 
1.8%
Other values (207) 2317
55.7%
Common
ValueCountFrequency (%)
( 573
45.3%
) 572
45.2%
77
 
6.1%
_ 34
 
2.7%
7 4
 
0.3%
2 2
 
0.2%
1 1
 
0.1%
- 1
 
0.1%
] 1
 
0.1%
[ 1
 
0.1%
Latin
ValueCountFrequency (%)
R 1
50.0%
C 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4160
76.6%
ASCII 1268
 
23.4%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 573
45.2%
) 572
45.1%
77
 
6.1%
_ 34
 
2.7%
7 4
 
0.3%
2 2
 
0.2%
1 1
 
0.1%
- 1
 
0.1%
] 1
 
0.1%
R 1
 
0.1%
Other values (2) 2
 
0.2%
Hangul
ValueCountFrequency (%)
497
 
11.9%
291
 
7.0%
285
 
6.9%
145
 
3.5%
130
 
3.1%
128
 
3.1%
126
 
3.0%
93
 
2.2%
75
 
1.8%
74
 
1.8%
Other values (206) 2316
55.7%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Distinct316
Distinct (%)38.8%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:41:57.495302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length81
Median length26
Mean length7.022113
Min length1

Characters and Unicode

Total characters5716
Distinct characters256
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique207 ?
Unique (%)25.4%

Sample

1st row
2nd row(주)이에스지청주
3rd row(주)이에스지청주
4th row(주)이에스지청주
5th row(주)서진인바이러테크
ValueCountFrequency (%)
자가 45
 
5.8%
주)동양환경 26
 
3.4%
주)케이엠그린구미지점 21
 
2.7%
주)케이엠그린 21
 
2.7%
유)대한환경 20
 
2.6%
계룡우드(주 19
 
2.5%
제이에이치개발(주 16
 
2.1%
두제에너지산업(주 13
 
1.7%
중부화훼 12
 
1.6%
동양환경(주 11
 
1.4%
Other values (311) 566
73.5%
2024-01-10T05:41:57.878456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
557
 
9.7%
) 547
 
9.6%
( 546
 
9.6%
268
 
4.7%
158
 
2.8%
143
 
2.5%
130
 
2.3%
122
 
2.1%
116
 
2.0%
111
 
1.9%
Other values (246) 3018
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4477
78.3%
Close Punctuation 547
 
9.6%
Open Punctuation 546
 
9.6%
Space Separator 86
 
1.5%
Connector Punctuation 33
 
0.6%
Uppercase Letter 17
 
0.3%
Decimal Number 9
 
0.2%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
557
 
12.4%
268
 
6.0%
158
 
3.5%
143
 
3.2%
130
 
2.9%
122
 
2.7%
116
 
2.6%
111
 
2.5%
83
 
1.9%
82
 
1.8%
Other values (230) 2707
60.5%
Uppercase Letter
ValueCountFrequency (%)
P 4
23.5%
S 4
23.5%
M 2
11.8%
R 2
11.8%
C 2
11.8%
K 1
 
5.9%
B 1
 
5.9%
W 1
 
5.9%
Decimal Number
ValueCountFrequency (%)
2 4
44.4%
7 4
44.4%
1 1
 
11.1%
Close Punctuation
ValueCountFrequency (%)
) 547
100.0%
Open Punctuation
ValueCountFrequency (%)
( 546
100.0%
Space Separator
ValueCountFrequency (%)
86
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 33
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4477
78.3%
Common 1222
 
21.4%
Latin 17
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
557
 
12.4%
268
 
6.0%
158
 
3.5%
143
 
3.2%
130
 
2.9%
122
 
2.7%
116
 
2.6%
111
 
2.5%
83
 
1.9%
82
 
1.8%
Other values (230) 2707
60.5%
Common
ValueCountFrequency (%)
) 547
44.8%
( 546
44.7%
86
 
7.0%
_ 33
 
2.7%
2 4
 
0.3%
7 4
 
0.3%
1 1
 
0.1%
- 1
 
0.1%
Latin
ValueCountFrequency (%)
P 4
23.5%
S 4
23.5%
M 2
11.8%
R 2
11.8%
C 2
11.8%
K 1
 
5.9%
B 1
 
5.9%
W 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4473
78.3%
ASCII 1239
 
21.7%
Compat Jamo 4
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
557
 
12.5%
268
 
6.0%
158
 
3.5%
143
 
3.2%
130
 
2.9%
122
 
2.7%
116
 
2.6%
111
 
2.5%
83
 
1.9%
82
 
1.8%
Other values (227) 2703
60.4%
ASCII
ValueCountFrequency (%)
) 547
44.1%
( 546
44.1%
86
 
6.9%
_ 33
 
2.7%
2 4
 
0.3%
P 4
 
0.3%
S 4
 
0.3%
7 4
 
0.3%
M 2
 
0.2%
R 2
 
0.2%
Other values (6) 7
 
0.6%
Compat Jamo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%

처리방법
Categorical

Distinct38
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
중간처분(일반소각)
121 
매립(민간관리형매립시설)
97 
재활용(기타)
83 
66 
재활용(중간가공폐기물 제조)
65 
Other values (33)
382 

Length

Max length19
Median length15
Mean length10.090909
Min length1

Unique

Unique10 ?
Unique (%)1.2%

Sample

1st row
2nd row매립(민간관리형매립시설)
3rd row매립(민간관리형매립시설)
4th row매립(민간관리형매립시설)
5th row재활용(원료 제조)

Common Values

ValueCountFrequency (%)
중간처분(일반소각) 121
14.9%
매립(민간관리형매립시설) 97
11.9%
재활용(기타) 83
10.2%
66
 
8.1%
재활용(중간가공폐기물 제조) 65
 
8.0%
재활용(파쇄_분쇄) 53
 
6.5%
중간처분(파쇄_분쇄) 50
 
6.1%
재활용(원료 제조) 39
 
4.8%
재활용(연료·고형연료제품 제조) 33
 
4.1%
재활용(원료가공) 24
 
2.9%
Other values (28) 183
22.5%

Length

2024-01-10T05:41:58.022788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 137
14.2%
중간처분(일반소각 121
12.5%
매립(민간관리형매립시설 97
10.1%
재활용(기타 83
 
8.6%
재활용(중간가공폐기물 65
 
6.7%
재활용(파쇄_분쇄 53
 
5.5%
중간처분(파쇄_분쇄 50
 
5.2%
사용 46
 
4.8%
재활용(원료 39
 
4.0%
재활용(연료·고형연료제품 33
 
3.4%
Other values (32) 241
25.0%
Distinct226
Distinct (%)27.8%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2024-01-10T05:41:58.322512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length33
Mean length18.81941
Min length1

Characters and Unicode

Total characters15319
Distinct characters183
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)12.5%

Sample

1st row
2nd row충청남도 금산군 추부면 분턱골길 15-6_ 대광
3rd row충청남도 금산군 추부면 분턱골길 15-6_ 대광
4th row충청남도 금산군 추부면 분턱골길 15-6_ 대광
5th row충청남도 금산군 복수면 용천로 867
ValueCountFrequency (%)
충청남도 714
19.6%
금산군 702
19.3%
복수면 217
 
6.0%
추부면 177
 
4.9%
다복로 99
 
2.7%
군북면 71
 
2.0%
금성면 61
 
1.7%
제원면 59
 
1.6%
금산읍 54
 
1.5%
군북로 51
 
1.4%
Other values (317) 1432
39.4%
2024-01-10T05:41:58.805654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3032
19.8%
924
 
6.0%
859
 
5.6%
834
 
5.4%
729
 
4.8%
719
 
4.7%
717
 
4.7%
714
 
4.7%
649
 
4.2%
534
 
3.5%
Other values (173) 5608
36.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9924
64.8%
Space Separator 3032
 
19.8%
Decimal Number 2123
 
13.9%
Dash Punctuation 148
 
1.0%
Open Punctuation 35
 
0.2%
Close Punctuation 35
 
0.2%
Connector Punctuation 22
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
924
 
9.3%
859
 
8.7%
834
 
8.4%
729
 
7.3%
719
 
7.2%
717
 
7.2%
714
 
7.2%
649
 
6.5%
534
 
5.4%
382
 
3.8%
Other values (158) 2863
28.8%
Decimal Number
ValueCountFrequency (%)
1 378
17.8%
4 290
13.7%
2 273
12.9%
5 233
11.0%
6 223
10.5%
3 211
9.9%
7 165
7.8%
0 143
 
6.7%
8 109
 
5.1%
9 98
 
4.6%
Space Separator
ValueCountFrequency (%)
3032
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 148
100.0%
Open Punctuation
ValueCountFrequency (%)
( 35
100.0%
Close Punctuation
ValueCountFrequency (%)
) 35
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9924
64.8%
Common 5395
35.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
924
 
9.3%
859
 
8.7%
834
 
8.4%
729
 
7.3%
719
 
7.2%
717
 
7.2%
714
 
7.2%
649
 
6.5%
534
 
5.4%
382
 
3.8%
Other values (158) 2863
28.8%
Common
ValueCountFrequency (%)
3032
56.2%
1 378
 
7.0%
4 290
 
5.4%
2 273
 
5.1%
5 233
 
4.3%
6 223
 
4.1%
3 211
 
3.9%
7 165
 
3.1%
- 148
 
2.7%
0 143
 
2.7%
Other values (5) 299
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9924
64.8%
ASCII 5395
35.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3032
56.2%
1 378
 
7.0%
4 290
 
5.4%
2 273
 
5.1%
5 233
 
4.3%
6 223
 
4.1%
3 211
 
3.9%
7 165
 
3.1%
- 148
 
2.7%
0 143
 
2.7%
Other values (5) 299
 
5.5%
Hangul
ValueCountFrequency (%)
924
 
9.3%
859
 
8.7%
834
 
8.4%
729
 
7.3%
719
 
7.2%
717
 
7.2%
714
 
7.2%
649
 
6.5%
534
 
5.4%
382
 
3.8%
Other values (158) 2863
28.8%

신고기준년도
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2009.2101
Minimum2000
Maximum2021
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.3 KiB
2024-01-10T05:41:58.947131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2000
5-th percentile2001
Q12006
median2009.5
Q32011.75
95-th percentile2019
Maximum2021
Range21
Interquartile range (IQR)5.75

Descriptive statistics

Standard deviation4.931895
Coefficient of variation (CV)0.0024546438
Kurtosis-0.31664807
Mean2009.2101
Median Absolute Deviation (MAD)2.5
Skewness0.23275794
Sum1635497
Variance24.323588
MonotonicityNot monotonic
2024-01-10T05:41:59.097840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
2010 154
18.9%
2008 107
13.1%
2009 65
 
8.0%
2002 63
 
7.7%
2011 49
 
6.0%
2001 40
 
4.9%
2005 37
 
4.5%
2013 35
 
4.3%
2003 35
 
4.3%
2012 33
 
4.1%
Other values (12) 196
24.1%
ValueCountFrequency (%)
2000 8
 
1.0%
2001 40
 
4.9%
2002 63
7.7%
2003 35
 
4.3%
2004 7
 
0.9%
2005 37
 
4.5%
2006 22
 
2.7%
2007 23
 
2.8%
2008 107
13.1%
2009 65
8.0%
ValueCountFrequency (%)
2021 3
 
0.4%
2020 17
2.1%
2019 24
2.9%
2018 32
3.9%
2017 9
 
1.1%
2016 18
2.2%
2015 11
 
1.4%
2014 22
2.7%
2013 35
4.3%
2012 33
4.1%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2021-10-13
814 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-10-13
2nd row2021-10-13
3rd row2021-10-13
4th row2021-10-13
5th row2021-10-13

Common Values

ValueCountFrequency (%)
2021-10-13 814
100.0%

Length

2024-01-10T05:41:59.233261image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:41:59.346499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-10-13 814
100.0%

Interactions

2024-01-10T05:41:53.386409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:53.207495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:53.484756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:41:53.302590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:41:59.436164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번처리방법신고기준년도
연번1.0000.7420.980
처리방법0.7421.0000.709
신고기준년도0.9800.7091.000
2024-01-10T05:41:59.548776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번신고기준년도처리방법
연번1.000-0.9900.358
신고기준년도-0.9901.0000.331
처리방법0.3580.3311.000

Missing values

2024-01-10T05:41:53.612133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:41:53.786421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번폐기물구분(사업장일반폐기물지정폐기물)상호명폐기물종류사업자등록번호전화번호운반자명처리업체명처리방법사업장 도로명주소신고기준년도데이터기준일자
01사업장일반폐기물금산환경재생산업(주)<NA><NA>20052021-10-13
12사업장일반폐기물대광무역 방치폐기물폐유리<NA><NA>(주)연호환경(주)이에스지청주매립(민간관리형매립시설)충청남도 금산군 추부면 분턱골길 15-6_ 대광20212021-10-13
23사업장일반폐기물대광무역 방치폐기물폐유리<NA><NA>(주)연호환경충주지점(주)이에스지청주매립(민간관리형매립시설)충청남도 금산군 추부면 분턱골길 15-6_ 대광20212021-10-13
34사업장일반폐기물대광무역 방치폐기물폐유리<NA><NA>(주)이에스그린(주)이에스지청주매립(민간관리형매립시설)충청남도 금산군 추부면 분턱골길 15-6_ 대광20212021-10-13
45사업장일반폐기물승원그 밖의 분진305-26-84981<NA>주식회사강우(주)서진인바이러테크재활용(원료 제조)충청남도 금산군 복수면 용천로 86720202021-10-13
56사업장일반폐기물(주)한성엠에스폐합성수지류(폐염화비닐수지류는 제외한다)597-81-01972<NA>와이제이이두제에너지산업(주)재활용(연료·고형연료제품 제조)충청남도 금산군 진산면 진산로 64320202021-10-13
67사업장일반폐기물(주)휴온스네이처 제3공장그 밖의 식물성잔재물305-81-82022<NA>(주)박물(주)맞춤농업회사법인재활용(농업생산활동에 사용)충청남도 금산군 금산읍 인삼광장로 1920202021-10-13
78사업장일반폐기물(주)휴온스네이처 제3공장그 밖의 폐수처리오니305-81-82022<NA>연호환경(주)청암녹화재활용(토질개선에 사용)충청남도 금산군 금산읍 인삼광장로 1920202021-10-13
89사업장일반폐기물(주)휴온스네이처 제3공장폐합성수지류(폐염화비닐수지류는 제외한다)305-81-82022<NA>(주)성화환경(주)이에스지세종중간처분(일반소각)충청남도 금산군 금산읍 인삼광장로 1920202021-10-13
910사업장일반폐기물천일산업(주)건설폐토석305-81-25749<NA>(주)동양알디(주)동양알디중간처분(파쇄_분쇄)충청남도 금산군 복수면 매방길 520202021-10-13
연번폐기물구분(사업장일반폐기물지정폐기물)상호명폐기물종류사업자등록번호전화번호운반자명처리업체명처리방법사업장 도로명주소신고기준년도데이터기준일자
804805사업장일반폐기물유아이제오차유동전문 유한회사사업장폐기물<NA><NA>동양환경한중관리형매립충청남도 금산군 진산면 태고사로 27120012021-10-13
805806사업장일반폐기물유아이제오차유동전문 유한회사사업장폐기물<NA><NA>오성금속오성금속고온열분해충청남도 금산군 진산면 태고사로 27120012021-10-13
806807사업장일반폐기물유아이제오차유동전문 유한회사사업장폐기물<NA><NA>동양환경동양환경소각충청남도 금산군 진산면 태고사로 27120012021-10-13
807808사업장일반폐기물(주)에이에스에이금산폐합성수지류408-81-79369<NA>(자)대신환경(주)이에스세종중간처분(일반소각)충청남도 금산군 제원면 군북로 27420012021-10-13
808809사업장일반폐기물(주)에이에스에이금산폐내화물408-81-79369<NA>(자)대신환경(주)티와이이엔이매립(민간관리형매립시설)충청남도 금산군 제원면 군북로 27420012021-10-13
809810사업장일반폐기물(주)에이에스에이금산폐수처리오니408-81-79369<NA>(자)대신환경(주)티와이이엔이매립(민간관리형매립시설)충청남도 금산군 제원면 군북로 27420012021-10-13
810811사업장일반폐기물(주)에이에스에이금산폐흡착제408-81-79369<NA>한세이프(주)제이에이치개발(주)매립(민간관리형매립시설)충청남도 금산군 제원면 군북로 27420012021-10-13
811812사업장일반폐기물(주)에이에스에이금산분진(대기오염방지시설에서 포집된 것에 한정하되_ 소각시설에서 발생되는 것은 제외한다)408-81-79369<NA>(자)대신환경(주)티와이이엔이매립(민간관리형매립시설)충청남도 금산군 제원면 군북로 27420012021-10-13
812813사업장일반폐기물(주)에이에스에이금산폐목재류(원목의 용도 그대로 사용하는 나무뿌리ㆍ가지 등을 제거한 원줄기는 제외한다_)408-81-79369<NA>용문산업용문산업재활용(파쇄_분쇄)충청남도 금산군 제원면 군북로 27420012021-10-13
813814사업장일반폐기물(주)에이에스에이금산폐합성수지류408-81-79369<NA>대진공업사대진공업사재활용(기타)충청남도 금산군 제원면 군북로 27420012021-10-13