Overview

Dataset statistics

Number of variables8
Number of observations1506
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows1
Duplicate rows (%)0.1%
Total size in memory95.7 KiB
Average record size in memory65.1 B

Variable types

Text6
Numeric1
Categorical1

Dataset

Description충청남도 서산시 폐기물배출신고현황 데이터입니다. 항목명은 상호명, 전화번호, 사업장도로명주소, 폐기물 종류, 배출량, 운반자, 처리업소명, 처리방법으로 구성되어 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=37&beforeMenuCd=DOM_000000201001001000&publicdatapk=15116837

Alerts

Dataset has 1 (0.1%) duplicate rowsDuplicates

Reproduction

Analysis started2024-01-09 20:37:59.999156
Analysis finished2024-01-09 20:38:00.899262
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct239
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2024-01-10T05:38:01.034650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length10.34595
Min length2

Characters and Unicode

Total characters15581
Distinct characters272
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique81 ?
Unique (%)5.4%

Sample

1st row(재)한국건설생활환경시험연구원
2nd row(주)가나스틸
3rd row(주)가나스틸
4th row(주)가야서산지점
5th row(주)가인
ValueCountFrequency (%)
주식회사 289
 
13.6%
한화토탈에너지스 139
 
6.5%
주)엘지화학대산공장 95
 
4.5%
에이치디현대오일뱅크(주 84
 
3.9%
대산공장 60
 
2.8%
롯데케미칼(주 45
 
2.1%
hd현대케미칼(주 42
 
2.0%
서산지점 41
 
1.9%
현대트랜시스(주 37
 
1.7%
지곡 25
 
1.2%
Other values (257) 1275
59.8%
2024-01-10T05:38:01.365130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1351
 
8.7%
) 1039
 
6.7%
( 1039
 
6.7%
671
 
4.3%
561
 
3.6%
558
 
3.6%
406
 
2.6%
377
 
2.4%
365
 
2.3%
350
 
2.2%
Other values (262) 8864
56.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12662
81.3%
Close Punctuation 1039
 
6.7%
Open Punctuation 1039
 
6.7%
Space Separator 671
 
4.3%
Uppercase Letter 91
 
0.6%
Decimal Number 70
 
0.4%
Other Punctuation 7
 
< 0.1%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1351
 
10.7%
561
 
4.4%
558
 
4.4%
406
 
3.2%
377
 
3.0%
365
 
2.9%
350
 
2.8%
338
 
2.7%
331
 
2.6%
319
 
2.5%
Other values (244) 7706
60.9%
Uppercase Letter
ValueCountFrequency (%)
H 42
46.2%
D 42
46.2%
N 2
 
2.2%
E 2
 
2.2%
G 2
 
2.2%
S 1
 
1.1%
Decimal Number
ValueCountFrequency (%)
2 26
37.1%
0 20
28.6%
1 15
21.4%
3 9
 
12.9%
Other Punctuation
ValueCountFrequency (%)
/ 3
42.9%
. 2
28.6%
· 2
28.6%
Lowercase Letter
ValueCountFrequency (%)
k 1
50.0%
t 1
50.0%
Close Punctuation
ValueCountFrequency (%)
) 1039
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1039
100.0%
Space Separator
ValueCountFrequency (%)
671
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12662
81.3%
Common 2826
 
18.1%
Latin 93
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1351
 
10.7%
561
 
4.4%
558
 
4.4%
406
 
3.2%
377
 
3.0%
365
 
2.9%
350
 
2.8%
338
 
2.7%
331
 
2.6%
319
 
2.5%
Other values (244) 7706
60.9%
Common
ValueCountFrequency (%)
) 1039
36.8%
( 1039
36.8%
671
23.7%
2 26
 
0.9%
0 20
 
0.7%
1 15
 
0.5%
3 9
 
0.3%
/ 3
 
0.1%
. 2
 
0.1%
· 2
 
0.1%
Latin
ValueCountFrequency (%)
H 42
45.2%
D 42
45.2%
N 2
 
2.2%
E 2
 
2.2%
G 2
 
2.2%
k 1
 
1.1%
t 1
 
1.1%
S 1
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12662
81.3%
ASCII 2917
 
18.7%
None 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1351
 
10.7%
561
 
4.4%
558
 
4.4%
406
 
3.2%
377
 
3.0%
365
 
2.9%
350
 
2.8%
338
 
2.7%
331
 
2.6%
319
 
2.5%
Other values (244) 7706
60.9%
ASCII
ValueCountFrequency (%)
) 1039
35.6%
( 1039
35.6%
671
23.0%
H 42
 
1.4%
D 42
 
1.4%
2 26
 
0.9%
0 20
 
0.7%
1 15
 
0.5%
3 9
 
0.3%
/ 3
 
0.1%
Other values (7) 11
 
0.4%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct211
Distinct (%)14.0%
Missing3
Missing (%)0.2%
Memory size11.9 KiB
2024-01-10T05:38:01.588803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.721224
Min length1

Characters and Unicode

Total characters17617
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)3.8%

Sample

1st row
2nd row041-681-4292
3rd row041-681-4292
4th row041-664-5326
5th row041-662-1270
ValueCountFrequency (%)
041-660-6457 139
 
9.5%
041-661-2029 95
 
6.5%
041-660-5432 84
 
5.7%
041-689-5330 45
 
3.1%
041-924-1039 42
 
2.9%
041-664-7456 26
 
1.8%
041-665-9957 25
 
1.7%
041-661-9217 25
 
1.7%
041-660-8799 23
 
1.6%
041-663-0751 21
 
1.4%
Other values (200) 940
64.2%
2024-01-10T05:38:01.935334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6 2997
17.0%
- 2926
16.6%
0 2671
15.2%
1 2348
13.3%
4 2224
12.6%
9 795
 
4.5%
5 779
 
4.4%
2 729
 
4.1%
7 728
 
4.1%
8 697
 
4.0%
Other values (2) 723
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14653
83.2%
Dash Punctuation 2926
 
16.6%
Space Separator 38
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
6 2997
20.5%
0 2671
18.2%
1 2348
16.0%
4 2224
15.2%
9 795
 
5.4%
5 779
 
5.3%
2 729
 
5.0%
7 728
 
5.0%
8 697
 
4.8%
3 685
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 2926
100.0%
Space Separator
ValueCountFrequency (%)
38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 17617
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
6 2997
17.0%
- 2926
16.6%
0 2671
15.2%
1 2348
13.3%
4 2224
12.6%
9 795
 
4.5%
5 779
 
4.4%
2 729
 
4.1%
7 728
 
4.1%
8 697
 
4.0%
Other values (2) 723
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17617
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6 2997
17.0%
- 2926
16.6%
0 2671
15.2%
1 2348
13.3%
4 2224
12.6%
9 795
 
4.5%
5 779
 
4.4%
2 729
 
4.1%
7 728
 
4.1%
8 697
 
4.0%
Other values (2) 723
 
4.1%
Distinct224
Distinct (%)14.9%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2024-01-10T05:38:02.211588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length44
Median length40
Mean length22.780212
Min length1

Characters and Unicode

Total characters34307
Distinct characters186
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)5.2%

Sample

1st row충청남도 서산시 대산읍 평신1로 595-10
2nd row충청남도 서산시 성연면 성연3로 133-25
3rd row충청남도 서산시 성연면 성연3로 133-25
4th row충청남도 서산시 운산면 장생동로 575
5th row충청남도 서산시 음암면 탑곡리 126-12 1층
ValueCountFrequency (%)
충청남도 1475
19.2%
서산시 1475
19.2%
대산읍 789
 
10.3%
독곶1로 199
 
2.6%
독곶2로 160
 
2.1%
103 154
 
2.0%
평신2로 135
 
1.8%
지곡면 132
 
1.7%
54 118
 
1.5%
성연면 112
 
1.5%
Other values (370) 2920
38.1%
2024-01-10T05:38:02.616592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6673
19.5%
2388
 
7.0%
1 1529
 
4.5%
1522
 
4.4%
1518
 
4.4%
1504
 
4.4%
1483
 
4.3%
1479
 
4.3%
1479
 
4.3%
1133
 
3.3%
Other values (176) 13599
39.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 20526
59.8%
Space Separator 6673
 
19.5%
Decimal Number 5934
 
17.3%
Dash Punctuation 585
 
1.7%
Close Punctuation 246
 
0.7%
Open Punctuation 246
 
0.7%
Connector Punctuation 94
 
0.3%
Uppercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2388
 
11.6%
1522
 
7.4%
1518
 
7.4%
1504
 
7.3%
1483
 
7.2%
1479
 
7.2%
1479
 
7.2%
1133
 
5.5%
940
 
4.6%
796
 
3.9%
Other values (159) 6284
30.6%
Decimal Number
ValueCountFrequency (%)
1 1529
25.8%
2 986
16.6%
5 587
 
9.9%
3 584
 
9.8%
4 544
 
9.2%
0 428
 
7.2%
8 406
 
6.8%
7 387
 
6.5%
6 277
 
4.7%
9 206
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
B 2
66.7%
L 1
33.3%
Space Separator
ValueCountFrequency (%)
6673
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 585
100.0%
Close Punctuation
ValueCountFrequency (%)
) 246
100.0%
Open Punctuation
ValueCountFrequency (%)
( 246
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 94
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 20526
59.8%
Common 13778
40.2%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2388
 
11.6%
1522
 
7.4%
1518
 
7.4%
1504
 
7.3%
1483
 
7.2%
1479
 
7.2%
1479
 
7.2%
1133
 
5.5%
940
 
4.6%
796
 
3.9%
Other values (159) 6284
30.6%
Common
ValueCountFrequency (%)
6673
48.4%
1 1529
 
11.1%
2 986
 
7.2%
5 587
 
4.3%
- 585
 
4.2%
3 584
 
4.2%
4 544
 
3.9%
0 428
 
3.1%
8 406
 
2.9%
7 387
 
2.8%
Other values (5) 1069
 
7.8%
Latin
ValueCountFrequency (%)
B 2
66.7%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 20526
59.8%
ASCII 13781
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
6673
48.4%
1 1529
 
11.1%
2 986
 
7.2%
5 587
 
4.3%
- 585
 
4.2%
3 584
 
4.2%
4 544
 
3.9%
0 428
 
3.1%
8 406
 
2.9%
7 387
 
2.8%
Other values (7) 1072
 
7.8%
Hangul
ValueCountFrequency (%)
2388
 
11.6%
1522
 
7.4%
1518
 
7.4%
1504
 
7.3%
1483
 
7.2%
1479
 
7.2%
1479
 
7.2%
1133
 
5.5%
940
 
4.6%
796
 
3.9%
Other values (159) 6284
30.6%
Distinct102
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2024-01-10T05:38:02.896921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length64
Mean length13.787517
Min length2

Characters and Unicode

Total characters20764
Distinct characters205
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)2.0%

Sample

1st row폐합성수지류(폐염화비닐수지류는 제외한다)
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐콘크리트
5th row자동차 폐타이어
ValueCountFrequency (%)
밖의 469
 
12.9%
469
 
12.9%
제외한다 375
 
10.3%
폐합성수지류(폐염화비닐수지류는 363
 
10.0%
폐수처리오니 123
 
3.4%
폐기물 82
 
2.3%
분진 63
 
1.7%
포함한다 58
 
1.6%
폐합성고분자화합물(합성수지류로 57
 
1.6%
피복된 57
 
1.6%
Other values (182) 1518
41.8%
2024-01-10T05:38:03.305777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2140
 
10.3%
1797
 
8.7%
1011
 
4.9%
947
 
4.6%
881
 
4.2%
639
 
3.1%
585
 
2.8%
521
 
2.5%
518
 
2.5%
( 488
 
2.4%
Other values (195) 11237
54.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 17485
84.2%
Space Separator 2140
 
10.3%
Open Punctuation 489
 
2.4%
Close Punctuation 489
 
2.4%
Connector Punctuation 154
 
0.7%
Decimal Number 5
 
< 0.1%
Other Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1797
 
10.3%
1011
 
5.8%
947
 
5.4%
881
 
5.0%
639
 
3.7%
585
 
3.3%
521
 
3.0%
518
 
3.0%
486
 
2.8%
473
 
2.7%
Other values (185) 9627
55.1%
Decimal Number
ValueCountFrequency (%)
2 2
40.0%
3 2
40.0%
1 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 488
99.8%
1
 
0.2%
Close Punctuation
ValueCountFrequency (%)
) 488
99.8%
1
 
0.2%
Space Separator
ValueCountFrequency (%)
2140
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 154
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 17485
84.2%
Common 3279
 
15.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1797
 
10.3%
1011
 
5.8%
947
 
5.4%
881
 
5.0%
639
 
3.7%
585
 
3.3%
521
 
3.0%
518
 
3.0%
486
 
2.8%
473
 
2.7%
Other values (185) 9627
55.1%
Common
ValueCountFrequency (%)
2140
65.3%
( 488
 
14.9%
) 488
 
14.9%
_ 154
 
4.7%
2 2
 
0.1%
3 2
 
0.1%
. 2
 
0.1%
1 1
 
< 0.1%
1
 
< 0.1%
1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 17463
84.1%
ASCII 3277
 
15.8%
Compat Jamo 22
 
0.1%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2140
65.3%
( 488
 
14.9%
) 488
 
14.9%
_ 154
 
4.7%
2 2
 
0.1%
3 2
 
0.1%
. 2
 
0.1%
1 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1797
 
10.3%
1011
 
5.8%
947
 
5.4%
881
 
5.0%
639
 
3.7%
585
 
3.3%
521
 
3.0%
518
 
3.0%
486
 
2.8%
473
 
2.7%
Other values (184) 9605
55.0%
Compat Jamo
ValueCountFrequency (%)
22
100.0%
None
ValueCountFrequency (%)
1
50.0%
1
50.0%

배출량(톤)
Real number (ℝ)

Distinct182
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean894.68824
Minimum0.7
Maximum120000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.4 KiB
2024-01-10T05:38:03.441854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.7
5-th percentile10
Q150
median150
Q3535
95-th percentile3375
Maximum120000
Range119999.3
Interquartile range (IQR)485

Descriptive statistics

Standard deviation4220.8778
Coefficient of variation (CV)4.7177079
Kurtosis467.22116
Mean894.68824
Median Absolute Deviation (MAD)126
Skewness18.956789
Sum1347400.5
Variance17815809
MonotonicityNot monotonic
2024-01-10T05:38:03.572159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100.0 120
 
8.0%
120.0 69
 
4.6%
60.0 68
 
4.5%
200.0 68
 
4.5%
50.0 60
 
4.0%
1200.0 51
 
3.4%
300.0 47
 
3.1%
240.0 46
 
3.1%
30.0 46
 
3.1%
1000.0 41
 
2.7%
Other values (172) 890
59.1%
ValueCountFrequency (%)
0.7 2
 
0.1%
0.75 1
 
0.1%
1.0 7
0.5%
1.2 1
 
0.1%
2.0 6
0.4%
2.4 2
 
0.1%
2.5 1
 
0.1%
3.0 2
 
0.1%
3.6 1
 
0.1%
4.8 2
 
0.1%
ValueCountFrequency (%)
120000.0 1
 
0.1%
60000.0 1
 
0.1%
54000.0 1
 
0.1%
36000.0 1
 
0.1%
20000.0 3
 
0.2%
19000.0 1
 
0.1%
15000.0 5
0.3%
12000.0 1
 
0.1%
10600.0 1
 
0.1%
10000.0 8
0.5%
Distinct370
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2024-01-10T05:38:03.811980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length18
Mean length7.2954847
Min length1

Characters and Unicode

Total characters10987
Distinct characters267
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique217 ?
Unique (%)14.4%

Sample

1st row(주)대산종합환경
2nd row(주)드림개발
3rd row(주)드림개발
4th row대명종합환경산업(주)
5th row(주)해룡
ValueCountFrequency (%)
대진환경(주 197
 
12.9%
주)성화환경 134
 
8.8%
주)수목환경 119
 
7.8%
주)드림개발 66
 
4.3%
주)태건환경건설 47
 
3.1%
대명종합환경산업(주 35
 
2.3%
주)대진환경 35
 
2.3%
주)대산종합환경 28
 
1.8%
유림환경물류(주 27
 
1.8%
자가 26
 
1.7%
Other values (363) 811
53.2%
2024-01-10T05:38:04.195972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 1260
 
11.5%
) 1259
 
11.5%
1226
 
11.2%
888
 
8.1%
878
 
8.0%
410
 
3.7%
271
 
2.5%
226
 
2.1%
219
 
2.0%
171
 
1.6%
Other values (257) 4179
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8418
76.6%
Open Punctuation 1260
 
11.5%
Close Punctuation 1259
 
11.5%
Space Separator 31
 
0.3%
Connector Punctuation 11
 
0.1%
Decimal Number 4
 
< 0.1%
Uppercase Letter 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1226
 
14.6%
888
 
10.5%
878
 
10.4%
410
 
4.9%
271
 
3.2%
226
 
2.7%
219
 
2.6%
171
 
2.0%
162
 
1.9%
162
 
1.9%
Other values (246) 3805
45.2%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 1
25.0%
0 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
K 1
33.3%
E 1
33.3%
T 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 1260
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1259
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8418
76.6%
Common 2566
 
23.4%
Latin 3
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1226
 
14.6%
888
 
10.5%
878
 
10.4%
410
 
4.9%
271
 
3.2%
226
 
2.7%
219
 
2.6%
171
 
2.0%
162
 
1.9%
162
 
1.9%
Other values (246) 3805
45.2%
Common
ValueCountFrequency (%)
( 1260
49.1%
) 1259
49.1%
31
 
1.2%
_ 11
 
0.4%
2 2
 
0.1%
1 1
 
< 0.1%
0 1
 
< 0.1%
- 1
 
< 0.1%
Latin
ValueCountFrequency (%)
K 1
33.3%
E 1
33.3%
T 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8418
76.6%
ASCII 2569
 
23.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 1260
49.0%
) 1259
49.0%
31
 
1.2%
_ 11
 
0.4%
2 2
 
0.1%
1 1
 
< 0.1%
0 1
 
< 0.1%
K 1
 
< 0.1%
E 1
 
< 0.1%
T 1
 
< 0.1%
Hangul
ValueCountFrequency (%)
1226
 
14.6%
888
 
10.5%
878
 
10.4%
410
 
4.9%
271
 
3.2%
226
 
2.7%
219
 
2.6%
171
 
2.0%
162
 
1.9%
162
 
1.9%
Other values (246) 3805
45.2%
Distinct520
Distinct (%)34.5%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
2024-01-10T05:38:04.436473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length18
Mean length8.0723772
Min length1

Characters and Unicode

Total characters12157
Distinct characters326
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique304 ?
Unique (%)20.2%

Sample

1st row(주)(주)서광하이테크
2nd row(주)서청
3rd row서해그린환경(주)
4th row대명종합환경산업(주)
5th row(주)해룡
ValueCountFrequency (%)
주)제이엔텍 74
 
4.7%
주)보림씨에스 72
 
4.6%
주)서광하이테크 58
 
3.7%
주)대성에코에너지센터 48
 
3.1%
대명종합환경산업(주 43
 
2.7%
서해그린환경(주 37
 
2.4%
일성페이퍼 35
 
2.2%
주)에코비트그린청주 32
 
2.0%
유)대한청정환경 19
 
1.2%
청천코리아 17
 
1.1%
Other values (524) 1133
72.3%
2024-01-10T05:38:04.811700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1246
 
10.2%
) 1177
 
9.7%
( 1176
 
9.7%
540
 
4.4%
438
 
3.6%
227
 
1.9%
220
 
1.8%
219
 
1.8%
217
 
1.8%
209
 
1.7%
Other values (316) 6488
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9581
78.8%
Close Punctuation 1177
 
9.7%
Open Punctuation 1176
 
9.7%
Space Separator 77
 
0.6%
Uppercase Letter 67
 
0.6%
Lowercase Letter 49
 
0.4%
Decimal Number 11
 
0.1%
Dash Punctuation 7
 
0.1%
Other Punctuation 7
 
0.1%
Connector Punctuation 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1246
 
13.0%
540
 
5.6%
438
 
4.6%
227
 
2.4%
220
 
2.3%
219
 
2.3%
217
 
2.3%
209
 
2.2%
203
 
2.1%
192
 
2.0%
Other values (271) 5870
61.3%
Uppercase Letter
ValueCountFrequency (%)
S 9
13.4%
C 6
 
9.0%
E 6
 
9.0%
R 5
 
7.5%
N 5
 
7.5%
I 4
 
6.0%
K 4
 
6.0%
A 4
 
6.0%
P 4
 
6.0%
L 4
 
6.0%
Other values (7) 16
23.9%
Lowercase Letter
ValueCountFrequency (%)
i 7
14.3%
o 6
12.2%
n 6
12.2%
e 4
 
8.2%
h 4
 
8.2%
k 3
 
6.1%
c 3
 
6.1%
t 2
 
4.1%
p 2
 
4.1%
l 2
 
4.1%
Other values (7) 10
20.4%
Decimal Number
ValueCountFrequency (%)
2 7
63.6%
1 3
27.3%
4 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 5
71.4%
/ 1
 
14.3%
& 1
 
14.3%
Close Punctuation
ValueCountFrequency (%)
) 1177
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1176
100.0%
Space Separator
ValueCountFrequency (%)
77
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9581
78.8%
Common 2460
 
20.2%
Latin 116
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1246
 
13.0%
540
 
5.6%
438
 
4.6%
227
 
2.4%
220
 
2.3%
219
 
2.3%
217
 
2.3%
209
 
2.2%
203
 
2.1%
192
 
2.0%
Other values (271) 5870
61.3%
Latin
ValueCountFrequency (%)
S 9
 
7.8%
i 7
 
6.0%
o 6
 
5.2%
n 6
 
5.2%
C 6
 
5.2%
E 6
 
5.2%
R 5
 
4.3%
N 5
 
4.3%
I 4
 
3.4%
K 4
 
3.4%
Other values (24) 58
50.0%
Common
ValueCountFrequency (%)
) 1177
47.8%
( 1176
47.8%
77
 
3.1%
2 7
 
0.3%
- 7
 
0.3%
_ 5
 
0.2%
. 5
 
0.2%
1 3
 
0.1%
4 1
 
< 0.1%
/ 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9581
78.8%
ASCII 2576
 
21.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1246
 
13.0%
540
 
5.6%
438
 
4.6%
227
 
2.4%
220
 
2.3%
219
 
2.3%
217
 
2.3%
209
 
2.2%
203
 
2.1%
192
 
2.0%
Other values (271) 5870
61.3%
ASCII
ValueCountFrequency (%)
) 1177
45.7%
( 1176
45.7%
77
 
3.0%
S 9
 
0.3%
2 7
 
0.3%
- 7
 
0.3%
i 7
 
0.3%
o 6
 
0.2%
n 6
 
0.2%
C 6
 
0.2%
Other values (35) 98
 
3.8%

처리방법
Categorical

Distinct30
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size11.9 KiB
매립(민간관리형매립시설)
282 
중간처분(일반소각)
255 
재활용(중간가공폐기물 제조)
251 
재활용(원료 제조)
174 
재활용(직접 제품제조)
139 
Other values (25)
405 

Length

Max length19
Median length17
Mean length12.223772
Min length1

Unique

Unique6 ?
Unique (%)0.4%

Sample

1st row중간처분(일반소각)
2nd row재활용(중간가공폐기물 제조)
3rd row중간처분(일반소각)
4th row중간처분(파쇄.분쇄)
5th row재활용(원형 재사용)

Common Values

ValueCountFrequency (%)
매립(민간관리형매립시설) 282
18.7%
중간처분(일반소각) 255
16.9%
재활용(중간가공폐기물 제조) 251
16.7%
재활용(원료 제조) 174
11.6%
재활용(직접 제품제조) 139
9.2%
중간처분(파쇄.분쇄) 70
 
4.6%
재활용(연료·고형연료제품 제조) 68
 
4.5%
재활용(농업생산활동에 사용) 46
 
3.1%
재활용(토질개선에 사용) 37
 
2.5%
재활용(기타) 31
 
2.1%
Other values (20) 153
10.2%

Length

2024-01-10T05:38:04.951645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
제조 493
21.4%
매립(민간관리형매립시설 282
12.2%
중간처분(일반소각 255
11.1%
재활용(중간가공폐기물 251
10.9%
재활용(원료 174
 
7.5%
재활용(직접 150
 
6.5%
제품제조 139
 
6.0%
사용 113
 
4.9%
중간처분(파쇄.분쇄 70
 
3.0%
재활용(연료·고형연료제품 68
 
2.9%
Other values (24) 312
13.5%

Interactions

2024-01-10T05:38:00.592962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:38:05.039253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배출량(톤)처리방법
배출량(톤)1.0000.220
처리방법0.2201.000
2024-01-10T05:38:05.121381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배출량(톤)처리방법
배출량(톤)1.0000.096
처리방법0.0961.000

Missing values

2024-01-10T05:38:00.724977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:38:00.844769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

상호명전화번호사업장도로명주소폐기물 종류배출량(톤)운반자처리업소명처리방법
0(재)한국건설생활환경시험연구원충청남도 서산시 대산읍 평신1로 595-10폐합성수지류(폐염화비닐수지류는 제외한다)20.0(주)대산종합환경(주)(주)서광하이테크중간처분(일반소각)
1(주)가나스틸041-681-4292충청남도 서산시 성연면 성연3로 133-25폐합성수지류(폐염화비닐수지류는 제외한다)30.0(주)드림개발(주)서청재활용(중간가공폐기물 제조)
2(주)가나스틸041-681-4292충청남도 서산시 성연면 성연3로 133-25폐합성수지류(폐염화비닐수지류는 제외한다)30.0(주)드림개발서해그린환경(주)중간처분(일반소각)
3(주)가야서산지점041-664-5326충청남도 서산시 운산면 장생동로 575폐콘크리트560.0대명종합환경산업(주)대명종합환경산업(주)중간처분(파쇄.분쇄)
4(주)가인041-662-1270충청남도 서산시 음암면 탑곡리 126-12 1층자동차 폐타이어20.0(주)해룡(주)해룡재활용(원형 재사용)
5(주)가인041-662-1270충청남도 서산시 음암면 탑곡리 126-12 1층폐합성수지류(폐염화비닐수지류는 제외한다)60.0주원범퍼산업주원범퍼산업재활용(기타)
6(주)거흥산업 서산지점041-663-1267충청남도 서산시 고북면 내포로 2207그 밖의 광재류108.0동진환경(주)(주)센트로매립(민간관리형매립시설)
7(주)거흥산업 서산지점041-663-1267충청남도 서산시 고북면 내포로 2207그 밖의 광재류108.0동진환경(주)(주)하이콘코리아재활용(직접 제품제조)
8(주)거흥산업 서산지점041-663-1267충청남도 서산시 고북면 내포로 2207폐합성수지류(폐염화비닐수지류는 제외한다)60.0구항산업새한환경(주)중간처분(일반소각)
9(주)거흥산업 서산지점041-663-1267충청남도 서산시 고북면 내포로 2207그 밖의 광재류100.0구항산업(주)세라에이치티재활용(성토재·복토재 등으로 사용)
상호명전화번호사업장도로명주소폐기물 종류배출량(톤)운반자처리업소명처리방법
1496환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설소은농장재활용(토질개선에 사용)
1497환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설제일농장재활용(토질개선에 사용)
1498환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설용이농장재활용(토질개선에 사용)
1499환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설군산한일농장재활용(토질개선에 사용)
1500환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설태인농장재활용(토질개선에 사용)
1501환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설산들농장재활용(토질개선에 사용)
1502환경시설관리주식회사041-665-9957충청남도 서산시 양대11로 55-32 (양대동)그 밖의 폐수처리오니200.0(주)태건환경건설대성팜재활용(토질개선에 사용)
1503효창산업041-688-3991충청남도 서산시 해미면 삼송리 816-23그 밖의 폐기물20.0(주)수목환경(주)서광하이테크중간처분(일반소각)
1504효창산업041-688-3991충청남도 서산시 해미면 삼송리 816-23폐합성수지류(폐염화비닐수지류는 제외한다)20.0(주)수목환경(주)성림개발중간처분(파쇄.분쇄)
1505효창산업041-688-3991충청남도 서산시 해미면 삼송리 816-23그 밖의 공정오니150.0(주)수목환경(주)이에스청원매립(민간관리형매립시설)

Duplicate rows

Most frequently occurring

상호명전화번호사업장도로명주소폐기물 종류배출량(톤)운반자처리업소명처리방법# duplicates
0우선산업(주)041-688-6388충청남도 서산시 해미면 산수로 137-24폐목재류 3등급200.0(유)케이앤케이환경(주)엔아이티중간처분(일반소각)2