Overview

Dataset statistics

Number of variables12
Number of observations280
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory26.9 KiB
Average record size in memory98.5 B

Variable types

Numeric1
Categorical2
Text8
DateTime1

Dataset

Description인천광역시 옹진군 사업장 폐기물 배출신고자 현황<br/>폐기물구분(사업장일반폐기물, 지정폐기물) 상호명 폐기물종류 사업자등록번호 연락처 운반자명 처리업소명 처리방법 사업장도로명주소 신고기준년도 데이터기준일자<br/>
Author인천광역시 옹진군
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15060259&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 신고기준년도High correlation
신고기준년도 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-18 04:31:16.620686
Analysis finished2024-03-18 04:31:19.512531
Duration2.89 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct280
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean140.5
Minimum1
Maximum280
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.6 KiB
2024-03-18T13:31:19.577574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile14.95
Q170.75
median140.5
Q3210.25
95-th percentile266.05
Maximum280
Range279
Interquartile range (IQR)139.5

Descriptive statistics

Standard deviation80.973247
Coefficient of variation (CV)0.57632204
Kurtosis-1.2
Mean140.5
Median Absolute Deviation (MAD)70
Skewness0
Sum39340
Variance6556.6667
MonotonicityStrictly increasing
2024-03-18T13:31:19.707328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
186 1
 
0.4%
192 1
 
0.4%
191 1
 
0.4%
190 1
 
0.4%
189 1
 
0.4%
188 1
 
0.4%
187 1
 
0.4%
185 1
 
0.4%
194 1
 
0.4%
Other values (270) 270
96.4%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
280 1
0.4%
279 1
0.4%
278 1
0.4%
277 1
0.4%
276 1
0.4%
275 1
0.4%
274 1
0.4%
273 1
0.4%
272 1
0.4%
271 1
0.4%

폐기물구분
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
사업장일반폐기물
150 
지정폐기물
130 

Length

Max length8
Median length8
Mean length6.6071429
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지정폐기물
2nd row지정폐기물
3rd row사업장일반폐기물
4th row사업장일반폐기물
5th row지정폐기물

Common Values

ValueCountFrequency (%)
사업장일반폐기물 150
53.6%
지정폐기물 130
46.4%

Length

2024-03-18T13:31:19.833468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:31:19.925683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장일반폐기물 150
53.6%
지정폐기물 130
46.4%
Distinct90
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:20.104496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length15
Mean length8.4428571
Min length2

Characters and Unicode

Total characters2364
Distinct characters162
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)20.7%

Sample

1st row국방시설본부 경기남부시설단
2nd row해병대9196부대
3rd row한국어촌어항공단
4th row한국어촌어항공단
5th row인천광역시 남부교육지원청
ValueCountFrequency (%)
개인 37
 
9.4%
옹진군청 34
 
8.7%
한국남동발전㈜ 25
 
6.4%
영흥발전본부 25
 
6.4%
북도면사무소 16
 
4.1%
인천광역시 16
 
4.1%
15
 
3.8%
영흥면사무소 15
 
3.8%
경기남부시설단 14
 
3.6%
국방시설본부 13
 
3.3%
Other values (90) 183
46.6%
2024-03-18T13:31:20.417789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
114
 
4.8%
94
 
4.0%
71
 
3.0%
67
 
2.8%
62
 
2.6%
61
 
2.6%
60
 
2.5%
59
 
2.5%
57
 
2.4%
57
 
2.4%
Other values (152) 1662
70.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2041
86.3%
Space Separator 114
 
4.8%
Decimal Number 113
 
4.8%
Other Symbol 39
 
1.6%
Other Punctuation 18
 
0.8%
Close Punctuation 15
 
0.6%
Open Punctuation 15
 
0.6%
Uppercase Letter 7
 
0.3%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
94
 
4.6%
71
 
3.5%
67
 
3.3%
62
 
3.0%
61
 
3.0%
60
 
2.9%
59
 
2.9%
57
 
2.8%
57
 
2.8%
54
 
2.6%
Other values (133) 1399
68.5%
Decimal Number
ValueCountFrequency (%)
9 38
33.6%
1 26
23.0%
6 16
14.2%
5 14
 
12.4%
8 14
 
12.4%
3 4
 
3.5%
2 1
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
P 2
28.6%
C 2
28.6%
X 1
14.3%
T 1
14.3%
S 1
14.3%
Other Punctuation
ValueCountFrequency (%)
, 16
88.9%
: 2
 
11.1%
Space Separator
ValueCountFrequency (%)
114
100.0%
Other Symbol
ValueCountFrequency (%)
39
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2080
88.0%
Common 277
 
11.7%
Latin 7
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
94
 
4.5%
71
 
3.4%
67
 
3.2%
62
 
3.0%
61
 
2.9%
60
 
2.9%
59
 
2.8%
57
 
2.7%
57
 
2.7%
54
 
2.6%
Other values (134) 1438
69.1%
Common
ValueCountFrequency (%)
114
41.2%
9 38
 
13.7%
1 26
 
9.4%
, 16
 
5.8%
6 16
 
5.8%
) 15
 
5.4%
( 15
 
5.4%
5 14
 
5.1%
8 14
 
5.1%
3 4
 
1.4%
Other values (3) 5
 
1.8%
Latin
ValueCountFrequency (%)
P 2
28.6%
C 2
28.6%
X 1
14.3%
T 1
14.3%
S 1
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2041
86.3%
ASCII 284
 
12.0%
None 39
 
1.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
114
40.1%
9 38
 
13.4%
1 26
 
9.2%
, 16
 
5.6%
6 16
 
5.6%
) 15
 
5.3%
( 15
 
5.3%
5 14
 
4.9%
8 14
 
4.9%
3 4
 
1.4%
Other values (8) 12
 
4.2%
Hangul
ValueCountFrequency (%)
94
 
4.6%
71
 
3.5%
67
 
3.3%
62
 
3.0%
61
 
3.0%
60
 
2.9%
59
 
2.9%
57
 
2.8%
57
 
2.8%
54
 
2.6%
Other values (133) 1399
68.5%
None
ValueCountFrequency (%)
39
100.0%
Distinct91
Distinct (%)32.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:20.634395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length285
Median length277
Mean length30.803571
Min length3

Characters and Unicode

Total characters8625
Distinct characters130
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique65 ?
Unique (%)23.2%

Sample

1st row폐석면
2nd row격리의료폐기물(고상), 병리계폐기물(액상),병리계폐기물(고상), 손상성폐기물(액상), 일반의료폐기물(액상)
3rd row폐합성수지
4th row폐합성수지
5th row폐석면
ValueCountFrequency (%)
석탄재 112
 
9.0%
그밖의 112
 
9.0%
폐석회 84
 
6.7%
폐합성수지류 72
 
5.8%
흩날릴우려가없는폐석면 61
 
4.9%
폐합성수지 58
 
4.6%
석면의제거작업에사용된 52
 
4.2%
폐합성고무류 45
 
3.6%
폐석면 43
 
3.4%
금속성폐촉매 43
 
3.4%
Other values (111) 568
45.4%
2024-03-18T13:31:20.973675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1016
 
11.8%
770
 
8.9%
, 767
 
8.9%
398
 
4.6%
338
 
3.9%
265
 
3.1%
235
 
2.7%
229
 
2.7%
186
 
2.2%
182
 
2.1%
Other values (120) 4239
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6706
77.8%
Space Separator 1016
 
11.8%
Other Punctuation 767
 
8.9%
Close Punctuation 63
 
0.7%
Open Punctuation 63
 
0.7%
Decimal Number 10
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
770
 
11.5%
398
 
5.9%
338
 
5.0%
265
 
4.0%
235
 
3.5%
229
 
3.4%
186
 
2.8%
182
 
2.7%
182
 
2.7%
180
 
2.7%
Other values (114) 3741
55.8%
Decimal Number
ValueCountFrequency (%)
1 5
50.0%
2 5
50.0%
Space Separator
ValueCountFrequency (%)
1016
100.0%
Other Punctuation
ValueCountFrequency (%)
, 767
100.0%
Close Punctuation
ValueCountFrequency (%)
) 63
100.0%
Open Punctuation
ValueCountFrequency (%)
( 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6706
77.8%
Common 1919
 
22.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
770
 
11.5%
398
 
5.9%
338
 
5.0%
265
 
4.0%
235
 
3.5%
229
 
3.4%
186
 
2.8%
182
 
2.7%
182
 
2.7%
180
 
2.7%
Other values (114) 3741
55.8%
Common
ValueCountFrequency (%)
1016
52.9%
, 767
40.0%
) 63
 
3.3%
( 63
 
3.3%
1 5
 
0.3%
2 5
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6706
77.8%
ASCII 1919
 
22.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1016
52.9%
, 767
40.0%
) 63
 
3.3%
( 63
 
3.3%
1 5
 
0.3%
2 5
 
0.3%
Hangul
ValueCountFrequency (%)
770
 
11.5%
398
 
5.9%
338
 
5.0%
265
 
4.0%
235
 
3.5%
229
 
3.4%
186
 
2.8%
182
 
2.7%
182
 
2.7%
180
 
2.7%
Other values (114) 3741
55.8%
Distinct68
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:21.184093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length11.114286
Min length6

Characters and Unicode

Total characters3112
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique48 ?
Unique (%)17.1%

Sample

1st row130-83-04732
2nd row121-83-02856
3rd row220-82-00065
4th row220-82-00065
5th row121-83-03516
ValueCountFrequency (%)
121-83-00824 47
16.8%
데이터미수집 40
14.3%
121-85-16465 24
 
8.6%
130-83-04732 16
 
5.7%
121-83-00431 15
 
5.4%
121-83-00391 15
 
5.4%
121-83-04553 14
 
5.0%
121-83-02856 12
 
4.3%
220-82-00065 11
 
3.9%
121-83-03516 8
 
2.9%
Other values (58) 78
27.9%
2024-03-18T13:31:21.524240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 532
17.1%
- 473
15.2%
0 370
11.9%
2 347
11.2%
8 319
10.3%
3 313
10.1%
4 149
 
4.8%
5 136
 
4.4%
6 127
 
4.1%
7 53
 
1.7%
Other values (8) 293
9.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2397
77.0%
Dash Punctuation 473
 
15.2%
Other Letter 240
 
7.7%
Other Punctuation 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 532
22.2%
0 370
15.4%
2 347
14.5%
8 319
13.3%
3 313
13.1%
4 149
 
6.2%
5 136
 
5.7%
6 127
 
5.3%
7 53
 
2.2%
9 51
 
2.1%
Other Letter
ValueCountFrequency (%)
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 473
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2872
92.3%
Hangul 240
 
7.7%

Most frequent character per script

Common
ValueCountFrequency (%)
1 532
18.5%
- 473
16.5%
0 370
12.9%
2 347
12.1%
8 319
11.1%
3 313
10.9%
4 149
 
5.2%
5 136
 
4.7%
6 127
 
4.4%
7 53
 
1.8%
Other values (2) 53
 
1.8%
Hangul
ValueCountFrequency (%)
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2872
92.3%
Hangul 240
 
7.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 532
18.5%
- 473
16.5%
0 370
12.9%
2 347
12.1%
8 319
11.1%
3 313
10.9%
4 149
 
5.2%
5 136
 
4.7%
6 127
 
4.4%
7 53
 
1.8%
Other values (2) 53
 
1.8%
Hangul
ValueCountFrequency (%)
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%
40
16.7%
Distinct108
Distinct (%)38.6%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:21.838809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length10.342857
Min length1

Characters and Unicode

Total characters2896
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique74 ?
Unique (%)26.4%

Sample

1st row데이터미수집
2nd row032-837-3432
3rd row032-6098-0782
4th row02-6098-0799
5th row032-762-7361
ValueCountFrequency (%)
데이터미수집 78
28.1%
070-8898-3937 15
 
5.4%
032-899-2623 9
 
3.2%
032-899-3822 9
 
3.2%
032-899-2682 7
 
2.5%
032-899-3415 6
 
2.2%
032-830-3142 6
 
2.2%
02-6098-0851 5
 
1.8%
070-8898-3942 5
 
1.8%
032-899-3923 5
 
1.8%
Other values (97) 133
47.8%
2024-03-18T13:31:22.201342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 400
13.8%
3 363
12.5%
2 341
11.8%
0 307
10.6%
8 306
10.6%
9 276
9.5%
4 112
 
3.9%
7 100
 
3.5%
6 83
 
2.9%
1 79
 
2.7%
Other values (8) 529
18.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2026
70.0%
Other Letter 468
 
16.2%
Dash Punctuation 400
 
13.8%
Space Separator 2
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 363
17.9%
2 341
16.8%
0 307
15.2%
8 306
15.1%
9 276
13.6%
4 112
 
5.5%
7 100
 
4.9%
6 83
 
4.1%
1 79
 
3.9%
5 59
 
2.9%
Other Letter
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 400
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2428
83.8%
Hangul 468
 
16.2%

Most frequent character per script

Common
ValueCountFrequency (%)
- 400
16.5%
3 363
15.0%
2 341
14.0%
0 307
12.6%
8 306
12.6%
9 276
11.4%
4 112
 
4.6%
7 100
 
4.1%
6 83
 
3.4%
1 79
 
3.3%
Other values (2) 61
 
2.5%
Hangul
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2428
83.8%
Hangul 468
 
16.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 400
16.5%
3 363
15.0%
2 341
14.0%
0 307
12.6%
8 306
12.6%
9 276
11.4%
4 112
 
4.6%
7 100
 
4.1%
6 83
 
3.4%
1 79
 
3.3%
Other values (2) 61
 
2.5%
Hangul
ValueCountFrequency (%)
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
78
16.7%
Distinct130
Distinct (%)46.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:22.394738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length259
Median length248
Mean length22.678571
Min length3

Characters and Unicode

Total characters6350
Distinct characters174
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)30.4%

Sample

1st row㈜네이처산업
2nd row㈜대산기업
3rd row㈜더엘
4th row㈜더엘
5th row진성산업개발㈜
ValueCountFrequency (%)
꿈에그린㈜ 266
27.7%
㈜강우 90
 
9.4%
㈜한강건설환경 25
 
2.6%
에스지개발㈜ 22
 
2.3%
주식회사 19
 
2.0%
송화환경㈜ 16
 
1.7%
승헌실업㈜ 16
 
1.7%
자가운반 16
 
1.7%
태경산업 16
 
1.7%
연구기관 16
 
1.7%
Other values (120) 458
47.7%
2024-03-18T13:31:22.727589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
792
 
12.5%
778
 
12.3%
+ 651
 
10.3%
353
 
5.6%
280
 
4.4%
279
 
4.4%
268
 
4.2%
177
 
2.8%
144
 
2.3%
120
 
1.9%
Other values (164) 2508
39.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4042
63.7%
Other Symbol 792
 
12.5%
Space Separator 778
 
12.3%
Math Symbol 651
 
10.3%
Decimal Number 64
 
1.0%
Close Punctuation 9
 
0.1%
Open Punctuation 9
 
0.1%
Uppercase Letter 4
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
353
 
8.7%
280
 
6.9%
279
 
6.9%
268
 
6.6%
177
 
4.4%
144
 
3.6%
120
 
3.0%
95
 
2.4%
91
 
2.3%
90
 
2.2%
Other values (150) 2145
53.1%
Decimal Number
ValueCountFrequency (%)
1 33
51.6%
3 15
23.4%
6 15
23.4%
4 1
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
E 1
25.0%
T 1
25.0%
S 1
25.0%
K 1
25.0%
Other Symbol
ValueCountFrequency (%)
792
100.0%
Space Separator
ValueCountFrequency (%)
778
100.0%
Math Symbol
ValueCountFrequency (%)
+ 651
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4834
76.1%
Common 1512
 
23.8%
Latin 4
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
792
 
16.4%
353
 
7.3%
280
 
5.8%
279
 
5.8%
268
 
5.5%
177
 
3.7%
144
 
3.0%
120
 
2.5%
95
 
2.0%
91
 
1.9%
Other values (151) 2235
46.2%
Common
ValueCountFrequency (%)
778
51.5%
+ 651
43.1%
1 33
 
2.2%
3 15
 
1.0%
6 15
 
1.0%
) 9
 
0.6%
( 9
 
0.6%
4 1
 
0.1%
. 1
 
0.1%
Latin
ValueCountFrequency (%)
E 1
25.0%
T 1
25.0%
S 1
25.0%
K 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4042
63.7%
ASCII 1516
 
23.9%
None 792
 
12.5%

Most frequent character per block

None
ValueCountFrequency (%)
792
100.0%
ASCII
ValueCountFrequency (%)
778
51.3%
+ 651
42.9%
1 33
 
2.2%
3 15
 
1.0%
6 15
 
1.0%
) 9
 
0.6%
( 9
 
0.6%
E 1
 
0.1%
4 1
 
0.1%
. 1
 
0.1%
Other values (3) 3
 
0.2%
Hangul
ValueCountFrequency (%)
353
 
8.7%
280
 
6.9%
279
 
6.9%
268
 
6.6%
177
 
4.4%
144
 
3.6%
120
 
3.0%
95
 
2.4%
91
 
2.3%
90
 
2.2%
Other values (150) 2145
53.1%
Distinct129
Distinct (%)46.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:22.929119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length363
Median length328
Mean length28.753571
Min length3

Characters and Unicode

Total characters8051
Distinct characters182
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique77 ?
Unique (%)27.5%

Sample

1st row㈜센트로+ (유)에스앤피+ ㈜센트로
2nd row㈜스테리싸이클코리아
3rd row㈜이한산업
4th row㈜이한산업
5th row에코시스템㈜포항+ (주)디와이솔루션+ 에코시스템㈜포항
ValueCountFrequency (%)
신대한정유산업㈜ 91
 
9.2%
보림씨에스㈜ 83
 
8.4%
㈜제이에이그린 78
 
7.9%
㈜씨엔에스 43
 
4.4%
㈜케이에스자원개발 32
 
3.2%
㈜천지환경개발 32
 
3.2%
㈜진흥중공업 26
 
2.6%
㈜디와이솔루션 22
 
2.2%
㈜센트로 21
 
2.1%
주식회사 19
 
1.9%
Other values (127) 538
54.6%
2024-03-18T13:31:23.236109image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
830
 
10.3%
729
 
9.1%
+ 675
 
8.4%
363
 
4.5%
356
 
4.4%
259
 
3.2%
220
 
2.7%
167
 
2.1%
147
 
1.8%
136
 
1.7%
Other values (172) 4169
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5672
70.5%
Space Separator 830
 
10.3%
Other Symbol 729
 
9.1%
Math Symbol 675
 
8.4%
Decimal Number 49
 
0.6%
Open Punctuation 44
 
0.5%
Close Punctuation 44
 
0.5%
Other Punctuation 4
 
< 0.1%
Uppercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
363
 
6.4%
356
 
6.3%
259
 
4.6%
220
 
3.9%
167
 
2.9%
147
 
2.6%
136
 
2.4%
134
 
2.4%
129
 
2.3%
126
 
2.2%
Other values (161) 3635
64.1%
Decimal Number
ValueCountFrequency (%)
1 32
65.3%
3 16
32.7%
2 1
 
2.0%
Uppercase Letter
ValueCountFrequency (%)
N 2
50.0%
C 2
50.0%
Space Separator
ValueCountFrequency (%)
830
100.0%
Other Symbol
ValueCountFrequency (%)
729
100.0%
Math Symbol
ValueCountFrequency (%)
+ 675
100.0%
Open Punctuation
ValueCountFrequency (%)
( 44
100.0%
Close Punctuation
ValueCountFrequency (%)
) 44
100.0%
Other Punctuation
ValueCountFrequency (%)
, 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6401
79.5%
Common 1646
 
20.4%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
729
 
11.4%
363
 
5.7%
356
 
5.6%
259
 
4.0%
220
 
3.4%
167
 
2.6%
147
 
2.3%
136
 
2.1%
134
 
2.1%
129
 
2.0%
Other values (162) 3761
58.8%
Common
ValueCountFrequency (%)
830
50.4%
+ 675
41.0%
( 44
 
2.7%
) 44
 
2.7%
1 32
 
1.9%
3 16
 
1.0%
, 4
 
0.2%
2 1
 
0.1%
Latin
ValueCountFrequency (%)
N 2
50.0%
C 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5672
70.5%
ASCII 1650
 
20.5%
None 729
 
9.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
830
50.3%
+ 675
40.9%
( 44
 
2.7%
) 44
 
2.7%
1 32
 
1.9%
3 16
 
1.0%
, 4
 
0.2%
N 2
 
0.1%
C 2
 
0.1%
2 1
 
0.1%
None
ValueCountFrequency (%)
729
100.0%
Hangul
ValueCountFrequency (%)
363
 
6.4%
356
 
6.3%
259
 
4.6%
220
 
3.9%
167
 
2.9%
147
 
2.6%
136
 
2.4%
134
 
2.4%
129
 
2.3%
126
 
2.2%
Other values (161) 3635
64.1%
Distinct90
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:23.442697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length586
Median length555
Mean length42.603571
Min length2

Characters and Unicode

Total characters11929
Distinct characters90
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique58 ?
Unique (%)20.7%

Sample

1st row매립(민간관리형), 중간처리(고형화), 매립(민간관리형)
2nd row중간처분(일반소각)
3rd row재활용(중간가공폐기물제조)
4th row재활용(중간가공폐기물제조)
5th row매립(관리형), 고형화, 매립(관리형)
ValueCountFrequency (%)
매립(민간관리형매립시설 202
21.0%
재활용(중간가공폐기물제조 131
13.6%
중간처분(일반소각 118
12.3%
재활용(성토재복토재등사용 59
 
6.1%
재활용(직접제품제조 56
 
5.8%
민간관리형매립 45
 
4.7%
재활용(원료제조 29
 
3.0%
일반소각 27
 
2.8%
재활용(연료고형연료제품제조 25
 
2.6%
고형화 17
 
1.8%
Other values (73) 254
26.4%
2024-03-18T13:31:23.760438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
( 820
 
6.9%
) 819
 
6.9%
737
 
6.2%
, 654
 
5.5%
591
 
5.0%
520
 
4.4%
489
 
4.1%
485
 
4.1%
485
 
4.1%
395
 
3.3%
Other values (80) 5934
49.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8795
73.7%
Open Punctuation 821
 
6.9%
Close Punctuation 820
 
6.9%
Space Separator 737
 
6.2%
Other Punctuation 656
 
5.5%
Decimal Number 100
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
591
 
6.7%
520
 
5.9%
489
 
5.6%
485
 
5.5%
485
 
5.5%
395
 
4.5%
391
 
4.4%
377
 
4.3%
329
 
3.7%
289
 
3.3%
Other values (66) 4444
50.5%
Decimal Number
ValueCountFrequency (%)
0 35
35.0%
1 27
27.0%
2 25
25.0%
3 9
 
9.0%
6 2
 
2.0%
4 1
 
1.0%
9 1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 820
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 819
99.9%
] 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
, 654
99.7%
· 2
 
0.3%
Space Separator
ValueCountFrequency (%)
737
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8795
73.7%
Common 3134
 
26.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
591
 
6.7%
520
 
5.9%
489
 
5.6%
485
 
5.5%
485
 
5.5%
395
 
4.5%
391
 
4.4%
377
 
4.3%
329
 
3.7%
289
 
3.3%
Other values (66) 4444
50.5%
Common
ValueCountFrequency (%)
( 820
26.2%
) 819
26.1%
737
23.5%
, 654
20.9%
0 35
 
1.1%
1 27
 
0.9%
2 25
 
0.8%
3 9
 
0.3%
6 2
 
0.1%
· 2
 
0.1%
Other values (4) 4
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8795
73.7%
ASCII 3132
 
26.3%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
( 820
26.2%
) 819
26.1%
737
23.5%
, 654
20.9%
0 35
 
1.1%
1 27
 
0.9%
2 25
 
0.8%
3 9
 
0.3%
6 2
 
0.1%
4 1
 
< 0.1%
Other values (3) 3
 
0.1%
Hangul
ValueCountFrequency (%)
591
 
6.7%
520
 
5.9%
489
 
5.6%
485
 
5.5%
485
 
5.5%
395
 
4.5%
391
 
4.4%
377
 
4.3%
329
 
3.7%
289
 
3.3%
Other values (66) 4444
50.5%
None
ValueCountFrequency (%)
· 2
100.0%
Distinct202
Distinct (%)72.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2024-03-18T13:31:23.988271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length38
Mean length23.489286
Min length5

Characters and Unicode

Total characters6577
Distinct characters134
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique170 ?
Unique (%)60.7%

Sample

1st row인천광역시 옹진군 백령면 북포리 225-9 외1필지
2nd row인천광역시 옹진군 북포리 1번지 사서함 603-30-8호
3rd row서울특별시 금천구 가산디지털2로 53
4th row서울특별시 금천구 가산디지털2로 53
5th row인천광역시 옹진군 영흥면 내리 237
ValueCountFrequency (%)
옹진군 227
 
16.1%
인천광역시 224
 
15.9%
영흥면 86
 
6.1%
백령면 52
 
3.7%
연평면 44
 
3.1%
일원 41
 
2.9%
북도면 28
 
2.0%
영흥남로 26
 
1.8%
293번길 24
 
1.7%
75 24
 
1.7%
Other values (298) 636
45.0%
2024-03-18T13:31:24.369992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1190
 
18.1%
255
 
3.9%
247
 
3.8%
242
 
3.7%
242
 
3.7%
242
 
3.7%
233
 
3.5%
230
 
3.5%
229
 
3.5%
224
 
3.4%
Other values (124) 3243
49.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4085
62.1%
Space Separator 1190
 
18.1%
Decimal Number 1075
 
16.3%
Dash Punctuation 116
 
1.8%
Other Punctuation 63
 
1.0%
Close Punctuation 24
 
0.4%
Open Punctuation 24
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
255
 
6.2%
247
 
6.0%
242
 
5.9%
242
 
5.9%
242
 
5.9%
233
 
5.7%
230
 
5.6%
229
 
5.6%
224
 
5.5%
150
 
3.7%
Other values (108) 1791
43.8%
Decimal Number
ValueCountFrequency (%)
1 199
18.5%
2 165
15.3%
7 137
12.7%
3 136
12.7%
5 91
8.5%
4 81
7.5%
9 74
 
6.9%
8 71
 
6.6%
6 64
 
6.0%
0 57
 
5.3%
Other Punctuation
ValueCountFrequency (%)
, 62
98.4%
. 1
 
1.6%
Space Separator
ValueCountFrequency (%)
1190
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4085
62.1%
Common 2492
37.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
255
 
6.2%
247
 
6.0%
242
 
5.9%
242
 
5.9%
242
 
5.9%
233
 
5.7%
230
 
5.6%
229
 
5.6%
224
 
5.5%
150
 
3.7%
Other values (108) 1791
43.8%
Common
ValueCountFrequency (%)
1190
47.8%
1 199
 
8.0%
2 165
 
6.6%
7 137
 
5.5%
3 136
 
5.5%
- 116
 
4.7%
5 91
 
3.7%
4 81
 
3.3%
9 74
 
3.0%
8 71
 
2.8%
Other values (6) 232
 
9.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4085
62.1%
ASCII 2492
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1190
47.8%
1 199
 
8.0%
2 165
 
6.6%
7 137
 
5.5%
3 136
 
5.5%
- 116
 
4.7%
5 91
 
3.7%
4 81
 
3.3%
9 74
 
3.0%
8 71
 
2.8%
Other values (6) 232
 
9.3%
Hangul
ValueCountFrequency (%)
255
 
6.2%
247
 
6.0%
242
 
5.9%
242
 
5.9%
242
 
5.9%
233
 
5.7%
230
 
5.6%
229
 
5.6%
224
 
5.5%
150
 
3.7%
Other values (108) 1791
43.8%

신고기준년도
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2020
95 
2019
79 
2022
42 
2021
36 
2023
28 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2020 95
33.9%
2019 79
28.2%
2022 42
15.0%
2021 36
 
12.9%
2023 28
 
10.0%

Length

2024-03-18T13:31:24.485010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:31:24.581374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 95
33.9%
2019 79
28.2%
2022 42
15.0%
2021 36
 
12.9%
2023 28
 
10.0%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Minimum2023-08-30 00:00:00
Maximum2023-08-30 00:00:00
2024-03-18T13:31:24.667144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:31:24.741732image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-18T13:31:19.127032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T13:31:24.811766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번폐기물구분상호명폐기물종류사업자등록번호처리방법신고기준년도
연번1.0000.2840.7360.7570.6420.7850.996
폐기물구분0.2841.0000.9141.0000.8110.9400.081
상호명0.7360.9141.0000.9780.9990.9690.638
폐기물종류0.7571.0000.9781.0000.9790.9950.790
사업자등록번호0.6420.8110.9990.9791.0000.9370.396
처리방법0.7850.9400.9690.9950.9371.0000.789
신고기준년도0.9960.0810.6380.7900.3960.7891.000
2024-03-18T13:31:25.124775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
폐기물구분신고기준년도
폐기물구분1.0000.098
신고기준년도0.0981.000
2024-03-18T13:31:25.185018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번폐기물구분신고기준년도
연번1.0000.2140.906
폐기물구분0.2141.0000.098
신고기준년도0.9060.0981.000

Missing values

2024-03-18T13:31:19.270281image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:31:19.441258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
01지정폐기물국방시설본부 경기남부시설단폐석면130-83-04732데이터미수집㈜네이처산업㈜센트로+ (유)에스앤피+ ㈜센트로매립(민간관리형), 중간처리(고형화), 매립(민간관리형)인천광역시 옹진군 백령면 북포리 225-9 외1필지20192023-08-30
12지정폐기물해병대9196부대격리의료폐기물(고상), 병리계폐기물(액상),병리계폐기물(고상), 손상성폐기물(액상), 일반의료폐기물(액상)121-83-02856032-837-3432㈜대산기업㈜스테리싸이클코리아중간처분(일반소각)인천광역시 옹진군 북포리 1번지 사서함 603-30-8호20192023-08-30
23사업장일반폐기물한국어촌어항공단폐합성수지220-82-00065032-6098-0782㈜더엘㈜이한산업재활용(중간가공폐기물제조)서울특별시 금천구 가산디지털2로 5320192023-08-30
34사업장일반폐기물한국어촌어항공단폐합성수지220-82-0006502-6098-0799㈜더엘㈜이한산업재활용(중간가공폐기물제조)서울특별시 금천구 가산디지털2로 5320192023-08-30
45지정폐기물인천광역시 남부교육지원청폐석면121-83-03516032-762-7361진성산업개발㈜에코시스템㈜포항+ (주)디와이솔루션+ 에코시스템㈜포항매립(관리형), 고형화, 매립(관리형)인천광역시 옹진군 영흥면 내리 23720192023-08-30
56지정폐기물개인폐석면데이터미수집데이터미수집㈜유니환경㈜디와이솔루션고형화인천광역시 옹진군 옹진군 백령면 백령로 292번길 5-2620192023-08-30
67사업장일반폐기물㈜선두종합건설임목폐목재131-81-64802032-472-0511주식회사 선용알씨대성연료재활용(직접 제품제조)인천광역시 남동구 소래로 630, 401호20192023-08-30
78사업장일반폐기물옹진군청폐합성수지121-83-00824032-899-2672인성환경㈜대일개발㈜일반소각(2101)인천광역시 미추홀구 매소홀로 12020192023-08-30
89사업장일반폐기물옹진군청폐합성수지121-83-00824032-899-2672인성환경㈜대일개발㈜일반소각(2101)인천광역시 미추홀구 매소홀로 12020192023-08-30
910사업장일반폐기물옹진군청폐합성수지121-83-00824032-899-2682㈜이알지서비스㈜이알지서비스일반소각(2101)인천광역시 미추홀구 매소홀로 12020192023-08-30
연번폐기물구분상호명폐기물종류사업자등록번호연락처운반자명처리업소명처리방법사업장도로명주소신고기준년도데이터기준일자
270271사업장일반폐기물옹진군청(관광문화과)임목폐목재121-83-00824032-899-2212한성환경대성연료재활용(직접 제품제조)영흥면 내리 산 123-420232023-08-30
271272사업장일반폐기물백령면사무소폐합성수지 , 폐합성섬유,폐타이어,폐목재류121-83-00367032-899-3524㈜한강건설환경㈜엔씨알중간가공폐기물제조진촌리 16725-7외 1필지20232023-08-30
272273사업장일반폐기물백령면사무소소각재(배출계)121-83-00367032-899-3524㈜한강건설환경㈜신진이앤씨중간가공폐기물제조진촌리 237620232023-08-30
273274사업장일반폐기물㈜남서해양개발폐합성수지류 폐합성수지류417-81-39885032-887-3444가로수환경㈜그린에코사이클화성2사업소 ,㈜신승에너지재활용(중간가공폐기물제조) 중간처분(일반소각)선재도 넛출항20232023-08-30
274275사업장일반폐기물영흥면사무소폐합성수지류121-83-00391032-899-3822㈜유원환경인홍상사㈜재활용(중간가공폐기물제조)영흥면 외리 산 234일원20232023-08-30
275276사업장일반폐기물금종건설㈜그밖의폐기물136-81-29703032-933-6215㈜한강건설환경경인환경에너지㈜일반소각자월면 이작리(해군 00부대 내)20232023-08-30
276277사업장일반폐기물북도면사무소폐합성수지121-83-00431032-899-3418㈜유원환경인홍상사㈜중간처리(중간가공폐기물제조)북도면 시도리 277-720232023-08-30
277278사업장일반폐기물옹진군청폐합성수지121-83-00824032-899-2672㈜옹진해운㈜케이비아이텍일반소각옹진군 일원20232023-08-30
278279사업장일반폐기물북도면사무소폐합성수지121-83-00431032-899-3418㈜유원환경인홍상사㈜중간처리(중간가공폐기물제조)북도면 장봉리 546-220232023-08-30
279280사업장일반폐기물㈜남서해양개발폐합성수지417-81-39885데이터미수집가로수환경㈜㈜신승에너지일반소각영흥면 내리 일원20232023-08-30