Overview

Dataset statistics

Number of variables9
Number of observations4657
Missing cells0
Missing cells (%)0.0%
Duplicate rows2
Duplicate rows (%)< 0.1%
Total size in memory332.1 KiB
Average record size in memory73.0 B

Variable types

Numeric1
Text5
Categorical3

Dataset

Description전라남도 목포시_사업장폐기물배출자 신고현황(상호, 신고일, 폐기물 종류, 처리방법, 사업장도로명주소, 사업장지번주소, 업무구분, 데이터기준일자)
URLhttps://www.data.go.kr/data/15060394/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 2 (< 0.1%) duplicate rowsDuplicates
번호 is highly overall correlated with 업무구분High correlation
처리방법 is highly overall correlated with 업무구분High correlation
업무구분 is highly overall correlated with 번호 and 1 other fieldsHigh correlation
업무구분 is highly imbalanced (64.9%)Imbalance

Reproduction

Analysis started2023-12-12 14:02:27.561585
Analysis finished2023-12-12 14:02:28.933580
Duration1.37 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION 

Distinct4475
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2326.7576
Minimum1
Maximum4657
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size41.1 KiB
2023-12-12T23:02:29.017500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile233.8
Q11165
median2329
Q33493
95-th percentile4424.2
Maximum4657
Range4656
Interquartile range (IQR)2328

Descriptive statistics

Standard deviation1344.7164
Coefficient of variation (CV)0.57793576
Kurtosis-1.2010279
Mean2326.7576
Median Absolute Deviation (MAD)1164
Skewness0.0046693977
Sum10835710
Variance1808262.2
MonotonicityNot monotonic
2023-12-12T23:02:29.156262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2121 4
 
0.1%
2151 2
 
< 0.1%
2149 2
 
< 0.1%
212 2
 
< 0.1%
1321 2
 
< 0.1%
211 2
 
< 0.1%
1821 2
 
< 0.1%
210 2
 
< 0.1%
321 2
 
< 0.1%
2147 2
 
< 0.1%
Other values (4465) 4635
99.5%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
ValueCountFrequency (%)
4657 1
< 0.1%
4656 1
< 0.1%
4655 1
< 0.1%
4654 1
< 0.1%
4653 1
< 0.1%
4652 1
< 0.1%
4651 1
< 0.1%
4650 1
< 0.1%
4649 1
< 0.1%
4648 1
< 0.1%

상호
Text

Distinct1395
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-12-12T23:02:29.470843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length7.4292463
Min length1

Characters and Unicode

Total characters34598
Distinct characters425
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique623 ?
Unique (%)13.4%

Sample

1st row의료법인 우성의료재단 목포효노인전문병원
2nd row(주)한국해운
3rd row(주)한국해운
4th row(주)한국해운
5th row(주)한국해운
ValueCountFrequency (%)
주)한흥 110
 
2.3%
유)전남건설 80
 
1.6%
목포지방해양항만청 68
 
1.4%
유)일신건설 63
 
1.3%
주식회사 54
 
1.1%
목포시청 52
 
1.1%
유)유진건설 50
 
1.0%
목포시(해양수산과 49
 
1.0%
주)건국종합건설 48
 
1.0%
목포해양대학교 47
 
1.0%
Other values (1383) 4233
87.2%
2023-12-12T23:02:30.013519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 3183
 
9.2%
( 3052
 
8.8%
2178
 
6.3%
2034
 
5.9%
1964
 
5.7%
1218
 
3.5%
784
 
2.3%
768
 
2.2%
761
 
2.2%
722
 
2.1%
Other values (415) 17934
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 28060
81.1%
Close Punctuation 3183
 
9.2%
Open Punctuation 3052
 
8.8%
Space Separator 219
 
0.6%
Uppercase Letter 45
 
0.1%
Decimal Number 19
 
0.1%
Other Punctuation 8
 
< 0.1%
Dash Punctuation 5
 
< 0.1%
Lowercase Letter 4
 
< 0.1%
Connector Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2178
 
7.8%
2034
 
7.2%
1964
 
7.0%
1218
 
4.3%
784
 
2.8%
768
 
2.7%
761
 
2.7%
722
 
2.6%
515
 
1.8%
504
 
1.8%
Other values (387) 16612
59.2%
Uppercase Letter
ValueCountFrequency (%)
P 8
17.8%
R 6
13.3%
F 6
13.3%
G 6
13.3%
K 4
8.9%
L 3
 
6.7%
T 2
 
4.4%
N 2
 
4.4%
E 2
 
4.4%
H 2
 
4.4%
Other values (3) 4
8.9%
Decimal Number
ValueCountFrequency (%)
2 8
42.1%
1 5
26.3%
3 4
21.1%
5 1
 
5.3%
6 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
k 2
50.0%
t 1
25.0%
s 1
25.0%
Other Punctuation
ValueCountFrequency (%)
. 5
62.5%
/ 3
37.5%
Close Punctuation
ValueCountFrequency (%)
) 3183
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3052
100.0%
Space Separator
ValueCountFrequency (%)
219
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 28060
81.1%
Common 6489
 
18.8%
Latin 49
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2178
 
7.8%
2034
 
7.2%
1964
 
7.0%
1218
 
4.3%
784
 
2.8%
768
 
2.7%
761
 
2.7%
722
 
2.6%
515
 
1.8%
504
 
1.8%
Other values (387) 16612
59.2%
Latin
ValueCountFrequency (%)
P 8
16.3%
R 6
12.2%
F 6
12.2%
G 6
12.2%
K 4
8.2%
L 3
 
6.1%
T 2
 
4.1%
N 2
 
4.1%
E 2
 
4.1%
k 2
 
4.1%
Other values (6) 8
16.3%
Common
ValueCountFrequency (%)
) 3183
49.1%
( 3052
47.0%
219
 
3.4%
2 8
 
0.1%
. 5
 
0.1%
1 5
 
0.1%
- 5
 
0.1%
3 4
 
0.1%
/ 3
 
< 0.1%
_ 3
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 28060
81.1%
ASCII 6538
 
18.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 3183
48.7%
( 3052
46.7%
219
 
3.3%
P 8
 
0.1%
2 8
 
0.1%
R 6
 
0.1%
F 6
 
0.1%
G 6
 
0.1%
. 5
 
0.1%
1 5
 
0.1%
Other values (18) 40
 
0.6%
Hangul
ValueCountFrequency (%)
2178
 
7.8%
2034
 
7.2%
1964
 
7.0%
1218
 
4.3%
784
 
2.8%
768
 
2.7%
761
 
2.7%
722
 
2.6%
515
 
1.8%
504
 
1.8%
Other values (387) 16612
59.2%
Distinct1214
Distinct (%)26.1%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-12-12T23:02:30.277699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters65198
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique483 ?
Unique (%)10.4%

Sample

1st row2023 년05 월16 일
2nd row2023 년03 월21 일
3rd row2023 년03 월21 일
4th row2023 년03 월21 일
5th row2023 년03 월21 일
ValueCountFrequency (%)
4657
25.0%
2003 1172
 
6.3%
2002 1153
 
6.2%
2004 1057
 
5.7%
년05 501
 
2.7%
년06 431
 
2.3%
년08 415
 
2.2%
년11 413
 
2.2%
년12 413
 
2.2%
년07 403
 
2.2%
Other values (58) 8013
43.0%
2023-12-12T23:02:30.718810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 14128
21.7%
13971
21.4%
2 8739
13.4%
1 5142
 
7.9%
4657
 
7.1%
4657
 
7.1%
4657
 
7.1%
3 2311
 
3.5%
4 2020
 
3.1%
9 1031
 
1.6%
Other values (4) 3885
 
6.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37256
57.1%
Space Separator 13971
 
21.4%
Other Letter 13971
 
21.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 14128
37.9%
2 8739
23.5%
1 5142
 
13.8%
3 2311
 
6.2%
4 2020
 
5.4%
9 1031
 
2.8%
5 1011
 
2.7%
7 1004
 
2.7%
8 963
 
2.6%
6 907
 
2.4%
Other Letter
ValueCountFrequency (%)
4657
33.3%
4657
33.3%
4657
33.3%
Space Separator
ValueCountFrequency (%)
13971
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 51227
78.6%
Hangul 13971
 
21.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 14128
27.6%
13971
27.3%
2 8739
17.1%
1 5142
 
10.0%
3 2311
 
4.5%
4 2020
 
3.9%
9 1031
 
2.0%
5 1011
 
2.0%
7 1004
 
2.0%
8 963
 
1.9%
Hangul
ValueCountFrequency (%)
4657
33.3%
4657
33.3%
4657
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 51227
78.6%
Hangul 13971
 
21.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 14128
27.6%
13971
27.3%
2 8739
17.1%
1 5142
 
10.0%
3 2311
 
4.5%
4 2020
 
3.9%
9 1031
 
2.0%
5 1011
 
2.0%
7 1004
 
2.0%
8 963
 
1.9%
Hangul
ValueCountFrequency (%)
4657
33.3%
4657
33.3%
4657
33.3%
Distinct89
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-12-12T23:02:31.068855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length88
Median length5
Mean length8.9407344
Min length1

Characters and Unicode

Total characters41637
Distinct characters186
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.4%

Sample

1st row그 밖의 폐섬유
2nd row폐합성수지류(폐염화비닐수지류는 제외한다)
3rd row폐합성수지류(폐염화비닐수지류는 제외한다)
4th row폐합성수지류(폐염화비닐수지류는 제외한다)
5th row폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)
ValueCountFrequency (%)
폐콘크리트 1374
18.9%
건설폐기물 577
 
7.9%
제외한다 473
 
6.5%
폐합성수지류(폐염화비닐수지류는 425
 
5.8%
건설폐재류 384
 
5.3%
폐아스팔트콘크리트 383
 
5.3%
토사 376
 
5.2%
폐합성수지류 221
 
3.0%
등을 199
 
2.7%
폐목재류 160
 
2.2%
Other values (158) 2714
37.2%
2023-12-12T23:02:31.580363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4793
 
11.5%
2672
 
6.4%
2232
 
5.4%
1975
 
4.7%
1802
 
4.3%
1761
 
4.2%
1761
 
4.2%
1468
 
3.5%
1220
 
2.9%
1091
 
2.6%
Other values (176) 20862
50.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 37090
89.1%
Space Separator 2672
 
6.4%
Open Punctuation 668
 
1.6%
Close Punctuation 668
 
1.6%
Connector Punctuation 450
 
1.1%
Decimal Number 47
 
0.1%
Other Punctuation 42
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4793
 
12.9%
2232
 
6.0%
1975
 
5.3%
1802
 
4.9%
1761
 
4.7%
1761
 
4.7%
1468
 
4.0%
1220
 
3.3%
1091
 
2.9%
1084
 
2.9%
Other values (165) 17903
48.3%
Decimal Number
ValueCountFrequency (%)
1 31
66.0%
2 11
 
23.4%
3 4
 
8.5%
8 1
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 666
99.7%
2
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 666
99.7%
2
 
0.3%
Space Separator
ValueCountFrequency (%)
2672
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 450
100.0%
Other Punctuation
ValueCountFrequency (%)
. 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 37090
89.1%
Common 4547
 
10.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4793
 
12.9%
2232
 
6.0%
1975
 
5.3%
1802
 
4.9%
1761
 
4.7%
1761
 
4.7%
1468
 
4.0%
1220
 
3.3%
1091
 
2.9%
1084
 
2.9%
Other values (165) 17903
48.3%
Common
ValueCountFrequency (%)
2672
58.8%
( 666
 
14.6%
) 666
 
14.6%
_ 450
 
9.9%
. 42
 
0.9%
1 31
 
0.7%
2 11
 
0.2%
3 4
 
0.1%
2
 
< 0.1%
2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 37017
88.9%
ASCII 4543
 
10.9%
Compat Jamo 73
 
0.2%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4793
 
12.9%
2232
 
6.0%
1975
 
5.3%
1802
 
4.9%
1761
 
4.8%
1761
 
4.8%
1468
 
4.0%
1220
 
3.3%
1091
 
2.9%
1084
 
2.9%
Other values (164) 17830
48.2%
ASCII
ValueCountFrequency (%)
2672
58.8%
( 666
 
14.7%
) 666
 
14.7%
_ 450
 
9.9%
. 42
 
0.9%
1 31
 
0.7%
2 11
 
0.2%
3 4
 
0.1%
8 1
 
< 0.1%
Compat Jamo
ValueCountFrequency (%)
73
100.0%
None
ValueCountFrequency (%)
2
50.0%
2
50.0%

처리방법
Categorical

HIGH CORRELATION 

Distinct33
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
재활용(파쇄.분쇄)
1971 
중간처분(일반소각)
767 
중간처분(파쇄.분쇄)
711 
재활용(중간가공폐기물 제조)
405 
파쇄.절단
263 
Other values (28)
540 

Length

Max length19
Median length10
Mean length10.422375
Min length1

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st row중간처분(일반소각)
2nd row재활용(중간가공폐기물 제조)
3rd row중간처분(일반소각)
4th row재활용(중간가공폐기물 제조)
5th row재활용(중간가공폐기물 제조)

Common Values

ValueCountFrequency (%)
재활용(파쇄.분쇄) 1971
42.3%
중간처분(일반소각) 767
 
16.5%
중간처분(파쇄.분쇄) 711
 
15.3%
재활용(중간가공폐기물 제조) 405
 
8.7%
파쇄.절단 263
 
5.6%
재활용(기타) 114
 
2.4%
재활용(직접 제품제조) 78
 
1.7%
재활용(연료·고형연료제품 제조) 58
 
1.2%
재활용(농업생산활동에 사용) 49
 
1.1%
매립(민간관리형매립시설) 39
 
0.8%
Other values (23) 202
 
4.3%

Length

2023-12-12T23:02:31.769877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
재활용(파쇄.분쇄 1971
37.2%
중간처분(일반소각 767
 
14.5%
중간처분(파쇄.분쇄 711
 
13.4%
제조 489
 
9.2%
재활용(중간가공폐기물 405
 
7.7%
파쇄.절단 263
 
5.0%
재활용(기타 114
 
2.2%
재활용(직접 80
 
1.5%
제품제조 78
 
1.5%
사용 73
 
1.4%
Other values (27) 342
 
6.5%
Distinct925
Distinct (%)19.9%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-12-12T23:02:32.119635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length48
Mean length16.053038
Min length1

Characters and Unicode

Total characters74759
Distinct characters326
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique420 ?
Unique (%)9.0%

Sample

1st row전라남도 목포시 영산로 143 (호남동)
2nd row전라남도 여수시 오천3길 33 (오천동)
3rd row전라남도 여수시 오천3길 33 (오천동)
4th row전라남도 여수시 오천3길 33 (오천동)
5th row전라남도 여수시 오천3길 33 (오천동)
ValueCountFrequency (%)
전라남도 2648
 
17.5%
목포시 2353
 
15.6%
용당동 562
 
3.7%
상동 475
 
3.1%
양을로 344
 
2.3%
산정동 288
 
1.9%
203 242
 
1.6%
연산동 218
 
1.4%
옥암동 217
 
1.4%
석현동 129
 
0.9%
Other values (1352) 7646
50.6%
2023-12-12T23:02:32.599293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
15701
21.0%
2908
 
3.9%
2899
 
3.9%
2843
 
3.8%
2744
 
3.7%
( 2722
 
3.6%
) 2722
 
3.6%
2671
 
3.6%
2671
 
3.6%
2664
 
3.6%
Other values (316) 34214
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 42408
56.7%
Space Separator 15701
 
21.0%
Decimal Number 10082
 
13.5%
Open Punctuation 2722
 
3.6%
Close Punctuation 2722
 
3.6%
Dash Punctuation 568
 
0.8%
Connector Punctuation 515
 
0.7%
Uppercase Letter 32
 
< 0.1%
Other Punctuation 8
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2908
 
6.9%
2899
 
6.8%
2843
 
6.7%
2744
 
6.5%
2671
 
6.3%
2671
 
6.3%
2664
 
6.3%
2488
 
5.9%
2487
 
5.9%
1061
 
2.5%
Other values (286) 16972
40.0%
Decimal Number
ValueCountFrequency (%)
1 2275
22.6%
2 1600
15.9%
3 1547
15.3%
0 874
 
8.7%
4 729
 
7.2%
5 713
 
7.1%
8 656
 
6.5%
6 613
 
6.1%
7 561
 
5.6%
9 514
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
A 11
34.4%
L 4
 
12.5%
B 4
 
12.5%
H 3
 
9.4%
D 2
 
6.2%
C 2
 
6.2%
K 2
 
6.2%
T 2
 
6.2%
J 1
 
3.1%
S 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 3
37.5%
/ 2
25.0%
@ 2
25.0%
: 1
 
12.5%
Space Separator
ValueCountFrequency (%)
15701
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2722
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2722
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 568
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 515
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 42408
56.7%
Common 32319
43.2%
Latin 32
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2908
 
6.9%
2899
 
6.8%
2843
 
6.7%
2744
 
6.5%
2671
 
6.3%
2671
 
6.3%
2664
 
6.3%
2488
 
5.9%
2487
 
5.9%
1061
 
2.5%
Other values (286) 16972
40.0%
Common
ValueCountFrequency (%)
15701
48.6%
( 2722
 
8.4%
) 2722
 
8.4%
1 2275
 
7.0%
2 1600
 
5.0%
3 1547
 
4.8%
0 874
 
2.7%
4 729
 
2.3%
5 713
 
2.2%
8 656
 
2.0%
Other values (10) 2780
 
8.6%
Latin
ValueCountFrequency (%)
A 11
34.4%
L 4
 
12.5%
B 4
 
12.5%
H 3
 
9.4%
D 2
 
6.2%
C 2
 
6.2%
K 2
 
6.2%
T 2
 
6.2%
J 1
 
3.1%
S 1
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 42408
56.7%
ASCII 32351
43.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
15701
48.5%
( 2722
 
8.4%
) 2722
 
8.4%
1 2275
 
7.0%
2 1600
 
4.9%
3 1547
 
4.8%
0 874
 
2.7%
4 729
 
2.3%
5 713
 
2.2%
8 656
 
2.0%
Other values (20) 2812
 
8.7%
Hangul
ValueCountFrequency (%)
2908
 
6.9%
2899
 
6.8%
2843
 
6.7%
2744
 
6.5%
2671
 
6.3%
2671
 
6.3%
2664
 
6.3%
2488
 
5.9%
2487
 
5.9%
1061
 
2.5%
Other values (286) 16972
40.0%
Distinct1611
Distinct (%)34.6%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-12-12T23:02:32.898654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length43
Mean length20.499893
Min length1

Characters and Unicode

Total characters95468
Distinct characters356
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique676 ?
Unique (%)14.5%

Sample

1st row전라남도 목포시 호남동 446
2nd row전라남도 여수시 오천동 135-8
3rd row전라남도 여수시 오천동 135-8
4th row전라남도 여수시 오천동 135-8
5th row전라남도 여수시 오천동 135-8
ValueCountFrequency (%)
전라남도 4245
21.7%
목포시 3862
19.7%
경동1가 808
 
4.1%
용당동 636
 
3.3%
상동 570
 
2.9%
산정동 493
 
2.5%
옥암동 261
 
1.3%
1188-2 251
 
1.3%
연산동 234
 
1.2%
석현동 159
 
0.8%
Other values (2032) 8045
41.1%
2023-12-12T23:02:33.310376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20795
21.8%
1 5979
 
6.3%
4508
 
4.7%
4494
 
4.7%
4459
 
4.7%
4361
 
4.6%
4282
 
4.5%
4281
 
4.5%
4054
 
4.2%
4032
 
4.2%
Other values (346) 34223
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 51054
53.5%
Space Separator 20795
21.8%
Decimal Number 20058
 
21.0%
Dash Punctuation 3393
 
3.6%
Math Symbol 42
 
< 0.1%
Connector Punctuation 42
 
< 0.1%
Close Punctuation 25
 
< 0.1%
Uppercase Letter 23
 
< 0.1%
Open Punctuation 22
 
< 0.1%
Other Punctuation 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4508
 
8.8%
4494
 
8.8%
4459
 
8.7%
4361
 
8.5%
4282
 
8.4%
4281
 
8.4%
4054
 
7.9%
4032
 
7.9%
977
 
1.9%
927
 
1.8%
Other values (316) 14679
28.8%
Decimal Number
ValueCountFrequency (%)
1 5979
29.8%
2 2169
 
10.8%
8 1850
 
9.2%
0 1697
 
8.5%
3 1582
 
7.9%
7 1429
 
7.1%
5 1427
 
7.1%
9 1350
 
6.7%
4 1311
 
6.5%
6 1264
 
6.3%
Uppercase Letter
ValueCountFrequency (%)
L 4
17.4%
B 4
17.4%
A 3
13.0%
H 3
13.0%
C 2
8.7%
D 2
8.7%
K 2
8.7%
T 2
8.7%
S 1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
@ 9
75.0%
/ 2
 
16.7%
. 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
o 1
50.0%
k 1
50.0%
Space Separator
ValueCountFrequency (%)
20795
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3393
100.0%
Math Symbol
ValueCountFrequency (%)
~ 42
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 42
100.0%
Close Punctuation
ValueCountFrequency (%)
) 25
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 51054
53.5%
Common 44389
46.5%
Latin 25
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4508
 
8.8%
4494
 
8.8%
4459
 
8.7%
4361
 
8.5%
4282
 
8.4%
4281
 
8.4%
4054
 
7.9%
4032
 
7.9%
977
 
1.9%
927
 
1.8%
Other values (316) 14679
28.8%
Common
ValueCountFrequency (%)
20795
46.8%
1 5979
 
13.5%
- 3393
 
7.6%
2 2169
 
4.9%
8 1850
 
4.2%
0 1697
 
3.8%
3 1582
 
3.6%
7 1429
 
3.2%
5 1427
 
3.2%
9 1350
 
3.0%
Other values (9) 2718
 
6.1%
Latin
ValueCountFrequency (%)
L 4
16.0%
B 4
16.0%
A 3
12.0%
H 3
12.0%
C 2
8.0%
D 2
8.0%
K 2
8.0%
T 2
8.0%
S 1
 
4.0%
o 1
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 51054
53.5%
ASCII 44414
46.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20795
46.8%
1 5979
 
13.5%
- 3393
 
7.6%
2 2169
 
4.9%
8 1850
 
4.2%
0 1697
 
3.8%
3 1582
 
3.6%
7 1429
 
3.2%
5 1427
 
3.2%
9 1350
 
3.0%
Other values (20) 2743
 
6.2%
Hangul
ValueCountFrequency (%)
4508
 
8.8%
4494
 
8.8%
4459
 
8.7%
4361
 
8.5%
4282
 
8.4%
4281
 
8.4%
4054
 
7.9%
4032
 
7.9%
977
 
1.9%
927
 
1.8%
Other values (316) 14679
28.8%

업무구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
사업장폐기물배출자(2-2호)
4350 
사업장폐기물배출자(2호)
 
307

Length

Max length15
Median length15
Mean length14.868155
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사업장폐기물배출자(2호)
2nd row사업장폐기물배출자(2호)
3rd row사업장폐기물배출자(2호)
4th row사업장폐기물배출자(2호)
5th row사업장폐기물배출자(2호)

Common Values

ValueCountFrequency (%)
사업장폐기물배출자(2-2호) 4350
93.4%
사업장폐기물배출자(2호) 307
 
6.6%

Length

2023-12-12T23:02:33.445953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:02:33.539644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사업장폐기물배출자(2-2호 4350
93.4%
사업장폐기물배출자(2호 307
 
6.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size36.5 KiB
2023-06-21
4657 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-06-21
2nd row2023-06-21
3rd row2023-06-21
4th row2023-06-21
5th row2023-06-21

Common Values

ValueCountFrequency (%)
2023-06-21 4657
100.0%

Length

2023-12-12T23:02:33.681300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:02:33.751991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-06-21 4657
100.0%

Interactions

2023-12-12T23:02:28.592259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:02:33.801024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호폐기물 종류처리방법업무구분
번호1.0000.8130.8380.942
폐기물 종류0.8131.0000.9590.826
처리방법0.8380.9591.0000.638
업무구분0.9420.8260.6381.000
2023-12-12T23:02:33.882613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업무구분처리방법
업무구분1.0000.547
처리방법0.5471.000
2023-12-12T23:02:33.958984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호처리방법업무구분
번호1.0000.4850.796
처리방법0.4851.0000.547
업무구분0.7960.5471.000

Missing values

2023-12-12T23:02:28.704708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:02:28.855092image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호상호신고일폐기물 종류처리방법사업장도로명주소사업장지번주소업무구분데이터기준일자
01의료법인 우성의료재단 목포효노인전문병원2023 년05 월16 일그 밖의 폐섬유중간처분(일반소각)전라남도 목포시 영산로 143 (호남동)전라남도 목포시 호남동 446사업장폐기물배출자(2호)2023-06-21
12(주)한국해운2023 년03 월21 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)전라남도 여수시 오천3길 33 (오천동)전라남도 여수시 오천동 135-8사업장폐기물배출자(2호)2023-06-21
23(주)한국해운2023 년03 월21 일폐합성수지류(폐염화비닐수지류는 제외한다)중간처분(일반소각)전라남도 여수시 오천3길 33 (오천동)전라남도 여수시 오천동 135-8사업장폐기물배출자(2호)2023-06-21
34(주)한국해운2023 년03 월21 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)전라남도 여수시 오천3길 33 (오천동)전라남도 여수시 오천동 135-8사업장폐기물배출자(2호)2023-06-21
45(주)한국해운2023 년03 월21 일폐가구류_ 폐도장목_ 폐목재포장재_ 폐전선드럼(원목상태의 깨끗한 목재를 말한다)재활용(중간가공폐기물 제조)전라남도 여수시 오천3길 33 (오천동)전라남도 여수시 오천동 135-8사업장폐기물배출자(2호)2023-06-21
56(주)한국해운2023 년03 월21 일음식물류폐기물재활용(농업생산활동에 사용)전라남도 여수시 오천3길 33 (오천동)전라남도 여수시 오천동 135-8사업장폐기물배출자(2호)2023-06-21
67(주)한진2023 년03 월21 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)서울특별시 중구 남대문로 63 (소공동)서울특별시 중구 소공동 32-7사업장폐기물배출자(2호)2023-06-21
78목포더좋은요양병원2023 년02 월14 일그 밖의 폐섬유중간처분(일반소각)전라남도 목포시 용당로 334 (용해동)전라남도 목포시 용해동 413사업장폐기물배출자(2호)2023-06-21
89목포더좋은요양병원2023 년02 월06 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)전라남도 목포시 용당로 334 (용해동)전라남도 목포시 용해동 413사업장폐기물배출자(2호)2023-06-21
910목포더좋은요양병원2023 년02 월06 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)전라남도 목포시 용당로 334 (용해동)전라남도 목포시 용해동 413사업장폐기물배출자(2호)2023-06-21
번호상호신고일폐기물 종류처리방법사업장도로명주소사업장지번주소업무구분데이터기준일자
46474648(유)금강환경2001 년01 월04 일건설폐재류파쇄.절단전라남도 영암군 삼호면 동호리 1035-10사업장폐기물배출자(2-2호)2023-06-21
46484649(유)금강환경2001 년01 월04 일폐콘크리트파쇄.절단전라남도 영암군 삼호면 동호리 1035-10사업장폐기물배출자(2-2호)2023-06-21
46494650우미건설(주)2002 년01 월21 일폐콘크리트파쇄.절단전라남도 담양군 금성면 금성산성길 260전라남도 담양군 금성면 대성리 40-2사업장폐기물배출자(2-2호)2023-06-21
46504651우미건설(주)2002 년01 월21 일건설폐기물파쇄.절단전라남도 담양군 금성면 금성산성길 260전라남도 담양군 금성면 대성리 40-2사업장폐기물배출자(2-2호)2023-06-21
46514652광남개발(주)2002 년01 월21 일폐목재류파쇄.절단광주광역시 동구 구성로 218 (대인동)광주광역시 동구 대인동 311-5사업장폐기물배출자(2-2호)2023-06-21
46524653광남개발(주)2002 년01 월21 일폐콘크리트파쇄.절단광주광역시 동구 구성로 218 (대인동)광주광역시 동구 대인동 311-5사업장폐기물배출자(2-2호)2023-06-21
46534654광남개발(주)2002 년01 월21 일건설폐기물파쇄.절단광주광역시 동구 구성로 218 (대인동)광주광역시 동구 대인동 311-5사업장폐기물배출자(2-2호)2023-06-21
46544655(주)우진건설2001 년12 월14 일건설폐기물파쇄.절단전라남도 목포시 석현동 815-8사업장폐기물배출자(2-2호)2023-06-21
46554656(주)우진건설2001 년12 월14 일건설폐재류파쇄.절단전라남도 목포시 석현동 815-8사업장폐기물배출자(2-2호)2023-06-21
46564657(주)우진건설2001 년12 월14 일폐콘크리트파쇄.절단전라남도 목포시 석현동 815-8사업장폐기물배출자(2-2호)2023-06-21

Duplicate rows

Most frequently occurring

번호상호신고일폐기물 종류처리방법사업장도로명주소사업장지번주소업무구분데이터기준일자# duplicates
0421목포시환경수도사업단2021 년05 월23 일임목폐목재(건설공사_ 산지개간 등의 과정에서 발생된 나무뿌리_ 가지_ 줄기 등을 말한다)재활용(중간가공폐기물 제조)전라남도 목포시 수문로 32 (남교동)전라남도 목포시 남교동 164사업장폐기물배출자(2-2호)2023-06-212
1721목포시(안전총괄과)2019 년03 월18 일폐합성수지류(폐염화비닐수지류는 제외한다)재활용(중간가공폐기물 제조)전라남도 목포시 양을로 203_ 목포시청 (용당동)전라남도 목포시 용당동 1188-2 목포시청사업장폐기물배출자(2-2호)2023-06-212