Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 390.6 KiB |
Average record size in memory | 40.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Dataset
Description | 김해시 AI기반 대형생활폐기물 학습데이터를 통해 빅데이터를 활용하여 정책결정, 업무개선의 기반 마련 |
---|---|
Author | 경상남도 김해시 |
URL | https://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15076741 |
파일명 has unique values | Unique |
Reproduction
Analysis started | 2024-03-11 03:33:14.232121 |
---|---|
Analysis finished | 2024-03-11 03:33:15.712207 |
Duration | 1.48 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
파일명
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 25 |
Mean length | 25.0054 |
Min length | 25 |
Characters and Unicode
Total characters | 250054 |
---|---|
Distinct characters | 21 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 33980545_b4b664be76_3.jpg |
---|---|
2nd row | 33981417_8d3ceec2d9_1.jpg |
3rd row | 33987212_e3e8135854_2.jpg |
4th row | 33983923_62836e2906_1.jpg |
5th row | 33983829_b0f05d4615_3.jpg |
Value | Count | Frequency (%) |
33980545_b4b664be76_3.jpg | 1 | < 0.1% |
35564736_75b63e21d8_2.jpg | 1 | < 0.1% |
33983241_114041329a_3.jpg | 1 | < 0.1% |
34002711_dd0d7d3b66_3.jpg | 1 | < 0.1% |
33990188_bb378f05cb_3.jpg | 1 | < 0.1% |
34013509_5e81b69731_1.jpg | 1 | < 0.1% |
33988095_d5eee9aaab_1.jpg | 1 | < 0.1% |
34014785_0b0f07c4a2_4.jpg | 1 | < 0.1% |
33976117_bd3c8e2bc9_2.jpg | 1 | < 0.1% |
33983002_e6e876ffa3_2.jpg | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 28933 | 11.6% |
_ | 20000 | 8.0% |
5 | 18078 | 7.2% |
9 | 16901 | 6.8% |
1 | 13684 | 5.5% |
4 | 13564 | 5.4% |
2 | 12933 | 5.2% |
7 | 12770 | 5.1% |
0 | 12329 | 4.9% |
8 | 12072 | 4.8% |
Other values (11) | 88790 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152617 | |
Lowercase Letter | 67437 | |
Connector Punctuation | 20000 | 8.0% |
Other Punctuation | 10000 | 4.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 28933 | |
5 | 18078 | |
9 | 16901 | |
1 | 13684 | |
4 | 13564 | |
2 | 12933 | |
7 | 12770 | |
0 | 12329 | |
8 | 12072 | |
6 | 11353 | 7.4% |
Lowercase Letter
Value | Count | Frequency (%) |
j | 10000 | |
p | 10000 | |
g | 10000 | |
a | 6401 | |
c | 6280 | |
f | 6256 | |
b | 6238 | |
e | 6207 | |
d | 6055 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 20000 |
Other Punctuation
Value | Count | Frequency (%) |
. | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 182617 | |
Latin | 67437 | 27.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 28933 | |
_ | 20000 | |
5 | 18078 | |
9 | 16901 | |
1 | 13684 | |
4 | 13564 | |
2 | 12933 | |
7 | 12770 | |
0 | 12329 | |
8 | 12072 | |
Other values (2) | 21353 |
Latin
Value | Count | Frequency (%) |
j | 10000 | |
p | 10000 | |
g | 10000 | |
a | 6401 | |
c | 6280 | |
f | 6256 | |
b | 6238 | |
e | 6207 | |
d | 6055 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 250054 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 28933 | 11.6% |
_ | 20000 | 8.0% |
5 | 18078 | 7.2% |
9 | 16901 | 6.8% |
1 | 13684 | 5.5% |
4 | 13564 | 5.4% |
2 | 12933 | 5.2% |
7 | 12770 | 5.1% |
0 | 12329 | 4.9% |
8 | 12072 | 4.8% |
Other values (11) | 88790 |
대분류
Text
Distinct | 90 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
의자 | 2128 | |
텔레비전 | 902 | 9.0% |
공기청정기및가습기 | 687 | 6.9% |
에어컨및온풍기 | 627 | 6.3% |
소파 | 523 | 5.2% |
청소기 | 484 | 4.8% |
상 | 392 | 3.9% |
실내조명등기구 | 365 | 3.6% |
시계 | 339 | 3.4% |
가방 | 285 | 2.9% |
Other values (80) | 3268 |
Most occurring characters
Value | Count | Frequency (%) |
기 | 4549 | 11.4% |
자 | 2412 | 6.0% |
의 | 2209 | 5.5% |
장 | 1876 | 4.7% |
전 | 1343 | 3.4% |
및 | 1316 | 3.3% |
청 | 1171 | 2.9% |
소 | 1127 | 2.8% |
가 | 1010 | 2.5% |
레 | 995 | 2.5% |
Other values (146) | 22061 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 38085 | |
Close Punctuation | 597 | 1.5% |
Open Punctuation | 597 | 1.5% |
Other Punctuation | 510 | 1.3% |
Uppercase Letter | 280 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 4549 | 11.9% |
자 | 2412 | 6.3% |
의 | 2209 | 5.8% |
장 | 1876 | 4.9% |
전 | 1343 | 3.5% |
및 | 1316 | 3.5% |
청 | 1171 | 3.1% |
소 | 1127 | 3.0% |
가 | 1010 | 2.7% |
레 | 995 | 2.6% |
Other values (139) | 20077 |
Uppercase Letter
Value | Count | Frequency (%) |
V | 135 | |
T | 125 | |
P | 10 | 3.6% |
C | 10 | 3.6% |
Close Punctuation
Value | Count | Frequency (%) |
) | 597 |
Open Punctuation
Value | Count | Frequency (%) |
( | 597 |
Other Punctuation
Value | Count | Frequency (%) |
, | 510 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 38085 | |
Common | 1704 | 4.3% |
Latin | 280 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 4549 | 11.9% |
자 | 2412 | 6.3% |
의 | 2209 | 5.8% |
장 | 1876 | 4.9% |
전 | 1343 | 3.5% |
및 | 1316 | 3.5% |
청 | 1171 | 3.1% |
소 | 1127 | 3.0% |
가 | 1010 | 2.7% |
레 | 995 | 2.6% |
Other values (139) | 20077 |
Latin
Value | Count | Frequency (%) |
V | 135 | |
T | 125 | |
P | 10 | 3.6% |
C | 10 | 3.6% |
Common
Value | Count | Frequency (%) |
) | 597 | |
( | 597 | |
, | 510 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 38085 | |
ASCII | 1984 | 5.0% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
기 | 4549 | 11.9% |
자 | 2412 | 6.3% |
의 | 2209 | 5.8% |
장 | 1876 | 4.9% |
전 | 1343 | 3.5% |
및 | 1316 | 3.5% |
청 | 1171 | 3.1% |
소 | 1127 | 3.0% |
가 | 1010 | 2.7% |
레 | 995 | 2.6% |
Other values (139) | 20077 |
ASCII
Value | Count | Frequency (%) |
) | 597 | |
( | 597 | |
, | 510 | |
V | 135 | 6.8% |
T | 125 | 6.3% |
P | 10 | 0.5% |
C | 10 | 0.5% |
소분류
Text
Distinct | 159 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 31 |
---|---|
Median length | 21 |
Mean length | 11.0185 |
Min length | 6 |
Characters and Unicode
Total characters | 110185 |
---|---|
Distinct characters | 235 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 청소기_가정용(모든규격) |
---|---|
2nd row | 냉장고_500ℓ이상 |
3rd row | 상_4인용미만 |
4th row | 청소기_가정용(모든규격) |
5th row | 공기청정기및가습기_높이1m미만 |
Value | Count | Frequency (%) |
의자_편의용(안락,흔들,식탁 | 843 | 8.4% |
의자_사무용 | 785 | 7.8% |
텔레비전_30인치이상 | 779 | 7.8% |
공기청정기및가습기_높이1m미만 | 548 | 5.5% |
의자_보조,간이 | 500 | 5.0% |
청소기_가정용(모든규격 | 407 | 4.1% |
시계_벽걸이용 | 337 | 3.4% |
에어컨및온풍기_1.0㎡이상 | 300 | 3.0% |
상_4인용미만 | 282 | 2.8% |
소파_3인용이상 | 272 | 2.7% |
Other values (149) | 4947 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 10000 | 9.1% |
기 | 4751 | 4.3% |
이 | 4475 | 4.1% |
용 | 4062 | 3.7% |
의 | 3052 | 2.8% |
상 | 2989 | 2.7% |
, | 2785 | 2.5% |
0 | 2598 | 2.4% |
자 | 2466 | 2.2% |
) | 2457 | 2.2% |
Other values (225) | 70550 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 80829 | |
Connector Punctuation | 10000 | 9.1% |
Decimal Number | 7805 | 7.1% |
Other Punctuation | 3557 | 3.2% |
Close Punctuation | 2457 | 2.2% |
Open Punctuation | 2457 | 2.2% |
Other Symbol | 1506 | 1.4% |
Lowercase Letter | 1263 | 1.1% |
Uppercase Letter | 280 | 0.3% |
Math Symbol | 31 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 4751 | 5.9% |
이 | 4475 | 5.5% |
용 | 4062 | 5.0% |
의 | 3052 | 3.8% |
상 | 2989 | 3.7% |
자 | 2466 | 3.1% |
인 | 2230 | 2.8% |
장 | 2212 | 2.7% |
가 | 2030 | 2.5% |
미 | 1919 | 2.4% |
Other values (196) | 50643 |
Decimal Number
Value | Count | Frequency (%) |
0 | 2598 | |
1 | 1930 | |
3 | 1478 | |
4 | 593 | 7.6% |
9 | 403 | 5.2% |
5 | 337 | 4.3% |
2 | 275 | 3.5% |
8 | 115 | 1.5% |
7 | 76 | 1.0% |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 761 | |
㎝ | 498 | |
㎏ | 235 | 15.6% |
㎜ | 10 | 0.7% |
㎥ | 2 | 0.1% |
Uppercase Letter
Value | Count | Frequency (%) |
V | 135 | |
T | 125 | |
C | 10 | 3.6% |
P | 10 | 3.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2785 | |
. | 762 | 21.4% |
· | 10 | 0.3% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 957 | |
ℓ | 252 | 20.0% |
c | 54 | 4.3% |
Math Symbol
Value | Count | Frequency (%) |
+ | 21 | |
× | 10 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 10000 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2457 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2457 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 80829 | |
Common | 28065 | 25.5% |
Latin | 1291 | 1.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 4751 | 5.9% |
이 | 4475 | 5.5% |
용 | 4062 | 5.0% |
의 | 3052 | 3.8% |
상 | 2989 | 3.7% |
자 | 2466 | 3.1% |
인 | 2230 | 2.8% |
장 | 2212 | 2.7% |
가 | 2030 | 2.5% |
미 | 1919 | 2.4% |
Other values (196) | 50643 |
Common
Value | Count | Frequency (%) |
_ | 10000 | |
, | 2785 | 9.9% |
0 | 2598 | 9.3% |
) | 2457 | 8.8% |
( | 2457 | 8.8% |
1 | 1930 | 6.9% |
3 | 1478 | 5.3% |
. | 762 | 2.7% |
㎡ | 761 | 2.7% |
4 | 593 | 2.1% |
Other values (13) | 2244 | 8.0% |
Latin
Value | Count | Frequency (%) |
m | 957 | |
V | 135 | 10.5% |
T | 125 | 9.7% |
c | 54 | 4.2% |
C | 10 | 0.8% |
P | 10 | 0.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 80829 | |
ASCII | 27578 | 25.0% |
CJK Compat | 1506 | 1.4% |
Letterlike Symbols | 252 | 0.2% |
None | 20 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 10000 | |
, | 2785 | 10.1% |
0 | 2598 | 9.4% |
) | 2457 | 8.9% |
( | 2457 | 8.9% |
1 | 1930 | 7.0% |
3 | 1478 | 5.4% |
m | 957 | 3.5% |
. | 762 | 2.8% |
4 | 593 | 2.2% |
Other values (11) | 1561 | 5.7% |
Hangul
Value | Count | Frequency (%) |
기 | 4751 | 5.9% |
이 | 4475 | 5.5% |
용 | 4062 | 5.0% |
의 | 3052 | 3.8% |
상 | 2989 | 3.7% |
자 | 2466 | 3.1% |
인 | 2230 | 2.8% |
장 | 2212 | 2.7% |
가 | 2030 | 2.5% |
미 | 1919 | 2.4% |
Other values (196) | 50643 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 761 | |
㎝ | 498 | |
㎏ | 235 | 15.6% |
㎜ | 10 | 0.7% |
㎥ | 2 | 0.1% |
Letterlike Symbols
Value | Count | Frequency (%) |
ℓ | 252 |
None
Value | Count | Frequency (%) |
· | 10 | |
× | 10 |
등급구분
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A | |
---|---|
B | |
C | |
D |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | C |
---|---|
2nd row | A |
3rd row | B |
4th row | C |
5th row | B |
Common Values
Value | Count | Frequency (%) |
A | 4026 | |
B | 3008 | |
C | 1971 | |
D | 995 | 10.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a | 4026 | |
b | 3008 | |
c | 1971 | |
d | 995 | 10.0% |
대분류 | 등급구분 | |
---|---|---|
대분류 | 1.000 | 0.981 |
등급구분 | 0.981 | 1.000 |
파일명 | 대분류 | 소분류 | 등급구분 | |
---|---|---|---|---|
21502 | 33980545_b4b664be76_3.jpg | 청소기 | 청소기_가정용(모든규격) | C |
3961 | 33981417_8d3ceec2d9_1.jpg | 냉장고 | 냉장고_500ℓ이상 | A |
12111 | 33987212_e3e8135854_2.jpg | 상 | 상_4인용미만 | B |
24900 | 33983923_62836e2906_1.jpg | 청소기 | 청소기_가정용(모든규격) | C |
14372 | 33983829_b0f05d4615_3.jpg | 공기청정기및가습기 | 공기청정기및가습기_높이1m미만 | B |
2325 | 34009577_57b153a361_2.jpg | 의자 | 의자_사무용 | A |
21627 | 33980211_043aee3dd9_3.jpg | 청소기 | 청소기_가정용(모든규격) | C |
26202 | 33979785_b304ead8c7_1.jpg | 자전거 | 자전거_성인용 | C |
28018 | 33971403_d3e55c1525_1.jpg | 전기밥솥 | 전기밥솥_모든규격 | D |
21408 | 33989366_c8a4015079_3.jpg | 수족관 | 수족관_가로90㎝이상 | C |
파일명 | 대분류 | 소분류 | 등급구분 | |
---|---|---|---|---|
11548 | 35567368_33497162c8_1.jpg | 냉장고 | 냉장고_500ℓ이상 | A |
21731 | 33989179_62ab54fa42_2.jpg | 청소기 | 청소기_가정용(모든규격) | C |
25414 | 34003792_50d99af3ea_3.jpg | 온장고 | 온장고_높이50cm미만 | C |
7305 | 35538970_6538b18daf_1.jpg | 의자 | 의자_편의용(안락,흔들,식탁) | A |
20503 | 33981682_64bb7b2ea4_3.jpg | 상 | 상_4인용미만 | B |
28437 | 33977102_9caa3283f1_5.jpg | 쌀통 | 쌀통_모든규격 | D |
15120 | 34002698_7875a8212c_1.jpg | 에어컨및온풍기 | 에어컨및온풍기_0.5㎡미만 | B |
8443 | 35546946_feb337d611_2.jpg | 텔레비전 | 텔레비전_30인치이상 | A |
21919 | 33990508_8a79efb1ed_3.jpg | 청소기 | 청소기_가정용(모든규격) | C |
7958 | 35553648_3f5aae1918_3.jpg | 진열장(장식장,책장,찬장) | 진열장(장식장,책장,찬장)_가로90㎝미만 | A |