Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 390.6 KiB |
Average record size in memory | 40.0 B |
Variable types
Text | 3 |
---|---|
Categorical | 1 |
Dataset
Description | 김해시 AI기반 대형생활폐기물 학습데이터를 통해 빅데이터를 활용하여 정책결정, 업무개선의 기반 마련 |
---|---|
Author | 경상남도 김해시 |
URL | https://www.data.go.kr/data/15076741/fileData.do |
파일명 has unique values | Unique |
Reproduction
Analysis started | 2024-03-11 03:35:06.954007 |
---|---|
Analysis finished | 2024-03-11 03:35:08.541655 |
Duration | 1.59 second |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
파일명
Text
UNIQUE
 
Distinct | 10000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 25 |
Mean length | 25.0035 |
Min length | 25 |
Characters and Unicode
Total characters | 250035 |
---|---|
Distinct characters | 21 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10000 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 35554487_7592616ad1_1.jpg |
---|---|
2nd row | 35571127_75d0e94a97_1.jpg |
3rd row | 33984936_92d1335a26_6.jpg |
4th row | 33977908_79062b7092_3.jpg |
5th row | 34004978_1773280082_1.jpg |
Value | Count | Frequency (%) |
35554487_7592616ad1_1.jpg | 1 | < 0.1% |
35539984_0d2514035c_3.jpg | 1 | < 0.1% |
33978840_27c1c3df01_3.jpg | 1 | < 0.1% |
35539213_43dd4ea640_2.jpg | 1 | < 0.1% |
33999510_526fbcf087_5.jpg | 1 | < 0.1% |
35566839_e75edc18a4_3.jpg | 1 | < 0.1% |
35541448_aaa1bb097d_1.jpg | 1 | < 0.1% |
33999417_a198cfea2a_1.jpg | 1 | < 0.1% |
33977384_9c2737bf7b_2.jpg | 1 | < 0.1% |
34014339_547948cc06_3.jpg | 1 | < 0.1% |
Other values (9990) | 9990 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 28791 | 11.5% |
_ | 20000 | 8.0% |
5 | 18151 | 7.3% |
9 | 16899 | 6.8% |
4 | 13811 | 5.5% |
1 | 13562 | 5.4% |
2 | 12869 | 5.1% |
7 | 12598 | 5.0% |
8 | 12159 | 4.9% |
0 | 12089 | 4.8% |
Other values (11) | 89106 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 152259 | |
Lowercase Letter | 67776 | |
Connector Punctuation | 20000 | 8.0% |
Other Punctuation | 10000 | 4.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 28791 | |
5 | 18151 | |
9 | 16899 | |
4 | 13811 | |
1 | 13562 | |
2 | 12869 | |
7 | 12598 | |
8 | 12159 | |
0 | 12089 | |
6 | 11330 | 7.4% |
Lowercase Letter
Value | Count | Frequency (%) |
j | 10000 | |
p | 10000 | |
g | 10000 | |
c | 6448 | |
f | 6348 | |
a | 6314 | |
b | 6304 | |
e | 6261 | |
d | 6101 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 20000 |
Other Punctuation
Value | Count | Frequency (%) |
. | 10000 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 182259 | |
Latin | 67776 | 27.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 28791 | |
_ | 20000 | |
5 | 18151 | |
9 | 16899 | |
4 | 13811 | |
1 | 13562 | |
2 | 12869 | |
7 | 12598 | |
8 | 12159 | |
0 | 12089 | |
Other values (2) | 21330 |
Latin
Value | Count | Frequency (%) |
j | 10000 | |
p | 10000 | |
g | 10000 | |
c | 6448 | |
f | 6348 | |
a | 6314 | |
b | 6304 | |
e | 6261 | |
d | 6101 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 250035 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 28791 | 11.5% |
_ | 20000 | 8.0% |
5 | 18151 | 7.3% |
9 | 16899 | 6.8% |
4 | 13811 | 5.5% |
1 | 13562 | 5.4% |
2 | 12869 | 5.1% |
7 | 12598 | 5.0% |
8 | 12159 | 4.9% |
0 | 12089 | 4.8% |
Other values (11) | 89106 |
대분류
Text
Distinct | 90 |
---|---|
Distinct (%) | 0.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Value | Count | Frequency (%) |
의자 | 2161 | |
텔레비전 | 937 | 9.4% |
공기청정기및가습기 | 703 | 7.0% |
에어컨및온풍기 | 612 | 6.1% |
청소기 | 516 | 5.2% |
소파 | 500 | 5.0% |
상 | 426 | 4.3% |
실내조명등기구 | 385 | 3.9% |
시계 | 322 | 3.2% |
가방 | 298 | 3.0% |
Other values (80) | 3140 |
Most occurring characters
Value | Count | Frequency (%) |
기 | 4613 | 11.6% |
자 | 2415 | 6.1% |
의 | 2233 | 5.6% |
장 | 1753 | 4.4% |
전 | 1363 | 3.4% |
및 | 1318 | 3.3% |
청 | 1219 | 3.1% |
소 | 1144 | 2.9% |
가 | 1036 | 2.6% |
레 | 1024 | 2.6% |
Other values (145) | 21563 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 37870 | |
Open Punctuation | 536 | 1.4% |
Close Punctuation | 536 | 1.4% |
Other Punctuation | 468 | 1.2% |
Uppercase Letter | 271 | 0.7% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 4613 | 12.2% |
자 | 2415 | 6.4% |
의 | 2233 | 5.9% |
장 | 1753 | 4.6% |
전 | 1363 | 3.6% |
및 | 1318 | 3.5% |
청 | 1219 | 3.2% |
소 | 1144 | 3.0% |
가 | 1036 | 2.7% |
레 | 1024 | 2.7% |
Other values (138) | 19752 |
Uppercase Letter
Value | Count | Frequency (%) |
V | 132 | |
T | 125 | |
C | 7 | 2.6% |
P | 7 | 2.6% |
Open Punctuation
Value | Count | Frequency (%) |
( | 536 |
Close Punctuation
Value | Count | Frequency (%) |
) | 536 |
Other Punctuation
Value | Count | Frequency (%) |
, | 468 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 37870 | |
Common | 1540 | 3.9% |
Latin | 271 | 0.7% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 4613 | 12.2% |
자 | 2415 | 6.4% |
의 | 2233 | 5.9% |
장 | 1753 | 4.6% |
전 | 1363 | 3.6% |
및 | 1318 | 3.5% |
청 | 1219 | 3.2% |
소 | 1144 | 3.0% |
가 | 1036 | 2.7% |
레 | 1024 | 2.7% |
Other values (138) | 19752 |
Latin
Value | Count | Frequency (%) |
V | 132 | |
T | 125 | |
C | 7 | 2.6% |
P | 7 | 2.6% |
Common
Value | Count | Frequency (%) |
( | 536 | |
) | 536 | |
, | 468 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 37870 | |
ASCII | 1811 | 4.6% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
기 | 4613 | 12.2% |
자 | 2415 | 6.4% |
의 | 2233 | 5.9% |
장 | 1753 | 4.6% |
전 | 1363 | 3.6% |
및 | 1318 | 3.5% |
청 | 1219 | 3.2% |
소 | 1144 | 3.0% |
가 | 1036 | 2.7% |
레 | 1024 | 2.7% |
Other values (138) | 19752 |
ASCII
Value | Count | Frequency (%) |
( | 536 | |
) | 536 | |
, | 468 | |
V | 132 | 7.3% |
T | 125 | 6.9% |
C | 7 | 0.4% |
P | 7 | 0.4% |
소분류
Text
Distinct | 161 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 31 |
---|---|
Median length | 21 |
Mean length | 11.0357 |
Min length | 6 |
Characters and Unicode
Total characters | 110357 |
---|---|
Distinct characters | 235 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | 텔레비전_30인치이상 |
---|---|
2nd row | 의자_사무용 |
3rd row | 항아리_7리터이상 |
4th row | 항아리_7리터이상 |
5th row | 자전거_성인용 |
Value | Count | Frequency (%) |
의자_편의용(안락,흔들,식탁 | 900 | 9.0% |
텔레비전_30인치이상 | 822 | 8.2% |
의자_사무용 | 779 | 7.8% |
공기청정기및가습기_높이1m미만 | 567 | 5.7% |
의자_보조,간이 | 482 | 4.8% |
청소기_가정용(모든규격 | 431 | 4.3% |
시계_벽걸이용 | 320 | 3.2% |
상_4인용미만 | 300 | 3.0% |
에어컨및온풍기_1.0㎡이상 | 278 | 2.8% |
소파_3인용이상 | 261 | 2.6% |
Other values (151) | 4860 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 10000 | 9.1% |
기 | 4820 | 4.4% |
이 | 4401 | 4.0% |
용 | 4140 | 3.8% |
의 | 3133 | 2.8% |
상 | 3023 | 2.7% |
, | 2839 | 2.6% |
0 | 2524 | 2.3% |
) | 2488 | 2.3% |
( | 2488 | 2.3% |
Other values (225) | 70501 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 81029 | |
Connector Punctuation | 10000 | 9.1% |
Decimal Number | 7743 | 7.0% |
Other Punctuation | 3610 | 3.3% |
Close Punctuation | 2488 | 2.3% |
Open Punctuation | 2488 | 2.3% |
Other Symbol | 1469 | 1.3% |
Lowercase Letter | 1224 | 1.1% |
Uppercase Letter | 271 | 0.2% |
Math Symbol | 35 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
기 | 4820 | 5.9% |
이 | 4401 | 5.4% |
용 | 4140 | 5.1% |
의 | 3133 | 3.9% |
상 | 3023 | 3.7% |
자 | 2467 | 3.0% |
인 | 2266 | 2.8% |
장 | 2096 | 2.6% |
가 | 2075 | 2.6% |
미 | 1908 | 2.4% |
Other values (196) | 50700 |
Decimal Number
Value | Count | Frequency (%) |
0 | 2524 | |
1 | 1908 | |
3 | 1504 | |
4 | 622 | 8.0% |
9 | 383 | 4.9% |
5 | 333 | 4.3% |
2 | 292 | 3.8% |
8 | 112 | 1.4% |
7 | 65 | 0.8% |
Other Symbol
Value | Count | Frequency (%) |
㎡ | 728 | |
㎝ | 491 | |
㎏ | 240 | 16.3% |
㎜ | 7 | 0.5% |
㎥ | 3 | 0.2% |
Uppercase Letter
Value | Count | Frequency (%) |
V | 132 | |
T | 125 | |
P | 7 | 2.6% |
C | 7 | 2.6% |
Other Punctuation
Value | Count | Frequency (%) |
, | 2839 | |
. | 762 | 21.1% |
· | 9 | 0.2% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 960 | |
ℓ | 212 | 17.3% |
c | 52 | 4.2% |
Math Symbol
Value | Count | Frequency (%) |
+ | 28 | |
× | 7 | 20.0% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 10000 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2488 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2488 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 81029 | |
Common | 28045 | 25.4% |
Latin | 1283 | 1.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
기 | 4820 | 5.9% |
이 | 4401 | 5.4% |
용 | 4140 | 5.1% |
의 | 3133 | 3.9% |
상 | 3023 | 3.7% |
자 | 2467 | 3.0% |
인 | 2266 | 2.8% |
장 | 2096 | 2.6% |
가 | 2075 | 2.6% |
미 | 1908 | 2.4% |
Other values (196) | 50700 |
Common
Value | Count | Frequency (%) |
_ | 10000 | |
, | 2839 | 10.1% |
0 | 2524 | 9.0% |
) | 2488 | 8.9% |
( | 2488 | 8.9% |
1 | 1908 | 6.8% |
3 | 1504 | 5.4% |
. | 762 | 2.7% |
㎡ | 728 | 2.6% |
4 | 622 | 2.2% |
Other values (13) | 2182 | 7.8% |
Latin
Value | Count | Frequency (%) |
m | 960 | |
V | 132 | 10.3% |
T | 125 | 9.7% |
c | 52 | 4.1% |
P | 7 | 0.5% |
C | 7 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 81029 | |
ASCII | 27631 | 25.0% |
CJK Compat | 1469 | 1.3% |
Letterlike Symbols | 212 | 0.2% |
None | 16 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
_ | 10000 | |
, | 2839 | 10.3% |
0 | 2524 | 9.1% |
) | 2488 | 9.0% |
( | 2488 | 9.0% |
1 | 1908 | 6.9% |
3 | 1504 | 5.4% |
m | 960 | 3.5% |
. | 762 | 2.8% |
4 | 622 | 2.3% |
Other values (11) | 1536 | 5.6% |
Hangul
Value | Count | Frequency (%) |
기 | 4820 | 5.9% |
이 | 4401 | 5.4% |
용 | 4140 | 5.1% |
의 | 3133 | 3.9% |
상 | 3023 | 3.7% |
자 | 2467 | 3.0% |
인 | 2266 | 2.8% |
장 | 2096 | 2.6% |
가 | 2075 | 2.6% |
미 | 1908 | 2.4% |
Other values (196) | 50700 |
CJK Compat
Value | Count | Frequency (%) |
㎡ | 728 | |
㎝ | 491 | |
㎏ | 240 | 16.3% |
㎜ | 7 | 0.5% |
㎥ | 3 | 0.2% |
Letterlike Symbols
Value | Count | Frequency (%) |
ℓ | 212 |
None
Value | Count | Frequency (%) |
· | 9 | |
× | 7 |
등급구분
Categorical
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
A | |
---|---|
B | |
C | |
D |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | A |
---|---|
2nd row | A |
3rd row | C |
4th row | C |
5th row | C |
Common Values
Value | Count | Frequency (%) |
A | 4046 | |
B | 3040 | |
C | 1960 | |
D | 954 | 9.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
a | 4046 | |
b | 3040 | |
c | 1960 | |
d | 954 | 9.5% |
대분류 | 등급구분 | |
---|---|---|
대분류 | 1.000 | 0.981 |
등급구분 | 0.981 | 1.000 |
파일명 | 대분류 | 소분류 | 등급구분 | |
---|---|---|---|---|
11739 | 35554487_7592616ad1_1.jpg | 텔레비전 | 텔레비전_30인치이상 | A |
7812 | 35571127_75d0e94a97_1.jpg | 의자 | 의자_사무용 | A |
26165 | 33984936_92d1335a26_6.jpg | 항아리 | 항아리_7리터이상 | C |
26417 | 33977908_79062b7092_3.jpg | 항아리 | 항아리_7리터이상 | C |
25820 | 34004978_1773280082_1.jpg | 자전거 | 자전거_성인용 | C |
26173 | 33976461_6ac59238cb_2.jpg | 청소기 | 청소기_가정용(모든규격) | C |
22722 | 33993808_e6e45f5f3a_6.jpg | 의료기 | 의료기_일반 | C |
2066 | 35563876_9bf6d4d7e5_5.jpg | 에어컨및온풍기 | 에어컨및온풍기_1.0㎡이상 | A |
10367 | 33975548_1c8ecc405e_3.jpg | 의자 | 의자_사무용 | A |
27910 | 33969017_cbbbb83434_2.jpg | 소화기 | 소화기_3.5㎏이하(약제기준) | D |
파일명 | 대분류 | 소분류 | 등급구분 | |
---|---|---|---|---|
659 | 35549193_27060a28e0_3.jpg | 침대 | 침대_2인용(일반) | A |
328 | 34005481_427f0650f3_1.jpg | 에어컨및온풍기 | 에어컨및온풍기_1.0㎡이상 | A |
1621 | 33994617_b5c3920d7d_1.jpg | 의자 | 의자_편의용(안락,흔들,식탁) | A |
11761 | 33970323_b97faac8eb_1.jpg | 의자 | 의자_사무용 | A |
3570 | 33986902_c2f03d733b_2.jpg | 의자 | 의자_편의용(안락,흔들,식탁) | A |
26972 | 33992801_ea05e4b3fb_6.jpg | 청소기 | 청소기_가정용(모든규격) | C |
25130 | 33970819_c45416e2fd_3.jpg | 시계 | 시계_벽걸이용 | C |
5066 | 33968831_431edcd41d_3.jpg | 의자 | 의자_편의용(안락,흔들,식탁) | A |
7100 | 35554734_f77853c7c2_5.jpg | 의자 | 의자_사무용 | A |
13422 | 33982911_88df1a021a_2.jpg | 피아노 | 피아노_어프라이트 | B |