Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 10000 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 140 |
Duplicate rows (%) | 1.4% |
Total size in memory | 312.5 KiB |
Average record size in memory | 32.0 B |
Variable types
Categorical | 1 |
---|---|
Text | 2 |
Dataset
Description | 강남구 쓰레기 무단투기 적발내역은 불법으로 쓰레기를 무단 투기한 내역 중 위반 쓰레기명, 위반장소, 위반일시 정보를 제공합니다. |
---|---|
Author | 서울특별시 강남구 |
URL | https://www.data.go.kr/data/15127416/fileData.do |
Dataset has 140 (1.4%) duplicate rows | Duplicates |
Reproduction
Analysis started | 2024-04-21 11:27:29.031459 |
---|---|
Analysis finished | 2024-04-21 11:27:29.965013 |
Duration | 0.93 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
위반쓰레기
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
담배꽁초 | |
---|---|
혼합배출 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 담배꽁초 |
---|---|
2nd row | 담배꽁초 |
3rd row | 담배꽁초 |
4th row | 담배꽁초 |
5th row | 담배꽁초 |
Common Values
Value | Count | Frequency (%) |
담배꽁초 | 7520 | |
혼합배출 | 2480 | 24.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
담배꽁초 | 7520 | |
혼합배출 | 2480 | 24.8% |
위반장소
Text
Distinct | 3694 |
---|---|
Distinct (%) | 36.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 26 |
---|---|
Median length | 23 |
Mean length | 12.6727 |
Min length | 3 |
Characters and Unicode
Total characters | 126727 |
---|---|
Distinct characters | 389 |
Distinct categories | 10 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 2650 ? |
---|---|
Unique (%) | 26.5% |
Sample
1st row | 강남역 1번출구 |
---|---|
2nd row | 삼성동 하동관옆 |
3rd row | 대치동 농협 |
4th row | 역삼동 테헤란로1길40 |
5th row | 삼성동 삼성로96길20 |
Value | Count | Frequency (%) |
역삼동 | 3958 | 16.7% |
강남역 | 1246 | 5.3% |
대치동 | 1210 | 5.1% |
논현동 | 1201 | 5.1% |
삼성동 | 983 | 4.2% |
11번출구 | 712 | 3.0% |
강남대로406 | 591 | 2.5% |
1번출구 | 308 | 1.3% |
신사동 | 283 | 1.2% |
강남대로408 | 199 | 0.8% |
Other values (3672) | 12971 |
Most occurring characters
Value | Count | Frequency (%) |
13751 | 10.9% | |
동 | 8829 | 7.0% |
1 | 8137 | 6.4% |
로 | 6241 | 4.9% |
역 | 6035 | 4.8% |
삼 | 5641 | 4.5% |
길 | 4454 | 3.5% |
2 | 4093 | 3.2% |
4 | 3663 | 2.9% |
3 | 3116 | 2.5% |
Other values (379) | 62767 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 70313 | |
Decimal Number | 34306 | |
Space Separator | 13751 | 10.9% |
Dash Punctuation | 2730 | 2.2% |
Open Punctuation | 2479 | 2.0% |
Close Punctuation | 2476 | 2.0% |
Uppercase Letter | 637 | 0.5% |
Other Punctuation | 33 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 8829 | 12.6% |
로 | 6241 | 8.9% |
역 | 6035 | 8.6% |
삼 | 5641 | 8.0% |
길 | 4454 | 6.3% |
대 | 3087 | 4.4% |
남 | 2761 | 3.9% |
강 | 2749 | 3.9% |
란 | 1886 | 2.7% |
헤 | 1886 | 2.7% |
Other values (337) | 26744 |
Uppercase Letter
Value | Count | Frequency (%) |
G | 162 | |
C | 125 | |
F | 92 | |
T | 52 | 8.2% |
K | 51 | 8.0% |
S | 36 | 5.7% |
V | 25 | 3.9% |
A | 20 | 3.1% |
M | 11 | 1.7% |
U | 9 | 1.4% |
Other values (13) | 54 | 8.5% |
Decimal Number
Value | Count | Frequency (%) |
1 | 8137 | |
2 | 4093 | |
4 | 3663 | |
3 | 3116 | 9.1% |
6 | 2980 | 8.7% |
0 | 2856 | 8.3% |
5 | 2606 | 7.6% |
8 | 2517 | 7.3% |
7 | 2361 | 6.9% |
9 | 1977 | 5.8% |
Other Punctuation
Value | Count | Frequency (%) |
& | 30 | |
. | 2 | 6.1% |
: | 1 | 3.0% |
Space Separator
Value | Count | Frequency (%) |
13751 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2730 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2479 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2476 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 70313 | |
Common | 55777 | |
Latin | 637 | 0.5% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 8829 | 12.6% |
로 | 6241 | 8.9% |
역 | 6035 | 8.6% |
삼 | 5641 | 8.0% |
길 | 4454 | 6.3% |
대 | 3087 | 4.4% |
남 | 2761 | 3.9% |
강 | 2749 | 3.9% |
란 | 1886 | 2.7% |
헤 | 1886 | 2.7% |
Other values (337) | 26744 |
Latin
Value | Count | Frequency (%) |
G | 162 | |
C | 125 | |
F | 92 | |
T | 52 | 8.2% |
K | 51 | 8.0% |
S | 36 | 5.7% |
V | 25 | 3.9% |
A | 20 | 3.1% |
M | 11 | 1.7% |
U | 9 | 1.4% |
Other values (13) | 54 | 8.5% |
Common
Value | Count | Frequency (%) |
13751 | ||
1 | 8137 | |
2 | 4093 | 7.3% |
4 | 3663 | 6.6% |
3 | 3116 | 5.6% |
6 | 2980 | 5.3% |
0 | 2856 | 5.1% |
- | 2730 | 4.9% |
5 | 2606 | 4.7% |
8 | 2517 | 4.5% |
Other values (9) | 9328 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 70313 | |
ASCII | 56414 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
13751 | ||
1 | 8137 | |
2 | 4093 | 7.3% |
4 | 3663 | 6.5% |
3 | 3116 | 5.5% |
6 | 2980 | 5.3% |
0 | 2856 | 5.1% |
- | 2730 | 4.8% |
5 | 2606 | 4.6% |
8 | 2517 | 4.5% |
Other values (32) | 9965 |
Hangul
Value | Count | Frequency (%) |
동 | 8829 | 12.6% |
로 | 6241 | 8.9% |
역 | 6035 | 8.6% |
삼 | 5641 | 8.0% |
길 | 4454 | 6.3% |
대 | 3087 | 4.4% |
남 | 2761 | 3.9% |
강 | 2749 | 3.9% |
란 | 1886 | 2.7% |
헤 | 1886 | 2.7% |
Other values (337) | 26744 |
위반일시
Text
Distinct | 9058 |
---|---|
Distinct (%) | 90.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 156.2 KiB |
Length
Max length | 17 |
---|---|
Median length | 16 |
Mean length | 16.0019 |
Min length | 16 |
Characters and Unicode
Total characters | 160019 |
---|---|
Distinct characters | 16 |
Distinct categories | 6 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 8242 ? |
---|---|
Unique (%) | 82.4% |
Sample
1st row | 2022-04-21 11:06 |
---|---|
2nd row | 2022-06-27 09:13 |
3rd row | 2023-11-08 09:55 |
4th row | 2022-02-23 13:50 |
5th row | 2022-08-12 10:28 |
Value | Count | Frequency (%) |
10:30 | 211 | 1.1% |
10:50 | 202 | 1.0% |
10:40 | 192 | 1.0% |
09:50 | 184 | 0.9% |
11:00 | 177 | 0.9% |
10:10 | 171 | 0.9% |
10:00 | 171 | 0.9% |
10:20 | 169 | 0.8% |
09:40 | 159 | 0.8% |
11:20 | 153 | 0.8% |
Other values (927) | 18210 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 35455 | |
0 | 32714 | |
1 | 21277 | |
- | 19998 | |
: | 10000 | 6.2% |
9999 | 6.2% | |
3 | 9074 | 5.7% |
5 | 5377 | 3.4% |
4 | 4601 | 2.9% |
9 | 4211 | 2.6% |
Other values (6) | 7313 | 4.6% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 120019 | |
Dash Punctuation | 19998 | 12.5% |
Other Punctuation | 10000 | 6.2% |
Space Separator | 9999 | 6.2% |
Uppercase Letter | 2 | < 0.1% |
Math Symbol | 1 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
2 | 35455 | |
0 | 32714 | |
1 | 21277 | |
3 | 9074 | 7.6% |
5 | 5377 | 4.5% |
4 | 4601 | 3.8% |
9 | 4211 | 3.5% |
8 | 2589 | 2.2% |
7 | 2520 | 2.1% |
6 | 2201 | 1.8% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 1 | |
X | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 19998 |
Other Punctuation
Value | Count | Frequency (%) |
: | 10000 |
Space Separator
Value | Count | Frequency (%) |
9999 |
Math Symbol
Value | Count | Frequency (%) |
+ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 160017 | |
Latin | 2 | < 0.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
2 | 35455 | |
0 | 32714 | |
1 | 21277 | |
- | 19998 | |
: | 10000 | 6.2% |
9999 | 6.2% | |
3 | 9074 | 5.7% |
5 | 5377 | 3.4% |
4 | 4601 | 2.9% |
9 | 4211 | 2.6% |
Other values (4) | 7311 | 4.6% |
Latin
Value | Count | Frequency (%) |
T | 1 | |
X | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 160019 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 35455 | |
0 | 32714 | |
1 | 21277 | |
- | 19998 | |
: | 10000 | 6.2% |
9999 | 6.2% | |
3 | 9074 | 5.7% |
5 | 5377 | 3.4% |
4 | 4601 | 2.9% |
9 | 4211 | 2.6% |
Other values (6) | 7313 | 4.6% |
위반쓰레기 | 위반장소 | 위반일시 | |
---|---|---|---|
5051 | 담배꽁초 | 강남역 1번출구 | 2022-04-21 11:06 |
7829 | 담배꽁초 | 삼성동 하동관옆 | 2022-06-27 09:13 |
19815 | 담배꽁초 | 대치동 농협 | 2023-11-08 09:55 |
2455 | 담배꽁초 | 역삼동 테헤란로1길40 | 2022-02-23 13:50 |
9821 | 담배꽁초 | 삼성동 삼성로96길20 | 2022-08-12 10:28 |
7967 | 담배꽁초 | 강남역 1번출구 | 2022-07-01 13:10 |
3511 | 담배꽁초 | 역삼동 강남대로406 | 2022-03-22 10:15 |
11872 | 담배꽁초 | 역삼동 건강보험센터옆 | 2022-10-07 12:15 |
1839 | 담배꽁초 | 대치동 테헤란로78길8 | 2022-02-10 11:00 |
12732 | 담배꽁초 | 역삼동 테헤란로205 | 2022-10-27 10:43 |
위반쓰레기 | 위반장소 | 위반일시 | |
---|---|---|---|
25647 | 혼합배출 | 삼성로71길 27-5 (대치동) | 2023-05-17 10:04 |
2087 | 담배꽁초 | 역삼동 테헤란로129 | 2022-02-15 10:40 |
2384 | 담배꽁초 | 강남역 1번출구 | 2022-02-21 12:44 |
17975 | 담배꽁초 | 삼성동 삼성역7번출구 | 2023-09-14 11:23 |
12955 | 담배꽁초 | 역삼동 테헤란로2길27 | 2022-11-01 13:18 |
21043 | 담배꽁초 | 논현동 선릉로129길5 | 2023-12-20 13:35 |
23521 | 혼합배출 | 테헤란로53길 60-8 (역삼동 693-11) | 2022-11-03 10:32 |
16486 | 담배꽁초 | 압구정동 CGV | 2023-08-03 10:40 |
15525 | 담배꽁초 | 역삼동 강남대로406 | 2023-01-06 12:10 |
8465 | 담배꽁초 | 역삼동 강남대로406 | 2022-07-13 12:12 |
Most frequently occurring
위반쓰레기 | 위반장소 | 위반일시 | # duplicates | |
---|---|---|---|---|
16 | 담배꽁초 | 강남역 1번출구 | 2022-07-11 10:10 | 3 |
59 | 담배꽁초 | 수서역 | 2022-11-23 10:30 | 3 |
116 | 담배꽁초 | 역삼동 패스트파이브앞길 | 2022-08-31 11:10 | 3 |
0 | 담배꽁초 | 강남역 11번출구 | 2022-01-05 13:34 | 2 |
1 | 담배꽁초 | 강남역 11번출구 | 2022-01-05 13:50 | 2 |
2 | 담배꽁초 | 강남역 11번출구 | 2022-01-06 13:48 | 2 |
3 | 담배꽁초 | 강남역 11번출구 | 2022-01-13 13:20 | 2 |
4 | 담배꽁초 | 강남역 11번출구 | 2022-01-18 12:30 | 2 |
5 | 담배꽁초 | 강남역 11번출구 | 2022-02-03 13:05 | 2 |
6 | 담배꽁초 | 강남역 11번출구 | 2022-05-30 11:50 | 2 |