Overview

Dataset statistics

Number of variables5
Number of observations321
Missing cells0
Missing cells (%)0.0%
Duplicate rows13
Duplicate rows (%)4.0%
Total size in memory13.0 KiB
Average record size in memory41.4 B

Variable types

Text2
Categorical1
Numeric1
DateTime1

Dataset

Description이 데이터는 서울특별시 동작구 소재에 있는 음식물쓰레기 다량배출 사업장에 관한 내용입니다. 이 데이터에는 사업장명, 업종명, 도로명주소, 일배출현황(kg단위) 등이 포함되어 있습니다.
Author서울특별시 동작구
URLhttps://www.data.go.kr/data/15037275/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 13 (4.0%) duplicate rowsDuplicates
일배출량 has 80 (24.9%) zerosZeros

Reproduction

Analysis started2023-12-12 06:54:30.750974
Analysis finished2023-12-12 06:54:31.544409
Duration0.79 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct223
Distinct (%)69.5%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:54:31.738152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length19
Mean length8.8193146
Min length2

Characters and Unicode

Total characters2831
Distinct characters330
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique152 ?
Unique (%)47.4%

Sample

1st row흑석어린이집
2nd row한우천국
3rd row성민촌 상도점
4th row(주)커피빈코리아
5th row주식회사 마이맘이
ValueCountFrequency (%)
스타벅스 9
 
2.0%
주식회사 8
 
1.8%
중앙대학교병원 7
 
1.6%
주)에스피씨 6
 
1.3%
주)스타벅스커피코리아 6
 
1.3%
지에프에스 6
 
1.3%
신대방식당 6
 
1.3%
대방중학교 5
 
1.1%
장승중학교 4
 
0.9%
삼성웰스토리(주)중앙대기숙사 4
 
0.9%
Other values (269) 388
86.4%
2023-12-12T15:54:32.165015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
128
 
4.5%
96
 
3.4%
89
 
3.1%
84
 
3.0%
) 72
 
2.5%
( 72
 
2.5%
70
 
2.5%
67
 
2.4%
62
 
2.2%
48
 
1.7%
Other values (320) 2043
72.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2480
87.6%
Space Separator 128
 
4.5%
Close Punctuation 72
 
2.5%
Open Punctuation 72
 
2.5%
Uppercase Letter 29
 
1.0%
Lowercase Letter 26
 
0.9%
Decimal Number 23
 
0.8%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
3.9%
89
 
3.6%
84
 
3.4%
70
 
2.8%
67
 
2.7%
62
 
2.5%
48
 
1.9%
48
 
1.9%
47
 
1.9%
41
 
1.7%
Other values (284) 1828
73.7%
Lowercase Letter
ValueCountFrequency (%)
e 5
19.2%
t 4
15.4%
h 4
15.4%
i 2
 
7.7%
k 1
 
3.8%
f 1
 
3.8%
c 1
 
3.8%
y 1
 
3.8%
u 1
 
3.8%
b 1
 
3.8%
Other values (5) 5
19.2%
Uppercase Letter
ValueCountFrequency (%)
S 7
24.1%
K 6
20.7%
I 4
13.8%
T 3
10.3%
D 3
10.3%
R 2
 
6.9%
C 2
 
6.9%
X 1
 
3.4%
U 1
 
3.4%
Decimal Number
ValueCountFrequency (%)
0 6
26.1%
9 5
21.7%
1 4
17.4%
3 3
13.0%
7 2
 
8.7%
5 1
 
4.3%
2 1
 
4.3%
6 1
 
4.3%
Space Separator
ValueCountFrequency (%)
128
100.0%
Close Punctuation
ValueCountFrequency (%)
) 72
100.0%
Open Punctuation
ValueCountFrequency (%)
( 72
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2480
87.6%
Common 296
 
10.5%
Latin 55
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
3.9%
89
 
3.6%
84
 
3.4%
70
 
2.8%
67
 
2.7%
62
 
2.5%
48
 
1.9%
48
 
1.9%
47
 
1.9%
41
 
1.7%
Other values (284) 1828
73.7%
Latin
ValueCountFrequency (%)
S 7
12.7%
K 6
 
10.9%
e 5
 
9.1%
t 4
 
7.3%
h 4
 
7.3%
I 4
 
7.3%
T 3
 
5.5%
D 3
 
5.5%
R 2
 
3.6%
i 2
 
3.6%
Other values (14) 15
27.3%
Common
ValueCountFrequency (%)
128
43.2%
) 72
24.3%
( 72
24.3%
0 6
 
2.0%
9 5
 
1.7%
1 4
 
1.4%
3 3
 
1.0%
7 2
 
0.7%
5 1
 
0.3%
2 1
 
0.3%
Other values (2) 2
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2480
87.6%
ASCII 351
 
12.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
128
36.5%
) 72
20.5%
( 72
20.5%
S 7
 
2.0%
0 6
 
1.7%
K 6
 
1.7%
e 5
 
1.4%
9 5
 
1.4%
t 4
 
1.1%
h 4
 
1.1%
Other values (26) 42
 
12.0%
Hangul
ValueCountFrequency (%)
96
 
3.9%
89
 
3.6%
84
 
3.4%
70
 
2.8%
67
 
2.7%
62
 
2.5%
48
 
1.9%
48
 
1.9%
47
 
1.9%
41
 
1.7%
Other values (284) 1828
73.7%

업종명
Categorical

Distinct7
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
집단급식소
158 
일반음식점
128 
휴게음식점
23 
기타
 
5
관광숙박시설
 
3
Other values (2)
 
4

Length

Max length6
Median length5
Mean length4.9688474
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row집단급식소
2nd row일반음식점
3rd row일반음식점
4th row일반음식점
5th row일반음식점

Common Values

ValueCountFrequency (%)
집단급식소 158
49.2%
일반음식점 128
39.9%
휴게음식점 23
 
7.2%
기타 5
 
1.6%
관광숙박시설 3
 
0.9%
대규모점포 2
 
0.6%
농수산물시장 2
 
0.6%

Length

2023-12-12T15:54:32.360841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:54:32.493106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
집단급식소 158
49.2%
일반음식점 128
39.9%
휴게음식점 23
 
7.2%
기타 5
 
1.6%
관광숙박시설 3
 
0.9%
대규모점포 2
 
0.6%
농수산물시장 2
 
0.6%
Distinct202
Distinct (%)62.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-12T15:54:32.788782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length39
Mean length31.068536
Min length1

Characters and Unicode

Total characters9973
Distinct characters247
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique125 ?
Unique (%)38.9%

Sample

1st row서울특별시 동작구 서달로4길 46 (흑석동_ 흑석어린이집)
2nd row서울특별시 영등포구 여의대방로 115 (신길동)
3rd row서울특별시 동작구 매봉로 13 (상도1동_ 상도타운)
4th row서울특별시 동작구 보라매로5길 35 (신대방동_ 한국컴퓨터빌딩_보라매현대APT)
5th row서울특별시 동작구 신대방1가길 38_ 지하1층 (신대방동_ 동작상떼빌아파트)
ValueCountFrequency (%)
서울특별시 311
 
16.8%
동작구 310
 
16.8%
사당동 71
 
3.8%
신대방동 56
 
3.0%
대방동 48
 
2.6%
상도동 43
 
2.3%
노량진동 40
 
2.2%
노량진로 31
 
1.7%
흑석동 23
 
1.2%
상도로 18
 
1.0%
Other values (347) 896
48.5%
2023-12-12T15:54:33.288505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1551
 
15.6%
700
 
7.0%
375
 
3.8%
341
 
3.4%
331
 
3.3%
329
 
3.3%
( 323
 
3.2%
) 323
 
3.2%
317
 
3.2%
315
 
3.2%
Other values (237) 5068
50.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6361
63.8%
Space Separator 1551
 
15.6%
Decimal Number 1057
 
10.6%
Open Punctuation 323
 
3.2%
Close Punctuation 323
 
3.2%
Connector Punctuation 280
 
2.8%
Uppercase Letter 53
 
0.5%
Dash Punctuation 12
 
0.1%
Lowercase Letter 8
 
0.1%
Other Punctuation 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
700
 
11.0%
375
 
5.9%
341
 
5.4%
331
 
5.2%
329
 
5.2%
317
 
5.0%
315
 
5.0%
315
 
5.0%
287
 
4.5%
254
 
4.0%
Other values (203) 2797
44.0%
Decimal Number
ValueCountFrequency (%)
1 267
25.3%
2 159
15.0%
4 113
10.7%
3 107
10.1%
5 100
 
9.5%
6 86
 
8.1%
0 83
 
7.9%
7 57
 
5.4%
9 43
 
4.1%
8 42
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
T 13
24.5%
A 11
20.8%
P 8
15.1%
S 6
11.3%
V 6
11.3%
K 3
 
5.7%
B 3
 
5.7%
I 2
 
3.8%
D 1
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
k 1
12.5%
n 1
12.5%
t 1
12.5%
a 1
12.5%
j 1
12.5%
g 1
12.5%
o 1
12.5%
s 1
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 4
80.0%
. 1
 
20.0%
Space Separator
ValueCountFrequency (%)
1551
100.0%
Open Punctuation
ValueCountFrequency (%)
( 323
100.0%
Close Punctuation
ValueCountFrequency (%)
) 323
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 280
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6361
63.8%
Common 3551
35.6%
Latin 61
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
700
 
11.0%
375
 
5.9%
341
 
5.4%
331
 
5.2%
329
 
5.2%
317
 
5.0%
315
 
5.0%
315
 
5.0%
287
 
4.5%
254
 
4.0%
Other values (203) 2797
44.0%
Common
ValueCountFrequency (%)
1551
43.7%
( 323
 
9.1%
) 323
 
9.1%
_ 280
 
7.9%
1 267
 
7.5%
2 159
 
4.5%
4 113
 
3.2%
3 107
 
3.0%
5 100
 
2.8%
6 86
 
2.4%
Other values (7) 242
 
6.8%
Latin
ValueCountFrequency (%)
T 13
21.3%
A 11
18.0%
P 8
13.1%
S 6
9.8%
V 6
9.8%
K 3
 
4.9%
B 3
 
4.9%
I 2
 
3.3%
k 1
 
1.6%
n 1
 
1.6%
Other values (7) 7
11.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6361
63.8%
ASCII 3612
36.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1551
42.9%
( 323
 
8.9%
) 323
 
8.9%
_ 280
 
7.8%
1 267
 
7.4%
2 159
 
4.4%
4 113
 
3.1%
3 107
 
3.0%
5 100
 
2.8%
6 86
 
2.4%
Other values (24) 303
 
8.4%
Hangul
ValueCountFrequency (%)
700
 
11.0%
375
 
5.9%
341
 
5.4%
331
 
5.2%
329
 
5.2%
317
 
5.0%
315
 
5.0%
315
 
5.0%
287
 
4.5%
254
 
4.0%
Other values (203) 2797
44.0%

일배출량
Real number (ℝ)

ZEROS 

Distinct97
Distinct (%)30.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2125.4961
Minimum0
Maximum117613
Zeros80
Zeros (%)24.9%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-12T15:54:33.454103image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12
median300
Q31360
95-th percentile7920
Maximum117613
Range117613
Interquartile range (IQR)1358

Descriptive statistics

Standard deviation9160.5521
Coefficient of variation (CV)4.3098418
Kurtosis109.1343
Mean2125.4961
Median Absolute Deviation (MAD)300
Skewness9.76145
Sum682284.25
Variance83915714
MonotonicityNot monotonic
2023-12-12T15:54:33.634594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 80
24.9%
300.0 21
 
6.5%
600.0 12
 
3.7%
2000.0 11
 
3.4%
60.0 10
 
3.1%
500.0 10
 
3.1%
400.0 8
 
2.5%
250.0 7
 
2.2%
1500.0 7
 
2.2%
1000.0 6
 
1.9%
Other values (87) 149
46.4%
ValueCountFrequency (%)
0.0 80
24.9%
2.0 1
 
0.3%
3.0 1
 
0.3%
5.0 1
 
0.3%
10.0 6
 
1.9%
15.0 4
 
1.2%
16.0 1
 
0.3%
20.0 1
 
0.3%
25.0 1
 
0.3%
30.0 6
 
1.9%
ValueCountFrequency (%)
117613.0 1
 
0.3%
92771.0 1
 
0.3%
30000.0 4
1.2%
20000.0 1
 
0.3%
18000.0 1
 
0.3%
15000.0 2
0.6%
10000.0 1
 
0.3%
9150.0 1
 
0.3%
9000.0 1
 
0.3%
8500.0 1
 
0.3%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2021-07-09 00:00:00
Maximum2021-07-09 00:00:00
2023-12-12T15:54:33.769576image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:54:33.873403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-12T15:54:31.201029image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:54:33.964333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종명일배출량
업종명1.0000.484
일배출량0.4841.000
2023-12-12T15:54:34.076898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일배출량업종명
일배출량1.0000.335
업종명0.3351.000

Missing values

2023-12-12T15:54:31.360236image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:54:31.496379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명업종명도로명주소일배출량데이터기준일자
0흑석어린이집집단급식소서울특별시 동작구 서달로4길 46 (흑석동_ 흑석어린이집)300.02021-07-09
1한우천국일반음식점서울특별시 영등포구 여의대방로 115 (신길동)2500.02021-07-09
2성민촌 상도점일반음식점서울특별시 동작구 매봉로 13 (상도1동_ 상도타운)3000.02021-07-09
3(주)커피빈코리아일반음식점서울특별시 동작구 보라매로5길 35 (신대방동_ 한국컴퓨터빌딩_보라매현대APT)5.02021-07-09
4주식회사 마이맘이일반음식점서울특별시 동작구 신대방1가길 38_ 지하1층 (신대방동_ 동작상떼빌아파트)60.02021-07-09
5주식회사 마이맘이일반음식점서울특별시 동작구 신대방1가길 38_ 지하1층 (신대방동_ 동작상떼빌아파트)0.02021-07-09
6구립 꿈나무 어린이집집단급식소서울특별시 동작구 여의대방로36길 11 (대방동_ 동작구육아종합지원센터)400.02021-07-09
7(주) 정캐터링 숭실대학교 제2생활관 구내식당집단급식소서울특별시 동작구 매봉로 43 (상도1동_ 예담어린이집)300.02021-07-09
8맥도날드(유)숭실대점일반음식점서울특별시 동작구 사당로 22 (상도동_ 창주빌딩)600.02021-07-09
9중앙대학교 사범대학 부속초등학교집단급식소서울특별시 동작구 서달로 135 (흑석동_ 중앙대부속초등학교)2000.02021-07-09
사업장명업종명도로명주소일배출량데이터기준일자
311숭의여자고등학교집단급식소서울특별시 동작구 여의대방로36길 79 (대방동_ 숭의여자중고등학교)0.02021-07-09
312(주)에스피씨 지에프에스 신대방식당집단급식소서울특별시 동작구 신대방16길 26_ 1층 (신대방동_ (주)성일화학)60.02021-07-09
313(주)에스피씨 지에프에스 신대방식당집단급식소서울특별시 동작구 신대방16길 26_ 1층 (신대방동_ (주)성일화학)0.02021-07-09
314(주)스타벅스커피코리아 신대방삼거리역점휴게음식점서울특별시 동작구 상도로 102_ 성대시장 (상도동)10.02021-07-09
315(주)스타벅스커피코리아 신대방삼거리역점휴게음식점서울특별시 동작구 상도로 102_ 성대시장 (상도동)0.02021-07-09
316상현초등학교집단급식소서울특별시 동작구 상도로58길 21_ 서울상현초등학교 (상도동)113.02021-07-09
317상현초등학교집단급식소서울특별시 동작구 상도로58길 21_ 서울상현초등학교 (상도동)0.02021-07-09
318제이엘푸드집단급식소서울특별시 동작구 상도로 369_ 숭실대학교 레지던스홀 지하1층 식당 (상도동)15.02021-07-09
319(주)스타벅스커피코리아 보라매공원R점휴게음식점서울특별시 동작구 보라매로5길 35 (신대방동_ 파크스퀘어_보라매현대APT)10.02021-07-09
320(주)스타벅스커피코리아 보라매공원R점휴게음식점서울특별시 동작구 보라매로5길 35 (신대방동_ 파크스퀘어_보라매현대APT)0.02021-07-09

Duplicate rows

Most frequently occurring

사업장명업종명도로명주소일배출량데이터기준일자# duplicates
1(주)에스피씨 지에프에스 신대방식당집단급식소서울특별시 동작구 신대방16길 26_ 1층 (신대방동_ (주)성일화학)0.02021-07-093
2(주)에스피씨 지에프에스 신대방식당집단급식소서울특별시 동작구 신대방16길 26_ 1층 (신대방동_ (주)성일화학)60.02021-07-093
11중앙대학교병원집단급식소서울특별시 동작구 흑석로 102 (흑석동_중앙대학교병원)30000.02021-07-093
0(재)화성시장학관집단급식소서울특별시 동작구 성대로11길 60_ 화성시장학관 (상도동)30.02021-07-092
3the KIDS 대방어린이집기타서울특별시 동작구 보라매로5가길 24_ 2층 (신대방동_ 보라매나산스위트)0.02021-07-092
4골든볼9일반음식점서울특별시 동작구 노량진로14가길 11_ 나루아카데미 (노량진동)0.02021-07-092
5대방중학교집단급식소서울특별시 동작구 여의대방로10길 24 (신대방동)0.02021-07-092
6대방중학교집단급식소서울특별시 동작구 여의대방로10길 24 (신대방동)120.02021-07-092
7맥도날드(유)숭실대점일반음식점서울특별시 동작구 사당로 22 (상도동_ 창주빌딩)600.02021-07-092
8삼성웰스토리(주)중앙대기숙사집단급식소서울특별시 동작구 흑석로 84 (흑석동_ 중앙대학교)0.02021-07-092