Overview

Dataset statistics

Number of variables2
Number of observations148
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.4 KiB
Average record size in memory16.9 B

Variable types

Text2

Dataset

Description공공데이터 제공신청 접수에 따라 음식물류폐기물 다량배출사업장 데이터를 제공함음식물류 폐기물 다량배출사업장 현황에 대한 자료로 사업장명, 소재지 정보를 제공함.
Author서울특별시 관악구
URLhttps://www.data.go.kr/data/15074537/fileData.do

Alerts

사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 20:31:35.466290
Analysis finished2023-12-12 20:31:35.717915
Duration0.25 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct148
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T05:31:35.860314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length19
Mean length7.6959459
Min length2

Characters and Unicode

Total characters1139
Distinct characters270
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique148 ?
Unique (%)100.0%

Sample

1st row롯데리아 서울대입구역점
2nd row관악농협하나로마트 보라매점
3rd row아웃백스테이크서울대점
4th row상무초밥 구로디지털점
5th row담양에초대
ValueCountFrequency (%)
서울대점 3
 
1.7%
kfc 3
 
1.7%
신림점 3
 
1.7%
신림역점 2
 
1.1%
고기싸롱 2
 
1.1%
농장사람들 2
 
1.1%
조마루감자탕 2
 
1.1%
서울대입구역점 2
 
1.1%
영략의료과학고등학교 1
 
0.6%
신림고등학교 1
 
0.6%
Other values (158) 158
88.3%
2023-12-13T05:31:36.284987image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
59
 
5.2%
56
 
4.9%
38
 
3.3%
31
 
2.7%
27
 
2.4%
26
 
2.3%
26
 
2.3%
25
 
2.2%
22
 
1.9%
22
 
1.9%
Other values (260) 807
70.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1049
92.1%
Space Separator 31
 
2.7%
Close Punctuation 16
 
1.4%
Open Punctuation 16
 
1.4%
Uppercase Letter 15
 
1.3%
Decimal Number 4
 
0.4%
Other Symbol 3
 
0.3%
Lowercase Letter 3
 
0.3%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
59
 
5.6%
56
 
5.3%
38
 
3.6%
27
 
2.6%
26
 
2.5%
26
 
2.5%
25
 
2.4%
22
 
2.1%
22
 
2.1%
20
 
1.9%
Other values (241) 728
69.4%
Uppercase Letter
ValueCountFrequency (%)
C 4
26.7%
F 3
20.0%
K 3
20.0%
S 2
13.3%
P 1
 
6.7%
L 1
 
6.7%
W 1
 
6.7%
Decimal Number
ValueCountFrequency (%)
3 1
25.0%
4 1
25.0%
9 1
25.0%
1 1
25.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
33.3%
f 1
33.3%
k 1
33.3%
Space Separator
ValueCountFrequency (%)
31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1052
92.4%
Common 69
 
6.1%
Latin 18
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
59
 
5.6%
56
 
5.3%
38
 
3.6%
27
 
2.6%
26
 
2.5%
26
 
2.5%
25
 
2.4%
22
 
2.1%
22
 
2.1%
20
 
1.9%
Other values (242) 731
69.5%
Latin
ValueCountFrequency (%)
C 4
22.2%
F 3
16.7%
K 3
16.7%
S 2
11.1%
c 1
 
5.6%
f 1
 
5.6%
k 1
 
5.6%
P 1
 
5.6%
L 1
 
5.6%
W 1
 
5.6%
Common
ValueCountFrequency (%)
31
44.9%
) 16
23.2%
( 16
23.2%
, 2
 
2.9%
3 1
 
1.4%
4 1
 
1.4%
9 1
 
1.4%
1 1
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1049
92.1%
ASCII 87
 
7.6%
None 3
 
0.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
59
 
5.6%
56
 
5.3%
38
 
3.6%
27
 
2.6%
26
 
2.5%
26
 
2.5%
25
 
2.4%
22
 
2.1%
22
 
2.1%
20
 
1.9%
Other values (241) 728
69.4%
ASCII
ValueCountFrequency (%)
31
35.6%
) 16
18.4%
( 16
18.4%
C 4
 
4.6%
F 3
 
3.4%
K 3
 
3.4%
S 2
 
2.3%
, 2
 
2.3%
c 1
 
1.1%
f 1
 
1.1%
Other values (8) 8
 
9.2%
None
ValueCountFrequency (%)
3
100.0%
Distinct139
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
2023-12-13T05:31:36.741275image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length18.439189
Min length15

Characters and Unicode

Total characters2729
Distinct characters65
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique131 ?
Unique (%)88.5%

Sample

1st row서울특별시 관악구 관악로 165
2nd row서울특별시 관악구 보라매로 12
3rd row서울특별시 관악구 남부순환로 1840
4th row서울특별시 관악구 시흥대로 568
5th row서울특별시 관악구 남현1길 38-10
ValueCountFrequency (%)
서울특별시 148
24.1%
관악구 148
24.1%
남부순환로 42
 
6.8%
신림로 19
 
3.1%
관악로 15
 
2.4%
봉천로 9
 
1.5%
양녕로 6
 
1.0%
시흥대로 5
 
0.8%
은천로 4
 
0.7%
난곡로 4
 
0.7%
Other values (172) 215
35.0%
2023-12-13T05:31:37.366539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
467
17.1%
165
 
6.0%
165
 
6.0%
153
 
5.6%
150
 
5.5%
148
 
5.4%
148
 
5.4%
148
 
5.4%
148
 
5.4%
129
 
4.7%
Other values (55) 908
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1763
64.6%
Decimal Number 491
 
18.0%
Space Separator 467
 
17.1%
Dash Punctuation 8
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
165
9.4%
165
9.4%
153
8.7%
150
8.5%
148
8.4%
148
8.4%
148
8.4%
148
8.4%
129
 
7.3%
52
 
2.9%
Other values (43) 357
20.2%
Decimal Number
ValueCountFrequency (%)
1 111
22.6%
4 54
11.0%
5 50
10.2%
6 50
10.2%
2 50
10.2%
3 46
9.4%
0 39
 
7.9%
8 34
 
6.9%
7 32
 
6.5%
9 25
 
5.1%
Space Separator
ValueCountFrequency (%)
467
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1763
64.6%
Common 966
35.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
165
9.4%
165
9.4%
153
8.7%
150
8.5%
148
8.4%
148
8.4%
148
8.4%
148
8.4%
129
 
7.3%
52
 
2.9%
Other values (43) 357
20.2%
Common
ValueCountFrequency (%)
467
48.3%
1 111
 
11.5%
4 54
 
5.6%
5 50
 
5.2%
6 50
 
5.2%
2 50
 
5.2%
3 46
 
4.8%
0 39
 
4.0%
8 34
 
3.5%
7 32
 
3.3%
Other values (2) 33
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1763
64.6%
ASCII 966
35.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
467
48.3%
1 111
 
11.5%
4 54
 
5.6%
5 50
 
5.2%
6 50
 
5.2%
2 50
 
5.2%
3 46
 
4.8%
0 39
 
4.0%
8 34
 
3.5%
7 32
 
3.3%
Other values (2) 33
 
3.4%
Hangul
ValueCountFrequency (%)
165
9.4%
165
9.4%
153
8.7%
150
8.5%
148
8.4%
148
8.4%
148
8.4%
148
8.4%
129
 
7.3%
52
 
2.9%
Other values (43) 357
20.2%

Missing values

2023-12-13T05:31:35.640078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:31:35.694836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명소재지
0롯데리아 서울대입구역점서울특별시 관악구 관악로 165
1관악농협하나로마트 보라매점서울특별시 관악구 보라매로 12
2아웃백스테이크서울대점서울특별시 관악구 남부순환로 1840
3상무초밥 구로디지털점서울특별시 관악구 시흥대로 568
4담양에초대서울특별시 관악구 남현1길 38-10
5황제갈비서울특별시 관악구 남부순환로 1644
6우디가스트로서울특별시 관악구 관악로 116
7비케이알(버거킹) 신림역서울특별시 관악구 신림로 318
8들꺠향밀내음서울특별시 관악구 관악로 210
9명륜진사갈비서울특별시 관악구 양녕로 29
사업장명소재지
138광신고등학교서울특별시 관악구 광신길 141
139서울남부초등학교서울특별시 관악구 남부순환로 163길 14
140인헌초등학교서울특별시 관악구 낙성대길 23
141구암고등학교서울특별시 관악구 성현로 57
142서울난곡초등학교(병설유치원 포함)서울특별시 관악구 난곡로 35길
143봉원중학교서울특별시 관악구 관악로 24가길 15
144서울미술고등학교서울특별시 관악구 남부순환로 247길 26
145척편한병원서울특별시 관악구 신림로 318
146서울정문학교서울특별시 관악구 난향3길 31
147서울산업정보학교서울특별시 관악구 신림로 67