Overview

Dataset statistics

Number of variables4
Number of observations383
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.5 KiB
Average record size in memory33.3 B

Variable types

Numeric1
Text3

Dataset

Description부산광역시_사상구_쓰레기종량제봉투판매업소현황_20230404
Author부산광역시 사상구
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=3078747

Alerts

연번 has unique valuesUnique
관리번호 has unique valuesUnique

Reproduction

Analysis started2023-12-10 17:04:27.843054
Analysis finished2023-12-10 17:04:28.787029
Duration0.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct383
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean192
Minimum1
Maximum383
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.5 KiB
2023-12-11T02:04:28.963039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.1
Q196.5
median192
Q3287.5
95-th percentile363.9
Maximum383
Range382
Interquartile range (IQR)191

Descriptive statistics

Standard deviation110.70682
Coefficient of variation (CV)0.57659802
Kurtosis-1.2
Mean192
Median Absolute Deviation (MAD)96
Skewness0
Sum73536
Variance12256
MonotonicityStrictly increasing
2023-12-11T02:04:29.240271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
2 1
 
0.3%
263 1
 
0.3%
262 1
 
0.3%
261 1
 
0.3%
260 1
 
0.3%
259 1
 
0.3%
258 1
 
0.3%
257 1
 
0.3%
256 1
 
0.3%
Other values (373) 373
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
383 1
0.3%
382 1
0.3%
381 1
0.3%
380 1
0.3%
379 1
0.3%
378 1
0.3%
377 1
0.3%
376 1
0.3%
375 1
0.3%
374 1
0.3%

관리번호
Text

UNIQUE 

Distinct383
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T02:04:29.851737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length6
Mean length5.7597911
Min length4

Characters and Unicode

Total characters2206
Distinct characters27
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique383 ?
Unique (%)100.0%

Sample

1st row삼락-8
2nd row삼락-10
3rd row삼락-17
4th row삼락-29
5th row삼락-38
ValueCountFrequency (%)
삼락-8 1
 
0.3%
주례2-57 1
 
0.3%
주례2-42 1
 
0.3%
주례2-40 1
 
0.3%
주례2-39 1
 
0.3%
주례2-32 1
 
0.3%
주례2-31 1
 
0.3%
주례2-30 1
 
0.3%
주례2-12 1
 
0.3%
주례2-9 1
 
0.3%
Other values (373) 373
97.4%
2023-12-11T02:04:30.667210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 384
17.4%
1 278
 
12.6%
2 160
 
7.3%
3 101
 
4.6%
7 93
 
4.2%
8 83
 
3.8%
80
 
3.6%
80
 
3.6%
9 79
 
3.6%
4 77
 
3.5%
Other values (17) 791
35.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1056
47.9%
Other Letter 766
34.7%
Dash Punctuation 384
 
17.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
10.4%
80
10.4%
66
 
8.6%
66
 
8.6%
61
 
8.0%
61
 
8.0%
50
 
6.5%
50
 
6.5%
42
 
5.5%
42
 
5.5%
Other values (6) 168
21.9%
Decimal Number
ValueCountFrequency (%)
1 278
26.3%
2 160
15.2%
3 101
 
9.6%
7 93
 
8.8%
8 83
 
7.9%
9 79
 
7.5%
4 77
 
7.3%
6 70
 
6.6%
5 66
 
6.2%
0 49
 
4.6%
Dash Punctuation
ValueCountFrequency (%)
- 384
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1440
65.3%
Hangul 766
34.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
10.4%
80
10.4%
66
 
8.6%
66
 
8.6%
61
 
8.0%
61
 
8.0%
50
 
6.5%
50
 
6.5%
42
 
5.5%
42
 
5.5%
Other values (6) 168
21.9%
Common
ValueCountFrequency (%)
- 384
26.7%
1 278
19.3%
2 160
11.1%
3 101
 
7.0%
7 93
 
6.5%
8 83
 
5.8%
9 79
 
5.5%
4 77
 
5.3%
6 70
 
4.9%
5 66
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1440
65.3%
Hangul 766
34.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 384
26.7%
1 278
19.3%
2 160
11.1%
3 101
 
7.0%
7 93
 
6.5%
8 83
 
5.8%
9 79
 
5.5%
4 77
 
5.3%
6 70
 
4.9%
5 66
 
4.6%
Hangul
ValueCountFrequency (%)
80
10.4%
80
10.4%
66
 
8.6%
66
 
8.6%
61
 
8.0%
61
 
8.0%
50
 
6.5%
50
 
6.5%
42
 
5.5%
42
 
5.5%
Other values (6) 168
21.9%

상호
Text

Distinct367
Distinct (%)95.8%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T02:04:31.089332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length17
Mean length8.2402089
Min length3

Characters and Unicode

Total characters3156
Distinct characters303
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique355 ?
Unique (%)92.7%

Sample

1st row대영상사
2nd row지에스(GS)25삼락공단점
3rd row빅세일마트
4th row한남슈퍼
5th row미니스톱사상삼락점
ValueCountFrequency (%)
씨유 45
 
8.3%
gs25 23
 
4.3%
이마트24 17
 
3.1%
세븐일레븐 16
 
3.0%
지에스(gs)25 10
 
1.8%
지에스25 7
 
1.3%
탑플러스마트 4
 
0.7%
럭키슈퍼 4
 
0.7%
사상점 3
 
0.6%
㈜코리아세븐 3
 
0.6%
Other values (385) 409
75.6%
2023-12-11T02:04:31.770621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
206
 
6.5%
161
 
5.1%
114
 
3.6%
113
 
3.6%
2 83
 
2.6%
73
 
2.3%
70
 
2.2%
64
 
2.0%
62
 
2.0%
59
 
1.9%
Other values (293) 2151
68.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2601
82.4%
Decimal Number 166
 
5.3%
Space Separator 161
 
5.1%
Uppercase Letter 121
 
3.8%
Open Punctuation 37
 
1.2%
Close Punctuation 37
 
1.2%
Other Symbol 18
 
0.6%
Lowercase Letter 14
 
0.4%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
206
 
7.9%
114
 
4.4%
113
 
4.3%
73
 
2.8%
70
 
2.7%
64
 
2.5%
62
 
2.4%
59
 
2.3%
56
 
2.2%
54
 
2.1%
Other values (260) 1730
66.5%
Uppercase Letter
ValueCountFrequency (%)
S 46
38.0%
G 44
36.4%
C 6
 
5.0%
H 4
 
3.3%
U 4
 
3.3%
M 3
 
2.5%
R 3
 
2.5%
T 2
 
1.7%
K 2
 
1.7%
E 2
 
1.7%
Other values (4) 5
 
4.1%
Lowercase Letter
ValueCountFrequency (%)
e 3
21.4%
a 2
14.3%
r 2
14.3%
m 2
14.3%
c 2
14.3%
f 1
 
7.1%
i 1
 
7.1%
t 1
 
7.1%
Decimal Number
ValueCountFrequency (%)
2 83
50.0%
5 56
33.7%
4 22
 
13.3%
1 2
 
1.2%
3 2
 
1.2%
6 1
 
0.6%
Space Separator
ValueCountFrequency (%)
161
100.0%
Open Punctuation
ValueCountFrequency (%)
( 37
100.0%
Close Punctuation
ValueCountFrequency (%)
) 37
100.0%
Other Symbol
ValueCountFrequency (%)
18
100.0%
Math Symbol
ValueCountFrequency (%)
> 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2619
83.0%
Common 402
 
12.7%
Latin 135
 
4.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
206
 
7.9%
114
 
4.4%
113
 
4.3%
73
 
2.8%
70
 
2.7%
64
 
2.4%
62
 
2.4%
59
 
2.3%
56
 
2.1%
54
 
2.1%
Other values (261) 1748
66.7%
Latin
ValueCountFrequency (%)
S 46
34.1%
G 44
32.6%
C 6
 
4.4%
H 4
 
3.0%
U 4
 
3.0%
M 3
 
2.2%
e 3
 
2.2%
R 3
 
2.2%
a 2
 
1.5%
r 2
 
1.5%
Other values (12) 18
 
13.3%
Common
ValueCountFrequency (%)
161
40.0%
2 83
20.6%
5 56
 
13.9%
( 37
 
9.2%
) 37
 
9.2%
4 22
 
5.5%
1 2
 
0.5%
3 2
 
0.5%
6 1
 
0.2%
> 1
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2601
82.4%
ASCII 537
 
17.0%
None 18
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
206
 
7.9%
114
 
4.4%
113
 
4.3%
73
 
2.8%
70
 
2.7%
64
 
2.5%
62
 
2.4%
59
 
2.3%
56
 
2.2%
54
 
2.1%
Other values (260) 1730
66.5%
ASCII
ValueCountFrequency (%)
161
30.0%
2 83
15.5%
5 56
 
10.4%
S 46
 
8.6%
G 44
 
8.2%
( 37
 
6.9%
) 37
 
6.9%
4 22
 
4.1%
C 6
 
1.1%
H 4
 
0.7%
Other values (22) 41
 
7.6%
None
ValueCountFrequency (%)
18
100.0%

주소
Text

Distinct381
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
2023-12-11T02:04:32.223756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length44
Mean length21.924282
Min length10

Characters and Unicode

Total characters8397
Distinct characters171
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique379 ?
Unique (%)99.0%

Sample

1st row사상구 낙동대로1510번길 34, B동 101호(삼락동, 대영빌라)
2nd row사상구 모덕로 32, 1층(삼락동)
3rd row사상구 사상로309번길 58(삼락동)
4th row사상구 낙동대로1548번길 28(삼락동)
5th row사상구 사상로277번길 33(삼락동)
ValueCountFrequency (%)
사상구 377
27.1%
백양대로 28
 
2.0%
사상로 20
 
1.4%
대동로 19
 
1.4%
1층 16
 
1.1%
학감대로 13
 
0.9%
상가동 12
 
0.9%
엄궁로 11
 
0.8%
광장로 10
 
0.7%
낙동대로 10
 
0.7%
Other values (627) 877
63.0%
2023-12-11T02:04:32.935888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1019
 
12.1%
503
 
6.0%
455
 
5.4%
454
 
5.4%
1 436
 
5.2%
383
 
4.6%
380
 
4.5%
) 362
 
4.3%
( 362
 
4.3%
2 260
 
3.1%
Other values (161) 3783
45.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4581
54.6%
Decimal Number 1842
21.9%
Space Separator 1019
 
12.1%
Close Punctuation 362
 
4.3%
Open Punctuation 362
 
4.3%
Other Punctuation 193
 
2.3%
Dash Punctuation 33
 
0.4%
Uppercase Letter 4
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
503
 
11.0%
455
 
9.9%
454
 
9.9%
383
 
8.4%
380
 
8.3%
203
 
4.4%
202
 
4.4%
152
 
3.3%
100
 
2.2%
94
 
2.1%
Other values (142) 1655
36.1%
Decimal Number
ValueCountFrequency (%)
1 436
23.7%
2 260
14.1%
0 209
11.3%
3 195
10.6%
4 165
 
9.0%
5 126
 
6.8%
6 121
 
6.6%
7 114
 
6.2%
9 112
 
6.1%
8 104
 
5.6%
Other Punctuation
ValueCountFrequency (%)
, 190
98.4%
@ 3
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
A 3
75.0%
B 1
 
25.0%
Space Separator
ValueCountFrequency (%)
1019
100.0%
Close Punctuation
ValueCountFrequency (%)
) 362
100.0%
Open Punctuation
ValueCountFrequency (%)
( 362
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4581
54.6%
Common 3812
45.4%
Latin 4
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
503
 
11.0%
455
 
9.9%
454
 
9.9%
383
 
8.4%
380
 
8.3%
203
 
4.4%
202
 
4.4%
152
 
3.3%
100
 
2.2%
94
 
2.1%
Other values (142) 1655
36.1%
Common
ValueCountFrequency (%)
1019
26.7%
1 436
11.4%
) 362
 
9.5%
( 362
 
9.5%
2 260
 
6.8%
0 209
 
5.5%
3 195
 
5.1%
, 190
 
5.0%
4 165
 
4.3%
5 126
 
3.3%
Other values (7) 488
12.8%
Latin
ValueCountFrequency (%)
A 3
75.0%
B 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4581
54.6%
ASCII 3816
45.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1019
26.7%
1 436
11.4%
) 362
 
9.5%
( 362
 
9.5%
2 260
 
6.8%
0 209
 
5.5%
3 195
 
5.1%
, 190
 
5.0%
4 165
 
4.3%
5 126
 
3.3%
Other values (9) 492
12.9%
Hangul
ValueCountFrequency (%)
503
 
11.0%
455
 
9.9%
454
 
9.9%
383
 
8.4%
380
 
8.3%
203
 
4.4%
202
 
4.4%
152
 
3.3%
100
 
2.2%
94
 
2.1%
Other values (142) 1655
36.1%

Interactions

2023-12-11T02:04:28.286920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-11T02:04:28.552776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T02:04:28.724659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번관리번호상호주소
01삼락-8대영상사사상구 낙동대로1510번길 34, B동 101호(삼락동, 대영빌라)
12삼락-10지에스(GS)25삼락공단점사상구 모덕로 32, 1층(삼락동)
23삼락-17빅세일마트사상구 사상로309번길 58(삼락동)
34삼락-29한남슈퍼사상구 낙동대로1548번길 28(삼락동)
45삼락-38미니스톱사상삼락점사상구 사상로277번길 33(삼락동)
56삼락-41DC플러스마트사상구 운산로 27(삼락동)
67삼락-42모닝마트사상구 삼덕로46번안길 33(삼락동)
78삼락-47일번지마트사상구 사상로277번길 61(삼락동)
89삼락-49현대상회사상구 사상로277번가길 21(삼락동)
910삼락-51씨유 삼락공단점사상구 사상로319번길 37(삼락동)
연번관리번호상호주소
373374엄궁-90씨유 사상엄궁점사상구 엄궁로 203(엄궁동)
374375엄궁-91씨유 엄궁공단점사상구 농산물시장로 22,1층(엄궁동)
375376엄궁-93세븐일레븐부산엄궁대박점사상구 낙동대로 720(엄궁동)
376377엄궁-94이마트24사상스타빌점사상구 대동로 25, 103호(엄궁동)
377378엄궁-95지에스(GS)25사상동궁점사상구 엄궁로 71(엄궁동)
378379엄궁-96지에스더프레시(GS THE FRESH)엄궁점사상구 엄궁북로 70(엄궁동)
379380엄궁-97푸드엔엄궁점사상구 농산물시장로42(엄궁동)
380381엄궁-98㈜노포축산사상구 낙동대로 702
381382엄궁-99세븐일레븐 부산엄궁초점사상구 엄궁로 179번길 7
382383엄궁-100씨유 엄궁플렉스점사상구 낙동대로 746-1, 1층