Overview

Dataset statistics

Number of variables7
Number of observations895
Missing cells413
Missing cells (%)6.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory49.9 KiB
Average record size in memory57.1 B

Variable types

Numeric1
Text3
Categorical3

Dataset

Description부산광역시 부산진구 쓰레기종량제봉투 판매업소의 상호명, 소재지 주소, 전화번호, 관리기관명 등이 포함된 데이터입니다.
Author부산광역시 부산진구
URLhttps://www.data.go.kr/data/15045191/fileData.do

Alerts

관리기관명 has constant value ""Constant
데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 지역명High correlation
지역명 is highly overall correlated with 연번High correlation
전화번호 has 413 (46.1%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:16:52.640605
Analysis finished2023-12-12 10:16:54.188973
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct895
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean448.57989
Minimum1
Maximum896
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size8.0 KiB
2023-12-12T19:16:54.298399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile45.7
Q1224.5
median449
Q3672.5
95-th percentile851.3
Maximum896
Range895
Interquartile range (IQR)448

Descriptive statistics

Standard deviation258.93086
Coefficient of variation (CV)0.57722352
Kurtosis-1.2017174
Mean448.57989
Median Absolute Deviation (MAD)224
Skewness-0.00090406201
Sum401479
Variance67045.192
MonotonicityStrictly increasing
2023-12-12T19:16:54.537331image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
2 1
 
0.1%
592 1
 
0.1%
593 1
 
0.1%
594 1
 
0.1%
595 1
 
0.1%
596 1
 
0.1%
597 1
 
0.1%
598 1
 
0.1%
599 1
 
0.1%
Other values (885) 885
98.9%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
896 1
0.1%
895 1
0.1%
894 1
0.1%
893 1
0.1%
892 1
0.1%
891 1
0.1%
890 1
0.1%
889 1
0.1%
888 1
0.1%
887 1
0.1%

상호
Text

Distinct810
Distinct (%)90.5%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-12-12T19:16:54.967754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length8.3307263
Min length2

Characters and Unicode

Total characters7456
Distinct characters400
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique739 ?
Unique (%)82.6%

Sample

1st row부산슈퍼
2nd row농심가슈퍼
3rd row성림상회
4th row대구연쇄점
5th rowLG25 서면새싹점
ValueCountFrequency (%)
cu 137
 
9.5%
gs25 104
 
7.2%
세븐일레븐 83
 
5.8%
이마트24 48
 
3.3%
㈜코리아세븐 22
 
1.5%
미니스톱 11
 
0.8%
주식회사 10
 
0.7%
서면점 9
 
0.6%
빅세일마트 8
 
0.6%
부전점 6
 
0.4%
Other values (815) 998
69.5%
2023-12-12T19:16:55.740644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
541
 
7.3%
503
 
6.7%
229
 
3.1%
228
 
3.1%
208
 
2.8%
194
 
2.6%
2 178
 
2.4%
174
 
2.3%
C 154
 
2.1%
U 144
 
1.9%
Other values (390) 4903
65.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5886
78.9%
Uppercase Letter 560
 
7.5%
Space Separator 541
 
7.3%
Decimal Number 362
 
4.9%
Other Symbol 62
 
0.8%
Close Punctuation 16
 
0.2%
Open Punctuation 16
 
0.2%
Lowercase Letter 9
 
0.1%
Other Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
503
 
8.5%
229
 
3.9%
228
 
3.9%
208
 
3.5%
194
 
3.3%
174
 
3.0%
133
 
2.3%
130
 
2.2%
125
 
2.1%
124
 
2.1%
Other values (358) 3838
65.2%
Uppercase Letter
ValueCountFrequency (%)
C 154
27.5%
U 144
25.7%
S 115
20.5%
G 112
20.0%
R 9
 
1.6%
K 5
 
0.9%
D 4
 
0.7%
L 4
 
0.7%
J 3
 
0.5%
H 2
 
0.4%
Other values (6) 8
 
1.4%
Decimal Number
ValueCountFrequency (%)
2 178
49.2%
5 125
34.5%
4 53
 
14.6%
3 3
 
0.8%
6 2
 
0.6%
1 1
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
e 5
55.6%
y 1
 
11.1%
i 1
 
11.1%
m 1
 
11.1%
n 1
 
11.1%
Space Separator
ValueCountFrequency (%)
541
100.0%
Other Symbol
ValueCountFrequency (%)
62
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5948
79.8%
Common 939
 
12.6%
Latin 569
 
7.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
503
 
8.5%
229
 
3.9%
228
 
3.8%
208
 
3.5%
194
 
3.3%
174
 
2.9%
133
 
2.2%
130
 
2.2%
125
 
2.1%
124
 
2.1%
Other values (359) 3900
65.6%
Latin
ValueCountFrequency (%)
C 154
27.1%
U 144
25.3%
S 115
20.2%
G 112
19.7%
R 9
 
1.6%
K 5
 
0.9%
e 5
 
0.9%
D 4
 
0.7%
L 4
 
0.7%
J 3
 
0.5%
Other values (11) 14
 
2.5%
Common
ValueCountFrequency (%)
541
57.6%
2 178
 
19.0%
5 125
 
13.3%
4 53
 
5.6%
) 16
 
1.7%
( 16
 
1.7%
. 4
 
0.4%
3 3
 
0.3%
6 2
 
0.2%
1 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5886
78.9%
ASCII 1508
 
20.2%
None 62
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
541
35.9%
2 178
 
11.8%
C 154
 
10.2%
U 144
 
9.5%
5 125
 
8.3%
S 115
 
7.6%
G 112
 
7.4%
4 53
 
3.5%
) 16
 
1.1%
( 16
 
1.1%
Other values (21) 54
 
3.6%
Hangul
ValueCountFrequency (%)
503
 
8.5%
229
 
3.9%
228
 
3.9%
208
 
3.5%
194
 
3.3%
174
 
3.0%
133
 
2.3%
130
 
2.2%
125
 
2.1%
124
 
2.1%
Other values (358) 3838
65.2%
None
ValueCountFrequency (%)
62
100.0%

지역명
Categorical

HIGH CORRELATION 

Distinct20
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
부전2동
82 
부전1동
73 
가야1동
72 
전포1동
63 
양정1동
59 
Other values (15)
546 

Length

Max length4
Median length4
Mean length3.9061453
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부전1동
2nd row부전1동
3rd row부전1동
4th row부전1동
5th row부전1동

Common Values

ValueCountFrequency (%)
부전2동 82
 
9.2%
부전1동 73
 
8.2%
가야1동 72
 
8.0%
전포1동 63
 
7.0%
양정1동 59
 
6.6%
양정2동 50
 
5.6%
초읍동 49
 
5.5%
범천1동 47
 
5.3%
개금1동 47
 
5.3%
전포2동 46
 
5.1%
Other values (10) 307
34.3%

Length

2023-12-12T19:16:55.984265image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
부전2동 82
 
9.2%
부전1동 73
 
8.2%
가야1동 72
 
8.0%
전포1동 63
 
7.0%
양정1동 59
 
6.6%
양정2동 50
 
5.6%
초읍동 49
 
5.5%
범천1동 47
 
5.3%
개금1동 47
 
5.3%
전포2동 46
 
5.1%
Other values (10) 307
34.3%
Distinct843
Distinct (%)94.2%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-12-12T19:16:56.366535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length47
Mean length28.534078
Min length16

Characters and Unicode

Total characters25538
Distinct characters258
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique797 ?
Unique (%)89.1%

Sample

1st row부산광역시 부산진구 새싹로8번길 42(부전동)
2nd row부산광역시 부산진구 새싹로28번길 15(부전동)
3rd row부산광역시 부산진구 새싹로8번길 11(부전동)
4th row부산광역시 부산진구 중앙대로783번길 8(부전동)
5th row부산광역시 부산진구 새싹로14번길 13(부전동)
ValueCountFrequency (%)
부산진구 897
21.0%
부산광역시 895
21.0%
1층 82
 
1.9%
중앙대로 47
 
1.1%
가야대로 45
 
1.1%
엄광로 36
 
0.8%
101호 28
 
0.7%
동평로 25
 
0.6%
1층(부전동 25
 
0.6%
새싹로 24
 
0.6%
Other values (1228) 2165
50.7%
2023-12-12T19:16:57.002419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3374
 
13.2%
1985
 
7.8%
1816
 
7.1%
1 1150
 
4.5%
950
 
3.7%
936
 
3.7%
920
 
3.6%
901
 
3.5%
900
 
3.5%
891
 
3.5%
Other values (248) 11715
45.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 15866
62.1%
Decimal Number 4197
 
16.4%
Space Separator 3374
 
13.2%
Open Punctuation 704
 
2.8%
Close Punctuation 703
 
2.8%
Other Punctuation 535
 
2.1%
Dash Punctuation 128
 
0.5%
Uppercase Letter 25
 
0.1%
Math Symbol 5
 
< 0.1%
Lowercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1985
 
12.5%
1816
 
11.4%
950
 
6.0%
936
 
5.9%
920
 
5.8%
901
 
5.7%
900
 
5.7%
891
 
5.6%
878
 
5.5%
425
 
2.7%
Other values (220) 5264
33.2%
Decimal Number
ValueCountFrequency (%)
1 1150
27.4%
2 501
11.9%
0 423
 
10.1%
3 373
 
8.9%
5 325
 
7.7%
6 320
 
7.6%
4 318
 
7.6%
9 273
 
6.5%
7 259
 
6.2%
8 255
 
6.1%
Uppercase Letter
ValueCountFrequency (%)
B 10
40.0%
A 5
20.0%
L 2
 
8.0%
H 2
 
8.0%
S 1
 
4.0%
D 1
 
4.0%
E 1
 
4.0%
J 1
 
4.0%
C 1
 
4.0%
N 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 533
99.6%
. 2
 
0.4%
Space Separator
ValueCountFrequency (%)
3374
100.0%
Open Punctuation
ValueCountFrequency (%)
( 704
100.0%
Close Punctuation
ValueCountFrequency (%)
) 703
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 128
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Lowercase Letter
ValueCountFrequency (%)
e 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 15866
62.1%
Common 9646
37.8%
Latin 26
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1985
 
12.5%
1816
 
11.4%
950
 
6.0%
936
 
5.9%
920
 
5.8%
901
 
5.7%
900
 
5.7%
891
 
5.6%
878
 
5.5%
425
 
2.7%
Other values (220) 5264
33.2%
Common
ValueCountFrequency (%)
3374
35.0%
1 1150
 
11.9%
( 704
 
7.3%
) 703
 
7.3%
, 533
 
5.5%
2 501
 
5.2%
0 423
 
4.4%
3 373
 
3.9%
5 325
 
3.4%
6 320
 
3.3%
Other values (7) 1240
 
12.9%
Latin
ValueCountFrequency (%)
B 10
38.5%
A 5
19.2%
L 2
 
7.7%
H 2
 
7.7%
S 1
 
3.8%
D 1
 
3.8%
e 1
 
3.8%
E 1
 
3.8%
J 1
 
3.8%
C 1
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 15866
62.1%
ASCII 9672
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3374
34.9%
1 1150
 
11.9%
( 704
 
7.3%
) 703
 
7.3%
, 533
 
5.5%
2 501
 
5.2%
0 423
 
4.4%
3 373
 
3.9%
5 325
 
3.4%
6 320
 
3.3%
Other values (18) 1266
 
13.1%
Hangul
ValueCountFrequency (%)
1985
 
12.5%
1816
 
11.4%
950
 
6.0%
936
 
5.9%
920
 
5.8%
901
 
5.7%
900
 
5.7%
891
 
5.6%
878
 
5.5%
425
 
2.7%
Other values (220) 5264
33.2%

전화번호
Text

MISSING 

Distinct465
Distinct (%)96.5%
Missing413
Missing (%)46.1%
Memory size7.1 KiB
2023-12-12T19:16:57.350232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.004149
Min length12

Characters and Unicode

Total characters5786
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique449 ?
Unique (%)93.2%

Sample

1st row051-803-2904
2nd row051-808-8879
3rd row051-808-4696
4th row051-807-1682
5th row051-804-1814
ValueCountFrequency (%)
051-816-9633 3
 
0.6%
051-893-3100 2
 
0.4%
051-710-5162 2
 
0.4%
051-852-8219 2
 
0.4%
051-803-2795 2
 
0.4%
051-806-8545 2
 
0.4%
051-794-7001 2
 
0.4%
051-608-2500 2
 
0.4%
051-817-8701 2
 
0.4%
051-898-5050 2
 
0.4%
Other values (455) 461
95.6%
2023-12-12T19:16:57.867333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 966
16.7%
0 901
15.6%
1 785
13.6%
5 761
13.2%
8 661
11.4%
9 370
 
6.4%
6 306
 
5.3%
3 290
 
5.0%
2 267
 
4.6%
7 251
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4820
83.3%
Dash Punctuation 966
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 901
18.7%
1 785
16.3%
5 761
15.8%
8 661
13.7%
9 370
7.7%
6 306
 
6.3%
3 290
 
6.0%
2 267
 
5.5%
7 251
 
5.2%
4 228
 
4.7%
Dash Punctuation
ValueCountFrequency (%)
- 966
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5786
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 966
16.7%
0 901
15.6%
1 785
13.6%
5 761
13.2%
8 661
11.4%
9 370
 
6.4%
6 306
 
5.3%
3 290
 
5.0%
2 267
 
4.6%
7 251
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5786
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 966
16.7%
0 901
15.6%
1 785
13.6%
5 761
13.2%
8 661
11.4%
9 370
 
6.4%
6 306
 
5.3%
3 290
 
5.0%
2 267
 
4.6%
7 251
 
4.3%

관리기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
부산광역시 부산진구청
895 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시 부산진구청
2nd row부산광역시 부산진구청
3rd row부산광역시 부산진구청
4th row부산광역시 부산진구청
5th row부산광역시 부산진구청

Common Values

ValueCountFrequency (%)
부산광역시 부산진구청 895
100.0%

Length

2023-12-12T19:16:58.043753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:16:58.187819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 895
50.0%
부산진구청 895
50.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size7.1 KiB
2023-10-10
895 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-10
2nd row2023-10-10
3rd row2023-10-10
4th row2023-10-10
5th row2023-10-10

Common Values

ValueCountFrequency (%)
2023-10-10 895
100.0%

Length

2023-12-12T19:16:58.331273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:16:58.482089image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-10 895
100.0%

Interactions

2023-12-12T19:16:53.370686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:16:58.583767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역명
연번1.0000.996
지역명0.9961.000
2023-12-12T19:16:58.706276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역명
연번1.0000.886
지역명0.8861.000

Missing values

2023-12-12T19:16:53.512226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:16:54.054503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호지역명소재지 주소전화번호관리기관명데이터기준일자
01부산슈퍼부전1동부산광역시 부산진구 새싹로8번길 42(부전동)051-803-2904부산광역시 부산진구청2023-10-10
12농심가슈퍼부전1동부산광역시 부산진구 새싹로28번길 15(부전동)051-808-8879부산광역시 부산진구청2023-10-10
23성림상회부전1동부산광역시 부산진구 새싹로8번길 11(부전동)051-808-4696부산광역시 부산진구청2023-10-10
34대구연쇄점부전1동부산광역시 부산진구 중앙대로783번길 8(부전동)051-807-1682부산광역시 부산진구청2023-10-10
45LG25 서면새싹점부전1동부산광역시 부산진구 새싹로14번길 13(부전동)051-804-1814부산광역시 부산진구청2023-10-10
56GS25 서면롯데점부전1동부산광역시 부산진구 가야대로 773-3(부전동)051-804-5311부산광역시 부산진구청2023-10-10
67봉투백화점부전1동부산광역시 부산진구 중앙대로756번길 22(부전동)051-809-2504부산광역시 부산진구청2023-10-10
78무궁화마트부전1동부산광역시 부산진구 새싹로8번길 36(부전동)051-802-2815부산광역시 부산진구청2023-10-10
89GS25 부전영광점부전1동부산광역시 부산진구 서면문화로 44-1(부전동)051-803-2007부산광역시 부산진구청2023-10-10
910세븐일레븐 부산 부전점부전1동부산광역시 부산진구 동천로 129(부전동)051-816-1577부산광역시 부산진구청2023-10-10
연번상호지역명소재지 주소전화번호관리기관명데이터기준일자
885887동양마트 신암점범천2동부산광역시 부산진구 엄광로 391051-631-6336부산광역시 부산진구청2023-10-10
886888탑할인마트범천2동부산광역시 부산진구 신암로 9(범천동)<NA>부산광역시 부산진구청2023-10-10
887889CU 뉴범천한라점범천2동부산광역시 부산진구 범천로22, 1층 105호(범천동, 한라비발디상가)<NA>부산광역시 부산진구청2023-10-10
888890세븐일레븐 부산신암점범천2동부산광역시 부산진구 신암로145, 1층(범천동)<NA>부산광역시 부산진구청2023-10-10
889891에스마트범천2동부산광역시 부산진구 신암로 9, 1층(범천동)051-639-8871부산광역시 부산진구청2023-10-10
890892세븐일레븐 부산범천이편한점범천2동부산광역시 부산진구 만리산로 110, 2층<NA>부산광역시 부산진구청2023-10-10
891893CU 범천센트럴점범천2동부산광역시 부산진구 신암로 133, 지하2층 1호(범천동, e편한세상서면더센트럴)<NA>부산광역시 부산진구청2023-10-10
892894이마트24 서면e편한점범천2동부산광역시 부산진구 가야대로 702, 102호(범천동, 서면E-편한세상상가)<NA>부산광역시 부산진구청2023-10-10
893895빅세일마트범천2동부산광역시 부산진구 범천로 14, 1층(동암빌딩)<NA>부산광역시 부산진구청2023-10-10
894896세븐일레븐 부암역점범천2동부산광역시 부산진구 가야대로 712(범천동)<NA>부산광역시 부산진구청2023-10-10