Overview

Dataset statistics

Number of variables9
Number of observations767
Missing cells36
Missing cells (%)0.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.1 KiB
Average record size in memory72.2 B

Variable types

Text3
Boolean5
Categorical1

Dataset

Description인천광역시 부평구 종량제봉투 판매정보(판매처명, 도로명주소, 우편번호, 종량제봉투 취급여부, 음식물납부필증(가정용) 취급여부, 음식물납부필증(120L) 취급여부, 대형폐기물스티커취급여부, 특수규격봉투취급여부) 데이터에 대한 정보입니다.
Author인천광역시 부평구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15094978&srcSe=7661IVAWM27C61E190

Alerts

데이터기준일자 has constant value ""Constant
종량제봉투취급여부 is highly imbalanced (95.3%)Imbalance
음식물납부필증(120L)취급여부 is highly imbalanced (58.6%)Imbalance
우편번호 has 36 (4.7%) missing valuesMissing

Reproduction

Analysis started2024-03-18 04:53:50.398340
Analysis finished2024-03-18 04:53:52.622400
Duration2.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct739
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-03-18T13:53:52.826099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length15
Mean length8.9413299
Min length2

Characters and Unicode

Total characters6858
Distinct characters384
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique718 ?
Unique (%)93.6%

Sample

1st row매점
2nd row씨유 부평동아점
3rd row지에스(GS)25 부평이즈뷰점
4th row씨유 삼산한길점
5th row씨유 부평향기점
ValueCountFrequency (%)
씨유 98
 
8.3%
세븐일레븐 69
 
5.9%
이마트24 49
 
4.2%
지에스25 43
 
3.7%
지에스(gs)25 34
 
2.9%
주)코리아세븐 19
 
1.6%
gs25 14
 
1.2%
주식회사 10
 
0.8%
싸군마켓 5
 
0.4%
부평점 5
 
0.4%
Other values (759) 831
70.6%
2024-03-18T13:53:53.225066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
412
 
6.0%
396
 
5.8%
335
 
4.9%
262
 
3.8%
237
 
3.5%
219
 
3.2%
2 191
 
2.8%
187
 
2.7%
149
 
2.2%
138
 
2.0%
Other values (374) 4332
63.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5578
81.3%
Space Separator 412
 
6.0%
Decimal Number 404
 
5.9%
Uppercase Letter 232
 
3.4%
Close Punctuation 109
 
1.6%
Open Punctuation 109
 
1.6%
Lowercase Letter 8
 
0.1%
Other Punctuation 4
 
0.1%
Other Symbol 1
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
396
 
7.1%
335
 
6.0%
262
 
4.7%
237
 
4.2%
219
 
3.9%
187
 
3.4%
149
 
2.7%
138
 
2.5%
136
 
2.4%
134
 
2.4%
Other values (332) 3385
60.7%
Uppercase Letter
ValueCountFrequency (%)
S 82
35.3%
G 75
32.3%
C 14
 
6.0%
D 11
 
4.7%
R 11
 
4.7%
K 6
 
2.6%
E 6
 
2.6%
U 5
 
2.2%
H 4
 
1.7%
T 3
 
1.3%
Other values (10) 15
 
6.5%
Decimal Number
ValueCountFrequency (%)
2 191
47.3%
5 131
32.4%
4 57
 
14.1%
1 13
 
3.2%
0 3
 
0.7%
7 3
 
0.7%
3 3
 
0.7%
6 2
 
0.5%
8 1
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
e 2
25.0%
k 2
25.0%
o 1
12.5%
t 1
12.5%
a 1
12.5%
r 1
12.5%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
/ 1
 
25.0%
Space Separator
ValueCountFrequency (%)
412
100.0%
Close Punctuation
ValueCountFrequency (%)
) 109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5579
81.4%
Common 1039
 
15.2%
Latin 240
 
3.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
396
 
7.1%
335
 
6.0%
262
 
4.7%
237
 
4.2%
219
 
3.9%
187
 
3.4%
149
 
2.7%
138
 
2.5%
136
 
2.4%
134
 
2.4%
Other values (333) 3386
60.7%
Latin
ValueCountFrequency (%)
S 82
34.2%
G 75
31.2%
C 14
 
5.8%
D 11
 
4.6%
R 11
 
4.6%
K 6
 
2.5%
E 6
 
2.5%
U 5
 
2.1%
H 4
 
1.7%
T 3
 
1.2%
Other values (16) 23
 
9.6%
Common
ValueCountFrequency (%)
412
39.7%
2 191
18.4%
5 131
 
12.6%
) 109
 
10.5%
( 109
 
10.5%
4 57
 
5.5%
1 13
 
1.3%
0 3
 
0.3%
7 3
 
0.3%
. 3
 
0.3%
Other values (5) 8
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5578
81.3%
ASCII 1279
 
18.6%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
412
32.2%
2 191
14.9%
5 131
 
10.2%
) 109
 
8.5%
( 109
 
8.5%
S 82
 
6.4%
G 75
 
5.9%
4 57
 
4.5%
C 14
 
1.1%
1 13
 
1.0%
Other values (31) 86
 
6.7%
Hangul
ValueCountFrequency (%)
396
 
7.1%
335
 
6.0%
262
 
4.7%
237
 
4.2%
219
 
3.9%
187
 
3.4%
149
 
2.7%
138
 
2.5%
136
 
2.4%
134
 
2.4%
Other values (332) 3385
60.7%
None
ValueCountFrequency (%)
1
100.0%
Distinct766
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2024-03-18T13:53:53.465682image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length54
Median length40
Mean length20.207301
Min length6

Characters and Unicode

Total characters15499
Distinct characters279
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique765 ?
Unique (%)99.7%

Sample

1st row부평대로 168, 2층(부평동, 부평구청 본청동)
2nd row부흥로 245, 1층 103호
3rd row주부토로65번길 19, 101,102,103,104호
4th row영성중로15번길 6, 1층
5th row부흥로 366, 1층 일부호
ValueCountFrequency (%)
1층 189
 
7.6%
101호 36
 
1.4%
1층(부평동 32
 
1.3%
상가동 32
 
1.3%
주부토로 23
 
0.9%
부평4동 19
 
0.8%
부평대로 18
 
0.7%
십정2동 18
 
0.7%
마장로 16
 
0.6%
1층(십정동 15
 
0.6%
Other values (1199) 2085
84.0%
2024-03-18T13:53:53.863170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3299
21.3%
1 1423
 
9.2%
664
 
4.3%
, 635
 
4.1%
601
 
3.9%
2 556
 
3.6%
0 522
 
3.4%
444
 
2.9%
3 404
 
2.6%
4 371
 
2.4%
Other values (269) 6580
42.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6053
39.1%
Decimal Number 4496
29.0%
Space Separator 3299
21.3%
Other Punctuation 643
 
4.1%
Open Punctuation 349
 
2.3%
Close Punctuation 349
 
2.3%
Dash Punctuation 240
 
1.5%
Uppercase Letter 51
 
0.3%
Math Symbol 14
 
0.1%
Lowercase Letter 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
664
 
11.0%
601
 
9.9%
444
 
7.3%
333
 
5.5%
302
 
5.0%
289
 
4.8%
283
 
4.7%
281
 
4.6%
159
 
2.6%
103
 
1.7%
Other values (227) 2594
42.9%
Uppercase Letter
ValueCountFrequency (%)
B 19
37.3%
A 9
17.6%
E 3
 
5.9%
C 3
 
5.9%
I 3
 
5.9%
H 3
 
5.9%
D 2
 
3.9%
O 1
 
2.0%
Z 1
 
2.0%
G 1
 
2.0%
Other values (6) 6
 
11.8%
Decimal Number
ValueCountFrequency (%)
1 1423
31.7%
2 556
 
12.4%
0 522
 
11.6%
3 404
 
9.0%
4 371
 
8.3%
6 285
 
6.3%
5 270
 
6.0%
7 247
 
5.5%
8 210
 
4.7%
9 208
 
4.6%
Other Punctuation
ValueCountFrequency (%)
, 635
98.8%
/ 2
 
0.3%
@ 2
 
0.3%
? 2
 
0.3%
: 2
 
0.3%
Lowercase Letter
ValueCountFrequency (%)
i 1
25.0%
b 1
25.0%
t 1
25.0%
y 1
25.0%
Math Symbol
ValueCountFrequency (%)
~ 12
85.7%
2
 
14.3%
Space Separator
ValueCountFrequency (%)
3299
100.0%
Open Punctuation
ValueCountFrequency (%)
( 349
100.0%
Close Punctuation
ValueCountFrequency (%)
) 349
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 240
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9391
60.6%
Hangul 6053
39.1%
Latin 55
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
664
 
11.0%
601
 
9.9%
444
 
7.3%
333
 
5.5%
302
 
5.0%
289
 
4.8%
283
 
4.7%
281
 
4.6%
159
 
2.6%
103
 
1.7%
Other values (227) 2594
42.9%
Common
ValueCountFrequency (%)
3299
35.1%
1 1423
15.2%
, 635
 
6.8%
2 556
 
5.9%
0 522
 
5.6%
3 404
 
4.3%
4 371
 
4.0%
( 349
 
3.7%
) 349
 
3.7%
6 285
 
3.0%
Other values (12) 1198
 
12.8%
Latin
ValueCountFrequency (%)
B 19
34.5%
A 9
16.4%
E 3
 
5.5%
C 3
 
5.5%
I 3
 
5.5%
H 3
 
5.5%
D 2
 
3.6%
O 1
 
1.8%
Z 1
 
1.8%
i 1
 
1.8%
Other values (10) 10
18.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9444
60.9%
Hangul 6053
39.1%
Arrows 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3299
34.9%
1 1423
15.1%
, 635
 
6.7%
2 556
 
5.9%
0 522
 
5.5%
3 404
 
4.3%
4 371
 
3.9%
( 349
 
3.7%
) 349
 
3.7%
6 285
 
3.0%
Other values (31) 1251
 
13.2%
Hangul
ValueCountFrequency (%)
664
 
11.0%
601
 
9.9%
444
 
7.3%
333
 
5.5%
302
 
5.0%
289
 
4.8%
283
 
4.7%
281
 
4.6%
159
 
2.6%
103
 
1.7%
Other values (227) 2594
42.9%
Arrows
ValueCountFrequency (%)
2
100.0%

우편번호
Text

MISSING 

Distinct135
Distinct (%)18.5%
Missing36
Missing (%)4.7%
Memory size6.1 KiB
2024-03-18T13:53:54.122188image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.7373461
Min length1

Characters and Unicode

Total characters3463
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)3.0%

Sample

1st row21354
2nd row21378
3rd row21359
4th row21321
5th row21397
ValueCountFrequency (%)
21404 20
 
2.9%
21360 19
 
2.8%
21408 17
 
2.5%
21318 16
 
2.3%
21329 15
 
2.2%
21317 15
 
2.2%
21377 15
 
2.2%
21376 14
 
2.0%
21391 14
 
2.0%
21389 14
 
2.0%
Other values (124) 524
76.7%
2024-03-18T13:53:54.566872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 858
24.8%
2 796
23.0%
3 562
16.2%
4 373
10.8%
0 169
 
4.9%
5 167
 
4.8%
9 149
 
4.3%
7 122
 
3.5%
8 114
 
3.3%
6 105
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3415
98.6%
Space Separator 48
 
1.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 858
25.1%
2 796
23.3%
3 562
16.5%
4 373
10.9%
0 169
 
4.9%
5 167
 
4.9%
9 149
 
4.4%
7 122
 
3.6%
8 114
 
3.3%
6 105
 
3.1%
Space Separator
ValueCountFrequency (%)
48
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3463
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 858
24.8%
2 796
23.0%
3 562
16.2%
4 373
10.8%
0 169
 
4.9%
5 167
 
4.8%
9 149
 
4.3%
7 122
 
3.5%
8 114
 
3.3%
6 105
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3463
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 858
24.8%
2 796
23.0%
3 562
16.2%
4 373
10.8%
0 169
 
4.9%
5 167
 
4.8%
9 149
 
4.3%
7 122
 
3.5%
8 114
 
3.3%
6 105
 
3.0%

종량제봉투취급여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
True
763 
False
 
4
ValueCountFrequency (%)
True 763
99.5%
False 4
 
0.5%
2024-03-18T13:53:54.677348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
False
394 
True
373 
ValueCountFrequency (%)
False 394
51.4%
True 373
48.6%
2024-03-18T13:53:54.754498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
False
703 
True
 
64
ValueCountFrequency (%)
False 703
91.7%
True 64
 
8.3%
2024-03-18T13:53:54.830139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
False
518 
True
249 
ValueCountFrequency (%)
False 518
67.5%
True 249
32.5%
2024-03-18T13:53:54.909011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size899.0 B
True
444 
False
323 
ValueCountFrequency (%)
True 444
57.9%
False 323
42.1%
2024-03-18T13:53:54.986870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.1 KiB
2023-08-23
767 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-23
2nd row2023-08-23
3rd row2023-08-23
4th row2023-08-23
5th row2023-08-23

Common Values

ValueCountFrequency (%)
2023-08-23 767
100.0%

Length

2024-03-18T13:53:55.082970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:53:55.180299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-23 767
100.0%

Correlations

2024-03-18T13:53:55.246700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종량제봉투취급여부음식물납부필증(가정용)취급여부음식물납부필증(120L)취급여부대형폐기물스티커취급여부특수규격봉투취급여부
종량제봉투취급여부1.0000.0000.0000.0460.088
음식물납부필증(가정용)취급여부0.0001.0000.4590.5250.612
음식물납부필증(120L)취급여부0.0000.4591.0000.3170.340
대형폐기물스티커취급여부0.0460.5250.3171.0000.672
특수규격봉투취급여부0.0880.6120.3400.6721.000
2024-03-18T13:53:55.358918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
특수규격봉투취급여부종량제봉투취급여부음식물납부필증(120L)취급여부음식물납부필증(가정용)취급여부대형폐기물스티커취급여부
특수규격봉투취급여부1.0000.0560.2210.4190.469
종량제봉투취급여부0.0561.0000.0000.0000.029
음식물납부필증(120L)취급여부0.2210.0001.0000.3030.206
음식물납부필증(가정용)취급여부0.4190.0000.3031.0000.352
대형폐기물스티커취급여부0.4690.0290.2060.3521.000
2024-03-18T13:53:55.450577image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종량제봉투취급여부음식물납부필증(가정용)취급여부음식물납부필증(120L)취급여부대형폐기물스티커취급여부특수규격봉투취급여부
종량제봉투취급여부1.0000.0000.0000.0290.056
음식물납부필증(가정용)취급여부0.0001.0000.3030.3520.419
음식물납부필증(120L)취급여부0.0000.3031.0000.2060.221
대형폐기물스티커취급여부0.0290.3520.2061.0000.469
특수규격봉투취급여부0.0560.4190.2210.4691.000

Missing values

2024-03-18T13:53:52.422123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:53:52.569052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

판매처명도로명 주소우편번호종량제봉투취급여부음식물납부필증(가정용)취급여부음식물납부필증(120L)취급여부대형폐기물스티커취급여부특수규격봉투취급여부데이터기준일자
0매점부평대로 168, 2층(부평동, 부평구청 본청동)21354YYYYY2023-08-23
1씨유 부평동아점부흥로 245, 1층 103호21378YNNNN2023-08-23
2지에스(GS)25 부평이즈뷰점주부토로65번길 19, 101,102,103,104호21359YYNYY2023-08-23
3씨유 삼산한길점영성중로15번길 6, 1층21321YNNNN2023-08-23
4씨유 부평향기점부흥로 366, 1층 일부호21397YYNNY2023-08-23
5DC프라자일신로14번길 12(일신동)21419YNNNY2023-08-23
6지에스25부평피코그램점부평북로 118(청천동)21310YNNNN2023-08-23
7지에스(GS)25 굴포빌리지점장제로249번길 27, 1층(갈산동)21337YYNNY2023-08-23
8세븐일레븐 부평일신본점경인로 1103, 1층 일부호21416YYNNN2023-08-23
9이마트24 R부평대우점세월천로 21, 1층 일부(청천동)21313YNNNY2023-08-23
판매처명도로명 주소우편번호종량제봉투취급여부음식물납부필증(가정용)취급여부음식물납부필증(120L)취급여부대형폐기물스티커취급여부특수규격봉투취급여부데이터기준일자
757씨유부평마이빌점부평4동 439-12 부평대우마이빌 11021376YYNNY2023-08-23
758명보슈퍼(상회)산곡2동 234-2721380YYNNY2023-08-23
759씨유부평이편한시티경원대로1344번길 9, 1층(부평동)21404YYYYY2023-08-23
760지에스25 삼산하이존체육관로 32, 104호(삼산동, 하이존 HI ZONE)21404YNNYN2023-08-23
761지에스(GS)25 부평하정점하정로 10, 1층21404YYNNN2023-08-23
762이마트24R부평로데오점시장로 21, 1층21404YNNNN2023-08-23
763엘할인마트부평1동 182-64 부평프라자 10121404YYYYY2023-08-23
764씨유부평아트점이규보로95번길25, 1층1호(십정동) 십정동186-47921404YNNNN2023-08-23
765지에스(GS)25 부평힘찬장제로84번길 14,1층 101호(부평동,건영아파트)21399YNNNN2023-08-23
766GS25 부개중앙부평문화로 197번길 421398YYNNN2023-08-23