Overview

Dataset statistics

Number of variables4
Number of observations2022
Missing cells93
Missing cells (%)1.1%
Duplicate rows226
Duplicate rows (%)11.2%
Total size in memory65.3 KiB
Average record size in memory33.1 B

Variable types

Text2
DateTime1
Numeric1

Dataset

Description용인시 종량제봉투 판매업소 위치 등 정보를 제공하고 있습니다. 자세한 문의사항은 용인도시공사 환경사업팀에 문의 바랍니다.
Author용인도시공사
URLhttps://www.data.go.kr/data/15060018/fileData.do

Alerts

Dataset has 226 (11.2%) duplicate rowsDuplicates
등록일 has 93 (4.6%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:34:53.738711
Analysis finished2023-12-12 14:34:54.347772
Duration0.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1731
Distinct (%)85.6%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2023-12-12T23:34:54.660614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length10.311573
Min length2

Characters and Unicode

Total characters20850
Distinct characters483
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1472 ?
Unique (%)72.8%

Sample

1st row(주)농민식자재마트
2nd row(주)에스피에스
3rd row(주)이플러스마트
4th row(주)지에스리테일용인포곡점
5th row(주)코리아세븐 용인전대리점
ValueCountFrequency (%)
씨유 280
 
8.1%
세븐일레븐 211
 
6.1%
지에스25 112
 
3.2%
이마트24 105
 
3.0%
gs25 101
 
2.9%
씨유(cu 61
 
1.8%
지에스25(gs25 54
 
1.6%
주)코리아세븐 43
 
1.2%
주식회사 42
 
1.2%
지에스(gs)25 30
 
0.9%
Other values (1707) 2435
70.1%
2023-12-12T23:34:55.123052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1461
 
7.0%
1386
 
6.6%
644
 
3.1%
635
 
3.0%
598
 
2.9%
2 564
 
2.7%
564
 
2.7%
521
 
2.5%
506
 
2.4%
460
 
2.2%
Other values (473) 13511
64.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 16589
79.6%
Space Separator 1461
 
7.0%
Decimal Number 1193
 
5.7%
Uppercase Letter 801
 
3.8%
Close Punctuation 379
 
1.8%
Open Punctuation 378
 
1.8%
Lowercase Letter 41
 
0.2%
Other Punctuation 4
 
< 0.1%
Dash Punctuation 2
 
< 0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1386
 
8.4%
644
 
3.9%
635
 
3.8%
598
 
3.6%
564
 
3.4%
521
 
3.1%
506
 
3.1%
460
 
2.8%
453
 
2.7%
396
 
2.4%
Other values (422) 10426
62.8%
Uppercase Letter
ValueCountFrequency (%)
S 264
33.0%
G 245
30.6%
C 87
 
10.9%
U 74
 
9.2%
R 24
 
3.0%
A 17
 
2.1%
T 14
 
1.7%
I 12
 
1.5%
M 11
 
1.4%
E 10
 
1.2%
Other values (11) 43
 
5.4%
Lowercase Letter
ValueCountFrequency (%)
e 9
22.0%
l 8
19.5%
s 7
17.1%
f 5
12.2%
p 3
 
7.3%
h 2
 
4.9%
u 1
 
2.4%
c 1
 
2.4%
t 1
 
2.4%
k 1
 
2.4%
Other values (3) 3
 
7.3%
Decimal Number
ValueCountFrequency (%)
2 564
47.3%
5 428
35.9%
4 146
 
12.2%
3 16
 
1.3%
1 16
 
1.3%
6 10
 
0.8%
9 9
 
0.8%
0 2
 
0.2%
7 2
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 2
50.0%
, 1
25.0%
/ 1
25.0%
Space Separator
ValueCountFrequency (%)
1461
100.0%
Close Punctuation
ValueCountFrequency (%)
) 379
100.0%
Open Punctuation
ValueCountFrequency (%)
( 378
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 16589
79.6%
Common 3419
 
16.4%
Latin 842
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1386
 
8.4%
644
 
3.9%
635
 
3.8%
598
 
3.6%
564
 
3.4%
521
 
3.1%
506
 
3.1%
460
 
2.8%
453
 
2.7%
396
 
2.4%
Other values (422) 10426
62.8%
Latin
ValueCountFrequency (%)
S 264
31.4%
G 245
29.1%
C 87
 
10.3%
U 74
 
8.8%
R 24
 
2.9%
A 17
 
2.0%
T 14
 
1.7%
I 12
 
1.4%
M 11
 
1.3%
E 10
 
1.2%
Other values (24) 84
 
10.0%
Common
ValueCountFrequency (%)
1461
42.7%
2 564
 
16.5%
5 428
 
12.5%
) 379
 
11.1%
( 378
 
11.1%
4 146
 
4.3%
3 16
 
0.5%
1 16
 
0.5%
6 10
 
0.3%
9 9
 
0.3%
Other values (7) 12
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 16589
79.6%
ASCII 4261
 
20.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1461
34.3%
2 564
 
13.2%
5 428
 
10.0%
) 379
 
8.9%
( 378
 
8.9%
S 264
 
6.2%
G 245
 
5.7%
4 146
 
3.4%
C 87
 
2.0%
U 74
 
1.7%
Other values (41) 235
 
5.5%
Hangul
ValueCountFrequency (%)
1386
 
8.4%
644
 
3.9%
635
 
3.8%
598
 
3.6%
564
 
3.4%
521
 
3.1%
506
 
3.1%
460
 
2.8%
453
 
2.7%
396
 
2.4%
Other values (422) 10426
62.8%

주소
Text

Distinct1563
Distinct (%)77.3%
Missing0
Missing (%)0.0%
Memory size15.9 KiB
2023-12-12T23:34:55.496373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length26
Mean length13.647379
Min length7

Characters and Unicode

Total characters27595
Distinct characters153
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1215 ?
Unique (%)60.1%

Sample

1st row처인구 포곡읍 둔전리 387-8
2nd row처인구 포곡읍 영문리 188-7
3rd row처인구 포곡읍 전대리 120-7
4th row처인구 포곡읍 둔전리 139-10
5th row용인시 처인구 포곡읍 전대로 128
ValueCountFrequency (%)
처인구 687
 
10.4%
기흥구 663
 
10.0%
수지구 598
 
9.0%
죽전동 187
 
2.8%
동천동 126
 
1.9%
포곡읍 105
 
1.6%
풍덕천동 96
 
1.4%
양지면 76
 
1.1%
구갈동 72
 
1.1%
상현동 71
 
1.1%
Other values (1580) 3955
59.6%
2023-12-12T23:34:56.060272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4630
 
16.8%
2063
 
7.5%
1792
 
6.5%
1 1605
 
5.8%
- 1298
 
4.7%
2 870
 
3.2%
3 860
 
3.1%
5 765
 
2.8%
743
 
2.7%
727
 
2.6%
Other values (143) 12242
44.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13913
50.4%
Decimal Number 7729
28.0%
Space Separator 4630
 
16.8%
Dash Punctuation 1298
 
4.7%
Close Punctuation 11
 
< 0.1%
Open Punctuation 11
 
< 0.1%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2063
 
14.8%
1792
 
12.9%
743
 
5.3%
727
 
5.2%
709
 
5.1%
687
 
4.9%
665
 
4.8%
612
 
4.4%
463
 
3.3%
294
 
2.1%
Other values (127) 5158
37.1%
Decimal Number
ValueCountFrequency (%)
1 1605
20.8%
2 870
11.3%
3 860
11.1%
5 765
9.9%
4 701
9.1%
6 663
8.6%
9 612
 
7.9%
8 612
 
7.9%
7 590
 
7.6%
0 451
 
5.8%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
/ 1
33.3%
Space Separator
ValueCountFrequency (%)
4630
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1298
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13913
50.4%
Common 13682
49.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2063
 
14.8%
1792
 
12.9%
743
 
5.3%
727
 
5.2%
709
 
5.1%
687
 
4.9%
665
 
4.8%
612
 
4.4%
463
 
3.3%
294
 
2.1%
Other values (127) 5158
37.1%
Common
ValueCountFrequency (%)
4630
33.8%
1 1605
 
11.7%
- 1298
 
9.5%
2 870
 
6.4%
3 860
 
6.3%
5 765
 
5.6%
4 701
 
5.1%
6 663
 
4.8%
9 612
 
4.5%
8 612
 
4.5%
Other values (6) 1066
 
7.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13913
50.4%
ASCII 13682
49.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4630
33.8%
1 1605
 
11.7%
- 1298
 
9.5%
2 870
 
6.4%
3 860
 
6.3%
5 765
 
5.6%
4 701
 
5.1%
6 663
 
4.8%
9 612
 
4.5%
8 612
 
4.5%
Other values (6) 1066
 
7.8%
Hangul
ValueCountFrequency (%)
2063
 
14.8%
1792
 
12.9%
743
 
5.3%
727
 
5.2%
709
 
5.1%
687
 
4.9%
665
 
4.8%
612
 
4.4%
463
 
3.3%
294
 
2.1%
Other values (127) 5158
37.1%

등록일
Date

MISSING 

Distinct1171
Distinct (%)60.7%
Missing93
Missing (%)4.6%
Memory size15.9 KiB
Minimum2006-06-16 00:00:00
Maximum2023-10-18 00:00:00
2023-12-12T23:34:56.236246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:34:56.410542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지정번호
Real number (ℝ)

Distinct1791
Distinct (%)88.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean171674.18
Minimum0
Maximum460009
Zeros2
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size17.9 KiB
2023-12-12T23:34:56.576385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile13140.05
Q182704.5
median173968
Q3241461.75
95-th percentile350001.95
Maximum460009
Range460009
Interquartile range (IQR)158757.25

Descriptive statistics

Standard deviation104423.34
Coefficient of variation (CV)0.60826468
Kurtosis-0.51931273
Mean171674.18
Median Absolute Deviation (MAD)70872.5
Skewness0.33818386
Sum3.4712519 × 108
Variance1.0904234 × 1010
MonotonicityNot monotonic
2023-12-12T23:34:57.083916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
234458 2
 
0.1%
241455 2
 
0.1%
234470 2
 
0.1%
234451 2
 
0.1%
234472 2
 
0.1%
234440 2
 
0.1%
234469 2
 
0.1%
234417 2
 
0.1%
234448 2
 
0.1%
234445 2
 
0.1%
Other values (1781) 2002
99.0%
ValueCountFrequency (%)
0 2
0.1%
12858 1
< 0.1%
12905 1
< 0.1%
12911 1
< 0.1%
12913 1
< 0.1%
12929 1
< 0.1%
12941 1
< 0.1%
12950 1
< 0.1%
12956 1
< 0.1%
12958 1
< 0.1%
ValueCountFrequency (%)
460009 1
< 0.1%
460008 1
< 0.1%
460007 1
< 0.1%
460006 1
< 0.1%
460005 1
< 0.1%
460004 1
< 0.1%
460002 1
< 0.1%
450003 1
< 0.1%
450002 2
0.1%
450001 2
0.1%

Interactions

2023-12-12T23:34:54.095599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T23:34:54.220564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:34:54.311974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

거래처명주소등록일지정번호
0(주)농민식자재마트처인구 포곡읍 둔전리 387-82019-06-1913091
1(주)에스피에스처인구 포곡읍 영문리 188-72015-11-1713040
2(주)이플러스마트처인구 포곡읍 전대리 120-72017-10-1113072
3(주)지에스리테일용인포곡점처인구 포곡읍 둔전리 139-102016-12-2712974
4(주)코리아세븐 용인전대리점용인시 처인구 포곡읍 전대로 1282016-02-2613045
5(주)코리아세븐 용인제일메디점처인구 포곡읍 둔전리 3402022-07-2713134
6(주)코리아세븐 용인포곡쉐르점처인구 포곡읍 둔전리 4492016-10-1913059
71004식자재마트처인구 포곡읍 둔전리 179-22018-04-0913078
8365PLUS 용인신원점처인구 포곡읍 신원리 342-22015-08-1713038
9365플러스 용인포곡점처인구 포곡읍 전대리 150-12014-05-1913025
거래처명주소등록일지정번호
2012주식회사 반월농민마트경기 화성시 반월동 66-22019-09-24310006
2013주식회사 식재료마켓경기 화성시 장지동 807-12019-07-25310005
2014(주)다인유통경기 광주시 오포읍 능평리 146-72017-07-04350001
2015OK마트경기 광주시 오포읍 능평리 153-182019-01-0222610
2016대한식자재마트경기 광주시 오포읍 문형리 561-52022-06-30350007
2017씨유 광주오포왕림점광주시 오포읍 왕림로452021-04-23350005
2018씨유 오포왕림점경기 광주시 오포읍 능평리 71-82017-11-21350002
2019어썸마켓 경기광주점경기 광주시 오포읍 능평리 175-72021-08-30350006
2020종량제봉투 판매소처인구 마평동 7032016-12-220
2021주식회사 태현유통경기 광주시 오포읍 능평리 153-182020-01-15350003

Duplicate rows

Most frequently occurring

거래처명주소등록일지정번호# duplicates
0(GS25)죽전중앙점죽전동1254-32020-09-153400152
1(주)농민마트현백물류경기 화성시 반월동 66-22021-04-193100072
2(주)농수산식자재마트 용인점수지구 동천동 9692019-08-012415182
3(주)농협하나로유통 농협성남유통센터경기 성남시 분당구 구미동 1742022-02-242813582
4(주)다인유통경기 광주시 오포읍 능평리 146-72017-07-043500012
5(주)더바른경기 화성시 반월동 772022-03-223100092
6(주)비앤에이치수지구 죽전동 12912010-11-152313962
7(주)새농 용인수지지점수지구 동천동 942-42018-10-102415072
8(주)신세계수지구 죽전동 12852020-04-173400132
9(주)오아시스 수지동천점수지구 동천동 9482018-10-262415092