Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells297
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory654.3 KiB
Average record size in memory67.0 B

Variable types

Numeric2
Categorical2
Text1
DateTime2

Dataset

Description인천광역시 중구 관내에서 실시한 쓰레기종량제봉투 포장단위에 대한 조사 데이터 입니다.<br/><br/>파일명 인천광역시_중구_쓰레기종량제봉투 포장단위<br/>파일내용 지정코드, 봉투명, 구분 등 <br/>
Author인천광역시 중구
URLhttps://data.incheon.go.kr/findData/publicDataDetail?dataId=15060078&srcSe=7661IVAWM27C61E190

Alerts

지정코드 has constant value ""Constant
데이터기준일자 has constant value ""Constant
만료기간 has 297 (3.0%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2024-03-18 04:35:12.755627
Analysis finished2024-03-18 04:35:15.609271
Duration2.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6150.0923
Minimum1
Maximum12323
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-18T13:35:15.669036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile627.9
Q13074.75
median6146.5
Q39228.25
95-th percentile11687.15
Maximum12323
Range12322
Interquartile range (IQR)6153.5

Descriptive statistics

Standard deviation3550.1583
Coefficient of variation (CV)0.57725285
Kurtosis-1.1983025
Mean6150.0923
Median Absolute Deviation (MAD)3076
Skewness0.0050059999
Sum61500923
Variance12603624
MonotonicityNot monotonic
2024-03-18T13:35:15.775509image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9854 1
 
< 0.1%
3655 1
 
< 0.1%
8950 1
 
< 0.1%
7062 1
 
< 0.1%
1325 1
 
< 0.1%
11094 1
 
< 0.1%
12005 1
 
< 0.1%
7914 1
 
< 0.1%
1915 1
 
< 0.1%
9796 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
1 1
< 0.1%
2 1
< 0.1%
3 1
< 0.1%
5 1
< 0.1%
6 1
< 0.1%
7 1
< 0.1%
8 1
< 0.1%
9 1
< 0.1%
10 1
< 0.1%
11 1
< 0.1%
ValueCountFrequency (%)
12323 1
< 0.1%
12322 1
< 0.1%
12321 1
< 0.1%
12319 1
< 0.1%
12318 1
< 0.1%
12317 1
< 0.1%
12315 1
< 0.1%
12313 1
< 0.1%
12312 1
< 0.1%
12311 1
< 0.1%

지정코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
110301
10000 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row110301
2nd row110301
3rd row110301
4th row110301
5th row110301

Common Values

ValueCountFrequency (%)
110301 10000
100.0%

Length

2024-03-18T13:35:15.873723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:35:15.953816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
110301 10000
100.0%
Distinct75
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-18T13:35:16.179427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length9.3479
Min length6

Characters and Unicode

Total characters93479
Distinct characters66
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반용 5L
2nd row일반용 50L(영종)
3rd row음식물 2L
4th row재사용 20L(영종)
5th row일반용 10L(영종)
ValueCountFrequency (%)
음식물 3896
17.2%
일반용 1957
 
8.6%
100l 996
 
4.4%
스티커 986
 
4.4%
필증 957
 
4.2%
용기 930
 
4.1%
20l 883
 
3.9%
60l 853
 
3.8%
10l 837
 
3.7%
50l 836
 
3.7%
Other values (51) 9516
42.0%
2024-03-18T13:35:16.448155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12647
 
13.5%
0 10697
 
11.4%
L 8798
 
9.4%
5831
 
6.2%
4112
 
4.4%
3896
 
4.2%
3896
 
4.2%
1 3384
 
3.6%
5 2956
 
3.2%
) 2414
 
2.6%
Other values (56) 34848
37.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 45520
48.7%
Decimal Number 21610
23.1%
Space Separator 12647
 
13.5%
Uppercase Letter 8846
 
9.5%
Close Punctuation 2414
 
2.6%
Open Punctuation 2414
 
2.6%
Other Punctuation 28
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5831
 
12.8%
4112
 
9.0%
3896
 
8.6%
3896
 
8.6%
1957
 
4.3%
1957
 
4.3%
1718
 
3.8%
1704
 
3.7%
1590
 
3.5%
1465
 
3.2%
Other values (41) 17394
38.2%
Decimal Number
ValueCountFrequency (%)
0 10697
49.5%
1 3384
 
15.7%
5 2956
 
13.7%
2 2298
 
10.6%
3 1219
 
5.6%
6 853
 
3.9%
7 203
 
0.9%
Uppercase Letter
ValueCountFrequency (%)
L 8798
99.5%
T 16
 
0.2%
P 16
 
0.2%
E 16
 
0.2%
Space Separator
ValueCountFrequency (%)
12647
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2414
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2414
100.0%
Other Punctuation
ValueCountFrequency (%)
, 28
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 45520
48.7%
Common 39113
41.8%
Latin 8846
 
9.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5831
 
12.8%
4112
 
9.0%
3896
 
8.6%
3896
 
8.6%
1957
 
4.3%
1957
 
4.3%
1718
 
3.8%
1704
 
3.7%
1590
 
3.5%
1465
 
3.2%
Other values (41) 17394
38.2%
Common
ValueCountFrequency (%)
12647
32.3%
0 10697
27.3%
1 3384
 
8.7%
5 2956
 
7.6%
) 2414
 
6.2%
( 2414
 
6.2%
2 2298
 
5.9%
3 1219
 
3.1%
6 853
 
2.2%
7 203
 
0.5%
Latin
ValueCountFrequency (%)
L 8798
99.5%
T 16
 
0.2%
P 16
 
0.2%
E 16
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 47959
51.3%
Hangul 45520
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12647
26.4%
0 10697
22.3%
L 8798
18.3%
1 3384
 
7.1%
5 2956
 
6.2%
) 2414
 
5.0%
( 2414
 
5.0%
2 2298
 
4.8%
3 1219
 
2.5%
6 853
 
1.8%
Other values (5) 279
 
0.6%
Hangul
ValueCountFrequency (%)
5831
 
12.8%
4112
 
9.0%
3896
 
8.6%
3896
 
8.6%
1957
 
4.3%
1957
 
4.3%
1718
 
3.8%
1704
 
3.7%
1590
 
3.5%
1465
 
3.2%
Other values (41) 17394
38.2%

구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
3339 
낱장
3332 
박스
3329 

Length

Max length2
Median length2
Mean length1.6661
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row박스
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
3339
33.4%
낱장 3332
33.3%
박스 3329
33.3%

Length

2024-03-18T13:35:16.556252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-18T13:35:16.635420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3339
33.4%
낱장 3332
33.3%
박스 3329
33.3%

만료기간
Date

MISSING 

Distinct47
Distinct (%)0.5%
Missing297
Missing (%)3.0%
Memory size156.2 KiB
Minimum2000-10-31 00:00:00
Maximum2020-04-19 00:00:00
2024-03-18T13:35:16.717813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:35:16.824557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)

수량
Real number (ℝ)

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.9891
Minimum1
Maximum200
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-18T13:35:16.915181image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median10
Q320
95-th percentile100
Maximum200
Range199
Interquartile range (IQR)19

Descriptive statistics

Standard deviation37.906673
Coefficient of variation (CV)1.4585604
Kurtosis0.12053798
Mean25.9891
Median Absolute Deviation (MAD)9
Skewness1.3809556
Sum259891
Variance1436.9159
MonotonicityNot monotonic
2024-03-18T13:35:17.013901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 3332
33.3%
10 2246
22.5%
100 1996
20.0%
5 928
 
9.3%
20 674
 
6.7%
2 312
 
3.1%
30 279
 
2.8%
15 135
 
1.4%
50 94
 
0.9%
200 3
 
< 0.1%
ValueCountFrequency (%)
1 3332
33.3%
2 312
 
3.1%
5 928
 
9.3%
10 2246
22.5%
15 135
 
1.4%
20 674
 
6.7%
30 279
 
2.8%
50 94
 
0.9%
60 1
 
< 0.1%
100 1996
20.0%
ValueCountFrequency (%)
200 3
 
< 0.1%
100 1996
20.0%
60 1
 
< 0.1%
50 94
 
0.9%
30 279
 
2.8%
20 674
 
6.7%
15 135
 
1.4%
10 2246
22.5%
5 928
9.3%
2 312
 
3.1%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2023-08-29 00:00:00
Maximum2023-08-29 00:00:00
2024-03-18T13:35:17.131722image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:35:17.204685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-03-18T13:35:15.276996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:35:14.991260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:35:15.364063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-18T13:35:15.153717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-18T13:35:17.260136image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번봉투명구분만료기간수량
연번1.0000.3940.0000.9960.110
봉투명0.3941.0000.0000.2550.747
구분0.0000.0001.0000.0000.491
만료기간0.9960.2550.0001.0000.186
수량0.1100.7470.4910.1861.000
2024-03-18T13:35:17.334336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번수량구분
연번1.000-0.0020.000
수량-0.0021.0000.425
구분0.0000.4251.000

Missing values

2024-03-18T13:35:15.468085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-18T13:35:15.557972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번지정코드봉투명구분만료기간수량데이터기준일자
98539854110301일반용 5L2020-01-151002023-08-29
78597860110301일반용 50L(영종)박스2019-01-28302023-08-29
1101011011110301음식물 2L2020-02-101002023-08-29
54005401110301재사용 20L(영종)2017-11-141002023-08-29
95539554110301일반용 10L(영종)2019-07-251002023-08-29
41954196110301음식물 20L2017-05-241002023-08-29
52875288110301음식물 필증 10L2017-11-14202023-08-29
985986110301사업장용 100L낱장2007-05-0612023-08-29
98569857110301일반용 5L박스2020-01-15202023-08-29
19571958110301일반용 5L2016-06-021002023-08-29
연번지정코드봉투명구분만료기간수량데이터기준일자
17321733110301일반용 100L박스2014-01-17202023-08-29
99769977110301조개껍데기용 60L박스2020-01-15102023-08-29
45064507110301조개껍데기용 100L2017-08-25102023-08-29
1068610687110301음식물 10L2020-02-091002023-08-29
46894690110301스티커 10000 원권낱장2017-08-2512023-08-29
1023810239110301일반용 10L(영종)낱장2020-01-1612023-08-29
15401541110301일반용 20L박스2014-01-16102023-08-29
91709171110301일반용 5L낱장2019-07-2412023-08-29
16101611110301사업장용 100L박스2014-01-16202023-08-29
56155616110301음식물 필증 2L박스2017-11-151002023-08-29