Overview

Dataset statistics

Number of variables8
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows819
Duplicate rows (%)8.2%
Total size in memory742.2 KiB
Average record size in memory76.0 B

Variable types

Categorical3
Numeric4
DateTime1

Dataset

Description광주광역시 서구 관급봉투관리시스템의 종량제봉투(납부필증) 판매현황 정보입니다.
Author광주광역시 서구
URLhttps://www.data.go.kr/data/15039805/fileData.do

Alerts

기관명 has constant value ""Constant
판매구분 has constant value ""Constant
데이터기준일자 has constant value ""Constant
Dataset has 819 (8.2%) duplicate rowsDuplicates
바코드번호 is highly overall correlated with 봉투구분High correlation
봉투구분 is highly overall correlated with 바코드번호High correlation
금액 is highly skewed (γ1 = 25.47593399)Skewed
금액 has 360 (3.6%) zerosZeros

Reproduction

Analysis started2023-12-12 16:19:25.583059
Analysis finished2023-12-12 16:19:28.428263
Duration2.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기관명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
서구청
10000 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서구청
2nd row서구청
3rd row서구청
4th row서구청
5th row서구청

Common Values

ValueCountFrequency (%)
서구청 10000
100.0%

Length

2023-12-13T01:19:28.483901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:28.560951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서구청 10000
100.0%

수불날짜
Real number (ℝ)

Distinct167
Distinct (%)1.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20190445
Minimum20190102
Maximum20190902
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:19:28.647552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20190102
5-th percentile20190110
Q120190221
median20190417
Q320190702
95-th percentile20190820
Maximum20190902
Range800
Interquartile range (IQR)481

Descriptive statistics

Standard deviation241.536
Coefficient of variation (CV)1.1962886 × 10-5
Kurtosis-1.3052164
Mean20190445
Median Absolute Deviation (MAD)203
Skewness0.13470176
Sum2.0190445 × 1011
Variance58339.638
MonotonicityNot monotonic
2023-12-13T01:19:28.799667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20190125 156
 
1.6%
20190222 146
 
1.5%
20190129 130
 
1.3%
20190308 119
 
1.2%
20190104 114
 
1.1%
20190109 113
 
1.1%
20190118 111
 
1.1%
20190214 98
 
1.0%
20190131 97
 
1.0%
20190705 97
 
1.0%
Other values (157) 8819
88.2%
ValueCountFrequency (%)
20190102 55
0.5%
20190103 67
0.7%
20190104 114
1.1%
20190107 53
0.5%
20190108 75
0.8%
20190109 113
1.1%
20190110 69
0.7%
20190111 57
0.6%
20190114 49
0.5%
20190115 78
0.8%
ValueCountFrequency (%)
20190902 45
0.4%
20190830 59
0.6%
20190829 42
0.4%
20190828 50
0.5%
20190827 50
0.5%
20190826 35
 
0.4%
20190823 89
0.9%
20190822 51
0.5%
20190821 58
0.6%
20190820 69
0.7%

판매구분
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
일반판매
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반판매
2nd row일반판매
3rd row일반판매
4th row일반판매
5th row일반판매

Common Values

ValueCountFrequency (%)
일반판매 10000
100.0%

Length

2023-12-13T01:19:28.923865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:29.023992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반판매 10000
100.0%

바코드번호
Real number (ℝ)

HIGH CORRELATION 

Distinct896
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.6548727 × 1011
Minimum1.215 × 1010
Maximum9.05 × 1011
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:19:29.135878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.215 × 1010
5-th percentile9.9160001 × 1010
Q13.96 × 1011
median5.9 × 1011
Q36.33 × 1011
95-th percentile9.05 × 1011
Maximum9.05 × 1011
Range8.9285 × 1011
Interquartile range (IQR)2.37 × 1011

Descriptive statistics

Standard deviation2.4887611 × 1011
Coefficient of variation (CV)0.44010913
Kurtosis-0.38765295
Mean5.6548727 × 1011
Median Absolute Deviation (MAD)4.3 × 1010
Skewness-0.59463123
Sum5.6548727 × 1015
Variance6.1939318 × 1022
MonotonicityNot monotonic
2023-12-13T01:19:29.283171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
584000000000 1609
16.1%
613000000000 1111
11.1%
609000000000 864
8.6%
905000000000 849
8.5%
879000000000 827
8.3%
119000000000 691
 
6.9%
590000000000 666
 
6.7%
854000000000 574
 
5.7%
633000000000 545
 
5.5%
370000000000 469
 
4.7%
Other values (886) 1795
17.9%
ValueCountFrequency (%)
12150000006 1
< 0.1%
12150000010 1
< 0.1%
12150000012 1
< 0.1%
12150000016 1
< 0.1%
12160000735 1
< 0.1%
12160000736 1
< 0.1%
12160000737 1
< 0.1%
12160006515 1
< 0.1%
12160006524 1
< 0.1%
12160006573 1
< 0.1%
ValueCountFrequency (%)
905000000000 849
8.5%
879000000000 827
8.3%
854000000000 574
 
5.7%
844000000000 101
 
1.0%
794000000000 1
 
< 0.1%
633000000000 545
 
5.5%
613000000000 1111
11.1%
609000000000 864
8.6%
590000000000 666
6.7%
584000000000 1609
16.1%

금액
Real number (ℝ)

SKEWED  ZEROS 

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14294.354
Minimum0
Maximum1336000
Zeros360
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:19:29.392641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3600
Q16800
median13400
Q320200
95-th percentile33400
Maximum1336000
Range1336000
Interquartile range (IQR)13400

Descriptive statistics

Standard deviation46363.208
Coefficient of variation (CV)3.243463
Kurtosis672.73035
Mean14294.354
Median Absolute Deviation (MAD)6600
Skewness25.475934
Sum1.4294354 × 108
Variance2.1495471 × 109
MonotonicityNot monotonic
2023-12-13T01:19:29.509984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
7000 2168
21.7%
13600 2100
21.0%
20200 1623
16.2%
33400 894
8.9%
6800 877
8.8%
3800 720
 
7.2%
13400 651
 
6.5%
3600 519
 
5.2%
0 360
 
3.6%
1010 32
 
0.3%
Other values (11) 56
 
0.6%
ValueCountFrequency (%)
0 360
3.6%
180 4
 
< 0.1%
350 2
 
< 0.1%
670 2
 
< 0.1%
680 8
 
0.1%
1010 32
 
0.3%
1670 26
 
0.3%
3600 519
5.2%
3800 720
7.2%
6800 877
8.8%
ValueCountFrequency (%)
1336000 2
 
< 0.1%
1260000 4
 
< 0.1%
1224000 4
 
< 0.1%
1212000 2
 
< 0.1%
1206000 1
 
< 0.1%
950000 1
 
< 0.1%
33400 894
8.9%
20200 1623
16.2%
13600 2100
21.0%
13400 651
 
6.5%

수량
Real number (ℝ)

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean58.6182
Minimum1
Maximum5000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-13T01:19:29.605529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20
Q120
median20
Q320
95-th percentile20
Maximum5000
Range4999
Interquartile range (IQR)0

Descriptive statistics

Standard deviation314.84217
Coefficient of variation (CV)5.3710651
Kurtosis118.25659
Mean58.6182
Median Absolute Deviation (MAD)0
Skewness10.183722
Sum586182
Variance99125.589
MonotonicityNot monotonic
2023-12-13T01:19:29.706964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
20 9694
96.9%
1 102
 
1.0%
1800 68
 
0.7%
1200 49
 
0.5%
800 41
 
0.4%
3600 37
 
0.4%
5000 9
 
0.1%
ValueCountFrequency (%)
1 102
 
1.0%
20 9694
96.9%
800 41
 
0.4%
1200 49
 
0.5%
1800 68
 
0.7%
3600 37
 
0.4%
5000 9
 
0.1%
ValueCountFrequency (%)
5000 9
 
0.1%
3600 37
 
0.4%
1800 68
 
0.7%
1200 49
 
0.5%
800 41
 
0.4%
20 9694
96.9%
1 102
 
1.0%

봉투구분
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
10리터
3199 
20리터
2873 
30리터
1704 
5리터
1263 
50리터
961 

Length

Max length4
Median length4
Mean length3.8737
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10리터
2nd row10리터
3rd row20리터
4th row5리터
5th row5리터

Common Values

ValueCountFrequency (%)
10리터 3199
32.0%
20리터 2873
28.7%
30리터 1704
17.0%
5리터 1263
 
12.6%
50리터 961
 
9.6%

Length

2023-12-13T01:19:29.812201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T01:19:29.902295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
10리터 3199
32.0%
20리터 2873
28.7%
30리터 1704
17.0%
5리터 1263
 
12.6%
50리터 961
 
9.6%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-01-07 00:00:00
Maximum2021-01-07 00:00:00
2023-12-13T01:19:29.984806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:30.060548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T01:19:27.845443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:26.251624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.029470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.435175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.939573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:26.400726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.145715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.544205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:28.017718image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:26.800218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.250571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.650484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:28.108375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:26.919792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.345751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:19:27.753168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T01:19:30.120558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수불날짜바코드번호금액수량봉투구분
수불날짜1.0000.5770.0410.1840.390
바코드번호0.5771.0000.0000.1360.691
금액0.0410.0001.0000.6000.012
수량0.1840.1360.6001.0000.221
봉투구분0.3900.6910.0120.2211.000
2023-12-13T01:19:30.201491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
수불날짜바코드번호금액수량봉투구분
수불날짜1.000-0.012-0.297-0.0160.172
바코드번호-0.0121.0000.282-0.0040.538
금액-0.2970.2821.000-0.0760.009
수량-0.016-0.004-0.0761.0000.151
봉투구분0.1720.5380.0090.1511.000

Missing values

2023-12-13T01:19:28.221058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:19:28.369241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기관명수불날짜판매구분바코드번호금액수량봉투구분데이터기준일자
20500서구청20190308일반판매58400000000070002010리터2021-01-07
25395서구청20190422일반판매58400000000002010리터2021-01-07
44326서구청20190116일반판매396000000000136002020리터2021-01-07
1944서구청20190222일반판매3470000000003800205리터2021-01-07
6480서구청20190614일반판매5900000000003800205리터2021-01-07
23965서구청20190409일반판매58400000000070002010리터2021-01-07
45794서구청20190124일반판매609000000000136002020리터2021-01-07
24756서구청20190416일반판매58400000000070002010리터2021-01-07
94546서구청20190409일반판매9050000000001670150리터2021-01-07
94194서구청20190404일반판매905000000000334002050리터2021-01-07
기관명수불날짜판매구분바코드번호금액수량봉투구분데이터기준일자
40575서구청20190821일반판매61300000000002010리터2021-01-07
92147서구청20190308일반판매905000000000334002050리터2021-01-07
70951서구청20190115일반판매74230005704202002030리터2021-01-07
40449서구청20190821일반판매61300000000068002010리터2021-01-07
70540서구청20190109일반판매74230002453202002030리터2021-01-07
84471서구청20190808일반판매119000000000202002030리터2021-01-07
11906서구청20190102일반판매37000000000070002010리터2021-01-07
62410서구청20190619일반판매633000000000136002020리터2021-01-07
2714서구청20190315일반판매5610000000003800205리터2021-01-07
50773서구청20190305일반판매609000000000136002020리터2021-01-07

Duplicate rows

Most frequently occurring

기관명수불날짜판매구분바코드번호금액수량봉투구분데이터기준일자# duplicates
101서구청20190129일반판매609000000000136002020리터2021-01-0738
5서구청20190104일반판매37000000000070002010리터2021-01-0736
182서구청20190222일반판매609000000000136002020리터2021-01-0736
752서구청20190808일반판매61300000000068002010리터2021-01-0736
113서구청20190131일반판매609000000000136002020리터2021-01-0735
675서구청20190705일반판매61300000000068002010리터2021-01-0735
146서구청20190214일반판매58400000000070002010리터2021-01-0734
151서구청20190215일반판매58400000000070002010리터2021-01-0734
51서구청20190118일반판매37000000000070002010리터2021-01-0732
180서구청20190222일반판매58400000000070002010리터2021-01-0732