Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory585.9 KiB
Average record size in memory60.0 B

Variable types

Numeric3
DateTime1
Categorical2

Dataset

Description부산광역시상수도사업본부_수용가정보시스템_간단e납부시스템연계자료_일반수납내역분배자료_20230126
Author부산광역시 상수도사업본부
URLhttp://data.busan.go.kr/dataSet/detail.nm?contentId=10&publicdatapk=15083440

Alerts

이용기관발행기관 has constant value ""Constant
이용기관지로번호 has constant value ""Constant
수납금액 is highly skewed (γ1 = 93.98457205)Skewed
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-10 16:50:03.330028
Analysis finished2023-12-10 16:50:05.378950
Duration2.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47565.17
Minimum2
Maximum95291
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:50:05.590455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile4676.85
Q123117.5
median47754
Q371614
95-th percentile90722.2
Maximum95291
Range95289
Interquartile range (IQR)48496.5

Descriptive statistics

Standard deviation27710.162
Coefficient of variation (CV)0.58257255
Kurtosis-1.2124124
Mean47565.17
Median Absolute Deviation (MAD)24174.5
Skewness5.0850011 × 10-5
Sum4.756517 × 108
Variance7.678531 × 108
MonotonicityNot monotonic
2023-12-11T01:50:06.043206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
33991 1
 
< 0.1%
11119 1
 
< 0.1%
86478 1
 
< 0.1%
52201 1
 
< 0.1%
37116 1
 
< 0.1%
57798 1
 
< 0.1%
68426 1
 
< 0.1%
87519 1
 
< 0.1%
90464 1
 
< 0.1%
77219 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
2 1
< 0.1%
3 1
< 0.1%
4 1
< 0.1%
10 1
< 0.1%
21 1
< 0.1%
42 1
< 0.1%
52 1
< 0.1%
62 1
< 0.1%
74 1
< 0.1%
79 1
< 0.1%
ValueCountFrequency (%)
95291 1
< 0.1%
95287 1
< 0.1%
95256 1
< 0.1%
95255 1
< 0.1%
95252 1
< 0.1%
95236 1
< 0.1%
95215 1
< 0.1%
95214 1
< 0.1%
95210 1
< 0.1%
95202 1
< 0.1%
Distinct86
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-01-01 00:00:00
Maximum2022-03-30 00:00:00
2023-12-11T01:50:07.239019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:07.704241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

일련번호
Real number (ℝ)

Distinct4546
Distinct (%)45.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2072.132
Minimum1
Maximum9500
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:50:08.002493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile61
Q1602.75
median1716.5
Q32938
95-th percentile5716.1
Maximum9500
Range9499
Interquartile range (IQR)2335.25

Descriptive statistics

Standard deviation1835.908
Coefficient of variation (CV)0.88599954
Kurtosis2.0640621
Mean2072.132
Median Absolute Deviation (MAD)1158.5
Skewness1.3473313
Sum20721320
Variance3370558.2
MonotonicityNot monotonic
2023-12-11T01:50:08.320537image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
94 18
 
0.2%
64 13
 
0.1%
45 13
 
0.1%
78 13
 
0.1%
89 12
 
0.1%
33 12
 
0.1%
5 12
 
0.1%
42 12
 
0.1%
122 12
 
0.1%
1 12
 
0.1%
Other values (4536) 9871
98.7%
ValueCountFrequency (%)
1 12
0.1%
2 4
 
< 0.1%
3 10
0.1%
4 6
0.1%
5 12
0.1%
6 8
0.1%
7 12
0.1%
8 12
0.1%
9 9
0.1%
10 10
0.1%
ValueCountFrequency (%)
9500 1
< 0.1%
9486 1
< 0.1%
9482 1
< 0.1%
9480 1
< 0.1%
9471 1
< 0.1%
9437 1
< 0.1%
9434 1
< 0.1%
9422 1
< 0.1%
9370 1
< 0.1%
9366 1
< 0.1%

이용기관발행기관
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
부산광역시
10000 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row부산광역시
2nd row부산광역시
3rd row부산광역시
4th row부산광역시
5th row부산광역시

Common Values

ValueCountFrequency (%)
부산광역시 10000
100.0%

Length

2023-12-11T01:50:08.612266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:08.768420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
부산광역시 10000
100.0%

이용기관지로번호
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
1004102
10000 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1004102
2nd row1004102
3rd row1004102
4th row1004102
5th row1004102

Common Values

ValueCountFrequency (%)
1004102 10000
100.0%

Length

2023-12-11T01:50:08.954963image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T01:50:09.124484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1004102 10000
100.0%

수납금액
Real number (ℝ)

SKEWED 

Distinct4538
Distinct (%)45.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean208939.92
Minimum30
Maximum4.0251898 × 108
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T01:50:09.407702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum30
5-th percentile2400
Q118230
median45940
Q3104540
95-th percentile459636
Maximum4.0251898 × 108
Range4.0251895 × 108
Interquartile range (IQR)86310

Descriptive statistics

Standard deviation4109818.4
Coefficient of variation (CV)19.669857
Kurtosis9186.6116
Mean208939.92
Median Absolute Deviation (MAD)34320
Skewness93.984572
Sum2.0893992 × 109
Variance1.6890608 × 1013
MonotonicityNot monotonic
2023-12-11T01:50:09.674231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2400 693
 
6.9%
11620 83
 
0.8%
14260 80
 
0.8%
16900 74
 
0.7%
8980 66
 
0.7%
3700 61
 
0.6%
2470 58
 
0.6%
6340 49
 
0.5%
12940 35
 
0.4%
26150 35
 
0.4%
Other values (4528) 8766
87.7%
ValueCountFrequency (%)
30 1
< 0.1%
230 1
< 0.1%
340 1
< 0.1%
350 1
< 0.1%
420 2
< 0.1%
530 1
< 0.1%
690 1
< 0.1%
730 1
< 0.1%
890 1
< 0.1%
920 1
< 0.1%
ValueCountFrequency (%)
402518980 1
< 0.1%
35410660 1
< 0.1%
21416000 1
< 0.1%
19195950 1
< 0.1%
15768430 1
< 0.1%
15616470 1
< 0.1%
15042000 1
< 0.1%
14843320 1
< 0.1%
14606610 1
< 0.1%
14360280 1
< 0.1%

Interactions

2023-12-11T01:50:04.448415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:03.711683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:04.059685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:04.563128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:03.820533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:04.173196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:04.683160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:03.947449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T01:50:04.318881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T01:50:09.912562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번납부일자일련번호수납금액
연번1.0000.9720.4790.000
납부일자0.9721.0000.6900.000
일련번호0.4790.6901.0000.000
수납금액0.0000.0000.0001.000
2023-12-11T01:50:10.096762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번일련번호수납금액
연번1.000-0.025-0.030
일련번호-0.0251.0000.084
수납금액-0.0300.0841.000

Missing values

2023-12-11T01:50:04.916766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T01:50:05.259465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번납부일자일련번호이용기관발행기관이용기관지로번호수납금액
33990339912022-01-271548부산광역시1004102134100
12340123412022-01-244741부산광역시1004102147880
46789467902022-02-232426부산광역시10041022400
915091512022-01-282919부산광역시10041022017010
842384242022-01-28885부산광역시100410215580
69661696622022-03-28121부산광역시1004102546390
48602486032022-02-231091부산광역시1004102116490
8228232022-01-03330부산광역시100410247040
60047600482022-02-289179부산광역시10041025090
79227792282022-03-253039부산광역시10041022889500
연번납부일자일련번호이용기관발행기관이용기관지로번호수납금액
76119761202022-03-241278부산광역시100410215600
65313653142022-02-242765부산광역시1004102233960
40707407082022-02-254411부산광역시10041024569520
58812588132022-02-285384부산광역시1004102116310
79515795162022-03-241974부산광역시100410224840
36913369142022-01-201032부산광역시100410240640
77863778642022-03-253505부산광역시1004102133480
358335842022-01-24177부산광역시100410268020
89864898652022-03-222245부산광역시100410239380
20343203442022-01-254592부산광역시1004102335980