Overview

Dataset statistics

Number of variables5
Number of observations477
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.2 KiB
Average record size in memory41.3 B

Variable types

Categorical3
Text1
Numeric1

Dataset

Description부산광역시 상수도사업본부 물공급운영시스템에 유량계를 통해 수집된 사업소별 유량 현황에 대한 데이터로 사업소명, 블록명, 수집일, 시간, 순시유량 등의 항목을 제공합니다
Author공공데이터포털
URLhttps://www.data.go.kr/data/15119900/fileData.do

Alerts

수집일자 has constant value ""Constant
수집시간 has constant value ""Constant
블록명 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:18:38.595851
Analysis finished2024-04-21 01:18:39.180179
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업소명
Categorical

Distinct11
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
동래통합사업소
89 
북부사업소
61 
부산진사업소
52 
남부사업소
51 
사하사업소
48 
Other values (6)
176 

Length

Max length7
Median length5
Mean length5.6331237
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동래통합사업소
2nd row동래통합사업소
3rd row동래통합사업소
4th row동래통합사업소
5th row동래통합사업소

Common Values

ValueCountFrequency (%)
동래통합사업소 89
18.7%
북부사업소 61
12.8%
부산진사업소 52
10.9%
남부사업소 51
10.7%
사하사업소 48
10.1%
중동부사업소 37
7.8%
해운대사업소 35
 
7.3%
서부사업소 29
 
6.1%
기장사업소 29
 
6.1%
영도사업소 25
 
5.2%

Length

2024-04-21T10:18:39.309047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동래통합사업소 89
18.7%
북부사업소 61
12.8%
부산진사업소 52
10.9%
남부사업소 51
10.7%
사하사업소 48
10.1%
중동부사업소 37
7.8%
해운대사업소 35
 
7.3%
서부사업소 29
 
6.1%
기장사업소 29
 
6.1%
영도사업소 25
 
5.2%

블록명
Text

UNIQUE 

Distinct477
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2024-04-21T10:18:40.322853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length5
Mean length5.3459119
Min length4

Characters and Unicode

Total characters2550
Distinct characters39
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique477 ?
Unique (%)100.0%

Sample

1st row동래 1
2nd row동래 2
3rd row동래 3
4th row동래 4
5th row동래 5
ValueCountFrequency (%)
북부 61
 
6.4%
동래 58
 
6.1%
부산진 52
 
5.5%
남부 51
 
5.4%
사하 48
 
5.0%
중동부 37
 
3.9%
해운대 35
 
3.7%
금정 31
 
3.3%
서부 29
 
3.0%
기장 29
 
3.0%
Other values (107) 522
54.8%
2024-04-21T10:18:41.817388image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
479
18.8%
1 243
 
9.5%
230
 
9.0%
2 169
 
6.6%
3 110
 
4.3%
96
 
3.8%
4 94
 
3.7%
- 80
 
3.1%
61
 
2.4%
5 58
 
2.3%
Other values (29) 930
36.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1080
42.4%
Decimal Number 908
35.6%
Space Separator 479
18.8%
Dash Punctuation 80
 
3.1%
Open Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
230
21.3%
96
 
8.9%
61
 
5.6%
58
 
5.4%
52
 
4.8%
52
 
4.8%
51
 
4.7%
50
 
4.6%
48
 
4.4%
48
 
4.4%
Other values (14) 334
30.9%
Decimal Number
ValueCountFrequency (%)
1 243
26.8%
2 169
18.6%
3 110
12.1%
4 94
 
10.4%
5 58
 
6.4%
8 51
 
5.6%
6 48
 
5.3%
7 47
 
5.2%
9 46
 
5.1%
0 42
 
4.6%
Space Separator
ValueCountFrequency (%)
479
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 80
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
D 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1469
57.6%
Hangul 1080
42.4%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
230
21.3%
96
 
8.9%
61
 
5.6%
58
 
5.4%
52
 
4.8%
52
 
4.8%
51
 
4.7%
50
 
4.6%
48
 
4.4%
48
 
4.4%
Other values (14) 334
30.9%
Common
ValueCountFrequency (%)
479
32.6%
1 243
16.5%
2 169
 
11.5%
3 110
 
7.5%
4 94
 
6.4%
- 80
 
5.4%
5 58
 
3.9%
8 51
 
3.5%
6 48
 
3.3%
7 47
 
3.2%
Other values (4) 90
 
6.1%
Latin
ValueCountFrequency (%)
D 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1470
57.6%
Hangul 1080
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
479
32.6%
1 243
16.5%
2 169
 
11.5%
3 110
 
7.5%
4 94
 
6.4%
- 80
 
5.4%
5 58
 
3.9%
8 51
 
3.5%
6 48
 
3.3%
7 47
 
3.2%
Other values (5) 91
 
6.2%
Hangul
ValueCountFrequency (%)
230
21.3%
96
 
8.9%
61
 
5.6%
58
 
5.4%
52
 
4.8%
52
 
4.8%
51
 
4.7%
50
 
4.6%
48
 
4.4%
48
 
4.4%
Other values (14) 334
30.9%

수집일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-08-01
477 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-08-01
2nd row2023-08-01
3rd row2023-08-01
4th row2023-08-01
5th row2023-08-01

Common Values

ValueCountFrequency (%)
2023-08-01 477
100.0%

Length

2024-04-21T10:18:42.214557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:18:42.495264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-08-01 477
100.0%

수집시간
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
15:00
477 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15:00
2nd row15:00
3rd row15:00
4th row15:00
5th row15:00

Common Values

ValueCountFrequency (%)
15:00 477
100.0%

Length

2024-04-21T10:18:42.794471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:18:43.075500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
15:00 477
100.0%

순시유량
Real number (ℝ)

Distinct469
Distinct (%)98.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.11101
Minimum1.37
Maximum860.85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 KiB
2024-04-21T10:18:43.396270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.37
5-th percentile10.188
Q131.9
median60.25
Q3115.42
95-th percentile308.816
Maximum860.85
Range859.48
Interquartile range (IQR)83.52

Descriptive statistics

Standard deviation112.77878
Coefficient of variation (CV)1.1613388
Kurtosis12.821965
Mean97.11101
Median Absolute Deviation (MAD)35.88
Skewness3.071934
Sum46321.952
Variance12719.054
MonotonicityNot monotonic
2024-04-21T10:18:43.819742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
97.3 2
 
0.4%
61.2 2
 
0.4%
53.2 2
 
0.4%
17.58 2
 
0.4%
43.7 2
 
0.4%
23.14 2
 
0.4%
43.31 2
 
0.4%
66.3 2
 
0.4%
31.87 1
 
0.2%
88.15 1
 
0.2%
Other values (459) 459
96.2%
ValueCountFrequency (%)
1.37 1
0.2%
1.46 1
0.2%
1.92 1
0.2%
2.02 1
0.2%
2.19 1
0.2%
2.4 1
0.2%
2.76 1
0.2%
3.35 1
0.2%
4.49 1
0.2%
4.72 1
0.2%
ValueCountFrequency (%)
860.85 1
0.2%
805.19 1
0.2%
748.1 1
0.2%
668.231 1
0.2%
623.6 1
0.2%
550.43 1
0.2%
539.43 1
0.2%
537.8 1
0.2%
451.4 1
0.2%
442.2 1
0.2%

Interactions

2024-04-21T10:18:38.774776image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:18:44.071662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업소명순시유량
사업소명1.0000.166
순시유량0.1661.000
2024-04-21T10:18:44.296763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순시유량사업소명
순시유량1.0000.071
사업소명0.0711.000

Missing values

2024-04-21T10:18:38.950479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:18:39.115496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업소명블록명수집일자수집시간순시유량
0동래통합사업소동래 12023-08-0115:0024.8
1동래통합사업소동래 22023-08-0115:00236.5
2동래통합사업소동래 32023-08-0115:0051.13
3동래통합사업소동래 42023-08-0115:0040.4
4동래통합사업소동래 52023-08-0115:00177.86
5동래통합사업소동래 62023-08-0115:00101.09
6동래통합사업소동래 72023-08-0115:0022.99
7동래통합사업소동래 7-12023-08-0115:002.4
8동래통합사업소동래 82023-08-0115:00112.95
9동래통합사업소동래 92023-08-0115:0016.79
사업소명블록명수집일자수집시간순시유량
467기장사업소기장 14-12023-08-0115:00165.9
468기장사업소기장 14-22023-08-0115:00172.1
469기장사업소기장 14-32023-08-0115:0010.49
470기장사업소기장 14-4(D100)2023-08-0115:00166.88
471기장사업소기장 152023-08-0115:004.81
472기장사업소기장 162023-08-0115:0065.73
473기장사업소기장 16-12023-08-0115:0029.62
474기장사업소기장 172023-08-0115:008.0
475기장사업소기장 182023-08-0115:0017.32
476기장사업소기장 192023-08-0115:00100.0