Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory644.5 KiB
Average record size in memory66.0 B

Variable types

Numeric2
Categorical3
DateTime2

Dataset

Description대구광역시 상수도 중구역별 시간집계 유량 자료입니다.해당자료는 구역번호, 장소명, 주소, 날짜, 시간, 유량정보, 단위를 포함하고 있습니다.
Author대구광역시
URLhttps://www.data.go.kr/data/15116330/fileData.do

Alerts

단위 has constant value ""Constant
장소명 is highly overall correlated with 중구역번호 and 1 other fieldsHigh correlation
주소 is highly overall correlated with 중구역번호 and 1 other fieldsHigh correlation
중구역번호 is highly overall correlated with 장소명 and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-04-21 02:25:31.113345
Analysis finished2024-04-21 02:25:33.309207
Duration2.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

중구역번호
Real number (ℝ)

HIGH CORRELATION 

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.9639
Minimum101
Maximum233
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:25:33.374944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile104
Q1113
median209
Q3222
95-th percentile231
Maximum233
Range132
Interquartile range (IQR)109

Descriptive statistics

Standard deviation50.617803
Coefficient of variation (CV)0.27515074
Kurtosis-1.2734277
Mean183.9639
Median Absolute Deviation (MAD)15
Skewness-0.77739939
Sum1839639
Variance2562.162
MonotonicityNot monotonic
2024-04-21T11:25:33.505533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=46)
ValueCountFrequency (%)
108 265
 
2.6%
222 247
 
2.5%
231 245
 
2.5%
110 240
 
2.4%
220 238
 
2.4%
227 238
 
2.4%
206 236
 
2.4%
115 236
 
2.4%
209 232
 
2.3%
205 230
 
2.3%
Other values (36) 7593
75.9%
ValueCountFrequency (%)
101 207
2.1%
102 212
2.1%
104 199
2.0%
105 221
2.2%
106 207
2.1%
107 194
1.9%
108 265
2.6%
109 212
2.1%
110 240
2.4%
111 205
2.1%
ValueCountFrequency (%)
233 188
1.9%
232 209
2.1%
231 245
2.5%
230 221
2.2%
229 192
1.9%
228 208
2.1%
227 238
2.4%
226 213
2.1%
225 222
2.2%
224 208
2.1%

장소명
Categorical

HIGH CORRELATION 

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동산병원(0108)
 
265
귀빈예식장(0222)
 
247
우방강촌마을(0231)
 
245
국채보상로(0110)
 
240
신천주공2,3단지(0220)
 
238
Other values (41)
8765 

Length

Max length15
Median length13
Mean length10.8608
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowK2(0226)
2nd row불로시장(0205)
3rd row경대병원(0111)
4th row국채보상로(0110)
5th row해서초등(0206)

Common Values

ValueCountFrequency (%)
동산병원(0108) 265
 
2.6%
귀빈예식장(0222) 247
 
2.5%
우방강촌마을(0231) 245
 
2.5%
국채보상로(0110) 240
 
2.4%
신천주공2,3단지(0220) 238
 
2.4%
방촌시장(0227) 238
 
2.4%
해서초등(0206) 236
 
2.4%
방천시장(0115) 236
 
2.4%
동구청(0209) 232
 
2.3%
불로시장(0205) 230
 
2.3%
Other values (36) 7593
75.9%

Length

2024-04-21T11:25:33.639004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동산병원(0108 265
 
2.6%
귀빈예식장(0222 247
 
2.5%
우방강촌마을(0231 245
 
2.5%
국채보상로(0110 240
 
2.4%
신천주공2,3단지(0220 238
 
2.4%
방촌시장(0227 238
 
2.4%
해서초등(0206 236
 
2.4%
방천시장(0115 236
 
2.4%
동구청(0209 232
 
2.3%
불로시장(0205 230
 
2.3%
Other values (36) 7593
75.9%

주소
Categorical

HIGH CORRELATION 

Distinct44
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
대구광역시 동구 방촌동 858-9
 
421
대구광역시 중구 수창동 104
 
419
대구광역시 중구 동산동 349
 
265
대구광역시 동구 신천동 742
 
247
대구광역시 동구 방촌동 1084-506
 
245
Other values (39)
8403 

Length

Max length21
Median length20
Mean length17.6743
Min length15

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구광역시 동구 방촌동 858-9
2nd row대구광역시 동구 불로동 460-12
3rd row대구광역시 중구 삼덕동3가 379
4th row대구광역시 중구 삼덕동2가 39
5th row대구광역시 동구 불로동 598-3

Common Values

ValueCountFrequency (%)
대구광역시 동구 방촌동 858-9 421
 
4.2%
대구광역시 중구 수창동 104 419
 
4.2%
대구광역시 중구 동산동 349 265
 
2.6%
대구광역시 동구 신천동 742 247
 
2.5%
대구광역시 동구 방촌동 1084-506 245
 
2.5%
대구광역시 중구 삼덕동2가 39 240
 
2.4%
대구광역시 동구 신천동 522-1 238
 
2.4%
대구광역시 동구 방촌동 1139-1 238
 
2.4%
대구광역시 동구 불로동 598-3 236
 
2.4%
대구광역시 중구 대봉동 121-397 236
 
2.4%
Other values (34) 7215
72.2%

Length

2024-04-21T11:25:33.751573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
대구광역시 10000
25.0%
동구 6959
17.4%
중구 3041
 
7.6%
신암동 1294
 
3.2%
신천동 916
 
2.3%
방촌동 904
 
2.3%
불로동 897
 
2.2%
효목동 835
 
2.1%
남산동 643
 
1.6%
봉무동 445
 
1.1%
Other values (60) 14066
35.2%

날짜
Date

Distinct91
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-01-01 00:00:00
Maximum2024-03-31 00:00:00
2024-04-21T11:25:33.863015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:25:34.017991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

시간
Date

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2024-04-21 00:00:00
Maximum2024-04-21 23:00:00
2024-04-21T11:25:34.140779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:25:34.237832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)

유량
Real number (ℝ)

Distinct503
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean97.0606
Minimum0
Maximum881
Zeros60
Zeros (%)0.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T11:25:34.349542image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7
Q134
median70
Q3134
95-th percentile261
Maximum881
Range881
Interquartile range (IQR)100

Descriptive statistics

Standard deviation92.565755
Coefficient of variation (CV)0.95369032
Kurtosis8.9947813
Mean97.0606
Median Absolute Deviation (MAD)44
Skewness2.3470072
Sum970606
Variance8568.419
MonotonicityNot monotonic
2024-04-21T11:25:34.465610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
6 107
 
1.1%
5 101
 
1.0%
28 98
 
1.0%
33 98
 
1.0%
12 95
 
0.9%
63 91
 
0.9%
11 85
 
0.9%
30 84
 
0.8%
7 84
 
0.8%
40 83
 
0.8%
Other values (493) 9074
90.7%
ValueCountFrequency (%)
0 60
0.6%
1 22
 
0.2%
2 33
 
0.3%
3 63
0.6%
4 60
0.6%
5 101
1.0%
6 107
1.1%
7 84
0.8%
8 67
0.7%
9 71
0.7%
ValueCountFrequency (%)
881 1
< 0.1%
874 1
< 0.1%
851 1
< 0.1%
825 1
< 0.1%
802 1
< 0.1%
782 1
< 0.1%
769 1
< 0.1%
734 2
< 0.1%
732 1
< 0.1%
717 1
< 0.1%

단위
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
m³/h
10000 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowm³/h
2nd rowm³/h
3rd rowm³/h
4th rowm³/h
5th rowm³/h

Common Values

ValueCountFrequency (%)
m³/h 10000
100.0%

Length

2024-04-21T11:25:34.572650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T11:25:34.661883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
m³/h 10000
100.0%

Interactions

2024-04-21T11:25:32.964117image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:25:32.629301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:25:33.050839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T11:25:32.887975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T11:25:34.721233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중구역번호장소명주소날짜시간유량
중구역번호1.0001.0001.0000.0000.0280.291
장소명1.0001.0001.0000.0000.0000.702
주소1.0001.0001.0000.0000.0000.700
날짜0.0000.0000.0001.0000.0000.000
시간0.0280.0000.0000.0001.0000.292
유량0.2910.7020.7000.0000.2921.000
2024-04-21T11:25:34.805627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
장소명주소
장소명1.0001.000
주소1.0001.000
2024-04-21T11:25:34.875448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
중구역번호유량장소명주소
중구역번호1.000-0.0010.9980.998
유량-0.0011.0000.3240.323
장소명0.9980.3241.0001.000
주소0.9980.3231.0001.000

Missing values

2024-04-21T11:25:33.158672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T11:25:33.259990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

중구역번호장소명주소날짜시간유량단위
83250226K2(0226)대구광역시 동구 방촌동 858-92024-01-1118:0090m³/h
40300205불로시장(0205)대구광역시 동구 불로동 460-122024-02-1104:0028m³/h
21563111경대병원(0111)대구광역시 중구 삼덕동3가 3792024-03-2011:00180m³/h
18734110국채보상로(0110)대구광역시 중구 삼덕동2가 392024-02-2214:0085m³/h
43092206해서초등(0206)대구광역시 동구 불로동 598-32024-03-0712:0019m³/h
23173112보성황실(0112)대구광역시 중구 남산동 2998-42024-02-2513:00198m³/h
28452115방천시장(0115)대구광역시 중구 대봉동 121-3972024-01-0312:000m³/h
30753201지묘배수지(0201)대구광역시 동구 지묘동 52-102024-01-0809:00224m³/h
48709209동구청(0209)대구광역시 동구 신암동 13-52024-01-2813:00197m³/h
5574104동아쇼핑(0104)대구광역시 중구 동일동 362024-02-2006:0057m³/h
중구역번호장소명주소날짜시간유량단위
31219201지묘배수지(0201)대구광역시 동구 지묘동 52-102024-01-2719:00424m³/h
87433228영남네오빌(0228)대구광역시 동구 방촌동 858-92024-01-0401:0033m³/h
82639225원부가압장(0225)대구광역시 동구 부동 595-02024-03-1707:006m³/h
46867208대구공항(0208)대구광역시 동구 지저동 872-62024-02-1119:00179m³/h
49286209동구청(0209)대구광역시 동구 신암동 13-52024-02-2114:00155m³/h
46181208대구공항(0208)대구광역시 동구 지저동 872-62024-01-1405:0063m³/h
22505112보성황실(0112)대구광역시 중구 남산동 2998-42024-01-2817:00205m³/h
5240104동아쇼핑(0104)대구광역시 중구 동일동 362024-02-0608:0077m³/h
76487223효목태왕매트로(0223)대구광역시 동구 효목동 6472024-01-0223:0023m³/h
81517225원부가압장(0225)대구광역시 동구 부동 595-02024-01-3013:0010m³/h