Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1168/S/1/datasetView.do

Alerts

강우량계명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
구청명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
강우량계 코드 is highly overall correlated with 구청 코드 and 2 other fieldsHigh correlation
구청 코드 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
10분우량 is highly imbalanced (97.0%)Imbalance

Reproduction

Analysis started2023-12-11 08:24:45.748982
Analysis finished2023-12-11 08:24:47.273965
Duration1.52 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강우량계 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1394.3975
Minimum101
Maximum2502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:24:47.373506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile202
Q1801
median1501
Q32002
95-th percentile2402
Maximum2502
Range2401
Interquartile range (IQR)1201

Descriptive statistics

Standard deviation708.19166
Coefficient of variation (CV)0.50788363
Kurtosis-1.1950043
Mean1394.3975
Median Absolute Deviation (MAD)600
Skewness-0.13794882
Sum13943975
Variance501535.43
MonotonicityNot monotonic
2023-12-11T17:24:47.565533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
2201 278
 
2.8%
2202 272
 
2.7%
2001 268
 
2.7%
2002 267
 
2.7%
801 266
 
2.7%
1303 265
 
2.6%
2102 261
 
2.6%
602 257
 
2.6%
802 256
 
2.6%
2502 255
 
2.5%
Other values (37) 7355
73.6%
ValueCountFrequency (%)
101 45
 
0.4%
102 46
 
0.5%
103 45
 
0.4%
201 245
2.5%
202 238
2.4%
301 217
2.2%
401 235
2.4%
402 251
2.5%
501 235
2.4%
601 236
2.4%
ValueCountFrequency (%)
2502 255
2.5%
2501 230
2.3%
2402 173
1.7%
2401 211
2.1%
2302 237
2.4%
2301 237
2.4%
2202 272
2.7%
2201 278
2.8%
2102 261
2.6%
2101 205
2.1%

강우량계명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
금천구청
 
278
가산2P
 
272
구로구청
 
268
개봉2동
 
267
동대문구청
 
266
Other values (42)
8649 

Length

Max length5
Median length4
Mean length3.8149
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row상계1동
2nd row반포P
3rd row고덕2동
4th row목동P
5th row한남P

Common Values

ValueCountFrequency (%)
금천구청 278
 
2.8%
가산2P 272
 
2.7%
구로구청 268
 
2.7%
개봉2동 267
 
2.7%
동대문구청 266
 
2.7%
갈현1동 265
 
2.6%
흑석P 261
 
2.6%
상월곡동 257
 
2.6%
휘경P 256
 
2.6%
마천2동 255
 
2.5%
Other values (37) 7355
73.6%

Length

2023-12-11T17:24:47.804823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
금천구청 278
 
2.8%
가산2p 272
 
2.7%
구로구청 268
 
2.7%
개봉2동 267
 
2.7%
동대문구청 266
 
2.7%
갈현1동 265
 
2.6%
흑석p 261
 
2.6%
상월곡동 257
 
2.6%
휘경p 256
 
2.6%
마천2동 255
 
2.5%
Other values (37) 7355
73.6%

구청 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.9289
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:24:47.992681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1108
median115
Q3120
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.0816001
Coefficient of variation (CV)0.062158066
Kurtosis-1.1948503
Mean113.9289
Median Absolute Deviation (MAD)6
Skewness-0.13786989
Sum1139289
Variance50.14906
MonotonicityNot monotonic
2023-12-11T17:24:48.156649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
113 748
 
7.5%
122 550
 
5.5%
120 535
 
5.3%
108 522
 
5.2%
106 493
 
4.9%
104 486
 
4.9%
125 485
 
4.9%
102 483
 
4.8%
115 482
 
4.8%
123 474
 
4.7%
Other values (15) 4742
47.4%
ValueCountFrequency (%)
101 136
 
1.4%
102 483
4.8%
103 217
2.2%
104 486
4.9%
105 235
2.4%
106 493
4.9%
107 213
2.1%
108 522
5.2%
109 474
4.7%
110 376
3.8%
ValueCountFrequency (%)
125 485
4.9%
124 384
3.8%
123 474
4.7%
122 550
5.5%
121 466
4.7%
120 535
5.3%
119 269
2.7%
118 457
4.6%
117 467
4.7%
116 461
4.6%

구청명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
은평구
748 
금천구
 
550
구로구
 
535
동대문구
 
522
성북구
 
493
Other values (20)
7152 

Length

Max length4
Median length3
Mean length3.0842
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row노원구
2nd row서초구
3rd row강동구
4th row양천구
5th row용산구

Common Values

ValueCountFrequency (%)
은평구 748
 
7.5%
금천구 550
 
5.5%
구로구 535
 
5.3%
동대문구 522
 
5.2%
성북구 493
 
4.9%
노원구 486
 
4.9%
송파구 485
 
4.9%
강동구 483
 
4.8%
마포구 482
 
4.8%
성동구 474
 
4.7%
Other values (15) 4742
47.4%

Length

2023-12-11T17:24:48.332315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
은평구 748
 
7.5%
금천구 550
 
5.5%
구로구 535
 
5.3%
동대문구 522
 
5.2%
성북구 493
 
4.9%
노원구 486
 
4.9%
송파구 485
 
4.9%
강동구 483
 
4.8%
마포구 482
 
4.8%
성동구 474
 
4.7%
Other values (15) 4742
47.4%

10분우량
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0.0
9969 
0.5
 
31

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 9969
99.7%
0.5 31
 
0.3%

Length

2023-12-11T17:24:48.494399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:24:48.617985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 9969
99.7%
0.5 31
 
0.3%
Distinct2432
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-02-01 00:09:00
Maximum2021-02-17 23:29:00
2023-12-11T17:24:48.763844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:48.967879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T17:24:46.472525image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:46.235970image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:46.595637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:46.360392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T17:24:49.098937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드강우량계명구청 코드구청명10분우량
강우량계 코드1.0001.0001.0000.9990.015
강우량계명1.0001.0001.0001.0000.048
구청 코드1.0001.0001.0001.0000.024
구청명0.9991.0001.0001.0000.030
10분우량0.0150.0480.0240.0301.000
2023-12-11T17:24:49.240685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
10분우량강우량계명구청명
10분우량1.0000.0400.026
강우량계명0.0401.0000.999
구청명0.0260.9991.000
2023-12-11T17:24:49.366754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드구청 코드강우량계명구청명10분우량
강우량계 코드1.0000.9990.9980.9760.011
구청 코드0.9991.0000.9980.9990.018
강우량계명0.9980.9981.0000.9990.040
구청명0.9760.9990.9991.0000.026
10분우량0.0110.0180.0400.0261.000

Missing values

2023-12-11T17:24:47.079391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T17:24:47.217070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
48040402상계1동104노원구0.02021-02-08 13:59
585492402반포P124서초구0.02021-02-10 08:59
26969202고덕2동102강동구0.02021-02-05 03:39
139791802목동P118양천구0.02021-02-03 04:19
341602한남P116용산구0.02021-02-01 00:09
65763801동대문구청108동대문구0.02021-02-11 14:19
127012301관악구청123관악구0.02021-02-02 23:29
262032302신림P123관악구0.02021-02-05 00:49
327852102흑석P121동작구0.02021-02-06 01:59
937962102흑석P121동작구0.02021-02-16 17:39
강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
21790901성동구청109성동구0.02021-02-04 08:49
676361303갈현1동113은평구0.02021-02-11 21:49
86208301도봉구청103도봉구0.02021-02-15 05:19
3842501송파구청125송파구0.02021-02-01 01:29
652811301은평구청113은평구0.02021-02-11 12:19
92963501강북구청105강북구0.02021-02-16 13:39
118501301은평구청113은평구0.02021-02-02 20:29
324892401서초구청124서초구0.02021-02-06 00:49
25131401노원구청104노원구0.02021-02-04 20:59
519251702공항동P117강서구0.02021-02-09 05:59