Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1168/S/1/datasetView.do

Alerts

강우량계명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
구청명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
강우량계 코드 is highly overall correlated with 구청 코드 and 2 other fieldsHigh correlation
구청 코드 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
10분우량 is highly imbalanced (97.6%)Imbalance

Reproduction

Analysis started2023-12-11 08:25:40.300984
Analysis finished2023-12-11 08:25:41.395618
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강우량계 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1300.3674
Minimum101
Maximum2502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:25:41.476698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile103
Q1701
median1302
Q31901
95-th percentile2402
Maximum2502
Range2401
Interquartile range (IQR)1200

Descriptive statistics

Standard deviation727.66567
Coefficient of variation (CV)0.55958468
Kurtosis-1.1675442
Mean1300.3674
Median Absolute Deviation (MAD)600
Skewness-0.016978015
Sum13003674
Variance529497.33
MonotonicityNot monotonic
2023-12-11T17:25:41.642374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1601 246
 
2.5%
1101 242
 
2.4%
1801 240
 
2.4%
1302 236
 
2.4%
1401 236
 
2.4%
1701 235
 
2.4%
301 234
 
2.3%
601 227
 
2.3%
802 225
 
2.2%
401 225
 
2.2%
Other values (37) 7654
76.5%
ValueCountFrequency (%)
101 198
2.0%
102 207
2.1%
103 187
1.9%
201 217
2.2%
202 207
2.1%
301 234
2.3%
401 225
2.2%
402 200
2.0%
501 183
1.8%
601 227
2.3%
ValueCountFrequency (%)
2502 197
2.0%
2501 219
2.2%
2402 201
2.0%
2401 224
2.2%
2302 215
2.1%
2301 213
2.1%
2202 196
2.0%
2201 207
2.1%
2102 223
2.2%
2101 213
2.1%

강우량계명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
용산구청
 
246
중구청
 
242
양천구청
 
240
증산P
 
236
서대문구청
 
236
Other values (42)
8800 

Length

Max length5
Median length4
Mean length3.7787
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서초구청
2nd row중랑구청
3rd row마천2동
4th row봉원P
5th row중랑구청

Common Values

ValueCountFrequency (%)
용산구청 246
 
2.5%
중구청 242
 
2.4%
양천구청 240
 
2.4%
증산P 236
 
2.4%
서대문구청 236
 
2.4%
강서구청 235
 
2.4%
도봉구청 234
 
2.3%
성북구청 227
 
2.3%
노원구청 225
 
2.2%
휘경P 225
 
2.2%
Other values (37) 7654
76.5%

Length

2023-12-11T17:25:41.838457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
용산구청 246
 
2.5%
중구청 242
 
2.4%
양천구청 240
 
2.4%
증산p 236
 
2.4%
서대문구청 236
 
2.4%
강서구청 235
 
2.4%
도봉구청 234
 
2.3%
성북구청 227
 
2.3%
노원구청 225
 
2.2%
휘경p 225
 
2.2%
Other values (37) 7654
76.5%

구청 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.9883
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:25:41.994283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile101
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2771463
Coefficient of variation (CV)0.064406194
Kurtosis-1.1674804
Mean112.9883
Median Absolute Deviation (MAD)6
Skewness-0.016928599
Sum1129883
Variance52.956859
MonotonicityNot monotonic
2023-12-11T17:25:42.183652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
113 621
 
6.2%
101 592
 
5.9%
118 460
 
4.6%
111 457
 
4.6%
117 449
 
4.5%
106 446
 
4.5%
116 444
 
4.4%
110 443
 
4.4%
121 436
 
4.4%
108 436
 
4.4%
Other values (15) 5216
52.2%
ValueCountFrequency (%)
101 592
5.9%
102 424
4.2%
103 234
 
2.3%
104 425
4.2%
105 183
 
1.8%
106 446
4.5%
107 424
4.2%
108 436
4.4%
109 414
4.1%
110 443
4.4%
ValueCountFrequency (%)
125 416
4.2%
124 425
4.2%
123 428
4.3%
122 403
4.0%
121 436
4.4%
120 205
2.1%
119 374
3.7%
118 460
4.6%
117 449
4.5%
116 444
4.4%

구청명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
은평구
 
621
강남구
 
592
양천구
 
460
중구
 
457
강서구
 
449
Other values (20)
7421 

Length

Max length4
Median length3
Mean length3.0589
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서초구
2nd row중랑구
3rd row송파구
4th row마포구
5th row중랑구

Common Values

ValueCountFrequency (%)
은평구 621
 
6.2%
강남구 592
 
5.9%
양천구 460
 
4.6%
중구 457
 
4.6%
강서구 449
 
4.5%
성북구 446
 
4.5%
용산구 444
 
4.4%
종로구 443
 
4.4%
동작구 436
 
4.4%
동대문구 436
 
4.4%
Other values (15) 5216
52.2%

Length

2023-12-11T17:25:42.370467image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
은평구 621
 
6.2%
강남구 592
 
5.9%
양천구 460
 
4.6%
중구 457
 
4.6%
강서구 449
 
4.5%
성북구 446
 
4.5%
용산구 444
 
4.4%
종로구 443
 
4.4%
동작구 436
 
4.4%
동대문구 436
 
4.4%
Other values (15) 5216
52.2%

10분우량
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0.0
9976 
0.5
 
24

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 9976
99.8%
0.5 24
 
0.2%

Length

2023-12-11T17:25:42.565766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:25:42.688195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 9976
99.8%
0.5 24
 
0.2%
Distinct2153
Distinct (%)21.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2022-04-01 00:09:00
Maximum2022-04-16 00:19:00
2023-12-11T17:25:42.880575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:25:43.123482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T17:25:40.941053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:25:40.716401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:25:41.068818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:25:40.834909image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T17:25:43.252960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드강우량계명구청 코드구청명10분우량
강우량계 코드1.0001.0001.0001.0000.031
강우량계명1.0001.0001.0001.0000.000
구청 코드1.0001.0001.0001.0000.028
구청명1.0001.0001.0001.0000.028
10분우량0.0310.0000.0280.0281.000
2023-12-11T17:25:43.389633image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
10분우량강우량계명구청명
10분우량1.0000.0000.024
강우량계명0.0001.0000.999
구청명0.0240.9991.000
2023-12-11T17:25:43.515382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드구청 코드강우량계명구청명10분우량
강우량계 코드1.0000.9990.9980.9840.024
구청 코드0.9991.0000.9980.9990.021
강우량계명0.9980.9981.0000.9990.000
구청명0.9840.9990.9991.0000.024
10분우량0.0240.0210.0000.0241.000

Missing values

2023-12-11T17:25:41.212210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T17:25:41.340190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
330452401서초구청124서초구0.02022-04-05 21:49
45003701중랑구청107중랑구0.02022-04-07 17:19
704712502마천2동125송파구0.02022-04-11 12:49
586351502봉원P115마포구0.02022-04-09 18:09
51282701중랑구청107중랑구0.02022-04-08 15:39
81095201강동구청102강동구0.52022-04-13 02:49
95349602상월곡동106성북구0.02022-04-15 07:19
679731902도림2동P119영등포구0.02022-04-11 03:49
450751701강서구청117강서구0.02022-04-07 17:29
283211701강서구청117강서구0.02022-04-05 04:39
강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
807261501마포구청115마포구0.02022-04-13 01:29
435831401서대문구청114서대문구0.02022-04-07 12:09
59273402상계1동104노원구0.02022-04-09 20:29
976632502마천2동125송파구0.02022-04-15 15:39
14200401노원구청104노원구0.02022-04-03 02:29
135332102흑석P121동작구0.02022-04-02 23:59
183321303갈현1동113은평구0.02022-04-03 17:09
7897102세곡동101강남구0.02022-04-02 04:09
53226401노원구청104노원구0.02022-04-08 22:39
184431101중구청111중구0.02022-04-03 17:29