Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1168/S/1/datasetView.do

Alerts

구청명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
강우량계명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
강우량계 코드 is highly overall correlated with 구청 코드 and 2 other fieldsHigh correlation
구청 코드 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
10분우량 is highly imbalanced (98.2%)Imbalance

Reproduction

Analysis started2024-04-20 20:05:29.377141
Analysis finished2024-04-20 20:05:30.891196
Duration1.51 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강우량계 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1344.489
Minimum101
Maximum2502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T05:05:31.015996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile103
Q1801
median1303
Q31902
95-th percentile2402
Maximum2502
Range2401
Interquartile range (IQR)1101

Descriptive statistics

Standard deviation721.25695
Coefficient of variation (CV)0.53645433
Kurtosis-1.1371742
Mean1344.489
Median Absolute Deviation (MAD)599
Skewness-0.095730273
Sum13444890
Variance520211.59
MonotonicityNot monotonic
2024-04-21T05:05:31.262543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1801 252
 
2.5%
2302 241
 
2.4%
1902 240
 
2.4%
1802 239
 
2.4%
1702 237
 
2.4%
1001 237
 
2.4%
1701 234
 
2.3%
103 232
 
2.3%
1302 232
 
2.3%
2201 231
 
2.3%
Other values (37) 7625
76.2%
ValueCountFrequency (%)
101 215
2.1%
102 218
2.2%
103 232
2.3%
201 85
 
0.9%
202 80
 
0.8%
301 213
2.1%
401 201
2.0%
402 219
2.2%
501 229
2.3%
601 206
2.1%
ValueCountFrequency (%)
2502 219
2.2%
2501 224
2.2%
2402 223
2.2%
2401 210
2.1%
2302 241
2.4%
2301 223
2.2%
2202 216
2.2%
2201 231
2.3%
2102 225
2.2%
2101 209
2.1%

강우량계명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
양천구청
 
252
신림P
 
241
도림2동P
 
240
목동P
 
239
종로구청
 
237
Other values (42)
8791 

Length

Max length5
Median length4
Mean length3.7796
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row은평구청
2nd row송파구청
3rd row흑석P
4th row목동P
5th row강북구청

Common Values

ValueCountFrequency (%)
양천구청 252
 
2.5%
신림P 241
 
2.4%
도림2동P 240
 
2.4%
목동P 239
 
2.4%
종로구청 237
 
2.4%
공항동P 237
 
2.4%
강서구청 234
 
2.3%
개포2동 232
 
2.3%
증산P 232
 
2.3%
금천구청 231
 
2.3%
Other values (37) 7625
76.2%

Length

2024-04-21T05:05:31.503735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
양천구청 252
 
2.5%
신림p 241
 
2.4%
도림2동p 240
 
2.4%
목동p 239
 
2.4%
종로구청 237
 
2.4%
공항동p 237
 
2.4%
강서구청 234
 
2.3%
개포2동 232
 
2.3%
증산p 232
 
2.3%
금천구청 231
 
2.3%
Other values (37) 7625
76.2%

구청 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.4293
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-04-21T05:05:31.720098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile101
Q1108
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)11

Descriptive statistics

Standard deviation7.2131134
Coefficient of variation (CV)0.063591271
Kurtosis-1.1370198
Mean113.4293
Median Absolute Deviation (MAD)6
Skewness-0.095729555
Sum1134293
Variance52.029004
MonotonicityNot monotonic
2024-04-21T05:05:31.941808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
113 667
 
6.7%
101 665
 
6.7%
118 491
 
4.9%
117 471
 
4.7%
123 464
 
4.6%
119 459
 
4.6%
110 452
 
4.5%
122 447
 
4.5%
116 447
 
4.5%
111 444
 
4.4%
Other values (15) 4993
49.9%
ValueCountFrequency (%)
101 665
6.7%
102 165
 
1.7%
103 213
 
2.1%
104 420
4.2%
105 229
 
2.3%
106 399
4.0%
107 402
4.0%
108 421
4.2%
109 398
4.0%
110 452
4.5%
ValueCountFrequency (%)
125 443
4.4%
124 433
4.3%
123 464
4.6%
122 447
4.5%
121 434
4.3%
120 217
2.2%
119 459
4.6%
118 491
4.9%
117 471
4.7%
116 447
4.5%

구청명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
은평구
 
667
강남구
 
665
양천구
 
491
강서구
 
471
관악구
 
464
Other values (20)
7242 

Length

Max length4
Median length3
Mean length3.0629
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row은평구
2nd row송파구
3rd row동작구
4th row양천구
5th row강북구

Common Values

ValueCountFrequency (%)
은평구 667
 
6.7%
강남구 665
 
6.7%
양천구 491
 
4.9%
강서구 471
 
4.7%
관악구 464
 
4.6%
영등포구 459
 
4.6%
종로구 452
 
4.5%
용산구 447
 
4.5%
금천구 447
 
4.5%
중구 444
 
4.4%
Other values (15) 4993
49.9%

Length

2024-04-21T05:05:32.178636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
은평구 667
 
6.7%
강남구 665
 
6.7%
양천구 491
 
4.9%
강서구 471
 
4.7%
관악구 464
 
4.6%
영등포구 459
 
4.6%
종로구 452
 
4.5%
용산구 447
 
4.5%
금천구 447
 
4.5%
중구 444
 
4.4%
Other values (15) 4993
49.9%

10분우량
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0.0
9972 
0.5
 
25
1.0
 
3

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 9972
99.7%
0.5 25
 
0.2%
1.0 3
 
< 0.1%

Length

2024-04-21T05:05:32.392593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T05:05:32.554212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 9972
99.7%
0.5 25
 
0.2%
1.0 3
 
< 0.1%
Distinct2201
Distinct (%)22.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2021-12-01 00:09:00
Maximum2021-12-16 22:59:00
2024-04-21T05:05:32.735016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T05:05:32.970882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-04-21T05:05:30.269556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T05:05:29.958675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T05:05:30.425394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T05:05:30.110155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T05:05:33.133157image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드강우량계명구청 코드구청명10분우량
강우량계 코드1.0001.0001.0001.0000.000
강우량계명1.0001.0001.0001.0000.000
구청 코드1.0001.0001.0001.0000.000
구청명1.0001.0001.0001.0000.000
10분우량0.0000.0000.0000.0001.000
2024-04-21T05:05:33.299726image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구청명10분우량강우량계명
구청명1.0000.0000.999
10분우량0.0001.0000.000
강우량계명0.9990.0001.000
2024-04-21T05:05:33.466693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드구청 코드강우량계명구청명10분우량
강우량계 코드1.0000.9990.9980.9820.000
구청 코드0.9991.0000.9980.9990.000
강우량계명0.9980.9981.0000.9990.000
구청명0.9820.9990.9991.0000.000
10분우량0.0000.0000.0000.0001.000

Missing values

2024-04-21T05:05:30.626706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T05:05:30.812347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
248101301은평구청113은평구0.02021-12-04 21:29
137852501송파구청125송파구0.02021-12-03 03:19
170942102흑석P121동작구0.02021-12-03 16:59
440151802목동P118양천구0.02021-12-07 20:39
19466501강북구청105강북구0.02021-12-04 01:49
966951101중구청111중구0.02021-12-16 11:19
845802302신림P123관악구0.02021-12-14 16:19
743041702공항동P117강서구0.02021-12-12 11:29
591421101중구청111중구0.02021-12-10 04:49
781151101중구청111중구0.02021-12-13 01:29
강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
63714301도봉구청103도봉구0.02021-12-10 21:19
711201801양천구청118양천구0.02021-12-11 23:39
14693701중랑구청107중랑구0.02021-12-03 06:49
80622201강동구청102강동구0.02021-12-13 10:39
861902401서초구청124서초구0.02021-12-14 21:59
958102302신림P123관악구0.02021-12-16 08:09
794121201광진구청112광진구0.02021-12-13 06:19
50869101강남구청101강남구0.02021-12-08 22:09
59001701강서구청117강서구0.02021-12-01 21:59
66277401노원구청104노원구0.02021-12-11 06:29