Overview

Dataset statistics

Number of variables6
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory576.2 KiB
Average record size in memory59.0 B

Variable types

Numeric2
Categorical3
DateTime1

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-1168/S/1/datasetView.do

Alerts

강우량계명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
구청명 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
강우량계 코드 is highly overall correlated with 구청 코드 and 2 other fieldsHigh correlation
구청 코드 is highly overall correlated with 강우량계 코드 and 2 other fieldsHigh correlation
10분우량 is highly imbalanced (99.4%)Imbalance

Reproduction

Analysis started2023-12-11 08:24:56.010502
Analysis finished2023-12-11 08:24:57.670440
Duration1.66 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

강우량계 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1236.8082
Minimum101
Maximum2502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:24:57.769161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile103
Q1602
median1251
Q31802
95-th percentile2402
Maximum2502
Range2401
Interquartile range (IQR)1200

Descriptive statistics

Standard deviation714.29141
Coefficient of variation (CV)0.57752803
Kurtosis-1.0846622
Mean1236.8082
Median Absolute Deviation (MAD)551
Skewness0.10536819
Sum12368082
Variance510212.21
MonotonicityNot monotonic
2023-12-11T17:24:58.029288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
1502 284
 
2.8%
2001 267
 
2.7%
1104 267
 
2.7%
701 263
 
2.6%
401 262
 
2.6%
1303 258
 
2.6%
601 257
 
2.6%
201 257
 
2.6%
1101 257
 
2.6%
501 256
 
2.6%
Other values (37) 7372
73.7%
ValueCountFrequency (%)
101 250
2.5%
102 239
2.4%
103 253
2.5%
201 257
2.6%
301 255
2.5%
401 262
2.6%
402 232
2.3%
501 256
2.6%
601 257
2.6%
602 253
2.5%
ValueCountFrequency (%)
2502 248
2.5%
2501 229
2.3%
2402 132
1.3%
2401 122
1.2%
2302 251
2.5%
2301 252
2.5%
2202 89
 
0.9%
2201 3
 
< 0.1%
2102 56
 
0.6%
2101 38
 
0.4%

강우량계명
Categorical

HIGH CORRELATION 

Distinct47
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
봉원P
 
284
구로구청
 
267
서소문
 
267
중랑구청
 
263
노원구청
 
262
Other values (42)
8657 

Length

Max length5
Median length4
Mean length3.7826
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구로구청
2nd row강동구청
3rd row휘경P
4th row구로구청
5th row양천구청

Common Values

ValueCountFrequency (%)
봉원P 284
 
2.8%
구로구청 267
 
2.7%
서소문 267
 
2.7%
중랑구청 263
 
2.6%
노원구청 262
 
2.6%
갈현1동 258
 
2.6%
강동구청 257
 
2.6%
성북구청 257
 
2.6%
중구청 257
 
2.6%
강북구청 256
 
2.6%
Other values (37) 7372
73.7%

Length

2023-12-11T17:24:58.264982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
봉원p 284
 
2.8%
구로구청 267
 
2.7%
서소문 267
 
2.7%
중랑구청 263
 
2.6%
노원구청 262
 
2.6%
갈현1동 258
 
2.6%
강동구청 257
 
2.6%
성북구청 257
 
2.6%
중구청 257
 
2.6%
강북구청 256
 
2.6%
Other values (37) 7372
73.7%

구청 코드
Real number (ℝ)

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.3523
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T17:24:58.437324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile101
Q1106
median112.5
Q3118
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.1430307
Coefficient of variation (CV)0.063577075
Kurtosis-1.0844642
Mean112.3523
Median Absolute Deviation (MAD)5.5
Skewness0.10551945
Sum1123523
Variance51.022887
MonotonicityNot monotonic
2023-12-11T17:24:58.615678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
101 742
 
7.4%
113 732
 
7.3%
115 528
 
5.3%
111 524
 
5.2%
120 519
 
5.2%
107 517
 
5.2%
106 510
 
5.1%
123 503
 
5.0%
104 494
 
4.9%
109 479
 
4.8%
Other values (15) 4452
44.5%
ValueCountFrequency (%)
101 742
7.4%
102 257
 
2.6%
103 255
 
2.5%
104 494
4.9%
105 256
 
2.6%
106 510
5.1%
107 517
5.2%
108 410
4.1%
109 479
4.8%
110 434
4.3%
ValueCountFrequency (%)
125 477
4.8%
124 254
2.5%
123 503
5.0%
122 92
 
0.9%
121 94
 
0.9%
120 519
5.2%
119 413
4.1%
118 450
4.5%
117 467
4.7%
116 262
2.6%

구청명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
742 
은평구
732 
마포구
 
528
중구
 
524
구로구
 
519
Other values (20)
6955 

Length

Max length4
Median length3
Mean length3.0508
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구로구
2nd row강동구
3rd row동대문구
4th row구로구
5th row양천구

Common Values

ValueCountFrequency (%)
강남구 742
 
7.4%
은평구 732
 
7.3%
마포구 528
 
5.3%
중구 524
 
5.2%
구로구 519
 
5.2%
중랑구 517
 
5.2%
성북구 510
 
5.1%
관악구 503
 
5.0%
노원구 494
 
4.9%
성동구 479
 
4.8%
Other values (15) 4452
44.5%

Length

2023-12-11T17:24:58.836754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 742
 
7.4%
은평구 732
 
7.3%
마포구 528
 
5.3%
중구 524
 
5.2%
구로구 519
 
5.2%
중랑구 517
 
5.2%
성북구 510
 
5.1%
관악구 503
 
5.0%
노원구 494
 
4.9%
성동구 479
 
4.8%
Other values (15) 4452
44.5%

10분우량
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0.0
9988 
0.5
 
9
1.5
 
1
38.5
 
1
34.5
 
1

Length

Max length4
Median length3
Mean length3.0002
Min length3

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row0.0
2nd row0.0
3rd row0.0
4th row0.0
5th row0.0

Common Values

ValueCountFrequency (%)
0.0 9988
99.9%
0.5 9
 
0.1%
1.5 1
 
< 0.1%
38.5 1
 
< 0.1%
34.5 1
 
< 0.1%

Length

2023-12-11T17:24:59.041634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T17:24:59.172172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0.0 9988
99.9%
0.5 9
 
0.1%
1.5 1
 
< 0.1%
38.5 1
 
< 0.1%
34.5 1
 
< 0.1%
Distinct2443
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2020-12-01 00:09:00
Maximum2020-12-20 08:29:00
2023-12-11T17:24:59.341853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:59.590258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-11T17:24:57.138610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:56.823893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:57.272774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T17:24:56.984349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T17:24:59.756810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드강우량계명구청 코드구청명10분우량
강우량계 코드1.0001.0001.0001.0000.023
강우량계명1.0001.0001.0001.0000.000
구청 코드1.0001.0001.0001.0000.021
구청명1.0001.0001.0001.0000.000
10분우량0.0230.0000.0210.0001.000
2023-12-11T17:24:59.899717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
10분우량강우량계명구청명
10분우량1.0000.0000.000
강우량계명0.0001.0000.999
구청명0.0000.9991.000
2023-12-11T17:25:00.043155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
강우량계 코드구청 코드강우량계명구청명10분우량
강우량계 코드1.0000.9990.9980.9820.009
구청 코드0.9991.0000.9980.9990.009
강우량계명0.9980.9981.0000.9990.000
구청명0.9820.9990.9991.0000.000
10분우량0.0090.0090.0000.0001.000

Missing values

2023-12-11T17:24:57.462478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T17:24:57.597561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
351482001구로구청120구로구0.02020-12-07 16:59
69352201강동구청102강동구0.02020-12-13 16:59
820802휘경P108동대문구0.02020-12-01 03:59
872872001구로구청120구로구0.02020-12-18 09:29
16711801양천구청118양천구0.02020-12-01 08:09
70119201강동구청102강동구0.02020-12-13 20:09
241111101중구청111중구0.02020-12-05 17:09
74437101강남구청101강남구0.02020-12-14 14:29
512742301관악구청123관악구0.02020-12-10 14:19
17135101강남구청101강남구0.02020-12-04 12:39
강우량계 코드강우량계명구청 코드구청명10분우량자료수집 시각
336661801양천구청118양천구0.02020-12-07 11:09
187912402반포P124서초구0.02020-12-04 19:09
197461502봉원P115마포구0.02020-12-04 22:49
45824801동대문구청108동대문구0.02020-12-09 13:49
65911701중랑구청107중랑구0.02020-12-13 02:39
76204902뚝섬P109성동구0.02020-12-14 21:39
589342002개봉2동120구로구0.02020-12-11 22:59
62389101강남구청101강남구0.02020-12-12 12:49
316731501마포구청115마포구0.02020-12-07 02:19
667171601용산구청116용산구0.02020-12-13 05:59