Overview

Dataset statistics

Number of variables7
Number of observations812
Missing cells4
Missing cells (%)0.1%
Duplicate rows8
Duplicate rows (%)1.0%
Total size in memory46.9 KiB
Average record size in memory59.2 B

Variable types

DateTime1
Categorical3
Numeric3

Dataset

Description경기도 안산시의 식중독, 자외선지수를 표현한 데이터로 발표시간,지수명,시군명,지점명,오늘예측값,내일예측값,모레예측값 등의 목록을 제공합니다.
Author경기도 안산시
URLhttps://www.data.go.kr/data/15090295/fileData.do

Alerts

시군명 has constant value ""Constant
Dataset has 8 (1.0%) duplicate rowsDuplicates
오늘예측값 is highly overall correlated with 내일예측값 and 1 other fieldsHigh correlation
내일예측값 is highly overall correlated with 오늘예측값 and 1 other fieldsHigh correlation
모레예측값 is highly overall correlated with 오늘예측값 and 1 other fieldsHigh correlation

Reproduction

Analysis started2023-12-12 20:32:09.487173
Analysis finished2023-12-12 20:32:11.128103
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct201
Distinct (%)24.8%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
Minimum2019-11-27 06:00:00
Maximum2020-07-18 06:00:00
2023-12-13T05:32:11.225158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:11.437680image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

지수명
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
식중독지수
406 
자외선지수
406 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row식중독지수
2nd row식중독지수
3rd row자외선지수
4th row자외선지수
5th row식중독지수

Common Values

ValueCountFrequency (%)
식중독지수 406
50.0%
자외선지수 406
50.0%

Length

2023-12-13T05:32:11.594254image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:32:11.722538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
식중독지수 406
50.0%
자외선지수 406
50.0%

시군명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
안산시
812 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row안산시
2nd row안산시
3rd row안산시
4th row안산시
5th row안산시

Common Values

ValueCountFrequency (%)
안산시 812
100.0%

Length

2023-12-13T05:32:11.871502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:32:11.997315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
안산시 812
100.0%

지점명
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
경기도 안산시 단원구
406 
경기도 안산시 상록구
406 

Length

Max length11
Median length11
Mean length11
Min length11

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 안산시 단원구
2nd row경기도 안산시 상록구
3rd row경기도 안산시 단원구
4th row경기도 안산시 상록구
5th row경기도 안산시 단원구

Common Values

ValueCountFrequency (%)
경기도 안산시 단원구 406
50.0%
경기도 안산시 상록구 406
50.0%

Length

2023-12-13T05:32:12.104014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:32:12.208572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
경기도 812
33.3%
안산시 812
33.3%
단원구 406
16.7%
상록구 406
16.7%

오늘예측값
Real number (ℝ)

HIGH CORRELATION 

Distinct75
Distinct (%)9.3%
Missing4
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean26.491337
Minimum-999
Maximum94
Zeros0
Zeros (%)0.0%
Negative4
Negative (%)0.5%
Memory size7.3 KiB
2023-12-13T05:32:12.358606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999
5-th percentile2
Q15
median10
Q357.25
95-th percentile79
Maximum94
Range1093
Interquartile range (IQR)52.25

Descriptive statistics

Standard deviation77.770848
Coefficient of variation (CV)2.9357087
Kurtosis147.99226
Mean26.491337
Median Absolute Deviation (MAD)14.5
Skewness-11.346078
Sum21405
Variance6048.3048
MonotonicityNot monotonic
2023-12-13T05:32:12.544847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 64
 
7.9%
7 54
 
6.7%
5 47
 
5.8%
6 45
 
5.5%
4 43
 
5.3%
9 34
 
4.2%
3 30
 
3.7%
1 30
 
3.7%
10 29
 
3.6%
8 28
 
3.4%
Other values (65) 404
49.8%
ValueCountFrequency (%)
-999 4
 
0.5%
1 30
3.7%
2 64
7.9%
3 30
3.7%
4 43
5.3%
5 47
5.8%
6 45
5.5%
7 54
6.7%
8 28
3.4%
9 34
4.2%
ValueCountFrequency (%)
94 4
0.5%
93 1
 
0.1%
92 2
0.2%
91 3
0.4%
90 2
0.2%
89 3
0.4%
88 1
 
0.1%
87 1
 
0.1%
86 4
0.5%
85 4
0.5%

내일예측값
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.669951
Minimum-999
Maximum94
Zeros0
Zeros (%)0.0%
Negative4
Negative (%)0.5%
Memory size7.3 KiB
2023-12-13T05:32:12.704085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999
5-th percentile1
Q15
median10
Q356
95-th percentile79
Maximum94
Range1093
Interquartile range (IQR)51

Descriptive statistics

Standard deviation77.650848
Coefficient of variation (CV)2.9115482
Kurtosis148.27843
Mean26.669951
Median Absolute Deviation (MAD)15.5
Skewness-11.348589
Sum21656
Variance6029.6542
MonotonicityNot monotonic
2023-12-13T05:32:12.882894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 55
 
6.8%
6 55
 
6.8%
2 54
 
6.7%
9 40
 
4.9%
4 39
 
4.8%
1 38
 
4.7%
3 35
 
4.3%
10 31
 
3.8%
8 31
 
3.8%
7 28
 
3.4%
Other values (64) 406
50.0%
ValueCountFrequency (%)
-999 4
 
0.5%
1 38
4.7%
2 54
6.7%
3 35
4.3%
4 39
4.8%
5 55
6.8%
6 55
6.8%
7 28
3.4%
8 31
3.8%
9 40
4.9%
ValueCountFrequency (%)
94 1
 
0.1%
93 6
0.7%
92 2
 
0.2%
91 2
 
0.2%
90 1
 
0.1%
89 2
 
0.2%
88 2
 
0.2%
87 2
 
0.2%
86 2
 
0.2%
85 5
0.6%

모레예측값
Real number (ℝ)

HIGH CORRELATION 

Distinct73
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.724138
Minimum-999
Maximum93
Zeros0
Zeros (%)0.0%
Negative4
Negative (%)0.5%
Memory size7.3 KiB
2023-12-13T05:32:13.357013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-999
5-th percentile1
Q15
median10
Q357
95-th percentile79
Maximum93
Range1092
Interquartile range (IQR)52

Descriptive statistics

Standard deviation77.774836
Coefficient of variation (CV)2.9102842
Kurtosis147.34922
Mean26.724138
Median Absolute Deviation (MAD)14.5
Skewness-11.294595
Sum21700
Variance6048.925
MonotonicityNot monotonic
2023-12-13T05:32:13.537422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5 65
 
8.0%
2 53
 
6.5%
6 51
 
6.3%
9 49
 
6.0%
1 43
 
5.3%
4 40
 
4.9%
8 38
 
4.7%
3 34
 
4.2%
7 24
 
3.0%
44 15
 
1.8%
Other values (63) 400
49.3%
ValueCountFrequency (%)
-999 4
 
0.5%
1 43
5.3%
2 53
6.5%
3 34
4.2%
4 40
4.9%
5 65
8.0%
6 51
6.3%
7 24
 
3.0%
8 38
4.7%
9 49
6.0%
ValueCountFrequency (%)
93 1
 
0.1%
92 6
0.7%
91 3
0.4%
89 3
0.4%
88 1
 
0.1%
87 2
 
0.2%
86 2
 
0.2%
85 2
 
0.2%
84 4
0.5%
83 3
0.4%

Interactions

2023-12-13T05:32:10.458855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:09.767429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:10.076597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:10.582603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:09.871603image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:10.178902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:10.698918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:09.977891image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:32:10.314170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:32:13.662940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지수명지점명오늘예측값내일예측값모레예측값
지수명1.0000.000NaNNaNNaN
지점명0.0001.000NaNNaNNaN
오늘예측값NaNNaN1.000NaNNaN
내일예측값NaNNaNNaN1.000NaN
모레예측값NaNNaNNaNNaN1.000
2023-12-13T05:32:13.798169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
지수명지점명
지수명1.0000.000
지점명0.0001.000
2023-12-13T05:32:13.913450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
오늘예측값내일예측값모레예측값지수명지점명
오늘예측값1.0000.9560.9420.0400.000
내일예측값0.9561.0000.9600.0390.000
모레예측값0.9420.9601.0000.0390.000
지수명0.0400.0390.0391.0000.000
지점명0.0000.0000.0000.0001.000

Missing values

2023-12-13T05:32:10.882257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:32:11.071166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

발표시간지수명시군명지점명오늘예측값내일예측값모레예측값
02020-07-18 06:00식중독지수안산시경기도 안산시 단원구697882
12020-07-18 06:00식중독지수안산시경기도 안산시 상록구717680
22020-07-18 06:00자외선지수안산시경기도 안산시 단원구1055
32020-07-18 06:00자외선지수안산시경기도 안산시 상록구1055
42020-07-17 06:00식중독지수안산시경기도 안산시 단원구626477
52020-07-17 06:00식중독지수안산시경기도 안산시 상록구646773
62020-07-17 06:00자외선지수안산시경기도 안산시 단원구10105
72020-07-17 06:00자외선지수안산시경기도 안산시 상록구10105
82020-07-16 06:00식중독지수안산시경기도 안산시 단원구565862
92020-07-16 06:00식중독지수안산시경기도 안산시 상록구526267
발표시간지수명시군명지점명오늘예측값내일예측값모레예측값
8022019-11-30 06:00자외선지수안산시경기도 안산시 단원구212
8032019-11-30 06:00자외선지수안산시경기도 안산시 상록구212
8042019-11-28 06:00식중독지수안산시경기도 안산시 단원구373937
8052019-11-28 06:00식중독지수안산시경기도 안산시 상록구384038
8062019-11-28 06:00자외선지수안산시경기도 안산시 단원구222
8072019-11-28 06:00자외선지수안산시경기도 안산시 상록구222
8082019-11-27 06:00식중독지수안산시경기도 안산시 단원구423538
8092019-11-27 06:00식중독지수안산시경기도 안산시 상록구433838
8102019-11-27 06:00자외선지수안산시경기도 안산시 단원구222
8112019-11-27 06:00자외선지수안산시경기도 안산시 상록구222

Duplicate rows

Most frequently occurring

발표시간지수명시군명지점명오늘예측값내일예측값모레예측값# duplicates
02020-05-05 06:00식중독지수안산시경기도 안산시 단원구4436412
12020-05-05 06:00식중독지수안산시경기도 안산시 상록구4533442
22020-05-05 06:00자외선지수안산시경기도 안산시 단원구5882
32020-05-05 06:00자외선지수안산시경기도 안산시 상록구5882
42020-05-07 06:00식중독지수안산시경기도 안산시 단원구3947652
52020-05-07 06:00식중독지수안산시경기도 안산시 상록구4047592
62020-05-07 06:00자외선지수안산시경기도 안산시 단원구9542
72020-05-07 06:00자외선지수안산시경기도 안산시 상록구9542