Overview

Dataset statistics

Number of variables4
Number of observations30
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory39.4 B

Variable types

Numeric3
DateTime1

Dataset

Description샘플 데이터
Author식품안전나라
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=65

Alerts

년월(occrrnc_ym) has unique valuesUnique
발생건수(occrrnc_cnt) has 2 (6.7%) zerosZeros
환자수(patnt_co) has 1 (3.3%) zerosZeros

Reproduction

Analysis started2024-01-14 06:50:57.985728
Analysis finished2024-01-14 06:50:59.285982
Duration1.3 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

년월(occrrnc_ym)
Real number (ℝ)

UNIQUE 

Distinct30
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean201643.57
Minimum201202
Maximum202105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size402.0 B
2024-01-14T15:50:59.346738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201202
5-th percentile201207.9
Q1201334.25
median201659
Q3201904
95-th percentile202102.1
Maximum202105
Range903
Interquartile range (IQR)569.75

Descriptive statistics

Standard deviation307.02552
Coefficient of variation (CV)0.001522615
Kurtosis-1.397795
Mean201643.57
Median Absolute Deviation (MAD)254
Skewness-0.018856752
Sum6049307
Variance94264.668
MonotonicityNot monotonic
2024-01-14T15:50:59.450480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
201805 1
 
3.3%
201404 1
 
3.3%
202103 1
 
3.3%
201912 1
 
3.3%
201512 1
 
3.3%
202004 1
 
3.3%
201211 1
 
3.3%
201804 1
 
3.3%
201209 1
 
3.3%
201510 1
 
3.3%
Other values (20) 20
66.7%
ValueCountFrequency (%)
201202 1
3.3%
201207 1
3.3%
201209 1
3.3%
201211 1
3.3%
201301 1
3.3%
201306 1
3.3%
201307 1
3.3%
201311 1
3.3%
201404 1
3.3%
201408 1
3.3%
ValueCountFrequency (%)
202105 1
3.3%
202103 1
3.3%
202101 1
3.3%
202008 1
3.3%
202004 1
3.3%
202003 1
3.3%
201912 1
3.3%
201905 1
3.3%
201901 1
3.3%
201812 1
3.3%

발생건수(occrrnc_cnt)
Real number (ℝ)

ZEROS 

Distinct9
Distinct (%)30.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1333333
Minimum0
Maximum9
Zeros2
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size402.0 B
2024-01-14T15:50:59.541564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.45
Q12
median3
Q34
95-th percentile6.55
Maximum9
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.0296651
Coefficient of variation (CV)0.64776544
Kurtosis1.1905187
Mean3.1333333
Median Absolute Deviation (MAD)1
Skewness0.89320087
Sum94
Variance4.1195402
MonotonicityNot monotonic
2024-01-14T15:50:59.624683image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
3 8
26.7%
2 6
20.0%
5 4
13.3%
1 4
13.3%
4 3
 
10.0%
0 2
 
6.7%
6 1
 
3.3%
7 1
 
3.3%
9 1
 
3.3%
ValueCountFrequency (%)
0 2
 
6.7%
1 4
13.3%
2 6
20.0%
3 8
26.7%
4 3
 
10.0%
5 4
13.3%
6 1
 
3.3%
7 1
 
3.3%
9 1
 
3.3%
ValueCountFrequency (%)
9 1
 
3.3%
7 1
 
3.3%
6 1
 
3.3%
5 4
13.3%
4 3
 
10.0%
3 8
26.7%
2 6
20.0%
1 4
13.3%
0 2
 
6.7%

환자수(patnt_co)
Real number (ℝ)

ZEROS 

Distinct27
Distinct (%)90.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70.833333
Minimum0
Maximum466
Zeros1
Zeros (%)3.3%
Negative0
Negative (%)0.0%
Memory size402.0 B
2024-01-14T15:50:59.721990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3.35
Q111
median51.5
Q3106
95-th percentile170.4
Maximum466
Range466
Interquartile range (IQR)95

Descriptive statistics

Standard deviation90.52189
Coefficient of variation (CV)1.2779561
Kurtosis12.437227
Mean70.833333
Median Absolute Deviation (MAD)43
Skewness3.0885395
Sum2125
Variance8194.2126
MonotonicityNot monotonic
2024-01-14T15:50:59.831216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%)
11 2
 
6.7%
16 2
 
6.7%
5 2
 
6.7%
8 1
 
3.3%
58 1
 
3.3%
83 1
 
3.3%
68 1
 
3.3%
95 1
 
3.3%
466 1
 
3.3%
60 1
 
3.3%
Other values (17) 17
56.7%
ValueCountFrequency (%)
0 1
3.3%
2 1
3.3%
5 2
6.7%
8 1
3.3%
9 1
3.3%
10 1
3.3%
11 2
6.7%
15 1
3.3%
16 2
6.7%
37 1
3.3%
ValueCountFrequency (%)
466 1
3.3%
210 1
3.3%
122 1
3.3%
119 1
3.3%
116 1
3.3%
114 1
3.3%
109 1
3.3%
108 1
3.3%
100 1
3.3%
95 1
3.3%
Distinct6
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size372.0 B
Minimum2017-11-21 17:09:11
Maximum2021-07-19 13:33:06
2024-01-14T15:50:59.929563image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:51:00.038065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)

Interactions

2024-01-14T15:50:58.639636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.097098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.384319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.977390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.188733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.477047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:59.047377image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.293931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-14T15:50:58.555332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-14T15:51:00.116468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월(occrrnc_ym)발생건수(occrrnc_cnt)환자수(patnt_co)적재일시(ldadng_dt)
년월(occrrnc_ym)1.0000.0000.1270.181
발생건수(occrrnc_cnt)0.0001.0000.6210.000
환자수(patnt_co)0.1270.6211.0000.396
적재일시(ldadng_dt)0.1810.0000.3961.000
2024-01-14T15:51:00.210849image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
년월(occrrnc_ym)발생건수(occrrnc_cnt)환자수(patnt_co)
년월(occrrnc_ym)1.000-0.007-0.145
발생건수(occrrnc_cnt)-0.0071.0000.337
환자수(patnt_co)-0.1450.3371.000

Missing values

2024-01-14T15:50:59.138985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-14T15:50:59.240724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

년월(occrrnc_ym)발생건수(occrrnc_cnt)환자수(patnt_co)적재일시(ldadng_dt)
0201805282018-08-21 16:56:45
12020035372017-11-21 17:09:11
220210531162017-11-21 17:09:11
32013015522019-04-22 14:35:29
420181261092017-11-21 17:09:11
5201706192017-11-21 17:09:11
620161121082021-01-22 09:59:48
7201901202017-11-21 17:09:11
82014083112018-08-21 16:56:45
92017090152017-11-21 17:09:11
년월(occrrnc_ym)발생건수(occrrnc_cnt)환자수(patnt_co)적재일시(ldadng_dt)
202012075162017-11-21 17:09:11
21201612122018-08-21 16:56:45
2220151032102021-01-22 09:59:48
232012093602017-11-21 17:09:11
242018041112017-11-21 17:09:11
2520121194662017-11-21 17:09:11
26202004452017-11-21 17:09:11
272015120952017-11-21 17:09:11
282019123682018-08-21 16:56:45
292021032832017-11-21 17:09:11