Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows1314
Duplicate rows (%)13.1%
Total size in memory322.3 KiB
Average record size in memory33.0 B

Variable types

Categorical2
Numeric1

Dataset

Description보건소 모바일 헬스케어 사업 대상자들이 모바일앱을 통해 입력한 식사일기 정보로서 식사일자, 칼로리, 끼니구분 정보를 제공합니다.
Author한국건강증진개발원
URLhttps://www.data.go.kr/data/15091584/fileData.do

Alerts

Dataset has 1314 (13.1%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 06:58:00.254196
Analysis finished2023-12-12 06:58:01.017411
Duration0.76 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

식사일시
Categorical

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-09-08
1563 
2023-09-01
1327 
2023-09-07
1255 
2023-09-04
1238 
2023-09-05
1117 
Other values (4)
3500 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-09-05
2nd row2023-09-04
3rd row2023-09-08
4th row2023-09-07
5th row2023-09-08

Common Values

ValueCountFrequency (%)
2023-09-08 1563
15.6%
2023-09-01 1327
13.3%
2023-09-07 1255
12.6%
2023-09-04 1238
12.4%
2023-09-05 1117
11.2%
2023-09-03 1091
10.9%
2023-09-06 894
8.9%
2023-09-09 766
7.7%
2023-09-02 749
7.5%

Length

2023-12-12T15:58:01.077888image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:58:01.184199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-09-08 1563
15.6%
2023-09-01 1327
13.3%
2023-09-07 1255
12.6%
2023-09-04 1238
12.4%
2023-09-05 1117
11.2%
2023-09-03 1091
10.9%
2023-09-06 894
8.9%
2023-09-09 766
7.7%
2023-09-02 749
7.5%

칼로리
Real number (ℝ)

Distinct2215
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean142.6943
Minimum0
Maximum2032
Zeros15
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-12T15:58:01.325245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7.2765
Q136.78
median92.91
Q3190.7625
95-th percentile436.3
Maximum2032
Range2032
Interquartile range (IQR)153.9825

Descriptive statistics

Standard deviation150.03635
Coefficient of variation (CV)1.051453
Kurtosis10.163739
Mean142.6943
Median Absolute Deviation (MAD)69.91
Skewness2.2078483
Sum1426943
Variance22510.905
MonotonicityNot monotonic
2023-12-12T15:58:01.460129image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
163.5 250
 
2.5%
302.4 242
 
2.4%
327.0 231
 
2.3%
151.2 211
 
2.1%
7.5 198
 
2.0%
132.0 176
 
1.8%
4.0 150
 
1.5%
15.0 141
 
1.4%
8.0 87
 
0.9%
79.0 80
 
0.8%
Other values (2205) 8234
82.3%
ValueCountFrequency (%)
0.0 15
0.1%
0.7 2
 
< 0.1%
0.75 1
 
< 0.1%
0.8 1
 
< 0.1%
0.83 1
 
< 0.1%
1.04 1
 
< 0.1%
1.08 1
 
< 0.1%
1.13 1
 
< 0.1%
1.2 1
 
< 0.1%
1.25 1
 
< 0.1%
ValueCountFrequency (%)
2032.0 1
< 0.1%
2019.6 1
< 0.1%
1650.0 1
< 0.1%
1450.0 1
< 0.1%
1356.6 1
< 0.1%
1340.0 1
< 0.1%
1172.18 1
< 0.1%
1054.14 2
< 0.1%
1009.46 1
< 0.1%
1000.0 1
< 0.1%

끼니구분
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
점심
3268 
아침
2863 
저녁
2697 
점심간식
579 
아침간식
352 

Length

Max length4
Median length2
Mean length2.2344
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row아침
2nd row저녁
3rd row점심
4th row아침
5th row저녁

Common Values

ValueCountFrequency (%)
점심 3268
32.7%
아침 2863
28.6%
저녁 2697
27.0%
점심간식 579
 
5.8%
아침간식 352
 
3.5%
저녁간식 241
 
2.4%

Length

2023-12-12T15:58:01.638344image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:58:01.767497image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
점심 3268
32.7%
아침 2863
28.6%
저녁 2697
27.0%
점심간식 579
 
5.8%
아침간식 352
 
3.5%
저녁간식 241
 
2.4%

Interactions

2023-12-12T15:58:00.424860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:58:01.849090image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
식사일시칼로리끼니구분
식사일시1.0000.0000.192
칼로리0.0001.0000.152
끼니구분0.1920.1521.000
2023-12-12T15:58:01.955804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
끼니구분식사일시
끼니구분1.0000.096
식사일시0.0961.000
2023-12-12T15:58:02.069709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
칼로리식사일시끼니구분
칼로리1.0000.0000.080
식사일시0.0001.0000.096
끼니구분0.0800.0961.000

Missing values

2023-12-12T15:58:00.584874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:58:00.981313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

식사일시칼로리끼니구분
140282023-09-0514.93아침
173462023-09-04168.49저녁
52832023-09-08427.73점심
85192023-09-0772.71아침
38012023-09-08142.58저녁
48472023-09-08766.58저녁
103482023-09-0745.28아침
114892023-09-06327.0아침
290192023-09-01104.1점심
208292023-09-0358.27저녁
식사일시칼로리끼니구분
137892023-09-05130.2아침
136442023-09-05199.5저녁
5592023-09-094.0아침간식
27472023-09-08105.0아침
72542023-09-0710.0아침간식
105372023-09-0710.25아침
24472023-09-0860.0저녁
144712023-09-05110.8점심
2812023-09-0934.02아침
232022023-09-03132.0아침간식

Duplicate rows

Most frequently occurring

식사일시칼로리끼니구분# duplicates
11922023-09-08302.4점심23
11692023-09-08163.5아침20
1302023-09-01132.0아침19
5692023-09-04302.4점심19
5762023-09-04327.0점심19
1392023-09-01151.2점심17
1612023-09-01302.4점심17
12002023-09-08327.0저녁17
12812023-09-09132.0아침17
4022023-09-03302.4점심16