Overview

Dataset statistics

Number of variables3
Number of observations5512
Missing cells0
Missing cells (%)0.0%
Duplicate rows50
Duplicate rows (%)0.9%
Total size in memory129.3 KiB
Average record size in memory24.0 B

Variable types

DateTime1
Categorical1
Text1

Dataset

Description국립소록도병원에서 직원들에게 제공하는 식단 데이터로 날짜와 메뉴구분(조식,중식,석식), 메뉴명으로 구분됨
Author보건복지부 국립소록도병원
URLhttps://www.data.go.kr/data/15075554/fileData.do

Alerts

Dataset has 50 (0.9%) duplicate rowsDuplicates

Reproduction

Analysis started2024-03-14 14:14:12.131957
Analysis finished2024-03-14 14:14:12.902079
Duration0.77 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일자
Date

Distinct343
Distinct (%)6.2%
Missing0
Missing (%)0.0%
Memory size43.2 KiB
Minimum2023-01-01 00:00:00
Maximum2023-12-31 00:00:00
2024-03-14T23:14:13.095080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-14T23:14:13.526220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

메뉴구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size43.2 KiB
중식
2057 
석식
1935 
조식
1520 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row조식
2nd row조식
3rd row조식
4th row조식
5th row조식

Common Values

ValueCountFrequency (%)
중식 2057
37.3%
석식 1935
35.1%
조식 1520
27.6%

Length

2024-03-14T23:14:13.938445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T23:14:14.251763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
중식 2057
37.3%
석식 1935
35.1%
조식 1520
27.6%
Distinct538
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Memory size43.2 KiB
2024-03-14T23:14:15.059686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length5.7958999
Min length1

Characters and Unicode

Total characters31947
Distinct characters357
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)1.5%

Sample

1st row쇠고기미역국
2nd row새우젓계란찜
3rd row미트볼칠리소스조림
4th row숙주부추나물
5th row배추김치
ValueCountFrequency (%)
배추김치 801
 
14.5%
알타리김치 76
 
1.4%
석박지 53
 
1.0%
깍두기 51
 
0.9%
콩나물무침 44
 
0.8%
시금치나물 42
 
0.8%
물만두국 40
 
0.7%
육개장 40
 
0.7%
열무김치 38
 
0.7%
미역줄기볶음 37
 
0.7%
Other values (534) 4311
77.9%
2024-03-14T23:14:16.392760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1498
 
4.7%
1251
 
3.9%
1153
 
3.6%
1080
 
3.4%
833
 
2.6%
742
 
2.3%
712
 
2.2%
704
 
2.2%
694
 
2.2%
680
 
2.1%
Other values (347) 22600
70.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31567
98.8%
Other Punctuation 294
 
0.9%
Space Separator 21
 
0.1%
Open Punctuation 21
 
0.1%
Close Punctuation 21
 
0.1%
Decimal Number 16
 
0.1%
Lowercase Letter 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1498
 
4.7%
1251
 
4.0%
1153
 
3.7%
1080
 
3.4%
833
 
2.6%
742
 
2.4%
712
 
2.3%
704
 
2.2%
694
 
2.2%
680
 
2.2%
Other values (337) 22220
70.4%
Decimal Number
ValueCountFrequency (%)
3 5
31.2%
2 5
31.2%
0 4
25.0%
1 2
 
12.5%
Other Punctuation
ValueCountFrequency (%)
& 286
97.3%
/ 8
 
2.7%
Space Separator
ValueCountFrequency (%)
21
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Lowercase Letter
ValueCountFrequency (%)
g 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31567
98.8%
Common 373
 
1.2%
Latin 7
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1498
 
4.7%
1251
 
4.0%
1153
 
3.7%
1080
 
3.4%
833
 
2.6%
742
 
2.4%
712
 
2.3%
704
 
2.2%
694
 
2.2%
680
 
2.2%
Other values (337) 22220
70.4%
Common
ValueCountFrequency (%)
& 286
76.7%
21
 
5.6%
( 21
 
5.6%
) 21
 
5.6%
/ 8
 
2.1%
3 5
 
1.3%
2 5
 
1.3%
0 4
 
1.1%
1 2
 
0.5%
Latin
ValueCountFrequency (%)
g 7
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31567
98.8%
ASCII 380
 
1.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1498
 
4.7%
1251
 
4.0%
1153
 
3.7%
1080
 
3.4%
833
 
2.6%
742
 
2.4%
712
 
2.3%
704
 
2.2%
694
 
2.2%
680
 
2.2%
Other values (337) 22220
70.4%
ASCII
ValueCountFrequency (%)
& 286
75.3%
21
 
5.5%
( 21
 
5.5%
) 21
 
5.5%
/ 8
 
2.1%
g 7
 
1.8%
3 5
 
1.3%
2 5
 
1.3%
0 4
 
1.1%
1 2
 
0.5%

Missing values

2024-03-14T23:14:12.515826image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T23:14:12.793452image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일자메뉴구분메뉴명
02023-01-01조식쇠고기미역국
12023-01-01조식새우젓계란찜
22023-01-01조식미트볼칠리소스조림
32023-01-01조식숙주부추나물
42023-01-01조식배추김치
52023-01-01중식삼색수제비국
62023-01-01중식오리훈제버섯볶음
72023-01-01중식멸치꽈리고추조림
82023-01-01중식톳콩나물초무침
92023-01-01중식배추김치
일자메뉴구분메뉴명
55022023-12-31중식조갯살미역국
55032023-12-31중식쇠고기낙지야채볶음
55042023-12-31중식쑥갓두부무침
55052023-12-31중식배추김치
55062023-12-31중식오복채무침
55072023-12-31석식홍고추콩나물국
55082023-12-31석식순살크런치생선까스&소스
55092023-12-31석식꽃맛살마카로니샐러드
55102023-12-31석식열무김치
55112023-12-31석식쌈배추나물

Duplicate rows

Most frequently occurring

일자메뉴구분메뉴명# duplicates
02023-08-12중식배추김치2
12023-11-01석식배추김치2
22023-11-02중식배추김치2
32023-11-03석식배추김치2
42023-11-06중식배추김치2
52023-11-07중식배추김치2
62023-11-08석식배추김치2
72023-11-09석식배추김치2
82023-11-10중식배추김치2
92023-11-13중식배추김치2