Overview

Dataset statistics

Number of variables7
Number of observations24
Missing cells66
Missing cells (%)39.3%
Duplicate rows1
Duplicate rows (%)4.2%
Total size in memory1.4 KiB
Average record size in memory61.5 B

Variable types

Text3
Unsupported4

Dataset

Description홍성군 학교급식(식재료, 지원규모 등) 현황
Author충청남도 홍성군
URLhttps://www.data.go.kr/data/15011541/fileData.do

Alerts

Unnamed: 6 has constant value ""Constant
Dataset has 1 (4.2%) duplicate rowsDuplicates
2014년 학교급식 식재료 공급현황(3~12월) has 11 (45.8%) missing valuesMissing
Unnamed: 1 has 7 (29.2%) missing valuesMissing
Unnamed: 2 has 8 (33.3%) missing valuesMissing
Unnamed: 3 has 5 (20.8%) missing valuesMissing
Unnamed: 4 has 8 (33.3%) missing valuesMissing
Unnamed: 5 has 4 (16.7%) missing valuesMissing
Unnamed: 6 has 23 (95.8%) missing valuesMissing
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 3 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 10:57:09.803975
Analysis finished2023-12-12 10:57:10.753825
Duration0.95 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct13
Distinct (%)100.0%
Missing11
Missing (%)45.8%
Memory size324.0 B
2023-12-12T19:57:10.960222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length17
Mean length9.6923077
Min length1

Characters and Unicode

Total characters126
Distinct characters49
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)100.0%

Sample

1st row(총 괄)
2nd row품 목 군 별
3rd row대 분 류 (공급량 비율)
4th row공산품(41.6%)
5th row농산물(40.2%)
ValueCountFrequency (%)
지원 2
 
7.1%
2
 
7.1%
2
 
7.1%
1
 
3.6%
축산물(14.4 1
 
3.6%
무상급식 1
 
3.6%
1
 
3.6%
지원규모 1
 
3.6%
학교급식 1
 
3.6%
2014년 1
 
3.6%
Other values (15) 15
53.6%
2023-12-12T19:57:11.516656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
27.8%
( 6
 
4.8%
) 6
 
4.8%
4 5
 
4.0%
% 4
 
3.2%
. 4
 
3.2%
4
 
3.2%
3
 
2.4%
3
 
2.4%
3
 
2.4%
Other values (39) 53
42.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54
42.9%
Space Separator 35
27.8%
Decimal Number 15
 
11.9%
Other Punctuation 9
 
7.1%
Open Punctuation 6
 
4.8%
Close Punctuation 6
 
4.8%
Control 1
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4
 
7.4%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
2
 
3.7%
2
 
3.7%
Other values (25) 26
48.1%
Decimal Number
ValueCountFrequency (%)
4 5
33.3%
1 3
20.0%
2 2
 
13.3%
0 2
 
13.3%
8 1
 
6.7%
3 1
 
6.7%
6 1
 
6.7%
Other Punctuation
ValueCountFrequency (%)
% 4
44.4%
. 4
44.4%
1
 
11.1%
Space Separator
ValueCountFrequency (%)
35
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 6
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 72
57.1%
Hangul 54
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4
 
7.4%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
2
 
3.7%
2
 
3.7%
Other values (25) 26
48.1%
Common
ValueCountFrequency (%)
35
48.6%
( 6
 
8.3%
) 6
 
8.3%
4 5
 
6.9%
% 4
 
5.6%
. 4
 
5.6%
1 3
 
4.2%
2 2
 
2.8%
0 2
 
2.8%
1
 
1.4%
Other values (4) 4
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 71
56.3%
Hangul 54
42.9%
Punctuation 1
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
35
49.3%
( 6
 
8.5%
) 6
 
8.5%
4 5
 
7.0%
% 4
 
5.6%
. 4
 
5.6%
1 3
 
4.2%
2 2
 
2.8%
0 2
 
2.8%
8 1
 
1.4%
Other values (3) 3
 
4.2%
Hangul
ValueCountFrequency (%)
4
 
7.4%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
2
 
3.7%
2
 
3.7%
Other values (25) 26
48.1%
Punctuation
ValueCountFrequency (%)
1
100.0%

Unnamed: 1
Text

MISSING 

Distinct17
Distinct (%)100.0%
Missing7
Missing (%)29.2%
Memory size324.0 B
2023-12-12T19:57:11.819186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length4.6470588
Min length2

Characters and Unicode

Total characters79
Distinct characters46
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)100.0%

Sample

1st row중 분 류
2nd row가공식품
3rd row채소류
4th row주곡
5th row잡곡
ValueCountFrequency (%)
2
 
8.3%
2
 
8.3%
1
 
4.2%
닭고기 1
 
4.2%
초·중학교 1
 
4.2%
지원대상 1
 
4.2%
1
 
4.2%
1
 
4.2%
1
 
4.2%
달걀,메추리알 1
 
4.2%
Other values (12) 12
50.0%
2023-12-12T19:57:12.357585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
16.5%
5
 
6.3%
4
 
5.1%
4
 
5.1%
3
 
3.8%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
2
 
2.5%
Other values (36) 40
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 60
75.9%
Space Separator 13
 
16.5%
Other Punctuation 3
 
3.8%
Close Punctuation 1
 
1.3%
Control 1
 
1.3%
Open Punctuation 1
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
8.3%
4
 
6.7%
4
 
6.7%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (30) 32
53.3%
Other Punctuation
ValueCountFrequency (%)
, 2
66.7%
· 1
33.3%
Space Separator
ValueCountFrequency (%)
13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 60
75.9%
Common 19
 
24.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
8.3%
4
 
6.7%
4
 
6.7%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (30) 32
53.3%
Common
ValueCountFrequency (%)
13
68.4%
, 2
 
10.5%
· 1
 
5.3%
) 1
 
5.3%
1
 
5.3%
( 1
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 60
75.9%
ASCII 18
 
22.8%
None 1
 
1.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13
72.2%
, 2
 
11.1%
) 1
 
5.6%
1
 
5.6%
( 1
 
5.6%
Hangul
ValueCountFrequency (%)
5
 
8.3%
4
 
6.7%
4
 
6.7%
3
 
5.0%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
2
 
3.3%
Other values (30) 32
53.3%
None
ValueCountFrequency (%)
· 1
100.0%

Unnamed: 2
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8
Missing (%)33.3%
Memory size324.0 B

Unnamed: 3
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)20.8%
Memory size324.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing8
Missing (%)33.3%
Memory size324.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)16.7%
Memory size324.0 B

Unnamed: 6
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing23
Missing (%)95.8%
Memory size324.0 B
2023-12-12T19:57:12.527665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters2
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row비고
ValueCountFrequency (%)
비고 1
100.0%
2023-12-12T19:57:12.862523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
50.0%
1
50.0%

Correlations

2023-12-12T19:57:13.014496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2014년 학교급식 식재료 공급현황(3~12월)Unnamed: 1
2014년 학교급식 식재료 공급현황(3~12월)1.0001.000
Unnamed: 11.0001.000

Missing values

2023-12-12T19:57:10.101559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:57:10.334814image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:57:10.577750image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

2014년 학교급식 식재료 공급현황(3~12월)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
0(총 괄)<NA>NaNNaNNaN(단위: kg,원)<NA>
1품 목 군 별<NA>공 급 량NaN공 급 액NaN비고
2대 분 류 (공급량 비율)중 분 류공 급 량비율공 급 액비율<NA>
3공산품(41.6%)가공식품523526.099641.6236080462036.7<NA>
4농산물(40.2%)채소류267075.921.213222101290749515.754048<NA>
5<NA>주곡191108.515.1793075913638189.197655<NA>
6<NA>잡곡5829.80.463048515864260.802339<NA>
7<NA>과일42127.73.3461062176120763.384584<NA>
8<NA>소 계506141.940.201682187346981529.2<NA>
9축산물(14.4%)소고기20495.61.6279185039380467.8<NA>
2014년 학교급식 식재료 공급현황(3~12월)Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
14<NA>소 계181565.5814.4170436223926.5<NA>
15수산물(3.8%)수 산 물47773.1833.84908697177.6<NA>
16합 계<NA>1259006.7626100.06429506391100<NA>
17<NA><NA>NaNNaNNaNNaN<NA>
18<NA><NA>NaNNaNNaNNaN<NA>
19※ 2014년 학교급식 지원규모<NA>NaNNaNNaNNaN<NA>
20구 분지원대상NaN개교/학생수NaN지원액<NA>
21<NA>NaN72개교/11,970명NaN3,582백만원<NA>
22무상급식 지원초·중학교NaN36개교/7,588명NaN3,189백만원<NA>
23친환경식품비 지원유치원,고등학교NaN36개교/4,382명NaN393백만원<NA>

Duplicate rows

Most frequently occurring

2014년 학교급식 식재료 공급현황(3~12월)Unnamed: 1Unnamed: 6# duplicates
0<NA><NA><NA>2