Overview

Dataset statistics

Number of variables11
Number of observations21
Missing cells36
Missing cells (%)15.6%
Duplicate rows1
Duplicate rows (%)4.8%
Total size in memory1.9 KiB
Average record size in memory94.3 B

Variable types

Text2
Categorical2
Unsupported7

Alerts

Dataset has 1 (4.8%) duplicate rowsDuplicates
비고 is highly overall correlated with 설립High correlation
설립 is highly overall correlated with 비고High correlation
학교명 has 3 (14.3%) missing valuesMissing
학급수 has 5 (23.8%) missing valuesMissing
사 업 비 has 4 (19.0%) missing valuesMissing
Unnamed: 4 has 4 (19.0%) missing valuesMissing
Unnamed: 5 has 4 (19.0%) missing valuesMissing
학 생 수 has 4 (19.0%) missing valuesMissing
Unnamed: 7 has 4 (19.0%) missing valuesMissing
Unnamed: 8 has 4 (19.0%) missing valuesMissing
기숙사정원/ has 4 (19.0%) missing valuesMissing
학급수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
사 업 비 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 4 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
학 생 수 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-13 23:50:15.832513
Analysis finished2024-03-13 23:50:16.334665
Duration0.5 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학교명
Text

MISSING 

Distinct17
Distinct (%)94.4%
Missing3
Missing (%)14.3%
Memory size300.0 B
2024-03-14T08:50:16.421038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.7777778
Min length3

Characters and Unicode

Total characters68
Distinct characters35
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)88.9%

Sample

1st row13개교
2nd row2008년 8개교
3rd row한별고
4th row진 안
5th row제일고
ValueCountFrequency (%)
제일고 2
 
9.1%
1
 
4.5%
고창고 1
 
4.5%
성원고 1
 
4.5%
남원고 1
 
4.5%
호남고 1
 
4.5%
정읍고 1
 
4.5%
5개교 1
 
4.5%
2009년 1
 
4.5%
부안고 1
 
4.5%
Other values (11) 11
50.0%
2024-03-14T08:50:16.983079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
14
20.6%
4
 
5.9%
0 4
 
5.9%
3
 
4.4%
3
 
4.4%
3
 
4.4%
2
 
2.9%
8 2
 
2.9%
2
 
2.9%
2 2
 
2.9%
Other values (25) 29
42.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52
76.5%
Decimal Number 12
 
17.6%
Space Separator 4
 
5.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
26.9%
3
 
5.8%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (17) 17
32.7%
Decimal Number
ValueCountFrequency (%)
0 4
33.3%
8 2
16.7%
2 2
16.7%
9 1
 
8.3%
5 1
 
8.3%
3 1
 
8.3%
1 1
 
8.3%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52
76.5%
Common 16
 
23.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
14
26.9%
3
 
5.8%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (17) 17
32.7%
Common
ValueCountFrequency (%)
4
25.0%
0 4
25.0%
8 2
12.5%
2 2
12.5%
9 1
 
6.2%
5 1
 
6.2%
3 1
 
6.2%
1 1
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52
76.5%
ASCII 16
 
23.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
14
26.9%
3
 
5.8%
3
 
5.8%
3
 
5.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
2
 
3.8%
Other values (17) 17
32.7%
ASCII
ValueCountFrequency (%)
4
25.0%
0 4
25.0%
8 2
12.5%
2 2
12.5%
9 1
 
6.2%
5 1
 
6.2%
3 1
 
6.2%
1 1
 
6.2%

설립
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)38.1%
Missing0
Missing (%)0.0%
Memory size300.0 B
공립
11 
<NA>
사립
공립11
 
1
사립 2
 
1
Other values (3)

Length

Max length4
Median length2
Mean length2.6190476
Min length2

Unique

Unique5 ?
Unique (%)23.8%

Sample

1st row<NA>
2nd row공립11
3rd row사립 2
4th row공립8
5th row공립

Common Values

ValueCountFrequency (%)
공립 11
52.4%
<NA> 3
 
14.3%
사립 2
 
9.5%
공립11 1
 
4.8%
사립 2 1
 
4.8%
공립8 1
 
4.8%
공립3 1
 
4.8%
사립2 1
 
4.8%

Length

2024-03-14T08:50:17.093192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T08:50:17.194626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 11
50.0%
na 3
 
13.6%
사립 3
 
13.6%
공립11 1
 
4.5%
2 1
 
4.5%
공립8 1
 
4.5%
공립3 1
 
4.5%
사립2 1
 
4.5%

학급수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)23.8%
Memory size300.0 B

사 업 비
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

Unnamed: 4
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

학 생 수
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)19.0%
Memory size300.0 B

기숙사정원/
Text

MISSING 

Distinct17
Distinct (%)100.0%
Missing4
Missing (%)19.0%
Memory size300.0 B
2024-03-14T08:50:17.373795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length6
Mean length6.2352941
Min length2

Characters and Unicode

Total characters106
Distinct characters21
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)100.0%

Sample

1st row급식비지원대상
2nd row1,942/681
3rd row1,090/418
4th row120/53
5th row72/24
ValueCountFrequency (%)
급식비지원대상 1
 
6.2%
1,090/418 1
 
6.2%
120/53 1
 
6.2%
72/24 1
 
6.2%
144/50 1
 
6.2%
120/54 1
 
6.2%
112/40 1
 
6.2%
1,942/681 1
 
6.2%
242/120 1
 
6.2%
852/263 1
 
6.2%
Other values (6) 6
37.5%
2024-03-14T08:50:17.652308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 15
14.2%
/ 15
14.2%
2 14
13.2%
4 13
12.3%
0 11
10.4%
5 8
7.5%
6 5
 
4.7%
8 4
 
3.8%
7 4
 
3.8%
3 4
 
3.8%
Other values (11) 13
12.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 80
75.5%
Other Punctuation 17
 
16.0%
Other Letter 7
 
6.6%
Space Separator 2
 
1.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 15
18.8%
2 14
17.5%
4 13
16.2%
0 11
13.8%
5 8
10.0%
6 5
 
6.2%
8 4
 
5.0%
7 4
 
5.0%
3 4
 
5.0%
9 2
 
2.5%
Other Letter
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 15
88.2%
, 2
 
11.8%
Space Separator
ValueCountFrequency (%)
  1
50.0%
1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 99
93.4%
Hangul 7
 
6.6%

Most frequent character per script

Common
ValueCountFrequency (%)
1 15
15.2%
/ 15
15.2%
2 14
14.1%
4 13
13.1%
0 11
11.1%
5 8
8.1%
6 5
 
5.1%
8 4
 
4.0%
7 4
 
4.0%
3 4
 
4.0%
Other values (4) 6
 
6.1%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 98
92.5%
Hangul 7
 
6.6%
None 1
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 15
15.3%
/ 15
15.3%
2 14
14.3%
4 13
13.3%
0 11
11.2%
5 8
8.2%
6 5
 
5.1%
8 4
 
4.1%
7 4
 
4.1%
3 4
 
4.1%
Other values (3) 5
 
5.1%
None
ValueCountFrequency (%)
  1
100.0%
Hangul
ValueCountFrequency (%)
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%

비고
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)9.5%
Missing0
Missing (%)0.0%
Memory size300.0 B
 
16 
<NA>

Length

Max length4
Median length2
Mean length2.4761905
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row 
3rd row<NA>
4th row 
5th row 

Common Values

ValueCountFrequency (%)
  16
76.2%
<NA> 5
 
23.8%

Length

2024-03-14T08:50:17.783755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T08:50:17.887896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 5
100.0%

Correlations

2024-03-14T08:50:17.975580image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교명설립기숙사정원/
학교명1.0001.0001.000
설립1.0001.0001.000
기숙사정원/1.0001.0001.000
2024-03-14T08:50:18.063533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
비고설립
비고1.0001.000
설립1.0001.000
2024-03-14T08:50:18.128988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립비고
설립1.0001.000
비고1.0001.000

Missing values

2024-03-14T08:50:15.985210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-14T08:50:16.106428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-14T08:50:16.232383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

학교명설립학급수사 업 비Unnamed: 4Unnamed: 5학 생 수Unnamed: 7Unnamed: 8기숙사정원/비고
0<NA><NA>NaN도비시군비급식비지원대상<NA>
113개교공립111889540002860006680005076361014661,942/681
2<NA>사립 2NaNNaNNaNNaNNaNNaNNaN<NA><NA>
32008년 8개교공립897522000156460365540255616339231,090/418
4한별고공립177380022140516604470447120/53
5진 안공립6342001026023940168937572/24
6제일고<NA>NaNNaNNaNNaNNaNNaNNaN<NA><NA>
7무주고공립13666001998046620325166159144/50
8장수고공립12558001674039060301170131120/54
9임실고공립9522001566036540243132111112/40
학교명설립학급수사 업 비Unnamed: 4Unnamed: 5학 생 수Unnamed: 7Unnamed: 8기숙사정원/비고
11제일고<NA>NaNNaNNaNNaNNaNNaNNaN<NA><NA>
12고창고공립2212780038140896605985980280/77
13부안고공립181116003348078120474474-242/120
142009년 5개교공립39143200012954030246025201977543852/263
15<NA>사립2NaNNaNNaNNaNNaNNaNNaN<NA><NA>
16정읍고공립15990002970069300428428162/33
17호남고사립211152003450080640599599246/64
18남원고공립18678532035647497531531144/71
19성원고사립15689472068448263419419150/45
20김제여고공립228100024300567005430543150/50

Duplicate rows

Most frequently occurring

학교명설립기숙사정원/비고# duplicates
0제일고<NA><NA><NA>2