Overview

Dataset statistics

Number of variables7
Number of observations66
Missing cells31
Missing cells (%)6.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.7 KiB
Average record size in memory58.0 B

Variable types

Unsupported1
Categorical1
Text5

Dataset

Description세종특별자치시교육청 관내 학교급별 현황 데이터로 2022년 9월 1일 기준으로 각 학교별 구분, 기관명, 전화번호, 팩스번호, 주소, 홈페이지를 제공하고 있습니다.
Author세종특별자치시교육청
URLhttps://www.data.go.kr/data/15050938/fileData.do

Alerts

Unnamed: 1 is highly imbalanced (74.9%)Imbalance
유치원 현황 has 2 (3.0%) missing valuesMissing
Unnamed: 2 has 2 (3.0%) missing valuesMissing
Unnamed: 3 has 2 (3.0%) missing valuesMissing
Unnamed: 4 has 21 (31.8%) missing valuesMissing
Unnamed: 5 has 2 (3.0%) missing valuesMissing
Unnamed: 6 has 2 (3.0%) missing valuesMissing
유치원 현황 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 22:28:51.045527
Analysis finished2023-12-12 22:28:51.712105
Duration0.67 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

유치원 현황
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)3.0%
Memory size660.0 B

Unnamed: 1
Categorical

IMBALANCE 

Distinct4
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Memory size660.0 B
공립
61 
<NA>
 
2
사립
 
2
구분
 
1

Length

Max length4
Median length2
Mean length2.0606061
Min length2

Unique

Unique1 ?
Unique (%)1.5%

Sample

1st row<NA>
2nd row<NA>
3rd row구분
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 61
92.4%
<NA> 2
 
3.0%
사립 2
 
3.0%
구분 1
 
1.5%

Length

2023-12-13T07:28:51.791313image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:28:51.892876image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 61
92.4%
na 2
 
3.0%
사립 2
 
3.0%
구분 1
 
1.5%

Unnamed: 2
Text

MISSING 

Distinct64
Distinct (%)100.0%
Missing2
Missing (%)3.0%
Memory size660.0 B
2023-12-13T07:28:52.087905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length5
Mean length6.984375
Min length3

Characters and Unicode

Total characters447
Distinct characters92
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row기관명
2nd row조치원대동초등학교병설유치원
3rd row조치원명동초등학교병설유치원
4th row조치원교동초등학교병설유치원
5th row조치원신봉초등학교병설유치원
ValueCountFrequency (%)
기관명 1
 
1.6%
조치원대동초등학교병설유치원 1
 
1.6%
으뜸유치원 1
 
1.6%
올망유치원 1
 
1.6%
종촌유치원 1
 
1.6%
도란유치원 1
 
1.6%
다빛유치원 1
 
1.6%
늘봄유치원 1
 
1.6%
초롱별유치원 1
 
1.6%
가락유치원 1
 
1.6%
Other values (54) 54
84.4%
2023-12-13T07:28:52.683066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
68
15.2%
67
15.0%
63
14.1%
20
 
4.5%
19
 
4.3%
19
 
4.3%
19
 
4.3%
18
 
4.0%
18
 
4.0%
6
 
1.3%
Other values (82) 130
29.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 447
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
68
15.2%
67
15.0%
63
14.1%
20
 
4.5%
19
 
4.3%
19
 
4.3%
19
 
4.3%
18
 
4.0%
18
 
4.0%
6
 
1.3%
Other values (82) 130
29.1%

Most occurring scripts

ValueCountFrequency (%)
Hangul 447
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
68
15.2%
67
15.0%
63
14.1%
20
 
4.5%
19
 
4.3%
19
 
4.3%
19
 
4.3%
18
 
4.0%
18
 
4.0%
6
 
1.3%
Other values (82) 130
29.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 447
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
68
15.2%
67
15.0%
63
14.1%
20
 
4.5%
19
 
4.3%
19
 
4.3%
19
 
4.3%
18
 
4.0%
18
 
4.0%
6
 
1.3%
Other values (82) 130
29.1%

Unnamed: 3
Text

MISSING 

Distinct64
Distinct (%)100.0%
Missing2
Missing (%)3.0%
Memory size660.0 B
2023-12-13T07:28:52.929469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9375
Min length4

Characters and Unicode

Total characters508
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row전화번호
2nd row861-4605
3rd row862-5452
4th row866-6169
5th row902-1470
ValueCountFrequency (%)
전화번호 1
 
1.6%
861-4605 1
 
1.6%
716-6103 1
 
1.6%
716-6401 1
 
1.6%
903-2952 1
 
1.6%
903-2801 1
 
1.6%
903-3801 1
 
1.6%
903-2401 1
 
1.6%
903-2601 1
 
1.6%
903-1302 1
 
1.6%
Other values (54) 54
84.4%
2023-12-13T07:28:53.313794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 99
19.5%
- 63
12.4%
9 63
12.4%
2 51
10.0%
6 49
9.6%
8 41
8.1%
1 36
 
7.1%
5 31
 
6.1%
3 28
 
5.5%
7 26
 
5.1%
Other values (5) 21
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 441
86.8%
Dash Punctuation 63
 
12.4%
Other Letter 4
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 99
22.4%
9 63
14.3%
2 51
11.6%
6 49
11.1%
8 41
9.3%
1 36
 
8.2%
5 31
 
7.0%
3 28
 
6.3%
7 26
 
5.9%
4 17
 
3.9%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 63
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 504
99.2%
Hangul 4
 
0.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 99
19.6%
- 63
12.5%
9 63
12.5%
2 51
10.1%
6 49
9.7%
8 41
8.1%
1 36
 
7.1%
5 31
 
6.2%
3 28
 
5.6%
7 26
 
5.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 504
99.2%
Hangul 4
 
0.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 99
19.6%
- 63
12.5%
9 63
12.5%
2 51
10.1%
6 49
9.7%
8 41
8.1%
1 36
 
7.1%
5 31
 
6.2%
3 28
 
5.6%
7 26
 
5.2%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 4
Text

MISSING 

Distinct45
Distinct (%)100.0%
Missing21
Missing (%)31.8%
Memory size660.0 B
2023-12-13T07:28:53.553889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length7.9111111
Min length4

Characters and Unicode

Total characters356
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45 ?
Unique (%)100.0%

Sample

1st row팩스번호
2nd row864-1795
3rd row410-4523
4th row715-5410
5th row865-3752
ValueCountFrequency (%)
868-3263 1
 
2.2%
999-5129 1
 
2.2%
903-0123 1
 
2.2%
903-0243 1
 
2.2%
999-7939 1
 
2.2%
999-7823 1
 
2.2%
999-6813 1
 
2.2%
999-5823 1
 
2.2%
999-6229 1
 
2.2%
999-7329 1
 
2.2%
Other values (35) 35
77.8%
2023-12-13T07:28:53.989514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9 68
19.1%
- 44
12.4%
2 42
11.8%
0 33
9.3%
3 31
8.7%
6 30
8.4%
1 30
8.4%
7 23
 
6.5%
5 20
 
5.6%
8 17
 
4.8%
Other values (5) 18
 
5.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 308
86.5%
Dash Punctuation 44
 
12.4%
Other Letter 4
 
1.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
9 68
22.1%
2 42
13.6%
0 33
10.7%
3 31
10.1%
6 30
9.7%
1 30
9.7%
7 23
 
7.5%
5 20
 
6.5%
8 17
 
5.5%
4 14
 
4.5%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 352
98.9%
Hangul 4
 
1.1%

Most frequent character per script

Common
ValueCountFrequency (%)
9 68
19.3%
- 44
12.5%
2 42
11.9%
0 33
9.4%
3 31
8.8%
6 30
8.5%
1 30
8.5%
7 23
 
6.5%
5 20
 
5.7%
8 17
 
4.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 352
98.9%
Hangul 4
 
1.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9 68
19.3%
- 44
12.5%
2 42
11.9%
0 33
9.4%
3 31
8.8%
6 30
8.5%
1 30
8.5%
7 23
 
6.5%
5 20
 
5.7%
8 17
 
4.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 5
Text

MISSING 

Distinct64
Distinct (%)100.0%
Missing2
Missing (%)3.0%
Memory size660.0 B
2023-12-13T07:28:54.295928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length16.53125
Min length2

Characters and Unicode

Total characters1058
Distinct characters90
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique64 ?
Unique (%)100.0%

Sample

1st row주소
2nd row세종특별자치시 조치원읍 대동학교길 11
3rd row세종특별자치시 조치원읍 조치원10길 35
4th row세종특별자치시 조치원읍 새내 18길 21
5th row세종특별자치시 조치원읍 서북부로 10
ValueCountFrequency (%)
세종특별자치시 63
29.9%
조치원읍 5
 
2.4%
연서면 5
 
2.4%
만남로 4
 
1.9%
15 3
 
1.4%
다솜1로 2
 
0.9%
다정남로 2
 
0.9%
금남면 2
 
0.9%
보람로 2
 
0.9%
33 2
 
0.9%
Other values (110) 121
57.3%
2023-12-13T07:28:54.781628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
13.9%
69
 
6.5%
66
 
6.2%
65
 
6.1%
65
 
6.1%
63
 
6.0%
63
 
6.0%
63
 
6.0%
1 50
 
4.7%
48
 
4.5%
Other values (80) 359
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 721
68.1%
Decimal Number 182
 
17.2%
Space Separator 147
 
13.9%
Dash Punctuation 8
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
9.6%
66
 
9.2%
65
 
9.0%
65
 
9.0%
63
 
8.7%
63
 
8.7%
63
 
8.7%
48
 
6.7%
15
 
2.1%
15
 
2.1%
Other values (68) 189
26.2%
Decimal Number
ValueCountFrequency (%)
1 50
27.5%
3 26
14.3%
2 22
12.1%
6 16
 
8.8%
4 15
 
8.2%
0 13
 
7.1%
7 12
 
6.6%
5 12
 
6.6%
9 10
 
5.5%
8 6
 
3.3%
Space Separator
ValueCountFrequency (%)
147
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 721
68.1%
Common 337
31.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
9.6%
66
 
9.2%
65
 
9.0%
65
 
9.0%
63
 
8.7%
63
 
8.7%
63
 
8.7%
48
 
6.7%
15
 
2.1%
15
 
2.1%
Other values (68) 189
26.2%
Common
ValueCountFrequency (%)
147
43.6%
1 50
 
14.8%
3 26
 
7.7%
2 22
 
6.5%
6 16
 
4.7%
4 15
 
4.5%
0 13
 
3.9%
7 12
 
3.6%
5 12
 
3.6%
9 10
 
3.0%
Other values (2) 14
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 721
68.1%
ASCII 337
31.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
147
43.6%
1 50
 
14.8%
3 26
 
7.7%
2 22
 
6.5%
6 16
 
4.7%
4 15
 
4.5%
0 13
 
3.9%
7 12
 
3.6%
5 12
 
3.6%
9 10
 
3.0%
Other values (2) 14
 
4.2%
Hangul
ValueCountFrequency (%)
69
 
9.6%
66
 
9.2%
65
 
9.0%
65
 
9.0%
63
 
8.7%
63
 
8.7%
63
 
8.7%
48
 
6.7%
15
 
2.1%
15
 
2.1%
Other values (68) 189
26.2%

Unnamed: 6
Text

MISSING 

Distinct45
Distinct (%)70.3%
Missing2
Missing (%)3.0%
Memory size660.0 B
2023-12-13T07:28:55.051468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length26
Mean length16.734375
Min length1

Characters and Unicode

Total characters1071
Distinct characters39
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)68.8%

Sample

1st row(2022. 9. 1. 기준)
2nd row홈페이지
3rd row·
4th row·
5th row·
ValueCountFrequency (%)
· 20
29.9%
http://mir.sjedukg.kr 1
 
1.5%
http://garak.sjedukg.kr 1
 
1.5%
http://saesaem.sjedukg.kr 1
 
1.5%
http://sodam.sjedukg.kr 1
 
1.5%
http://boram.sjedukg.kr 1
 
1.5%
http://saerom.sjedukg.kr 1
 
1.5%
http://saetteum.sjedukg.kr 1
 
1.5%
http://gadeuk.sjedukg.kr 1
 
1.5%
http://hanbit.sjedukg.kr 1
 
1.5%
Other values (38) 38
56.7%
2023-12-13T07:28:55.369798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 92
 
8.6%
t 91
 
8.5%
. 87
 
8.1%
k 87
 
8.1%
e 72
 
6.7%
s 60
 
5.6%
g 58
 
5.4%
u 56
 
5.2%
d 54
 
5.0%
r 52
 
4.9%
Other values (29) 362
33.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 812
75.8%
Other Punctuation 241
 
22.5%
Decimal Number 6
 
0.6%
Other Letter 6
 
0.6%
Space Separator 3
 
0.3%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%
Uppercase Letter 1
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 91
11.2%
k 87
10.7%
e 72
 
8.9%
s 60
 
7.4%
g 58
 
7.1%
u 56
 
6.9%
d 54
 
6.7%
r 52
 
6.4%
h 51
 
6.3%
j 47
 
5.8%
Other values (11) 184
22.7%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Other Punctuation
ValueCountFrequency (%)
/ 92
38.2%
. 87
36.1%
: 42
17.4%
· 20
 
8.3%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
1 1
 
16.7%
9 1
 
16.7%
0 1
 
16.7%
Space Separator
ValueCountFrequency (%)
3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
U 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 813
75.9%
Common 252
 
23.5%
Hangul 6
 
0.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 91
11.2%
k 87
10.7%
e 72
 
8.9%
s 60
 
7.4%
g 58
 
7.1%
u 56
 
6.9%
d 54
 
6.6%
r 52
 
6.4%
h 51
 
6.3%
j 47
 
5.8%
Other values (12) 185
22.8%
Common
ValueCountFrequency (%)
/ 92
36.5%
. 87
34.5%
: 42
16.7%
· 20
 
7.9%
3
 
1.2%
2 3
 
1.2%
) 1
 
0.4%
1 1
 
0.4%
9 1
 
0.4%
0 1
 
0.4%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1045
97.6%
None 20
 
1.9%
Hangul 6
 
0.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 92
 
8.8%
t 91
 
8.7%
. 87
 
8.3%
k 87
 
8.3%
e 72
 
6.9%
s 60
 
5.7%
g 58
 
5.6%
u 56
 
5.4%
d 54
 
5.2%
r 52
 
5.0%
Other values (22) 336
32.2%
None
ValueCountFrequency (%)
· 20
100.0%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Correlations

2023-12-13T07:28:55.485531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
Unnamed: 11.0001.0001.0001.0001.0000.000
Unnamed: 21.0001.0001.0001.0001.0001.000
Unnamed: 31.0001.0001.0001.0001.0001.000
Unnamed: 41.0001.0001.0001.0001.0001.000
Unnamed: 51.0001.0001.0001.0001.0001.000
Unnamed: 60.0001.0001.0001.0001.0001.000

Missing values

2023-12-13T07:28:51.410347image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:28:51.524237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-13T07:28:51.635514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

유치원 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
0NaN<NA><NA><NA><NA><NA>(2022. 9. 1. 기준)
1NaN<NA><NA><NA><NA><NA><NA>
2순번구분기관명전화번호팩스번호주소홈페이지
31공립조치원대동초등학교병설유치원861-4605<NA>세종특별자치시 조치원읍 대동학교길 11·
42공립조치원명동초등학교병설유치원862-5452<NA>세종특별자치시 조치원읍 조치원10길 35·
53공립조치원교동초등학교병설유치원866-6169<NA>세종특별자치시 조치원읍 새내 18길 21·
64공립조치원신봉초등학교병설유치원902-1470<NA>세종특별자치시 조치원읍 서북부로 10·
75공립연남초등학교병설유치원863-8355<NA>세종특별자치시 연기면 연기길 2·
86공립연동초등학교병설유치원864-7831<NA>세종특별자치시 연동면 청연로 606-36·
97공립부강초등학교병설유치원902-1690<NA>세종특별자치시 부강면 부강로 15·
유치원 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6
5654공립한결유치원902-2600902-2629세종특별자치시 다정북로 137http://hangyeol.sjedukg.kr/
5755공립해들유치원902-2400902-2429세종특별자치시 보람서로 29http://haedeul.sjedukg.kr/
5856공립대평유치원902-7000902-7029세종특별자치시 대평1길 17http://daepyeong.sjedukg.kr/
5957공립솔빛숲유치원410-2000410-2029세종특별자치시 반곡3길 48http://solbitsup.sjedukg.kr/
6058공립반곡유치원902-7100902-7170세종특별자치시 시청대로 464http://bangokU.sjedukg.kr
6159공립해밀유치원902-0700902-0719세종특별자치시 해밀1로 75https://haemil.sjedukg.kr
6260공립나성유치원902-0500902-0529세종특별자치시 중앙공원서로 11https://naseong.sjedukg.kr
6361공립집현유치원902-4700902-4719세종특별자치시 집현서로 25http://jiphyeon.sjedukg.kr
6462사립성모유치원863-5322863-1600세종특별자치시 조치원읍 시내6길 30·
6563사립아이마루유치원866-2257867-5757세종특별자치시 연서면 월성로 141·