Overview

Dataset statistics

Number of variables7
Number of observations135
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.6 KiB
Average record size in memory58.0 B

Variable types

Numeric1
Categorical3
Text3

Dataset

Description2023년 서울특별시동부교육지원청 학교 정보에 대한 데이터로 설립구분, 학제구분, 학교명, 주소, 연락처를 제공합니다.
URLhttps://www.data.go.kr/data/15113000/fileData.do

Alerts

연번 is highly overall correlated with 설립구분 and 2 other fieldsHigh correlation
설립구분 is highly overall correlated with 연번High correlation
학제구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
비고 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
비고 is highly imbalanced (81.7%)Imbalance
연번 has unique valuesUnique
학교명 has unique valuesUnique
연락처 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:35:00.726510
Analysis finished2023-12-11 23:35:01.239645
Duration0.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68
Minimum1
Maximum135
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T08:35:01.298099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.7
Q134.5
median68
Q3101.5
95-th percentile128.3
Maximum135
Range134
Interquartile range (IQR)67

Descriptive statistics

Standard deviation39.115214
Coefficient of variation (CV)0.57522374
Kurtosis-1.2
Mean68
Median Absolute Deviation (MAD)34
Skewness0
Sum9180
Variance1530
MonotonicityStrictly increasing
2023-12-12T08:35:01.418145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
94 1
 
0.7%
88 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
95 1
 
0.7%
2 1
 
0.7%
Other values (125) 125
92.6%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%
127 1
0.7%
126 1
0.7%

설립구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
공립
84 
사립
51 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 84
62.2%
사립 51
37.8%

Length

2023-12-12T08:35:01.529456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:01.604385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 84
62.2%
사립 51
37.8%

학제구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
유치원
61 
초등학교
45 
중학교
29 

Length

Max length4
Median length3
Mean length3.3333333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유치원
2nd row유치원
3rd row유치원
4th row유치원
5th row유치원

Common Values

ValueCountFrequency (%)
유치원 61
45.2%
초등학교 45
33.3%
중학교 29
21.5%

Length

2023-12-12T08:35:01.684587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:01.765066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유치원 61
45.2%
초등학교 45
33.3%
중학교 29
21.5%

학교명
Text

UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T08:35:01.945561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length7.4666667
Min length5

Characters and Unicode

Total characters1008
Distinct characters112
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)100.0%

Sample

1st row서울새솔유치원
2nd row서울이문유치원
3rd row서울휘경유치원
4th row서울군자초등학교병설유치원
5th row서울답십리초등학교병설유치원
ValueCountFrequency (%)
서울새솔유치원 1
 
0.7%
경희초등학교 1
 
0.7%
서울홍파초등학교 1
 
0.7%
서울홍릉초등학교 1
 
0.7%
서울청량초등학교 1
 
0.7%
서울중흥초등학교 1
 
0.7%
서울중화초등학교 1
 
0.7%
서울중목초등학교 1
 
0.7%
서울중랑초등학교 1
 
0.7%
서울중곡초등학교 1
 
0.7%
Other values (125) 125
92.6%
2023-12-12T08:35:02.284934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.0%
66
 
6.5%
65
 
6.4%
65
 
6.4%
65
 
6.4%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (102) 322
31.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1008
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.0%
66
 
6.5%
65
 
6.4%
65
 
6.4%
65
 
6.4%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (102) 322
31.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1008
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.0%
66
 
6.5%
65
 
6.4%
65
 
6.4%
65
 
6.4%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (102) 322
31.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1008
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.0%
66
 
6.5%
65
 
6.4%
65
 
6.4%
65
 
6.4%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (102) 322
31.9%

주소
Text

Distinct130
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T08:35:02.628532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length22.540741
Min length17

Characters and Unicode

Total characters3043
Distinct characters115
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)94.1%

Sample

1st row서울특별시 중랑구 신내역로1길 136
2nd row서울특별시 동대문구 신이문로 16
3rd row서울특별시 동대문구 망우로6길 48
4th row서울특별시 동대문구 한천로6길 21 서울군자초등학교병설유치원
5th row서울특별시 동대문구 전농로3길 23 서울답십리초등학교병설유치원
ValueCountFrequency (%)
서울특별시 135
23.4%
중랑구 71
 
12.3%
동대문구 64
 
11.1%
8
 
1.4%
사가정로 7
 
1.2%
장안벚꽃로 6
 
1.0%
20 6
 
1.0%
26 6
 
1.0%
봉화산로 6
 
1.0%
32 5
 
0.9%
Other values (186) 263
45.6%
2023-12-12T08:35:03.103596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
569
18.7%
154
 
5.1%
154
 
5.1%
136
 
4.5%
135
 
4.4%
135
 
4.4%
135
 
4.4%
133
 
4.4%
1 90
 
3.0%
79
 
2.6%
Other values (105) 1323
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2004
65.9%
Space Separator 569
 
18.7%
Decimal Number 458
 
15.1%
Other Punctuation 9
 
0.3%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%
Decimal Number
ValueCountFrequency (%)
1 90
19.7%
2 64
14.0%
5 55
12.0%
6 52
11.4%
3 48
10.5%
0 36
 
7.9%
7 35
 
7.6%
4 32
 
7.0%
8 24
 
5.2%
9 22
 
4.8%
Space Separator
ValueCountFrequency (%)
569
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2004
65.9%
Common 1039
34.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%
Common
ValueCountFrequency (%)
569
54.8%
1 90
 
8.7%
2 64
 
6.2%
5 55
 
5.3%
6 52
 
5.0%
3 48
 
4.6%
0 36
 
3.5%
7 35
 
3.4%
4 32
 
3.1%
8 24
 
2.3%
Other values (3) 34
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2004
65.9%
ASCII 1039
34.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
569
54.8%
1 90
 
8.7%
2 64
 
6.2%
5 55
 
5.3%
6 52
 
5.0%
3 48
 
4.6%
0 36
 
3.5%
7 35
 
3.4%
4 32
 
3.1%
8 24
 
2.3%
Other values (3) 34
 
3.3%
Hangul
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%

연락처
Text

UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T08:35:03.377645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.518519
Min length11

Characters and Unicode

Total characters1555
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)100.0%

Sample

1st row02-3422-1331
2nd row02-959-2023
3rd row02-2242-0356
4th row02-2212-8456
5th row02-2248-3152
ValueCountFrequency (%)
02-3422-1331 1
 
0.7%
02-962-4300 1
 
0.7%
02-967-8161 1
 
0.7%
02-968-4701 1
 
0.7%
02-962-1341 1
 
0.7%
02-495-2912 1
 
0.7%
02-433-8993 1
 
0.7%
02-2209-0012 1
 
0.7%
02-437-4147 1
 
0.7%
02-2209-2543 1
 
0.7%
Other values (125) 125
92.6%
2023-12-12T08:35:03.753010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 319
20.5%
- 270
17.4%
0 234
15.0%
4 150
9.6%
3 123
 
7.9%
1 115
 
7.4%
9 99
 
6.4%
6 69
 
4.4%
5 61
 
3.9%
8 59
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1285
82.6%
Dash Punctuation 270
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 319
24.8%
0 234
18.2%
4 150
11.7%
3 123
 
9.6%
1 115
 
8.9%
9 99
 
7.7%
6 69
 
5.4%
5 61
 
4.7%
8 59
 
4.6%
7 56
 
4.4%
Dash Punctuation
ValueCountFrequency (%)
- 270
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1555
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 319
20.5%
- 270
17.4%
0 234
15.0%
4 150
9.6%
3 123
 
7.9%
1 115
 
7.4%
9 99
 
6.4%
6 69
 
4.4%
5 61
 
3.9%
8 59
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1555
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 319
20.5%
- 270
17.4%
0 234
15.0%
4 150
9.6%
3 123
 
7.9%
1 115
 
7.4%
9 99
 
6.4%
6 69
 
4.4%
5 61
 
3.9%
8 59
 
3.8%

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
<NA>
128 
행정실
 
4
폐원
 
2
휴원
 
1

Length

Max length4
Median length4
Mean length3.9259259
Min length2

Unique

Unique1 ?
Unique (%)0.7%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 128
94.8%
행정실 4
 
3.0%
폐원 2
 
1.5%
휴원 1
 
0.7%

Length

2023-12-12T08:35:03.894471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T08:35:04.228890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 128
94.8%
행정실 4
 
3.0%
폐원 2
 
1.5%
휴원 1
 
0.7%

Interactions

2023-12-12T08:35:01.015054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T08:35:04.286272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설립구분학제구분비고
연번1.0000.9250.9481.000
설립구분0.9251.0000.2950.000
학제구분0.9480.2951.0001.000
비고1.0000.0001.0001.000
2023-12-12T08:35:04.362098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립구분비고학제구분
설립구분1.0000.0000.474
비고0.0001.0000.894
학제구분0.4740.8941.000
2023-12-12T08:35:04.454591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설립구분학제구분비고
연번1.0000.7400.9110.707
설립구분0.7401.0000.4740.000
학제구분0.9110.4741.0000.894
비고0.7070.0000.8941.000

Missing values

2023-12-12T08:35:01.112056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:35:01.203258image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번설립구분학제구분학교명주소연락처비고
01공립유치원서울새솔유치원서울특별시 중랑구 신내역로1길 13602-3422-1331<NA>
12공립유치원서울이문유치원서울특별시 동대문구 신이문로 1602-959-2023<NA>
23공립유치원서울휘경유치원서울특별시 동대문구 망우로6길 4802-2242-0356<NA>
34공립유치원서울군자초등학교병설유치원서울특별시 동대문구 한천로6길 21 서울군자초등학교병설유치원02-2212-8456<NA>
45공립유치원서울답십리초등학교병설유치원서울특별시 동대문구 전농로3길 23 서울답십리초등학교병설유치원02-2248-3152<NA>
56공립유치원서울동원초등학교병설유치원서울특별시 중랑구 송림길 114 서울동원초등학교병설유치원02-438-3288<NA>
67공립유치원서울면동초등학교병설유치원서울특별시 중랑구 면목로57길 32 , 서울면동초등학교병설유치원02-496-7922<NA>
78공립유치원서울면목초등학교병설유치원서울특별시 중랑구 면목로 434 서울면목초등학교병설유치원02-2209-4341<NA>
89공립유치원서울면북초등학교병설유치원서울특별시 중랑구 용마공원로5길 32 서울면북초등학교병설유치원02-433-3296<NA>
910공립유치원서울묵현초등학교병설유치원서울특별시 중랑구 동일로157길 75 서울묵현초등학교병설유치원02-977-9105<NA>
연번설립구분학제구분학교명주소연락처비고
125126공립중학교휘경중학교서울특별시 동대문구 망우로18나길 2002-2244-1359<NA>
126127사립중학교경희여자중학교서울특별시 동대문구 경희대로 2602-2250-8888<NA>
127128사립중학교경희중학교서울특별시 동대문구 경희대로 2602-966-6402행정실
128129사립중학교대광중학교서울특별시 동대문구 안암로 602-940-2246<NA>
129130사립중학교동국대학교사범대학부속중학교서울특별시 동대문구 장안벚꽃로 20102-6716-1700<NA>
130131사립중학교송곡여자중학교서울특별시 중랑구 양원역로 67070-7124-3502<NA>
131132사립중학교영란여자중학교서울특별시 중랑구 망우로73길 5602-2209-0143<NA>
132133사립중학교정화여자중학교서울특별시 동대문구 홍릉로15길 5002-967-0178행정실
133134사립중학교혜원여자중학교서울특별시 중랑구 봉우재로 58길 3902-6491-5400<NA>
134135사립중학교휘경여자중학교서울특별시 동대문구 한천로 24702-2244-8927행정실