Overview

Dataset statistics

Number of variables10
Number of observations135
Missing cells538
Missing cells (%)39.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.2 KiB
Average record size in memory85.0 B

Variable types

Numeric1
Categorical2
Text4
Unsupported3

Dataset

Description2022년 서울특별시동부교육지원청 관내(동대문구, 중랑구) 학교 정보(설립구분, 학제구분, 학교명, 주소, 전화번호) 파일입니다
Author서울특별시교육청 서울특별시동부교육지원청
URLhttps://www.data.go.kr/data/15053610/fileData.do

Alerts

비고 has constant value ""Constant
연번 is highly overall correlated with 설립구분 and 1 other fieldsHigh correlation
설립구분 is highly overall correlated with 연번High correlation
학제구분 is highly overall correlated with 연번High correlation
비고 has 133 (98.5%) missing valuesMissing
Unnamed: 7 has 135 (100.0%) missing valuesMissing
Unnamed: 8 has 135 (100.0%) missing valuesMissing
Unnamed: 9 has 135 (100.0%) missing valuesMissing
연번 has unique valuesUnique
학교명 has unique valuesUnique
연락처 has unique valuesUnique
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 05:21:47.111002
Analysis finished2023-12-12 05:21:48.167695
Duration1.06 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68
Minimum1
Maximum135
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2023-12-12T14:21:48.253297image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.7
Q134.5
median68
Q3101.5
95-th percentile128.3
Maximum135
Range134
Interquartile range (IQR)67

Descriptive statistics

Standard deviation39.115214
Coefficient of variation (CV)0.57522374
Kurtosis-1.2
Mean68
Median Absolute Deviation (MAD)34
Skewness0
Sum9180
Variance1530
MonotonicityStrictly increasing
2023-12-12T14:21:48.424651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
94 1
 
0.7%
88 1
 
0.7%
89 1
 
0.7%
90 1
 
0.7%
91 1
 
0.7%
92 1
 
0.7%
93 1
 
0.7%
95 1
 
0.7%
2 1
 
0.7%
Other values (125) 125
92.6%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
135 1
0.7%
134 1
0.7%
133 1
0.7%
132 1
0.7%
131 1
0.7%
130 1
0.7%
129 1
0.7%
128 1
0.7%
127 1
0.7%
126 1
0.7%

설립구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
공립
84 
사립
51 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row공립
3rd row공립
4th row공립
5th row공립

Common Values

ValueCountFrequency (%)
공립 84
62.2%
사립 51
37.8%

Length

2023-12-12T14:21:48.586657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:21:48.704022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 84
62.2%
사립 51
37.8%

학제구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
유치원
61 
초등학교
45 
중학교
29 

Length

Max length4
Median length3
Mean length3.3333333
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유치원
2nd row유치원
3rd row유치원
4th row유치원
5th row유치원

Common Values

ValueCountFrequency (%)
유치원 61
45.2%
초등학교 45
33.3%
중학교 29
21.5%

Length

2023-12-12T14:21:48.816450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:21:48.921678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유치원 61
45.2%
초등학교 45
33.3%
중학교 29
21.5%

학교명
Text

UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T14:21:49.184301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length13
Mean length7.4592593
Min length5

Characters and Unicode

Total characters1007
Distinct characters111
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)100.0%

Sample

1st row서울새솔유치원
2nd row서울이문유치원
3rd row서울휘경유치원
4th row서울군자초등학교병설유치원
5th row서울답십리초등학교병설유치원
ValueCountFrequency (%)
서울새솔유치원 1
 
0.7%
경희초등학교 1
 
0.7%
서울홍파초등학교 1
 
0.7%
서울홍릉초등학교 1
 
0.7%
서울청량초등학교 1
 
0.7%
서울중흥초등학교 1
 
0.7%
서울중화초등학교 1
 
0.7%
서울중목초등학교 1
 
0.7%
서울중랑초등학교 1
 
0.7%
서울중곡초등학교 1
 
0.7%
Other values (125) 125
92.6%
2023-12-12T14:21:49.550343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.1%
65
 
6.5%
65
 
6.5%
65
 
6.5%
65
 
6.5%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (101) 322
32.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1007
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.1%
65
 
6.5%
65
 
6.5%
65
 
6.5%
65
 
6.5%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (101) 322
32.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1007
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.1%
65
 
6.5%
65
 
6.5%
65
 
6.5%
65
 
6.5%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (101) 322
32.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1007
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
96
 
9.5%
95
 
9.4%
71
 
7.1%
65
 
6.5%
65
 
6.5%
65
 
6.5%
65
 
6.5%
61
 
6.1%
61
 
6.1%
41
 
4.1%
Other values (101) 322
32.0%

주소
Text

Distinct130
Distinct (%)96.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T14:21:49.935269image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length35
Mean length22.540741
Min length17

Characters and Unicode

Total characters3043
Distinct characters115
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique127 ?
Unique (%)94.1%

Sample

1st row서울특별시 중랑구 신내역로1길 136
2nd row서울특별시 동대문구 신이문로 16
3rd row서울특별시 동대문구 망우로6길 48
4th row서울특별시 동대문구 한천로6길 21 서울군자초등학교병설유치원
5th row서울특별시 동대문구 전농로3길 23 서울답십리초등학교병설유치원
ValueCountFrequency (%)
서울특별시 135
23.4%
중랑구 71
 
12.3%
동대문구 64
 
11.1%
8
 
1.4%
사가정로 7
 
1.2%
장안벚꽃로 6
 
1.0%
20 6
 
1.0%
26 6
 
1.0%
봉화산로 6
 
1.0%
32 5
 
0.9%
Other values (186) 263
45.6%
2023-12-12T14:21:50.509770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
569
18.7%
154
 
5.1%
154
 
5.1%
136
 
4.5%
135
 
4.4%
135
 
4.4%
135
 
4.4%
133
 
4.4%
1 90
 
3.0%
79
 
2.6%
Other values (105) 1323
43.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2004
65.9%
Space Separator 569
 
18.7%
Decimal Number 458
 
15.1%
Other Punctuation 9
 
0.3%
Dash Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%
Decimal Number
ValueCountFrequency (%)
1 90
19.7%
2 64
14.0%
5 55
12.0%
6 52
11.4%
3 48
10.5%
0 36
 
7.9%
7 35
 
7.6%
4 32
 
7.0%
8 24
 
5.2%
9 22
 
4.8%
Space Separator
ValueCountFrequency (%)
569
100.0%
Other Punctuation
ValueCountFrequency (%)
, 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2004
65.9%
Common 1039
34.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%
Common
ValueCountFrequency (%)
569
54.8%
1 90
 
8.7%
2 64
 
6.2%
5 55
 
5.3%
6 52
 
5.0%
3 48
 
4.6%
0 36
 
3.5%
7 35
 
3.4%
4 32
 
3.1%
8 24
 
2.3%
Other values (3) 34
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2004
65.9%
ASCII 1039
34.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
569
54.8%
1 90
 
8.7%
2 64
 
6.2%
5 55
 
5.3%
6 52
 
5.0%
3 48
 
4.6%
0 36
 
3.5%
7 35
 
3.4%
4 32
 
3.1%
8 24
 
2.3%
Other values (3) 34
 
3.3%
Hangul
ValueCountFrequency (%)
154
 
7.7%
154
 
7.7%
136
 
6.8%
135
 
6.7%
135
 
6.7%
135
 
6.7%
133
 
6.6%
79
 
3.9%
79
 
3.9%
76
 
3.8%
Other values (92) 788
39.3%

연락처
Text

UNIQUE 

Distinct135
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2023-12-12T14:21:50.839767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.503704
Min length11

Characters and Unicode

Total characters1553
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)100.0%

Sample

1st row02-3422-1331
2nd row02-959-2023
3rd row02-2242-0356
4th row02-2212-8456
5th row02-2248-3152
ValueCountFrequency (%)
02-3422-1331 1
 
0.7%
02-962-4300 1
 
0.7%
02-967-8161 1
 
0.7%
02-968-4701 1
 
0.7%
02-962-1341 1
 
0.7%
02-495-2912 1
 
0.7%
02-433-8993 1
 
0.7%
02-2209-0012 1
 
0.7%
02-437-4147 1
 
0.7%
02-2209-2543 1
 
0.7%
Other values (125) 125
92.6%
2023-12-12T14:21:51.290355image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 315
20.3%
- 270
17.4%
0 231
14.9%
4 148
9.5%
3 125
 
8.0%
1 114
 
7.3%
9 102
 
6.6%
6 71
 
4.6%
5 60
 
3.9%
8 59
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1283
82.6%
Dash Punctuation 270
 
17.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 315
24.6%
0 231
18.0%
4 148
11.5%
3 125
 
9.7%
1 114
 
8.9%
9 102
 
8.0%
6 71
 
5.5%
5 60
 
4.7%
8 59
 
4.6%
7 58
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 270
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1553
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 315
20.3%
- 270
17.4%
0 231
14.9%
4 148
9.5%
3 125
 
8.0%
1 114
 
7.3%
9 102
 
6.6%
6 71
 
4.6%
5 60
 
3.9%
8 59
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1553
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 315
20.3%
- 270
17.4%
0 231
14.9%
4 148
9.5%
3 125
 
8.0%
1 114
 
7.3%
9 102
 
6.6%
6 71
 
4.6%
5 60
 
3.9%
8 59
 
3.8%

비고
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing133
Missing (%)98.5%
Memory size1.2 KiB
2023-12-12T14:21:51.426834image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row휴원
2nd row휴원
ValueCountFrequency (%)
휴원 2
100.0%
2023-12-12T14:21:51.679141image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2
50.0%
2
50.0%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing135
Missing (%)100.0%
Memory size1.3 KiB

Unnamed: 8
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing135
Missing (%)100.0%
Memory size1.3 KiB

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing135
Missing (%)100.0%
Memory size1.3 KiB

Interactions

2023-12-12T14:21:47.507005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T14:21:51.787659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설립구분학제구분
연번1.0000.9250.948
설립구분0.9251.0000.295
학제구분0.9480.2951.000
2023-12-12T14:21:51.873741image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설립구분학제구분
설립구분1.0000.474
학제구분0.4741.000
2023-12-12T14:21:51.948823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번설립구분학제구분
연번1.0000.7400.911
설립구분0.7401.0000.474
학제구분0.9110.4741.000

Missing values

2023-12-12T14:21:47.954329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:21:48.105583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번설립구분학제구분학교명주소연락처비고Unnamed: 7Unnamed: 8Unnamed: 9
01공립유치원서울새솔유치원서울특별시 중랑구 신내역로1길 13602-3422-1331<NA><NA><NA><NA>
12공립유치원서울이문유치원서울특별시 동대문구 신이문로 1602-959-2023<NA><NA><NA><NA>
23공립유치원서울휘경유치원서울특별시 동대문구 망우로6길 4802-2242-0356<NA><NA><NA><NA>
34공립유치원서울군자초등학교병설유치원서울특별시 동대문구 한천로6길 21 서울군자초등학교병설유치원02-2212-8456<NA><NA><NA><NA>
45공립유치원서울답십리초등학교병설유치원서울특별시 동대문구 전농로3길 23 서울답십리초등학교병설유치원02-2248-3152<NA><NA><NA><NA>
56공립유치원서울동원초등학교병설유치원서울특별시 중랑구 송림길 114 서울동원초등학교병설유치원02-438-3288<NA><NA><NA><NA>
67공립유치원서울면동초등학교병설유치원서울특별시 중랑구 면목로57길 32 , 서울면동초등학교병설유치원02-496-7922<NA><NA><NA><NA>
78공립유치원서울면목초등학교병설유치원서울특별시 중랑구 면목로 434 서울면목초등학교병설유치원02-2209-4341<NA><NA><NA><NA>
89공립유치원서울면북초등학교병설유치원서울특별시 중랑구 용마공원로5길 32 서울면북초등학교병설유치원02-433-3296<NA><NA><NA><NA>
910공립유치원서울묵현초등학교병설유치원서울특별시 중랑구 동일로157길 75 서울묵현초등학교병설유치원02-977-9105<NA><NA><NA><NA>
연번설립구분학제구분학교명주소연락처비고Unnamed: 7Unnamed: 8Unnamed: 9
125126공립중학교휘경중학교서울특별시 동대문구 망우로18나길 2002-2244-1359<NA><NA><NA><NA>
126127사립중학교경희여자중학교서울특별시 동대문구 경희대로 2602-2250-8888<NA><NA><NA><NA>
127128사립중학교경희중학교서울특별시 동대문구 경희대로 2602-966-6402<NA><NA><NA><NA>
128129사립중학교대광중학교서울특별시 동대문구 안암로 602-940-2246<NA><NA><NA><NA>
129130사립중학교동국대학교사범대학부속중학교서울특별시 동대문구 장안벚꽃로 20102-6716-1700<NA><NA><NA><NA>
130131사립중학교송곡여자중학교서울특별시 중랑구 양원역로 67070-7124-3569<NA><NA><NA><NA>
131132사립중학교영란여자중학교서울특별시 중랑구 망우로73길 5602-2209-0143<NA><NA><NA><NA>
132133사립중학교정화여자중학교서울특별시 동대문구 홍릉로15길 5002-967-0178<NA><NA><NA><NA>
133134사립중학교혜원여자중학교서울특별시 중랑구 봉우재로 58길 3902-6491-7918<NA><NA><NA><NA>
134135사립중학교휘경여자중학교서울특별시 동대문구 한천로 24702-2244-8927<NA><NA><NA><NA>