Overview

Dataset statistics

Number of variables6
Number of observations53
Missing cells9
Missing cells (%)2.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory50.5 B

Variable types

Text5
Categorical1

Dataset

Description서울특별시 용산구 학교정보 현황(학교명, 구분(유치원, 초등학교, 중학교, 고등학교, 대학교), 우편번호, 주소, 전화번호, 홈페이지주소)에 대한 데이터를 제공합니다.
Author서울특별시 용산구
URLhttps://www.data.go.kr/data/15070940/fileData.do

Alerts

홈페이지주소 has 9 (17.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:02:17.608841
Analysis finished2023-12-12 14:02:18.628236
Duration1.02 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct52
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T23:02:18.845829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length6.1132075
Min length4

Characters and Unicode

Total characters324
Distinct characters73
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)96.2%

Sample

1st row일민유치원
2nd row한가람유치원
3rd row충신유치원
4th row정우유치원
5th row유성유치원
ValueCountFrequency (%)
중경고등학교 2
 
3.8%
한가람유치원 1
 
1.9%
배문중학교 1
 
1.9%
보광초등학교 1
 
1.9%
수도중학교 1
 
1.9%
한강중학교 1
 
1.9%
용산중학교 1
 
1.9%
용강중학교 1
 
1.9%
오산중학교 1
 
1.9%
신광여자중학교 1
 
1.9%
Other values (42) 42
79.2%
2023-12-12T23:02:19.305884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
12.0%
38
 
11.7%
27
 
8.3%
18
 
5.6%
15
 
4.6%
15
 
4.6%
14
 
4.3%
13
 
4.0%
12
 
3.7%
9
 
2.8%
Other values (63) 124
38.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 324
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
12.0%
38
 
11.7%
27
 
8.3%
18
 
5.6%
15
 
4.6%
15
 
4.6%
14
 
4.3%
13
 
4.0%
12
 
3.7%
9
 
2.8%
Other values (63) 124
38.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 324
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
12.0%
38
 
11.7%
27
 
8.3%
18
 
5.6%
15
 
4.6%
15
 
4.6%
14
 
4.3%
13
 
4.0%
12
 
3.7%
9
 
2.8%
Other values (63) 124
38.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 324
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
 
12.0%
38
 
11.7%
27
 
8.3%
18
 
5.6%
15
 
4.6%
15
 
4.6%
14
 
4.3%
13
 
4.0%
12
 
3.7%
9
 
2.8%
Other values (63) 124
38.3%

구분
Categorical

Distinct5
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Memory size556.0 B
초등학교
15 
유치원
14 
고등학교
12 
중학교
10 
대학교

Length

Max length4
Median length4
Mean length3.509434
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유치원
2nd row유치원
3rd row유치원
4th row유치원
5th row유치원

Common Values

ValueCountFrequency (%)
초등학교 15
28.3%
유치원 14
26.4%
고등학교 12
22.6%
중학교 10
18.9%
대학교 2
 
3.8%

Length

2023-12-12T23:02:19.451821image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:02:19.576648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초등학교 15
28.3%
유치원 14
26.4%
고등학교 12
22.6%
중학교 10
18.9%
대학교 2
 
3.8%
Distinct34
Distinct (%)64.2%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T23:02:19.820587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters371
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)39.6%

Sample

1st row140-764
2nd row140-030
3rd row140-854
4th row140-847
5th row140-856
ValueCountFrequency (%)
140-833 4
 
7.5%
140-853 3
 
5.7%
140-823 3
 
5.7%
140-830 3
 
5.7%
140-847 3
 
5.7%
140-040 2
 
3.8%
140-240 2
 
3.8%
140-883 2
 
3.8%
140-897 2
 
3.8%
140-870 2
 
3.8%
Other values (24) 27
50.9%
2023-12-12T23:02:20.194325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 73
19.7%
4 63
17.0%
1 60
16.2%
- 53
14.3%
8 40
10.8%
3 24
 
6.5%
7 17
 
4.6%
9 15
 
4.0%
5 10
 
2.7%
2 8
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 318
85.7%
Dash Punctuation 53
 
14.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 73
23.0%
4 63
19.8%
1 60
18.9%
8 40
12.6%
3 24
 
7.5%
7 17
 
5.3%
9 15
 
4.7%
5 10
 
3.1%
2 8
 
2.5%
6 8
 
2.5%
Dash Punctuation
ValueCountFrequency (%)
- 53
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 371
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 73
19.7%
4 63
17.0%
1 60
16.2%
- 53
14.3%
8 40
10.8%
3 24
 
6.5%
7 17
 
4.6%
9 15
 
4.0%
5 10
 
2.7%
2 8
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 371
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 73
19.7%
4 63
17.0%
1 60
16.2%
- 53
14.3%
8 40
10.8%
3 24
 
6.5%
7 17
 
4.6%
9 15
 
4.0%
5 10
 
2.7%
2 8
 
2.2%

주소
Text

Distinct48
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T23:02:20.535851image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length20.641509
Min length17

Characters and Unicode

Total characters1094
Distinct characters63
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)81.1%

Sample

1st row서울특별시 용산구 효창원로 17
2nd row서울특별시 용산구 이촌동 301-54
3rd row서울특별시 용산구 이촌로64길 72
4th row서울특별시 용산구 원효로2가 72-2
5th row서울특별시 용산구 회나무로13가길 16
ValueCountFrequency (%)
서울특별시 53
24.0%
용산구 52
23.5%
효창원로 5
 
2.3%
17 4
 
1.8%
이촌로 3
 
1.4%
100 3
 
1.4%
이촌동 2
 
0.9%
원효로 2
 
0.9%
두텁바위로 2
 
0.9%
3가 2
 
0.9%
Other values (78) 93
42.1%
2023-12-12T23:02:21.035302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
251
22.9%
57
 
5.2%
53
 
4.8%
53
 
4.8%
53
 
4.8%
53
 
4.8%
53
 
4.8%
52
 
4.8%
52
 
4.8%
46
 
4.2%
Other values (53) 371
33.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 649
59.3%
Space Separator 251
 
22.9%
Decimal Number 185
 
16.9%
Dash Punctuation 9
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
57
 
8.8%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
52
 
8.0%
52
 
8.0%
46
 
7.1%
26
 
4.0%
Other values (41) 151
23.3%
Decimal Number
ValueCountFrequency (%)
1 35
18.9%
3 28
15.1%
4 25
13.5%
2 23
12.4%
7 16
8.6%
0 14
 
7.6%
6 12
 
6.5%
5 12
 
6.5%
9 11
 
5.9%
8 9
 
4.9%
Space Separator
ValueCountFrequency (%)
251
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 649
59.3%
Common 445
40.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
57
 
8.8%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
52
 
8.0%
52
 
8.0%
46
 
7.1%
26
 
4.0%
Other values (41) 151
23.3%
Common
ValueCountFrequency (%)
251
56.4%
1 35
 
7.9%
3 28
 
6.3%
4 25
 
5.6%
2 23
 
5.2%
7 16
 
3.6%
0 14
 
3.1%
6 12
 
2.7%
5 12
 
2.7%
9 11
 
2.5%
Other values (2) 18
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 649
59.3%
ASCII 445
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
251
56.4%
1 35
 
7.9%
3 28
 
6.3%
4 25
 
5.6%
2 23
 
5.2%
7 16
 
3.6%
0 14
 
3.1%
6 12
 
2.7%
5 12
 
2.7%
9 11
 
2.5%
Other values (2) 18
 
4.0%
Hangul
ValueCountFrequency (%)
57
 
8.8%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
53
 
8.2%
52
 
8.0%
52
 
8.0%
46
 
7.1%
26
 
4.0%
Other values (41) 151
23.3%
Distinct49
Distinct (%)92.5%
Missing0
Missing (%)0.0%
Memory size556.0 B
2023-12-12T23:02:21.307508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length11
Mean length11.301887
Min length11

Characters and Unicode

Total characters599
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)86.8%

Sample

1st row02-704-0915
2nd row02-796-8865
3rd row02-793-7618
4th row02-712-2211
5th row02-792-8691
ValueCountFrequency (%)
02-710-6903 3
 
5.7%
070-4000-3191 2
 
3.8%
02-702-5501 2
 
3.8%
02-3480-7281 1
 
1.9%
02-795-7454 1
 
1.9%
02-3480-7282 1
 
1.9%
02-704-0915 1
 
1.9%
02-712-8015 1
 
1.9%
02-793-4032 1
 
1.9%
02-754-3616 1
 
1.9%
Other values (39) 39
73.6%
2023-12-12T23:02:21.726783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 109
18.2%
- 106
17.7%
2 77
12.9%
7 62
10.4%
9 49
8.2%
1 48
8.0%
6 36
 
6.0%
4 31
 
5.2%
5 30
 
5.0%
3 29
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 493
82.3%
Dash Punctuation 106
 
17.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 109
22.1%
2 77
15.6%
7 62
12.6%
9 49
9.9%
1 48
9.7%
6 36
 
7.3%
4 31
 
6.3%
5 30
 
6.1%
3 29
 
5.9%
8 22
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 106
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 599
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 109
18.2%
- 106
17.7%
2 77
12.9%
7 62
10.4%
9 49
8.2%
1 48
8.0%
6 36
 
6.0%
4 31
 
5.2%
5 30
 
5.0%
3 29
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 599
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 109
18.2%
- 106
17.7%
2 77
12.9%
7 62
10.4%
9 49
8.2%
1 48
8.0%
6 36
 
6.0%
4 31
 
5.2%
5 30
 
5.0%
3 29
 
4.8%

홈페이지주소
Text

MISSING 

Distinct43
Distinct (%)97.7%
Missing9
Missing (%)17.0%
Memory size556.0 B
2023-12-12T23:02:21.960616image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length23
Mean length17.5
Min length8

Characters and Unicode

Total characters770
Distinct characters25
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique42 ?
Unique (%)95.5%

Sample

1st rowwww.ilminkids.com
2nd rowkids.choongshin.or.kr/
3rd rowwww.yusungkids.org/
4th rowwww.sung-kids.com/
5th rowwww.bokja.co.kr/
ValueCountFrequency (%)
www.jungkyung.hs.kr 2
 
4.5%
www.yongsan.hs.kr 1
 
2.3%
www.ilminkids.com 1
 
2.3%
www.yonggang.ms.kr 1
 
2.3%
os.ms.kr 1
 
2.3%
www.shinkwang.ms.kr 1
 
2.3%
sacredheart.ms.kr 1
 
2.3%
www.sunrin.ms.kr 1
 
2.3%
www.bosung-g.ms.kr 1
 
2.3%
www.baemoon.ms.kr 1
 
2.3%
Other values (33) 33
75.0%
2023-12-12T23:02:22.362173image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 125
16.2%
w 118
15.3%
s 71
9.2%
k 56
 
7.3%
n 51
 
6.6%
r 49
 
6.4%
o 39
 
5.1%
g 35
 
4.5%
a 34
 
4.4%
e 33
 
4.3%
Other values (15) 159
20.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 631
81.9%
Other Punctuation 132
 
17.1%
Dash Punctuation 7
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 118
18.7%
s 71
11.3%
k 56
8.9%
n 51
8.1%
r 49
7.8%
o 39
 
6.2%
g 35
 
5.5%
a 34
 
5.4%
e 33
 
5.2%
h 28
 
4.4%
Other values (12) 117
18.5%
Other Punctuation
ValueCountFrequency (%)
. 125
94.7%
/ 7
 
5.3%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 631
81.9%
Common 139
 
18.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 118
18.7%
s 71
11.3%
k 56
8.9%
n 51
8.1%
r 49
7.8%
o 39
 
6.2%
g 35
 
5.5%
a 34
 
5.4%
e 33
 
5.2%
h 28
 
4.4%
Other values (12) 117
18.5%
Common
ValueCountFrequency (%)
. 125
89.9%
- 7
 
5.0%
/ 7
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 770
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 125
16.2%
w 118
15.3%
s 71
9.2%
k 56
 
7.3%
n 51
 
6.6%
r 49
 
6.4%
o 39
 
5.1%
g 35
 
4.5%
a 34
 
4.4%
e 33
 
4.3%
Other values (15) 159
20.6%

Correlations

2023-12-12T23:02:22.500206image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학교명구분우편번호주소전화번호홈페이지주소
학교명1.0001.0001.0001.0000.9881.000
구분1.0001.0000.4410.7270.5231.000
우편번호1.0000.4411.0001.0000.9941.000
주소1.0000.7271.0001.0000.9921.000
전화번호0.9880.5230.9940.9921.0000.983
홈페이지주소1.0001.0001.0001.0000.9831.000

Missing values

2023-12-12T23:02:18.434418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:02:18.556456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학교명구분우편번호주소전화번호홈페이지주소
0일민유치원유치원140-764서울특별시 용산구 효창원로 1702-704-0915www.ilminkids.com
1한가람유치원유치원140-030서울특별시 용산구 이촌동 301-5402-796-8865<NA>
2충신유치원유치원140-854서울특별시 용산구 이촌로64길 7202-793-7618kids.choongshin.or.kr/
3정우유치원유치원140-847서울특별시 용산구 원효로2가 72-202-712-2211<NA>
4유성유치원유치원140-856서울특별시 용산구 회나무로13가길 1602-792-8691www.yusungkids.org/
5원유치원유치원140-909서울특별시 용산구 이촌동 40202-797-0037<NA>
6신동아큰나무유치원유치원140-751서울특별시 용산구 서빙고동 241-9402-790-8858<NA>
7성심유치원유치원140-823서울특별시 용산구 보광로24길 9 -402-794-3045www.sung-kids.com/
8복자유치원유치원140-899서울특별시 용산구 후암로34길 21 -802-319-2915www.bokja.co.kr/
9보윤유치원유치원140-830서울특별시 용산구 서계동 10002-718-0999<NA>
학교명구분우편번호주소전화번호홈페이지주소
43오산고등학교고등학교140-823서울특별시 용산구 보광로7길 1702-799-9601www.osan.hs.kr
44신광여자고등학교고등학교140-870서울특별시 용산구 청파로 26302-710-6903www.shinkwang.hs.kr
45성심여자고등학교고등학교140-847서울특별시 용산구 원효로 19길 4902-702-5501sacredheart.hs.kr
46선린인터넷고등학교고등학교140-869서울특별시 용산구 원효로97길 33 -402-713-6211www.sunrint.hs.kr
47서울자동차고등학교고등학교140-897서울특별시 용산구 백범로45길 1502-718-4970www.seoulauto.hs.kr
48서울디지텍고등학교고등학교140-857서울특별시 용산구 회나무로12길 2702-795-7454www.sdh.hs.kr
49보성여자고등학교고등학교140-833서울특별시 용산구 신흥로3가길 66070-4000-3191www.bosung.hs.kr
50배문고등학교고등학교140-830서울특별시 용산구 효창원로 25802-3480-7281www.baemoon.hs.kr
51폴리텍대학대학교121-757서울특별시 마포구 백범로31길 2102-2125-6500www.kopo.ac.kr/index.asp
52숙명여자대학교대학교140-742서울특별시 용산구 청파로47길 10002-710-9114www.sookmyung.ac.kr/index.jsp