Overview

Dataset statistics

Number of variables5
Number of observations1021
Missing cells0
Missing cells (%)0.0%
Duplicate rows6
Duplicate rows (%)0.6%
Total size in memory40.0 KiB
Average record size in memory40.1 B

Variable types

Categorical1
Text4

Dataset

Description강원특별자치도교육청 소속 학교현황 자료 입니다. 관할 지역별 학교이름, 주소, 전화번호, 홈페이지 정보를 확인하실 수 있습니다.
URLhttps://www.data.go.kr/data/15106694/fileData.do

Alerts

Dataset has 6 (0.6%) duplicate rowsDuplicates

Reproduction

Analysis started2023-12-12 01:15:57.165318
Analysis finished2023-12-12 01:15:57.742531
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

관할
Categorical

Distinct20
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
원주
164 
춘천
122 
강릉
100 
홍천
69 
속초양양
62 
Other values (15)
504 

Length

Max length4
Median length2
Mean length2.1234084
Min length2

Unique

Unique3 ?
Unique (%)0.3%

Sample

1st row삼척
2nd row삼척
3rd row춘천
4th row강릉
5th row춘천

Common Values

ValueCountFrequency (%)
원주 164
16.1%
춘천 122
11.9%
강릉 100
 
9.8%
홍천 69
 
6.8%
속초양양 62
 
6.1%
횡성 60
 
5.9%
삼척 52
 
5.1%
정선 50
 
4.9%
평창 46
 
4.5%
동해 45
 
4.4%
Other values (10) 251
24.6%

Length

2023-12-12T10:15:57.822059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
원주 164
16.1%
춘천 123
12.0%
강릉 100
 
9.8%
홍천 69
 
6.8%
속초양양 62
 
6.1%
횡성 60
 
5.9%
삼척 52
 
5.1%
정선 50
 
4.9%
평창 46
 
4.5%
동해 45
 
4.4%
Other values (8) 250
24.5%
Distinct991
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T10:15:58.097187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length13
Mean length6.6278159
Min length5

Characters and Unicode

Total characters6767
Distinct characters250
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique966 ?
Unique (%)94.6%

Sample

1st row가곡고등학교
2nd row가곡중학교
3rd row가산초등학교
4th row가온유치원
5th row가정중학교
ValueCountFrequency (%)
중앙초등학교 4
 
0.4%
교동초병설유치원 3
 
0.3%
남산초등학교 3
 
0.3%
교동초등학교 3
 
0.3%
남산초병설유치원 2
 
0.2%
원당초등학교 2
 
0.2%
청일초병설유치원 2
 
0.2%
신동초등학교 2
 
0.2%
신동초병설유치원 2
 
0.2%
청일초등학교 2
 
0.2%
Other values (981) 996
97.6%
2023-12-12T10:15:58.571872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
692
 
10.2%
674
 
10.0%
635
 
9.4%
488
 
7.2%
432
 
6.4%
369
 
5.5%
367
 
5.4%
260
 
3.8%
252
 
3.7%
173
 
2.6%
Other values (240) 2425
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 6759
99.9%
Close Punctuation 4
 
0.1%
Open Punctuation 4
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
692
 
10.2%
674
 
10.0%
635
 
9.4%
488
 
7.2%
432
 
6.4%
369
 
5.5%
367
 
5.4%
260
 
3.8%
252
 
3.7%
173
 
2.6%
Other values (238) 2417
35.8%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 6759
99.9%
Common 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
692
 
10.2%
674
 
10.0%
635
 
9.4%
488
 
7.2%
432
 
6.4%
369
 
5.5%
367
 
5.4%
260
 
3.8%
252
 
3.7%
173
 
2.6%
Other values (238) 2417
35.8%
Common
ValueCountFrequency (%)
) 4
50.0%
( 4
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 6759
99.9%
ASCII 8
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
692
 
10.2%
674
 
10.0%
635
 
9.4%
488
 
7.2%
432
 
6.4%
369
 
5.5%
367
 
5.4%
260
 
3.8%
252
 
3.7%
173
 
2.6%
Other values (238) 2417
35.8%
ASCII
ValueCountFrequency (%)
) 4
50.0%
( 4
50.0%

주소
Text

Distinct829
Distinct (%)81.2%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T10:15:59.002638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length22
Mean length15.360431
Min length9

Characters and Unicode

Total characters15683
Distinct characters296
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique644 ?
Unique (%)63.1%

Sample

1st row삼척시 가곡면 가곡천로 1427
2nd row삼척시 가곡면 가곡천로 1427
3rd row춘천시 동면 가산로 52
4th row강릉시 하슬라로20번길 15(홍제동)
5th row춘천시 남면 여우내길 20
ValueCountFrequency (%)
원주시 163
 
4.3%
춘천시 123
 
3.3%
강릉시 100
 
2.7%
홍천군 69
 
1.8%
횡성군 60
 
1.6%
삼척시 52
 
1.4%
정선군 50
 
1.3%
평창군 46
 
1.2%
동해시 45
 
1.2%
영월군 43
 
1.1%
Other values (1188) 3013
80.0%
2023-12-12T10:15:59.673260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2876
 
18.3%
1 716
 
4.6%
681
 
4.3%
559
 
3.6%
2 489
 
3.1%
477
 
3.0%
475
 
3.0%
457
 
2.9%
3 347
 
2.2%
332
 
2.1%
Other values (286) 8274
52.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9151
58.3%
Decimal Number 3222
 
20.5%
Space Separator 2876
 
18.3%
Dash Punctuation 158
 
1.0%
Open Punctuation 138
 
0.9%
Close Punctuation 138
 
0.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
681
 
7.4%
559
 
6.1%
477
 
5.2%
475
 
5.2%
457
 
5.0%
332
 
3.6%
305
 
3.3%
276
 
3.0%
206
 
2.3%
194
 
2.1%
Other values (272) 5189
56.7%
Decimal Number
ValueCountFrequency (%)
1 716
22.2%
2 489
15.2%
3 347
10.8%
4 280
 
8.7%
5 277
 
8.6%
7 250
 
7.8%
0 227
 
7.0%
6 222
 
6.9%
9 215
 
6.7%
8 199
 
6.2%
Space Separator
ValueCountFrequency (%)
2876
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 158
100.0%
Open Punctuation
ValueCountFrequency (%)
( 138
100.0%
Close Punctuation
ValueCountFrequency (%)
) 138
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9151
58.3%
Common 6532
41.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
681
 
7.4%
559
 
6.1%
477
 
5.2%
475
 
5.2%
457
 
5.0%
332
 
3.6%
305
 
3.3%
276
 
3.0%
206
 
2.3%
194
 
2.1%
Other values (272) 5189
56.7%
Common
ValueCountFrequency (%)
2876
44.0%
1 716
 
11.0%
2 489
 
7.5%
3 347
 
5.3%
4 280
 
4.3%
5 277
 
4.2%
7 250
 
3.8%
0 227
 
3.5%
6 222
 
3.4%
9 215
 
3.3%
Other values (4) 633
 
9.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9151
58.3%
ASCII 6532
41.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2876
44.0%
1 716
 
11.0%
2 489
 
7.5%
3 347
 
5.3%
4 280
 
4.3%
5 277
 
4.2%
7 250
 
3.8%
0 227
 
3.5%
6 222
 
3.4%
9 215
 
3.3%
Other values (4) 633
 
9.7%
Hangul
ValueCountFrequency (%)
681
 
7.4%
559
 
6.1%
477
 
5.2%
475
 
5.2%
457
 
5.0%
332
 
3.6%
305
 
3.3%
276
 
3.0%
206
 
2.3%
194
 
2.1%
Other values (272) 5189
56.7%
Distinct918
Distinct (%)89.9%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T10:15:59.961931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.004897
Min length12

Characters and Unicode

Total characters12257
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique817 ?
Unique (%)80.0%

Sample

1st row033-570-6940
2nd row033-570-6909
3rd row033-241-1002
4th row033-645-1231
5th row033-260-0300
ValueCountFrequency (%)
033-340-5400 4
 
0.4%
033-339-7700 2
 
0.2%
033-582-1436 2
 
0.2%
033-461-6171 2
 
0.2%
033-241-8505 2
 
0.2%
033-435-4037 2
 
0.2%
033-562-4014 2
 
0.2%
033-463-2047 2
 
0.2%
033-461-3016 2
 
0.2%
033-243-1012 2
 
0.2%
Other values (908) 999
97.8%
2023-12-12T10:16:00.447172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 2909
23.7%
0 2092
17.1%
- 2042
16.7%
4 810
 
6.6%
2 786
 
6.4%
6 781
 
6.4%
5 716
 
5.8%
7 662
 
5.4%
1 646
 
5.3%
8 487
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 10215
83.3%
Dash Punctuation 2042
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 2909
28.5%
0 2092
20.5%
4 810
 
7.9%
2 786
 
7.7%
6 781
 
7.6%
5 716
 
7.0%
7 662
 
6.5%
1 646
 
6.3%
8 487
 
4.8%
9 326
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 2042
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12257
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 2909
23.7%
0 2092
17.1%
- 2042
16.7%
4 810
 
6.6%
2 786
 
6.4%
6 781
 
6.4%
5 716
 
5.8%
7 662
 
5.4%
1 646
 
5.3%
8 487
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12257
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 2909
23.7%
0 2092
17.1%
- 2042
16.7%
4 810
 
6.6%
2 786
 
6.4%
6 781
 
6.4%
5 716
 
5.8%
7 662
 
5.4%
1 646
 
5.3%
8 487
 
4.0%
Distinct195
Distinct (%)19.1%
Missing0
Missing (%)0.0%
Memory size8.1 KiB
2023-12-12T10:16:00.746935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length7
Mean length9.5308521
Min length7

Characters and Unicode

Total characters9731
Distinct characters46
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)13.2%

Sample

1st rowgagog.gwe.hs.kr
2nd rowgagog.gwe.ms.kr
3rd rowc-kasan.gwe.es.kr
4th rowhttp://
5th rowgajeong.gwe.ms.kr
ValueCountFrequency (%)
http 762
74.6%
hoengseong.gwe.es.kr 4
 
0.4%
c-namsan.gwe.es.kr 4
 
0.4%
kwangsan.gwe.es.kr 3
 
0.3%
girin.gwe.es.kr 3
 
0.3%
gonghyeonjin.gwe.es.kr 2
 
0.2%
kumkwang.gwe.es.kr 2
 
0.2%
geomun.gwe.es.kr 2
 
0.2%
kh.gwe.es.kr 2
 
0.2%
bangok.gwe.es.kr 2
 
0.2%
Other values (182) 235
 
23.0%
2023-12-12T10:16:01.261321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 1547
15.9%
t 1546
15.9%
h 908
9.3%
p 788
8.1%
. 767
7.9%
: 766
7.9%
e 501
 
5.1%
g 470
 
4.8%
k 344
 
3.5%
w 330
 
3.4%
Other values (36) 1764
18.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 6587
67.7%
Other Punctuation 3091
31.8%
Decimal Number 22
 
0.2%
Dash Punctuation 19
 
0.2%
Uppercase Letter 11
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 1546
23.5%
h 908
13.8%
p 788
12.0%
e 501
 
7.6%
g 470
 
7.1%
k 344
 
5.2%
w 330
 
5.0%
s 308
 
4.7%
r 286
 
4.3%
n 272
 
4.1%
Other values (14) 834
12.7%
Decimal Number
ValueCountFrequency (%)
1 5
22.7%
3 3
13.6%
8 3
13.6%
0 3
13.6%
2 2
 
9.1%
7 2
 
9.1%
9 1
 
4.5%
6 1
 
4.5%
4 1
 
4.5%
5 1
 
4.5%
Other Punctuation
ValueCountFrequency (%)
/ 1547
50.0%
. 767
24.8%
: 766
24.8%
% 9
 
0.3%
? 2
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
C 4
36.4%
E 3
27.3%
A 2
18.2%
D 1
 
9.1%
B 1
 
9.1%
Dash Punctuation
ValueCountFrequency (%)
- 19
100.0%
Math Symbol
ValueCountFrequency (%)
= 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 6598
67.8%
Common 3133
32.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 1546
23.4%
h 908
13.8%
p 788
11.9%
e 501
 
7.6%
g 470
 
7.1%
k 344
 
5.2%
w 330
 
5.0%
s 308
 
4.7%
r 286
 
4.3%
n 272
 
4.1%
Other values (19) 845
12.8%
Common
ValueCountFrequency (%)
/ 1547
49.4%
. 767
24.5%
: 766
24.4%
- 19
 
0.6%
% 9
 
0.3%
1 5
 
0.2%
3 3
 
0.1%
8 3
 
0.1%
0 3
 
0.1%
? 2
 
0.1%
Other values (7) 9
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9731
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 1547
15.9%
t 1546
15.9%
h 908
9.3%
p 788
8.1%
. 767
7.9%
: 766
7.9%
e 501
 
5.1%
g 470
 
4.8%
k 344
 
3.5%
w 330
 
3.4%
Other values (36) 1764
18.1%

Missing values

2023-12-12T10:15:57.585083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T10:15:57.703808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관할학교이름주소전화번호홈페이지
0삼척가곡고등학교삼척시 가곡면 가곡천로 1427033-570-6940gagog.gwe.hs.kr
1삼척가곡중학교삼척시 가곡면 가곡천로 1427033-570-6909gagog.gwe.ms.kr
2춘천가산초등학교춘천시 동면 가산로 52033-241-1002c-kasan.gwe.es.kr
3강릉가온유치원강릉시 하슬라로20번길 15(홍제동)033-645-1231http://
4춘천가정중학교춘천시 남면 여우내길 20033-260-0300gajeong.gwe.ms.kr
5화천간동고등학교화천군 간동면 파로호로 945-21033-440-1911gandong.gwe.hs.kr
6화천간동중학교화천군 간동면 파로호로 945-21033-440-1920gandong.gwe.ms.kr
7고성간성초등학교고성군 간성읍 간성로 39번길 20033-681-2013kansung.gwe.es.kr
8고성간성초병설유치원고성군 간성읍 간성로39번길 20033-681-2003kansung.gwe.es.kr
9정선갈래초등학교정선군 고한읍 고한로 300033-591-2530kallae.gwe.es.kr
관할학교이름주소전화번호홈페이지
1011횡성청일초병설유치원횡성군 청일면 청정로 795033-342-5627chongil.gwe.es.kr/
1012횡성춘당초등학교횡성군 청일면 청정로 1461033-342-5108chundang.gwe.es.kr
1013횡성춘당초병설유치원횡성군 청일면 청정로 1461033-342-5106chundang.gwe.es.kr
1014횡성현천고등학교횡성군 둔내면 경강로 4119033-900-3300hchs.gwe.hs.kr
1015횡성화성유치원횡성군 횡성읍 북천서로 20033-343-1280http://
1016횡성횡성고등학교횡성군 횡성읍 교항로 33033-340-8807hshigh.gwe.hs.kr
1017횡성횡성여자고등학교횡성군 횡성읍 횡성로 252033-343-2011hs.gwe.hs.kr
1018횡성횡성중학교횡성군 횡성읍 교항로 33033-343-2241hoengseong.gwe.ms.kr
1019횡성횡성초등학교횡성군 횡성읍 교항북로 35033-340-5400hoengseong.gwe.es.kr
1020횡성횡성초병설유치원횡성군 횡성읍 교항북로 35033-340-5400hoengseong.gwe.es.kr

Duplicate rows

Most frequently occurring

관할학교이름주소전화번호홈페이지# duplicates
0횡성화성유치원횡성군 횡성읍 북천서로 20033-343-1280http://2
1횡성횡성고등학교횡성군 횡성읍 교항로 33033-340-8807hshigh.gwe.hs.kr2
2횡성횡성여자고등학교횡성군 횡성읍 횡성로 252033-343-2011hs.gwe.hs.kr2
3횡성횡성중학교횡성군 횡성읍 교항로 33033-343-2241hoengseong.gwe.ms.kr2
4횡성횡성초등학교횡성군 횡성읍 교항북로 35033-340-5400hoengseong.gwe.es.kr2
5횡성횡성초병설유치원횡성군 횡성읍 교항북로 35033-340-5400hoengseong.gwe.es.kr2