Overview

Dataset statistics

Number of variables28
Number of observations3908
Missing cells1070
Missing cells (%)1.0%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory874.1 KiB
Average record size in memory229.0 B

Variable types

Categorical14
Numeric5
Text8
Boolean1

Dataset

Description학교종류명,설립구분,표준학교코드,학교명,영문학교명,관할조직명,도로명우편번호,도로명주소,도로명상세주소,전화번호,홈페이지주소,팩스번호,남녀공학구분명,고등학교구분명,산업체특별학급존재여부,고등학교일반실업구분명,특수목적고등학교계열명,입시전후기구분명,주야구분명,설립일자,개교기념일,시도교육청코드,시도교육청명,소재지명,주야과정,계열명,학과명,적재일시
Author서울특별시교육청
URLhttps://data.seoul.go.kr/dataList/OA-20502/S/1/datasetView.do

Alerts

시도교육청코드 has constant value ""Constant
시도교육청명 has constant value ""Constant
소재지명 has constant value ""Constant
Dataset has 1 (< 0.1%) duplicate rowsDuplicates
학교종류명 is highly imbalanced (57.6%)Imbalance
관할조직명 is highly imbalanced (52.5%)Imbalance
산업체특별학급존재여부 is highly imbalanced (91.6%)Imbalance
특수목적고등학교계열명 is highly imbalanced (86.7%)Imbalance
주야구분명 is highly imbalanced (89.6%)Imbalance
주야과정 is highly imbalanced (53.5%)Imbalance
학과명 has 1064 (27.2%) missing valuesMissing

Reproduction

Analysis started2024-05-04 05:55:13.932120
Analysis finished2024-05-04 05:55:16.566790
Duration2.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학교종류명
Categorical

IMBALANCE 

Distinct16
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
고등학교
2512 
초등학교
610 
중학교
390 
각종학교(고)
256 
특수학교
 
35
Other values (11)
 
105

Length

Max length13
Median length4
Mean length4.2290174
Min length3

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row각종학교(중)
2nd row초등학교
3rd row중학교
4th row중학교
5th row중학교

Common Values

ValueCountFrequency (%)
고등학교 2512
64.3%
초등학교 610
 
15.6%
중학교 390
 
10.0%
각종학교(고) 256
 
6.6%
특수학교 35
 
0.9%
방송통신고등학교 32
 
0.8%
외국인학교 17
 
0.4%
평생학교(고)-3년6학기 15
 
0.4%
평생학교(고)-2년6학기 14
 
0.4%
고등기술학교 10
 
0.3%
Other values (6) 17
 
0.4%

Length

2024-05-04T05:55:16.871613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
고등학교 2512
64.3%
초등학교 610
 
15.6%
중학교 390
 
10.0%
각종학교(고 256
 
6.6%
특수학교 35
 
0.9%
방송통신고등학교 32
 
0.8%
외국인학교 17
 
0.4%
평생학교(고)-3년6학기 15
 
0.4%
평생학교(고)-2년6학기 14
 
0.4%
고등기술학교 10
 
0.3%
Other values (6) 17
 
0.4%

설립구분
Categorical

Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
공립
1976 
사립
1907 
국립
 
25

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사립
2nd row공립
3rd row공립
4th row사립
5th row공립

Common Values

ValueCountFrequency (%)
공립 1976
50.6%
사립 1907
48.8%
국립 25
 
0.6%

Length

2024-05-04T05:55:17.239841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:17.801845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
공립 1976
50.6%
사립 1907
48.8%
국립 25
 
0.6%

표준학교코드
Real number (ℝ)

Distinct1414
Distinct (%)36.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7002005.1
Minimum0
Maximum7134155
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size34.5 KiB
2024-05-04T05:55:18.237628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile7010092
Q17010268
median7010852.5
Q37021111.2
95-th percentile7130163.7
Maximum7134155
Range7134155
Interquartile range (IQR)10843.25

Descriptive statistics

Standard deviation390837.75
Coefficient of variation (CV)0.055817976
Kurtosis211.79072
Mean7002005.1
Median Absolute Deviation (MAD)644.5
Skewness-14.52017
Sum2.7363836 × 1010
Variance1.5275415 × 1011
MonotonicityDecreasing
2024-05-04T05:55:18.821715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7010565 65
 
1.7%
7010567 63
 
1.6%
7010566 61
 
1.6%
7010278 41
 
1.0%
7010808 37
 
0.9%
7010737 34
 
0.9%
7010271 31
 
0.8%
7011539 30
 
0.8%
7010833 29
 
0.7%
7010738 28
 
0.7%
Other values (1404) 3489
89.3%
ValueCountFrequency (%)
0 1
 
< 0.1%
1342098 1
 
< 0.1%
1342099 1
 
< 0.1%
1342102 1
 
< 0.1%
1371661 2
 
0.1%
1371662 1
 
< 0.1%
1371663 10
0.3%
1371664 1
 
< 0.1%
7010057 4
 
0.1%
7010058 4
 
0.1%
ValueCountFrequency (%)
7134155 1
< 0.1%
7134150 1
< 0.1%
7134142 1
< 0.1%
7134141 1
< 0.1%
7134140 1
< 0.1%
7134139 1
< 0.1%
7134138 1
< 0.1%
7134137 1
< 0.1%
7134136 1
< 0.1%
7134135 1
< 0.1%
Distinct1414
Distinct (%)36.2%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
2024-05-04T05:55:19.511610image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length20
Mean length8.1532753
Min length4

Characters and Unicode

Total characters31863
Distinct characters298
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1086 ?
Unique (%)27.8%

Sample

1st row선화예술중학교
2nd row서울숭신초등학교
3rd row행당중학교
4th row한양대학교사범대학부속중학교
5th row자양중학교
ValueCountFrequency (%)
서울산업정보학교 65
 
1.6%
종로산업정보학교 63
 
1.6%
아현산업정보학교 61
 
1.5%
서울공업고등학교 41
 
1.0%
덕수고등학교 37
 
0.9%
학력인정 36
 
0.9%
성동공업고등학교 34
 
0.9%
경기기계공업고등학교 31
 
0.8%
서울동구고등학교 30
 
0.8%
대동세무고등학교 29
 
0.7%
Other values (1405) 3519
89.2%
2024-05-04T05:55:20.859938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4326
 
13.6%
4066
 
12.8%
3294
 
10.3%
2659
 
8.3%
1343
 
4.2%
1233
 
3.9%
632
 
2.0%
513
 
1.6%
469
 
1.5%
462
 
1.4%
Other values (288) 12866
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 31759
99.7%
Space Separator 38
 
0.1%
Open Punctuation 21
 
0.1%
Close Punctuation 21
 
0.1%
Decimal Number 20
 
0.1%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4326
 
13.6%
4066
 
12.8%
3294
 
10.4%
2659
 
8.4%
1343
 
4.2%
1233
 
3.9%
632
 
2.0%
513
 
1.6%
469
 
1.5%
462
 
1.5%
Other values (281) 12762
40.2%
Decimal Number
ValueCountFrequency (%)
2 19
95.0%
4 1
 
5.0%
Other Punctuation
ValueCountFrequency (%)
. 3
75.0%
· 1
 
25.0%
Space Separator
ValueCountFrequency (%)
38
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 31759
99.7%
Common 104
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4326
 
13.6%
4066
 
12.8%
3294
 
10.4%
2659
 
8.4%
1343
 
4.2%
1233
 
3.9%
632
 
2.0%
513
 
1.6%
469
 
1.5%
462
 
1.5%
Other values (281) 12762
40.2%
Common
ValueCountFrequency (%)
38
36.5%
( 21
20.2%
) 21
20.2%
2 19
18.3%
. 3
 
2.9%
· 1
 
1.0%
4 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 31759
99.7%
ASCII 103
 
0.3%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4326
 
13.6%
4066
 
12.8%
3294
 
10.4%
2659
 
8.4%
1343
 
4.2%
1233
 
3.9%
632
 
2.0%
513
 
1.6%
469
 
1.5%
462
 
1.5%
Other values (281) 12762
40.2%
ASCII
ValueCountFrequency (%)
38
36.9%
( 21
20.4%
) 21
20.4%
2 19
18.4%
. 3
 
2.9%
4 1
 
1.0%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct1408
Distinct (%)36.0%
Missing2
Missing (%)0.1%
Memory size30.7 KiB
2024-05-04T05:55:21.648215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length76
Median length68
Mean length27.670507
Min length1

Characters and Unicode

Total characters108081
Distinct characters86
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1080 ?
Unique (%)27.6%

Sample

1st rowSunhwa Arts Middle School
2nd rowSeoul Soongshin Elementary School
3rd rowHaengdang Middle School
4th rowHanyang University Middle School
5th rowJayang Middle School
ValueCountFrequency (%)
school 3863
25.1%
high 2598
16.9%
seoul 1189
 
7.7%
elementary 609
 
4.0%
middle 395
 
2.6%
girls’ 287
 
1.9%
technical 236
 
1.5%
arts 132
 
0.9%
polytechnic 128
 
0.8%
science 124
 
0.8%
Other values (1048) 5835
37.9%
2024-05-04T05:55:23.020831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 12346
 
11.4%
11505
 
10.6%
l 7320
 
6.8%
h 7289
 
6.7%
S 6342
 
5.9%
e 6186
 
5.7%
n 5872
 
5.4%
i 5806
 
5.4%
g 5328
 
4.9%
c 5266
 
4.9%
Other values (76) 34821
32.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 73656
68.1%
Uppercase Letter 22270
 
20.6%
Space Separator 11506
 
10.6%
Final Punctuation 314
 
0.3%
Dash Punctuation 144
 
0.1%
Other Punctuation 80
 
0.1%
Other Letter 65
 
0.1%
Modifier Symbol 46
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
9
13.8%
8
 
12.3%
6
 
9.2%
6
 
9.2%
4
 
6.2%
3
 
4.6%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Other values (20) 21
32.3%
Lowercase Letter
ValueCountFrequency (%)
o 12346
16.8%
l 7320
9.9%
h 7289
9.9%
e 6186
8.4%
n 5872
8.0%
i 5806
7.9%
g 5328
7.2%
c 5266
7.1%
a 3637
 
4.9%
u 2731
 
3.7%
Other values (14) 11875
16.1%
Uppercase Letter
ValueCountFrequency (%)
S 6342
28.5%
H 3633
16.3%
E 1224
 
5.5%
G 1203
 
5.4%
O 1170
 
5.3%
I 956
 
4.3%
M 804
 
3.6%
C 801
 
3.6%
N 792
 
3.6%
D 733
 
3.3%
Other values (13) 4612
20.7%
Other Punctuation
ValueCountFrequency (%)
& 48
60.0%
/ 25
31.2%
' 4
 
5.0%
. 3
 
3.8%
Space Separator
ValueCountFrequency (%)
11505
> 99.9%
  1
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
314
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 144
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 95926
88.8%
Common 12090
 
11.2%
Hangul 65
 
0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 12346
12.9%
l 7320
 
7.6%
h 7289
 
7.6%
S 6342
 
6.6%
e 6186
 
6.4%
n 5872
 
6.1%
i 5806
 
6.1%
g 5328
 
5.6%
c 5266
 
5.5%
a 3637
 
3.8%
Other values (37) 30534
31.8%
Hangul
ValueCountFrequency (%)
9
13.8%
8
 
12.3%
6
 
9.2%
6
 
9.2%
4
 
6.2%
3
 
4.6%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Other values (20) 21
32.3%
Common
ValueCountFrequency (%)
11505
95.2%
314
 
2.6%
- 144
 
1.2%
& 48
 
0.4%
` 46
 
0.4%
/ 25
 
0.2%
' 4
 
< 0.1%
. 3
 
< 0.1%
  1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 107701
99.6%
Punctuation 314
 
0.3%
Hangul 65
 
0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 12346
 
11.5%
11505
 
10.7%
l 7320
 
6.8%
h 7289
 
6.8%
S 6342
 
5.9%
e 6186
 
5.7%
n 5872
 
5.5%
i 5806
 
5.4%
g 5328
 
4.9%
c 5266
 
4.9%
Other values (44) 34441
32.0%
Punctuation
ValueCountFrequency (%)
314
100.0%
Hangul
ValueCountFrequency (%)
9
13.8%
8
 
12.3%
6
 
9.2%
6
 
9.2%
4
 
6.2%
3
 
4.6%
2
 
3.1%
2
 
3.1%
2
 
3.1%
2
 
3.1%
Other values (20) 21
32.3%
None
ValueCountFrequency (%)
  1
100.0%

관할조직명
Categorical

IMBALANCE 

Distinct13
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
서울특별시교육청
2878 
서울특별시강동송파교육지원청
 
118
서울특별시서부교육지원청
 
118
서울특별시강서양천교육지원청
 
107
서울특별시남부교육지원청
 
107
Other values (8)
580 

Length

Max length14
Median length8
Mean length9.2709826
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시성동광진교육지원청
2nd row서울특별시성동광진교육지원청
3rd row서울특별시성동광진교육지원청
4th row서울특별시성동광진교육지원청
5th row서울특별시성동광진교육지원청

Common Values

ValueCountFrequency (%)
서울특별시교육청 2878
73.6%
서울특별시강동송파교육지원청 118
 
3.0%
서울특별시서부교육지원청 118
 
3.0%
서울특별시강서양천교육지원청 107
 
2.7%
서울특별시남부교육지원청 107
 
2.7%
서울특별시북부교육지원청 105
 
2.7%
서울특별시강남서초교육지원청 97
 
2.5%
서울특별시동작관악교육지원청 75
 
1.9%
서울특별시동부교육지원청 74
 
1.9%
서울특별시성북강북교육지원청 73
 
1.9%
Other values (3) 156
 
4.0%

Length

2024-05-04T05:55:23.572138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시교육청 2878
73.6%
서울특별시강동송파교육지원청 118
 
3.0%
서울특별시서부교육지원청 118
 
3.0%
서울특별시강서양천교육지원청 107
 
2.7%
서울특별시남부교육지원청 107
 
2.7%
서울특별시북부교육지원청 105
 
2.7%
서울특별시강남서초교육지원청 97
 
2.5%
서울특별시동작관악교육지원청 75
 
1.9%
서울특별시동부교육지원청 74
 
1.9%
서울특별시성북강북교육지원청 73
 
1.9%
Other values (3) 156
 
4.0%

도로명우편번호
Real number (ℝ)

Distinct954
Distinct (%)24.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5046.4867
Minimum1006
Maximum15188
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.5 KiB
2024-05-04T05:55:23.974993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1006
5-th percentile1672
Q13057
median4764
Q37310
95-th percentile8783.95
Maximum15188
Range14182
Interquartile range (IQR)4253

Descriptive statistics

Standard deviation2352.2496
Coefficient of variation (CV)0.46611628
Kurtosis-1.1083431
Mean5046.4867
Median Absolute Deviation (MAD)1986
Skewness0.11589399
Sum19721670
Variance5533078.2
MonotonicityNot monotonic
2024-05-04T05:55:24.608302image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3114 69
 
1.8%
8815 69
 
1.8%
4117 64
 
1.6%
3047 44
 
1.1%
4939 44
 
1.1%
4566 44
 
1.1%
8248 44
 
1.1%
6955 42
 
1.1%
4764 38
 
1.0%
3322 37
 
0.9%
Other values (944) 3413
87.3%
ValueCountFrequency (%)
1006 1
 
< 0.1%
1015 1
 
< 0.1%
1020 1
 
< 0.1%
1051 1
 
< 0.1%
1061 2
 
0.1%
1085 1
 
< 0.1%
1095 19
0.5%
1103 1
 
< 0.1%
1109 1
 
< 0.1%
1116 1
 
< 0.1%
ValueCountFrequency (%)
15188 2
 
0.1%
8863 1
 
< 0.1%
8859 2
 
0.1%
8858 1
 
< 0.1%
8857 1
 
< 0.1%
8854 5
 
0.1%
8850 1
 
< 0.1%
8847 21
0.5%
8846 1
 
< 0.1%
8842 1
 
< 0.1%
Distinct1235
Distinct (%)31.6%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
2024-05-04T05:55:25.225703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length22
Mean length18.486438
Min length12

Characters and Unicode

Total characters72245
Distinct characters265
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique932 ?
Unique (%)23.8%

Sample

1st row서울특별시 광진구 천호대로 664
2nd row서울특별시 성동구 마장로 161
3rd row서울특별시 성동구 왕십리로 189
4th row서울특별시 성동구 마조로 42
5th row서울특별시 광진구 뚝섬로41길 33
ValueCountFrequency (%)
서울특별시 3889
24.9%
노원구 313
 
2.0%
관악구 255
 
1.6%
강서구 254
 
1.6%
은평구 245
 
1.6%
종로구 220
 
1.4%
강남구 210
 
1.3%
중구 184
 
1.2%
성북구 171
 
1.1%
마포구 171
 
1.1%
Other values (1330) 9691
62.1%
2024-05-04T05:55:26.433492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11695
16.2%
4568
 
6.3%
4098
 
5.7%
4061
 
5.6%
3945
 
5.5%
3914
 
5.4%
3889
 
5.4%
3889
 
5.4%
1 2469
 
3.4%
2240
 
3.1%
Other values (255) 27477
38.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 46978
65.0%
Decimal Number 13279
 
18.4%
Space Separator 11695
 
16.2%
Dash Punctuation 293
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4568
 
9.7%
4098
 
8.7%
4061
 
8.6%
3945
 
8.4%
3914
 
8.3%
3889
 
8.3%
3889
 
8.3%
2240
 
4.8%
791
 
1.7%
770
 
1.6%
Other values (243) 14813
31.5%
Decimal Number
ValueCountFrequency (%)
1 2469
18.6%
2 2051
15.4%
3 1295
9.8%
6 1266
9.5%
4 1253
9.4%
5 1246
9.4%
9 1114
8.4%
7 1003
7.6%
0 792
 
6.0%
8 790
 
5.9%
Space Separator
ValueCountFrequency (%)
11695
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 293
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 46978
65.0%
Common 25267
35.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4568
 
9.7%
4098
 
8.7%
4061
 
8.6%
3945
 
8.4%
3914
 
8.3%
3889
 
8.3%
3889
 
8.3%
2240
 
4.8%
791
 
1.7%
770
 
1.6%
Other values (243) 14813
31.5%
Common
ValueCountFrequency (%)
11695
46.3%
1 2469
 
9.8%
2 2051
 
8.1%
3 1295
 
5.1%
6 1266
 
5.0%
4 1253
 
5.0%
5 1246
 
4.9%
9 1114
 
4.4%
7 1003
 
4.0%
0 792
 
3.1%
Other values (2) 1083
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 46978
65.0%
ASCII 25267
35.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
11695
46.3%
1 2469
 
9.8%
2 2051
 
8.1%
3 1295
 
5.1%
6 1266
 
5.0%
4 1253
 
5.0%
5 1246
 
4.9%
9 1114
 
4.4%
7 1003
 
4.0%
0 792
 
3.1%
Other values (2) 1083
 
4.3%
Hangul
ValueCountFrequency (%)
4568
 
9.7%
4098
 
8.7%
4061
 
8.6%
3945
 
8.4%
3914
 
8.3%
3889
 
8.3%
3889
 
8.3%
2240
 
4.8%
791
 
1.7%
770
 
1.6%
Other values (243) 14813
31.5%
Distinct1217
Distinct (%)31.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
2024-05-04T05:55:26.943795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length29
Mean length12.903787
Min length2

Characters and Unicode

Total characters50428
Distinct characters299
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique879 ?
Unique (%)22.5%

Sample

1st row/ 선화예술중고등학교 (능동)
2nd row(하왕십리동)
3rd row(행당동/행당중학교)
4th row(사근동/한양사대부속중?고등학교)
5th row(자양동/서울자양중학교)
ValueCountFrequency (%)
1462
 
19.3%
신림동 112
 
1.5%
대방동 97
 
1.3%
신당동 79
 
1.0%
아현동 75
 
1.0%
숭인동 72
 
1.0%
갈현동 67
 
0.9%
서울산업정보학교 65
 
0.9%
종로산업정보학교 63
 
0.8%
중계동 62
 
0.8%
Other values (1331) 5415
71.5%
2024-05-04T05:55:27.895545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4058
 
8.0%
( 3869
 
7.7%
) 3868
 
7.7%
3672
 
7.3%
3252
 
6.4%
3092
 
6.1%
/ 2988
 
5.9%
2438
 
4.8%
1934
 
3.8%
907
 
1.8%
Other values (289) 20350
40.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35403
70.2%
Open Punctuation 3869
 
7.7%
Close Punctuation 3868
 
7.7%
Space Separator 3672
 
7.3%
Other Punctuation 3085
 
6.1%
Decimal Number 499
 
1.0%
Dash Punctuation 32
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4058
 
11.5%
3252
 
9.2%
3092
 
8.7%
2438
 
6.9%
1934
 
5.5%
907
 
2.6%
751
 
2.1%
746
 
2.1%
572
 
1.6%
566
 
1.6%
Other values (271) 17087
48.3%
Decimal Number
ValueCountFrequency (%)
2 123
24.6%
3 114
22.8%
1 110
22.0%
4 54
10.8%
0 47
 
9.4%
6 26
 
5.2%
7 8
 
1.6%
8 8
 
1.6%
5 5
 
1.0%
9 4
 
0.8%
Other Punctuation
ValueCountFrequency (%)
/ 2988
96.9%
, 94
 
3.0%
. 2
 
0.1%
? 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 3869
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3868
100.0%
Space Separator
ValueCountFrequency (%)
3672
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35403
70.2%
Common 15025
29.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4058
 
11.5%
3252
 
9.2%
3092
 
8.7%
2438
 
6.9%
1934
 
5.5%
907
 
2.6%
751
 
2.1%
746
 
2.1%
572
 
1.6%
566
 
1.6%
Other values (271) 17087
48.3%
Common
ValueCountFrequency (%)
( 3869
25.8%
) 3868
25.7%
3672
24.4%
/ 2988
19.9%
2 123
 
0.8%
3 114
 
0.8%
1 110
 
0.7%
, 94
 
0.6%
4 54
 
0.4%
0 47
 
0.3%
Other values (8) 86
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35403
70.2%
ASCII 15025
29.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4058
 
11.5%
3252
 
9.2%
3092
 
8.7%
2438
 
6.9%
1934
 
5.5%
907
 
2.6%
751
 
2.1%
746
 
2.1%
572
 
1.6%
566
 
1.6%
Other values (271) 17087
48.3%
ASCII
ValueCountFrequency (%)
( 3869
25.8%
) 3868
25.7%
3672
24.4%
/ 2988
19.9%
2 123
 
0.8%
3 114
 
0.8%
1 110
 
0.7%
, 94
 
0.6%
4 54
 
0.4%
0 47
 
0.3%
Other values (8) 86
 
0.6%
Distinct1398
Distinct (%)35.8%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
2024-05-04T05:55:28.585599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length11
Mean length11.521238
Min length1

Characters and Unicode

Total characters45025
Distinct characters15
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1069 ?
Unique (%)27.4%

Sample

1st row02-2204-1100
2nd row02-2252-5950
3rd row02-2292-2721
4th row02-2200-3700
5th row02-446-0365
ValueCountFrequency (%)
02-6331-1900 65
 
1.7%
02-2237-0465 63
 
1.6%
02-390-5800 61
 
1.6%
02-2082-1810 41
 
1.0%
02-2292-5707 37
 
0.9%
070-8685-7600 34
 
0.9%
02-2289-1600 31
 
0.8%
02-762-1301 30
 
0.8%
02-763-1631 29
 
0.7%
02-2226-2141 28
 
0.7%
Other values (1390) 3492
89.3%
2024-05-04T05:55:29.677301image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 8357
18.6%
- 7794
17.3%
2 7482
16.6%
1 3410
7.6%
3 3008
 
6.7%
6 2804
 
6.2%
7 2552
 
5.7%
5 2534
 
5.6%
4 2365
 
5.3%
8 2353
 
5.2%
Other values (5) 2366
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37205
82.6%
Dash Punctuation 7794
 
17.3%
Other Punctuation 22
 
< 0.1%
Space Separator 3
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 8357
22.5%
2 7482
20.1%
1 3410
9.2%
3 3008
 
8.1%
6 2804
 
7.5%
7 2552
 
6.9%
5 2534
 
6.8%
4 2365
 
6.4%
8 2353
 
6.3%
9 2340
 
6.3%
Other Punctuation
ValueCountFrequency (%)
/ 21
95.5%
. 1
 
4.5%
Dash Punctuation
ValueCountFrequency (%)
- 7794
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 45025
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 8357
18.6%
- 7794
17.3%
2 7482
16.6%
1 3410
7.6%
3 3008
 
6.7%
6 2804
 
6.2%
7 2552
 
5.7%
5 2534
 
5.6%
4 2365
 
5.3%
8 2353
 
5.2%
Other values (5) 2366
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45025
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 8357
18.6%
- 7794
17.3%
2 7482
16.6%
1 3410
7.6%
3 3008
 
6.7%
6 2804
 
6.2%
7 2552
 
5.7%
5 2534
 
5.6%
4 2365
 
5.3%
8 2353
 
5.2%
Other values (5) 2366
 
5.3%
Distinct1401
Distinct (%)35.9%
Missing2
Missing (%)0.1%
Memory size30.7 KiB
2024-05-04T05:55:30.253324image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length33
Mean length23.06042
Min length7

Characters and Unicode

Total characters90074
Distinct characters40
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1075 ?
Unique (%)27.5%

Sample

1st rowhttps://sunhwaarts.sen.ms.kr
2nd rowhttp://soongshin.sen.es.kr
3rd rowhttp://www.haengdang.ms.kr
4th rowhttps://hyu.sen.ms.kr
5th rowhttp://jayang.sen.ms.kr
ValueCountFrequency (%)
http://www.sis.sc.kr 65
 
1.7%
http://jongno.sen.hs.kr 63
 
1.6%
http://ahyeon.sen.sc.kr 61
 
1.6%
http://seoul-th.sen.hs.kr 41
 
1.0%
http://duksoo.sen.hs.kr 37
 
0.9%
http://www.sdth.hs.kr 34
 
0.9%
http://www.ggmt.hs.kr 32
 
0.8%
donggoo.sen.hs.kr 30
 
0.8%
http://daedong.sen.hs.kr 29
 
0.7%
http://srobot.sen.hs.kr 28
 
0.7%
Other values (1391) 3486
89.2%
2024-05-04T05:55:31.283411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 11343
12.6%
s 8131
 
9.0%
t 7871
 
8.7%
/ 7743
 
8.6%
h 7543
 
8.4%
w 5422
 
6.0%
n 5138
 
5.7%
k 4611
 
5.1%
e 4262
 
4.7%
r 4253
 
4.7%
Other values (30) 23757
26.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 66890
74.3%
Other Punctuation 22786
 
25.3%
Dash Punctuation 361
 
0.4%
Decimal Number 31
 
< 0.1%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 8131
12.2%
t 7871
11.8%
h 7543
11.3%
w 5422
8.1%
n 5138
 
7.7%
k 4611
 
6.9%
e 4262
 
6.4%
r 4253
 
6.4%
p 3854
 
5.8%
o 2884
 
4.3%
Other values (14) 12921
19.3%
Decimal Number
ValueCountFrequency (%)
8 7
22.6%
6 6
19.4%
1 6
19.4%
2 3
9.7%
3 2
 
6.5%
9 2
 
6.5%
0 2
 
6.5%
5 1
 
3.2%
4 1
 
3.2%
7 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 11343
49.8%
/ 7743
34.0%
: 3700
 
16.2%
Uppercase Letter
ValueCountFrequency (%)
G 4
66.7%
J 2
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 361
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 66896
74.3%
Common 23178
 
25.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 8131
12.2%
t 7871
11.8%
h 7543
11.3%
w 5422
8.1%
n 5138
 
7.7%
k 4611
 
6.9%
e 4262
 
6.4%
r 4253
 
6.4%
p 3854
 
5.8%
o 2884
 
4.3%
Other values (16) 12927
19.3%
Common
ValueCountFrequency (%)
. 11343
48.9%
/ 7743
33.4%
: 3700
 
16.0%
- 361
 
1.6%
8 7
 
< 0.1%
6 6
 
< 0.1%
1 6
 
< 0.1%
2 3
 
< 0.1%
3 2
 
< 0.1%
9 2
 
< 0.1%
Other values (4) 5
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90074
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 11343
12.6%
s 8131
 
9.0%
t 7871
 
8.7%
/ 7743
 
8.6%
h 7543
 
8.4%
w 5422
 
6.0%
n 5138
 
5.7%
k 4611
 
5.1%
e 4262
 
4.7%
r 4253
 
4.7%
Other values (30) 23757
26.4%
Distinct1383
Distinct (%)35.4%
Missing2
Missing (%)0.1%
Memory size30.7 KiB
2024-05-04T05:55:31.961127image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length11
Mean length11.365591
Min length8

Characters and Unicode

Total characters44394
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1060 ?
Unique (%)27.1%

Sample

1st row02-453-4641
2nd row02-2236-6134
3rd row02-2293-0473
4th row02-2298-3173
5th row02-458-7047
ValueCountFrequency (%)
02-874-0535 65
 
1.7%
02-2238-0598 63
 
1.6%
02-313-7606 61
 
1.6%
02-825-2030 41
 
1.0%
02-2299-0703 37
 
0.9%
02-2234-1950 34
 
0.9%
02-978-4327 32
 
0.8%
02-747-6525 30
 
0.8%
02-763-1632 29
 
0.7%
02-2226-5346 28
 
0.7%
Other values (1374) 3487
89.3%
2024-05-04T05:55:33.072492image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 7802
17.6%
2 7577
17.1%
0 6595
14.9%
3 3268
7.4%
6 3116
 
7.0%
9 2980
 
6.7%
5 2848
 
6.4%
8 2697
 
6.1%
7 2588
 
5.8%
4 2549
 
5.7%
Other values (3) 2374
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36589
82.4%
Dash Punctuation 7802
 
17.6%
Other Punctuation 2
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 7577
20.7%
0 6595
18.0%
3 3268
8.9%
6 3116
8.5%
9 2980
 
8.1%
5 2848
 
7.8%
8 2697
 
7.4%
7 2588
 
7.1%
4 2549
 
7.0%
1 2371
 
6.5%
Dash Punctuation
ValueCountFrequency (%)
- 7802
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44394
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 7802
17.6%
2 7577
17.1%
0 6595
14.9%
3 3268
7.4%
6 3116
 
7.0%
9 2980
 
6.7%
5 2848
 
6.4%
8 2697
 
6.1%
7 2588
 
5.8%
4 2549
 
5.7%
Other values (3) 2374
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44394
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 7802
17.6%
2 7577
17.1%
0 6595
14.9%
3 3268
7.4%
6 3116
 
7.0%
9 2980
 
6.7%
5 2848
 
6.4%
8 2697
 
6.1%
7 2588
 
5.8%
4 2549
 
5.7%
Other values (3) 2374
 
5.3%
Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
남여공학
2774 
677 
457 

Length

Max length4
Median length4
Mean length3.129478
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row남여공학
2nd row남여공학
3rd row남여공학
4th row남여공학
5th row남여공학

Common Values

ValueCountFrequency (%)
남여공학 2774
71.0%
677
 
17.3%
457
 
11.7%

Length

2024-05-04T05:55:33.590242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:33.935512image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
남여공학 2774
71.0%
677
 
17.3%
457
 
11.7%
Distinct6
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
특성화고
1427 
<NA>
1099 
일반고
1023 
특목고
183 
자율고
174 

Length

Max length4
Median length4
Mean length3.6458547
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
특성화고 1427
36.5%
<NA> 1099
28.1%
일반고 1023
26.2%
특목고 183
 
4.7%
자율고 174
 
4.5%
99 2
 
0.1%

Length

2024-05-04T05:55:34.445140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:34.874646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
특성화고 1427
36.5%
na 1099
28.1%
일반고 1023
26.2%
특목고 183
 
4.7%
자율고 174
 
4.5%
99 2
 
0.1%
Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
False
3867 
True
 
41
ValueCountFrequency (%)
False 3867
99.0%
True 41
 
1.0%
2024-05-04T05:55:35.288439image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
일반계
2196 
전문계
1551 
해당없음
 
129
<NA>
 
32

Length

Max length4
Median length3
Mean length3.0411975
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row일반계
2nd row일반계
3rd row일반계
4th row일반계
5th row일반계

Common Values

ValueCountFrequency (%)
일반계 2196
56.2%
전문계 1551
39.7%
해당없음 129
 
3.3%
<NA> 32
 
0.8%

Length

2024-05-04T05:55:35.752947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:36.142794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반계 2196
56.2%
전문계 1551
39.7%
해당없음 129
 
3.3%
na 32
 
0.8%

특수목적고등학교계열명
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
<NA>
3716 
산업수요 맞춤형 고등학교
 
76
외국어계열
 
54
예술계열
 
52
과학계열
 
8
Other values (2)
 
2

Length

Max length13
Median length4
Mean length4.1888434
Min length4

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 3716
95.1%
산업수요 맞춤형 고등학교 76
 
1.9%
외국어계열 54
 
1.4%
예술계열 52
 
1.3%
과학계열 8
 
0.2%
국제계열 1
 
< 0.1%
체육계열 1
 
< 0.1%

Length

2024-05-04T05:55:36.542536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:36.924273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 3716
91.5%
산업수요 76
 
1.9%
맞춤형 76
 
1.9%
고등학교 76
 
1.9%
외국어계열 54
 
1.3%
예술계열 52
 
1.3%
과학계열 8
 
0.2%
국제계열 1
 
< 0.1%
체육계열 1
 
< 0.1%
Distinct3
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
전기
2904 
후기
998 
전후기
 
6

Length

Max length3
Median length2
Mean length2.0015353
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기
2nd row전기
3rd row전기
4th row전기
5th row전기

Common Values

ValueCountFrequency (%)
전기 2904
74.3%
후기 998
 
25.5%
전후기 6
 
0.2%

Length

2024-05-04T05:55:37.338023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:37.675132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전기 2904
74.3%
후기 998
 
25.5%
전후기 6
 
0.2%

주야구분명
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
주간
3855 
주야간
 
53

Length

Max length3
Median length2
Mean length2.0135619
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row주간
2nd row주간
3rd row주간
4th row주간
5th row주간

Common Values

ValueCountFrequency (%)
주간 3855
98.6%
주야간 53
 
1.4%

Length

2024-05-04T05:55:38.057445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:38.459295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주간 3855
98.6%
주야간 53
 
1.4%

설립일자
Real number (ℝ)

Distinct888
Distinct (%)22.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19696974
Minimum18820908
Maximum20240301
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.5 KiB
2024-05-04T05:55:38.848298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18820908
5-th percentile19080401
Q119550422
median19770901
Q319873443
95-th percentile20090102
Maximum20240301
Range1419393
Interquartile range (IQR)323020.5

Descriptive statistics

Standard deviation277040.06
Coefficient of variation (CV)0.014065107
Kurtosis0.53903164
Mean19696974
Median Absolute Deviation (MAD)140206
Skewness-0.9295473
Sum7.6975776 × 1010
Variance7.6751193 × 1010
MonotonicityNot monotonic
2024-05-04T05:55:39.419713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19830302 65
 
1.7%
19901110 63
 
1.6%
19540806 61
 
1.6%
19831226 52
 
1.3%
19841217 50
 
1.3%
19900120 43
 
1.1%
19871215 41
 
1.0%
18990505 41
 
1.0%
19940105 40
 
1.0%
19100413 38
 
1.0%
Other values (878) 3414
87.4%
ValueCountFrequency (%)
18820908 1
 
< 0.1%
18850509 6
0.2%
18850608 4
0.1%
18850803 1
 
< 0.1%
18860531 4
0.1%
18871020 5
0.1%
18940406 6
0.2%
18940918 1
 
< 0.1%
18950416 1
 
< 0.1%
18950719 1
 
< 0.1%
ValueCountFrequency (%)
20240301 1
 
< 0.1%
20230301 2
0.1%
20220301 4
0.1%
20210301 3
0.1%
20200301 4
0.1%
20190901 2
0.1%
20190501 1
 
< 0.1%
20190301 3
0.1%
20180901 1
 
< 0.1%
20180208 1
 
< 0.1%

개교기념일
Real number (ℝ)

Distinct1105
Distinct (%)28.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19706405
Minimum18820908
Maximum20240304
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.5 KiB
2024-05-04T05:55:40.139990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum18820908
5-th percentile19080608
Q119550301
median19780315
Q319900426
95-th percentile20090506
Maximum20240304
Range1419396
Interquartile range (IQR)350125.25

Descriptive statistics

Standard deviation279096.63
Coefficient of variation (CV)0.014162737
Kurtosis0.48776495
Mean19706405
Median Absolute Deviation (MAD)149101
Skewness-0.90698251
Sum7.7012631 × 1010
Variance7.7894928 × 1010
MonotonicityNot monotonic
2024-05-04T05:55:40.651079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19831002 65
 
1.7%
19901110 63
 
1.6%
19540806 61
 
1.6%
18990505 41
 
1.0%
19100413 38
 
1.0%
19440401 34
 
0.9%
20020101 32
 
0.8%
19420617 31
 
0.8%
19570215 31
 
0.8%
19841217 30
 
0.8%
Other values (1095) 3482
89.1%
ValueCountFrequency (%)
18820908 1
 
< 0.1%
18850509 6
0.2%
18850608 5
0.1%
18860531 4
0.1%
18871020 1
 
< 0.1%
18940406 6
0.2%
18940918 1
 
< 0.1%
18950416 1
 
< 0.1%
18950918 1
 
< 0.1%
18950930 1
 
< 0.1%
ValueCountFrequency (%)
20240304 1
 
< 0.1%
20220501 2
 
0.1%
20220301 2
 
0.1%
20211205 16
0.4%
20211013 1
 
< 0.1%
20210301 2
 
0.1%
20200420 1
 
< 0.1%
20200301 3
 
0.1%
20190902 1
 
< 0.1%
20190901 2
 
0.1%

시도교육청코드
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
B10
3908 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowB10
2nd rowB10
3rd rowB10
4th rowB10
5th rowB10

Common Values

ValueCountFrequency (%)
B10 3908
100.0%

Length

2024-05-04T05:55:41.052918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:41.320508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
b10 3908
100.0%

시도교육청명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
서울특별시교육청
3908 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시교육청
2nd row서울특별시교육청
3rd row서울특별시교육청
4th row서울특별시교육청
5th row서울특별시교육청

Common Values

ValueCountFrequency (%)
서울특별시교육청 3908
100.0%

Length

2024-05-04T05:55:41.594123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:41.889560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시교육청 3908
100.0%

소재지명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
서울특별시
3908 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 3908
100.0%

Length

2024-05-04T05:55:42.275432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:42.667583image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 3908
100.0%

주야과정
Categorical

IMBALANCE 

Distinct4
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
주간
2802 
<NA>
1064 
야간
 
38
산업체특별
 
4

Length

Max length5
Median length2
Mean length2.5475947
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
주간 2802
71.7%
<NA> 1064
 
27.2%
야간 38
 
1.0%
산업체특별 4
 
0.1%

Length

2024-05-04T05:55:43.140967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:55:43.488538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주간 2802
71.7%
na 1064
 
27.2%
야간 38
 
1.0%
산업체특별 4
 
0.1%

계열명
Categorical

Distinct21
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size30.7 KiB
일반계
1138 
<NA>
1064 
공업계
538 
상업계
387 
특성화
381 
Other values (16)
400 

Length

Max length7
Median length3
Mean length3.2978506
Min length3

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
일반계 1138
29.1%
<NA> 1064
27.2%
공업계 538
13.8%
상업계 387
 
9.9%
특성화 381
 
9.7%
통합계 191
 
4.9%
예술계 64
 
1.6%
가사계 51
 
1.3%
외국어계 44
 
1.1%
가사실업계 16
 
0.4%
Other values (11) 34
 
0.9%

Length

2024-05-04T05:55:44.021931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
일반계 1138
29.1%
na 1064
27.2%
공업계 538
13.8%
상업계 387
 
9.9%
특성화 381
 
9.7%
통합계 191
 
4.9%
예술계 64
 
1.6%
가사계 51
 
1.3%
외국어계 44
 
1.1%
가사실업계 16
 
0.4%
Other values (11) 34
 
0.9%

학과명
Text

MISSING 

Distinct869
Distinct (%)30.6%
Missing1064
Missing (%)27.2%
Memory size30.7 KiB
2024-05-04T05:55:44.772001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10
Mean length5.3565401
Min length2

Characters and Unicode

Total characters15234
Distinct characters319
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique592 ?
Unique (%)20.8%

Sample

1st row일반학과
2nd row멀티미디어과
3rd row인터랙티브미디어과
4th row뉴미디어웹솔루션과
5th row뉴미디어솔루션과
ValueCountFrequency (%)
일반학과 251
 
8.8%
공통과정 234
 
8.2%
인문사회과정 203
 
7.1%
자연과정 199
 
7.0%
정보처리과 51
 
1.8%
경영정보과 39
 
1.4%
전기과 38
 
1.3%
전자과 25
 
0.9%
시각디자인과 25
 
0.9%
기계과 25
 
0.9%
Other values (862) 1766
61.8%
2024-05-04T05:55:45.997548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2821
 
18.5%
1000
 
6.6%
613
 
4.0%
513
 
3.4%
373
 
2.4%
369
 
2.4%
318
 
2.1%
296
 
1.9%
282
 
1.9%
279
 
1.8%
Other values (309) 8370
54.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 14803
97.2%
Uppercase Letter 122
 
0.8%
Close Punctuation 109
 
0.7%
Open Punctuation 109
 
0.7%
Decimal Number 22
 
0.1%
Other Punctuation 21
 
0.1%
Lowercase Letter 19
 
0.1%
Dash Punctuation 17
 
0.1%
Space Separator 12
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2821
 
19.1%
1000
 
6.8%
613
 
4.1%
513
 
3.5%
373
 
2.5%
369
 
2.5%
318
 
2.1%
296
 
2.0%
282
 
1.9%
279
 
1.9%
Other values (277) 7939
53.6%
Uppercase Letter
ValueCountFrequency (%)
I 23
18.9%
D 23
18.9%
T 17
13.9%
A 17
13.9%
C 13
10.7%
M 9
 
7.4%
O 5
 
4.1%
S 3
 
2.5%
R 2
 
1.6%
V 2
 
1.6%
Other values (6) 8
 
6.6%
Lowercase Letter
ValueCountFrequency (%)
e 9
47.4%
z 3
 
15.8%
i 3
 
15.8%
b 3
 
15.8%
u 1
 
5.3%
Other Punctuation
ValueCountFrequency (%)
? 8
38.1%
· 8
38.1%
/ 4
19.0%
1
 
4.8%
Decimal Number
ValueCountFrequency (%)
3 10
45.5%
2 6
27.3%
1 6
27.3%
Close Punctuation
ValueCountFrequency (%)
) 109
100.0%
Open Punctuation
ValueCountFrequency (%)
( 109
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 14803
97.2%
Common 290
 
1.9%
Latin 141
 
0.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2821
 
19.1%
1000
 
6.8%
613
 
4.1%
513
 
3.5%
373
 
2.5%
369
 
2.5%
318
 
2.1%
296
 
2.0%
282
 
1.9%
279
 
1.9%
Other values (277) 7939
53.6%
Latin
ValueCountFrequency (%)
I 23
16.3%
D 23
16.3%
T 17
12.1%
A 17
12.1%
C 13
9.2%
M 9
 
6.4%
e 9
 
6.4%
O 5
 
3.5%
S 3
 
2.1%
z 3
 
2.1%
Other values (11) 19
13.5%
Common
ValueCountFrequency (%)
) 109
37.6%
( 109
37.6%
- 17
 
5.9%
12
 
4.1%
3 10
 
3.4%
? 8
 
2.8%
· 8
 
2.8%
2 6
 
2.1%
1 6
 
2.1%
/ 4
 
1.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 14802
97.2%
ASCII 422
 
2.8%
None 9
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2821
 
19.1%
1000
 
6.8%
613
 
4.1%
513
 
3.5%
373
 
2.5%
369
 
2.5%
318
 
2.1%
296
 
2.0%
282
 
1.9%
279
 
1.9%
Other values (276) 7938
53.6%
ASCII
ValueCountFrequency (%)
) 109
25.8%
( 109
25.8%
I 23
 
5.5%
D 23
 
5.5%
T 17
 
4.0%
- 17
 
4.0%
A 17
 
4.0%
C 13
 
3.1%
12
 
2.8%
3 10
 
2.4%
Other values (20) 72
17.1%
None
ValueCountFrequency (%)
· 8
88.9%
1
 
11.1%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

적재일시
Real number (ℝ)

Distinct33
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20231076
Minimum20230615
Maximum20240428
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size34.5 KiB
2024-05-04T05:55:46.363295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20230615
5-th percentile20230615
Q120230615
median20230615
Q320230615
95-th percentile20231112
Maximum20240428
Range9813
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2042.0904
Coefficient of variation (CV)0.0001009383
Kurtosis16.518342
Mean20231076
Median Absolute Deviation (MAD)0
Skewness4.3004036
Sum7.9063047 × 1010
Variance4170133
MonotonicityNot monotonic
2024-05-04T05:55:46.937098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
20230615 3253
83.2%
20230627 251
 
6.4%
20240324 60
 
1.5%
20230629 56
 
1.4%
20230709 47
 
1.2%
20240225 37
 
0.9%
20230705 33
 
0.8%
20240310 31
 
0.8%
20240414 27
 
0.7%
20240303 19
 
0.5%
Other values (23) 94
 
2.4%
ValueCountFrequency (%)
20230615 3253
83.2%
20230623 7
 
0.2%
20230624 4
 
0.1%
20230627 251
 
6.4%
20230628 5
 
0.1%
20230629 56
 
1.4%
20230702 6
 
0.2%
20230705 33
 
0.8%
20230709 47
 
1.2%
20230827 1
 
< 0.1%
ValueCountFrequency (%)
20240428 1
 
< 0.1%
20240421 1
 
< 0.1%
20240414 27
0.7%
20240407 1
 
< 0.1%
20240331 2
 
0.1%
20240324 60
1.5%
20240310 31
0.8%
20240303 19
 
0.5%
20240225 37
0.9%
20240204 1
 
< 0.1%

Sample

학교종류명설립구분표준학교코드학교명영문학교명관할조직명도로명우편번호도로명주소도로명상세주소전화번호홈페이지주소팩스번호남녀공학구분명고등학교구분명산업체특별학급존재여부고등학교일반실업구분명특수목적고등학교계열명입시전후기구분명주야구분명설립일자개교기념일시도교육청코드시도교육청명소재지명주야과정계열명학과명적재일시
0각종학교(중)사립7134155선화예술중학교Sunhwa Arts Middle School서울특별시성동광진교육지원청4991서울특별시 광진구 천호대로 664/ 선화예술중고등학교 (능동)02-2204-1100https://sunhwaarts.sen.ms.kr02-453-4641남여공학<NA>N일반계<NA>전기주간1973120119730705B10서울특별시교육청서울특별시<NA><NA><NA>20230627
1초등학교공립7134150서울숭신초등학교Seoul Soongshin Elementary School서울특별시성동광진교육지원청4702서울특별시 성동구 마장로 161(하왕십리동)02-2252-5950http://soongshin.sen.es.kr02-2236-6134남여공학<NA>N일반계<NA>전기주간1959040319590403B10서울특별시교육청서울특별시<NA><NA><NA>20230615
2중학교공립7134142행당중학교Haengdang Middle School서울특별시성동광진교육지원청4764서울특별시 성동구 왕십리로 189(행당동/행당중학교)02-2292-2721http://www.haengdang.ms.kr02-2293-0473남여공학<NA>N일반계<NA>전기주간1968080619681002B10서울특별시교육청서울특별시<NA><NA><NA>20230615
3중학교사립7134141한양대학교사범대학부속중학교Hanyang University Middle School서울특별시성동광진교육지원청4761서울특별시 성동구 마조로 42(사근동/한양사대부속중?고등학교)02-2200-3700https://hyu.sen.ms.kr02-2298-3173남여공학<NA>N일반계<NA>전기주간1960011819600118B10서울특별시교육청서울특별시<NA><NA><NA>20230615
4중학교공립7134140자양중학교Jayang Middle School서울특별시성동광진교육지원청5069서울특별시 광진구 뚝섬로41길 33(자양동/서울자양중학교)02-446-0365http://jayang.sen.ms.kr02-458-7047남여공학<NA>N일반계<NA>전기주간1984030119840301B10서울특별시교육청서울특별시<NA><NA><NA>20230615
5중학교공립7134139용곡중학교Yong-gok Middle School서울특별시성동광진교육지원청4940서울특별시 광진구 용마산로22길 76(중곡동/용곡중학교)02-452-2622http://yonggok.sen.ms.kr/index.do02-458-8566남여공학<NA>N일반계<NA>전기주간1982120919830504B10서울특별시교육청서울특별시<NA><NA><NA>20230615
6중학교공립7134138옥정중학교Ok-Jung Middle School서울특별시성동광진교육지원청4734서울특별시 성동구 한림말길 11/ 옥정중학교 (옥수동)02-6021-2100http://okjung.sen.ms.kr02-2298-1303남여공학<NA>N일반계<NA>전기주간1984010919840511B10서울특별시교육청서울특별시<NA><NA><NA>20230615
7중학교공립7134137양진중학교Yangjin Middle School서울특별시성동광진교육지원청4982서울특별시 광진구 워커힐로 32/ 양진중학교02-2049-1200http://www.yangjin.ms.kr02-2049-1208/1220남여공학<NA>N해당없음<NA>전기주간2005122920060514B10서울특별시교육청서울특별시<NA><NA><NA>20230615
8중학교공립7134136신양중학교Shinyang Middle School서울특별시성동광진교육지원청5087서울특별시 광진구 자양강변길 73/ 신양중학교 (자양동)02-461-1070http://shinyang.ms.kr02-469-8386남여공학<NA>N일반계<NA>전기주간1984010919840509B10서울특별시교육청서울특별시<NA><NA><NA>20230615
9중학교공립7134135성원중학교Sungwon Middle School서울특별시성동광진교육지원청4775서울특별시 성동구 성덕정9가길 13(성수동2가/ 성원중학교)02-3408-2558http://www.sungwon.ms.kr02-463-0676남여공학<NA>N일반계<NA>전기주간1968080619681002B10서울특별시교육청서울특별시<NA><NA><NA>20230615
학교종류명설립구분표준학교코드학교명영문학교명관할조직명도로명우편번호도로명주소도로명상세주소전화번호홈페이지주소팩스번호남녀공학구분명고등학교구분명산업체특별학급존재여부고등학교일반실업구분명특수목적고등학교계열명입시전후기구분명주야구분명설립일자개교기념일시도교육청코드시도교육청명소재지명주야과정계열명학과명적재일시
3898고등학교국립1371663국립전통예술고등학교NATIONAL HIGH SCHOOL OF TRADITIONAL KOREAN ARTS교육부8650서울특별시 금천구 시흥대로38길 62(시흥동)02-896-1094http://kugak-am.hs.kr02-896-1096남여공학특목고N일반계예술계열전기주간1960051319600513B10서울특별시교육청서울특별시주간예술계창작연희과20230615
3899고등학교국립1371663국립전통예술고등학교NATIONAL HIGH SCHOOL OF TRADITIONAL KOREAN ARTS교육부8650서울특별시 금천구 시흥대로38길 62(시흥동)02-896-1094http://kugak-am.hs.kr02-896-1096남여공학특목고N일반계예술계열전기주간1960051319600513B10서울특별시교육청서울특별시주간예술계타악과20230615
3900고등학교국립1371663국립전통예술고등학교NATIONAL HIGH SCHOOL OF TRADITIONAL KOREAN ARTS교육부8650서울특별시 금천구 시흥대로38길 62(시흥동)02-896-1094http://kugak-am.hs.kr02-896-1096남여공학특목고N일반계예술계열전기주간1960051319600513B10서울특별시교육청서울특별시주간예술계한국음악과20230615
3901각종학교(중)국립1371662국립국악중학교Gukak National Middle School교육부6311서울특별시 강남구 개포로22길 65/ 국립국악중고등학교 (개포동)02-3460-0500http://gugak.sen.ms.kr02-579-6013남여공학<NA>N일반계<NA>전기주간1955040119910301B10서울특별시교육청서울특별시<NA><NA><NA>20230709
3902고등학교국립1371661국립국악고등학교Gugak National High School교육부6311서울특별시 강남구 개포로22길 65(개포동/ 국악고등학교)02-3460-0500http://gugak.sen.hs.kr02-3460-0555남여공학특목고N일반계예술계열전기주간1955040119720909B10서울특별시교육청서울특별시주간예술계무용과20230615
3903고등학교국립1371661국립국악고등학교Gugak National High School교육부6311서울특별시 강남구 개포로22길 65(개포동/ 국악고등학교)02-3460-0500http://gugak.sen.hs.kr02-3460-0555남여공학특목고N일반계예술계열전기주간1955040119720909B10서울특별시교육청서울특별시주간예술계국악과20230615
3904특수학교국립1342102한국우진학교Hanguk Woojin School교육부3934서울특별시 마포구 월드컵북로38길 21/ 한국우진학교 (중동)02-6388-5800http://woojin.sen.sc.kr02-6388-5999남여공학<NA>N해당없음<NA>전기주간2000030120000306B10서울특별시교육청서울특별시<NA><NA><NA>20230709
3905특수학교국립1342099서울농학교Seoul National school for the Deaf교육부3032서울특별시 종로구 필운대로 103(신교동)02-737-0659http://seoulnong.sen.sc.kr02-723-5848남여공학<NA>N해당없음<NA>전기주간1913040119130401B10서울특별시교육청서울특별시<NA><NA><NA>20230709
3906특수학교국립1342098서울맹학교Seoul National School for the Blind교육부3032서울특별시 종로구 필운대로 97(신교동)02-731-6772www.bl.sc.kr02-722-0845남여공학<NA>N해당없음<NA>전기주간1913040119130401B10서울특별시교육청서울특별시<NA><NA><NA>20230709
3907공동실습소공립0경기기계공업고등학교부설미래기술교육센터.서울특별시교육청1810서울특별시 노원구 공릉로 264(하계동/ 경기기계공업고등학교)02-970-8922http://www.ggmt.hs.kr02-978-4327남여공학<NA>N해당없음<NA>전기주간1982060119820601B10서울특별시교육청서울특별시주간기계공동실습소공동실습소20230615

Duplicate rows

Most frequently occurring

학교종류명설립구분표준학교코드학교명영문학교명관할조직명도로명우편번호도로명주소도로명상세주소전화번호홈페이지주소팩스번호남녀공학구분명고등학교구분명산업체특별학급존재여부고등학교일반실업구분명특수목적고등학교계열명입시전후기구분명주야구분명설립일자개교기념일시도교육청코드시도교육청명소재지명주야과정계열명학과명적재일시# duplicates
0각종학교(고)공립7010566아현산업정보학교Ahyeon Vocational School서울특별시교육청4117서울특별시 마포구 마포대로 249(아현동/ 아현산업정보학교/ 아현직업학교)02-390-5800http://ahyeon.sen.sc.kr02-313-7606남여공학특성화고N전문계<NA>전기주간1954080619540806B10서울특별시교육청서울특별시주간통합계웹미디어과202306272