Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells22850
Missing cells (%)10.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 MiB
Average record size in memory193.0 B

Variable types

Categorical5
Numeric9
Text6
Boolean2

Dataset

Description행정구역명,학원/교습소,학원지정번호,학원명,도로명주소,도로명상세주소,분야명,교습계열명,교습과정목록명,교습과정명,정원합계,일시수용능력인원합계,인당수강료내용,수강료공개여부,기숙사학원여부,도로명우편번호,등록상태명,등록일자,휴원시작일자,휴원종료일자,개설일자,적재일시
Author서울특별시교육청
URLhttps://data.seoul.go.kr/dataList/OA-20528/S/1/datasetView.do

Alerts

등록상태명 has constant value ""Constant
수강료공개여부 is highly imbalanced (54.4%)Imbalance
기숙사학원여부 is highly imbalanced (97.7%)Imbalance
교습과정목록명 has 2937 (29.4%) missing valuesMissing
교습과정명 has 939 (9.4%) missing valuesMissing
인당수강료내용 has 7813 (78.1%) missing valuesMissing
기숙사학원여부 has 513 (5.1%) missing valuesMissing
휴원시작일자 has 9847 (98.5%) missing valuesMissing
휴원종료일자 has 782 (7.8%) missing valuesMissing
정원합계 is highly skewed (γ1 = 67.57890299)Skewed
일시수용능력인원합계 is highly skewed (γ1 = 27.2173483)Skewed
학원지정번호 has unique valuesUnique
정원합계 has 674 (6.7%) zerosZeros
일시수용능력인원합계 has 238 (2.4%) zerosZeros

Reproduction

Analysis started2024-05-04 05:49:36.599581
Analysis finished2024-05-04 05:49:41.217709
Duration4.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

행정구역명
Categorical

Distinct26
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
강남구
1390 
양천구
839 
송파구
771 
서초구
718 
강서구
 
534
Other values (21)
5748 

Length

Max length4
Median length3
Mean length3.0876
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강서구
2nd row강남구
3rd row마포구
4th row강서구
5th row강서구

Common Values

ValueCountFrequency (%)
강남구 1390
13.9%
양천구 839
 
8.4%
송파구 771
 
7.7%
서초구 718
 
7.2%
강서구 534
 
5.3%
노원구 523
 
5.2%
강동구 502
 
5.0%
마포구 491
 
4.9%
은평구 448
 
4.5%
동작구 365
 
3.6%
Other values (16) 3419
34.2%

Length

2024-05-04T05:49:41.534852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 1390
13.9%
양천구 839
 
8.4%
송파구 771
 
7.7%
서초구 718
 
7.2%
강서구 534
 
5.3%
노원구 523
 
5.2%
강동구 502
 
5.0%
마포구 491
 
4.9%
은평구 448
 
4.5%
동작구 365
 
3.6%
Other values (16) 3419
34.2%

학원/교습소
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학원
5997 
교습소
4003 

Length

Max length3
Median length2
Mean length2.4003
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row교습소
2nd row학원
3rd row교습소
4th row교습소
5th row교습소

Common Values

ValueCountFrequency (%)
학원 5997
60.0%
교습소 4003
40.0%

Length

2024-05-04T05:49:42.025424image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:49:42.330657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원 5997
60.0%
교습소 4003
40.0%

학원지정번호
Real number (ℝ)

UNIQUE 

Distinct10000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2424287 × 109
Minimum289
Maximum3.0000503 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:42.764941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum289
5-th percentile10100.3
Q11.0000368 × 109
median3.0000271 × 109
Q33.0000385 × 109
95-th percentile3.0000466 × 109
Maximum3.0000503 × 109
Range3.00005 × 109
Interquartile range (IQR)2.0000017 × 109

Descriptive statistics

Standard deviation1.2413748 × 109
Coefficient of variation (CV)0.55358495
Kurtosis-0.62256649
Mean2.2424287 × 109
Median Absolute Deviation (MAD)13811
Skewness-1.1226602
Sum2.2424287 × 1013
Variance1.5410114 × 1018
MonotonicityNot monotonic
2024-05-04T05:49:43.261119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3000033849 1
 
< 0.1%
3000021911 1
 
< 0.1%
3000010496 1
 
< 0.1%
3000042934 1
 
< 0.1%
19450 1
 
< 0.1%
3000019336 1
 
< 0.1%
3000040333 1
 
< 0.1%
3000016777 1
 
< 0.1%
3000042534 1
 
< 0.1%
14032 1
 
< 0.1%
Other values (9990) 9990
99.9%
ValueCountFrequency (%)
289 1
< 0.1%
296 1
< 0.1%
301 1
< 0.1%
305 1
< 0.1%
306 1
< 0.1%
331 1
< 0.1%
366 1
< 0.1%
370 1
< 0.1%
383 1
< 0.1%
390 1
< 0.1%
ValueCountFrequency (%)
3000050283 1
< 0.1%
3000050279 1
< 0.1%
3000050277 1
< 0.1%
3000050275 1
< 0.1%
3000050272 1
< 0.1%
3000050271 1
< 0.1%
3000050267 1
< 0.1%
3000050265 1
< 0.1%
3000050263 1
< 0.1%
3000050261 1
< 0.1%
Distinct9615
Distinct (%)96.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-04T05:49:43.816442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length36
Mean length9.4928
Min length1

Characters and Unicode

Total characters94928
Distinct characters966
Distinct categories12 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9293 ?
Unique (%)92.9%

Sample

1st row가우스엠(M)수학교습소
2nd row개포닥터정이클래스학원
3rd row아우름첼로아카데미첼로교습소
4th row뮤엠영어신정교습소
5th row리드나인주니어영어교습소
ValueCountFrequency (%)
english)영어교습소 13
 
0.1%
english)학원 7
 
0.1%
academy)학원 5
 
< 0.1%
오하운폴댄스학원 5
 
< 0.1%
아름다운음악교습소 4
 
< 0.1%
정수학교습소 4
 
< 0.1%
연세음악학원 4
 
< 0.1%
화가마을미술교습소 4
 
< 0.1%
리드앤톡영어학원 4
 
< 0.1%
다빈치미술교습소 4
 
< 0.1%
Other values (9726) 10105
99.5%
2024-05-04T05:49:44.797007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7814
 
8.2%
6283
 
6.6%
4337
 
4.6%
4167
 
4.4%
4139
 
4.4%
2360
 
2.5%
1966
 
2.1%
1951
 
2.1%
1895
 
2.0%
1748
 
1.8%
Other values (956) 58268
61.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88154
92.9%
Uppercase Letter 2462
 
2.6%
Lowercase Letter 1875
 
2.0%
Close Punctuation 728
 
0.8%
Open Punctuation 728
 
0.8%
Decimal Number 633
 
0.7%
Space Separator 159
 
0.2%
Other Punctuation 159
 
0.2%
Dash Punctuation 20
 
< 0.1%
Math Symbol 8
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7814
 
8.9%
6283
 
7.1%
4337
 
4.9%
4167
 
4.7%
4139
 
4.7%
2360
 
2.7%
1966
 
2.2%
1951
 
2.2%
1895
 
2.1%
1748
 
2.0%
Other values (873) 51494
58.4%
Lowercase Letter
ValueCountFrequency (%)
e 227
12.1%
i 168
 
9.0%
n 157
 
8.4%
a 146
 
7.8%
s 143
 
7.6%
o 127
 
6.8%
l 120
 
6.4%
t 110
 
5.9%
h 96
 
5.1%
r 88
 
4.7%
Other values (17) 493
26.3%
Uppercase Letter
ValueCountFrequency (%)
E 262
 
10.6%
S 251
 
10.2%
M 207
 
8.4%
A 170
 
6.9%
T 165
 
6.7%
C 142
 
5.8%
I 117
 
4.8%
L 113
 
4.6%
N 110
 
4.5%
B 93
 
3.8%
Other values (16) 832
33.8%
Decimal Number
ValueCountFrequency (%)
2 202
31.9%
1 165
26.1%
3 85
13.4%
0 67
 
10.6%
4 35
 
5.5%
5 27
 
4.3%
7 22
 
3.5%
6 13
 
2.1%
9 9
 
1.4%
8 8
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 51
32.1%
& 44
27.7%
? 20
 
12.6%
, 14
 
8.8%
' 11
 
6.9%
: 7
 
4.4%
! 6
 
3.8%
# 2
 
1.3%
% 2
 
1.3%
/ 2
 
1.3%
Open Punctuation
ValueCountFrequency (%)
( 722
99.2%
[ 5
 
0.7%
{ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 723
99.3%
] 5
 
0.7%
Space Separator
ValueCountFrequency (%)
159
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%
Math Symbol
ValueCountFrequency (%)
+ 8
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Other Number
ValueCountFrequency (%)
² 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88138
92.8%
Latin 4336
 
4.6%
Common 2437
 
2.6%
Han 16
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7814
 
8.9%
6283
 
7.1%
4337
 
4.9%
4167
 
4.7%
4139
 
4.7%
2360
 
2.7%
1966
 
2.2%
1951
 
2.2%
1895
 
2.2%
1748
 
2.0%
Other values (860) 51478
58.4%
Latin
ValueCountFrequency (%)
E 262
 
6.0%
S 251
 
5.8%
e 227
 
5.2%
M 207
 
4.8%
A 170
 
3.9%
i 168
 
3.9%
T 165
 
3.8%
n 157
 
3.6%
a 146
 
3.4%
s 143
 
3.3%
Other values (42) 2440
56.3%
Common
ValueCountFrequency (%)
) 723
29.7%
( 722
29.6%
2 202
 
8.3%
1 165
 
6.8%
159
 
6.5%
3 85
 
3.5%
0 67
 
2.7%
. 51
 
2.1%
& 44
 
1.8%
4 35
 
1.4%
Other values (20) 184
 
7.6%
Han
ValueCountFrequency (%)
3
18.8%
2
12.5%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
1
 
6.2%
Other values (3) 3
18.8%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88137
92.8%
ASCII 6772
 
7.1%
CJK 15
 
< 0.1%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
CJK Compat Ideographs 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7814
 
8.9%
6283
 
7.1%
4337
 
4.9%
4167
 
4.7%
4139
 
4.7%
2360
 
2.7%
1966
 
2.2%
1951
 
2.2%
1895
 
2.2%
1748
 
2.0%
Other values (859) 51477
58.4%
ASCII
ValueCountFrequency (%)
) 723
 
10.7%
( 722
 
10.7%
E 262
 
3.9%
S 251
 
3.7%
e 227
 
3.4%
M 207
 
3.1%
2 202
 
3.0%
A 170
 
2.5%
i 168
 
2.5%
1 165
 
2.4%
Other values (71) 3675
54.3%
CJK
ValueCountFrequency (%)
3
20.0%
2
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Other values (2) 2
13.3%
None
ValueCountFrequency (%)
α 1
50.0%
² 1
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK Compat Ideographs
ValueCountFrequency (%)
1
100.0%
Distinct6672
Distinct (%)66.7%
Missing2
Missing (%)< 0.1%
Memory size156.2 KiB
2024-05-04T05:49:45.463991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length39
Median length32
Mean length18.490398
Min length1

Characters and Unicode

Total characters184867
Distinct characters303
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5169 ?
Unique (%)51.7%

Sample

1st row서울특별시 강서구 공항대로41길 66
2nd row서울특별시 강남구 개포로 512
3rd row서울특별시 마포구 백범로31길 8
4th row서울특별시 강서구 곰달래로33가길 35
5th row서울특별시 강서구 공항대로41길 51
ValueCountFrequency (%)
서울특별시 9991
25.0%
강남구 1388
 
3.5%
양천구 838
 
2.1%
송파구 774
 
1.9%
서초구 731
 
1.8%
강서구 535
 
1.3%
노원구 523
 
1.3%
강동구 502
 
1.3%
마포구 496
 
1.2%
은평구 449
 
1.1%
Other values (3845) 23736
59.4%
2024-05-04T05:49:46.370930image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29967
16.2%
12361
 
6.7%
10483
 
5.7%
10163
 
5.5%
10072
 
5.4%
10018
 
5.4%
9992
 
5.4%
9992
 
5.4%
1 6407
 
3.5%
2 4829
 
2.6%
Other values (293) 70583
38.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 120944
65.4%
Decimal Number 32973
 
17.8%
Space Separator 29967
 
16.2%
Dash Punctuation 909
 
0.5%
Other Punctuation 47
 
< 0.1%
Close Punctuation 13
 
< 0.1%
Open Punctuation 13
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12361
 
10.2%
10483
 
8.7%
10163
 
8.4%
10072
 
8.3%
10018
 
8.3%
9992
 
8.3%
9992
 
8.3%
4707
 
3.9%
2849
 
2.4%
2671
 
2.2%
Other values (275) 37636
31.1%
Decimal Number
ValueCountFrequency (%)
1 6407
19.4%
2 4829
14.6%
3 3929
11.9%
4 3127
9.5%
5 3069
9.3%
6 2716
8.2%
7 2521
 
7.6%
0 2215
 
6.7%
8 2185
 
6.6%
9 1975
 
6.0%
Other Punctuation
ValueCountFrequency (%)
? 35
74.5%
. 8
 
17.0%
, 4
 
8.5%
Space Separator
ValueCountFrequency (%)
29967
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 909
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 120944
65.4%
Common 63922
34.6%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12361
 
10.2%
10483
 
8.7%
10163
 
8.4%
10072
 
8.3%
10018
 
8.3%
9992
 
8.3%
9992
 
8.3%
4707
 
3.9%
2849
 
2.4%
2671
 
2.2%
Other values (275) 37636
31.1%
Common
ValueCountFrequency (%)
29967
46.9%
1 6407
 
10.0%
2 4829
 
7.6%
3 3929
 
6.1%
4 3127
 
4.9%
5 3069
 
4.8%
6 2716
 
4.2%
7 2521
 
3.9%
0 2215
 
3.5%
8 2185
 
3.4%
Other values (7) 2957
 
4.6%
Latin
ValueCountFrequency (%)
A 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 120944
65.4%
ASCII 63923
34.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
29967
46.9%
1 6407
 
10.0%
2 4829
 
7.6%
3 3929
 
6.1%
4 3127
 
4.9%
5 3069
 
4.8%
6 2716
 
4.2%
7 2521
 
3.9%
0 2215
 
3.5%
8 2185
 
3.4%
Other values (8) 2958
 
4.6%
Hangul
ValueCountFrequency (%)
12361
 
10.2%
10483
 
8.7%
10163
 
8.4%
10072
 
8.3%
10018
 
8.3%
9992
 
8.3%
9992
 
8.3%
4707
 
3.9%
2849
 
2.4%
2671
 
2.2%
Other values (275) 37636
31.1%
Distinct8410
Distinct (%)84.2%
Missing17
Missing (%)0.2%
Memory size156.2 KiB
2024-05-04T05:49:46.960133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length52
Mean length17.527196
Min length2

Characters and Unicode

Total characters174974
Distinct characters587
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7738 ?
Unique (%)77.5%

Sample

1st row, 407호 (등촌동, 세신등촌종합상가)
2nd row, 403호 (개포동, 개포종합상가)
3rd row, 201동534호 (공덕동, 공덕SK리더스뷰)
4th row, 1층일부 (화곡동)
5th row, 414호 (등촌동, 세신그린코아빌딩)
ValueCountFrequency (%)
7774
 
21.1%
2층 1866
 
5.1%
3층 1265
 
3.4%
일부 1113
 
3.0%
4층 691
 
1.9%
1층 663
 
1.8%
대치동 489
 
1.3%
5층 368
 
1.0%
목동 346
 
0.9%
상가동 343
 
0.9%
Other values (6436) 21905
59.5%
2024-05-04T05:49:48.053627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27010
 
15.4%
, 13566
 
7.8%
11672
 
6.7%
) 10156
 
5.8%
( 10152
 
5.8%
7053
 
4.0%
2 6546
 
3.7%
5880
 
3.4%
1 5210
 
3.0%
0 4975
 
2.8%
Other values (577) 72754
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84208
48.1%
Decimal Number 27903
 
15.9%
Space Separator 27010
 
15.4%
Other Punctuation 13713
 
7.8%
Close Punctuation 10156
 
5.8%
Open Punctuation 10152
 
5.8%
Uppercase Letter 947
 
0.5%
Dash Punctuation 461
 
0.3%
Math Symbol 298
 
0.2%
Lowercase Letter 107
 
0.1%
Other values (3) 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11672
 
13.9%
7053
 
8.4%
5880
 
7.0%
2417
 
2.9%
2398
 
2.8%
2184
 
2.6%
2162
 
2.6%
1705
 
2.0%
1609
 
1.9%
1581
 
1.9%
Other values (507) 45547
54.1%
Uppercase Letter
ValueCountFrequency (%)
B 193
20.4%
A 179
18.9%
S 65
 
6.9%
M 60
 
6.3%
C 59
 
6.2%
K 51
 
5.4%
D 46
 
4.9%
E 40
 
4.2%
L 31
 
3.3%
I 29
 
3.1%
Other values (15) 194
20.5%
Lowercase Letter
ValueCountFrequency (%)
e 24
22.4%
s 17
15.9%
r 12
11.2%
i 10
9.3%
o 9
 
8.4%
l 7
 
6.5%
n 6
 
5.6%
a 4
 
3.7%
h 4
 
3.7%
u 3
 
2.8%
Other values (6) 11
10.3%
Decimal Number
ValueCountFrequency (%)
2 6546
23.5%
1 5210
18.7%
0 4975
17.8%
3 4198
15.0%
4 2531
 
9.1%
5 1651
 
5.9%
6 1059
 
3.8%
7 731
 
2.6%
8 556
 
2.0%
9 446
 
1.6%
Other Punctuation
ValueCountFrequency (%)
, 13566
98.9%
? 68
 
0.5%
. 34
 
0.2%
@ 30
 
0.2%
/ 12
 
0.1%
& 3
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 295
99.0%
< 1
 
0.3%
> 1
 
0.3%
+ 1
 
0.3%
Letter Number
ValueCountFrequency (%)
15
88.2%
1
 
5.9%
1
 
5.9%
Space Separator
ValueCountFrequency (%)
27010
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10156
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10152
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 461
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 89694
51.3%
Hangul 84209
48.1%
Latin 1071
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11672
 
13.9%
7053
 
8.4%
5880
 
7.0%
2417
 
2.9%
2398
 
2.8%
2184
 
2.6%
2162
 
2.6%
1705
 
2.0%
1609
 
1.9%
1581
 
1.9%
Other values (508) 45548
54.1%
Latin
ValueCountFrequency (%)
B 193
18.0%
A 179
16.7%
S 65
 
6.1%
M 60
 
5.6%
C 59
 
5.5%
K 51
 
4.8%
D 46
 
4.3%
E 40
 
3.7%
L 31
 
2.9%
I 29
 
2.7%
Other values (34) 318
29.7%
Common
ValueCountFrequency (%)
27010
30.1%
, 13566
15.1%
) 10156
 
11.3%
( 10152
 
11.3%
2 6546
 
7.3%
1 5210
 
5.8%
0 4975
 
5.5%
3 4198
 
4.7%
4 2531
 
2.8%
5 1651
 
1.8%
Other values (15) 3699
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 90748
51.9%
Hangul 84202
48.1%
Number Forms 17
 
< 0.1%
Compat Jamo 6
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
27010
29.8%
, 13566
14.9%
) 10156
 
11.2%
( 10152
 
11.2%
2 6546
 
7.2%
1 5210
 
5.7%
0 4975
 
5.5%
3 4198
 
4.6%
4 2531
 
2.8%
5 1651
 
1.8%
Other values (56) 4753
 
5.2%
Hangul
ValueCountFrequency (%)
11672
 
13.9%
7053
 
8.4%
5880
 
7.0%
2417
 
2.9%
2398
 
2.8%
2184
 
2.6%
2162
 
2.6%
1705
 
2.0%
1609
 
1.9%
1581
 
1.9%
Other values (506) 45541
54.1%
Number Forms
ValueCountFrequency (%)
15
88.2%
1
 
5.9%
1
 
5.9%
Compat Jamo
ValueCountFrequency (%)
6
100.0%
None
ValueCountFrequency (%)
1
100.0%

분야명
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
입시.검정 및 보습
5406 
예능(대)
2599 
국제화
 
522
직업기술
 
338
기타(대)
 
308
Other values (6)
827 

Length

Max length10
Median length10
Mean length7.5383
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row입시.검정 및 보습
2nd row입시.검정 및 보습
3rd row예능(대)
4th row입시.검정 및 보습
5th row입시.검정 및 보습

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 5406
54.1%
예능(대) 2599
26.0%
국제화 522
 
5.2%
직업기술 338
 
3.4%
기타(대) 308
 
3.1%
기예(대) 304
 
3.0%
독서실 216
 
2.2%
종합(대) 201
 
2.0%
인문사회(대) 96
 
1.0%
정보 9
 
0.1%

Length

2024-05-04T05:49:48.514790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
입시.검정 5406
26.0%
5406
26.0%
보습 5406
26.0%
예능(대 2599
12.5%
국제화 522
 
2.5%
직업기술 338
 
1.6%
기타(대 308
 
1.5%
기예(대 304
 
1.5%
독서실 216
 
1.0%
종합(대 201
 
1.0%
Other values (3) 106
 
0.5%

교습계열명
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통교과
4905 
예능(중)
2461 
<NA>
940 
외국어
 
411
기타(중)
 
286
Other values (15)
997 

Length

Max length7
Median length4
Mean length4.2731
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row보통교과
2nd row보통교과
3rd row예능(중)
4th row보통교과
5th row보통교과

Common Values

ValueCountFrequency (%)
보통교과 4905
49.0%
예능(중) 2461
24.6%
<NA> 940
 
9.4%
외국어 411
 
4.1%
기타(중) 286
 
2.9%
기예(중) 270
 
2.7%
독서 213
 
2.1%
산업응용기술 127
 
1.3%
인문사회(중) 86
 
0.9%
국제 78
 
0.8%
Other values (10) 223
 
2.2%

Length

2024-05-04T05:49:48.947371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보통교과 4905
49.0%
예능(중 2461
24.6%
na 940
 
9.4%
외국어 411
 
4.1%
기타(중 286
 
2.9%
기예(중 270
 
2.7%
독서 213
 
2.1%
산업응용기술 127
 
1.3%
인문사회(중 86
 
0.9%
국제 78
 
0.8%
Other values (10) 223
 
2.2%

교습과정목록명
Text

MISSING 

Distinct1603
Distinct (%)22.7%
Missing2937
Missing (%)29.4%
Memory size156.2 KiB
2024-05-04T05:49:49.571461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length39
Mean length6.1990656
Min length2

Characters and Unicode

Total characters43784
Distinct characters336
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1274 ?
Unique (%)18.0%

Sample

1st row초등1
2nd row보습
3rd row영어A(초등)
4th row보습?논술,
5th row음악,
ValueCountFrequency (%)
보습 1691
23.1%
보습?논술 631
 
8.6%
음악 404
 
5.5%
미술 306
 
4.2%
실용외국어(유아/초?중?고 254
 
3.5%
초등수학 139
 
1.9%
독서실(유아/초?중?고 118
 
1.6%
무용 88
 
1.2%
피아노 81
 
1.1%
초등영어 74
 
1.0%
Other values (1519) 3535
48.3%
2024-05-04T05:49:50.692106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
, 3661
 
8.4%
2543
 
5.8%
2472
 
5.6%
2034
 
4.6%
( 1905
 
4.4%
) 1903
 
4.3%
? 1602
 
3.7%
1512
 
3.5%
1309
 
3.0%
931
 
2.1%
Other values (326) 23912
54.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29934
68.4%
Other Punctuation 6288
 
14.4%
Decimal Number 2852
 
6.5%
Open Punctuation 1906
 
4.4%
Close Punctuation 1904
 
4.3%
Uppercase Letter 422
 
1.0%
Space Separator 259
 
0.6%
Lowercase Letter 139
 
0.3%
Math Symbol 32
 
0.1%
Dash Punctuation 22
 
0.1%
Other values (3) 26
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2543
 
8.5%
2472
 
8.3%
2034
 
6.8%
1512
 
5.1%
1309
 
4.4%
931
 
3.1%
849
 
2.8%
811
 
2.7%
783
 
2.6%
762
 
2.5%
Other values (259) 15928
53.2%
Lowercase Letter
ValueCountFrequency (%)
e 24
17.3%
i 20
14.4%
t 14
10.1%
h 13
9.4%
a 13
9.4%
r 10
7.2%
n 9
 
6.5%
s 6
 
4.3%
o 5
 
3.6%
c 4
 
2.9%
Other values (9) 21
15.1%
Uppercase Letter
ValueCountFrequency (%)
A 366
86.7%
B 10
 
2.4%
P 9
 
2.1%
W 8
 
1.9%
S 5
 
1.2%
C 4
 
0.9%
E 3
 
0.7%
K 3
 
0.7%
L 3
 
0.7%
G 2
 
0.5%
Other values (8) 9
 
2.1%
Decimal Number
ValueCountFrequency (%)
1 880
30.9%
0 443
15.5%
2 398
14.0%
5 327
 
11.5%
4 313
 
11.0%
6 188
 
6.6%
3 161
 
5.6%
9 70
 
2.5%
8 42
 
1.5%
7 30
 
1.1%
Other Punctuation
ValueCountFrequency (%)
, 3661
58.2%
? 1602
25.5%
/ 442
 
7.0%
* 384
 
6.1%
. 198
 
3.1%
: 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 27
84.4%
+ 3
 
9.4%
> 1
 
3.1%
< 1
 
3.1%
Open Punctuation
ValueCountFrequency (%)
( 1905
99.9%
[ 1
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 1903
99.9%
] 1
 
0.1%
Letter Number
ValueCountFrequency (%)
12
92.3%
1
 
7.7%
Space Separator
ValueCountFrequency (%)
259
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 10
100.0%
Other Number
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29934
68.4%
Common 13276
30.3%
Latin 574
 
1.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2543
 
8.5%
2472
 
8.3%
2034
 
6.8%
1512
 
5.1%
1309
 
4.4%
931
 
3.1%
849
 
2.8%
811
 
2.7%
783
 
2.6%
762
 
2.5%
Other values (259) 15928
53.2%
Latin
ValueCountFrequency (%)
A 366
63.8%
e 24
 
4.2%
i 20
 
3.5%
t 14
 
2.4%
h 13
 
2.3%
a 13
 
2.3%
12
 
2.1%
B 10
 
1.7%
r 10
 
1.7%
n 9
 
1.6%
Other values (29) 83
 
14.5%
Common
ValueCountFrequency (%)
, 3661
27.6%
( 1905
14.3%
) 1903
14.3%
? 1602
12.1%
1 880
 
6.6%
0 443
 
3.3%
/ 442
 
3.3%
2 398
 
3.0%
* 384
 
2.9%
5 327
 
2.5%
Other values (18) 1331
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29933
68.4%
ASCII 13834
31.6%
Number Forms 13
 
< 0.1%
Enclosed Alphanum 3
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
, 3661
26.5%
( 1905
13.8%
) 1903
13.8%
? 1602
11.6%
1 880
 
6.4%
0 443
 
3.2%
/ 442
 
3.2%
2 398
 
2.9%
* 384
 
2.8%
A 366
 
2.6%
Other values (54) 1850
13.4%
Hangul
ValueCountFrequency (%)
2543
 
8.5%
2472
 
8.3%
2034
 
6.8%
1512
 
5.1%
1309
 
4.4%
931
 
3.1%
849
 
2.8%
811
 
2.7%
783
 
2.6%
762
 
2.5%
Other values (258) 15927
53.2%
Number Forms
ValueCountFrequency (%)
12
92.3%
1
 
7.7%
Enclosed Alphanum
ValueCountFrequency (%)
3
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

교습과정명
Text

MISSING 

Distinct96
Distinct (%)1.1%
Missing939
Missing (%)9.4%
Memory size156.2 KiB
2024-05-04T05:49:51.081689image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length2
Mean length3.590884
Min length2

Characters and Unicode

Total characters32537
Distinct characters157
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique20 ?
Unique (%)0.2%

Sample

1st row보습
2nd row보습
3rd row음악
4th row보습
5th row보습
ValueCountFrequency (%)
보습 3872
42.7%
음악 1373
 
15.2%
보습?논술 999
 
11.0%
미술 973
 
10.7%
실용외국어(유아/초?중?고 398
 
4.4%
독서실(유아/초?중?고 148
 
1.6%
무용 136
 
1.5%
실용음악(성악 81
 
0.9%
기타(소 81
 
0.9%
독서실(일반인 59
 
0.7%
Other values (86) 941
 
10.4%
2024-05-04T05:49:51.819621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4934
15.2%
4871
15.0%
? 2152
 
6.6%
2000
 
6.1%
1536
 
4.7%
1501
 
4.6%
) 1044
 
3.2%
( 1044
 
3.2%
1038
 
3.2%
1005
 
3.1%
Other values (147) 11412
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 27469
84.4%
Other Punctuation 2980
 
9.2%
Close Punctuation 1044
 
3.2%
Open Punctuation 1044
 
3.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4934
18.0%
4871
17.7%
2000
 
7.3%
1536
 
5.6%
1501
 
5.5%
1038
 
3.8%
1005
 
3.7%
797
 
2.9%
692
 
2.5%
598
 
2.2%
Other values (142) 8497
30.9%
Other Punctuation
ValueCountFrequency (%)
? 2152
72.2%
/ 546
 
18.3%
, 282
 
9.5%
Close Punctuation
ValueCountFrequency (%)
) 1044
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1044
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 27469
84.4%
Common 5068
 
15.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4934
18.0%
4871
17.7%
2000
 
7.3%
1536
 
5.6%
1501
 
5.5%
1038
 
3.8%
1005
 
3.7%
797
 
2.9%
692
 
2.5%
598
 
2.2%
Other values (142) 8497
30.9%
Common
ValueCountFrequency (%)
? 2152
42.5%
) 1044
20.6%
( 1044
20.6%
/ 546
 
10.8%
, 282
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 27469
84.4%
ASCII 5068
 
15.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
4934
18.0%
4871
17.7%
2000
 
7.3%
1536
 
5.6%
1501
 
5.5%
1038
 
3.8%
1005
 
3.7%
797
 
2.9%
692
 
2.5%
598
 
2.2%
Other values (142) 8497
30.9%
ASCII
ValueCountFrequency (%)
? 2152
42.5%
) 1044
20.6%
( 1044
20.6%
/ 546
 
10.8%
, 282
 
5.6%

정원합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct848
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1526.9666
Minimum0
Maximum5525439
Zeros674
Zeros (%)6.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:52.212708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q118
median48
Q3140
95-th percentile720
Maximum5525439
Range5525439
Interquartile range (IQR)122

Descriptive statistics

Standard deviation70587.54
Coefficient of variation (CV)46.227298
Kurtosis4814.9443
Mean1526.9666
Median Absolute Deviation (MAD)38
Skewness67.578903
Sum15269666
Variance4.9826008 × 109
MonotonicityNot monotonic
2024-05-04T05:49:52.629245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 674
 
6.7%
20 315
 
3.1%
12 296
 
3.0%
30 285
 
2.9%
18 254
 
2.5%
24 243
 
2.4%
15 236
 
2.4%
60 215
 
2.1%
40 212
 
2.1%
10 185
 
1.8%
Other values (838) 7085
70.9%
ValueCountFrequency (%)
0 674
6.7%
1 15
 
0.1%
2 31
 
0.3%
3 61
 
0.6%
4 97
 
1.0%
5 86
 
0.9%
6 156
 
1.6%
7 49
 
0.5%
8 131
 
1.3%
9 182
 
1.8%
ValueCountFrequency (%)
5525439 1
< 0.1%
3999996 1
< 0.1%
1711000 1
< 0.1%
408408 1
< 0.1%
400045 1
< 0.1%
99990 1
< 0.1%
69993 1
< 0.1%
68800 1
< 0.1%
65292 1
< 0.1%
63592 1
< 0.1%

일시수용능력인원합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct406
Distinct (%)4.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.5982
Minimum0
Maximum9999
Zeros238
Zeros (%)2.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:53.102405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q16
median45
Q379
95-th percentile191
Maximum9999
Range9999
Interquartile range (IQR)73

Descriptive statistics

Standard deviation331.08029
Coefficient of variation (CV)4.6241426
Kurtosis807.16299
Mean71.5982
Median Absolute Deviation (MAD)38
Skewness27.217348
Sum715982
Variance109614.16
MonotonicityNot monotonic
2024-05-04T05:49:53.372126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9 873
 
8.7%
5 763
 
7.6%
4 573
 
5.7%
6 562
 
5.6%
7 428
 
4.3%
3 333
 
3.3%
8 287
 
2.9%
70 268
 
2.7%
0 238
 
2.4%
50 213
 
2.1%
Other values (396) 5462
54.6%
ValueCountFrequency (%)
0 238
 
2.4%
1 28
 
0.3%
2 100
 
1.0%
3 333
 
3.3%
4 573
5.7%
5 763
7.6%
6 562
5.6%
7 428
4.3%
8 287
 
2.9%
9 873
8.7%
ValueCountFrequency (%)
9999 10
0.1%
3029 1
 
< 0.1%
1701 1
 
< 0.1%
1679 1
 
< 0.1%
1600 2
 
< 0.1%
1500 2
 
< 0.1%
1373 1
 
< 0.1%
1261 1
 
< 0.1%
1223 1
 
< 0.1%
1208 1
 
< 0.1%

인당수강료내용
Text

MISSING 

Distinct2151
Distinct (%)98.4%
Missing7813
Missing (%)78.1%
Memory size156.2 KiB
2024-05-04T05:49:54.002967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length900
Median length313
Mean length99.219936
Min length4

Characters and Unicode

Total characters216994
Distinct characters452
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2122 ?
Unique (%)97.0%

Sample

1st row초등1:170000, 초등2:190000, 중등1:210000, 중등2:250000, 고등1:300000, 고등2:300000, 고등3:350000, 고등4:400000, 고등5:450000, 고등6:600000, 중등3:220000, 고등7:500000, 초등3:260000, 초등4:140000, 중등4:290000, 중등5:260000
2nd row초등 미술:70000, 초등 미술:160000, 초등 미술:200000
3rd row초급미술:100000, 중급미술:100000, 고급미술:100000
4th row초등미술:50000, 초등미술:70000
5th row바이엘:120000, 체르니100:130000, 체르니30:140000, 체르니40:150000, 듀오연주:170000
ValueCountFrequency (%)
피아노 181
 
1.3%
초등 97
 
0.7%
중등 59
 
0.4%
바이올린 39
 
0.3%
미술 37
 
0.3%
고등 33
 
0.2%
초등수학:200000 29
 
0.2%
중등수학:250000 28
 
0.2%
고급 27
 
0.2%
영어 24
 
0.2%
Other values (10814) 13740
96.1%
2024-05-04T05:49:54.907468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 54195
25.0%
: 12877
 
5.9%
12161
 
5.6%
, 12063
 
5.6%
1 10139
 
4.7%
2 8294
 
3.8%
6615
 
3.0%
( 5659
 
2.6%
) 5635
 
2.6%
3 4843
 
2.2%
Other values (442) 84513
38.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 94144
43.4%
Other Letter 65677
30.3%
Other Punctuation 29121
 
13.4%
Space Separator 12161
 
5.6%
Open Punctuation 5669
 
2.6%
Close Punctuation 5645
 
2.6%
Uppercase Letter 2885
 
1.3%
Lowercase Letter 966
 
0.4%
Dash Punctuation 339
 
0.2%
Math Symbol 149
 
0.1%
Other values (3) 238
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6615
 
10.1%
4343
 
6.6%
4267
 
6.5%
3885
 
5.9%
3278
 
5.0%
3020
 
4.6%
2716
 
4.1%
2657
 
4.0%
2528
 
3.8%
2410
 
3.7%
Other values (356) 29958
45.6%
Uppercase Letter
ValueCountFrequency (%)
A 872
30.2%
B 804
27.9%
C 439
15.2%
D 210
 
7.3%
E 109
 
3.8%
F 69
 
2.4%
G 52
 
1.8%
I 43
 
1.5%
H 42
 
1.5%
S 41
 
1.4%
Other values (14) 204
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
e 162
16.8%
a 77
 
8.0%
i 76
 
7.9%
n 70
 
7.2%
r 66
 
6.8%
s 62
 
6.4%
l 55
 
5.7%
o 53
 
5.5%
t 53
 
5.5%
c 37
 
3.8%
Other values (14) 255
26.4%
Decimal Number
ValueCountFrequency (%)
0 54195
57.6%
1 10139
 
10.8%
2 8294
 
8.8%
3 4843
 
5.1%
5 4809
 
5.1%
4 4665
 
5.0%
6 2326
 
2.5%
8 1952
 
2.1%
7 1596
 
1.7%
9 1325
 
1.4%
Other Punctuation
ValueCountFrequency (%)
: 12877
44.2%
, 12063
41.4%
* 2711
 
9.3%
. 1381
 
4.7%
/ 66
 
0.2%
? 14
 
< 0.1%
# 6
 
< 0.1%
& 3
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
~ 93
62.4%
+ 48
32.2%
> 4
 
2.7%
< 4
 
2.7%
Letter Number
ValueCountFrequency (%)
35
41.2%
35
41.2%
10
 
11.8%
5
 
5.9%
Open Punctuation
ValueCountFrequency (%)
( 5659
99.8%
[ 8
 
0.1%
{ 2
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 5635
99.8%
] 8
 
0.1%
} 2
 
< 0.1%
Other Number
ValueCountFrequency (%)
15
36.6%
15
36.6%
11
26.8%
Space Separator
ValueCountFrequency (%)
12161
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 339
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 147381
67.9%
Hangul 65677
30.3%
Latin 3936
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6615
 
10.1%
4343
 
6.6%
4267
 
6.5%
3885
 
5.9%
3278
 
5.0%
3020
 
4.6%
2716
 
4.1%
2657
 
4.0%
2528
 
3.8%
2410
 
3.7%
Other values (356) 29958
45.6%
Latin
ValueCountFrequency (%)
A 872
22.2%
B 804
20.4%
C 439
11.2%
D 210
 
5.3%
e 162
 
4.1%
E 109
 
2.8%
a 77
 
2.0%
i 76
 
1.9%
n 70
 
1.8%
F 69
 
1.8%
Other values (42) 1048
26.6%
Common
ValueCountFrequency (%)
0 54195
36.8%
: 12877
 
8.7%
12161
 
8.3%
, 12063
 
8.2%
1 10139
 
6.9%
2 8294
 
5.6%
( 5659
 
3.8%
) 5635
 
3.8%
3 4843
 
3.3%
5 4809
 
3.3%
Other values (24) 16706
 
11.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 151191
69.7%
Hangul 65662
30.3%
Number Forms 85
 
< 0.1%
Enclosed Alphanum 41
 
< 0.1%
Compat Jamo 15
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 54195
35.8%
: 12877
 
8.5%
12161
 
8.0%
, 12063
 
8.0%
1 10139
 
6.7%
2 8294
 
5.5%
( 5659
 
3.7%
) 5635
 
3.7%
3 4843
 
3.2%
5 4809
 
3.2%
Other values (69) 20516
 
13.6%
Hangul
ValueCountFrequency (%)
6615
 
10.1%
4343
 
6.6%
4267
 
6.5%
3885
 
5.9%
3278
 
5.0%
3020
 
4.6%
2716
 
4.1%
2657
 
4.0%
2528
 
3.9%
2410
 
3.7%
Other values (349) 29943
45.6%
Number Forms
ValueCountFrequency (%)
35
41.2%
35
41.2%
10
 
11.8%
5
 
5.9%
Enclosed Alphanum
ValueCountFrequency (%)
15
36.6%
15
36.6%
11
26.8%
Compat Jamo
ValueCountFrequency (%)
3
20.0%
3
20.0%
3
20.0%
2
13.3%
2
13.3%
1
 
6.7%
1
 
6.7%

수강료공개여부
Boolean

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
True
9041 
False
959 
ValueCountFrequency (%)
True 9041
90.4%
False 959
 
9.6%
2024-05-04T05:49:55.244659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

기숙사학원여부
Boolean

IMBALANCE  MISSING 

Distinct2
Distinct (%)< 0.1%
Missing513
Missing (%)5.1%
Memory size97.7 KiB
False
9466 
True
 
21
(Missing)
 
513
ValueCountFrequency (%)
False 9466
94.7%
True 21
 
0.2%
(Missing) 513
 
5.1%
2024-05-04T05:49:55.495566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

도로명우편번호
Real number (ℝ)

Distinct3671
Distinct (%)36.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45231.023
Minimum0
Maximum158885
Zeros8
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:55.697235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1762.95
Q14745
median6674.5
Q3132786
95-th percentile156760.05
Maximum158885
Range158885
Interquartile range (IQR)128041

Descriptive statistics

Standard deviation62025.488
Coefficient of variation (CV)1.3713041
Kurtosis-1.0772661
Mean45231.023
Median Absolute Deviation (MAD)2605.5
Skewness0.93439211
Sum4.5231023 × 108
Variance3.8471611 × 109
MonotonicityNot monotonic
2024-05-04T05:49:55.971992image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7983 104
 
1.0%
6593 58
 
0.6%
5849 39
 
0.4%
6279 35
 
0.4%
6202 32
 
0.3%
6512 31
 
0.3%
5269 30
 
0.3%
134825 29
 
0.3%
139860 29
 
0.3%
135500 29
 
0.3%
Other values (3661) 9584
95.8%
ValueCountFrequency (%)
0 8
0.1%
1021 1
 
< 0.1%
1031 3
 
< 0.1%
1033 1
 
< 0.1%
1041 3
 
< 0.1%
1042 2
 
< 0.1%
1043 2
 
< 0.1%
1049 1
 
< 0.1%
1051 1
 
< 0.1%
1054 1
 
< 0.1%
ValueCountFrequency (%)
158885 14
0.1%
158884 6
0.1%
158879 2
 
< 0.1%
158878 1
 
< 0.1%
158877 14
0.1%
158876 3
 
< 0.1%
158875 1
 
< 0.1%
158872 1
 
< 0.1%
158865 2
 
< 0.1%
158863 3
 
< 0.1%

등록상태명
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개원
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 10000
100.0%

Length

2024-05-04T05:49:56.216372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T05:49:56.379097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 10000
100.0%

등록일자
Real number (ℝ)

Distinct4792
Distinct (%)47.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20139925
Minimum19560401
Maximum20240426
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:56.687140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19560401
5-th percentile19970619
Q120091203
median20160817
Q320210209
95-th percentile20231025
Maximum20240426
Range680025
Interquartile range (IQR)119006.25

Descriptive statistics

Standard deviation86127.353
Coefficient of variation (CV)0.0042764486
Kurtosis2.9868746
Mean20139925
Median Absolute Deviation (MAD)50299
Skewness-1.4641968
Sum2.0139925 × 1011
Variance7.4179209 × 109
MonotonicityNot monotonic
2024-05-04T05:49:57.139308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20230313 10
 
0.1%
20231012 9
 
0.1%
20221229 9
 
0.1%
20220711 9
 
0.1%
20220905 9
 
0.1%
20230224 9
 
0.1%
20230713 9
 
0.1%
20240222 9
 
0.1%
20230704 9
 
0.1%
20230322 9
 
0.1%
Other values (4782) 9909
99.1%
ValueCountFrequency (%)
19560401 1
< 0.1%
19561209 1
< 0.1%
19620127 1
< 0.1%
19620213 1
< 0.1%
19630427 1
< 0.1%
19631010 1
< 0.1%
19650416 1
< 0.1%
19650423 1
< 0.1%
19680210 1
< 0.1%
19680628 1
< 0.1%
ValueCountFrequency (%)
20240426 1
 
< 0.1%
20240424 4
< 0.1%
20240423 3
< 0.1%
20240422 6
0.1%
20240419 1
 
< 0.1%
20240418 1
 
< 0.1%
20240417 3
< 0.1%
20240416 6
0.1%
20240415 2
 
< 0.1%
20240412 2
 
< 0.1%

휴원시작일자
Real number (ℝ)

MISSING 

Distinct124
Distinct (%)81.0%
Missing9847
Missing (%)98.5%
Infinite0
Infinite (%)0.0%
Mean20182926
Minimum20101021
Maximum20231130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:57.574396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20101021
5-th percentile20111214
Q120160418
median20200201
Q320201201
95-th percentile20230701
Maximum20231130
Range130109
Interquartile range (IQR)40783

Descriptive statistics

Standard deviation35682.392
Coefficient of variation (CV)0.0017679494
Kurtosis-0.61313259
Mean20182926
Median Absolute Deviation (MAD)20725
Skewness-0.6373904
Sum3.0879876 × 109
Variance1.2732331 × 109
MonotonicityNot monotonic
2024-05-04T05:49:58.040209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20200224 16
 
0.2%
20200225 6
 
0.1%
20230120 2
 
< 0.1%
20230701 2
 
< 0.1%
20200201 2
 
< 0.1%
20230818 2
 
< 0.1%
20200226 2
 
< 0.1%
20200302 2
 
< 0.1%
20180820 2
 
< 0.1%
20180921 2
 
< 0.1%
Other values (114) 115
 
1.1%
(Missing) 9847
98.5%
ValueCountFrequency (%)
20101021 1
< 0.1%
20101129 1
< 0.1%
20110201 1
< 0.1%
20110418 1
< 0.1%
20110627 1
< 0.1%
20110704 1
< 0.1%
20111209 1
< 0.1%
20111210 1
< 0.1%
20111217 1
< 0.1%
20120101 1
< 0.1%
ValueCountFrequency (%)
20231130 1
< 0.1%
20230818 2
< 0.1%
20230812 1
< 0.1%
20230809 1
< 0.1%
20230731 1
< 0.1%
20230717 1
< 0.1%
20230701 2
< 0.1%
20230630 1
< 0.1%
20230426 1
< 0.1%
20230313 1
< 0.1%

휴원종료일자
Real number (ℝ)

MISSING 

Distinct135
Distinct (%)1.5%
Missing782
Missing (%)7.8%
Infinite0
Infinite (%)0.0%
Mean98666619
Minimum20101028
Maximum99991231
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:58.473772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20101028
5-th percentile99991231
Q199991231
median99991231
Q399991231
95-th percentile99991231
Maximum99991231
Range79890203
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10196488
Coefficient of variation (CV)0.10334284
Kurtosis55.295932
Mean98666619
Median Absolute Deviation (MAD)0
Skewness-7.5686143
Sum9.0950889 × 1011
Variance1.0396837 × 1014
MonotonicityNot monotonic
2024-05-04T05:49:58.892754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99991231 9065
90.6%
20200405 9
 
0.1%
20200430 3
 
< 0.1%
20200419 3
 
< 0.1%
20230714 2
 
< 0.1%
20230831 2
 
< 0.1%
20160930 2
 
< 0.1%
20200414 2
 
< 0.1%
20200417 2
 
< 0.1%
20200420 2
 
< 0.1%
Other values (125) 126
 
1.3%
(Missing) 782
 
7.8%
ValueCountFrequency (%)
20101028 1
< 0.1%
20110214 1
< 0.1%
20110424 1
< 0.1%
20110828 1
< 0.1%
20111215 1
< 0.1%
20111223 2
< 0.1%
20111231 1
< 0.1%
20120122 1
< 0.1%
20120228 1
< 0.1%
20120506 1
< 0.1%
ValueCountFrequency (%)
99991231 9065
90.6%
20241129 1
 
< 0.1%
20240811 1
 
< 0.1%
20240312 1
 
< 0.1%
20240112 1
 
< 0.1%
20231124 1
 
< 0.1%
20230930 1
 
< 0.1%
20230831 2
 
< 0.1%
20230830 1
 
< 0.1%
20230815 1
 
< 0.1%

개설일자
Real number (ℝ)

Distinct4783
Distinct (%)47.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20140419
Minimum19560401
Maximum20240426
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:49:59.276176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum19560401
5-th percentile19970898
Q120091222
median20160828
Q320210216
95-th percentile20231026
Maximum20240426
Range680025
Interquartile range (IQR)118994

Descriptive statistics

Standard deviation85689.365
Coefficient of variation (CV)0.0042545969
Kurtosis3.0314896
Mean20140419
Median Absolute Deviation (MAD)50288.5
Skewness-1.470887
Sum2.0140419 × 1011
Variance7.3426673 × 109
MonotonicityNot monotonic
2024-05-04T05:49:59.729808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20230322 11
 
0.1%
20221229 9
 
0.1%
20230313 9
 
0.1%
20220711 9
 
0.1%
20240408 9
 
0.1%
20210607 9
 
0.1%
20230714 9
 
0.1%
20220905 9
 
0.1%
20230224 9
 
0.1%
20240105 9
 
0.1%
Other values (4773) 9908
99.1%
ValueCountFrequency (%)
19560401 1
< 0.1%
19561209 1
< 0.1%
19620127 1
< 0.1%
19620213 1
< 0.1%
19630427 1
< 0.1%
19631010 1
< 0.1%
19650416 1
< 0.1%
19650423 1
< 0.1%
19680210 1
< 0.1%
19680628 1
< 0.1%
ValueCountFrequency (%)
20240426 1
 
< 0.1%
20240424 4
< 0.1%
20240423 3
< 0.1%
20240422 6
0.1%
20240419 1
 
< 0.1%
20240418 1
 
< 0.1%
20240417 3
< 0.1%
20240416 5
0.1%
20240415 2
 
< 0.1%
20240412 3
< 0.1%

적재일시
Real number (ℝ)

Distinct19
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20232992
Minimum20231018
Maximum20240428
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T05:50:00.111115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20231018
5-th percentile20231018
Q120231018
median20231018
Q320231206
95-th percentile20240407
Maximum20240428
Range9410
Interquartile range (IQR)188

Descriptive statistics

Standard deviation3783.0517
Coefficient of variation (CV)0.0001869744
Kurtosis-0.0031396546
Mean20232992
Median Absolute Deviation (MAD)0
Skewness1.4125162
Sum2.0232992 × 1011
Variance14311480
MonotonicityNot monotonic
2024-05-04T05:50:00.360464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=19)
ValueCountFrequency (%)
20231018 6894
68.9%
20240317 459
 
4.6%
20240225 442
 
4.4%
20240428 418
 
4.2%
20240128 376
 
3.8%
20231029 220
 
2.2%
20240331 154
 
1.5%
20240407 134
 
1.3%
20240324 133
 
1.3%
20231206 106
 
1.1%
Other values (9) 664
 
6.6%
ValueCountFrequency (%)
20231018 6894
68.9%
20231023 28
 
0.3%
20231029 220
 
2.2%
20231105 91
 
0.9%
20231113 74
 
0.7%
20231119 82
 
0.8%
20231126 77
 
0.8%
20231206 106
 
1.1%
20231210 49
 
0.5%
20231217 95
 
0.9%
ValueCountFrequency (%)
20240428 418
4.2%
20240407 134
 
1.3%
20240331 154
 
1.5%
20240324 133
 
1.3%
20240317 459
4.6%
20240225 442
4.4%
20240128 376
3.8%
20231231 93
 
0.9%
20231224 75
 
0.8%
20231217 95
 
0.9%

Sample

행정구역명학원/교습소학원지정번호학원명도로명주소도로명상세주소분야명교습계열명교습과정목록명교습과정명정원합계일시수용능력인원합계인당수강료내용수강료공개여부기숙사학원여부도로명우편번호등록상태명등록일자휴원시작일자휴원종료일자개설일자적재일시
15101강서구교습소3000033849가우스엠(M)수학교습소서울특별시 강서구 공항대로41길 66, 407호 (등촌동, 세신등촌종합상가)입시.검정 및 보습보통교과초등1보습847초등1:170000, 초등2:190000, 중등1:210000, 중등2:250000, 고등1:300000, 고등2:300000, 고등3:350000, 고등4:400000, 고등5:450000, 고등6:600000, 중등3:220000, 고등7:500000, 초등3:260000, 초등4:140000, 중등4:290000, 중등5:260000YN7587개원20190306<NA>999912312019030620231231
2914강남구학원18846개포닥터정이클래스학원서울특별시 강남구 개포로 512, 403호 (개포동, 개포종합상가)입시.검정 및 보습보통교과보습보습10072<NA>YN6329개원20071120<NA>999912312007112020240128
22361마포구교습소3000043635아우름첼로아카데미첼로교습소서울특별시 마포구 백범로31길 8, 201동534호 (공덕동, 공덕SK리더스뷰)예능(대)예능(중)<NA>음악153<NA>YN4147개원20230322<NA>999912312023032220231018
15918강서구교습소3000035135뮤엠영어신정교습소서울특별시 강서구 곰달래로33가길 35, 1층일부 (화곡동)입시.검정 및 보습보통교과영어A(초등)보습366<NA>YN7750개원20190902<NA>999912312019090220231018
18603강서구교습소3000039061리드나인주니어영어교습소서울특별시 강서구 공항대로41길 51, 414호 (등촌동, 세신그린코아빌딩)입시.검정 및 보습보통교과<NA>보습244<NA>YN7586개원20210521<NA>999912312021052120231018
14543서초구학원3000032895입시전문코벤트학원서울특별시 서초구 방배로 121, 3층전체 (방배동, 미광빌딩)입시.검정 및 보습보통교과보습?논술,보습?논술14672<NA>YN6682개원20181024<NA>999912312018102420231018
887강서구학원1000034899벨라뮤직아카데미음악학원서울특별시 강서구 양천로14길 742층 (방화동)예능(대)예능(중)음악,음악15042<NA>YN157850개원20020608<NA>999912312002060820231018
21827강서구학원3000043023스마트해법영어학원서울특별시 강서구 강서로47길 169, 401호 (내발산동, 웨스트엔드아트센터)입시.검정 및 보습보통교과<NA>보습6060<NA>YN7635개원20221209<NA>999912312022120920231018
22983강남구학원3000044313골든블랑아카데미학원서울특별시 서초구 강남대로 279,7층 일부(서초동)직업기술<NA><NA><NA>060<NA>YN6729개원20230725<NA><NA>2023072420231224
860양천구학원1000034660늘채움학원서울특별시 양천구 목동남로4길 6-23목동2차우성아파트상가씨동 302,303호(신정동)입시.검정 및 보습보통교과보습,보습6685<NA>YN158776개원2001042620131226201412312001042620231018
행정구역명학원/교습소학원지정번호학원명도로명주소도로명상세주소분야명교습계열명교습과정목록명교습과정명정원합계일시수용능력인원합계인당수강료내용수강료공개여부기숙사학원여부도로명우편번호등록상태명등록일자휴원시작일자휴원종료일자개설일자적재일시
2701송파구학원17866M스칼라영어보습학원서울특별시 송파구 토성로 36, 3층 (풍납동)입시.검정 및 보습보통교과보습보습4050<NA>YN138874개원20070716<NA>999912312007071620231126
10681강북구교습소3000024859윤선생우리집앞화계영어교습소서울특별시 강북구 솔매로50길 78, 2층 (미아동)입시.검정 및 보습보통교과영어초등(1~4학년)보습549영어초등(1~4학년):180000, 영어초등(5~6학년):200000, 영어초등:140000, 영어중등(1~3학년):230000, 방학특강(초등):120000YN1158개원20151211<NA>999912312015121120240225
6089강남구학원29932청담아카데미학원서울특별시 강남구 학동로82길 21?3층(삼성동)기예(대)기예(중)실용음악(성악),실용음악(성악)911<NA>NN135870개원20100929<NA>999912312010092920231018
7827동대문구교습소3000016464윤선생크레시티영어교습소서울특별시 동대문구 답십리로 76, 2층 (전농동)입시.검정 및 보습보통교과초등영어A보습1049초등영어A:190000, 초등영어B:192000, 초등영어C:203000, 초등영어D:210000, 초등영어E:212000, 초등영어F:223000, 중등영어G:250000, 중등영어H:252000, 중등영어I:263000, 문법특강:138000, 배가학습:70000, 고등영어:230000NN130859개원20130222<NA>999912312013022220240317
20631서대문구교습소3000041606피아노힐(PianoHill)피아노교습소서울특별시 서대문구 독립문로8길 40, 1층 (천연동)예능(대)예능(중)<NA>음악93<NA>YN3744개원20220609<NA>999912312022061020231018
19792양천구교습소3000040590딘스잉글리쉬영어교습소서울특별시 양천구 목동동로 385412호(목동,벽산미라지타워)국제화외국어초등영어A실용외국어(유아/초?중?고)285초등영어A:290000, 초등영어B:310000, 초등영어C:330000, 초등영어D:450000, 초등영어G:80000, 초등영어E:200000, 초등영어F:110000YN7983개원20220107<NA>999912312022010720240225
11812은평구교습소3000027500조이바이올린교습소서울특별시 은평구 불광로 122-10, 상가동 지3층 110-1호 (불광동, 북한산현대힐스테이트3차아파트)예능(대)예능(중)초급바이올린음악45초급바이올린:150000, 중급바이올린:170000, 고급바이올린:190000, 입시바이올린:250000YN3362개원20161117<NA>999912312016111720231018
18841동작구학원3000039393메가군무원학원서울특별시 동작구 노량진로 140, 701호 일부, 702호 일부 (노량진동, 메가스터디타워)인문사회(대)인문사회(중)성인고시성인고시4002174<NA>YN6922개원20210714<NA>999912312021071420240317
2769영등포구학원18179대명학원서울특별시 영등포구 대림로 1492층(대림동, 후지빌딩)입시.검정 및 보습보통교과<NA>보습078<NA>YN7417개원20070810<NA>999912312007081020231206
10186서초구교습소3000023628러빈아트(Lovin' art)미술교습소서울특별시 서초구 명달로4길 32, 401호 (서초동)예능(대)예능(중)<NA>미술369<NA>YN137868개원20150701<NA>999912312015070120231018