Overview

Dataset statistics

Number of variables22
Number of observations10000
Missing cells4137
Missing cells (%)1.9%
Duplicate rows41
Duplicate rows (%)0.4%
Total size in memory1.8 MiB
Average record size in memory192.0 B

Variable types

Categorical6
Text7
Numeric9

Dataset

Description매분기 업데이트 되는 경기도교육청 관할 학원 정보이며, 해당 정보 관련하여 자세한 문의는 관할 교육지원청에서 담당하고 있음
Author경기도교육청
URLhttps://www.data.go.kr/data/3044325/fileData.do

Alerts

○ 문의사항 : 수원교육지원청 평생교육건강과로 문의 바랍니다. has constant value ""Constant
Dataset has 41 (0.4%) duplicate rowsDuplicates
Unnamed: 17 is highly imbalanced (96.9%)Imbalance
Unnamed: 6 has 3656 (36.6%) missing valuesMissing
Unnamed: 8 has 161 (1.6%) missing valuesMissing
Unnamed: 12 is highly skewed (γ1 = 97.11123229)Skewed
Unnamed: 16 is highly skewed (γ1 = 29.06169873)Skewed
Unnamed: 19 is highly skewed (γ1 = 44.86089775)Skewed
Unnamed: 12 has 191 (1.9%) zerosZeros
Unnamed: 13 has 129 (1.3%) zerosZeros
Unnamed: 14 has 9556 (95.6%) zerosZeros
Unnamed: 15 has 9536 (95.4%) zerosZeros
Unnamed: 16 has 9940 (99.4%) zerosZeros
Unnamed: 18 has 9651 (96.5%) zerosZeros
Unnamed: 19 has 9948 (99.5%) zerosZeros

Reproduction

Analysis started2024-03-14 10:40:32.951523
Analysis finished2024-03-14 10:40:35.656413
Duration2.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원
10000 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수원
2nd row수원
3rd row수원
4th row수원
5th row수원

Common Values

ValueCountFrequency (%)
수원 10000
100.0%

Length

2024-03-14T19:40:35.845719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T19:40:36.135918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수원 10000
100.0%
Distinct1946
Distinct (%)19.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T19:40:36.894899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length20
Mean length8.9034
Min length3

Characters and Unicode

Total characters89034
Distinct characters610
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique534 ?
Unique (%)5.3%

Sample

1st row피아체레음악학원
2nd row리드인다독다독독서논술전문학원
3rd row수학원
4th row이자경수학퍼스트학원
5th row와이즈리더영어학원
ValueCountFrequency (%)
러셀영통학원 137
 
1.4%
피켈학원 103
 
1.0%
아발론랭콘수원영통어학원 98
 
1.0%
수원아이비알뷰티미용학원 91
 
0.9%
눈높이러닝센터다솔학원 69
 
0.7%
에스비에스아카데미미용학원 65
 
0.7%
수원장안에이닷영어학원 62
 
0.6%
아발론녹지원광교어학원 61
 
0.6%
에듀코치개별지도영통학원 61
 
0.6%
에스쓰리미용학원 58
 
0.6%
Other values (1936) 9195
92.0%
2024-03-14T19:40:38.159400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11612
 
13.0%
11065
 
12.4%
3094
 
3.5%
2549
 
2.9%
2147
 
2.4%
2111
 
2.4%
1913
 
2.1%
1760
 
2.0%
1394
 
1.6%
1266
 
1.4%
Other values (600) 50123
56.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 88257
99.1%
Uppercase Letter 471
 
0.5%
Decimal Number 131
 
0.1%
Close Punctuation 86
 
0.1%
Open Punctuation 86
 
0.1%
Space Separator 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11612
 
13.2%
11065
 
12.5%
3094
 
3.5%
2549
 
2.9%
2147
 
2.4%
2111
 
2.4%
1913
 
2.2%
1760
 
2.0%
1394
 
1.6%
1266
 
1.4%
Other values (571) 49346
55.9%
Uppercase Letter
ValueCountFrequency (%)
S 188
39.9%
B 62
 
13.2%
E 41
 
8.7%
A 32
 
6.8%
M 27
 
5.7%
I 17
 
3.6%
N 17
 
3.6%
P 14
 
3.0%
K 13
 
2.8%
R 9
 
1.9%
Other values (8) 51
 
10.8%
Decimal Number
ValueCountFrequency (%)
1 43
32.8%
2 32
24.4%
9 31
23.7%
3 12
 
9.2%
4 5
 
3.8%
7 5
 
3.8%
6 2
 
1.5%
0 1
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 86
100.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 88257
99.1%
Latin 471
 
0.5%
Common 306
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11612
 
13.2%
11065
 
12.5%
3094
 
3.5%
2549
 
2.9%
2147
 
2.4%
2111
 
2.4%
1913
 
2.2%
1760
 
2.0%
1394
 
1.6%
1266
 
1.4%
Other values (571) 49346
55.9%
Latin
ValueCountFrequency (%)
S 188
39.9%
B 62
 
13.2%
E 41
 
8.7%
A 32
 
6.8%
M 27
 
5.7%
I 17
 
3.6%
N 17
 
3.6%
P 14
 
3.0%
K 13
 
2.8%
R 9
 
1.9%
Other values (8) 51
 
10.8%
Common
ValueCountFrequency (%)
) 86
28.1%
( 86
28.1%
1 43
14.1%
2 32
 
10.5%
9 31
 
10.1%
3 12
 
3.9%
4 5
 
1.6%
7 5
 
1.6%
3
 
1.0%
6 2
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 88257
99.1%
ASCII 777
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11612
 
13.2%
11065
 
12.5%
3094
 
3.5%
2549
 
2.9%
2147
 
2.4%
2111
 
2.4%
1913
 
2.2%
1760
 
2.0%
1394
 
1.6%
1266
 
1.4%
Other values (571) 49346
55.9%
ASCII
ValueCountFrequency (%)
S 188
24.2%
) 86
11.1%
( 86
11.1%
B 62
 
8.0%
1 43
 
5.5%
E 41
 
5.3%
A 32
 
4.1%
2 32
 
4.1%
9 31
 
4.0%
M 27
 
3.5%
Other values (19) 149
19.2%

Unnamed: 2
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학교교과교습학원
8752 
평생직업교육학원
1248 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교교과교습학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row학교교과교습학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 8752
87.5%
평생직업교육학원 1248
 
12.5%

Length

2024-03-14T19:40:38.565240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T19:40:38.862456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 8752
87.5%
평생직업교육학원 1248
 
12.5%

Unnamed: 3
Categorical

Distinct10
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
입시.검정 및 보습
5150 
예능(대)
1725 
종합(대)
962 
국제화
842 
직업기술
665 
Other values (5)
656 

Length

Max length10
Median length10
Mean length7.3384
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row예능(대)
2nd row입시.검정 및 보습
3rd row입시.검정 및 보습
4th row입시.검정 및 보습
5th row입시.검정 및 보습

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 5150
51.5%
예능(대) 1725
 
17.2%
종합(대) 962
 
9.6%
국제화 842
 
8.4%
직업기술 665
 
6.7%
기타(대) 242
 
2.4%
인문사회(대) 147
 
1.5%
독서실 145
 
1.5%
기예(대) 115
 
1.1%
정보 7
 
0.1%

Length

2024-03-14T19:40:39.238972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T19:40:39.612508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 5150
25.4%
5150
25.4%
보습 5150
25.4%
예능(대 1725
 
8.5%
종합(대 962
 
4.7%
국제화 842
 
4.1%
직업기술 665
 
3.3%
기타(대 242
 
1.2%
인문사회(대 147
 
0.7%
독서실 145
 
0.7%
Other values (2) 122
 
0.6%
Distinct1934
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T19:40:40.750267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length68
Mean length43.9385
Min length22

Characters and Unicode

Total characters439385
Distinct characters411
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique528 ?
Unique (%)5.3%

Sample

1st row경기도 수원시 장안구 파장로68번길 61 , 1층 (파장동 572-13)
2nd row경기도 수원시 영통구 에듀타운로 25 , 301호 (이의동, 명품프라자)
3rd row경기도 수원시 팔달구 동말로 95 , 6층 (화서동)
4th row경기도 수원시 권선구 곡반정로 160 , 503호 (곡반정동, 라퍼스트)
5th row경기도 수원시 영통구 반달로7번길 16 이폴리스 6층 605호 (영통동)
ValueCountFrequency (%)
경기도 10000
 
10.5%
수원시 10000
 
10.5%
8385
 
8.8%
영통구 4388
 
4.6%
일부 2092
 
2.2%
장안구 2007
 
2.1%
팔달구 1861
 
2.0%
영통동 1814
 
1.9%
권선구 1744
 
1.8%
전체 1349
 
1.4%
Other values (2159) 51741
54.2%
2024-03-14T19:40:42.279100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88039
 
20.0%
, 16935
 
3.9%
1 14726
 
3.4%
0 13475
 
3.1%
11951
 
2.7%
2 11559
 
2.6%
11287
 
2.6%
10970
 
2.5%
10801
 
2.5%
10600
 
2.4%
Other values (401) 239042
54.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 225911
51.4%
Space Separator 88039
 
20.0%
Decimal Number 81105
 
18.5%
Other Punctuation 17123
 
3.9%
Open Punctuation 10483
 
2.4%
Close Punctuation 10463
 
2.4%
Uppercase Letter 2727
 
0.6%
Dash Punctuation 2295
 
0.5%
Math Symbol 879
 
0.2%
Lowercase Letter 346
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11951
 
5.3%
11287
 
5.0%
10970
 
4.9%
10801
 
4.8%
10600
 
4.7%
10449
 
4.6%
10286
 
4.6%
10188
 
4.5%
10142
 
4.5%
10112
 
4.5%
Other values (345) 119125
52.7%
Uppercase Letter
ValueCountFrequency (%)
A 327
12.0%
S 282
10.3%
K 259
9.5%
W 240
8.8%
I 239
8.8%
E 229
8.4%
L 222
8.1%
B 215
7.9%
V 182
6.7%
D 101
 
3.7%
Other values (15) 431
15.8%
Decimal Number
ValueCountFrequency (%)
1 14726
18.2%
0 13475
16.6%
2 11559
14.3%
3 9555
11.8%
4 7427
9.2%
5 7141
8.8%
6 5735
 
7.1%
7 4112
 
5.1%
8 3943
 
4.9%
9 3432
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
e 134
38.7%
a 97
28.0%
k 97
28.0%
l 6
 
1.7%
i 4
 
1.2%
g 2
 
0.6%
n 2
 
0.6%
d 2
 
0.6%
u 2
 
0.6%
Other Punctuation
ValueCountFrequency (%)
, 16935
98.9%
. 94
 
0.5%
· 86
 
0.5%
6
 
< 0.1%
/ 2
 
< 0.1%
Letter Number
ValueCountFrequency (%)
9
64.3%
5
35.7%
Space Separator
ValueCountFrequency (%)
88039
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10483
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10463
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2295
100.0%
Math Symbol
ValueCountFrequency (%)
~ 879
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 225911
51.4%
Common 210387
47.9%
Latin 3087
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11951
 
5.3%
11287
 
5.0%
10970
 
4.9%
10801
 
4.8%
10600
 
4.7%
10449
 
4.6%
10286
 
4.6%
10188
 
4.5%
10142
 
4.5%
10112
 
4.5%
Other values (345) 119125
52.7%
Latin
ValueCountFrequency (%)
A 327
10.6%
S 282
 
9.1%
K 259
 
8.4%
W 240
 
7.8%
I 239
 
7.7%
E 229
 
7.4%
L 222
 
7.2%
B 215
 
7.0%
V 182
 
5.9%
e 134
 
4.3%
Other values (26) 758
24.6%
Common
ValueCountFrequency (%)
88039
41.8%
, 16935
 
8.0%
1 14726
 
7.0%
0 13475
 
6.4%
2 11559
 
5.5%
( 10483
 
5.0%
) 10463
 
5.0%
3 9555
 
4.5%
4 7427
 
3.5%
5 7141
 
3.4%
Other values (10) 20584
 
9.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 225911
51.4%
ASCII 213368
48.6%
None 86
 
< 0.1%
Number Forms 14
 
< 0.1%
Katakana 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
88039
41.3%
, 16935
 
7.9%
1 14726
 
6.9%
0 13475
 
6.3%
2 11559
 
5.4%
( 10483
 
4.9%
) 10463
 
4.9%
3 9555
 
4.5%
4 7427
 
3.5%
5 7141
 
3.3%
Other values (42) 23565
 
11.0%
Hangul
ValueCountFrequency (%)
11951
 
5.3%
11287
 
5.0%
10970
 
4.9%
10801
 
4.8%
10600
 
4.7%
10449
 
4.6%
10286
 
4.6%
10188
 
4.5%
10142
 
4.5%
10112
 
4.5%
Other values (345) 119125
52.7%
None
ValueCountFrequency (%)
· 86
100.0%
Number Forms
ValueCountFrequency (%)
9
64.3%
5
35.7%
Katakana
ValueCountFrequency (%)
6
100.0%
Distinct1694
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-14T19:40:43.239595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length3
Mean length5.2905
Min length2

Characters and Unicode

Total characters52905
Distinct characters405
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique423 ?
Unique (%)4.2%

Sample

1st row정화순
2nd row오현주
3rd row조명한
4th row정원석
5th row안상희
ValueCountFrequency (%)
주식회사 2016
 
16.7%
주)대교 520
 
4.3%
메가스터디교육(주 304
 
2.5%
아발론교육 193
 
1.6%
케이뷰티 146
 
1.2%
엠코드에듀 120
 
1.0%
엠코드하이에듀 114
 
0.9%
동화세상에듀코 99
 
0.8%
플로우교육 81
 
0.7%
배경선 73
 
0.6%
Other values (1702) 8387
69.6%
2024-03-14T19:40:44.579708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3707
 
7.0%
2193
 
4.1%
2173
 
4.1%
2129
 
4.0%
2105
 
4.0%
2058
 
3.9%
1524
 
2.9%
1309
 
2.5%
( 1198
 
2.3%
) 1198
 
2.3%
Other values (395) 33311
63.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 47952
90.6%
Space Separator 2058
 
3.9%
Open Punctuation 1198
 
2.3%
Close Punctuation 1198
 
2.3%
Other Punctuation 314
 
0.6%
Uppercase Letter 152
 
0.3%
Lowercase Letter 26
 
< 0.1%
Decimal Number 7
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3707
 
7.7%
2193
 
4.6%
2173
 
4.5%
2129
 
4.4%
2105
 
4.4%
1524
 
3.2%
1309
 
2.7%
908
 
1.9%
883
 
1.8%
854
 
1.8%
Other values (354) 30167
62.9%
Uppercase Letter
ValueCountFrequency (%)
A 20
13.2%
N 14
 
9.2%
O 12
 
7.9%
Y 11
 
7.2%
I 11
 
7.2%
L 10
 
6.6%
R 10
 
6.6%
K 10
 
6.6%
E 9
 
5.9%
U 8
 
5.3%
Other values (10) 37
24.3%
Lowercase Letter
ValueCountFrequency (%)
a 5
19.2%
e 3
11.5%
d 2
 
7.7%
g 2
 
7.7%
r 2
 
7.7%
n 2
 
7.7%
i 2
 
7.7%
l 2
 
7.7%
t 2
 
7.7%
o 2
 
7.7%
Other values (2) 2
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 2
28.6%
2 2
28.6%
6 1
14.3%
3 1
14.3%
4 1
14.3%
Space Separator
ValueCountFrequency (%)
2058
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1198
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1198
100.0%
Other Punctuation
ValueCountFrequency (%)
, 314
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 47952
90.6%
Common 4775
 
9.0%
Latin 178
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3707
 
7.7%
2193
 
4.6%
2173
 
4.5%
2129
 
4.4%
2105
 
4.4%
1524
 
3.2%
1309
 
2.7%
908
 
1.9%
883
 
1.8%
854
 
1.8%
Other values (354) 30167
62.9%
Latin
ValueCountFrequency (%)
A 20
 
11.2%
N 14
 
7.9%
O 12
 
6.7%
Y 11
 
6.2%
I 11
 
6.2%
L 10
 
5.6%
R 10
 
5.6%
K 10
 
5.6%
E 9
 
5.1%
U 8
 
4.5%
Other values (22) 63
35.4%
Common
ValueCountFrequency (%)
2058
43.1%
( 1198
25.1%
) 1198
25.1%
, 314
 
6.6%
1 2
 
< 0.1%
2 2
 
< 0.1%
6 1
 
< 0.1%
3 1
 
< 0.1%
4 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 47952
90.6%
ASCII 4953
 
9.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3707
 
7.7%
2193
 
4.6%
2173
 
4.5%
2129
 
4.4%
2105
 
4.4%
1524
 
3.2%
1309
 
2.7%
908
 
1.9%
883
 
1.8%
854
 
1.8%
Other values (354) 30167
62.9%
ASCII
ValueCountFrequency (%)
2058
41.6%
( 1198
24.2%
) 1198
24.2%
, 314
 
6.3%
A 20
 
0.4%
N 14
 
0.3%
O 12
 
0.2%
Y 11
 
0.2%
I 11
 
0.2%
L 10
 
0.2%
Other values (31) 107
 
2.2%

Unnamed: 6
Text

MISSING 

Distinct1061
Distinct (%)16.7%
Missing3656
Missing (%)36.6%
Memory size156.2 KiB
2024-03-14T19:40:45.575398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.035467
Min length12

Characters and Unicode

Total characters76353
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique253 ?
Unique (%)4.0%

Sample

1st row031-307-5532
2nd row031-242-6399
3rd row031-202-0516
4th row031-269-7084
5th row031-204-0021
ValueCountFrequency (%)
031-251-1010 137
 
2.2%
031-206-1117 121
 
1.9%
031-306-3214 100
 
1.6%
031-203-0026 98
 
1.5%
031-237-9090 72
 
1.1%
031-271-9109 69
 
1.1%
031-242-6200 65
 
1.0%
031-222-6222 62
 
1.0%
031-211-0605 61
 
1.0%
031-204-5155 53
 
0.8%
Other values (1051) 5506
86.8%
2024-03-14T19:40:46.963513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 13046
17.1%
- 12688
16.6%
1 11125
14.6%
3 9686
12.7%
2 9196
12.0%
5 4626
 
6.1%
9 3821
 
5.0%
6 3335
 
4.4%
7 3252
 
4.3%
4 2911
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 63665
83.4%
Dash Punctuation 12688
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 13046
20.5%
1 11125
17.5%
3 9686
15.2%
2 9196
14.4%
5 4626
 
7.3%
9 3821
 
6.0%
6 3335
 
5.2%
7 3252
 
5.1%
4 2911
 
4.6%
8 2667
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 12688
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 76353
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 13046
17.1%
- 12688
16.6%
1 11125
14.6%
3 9686
12.7%
2 9196
12.0%
5 4626
 
6.1%
9 3821
 
5.0%
6 3335
 
4.4%
7 3252
 
4.3%
4 2911
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 76353
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 13046
17.1%
- 12688
16.6%
1 11125
14.6%
3 9686
12.7%
2 9196
12.0%
5 4626
 
6.1%
9 3821
 
5.0%
6 3335
 
4.4%
7 3252
 
4.3%
4 2911
 
3.8%

Unnamed: 7
Categorical

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통교과
5483 
예능(중)
1773 
외국어
912 
산업응용기술
603 
기타(중)
 
278
Other values (13)
951 

Length

Max length7
Median length4
Mean length4.2852
Min length2

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row예능(중)
2nd row보통교과
3rd row보통교과
4th row보통교과
5th row보통교과

Common Values

ValueCountFrequency (%)
보통교과 5483
54.8%
예능(중) 1773
 
17.7%
외국어 912
 
9.1%
산업응용기술 603
 
6.0%
기타(중) 278
 
2.8%
<NA> 161
 
1.6%
독서실 144
 
1.4%
인문사회(중) 144
 
1.4%
기예(중) 119
 
1.2%
컴퓨터 105
 
1.1%
Other values (8) 278
 
2.8%

Length

2024-03-14T19:40:47.389534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보통교과 5483
54.8%
예능(중 1773
 
17.7%
외국어 912
 
9.1%
산업응용기술 603
 
6.0%
기타(중 278
 
2.8%
na 161
 
1.6%
독서실 144
 
1.4%
인문사회(중 144
 
1.4%
기예(중 119
 
1.2%
컴퓨터 105
 
1.1%
Other values (8) 278
 
2.8%

Unnamed: 8
Text

MISSING 

Distinct59
Distinct (%)0.6%
Missing161
Missing (%)1.6%
Memory size156.2 KiB
2024-03-14T19:40:48.077416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length2
Mean length4.0039638
Min length2

Characters and Unicode

Total characters39395
Distinct characters115
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row음악
2nd row보습·논술
3rd row보습
4th row보습
5th row보습
ValueCountFrequency (%)
보습 5205
52.9%
음악 1223
 
12.4%
실용외국어(유아/초·중·고 903
 
9.2%
이·미용 455
 
4.6%
미술 406
 
4.1%
식음료품(바리스타,소믈리에 153
 
1.6%
입시 146
 
1.5%
독서실(유아/초·중·고 144
 
1.5%
기타(소 142
 
1.4%
무용 139
 
1.4%
Other values (49) 923
 
9.4%
2024-03-14T19:40:49.174068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5365
 
13.6%
5240
 
13.3%
· 2586
 
6.6%
( 1592
 
4.0%
) 1592
 
4.0%
1563
 
4.0%
1463
 
3.7%
1318
 
3.3%
1125
 
2.9%
1098
 
2.8%
Other values (105) 16453
41.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32246
81.9%
Other Punctuation 3965
 
10.1%
Open Punctuation 1592
 
4.0%
Close Punctuation 1592
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5365
16.6%
5240
16.3%
1563
 
4.8%
1463
 
4.5%
1318
 
4.1%
1125
 
3.5%
1098
 
3.4%
1083
 
3.4%
1047
 
3.2%
1047
 
3.2%
Other values (100) 11897
36.9%
Other Punctuation
ValueCountFrequency (%)
· 2586
65.2%
/ 1047
26.4%
, 332
 
8.4%
Open Punctuation
ValueCountFrequency (%)
( 1592
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1592
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32246
81.9%
Common 7149
 
18.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5365
16.6%
5240
16.3%
1563
 
4.8%
1463
 
4.5%
1318
 
4.1%
1125
 
3.5%
1098
 
3.4%
1083
 
3.4%
1047
 
3.2%
1047
 
3.2%
Other values (100) 11897
36.9%
Common
ValueCountFrequency (%)
· 2586
36.2%
( 1592
22.3%
) 1592
22.3%
/ 1047
14.6%
, 332
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32246
81.9%
ASCII 4563
 
11.6%
None 2586
 
6.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5365
16.6%
5240
16.3%
1563
 
4.8%
1463
 
4.5%
1318
 
4.1%
1125
 
3.5%
1098
 
3.4%
1083
 
3.4%
1047
 
3.2%
1047
 
3.2%
Other values (100) 11897
36.9%
None
ValueCountFrequency (%)
· 2586
100.0%
ASCII
ValueCountFrequency (%)
( 1592
34.9%
) 1592
34.9%
/ 1047
22.9%
, 332
 
7.3%
Distinct6041
Distinct (%)60.6%
Missing32
Missing (%)0.3%
Memory size156.2 KiB
2024-03-14T19:40:50.344044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length37
Mean length7.7966493
Min length1

Characters and Unicode

Total characters77717
Distinct characters624
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5016 ?
Unique (%)50.3%

Sample

1st row중급
2nd row초등(독서,논술)
3rd row고등단과(영,수)
4th row수학(고)
5th rowI-80
ValueCountFrequency (%)
영어 153
 
1.3%
중등단과(수학 136
 
1.2%
고등단과(수학 135
 
1.2%
피아노 126
 
1.1%
수학 110
 
0.9%
초등단과(수학 105
 
0.9%
중등수학 104
 
0.9%
중등단과(영어 92
 
0.8%
초등수학 92
 
0.8%
초등단과(영어 89
 
0.8%
Other values (5945) 10558
90.2%
2024-03-14T19:40:51.984781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5033
 
6.5%
( 4245
 
5.5%
) 4223
 
5.4%
3117
 
4.0%
3105
 
4.0%
2909
 
3.7%
2735
 
3.5%
2361
 
3.0%
2355
 
3.0%
2289
 
2.9%
Other values (614) 45345
58.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54506
70.1%
Decimal Number 5629
 
7.2%
Open Punctuation 4399
 
5.7%
Close Punctuation 4377
 
5.6%
Uppercase Letter 2776
 
3.6%
Other Punctuation 2016
 
2.6%
Space Separator 1737
 
2.2%
Lowercase Letter 1541
 
2.0%
Dash Punctuation 531
 
0.7%
Math Symbol 120
 
0.2%
Other values (2) 85
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5033
 
9.2%
3117
 
5.7%
3105
 
5.7%
2909
 
5.3%
2735
 
5.0%
2361
 
4.3%
2355
 
4.3%
2289
 
4.2%
2224
 
4.1%
2172
 
4.0%
Other values (529) 26206
48.1%
Uppercase Letter
ValueCountFrequency (%)
A 600
21.6%
B 410
14.8%
C 277
10.0%
D 155
 
5.6%
L 144
 
5.2%
S 132
 
4.8%
M 126
 
4.5%
E 126
 
4.5%
I 95
 
3.4%
T 88
 
3.2%
Other values (16) 623
22.4%
Lowercase Letter
ValueCountFrequency (%)
e 157
 
10.2%
a 131
 
8.5%
i 125
 
8.1%
m 112
 
7.3%
t 112
 
7.3%
n 110
 
7.1%
s 102
 
6.6%
o 88
 
5.7%
l 87
 
5.6%
r 85
 
5.5%
Other values (16) 432
28.0%
Other Punctuation
ValueCountFrequency (%)
, 1705
84.6%
/ 185
 
9.2%
. 64
 
3.2%
& 26
 
1.3%
: 8
 
0.4%
; 7
 
0.3%
* 7
 
0.3%
% 6
 
0.3%
' 6
 
0.3%
· 2
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 1463
26.0%
2 1333
23.7%
3 885
15.7%
4 502
 
8.9%
0 418
 
7.4%
5 346
 
6.1%
6 275
 
4.9%
8 152
 
2.7%
9 131
 
2.3%
7 124
 
2.2%
Letter Number
ValueCountFrequency (%)
27
69.2%
6
 
15.4%
3
 
7.7%
3
 
7.7%
Open Punctuation
ValueCountFrequency (%)
( 4245
96.5%
[ 154
 
3.5%
Close Punctuation
ValueCountFrequency (%)
) 4223
96.5%
] 154
 
3.5%
Math Symbol
ValueCountFrequency (%)
~ 76
63.3%
+ 44
36.7%
Space Separator
ValueCountFrequency (%)
1737
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 531
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 46
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54506
70.1%
Common 18855
 
24.3%
Latin 4356
 
5.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5033
 
9.2%
3117
 
5.7%
3105
 
5.7%
2909
 
5.3%
2735
 
5.0%
2361
 
4.3%
2355
 
4.3%
2289
 
4.2%
2224
 
4.1%
2172
 
4.0%
Other values (529) 26206
48.1%
Latin
ValueCountFrequency (%)
A 600
 
13.8%
B 410
 
9.4%
C 277
 
6.4%
e 157
 
3.6%
D 155
 
3.6%
L 144
 
3.3%
S 132
 
3.0%
a 131
 
3.0%
M 126
 
2.9%
E 126
 
2.9%
Other values (46) 2098
48.2%
Common
ValueCountFrequency (%)
( 4245
22.5%
) 4223
22.4%
1737
9.2%
, 1705
9.0%
1 1463
 
7.8%
2 1333
 
7.1%
3 885
 
4.7%
- 531
 
2.8%
4 502
 
2.7%
0 418
 
2.2%
Other values (19) 1813
9.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54505
70.1%
ASCII 23170
29.8%
Number Forms 39
 
0.1%
None 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5033
 
9.2%
3117
 
5.7%
3105
 
5.7%
2909
 
5.3%
2735
 
5.0%
2361
 
4.3%
2355
 
4.3%
2289
 
4.2%
2224
 
4.1%
2172
 
4.0%
Other values (528) 26205
48.1%
ASCII
ValueCountFrequency (%)
( 4245
18.3%
) 4223
18.2%
1737
 
7.5%
, 1705
 
7.4%
1 1463
 
6.3%
2 1333
 
5.8%
3 885
 
3.8%
A 600
 
2.6%
- 531
 
2.3%
4 502
 
2.2%
Other values (70) 5946
25.7%
Number Forms
ValueCountFrequency (%)
27
69.2%
6
 
15.4%
3
 
7.7%
3
 
7.7%
None
ValueCountFrequency (%)
· 2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%

Unnamed: 10
Real number (ℝ)

Distinct102
Distinct (%)1.0%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean21.327247
Minimum0
Maximum384
Zeros67
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:52.408790image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q110
median13
Q320
95-th percentile60
Maximum384
Range384
Interquartile range (IQR)10

Descriptive statistics

Standard deviation27.82951
Coefficient of variation (CV)1.3048806
Kurtosis30.803161
Mean21.327247
Median Absolute Deviation (MAD)5
Skewness4.5772202
Sum212590
Variance774.48164
MonotonicityNot monotonic
2024-03-14T19:40:52.703050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 1737
17.4%
20 1140
11.4%
15 1084
10.8%
13 664
 
6.6%
8 638
 
6.4%
12 622
 
6.2%
30 529
 
5.3%
5 365
 
3.6%
6 309
 
3.1%
40 252
 
2.5%
Other values (92) 2628
26.3%
ValueCountFrequency (%)
0 67
 
0.7%
1 146
 
1.5%
2 85
 
0.9%
3 68
 
0.7%
4 139
 
1.4%
5 365
3.6%
6 309
3.1%
7 87
 
0.9%
8 638
6.4%
9 129
 
1.3%
ValueCountFrequency (%)
384 5
 
0.1%
360 1
 
< 0.1%
320 1
 
< 0.1%
300 1
 
< 0.1%
293 1
 
< 0.1%
250 1
 
< 0.1%
210 1
 
< 0.1%
200 17
0.2%
194 12
0.1%
170 1
 
< 0.1%
Distinct67
Distinct (%)0.7%
Missing32
Missing (%)0.3%
Memory size156.2 KiB
2024-03-14T19:40:53.311799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length5
Mean length5.0331059
Min length5

Characters and Unicode

Total characters50170
Distinct characters13
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)0.2%

Sample

1st row1개월0일
2nd row1개월0일
3rd row1개월0일
4th row1개월0일
5th row1개월0일
ValueCountFrequency (%)
1개월0일 8828
88.6%
2개월0일 181
 
1.8%
3개월0일 166
 
1.7%
0개월1일 134
 
1.3%
0개월0일 104
 
1.0%
0개월28일 95
 
1.0%
4개월0일 48
 
0.5%
0개월21일 39
 
0.4%
6개월0일 38
 
0.4%
0개월10일 31
 
0.3%
Other values (57) 304
 
3.0%
2024-03-14T19:40:54.133738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10115
20.2%
9968
19.9%
9968
19.9%
9968
19.9%
1 9146
18.2%
2 396
 
0.8%
3 194
 
0.4%
8 121
 
0.2%
4 113
 
0.2%
6 67
 
0.1%
Other values (3) 114
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29904
59.6%
Decimal Number 20266
40.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10115
49.9%
1 9146
45.1%
2 396
 
2.0%
3 194
 
1.0%
8 121
 
0.6%
4 113
 
0.6%
6 67
 
0.3%
5 61
 
0.3%
7 48
 
0.2%
9 5
 
< 0.1%
Other Letter
ValueCountFrequency (%)
9968
33.3%
9968
33.3%
9968
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29904
59.6%
Common 20266
40.4%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10115
49.9%
1 9146
45.1%
2 396
 
2.0%
3 194
 
1.0%
8 121
 
0.6%
4 113
 
0.6%
6 67
 
0.3%
5 61
 
0.3%
7 48
 
0.2%
9 5
 
< 0.1%
Hangul
ValueCountFrequency (%)
9968
33.3%
9968
33.3%
9968
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29904
59.6%
ASCII 20266
40.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10115
49.9%
1 9146
45.1%
2 396
 
2.0%
3 194
 
1.0%
8 121
 
0.6%
4 113
 
0.6%
6 67
 
0.3%
5 61
 
0.3%
7 48
 
0.2%
9 5
 
< 0.1%
Hangul
ValueCountFrequency (%)
9968
33.3%
9968
33.3%
9968
33.3%

Unnamed: 12
Real number (ℝ)

SKEWED  ZEROS 

Distinct732
Distinct (%)7.3%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean1205604.9
Minimum0
Maximum1 × 1010
Zeros191
Zeros (%)1.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:54.401143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile215
Q1860
median1290
Q31806
95-th percentile3780
Maximum1 × 1010
Range1 × 1010
Interquartile range (IQR)946

Descriptive statistics

Standard deviation1.0115492 × 108
Coefficient of variation (CV)83.903875
Kurtosis9582.7461
Mean1205604.9
Median Absolute Deviation (MAD)474
Skewness97.111232
Sum1.2017469 × 1010
Variance1.0232318 × 1016
MonotonicityNot monotonic
2024-03-14T19:40:54.796721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1260 659
 
6.6%
1512 456
 
4.6%
1290 415
 
4.2%
1548 384
 
3.8%
1008 253
 
2.5%
1032 212
 
2.1%
0 191
 
1.9%
2322 187
 
1.9%
2268 177
 
1.8%
756 169
 
1.7%
Other values (722) 6865
68.7%
ValueCountFrequency (%)
0 191
1.9%
1 11
 
0.1%
10 1
 
< 0.1%
20 1
 
< 0.1%
24 1
 
< 0.1%
30 1
 
< 0.1%
40 2
 
< 0.1%
48 1
 
< 0.1%
50 10
 
0.1%
55 5
 
0.1%
ValueCountFrequency (%)
9999999999 1
 
< 0.1%
999999999 2
 
< 0.1%
91200 4
< 0.1%
73200 1
 
< 0.1%
69360 2
 
< 0.1%
67200 2
 
< 0.1%
62400 1
 
< 0.1%
52800 6
0.1%
48000 1
 
< 0.1%
46440 2
 
< 0.1%

Unnamed: 13
Real number (ℝ)

ZEROS 

Distinct864
Distinct (%)8.7%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean321032.88
Minimum0
Maximum10000000
Zeros129
Zeros (%)1.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:55.232111image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile35000
Q1140000
median220000
Q3346250
95-th percentile820000
Maximum10000000
Range10000000
Interquartile range (IQR)206250

Descriptive statistics

Standard deviation488627.73
Coefficient of variation (CV)1.5220489
Kurtosis102.08218
Mean321032.88
Median Absolute Deviation (MAD)95000
Skewness8.2125574
Sum3.2000557 × 109
Variance2.3875706 × 1011
MonotonicityNot monotonic
2024-03-14T19:40:55.698644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150000 461
 
4.6%
300000 427
 
4.3%
200000 394
 
3.9%
250000 368
 
3.7%
180000 290
 
2.9%
120000 264
 
2.6%
160000 259
 
2.6%
130000 228
 
2.3%
100000 222
 
2.2%
140000 220
 
2.2%
Other values (854) 6835
68.3%
ValueCountFrequency (%)
0 129
1.3%
1 2
 
< 0.1%
4500 1
 
< 0.1%
4700 1
 
< 0.1%
5000 9
 
0.1%
6000 4
 
< 0.1%
6500 2
 
< 0.1%
7000 4
 
< 0.1%
7200 1
 
< 0.1%
8000 3
 
< 0.1%
ValueCountFrequency (%)
10000000 1
 
< 0.1%
9000000 2
< 0.1%
8896580 1
 
< 0.1%
8747460 1
 
< 0.1%
8534340 1
 
< 0.1%
8062980 1
 
< 0.1%
8000000 1
 
< 0.1%
6000000 4
< 0.1%
5500000 1
 
< 0.1%
5400000 2
< 0.1%

Unnamed: 14
Real number (ℝ)

ZEROS 

Distinct81
Distinct (%)0.8%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean2877.5732
Minimum0
Maximum880000
Zeros9556
Zeros (%)95.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:56.140587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum880000
Range880000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation30605.233
Coefficient of variation (CV)10.635779
Kurtosis305.22906
Mean2877.5732
Median Absolute Deviation (MAD)0
Skewness16.337879
Sum28683650
Variance9.3668028 × 108
MonotonicityNot monotonic
2024-03-14T19:40:56.570882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 9556
95.6%
20000 112
 
1.1%
10000 45
 
0.4%
30000 27
 
0.3%
15000 20
 
0.2%
4000 17
 
0.2%
50000 10
 
0.1%
5000 9
 
0.1%
45000 9
 
0.1%
40000 9
 
0.1%
Other values (71) 154
 
1.5%
(Missing) 32
 
0.3%
ValueCountFrequency (%)
0 9556
95.6%
2000 3
 
< 0.1%
3000 8
 
0.1%
3200 5
 
0.1%
4000 17
 
0.2%
5000 9
 
0.1%
6000 7
 
0.1%
7000 1
 
< 0.1%
8000 2
 
< 0.1%
9000 4
 
< 0.1%
ValueCountFrequency (%)
880000 1
 
< 0.1%
720000 1
 
< 0.1%
660000 2
< 0.1%
630000 1
 
< 0.1%
600000 2
< 0.1%
550000 2
< 0.1%
520000 3
< 0.1%
507500 2
< 0.1%
495000 1
 
< 0.1%
475000 1
 
< 0.1%

Unnamed: 15
Real number (ℝ)

ZEROS 

Distinct115
Distinct (%)1.2%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean21279.697
Minimum0
Maximum6500000
Zeros9536
Zeros (%)95.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:57.166960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6500000
Range6500000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation154747.01
Coefficient of variation (CV)7.2720494
Kurtosis367.29458
Mean21279.697
Median Absolute Deviation (MAD)0
Skewness13.852332
Sum2.1211602 × 108
Variance2.3946637 × 1010
MonotonicityNot monotonic
2024-03-14T19:40:57.607529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 9536
95.4%
20000 29
 
0.3%
40000 25
 
0.2%
700000 23
 
0.2%
500000 21
 
0.2%
1000000 18
 
0.2%
600000 17
 
0.2%
300000 16
 
0.2%
130000 16
 
0.2%
400000 14
 
0.1%
Other values (105) 253
 
2.5%
(Missing) 32
 
0.3%
ValueCountFrequency (%)
0 9536
95.4%
4000 2
 
< 0.1%
5000 1
 
< 0.1%
5200 1
 
< 0.1%
6000 2
 
< 0.1%
7000 1
 
< 0.1%
8000 2
 
< 0.1%
10000 6
 
0.1%
10680 1
 
< 0.1%
11000 1
 
< 0.1%
ValueCountFrequency (%)
6500000 1
 
< 0.1%
2610000 1
 
< 0.1%
2400000 1
 
< 0.1%
2000000 1
 
< 0.1%
1990000 1
 
< 0.1%
1800000 4
 
< 0.1%
1700000 1
 
< 0.1%
1690000 1
 
< 0.1%
1590000 10
0.1%
1500000 2
 
< 0.1%

Unnamed: 16
Real number (ℝ)

SKEWED  ZEROS 

Distinct15
Distinct (%)0.2%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean222.21108
Minimum0
Maximum220000
Zeros9940
Zeros (%)99.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:57.880478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum220000
Range220000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation5325.1158
Coefficient of variation (CV)23.964223
Kurtosis941.09254
Mean222.21108
Median Absolute Deviation (MAD)0
Skewness29.061699
Sum2215000
Variance28356858
MonotonicityNot monotonic
2024-03-14T19:40:58.171877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 9940
99.4%
20000 6
 
0.1%
10000 3
 
< 0.1%
80000 3
 
< 0.1%
130000 3
 
< 0.1%
200000 2
 
< 0.1%
100000 2
 
< 0.1%
90000 2
 
< 0.1%
5000 1
 
< 0.1%
70000 1
 
< 0.1%
Other values (5) 5
 
0.1%
(Missing) 32
 
0.3%
ValueCountFrequency (%)
0 9940
99.4%
5000 1
 
< 0.1%
10000 3
 
< 0.1%
20000 6
 
0.1%
40000 1
 
< 0.1%
50000 1
 
< 0.1%
70000 1
 
< 0.1%
80000 3
 
< 0.1%
90000 2
 
< 0.1%
100000 2
 
< 0.1%
ValueCountFrequency (%)
220000 1
 
< 0.1%
200000 2
< 0.1%
150000 1
 
< 0.1%
130000 3
< 0.1%
120000 1
 
< 0.1%
100000 2
< 0.1%
90000 2
< 0.1%
80000 3
< 0.1%
70000 1
 
< 0.1%
50000 1
 
< 0.1%

Unnamed: 17
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9968 
<NA>
 
32

Length

Max length4
Median length1
Mean length1.0096
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9968
99.7%
<NA> 32
 
0.3%

Length

2024-03-14T19:40:58.416401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T19:40:58.594333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9968
99.7%
na 32
 
0.3%

Unnamed: 18
Real number (ℝ)

ZEROS 

Distinct47
Distinct (%)0.5%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean979.89366
Minimum0
Maximum150000
Zeros9651
Zeros (%)96.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:58.794246image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum150000
Range150000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6272.2396
Coefficient of variation (CV)6.400939
Kurtosis95.213298
Mean979.89366
Median Absolute Deviation (MAD)0
Skewness8.4584122
Sum9767580
Variance39340989
MonotonicityNot monotonic
2024-03-14T19:40:59.062466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
0 9651
96.5%
30000 97
 
1.0%
20000 76
 
0.8%
40000 28
 
0.3%
50000 14
 
0.1%
10000 13
 
0.1%
25000 11
 
0.1%
70000 9
 
0.1%
15000 7
 
0.1%
60000 6
 
0.1%
Other values (37) 56
 
0.6%
(Missing) 32
 
0.3%
ValueCountFrequency (%)
0 9651
96.5%
220 1
 
< 0.1%
960 1
 
< 0.1%
1880 1
 
< 0.1%
3075 2
 
< 0.1%
3295 1
 
< 0.1%
4420 1
 
< 0.1%
4880 1
 
< 0.1%
5500 1
 
< 0.1%
6000 5
 
0.1%
ValueCountFrequency (%)
150000 1
 
< 0.1%
100000 2
 
< 0.1%
80000 4
< 0.1%
75000 1
 
< 0.1%
70000 9
0.1%
65000 3
 
< 0.1%
63000 1
 
< 0.1%
60000 6
0.1%
55000 3
 
< 0.1%
51000 2
 
< 0.1%

Unnamed: 19
Real number (ℝ)

SKEWED  ZEROS 

Distinct14
Distinct (%)0.1%
Missing32
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean316.33226
Minimum0
Maximum640000
Zeros9948
Zeros (%)99.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:59.368840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum640000
Range640000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation10995.372
Coefficient of variation (CV)34.758934
Kurtosis2230.056
Mean316.33226
Median Absolute Deviation (MAD)0
Skewness44.860898
Sum3153200
Variance1.2089821 × 108
MonotonicityNot monotonic
2024-03-14T19:40:59.562350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
0 9948
99.5%
300000 3
 
< 0.1%
40000 2
 
< 0.1%
35000 2
 
< 0.1%
43600 2
 
< 0.1%
13000 2
 
< 0.1%
25000 2
 
< 0.1%
50000 1
 
< 0.1%
30000 1
 
< 0.1%
585000 1
 
< 0.1%
Other values (4) 4
 
< 0.1%
(Missing) 32
 
0.3%
ValueCountFrequency (%)
0 9948
99.5%
13000 2
 
< 0.1%
25000 2
 
< 0.1%
30000 1
 
< 0.1%
35000 2
 
< 0.1%
40000 2
 
< 0.1%
43600 2
 
< 0.1%
50000 1
 
< 0.1%
85000 1
 
< 0.1%
200000 1
 
< 0.1%
ValueCountFrequency (%)
640000 1
 
< 0.1%
585000 1
 
< 0.1%
350000 1
 
< 0.1%
300000 3
< 0.1%
200000 1
 
< 0.1%
85000 1
 
< 0.1%
50000 1
 
< 0.1%
43600 2
< 0.1%
40000 2
< 0.1%
35000 2
< 0.1%

Unnamed: 20
Real number (ℝ)

Distinct937
Distinct (%)9.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean345599.12
Minimum0
Maximum12500000
Zeros36
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-14T19:40:59.808430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile35000
Q1140000
median230000
Q3350000
95-th percentile1000000
Maximum12500000
Range12500000
Interquartile range (IQR)210000

Descriptive statistics

Standard deviation548574.41
Coefficient of variation (CV)1.5873143
Kurtosis97.423826
Mean345599.12
Median Absolute Deviation (MAD)100000
Skewness7.7817119
Sum3.4559912 × 109
Variance3.0093388 × 1011
MonotonicityNot monotonic
2024-03-14T19:41:00.181330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
150000 445
 
4.5%
300000 415
 
4.2%
200000 385
 
3.9%
250000 369
 
3.7%
180000 283
 
2.8%
160000 268
 
2.7%
120000 252
 
2.5%
130000 234
 
2.3%
100000 222
 
2.2%
140000 220
 
2.2%
Other values (927) 6907
69.1%
ValueCountFrequency (%)
0 36
0.4%
2000 3
 
< 0.1%
3000 2
 
< 0.1%
3200 5
 
0.1%
4000 2
 
< 0.1%
4500 1
 
< 0.1%
4700 1
 
< 0.1%
5000 10
 
0.1%
6000 10
 
0.1%
6500 2
 
< 0.1%
ValueCountFrequency (%)
12500000 1
< 0.1%
10700000 1
< 0.1%
9700000 1
< 0.1%
9000000 2
< 0.1%
8896580 1
< 0.1%
8747460 1
< 0.1%
8534340 1
< 0.1%
8062980 1
< 0.1%
6700000 1
< 0.1%
6000000 2
< 0.1%

Unnamed: 21
Categorical

Distinct50
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2
1534 
1
1154 
3
1053 
4
848 
5
653 
Other values (45)
4758 

Length

Max length2
Median length1
Mean length1.3025
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row4
3rd row1
4th row9
5th row5

Common Values

ValueCountFrequency (%)
2 1534
15.3%
1 1154
11.5%
3 1053
 
10.5%
4 848
 
8.5%
5 653
 
6.5%
6 560
 
5.6%
10 454
 
4.5%
7 397
 
4.0%
8 360
 
3.6%
0 278
 
2.8%
Other values (40) 2709
27.1%

Length

2024-03-14T19:41:00.573663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2 1534
15.3%
1 1154
11.5%
3 1053
 
10.5%
4 848
 
8.5%
5 653
 
6.5%
6 560
 
5.6%
10 454
 
4.5%
7 397
 
4.0%
8 360
 
3.6%
0 278
 
2.8%
Other values (40) 2709
27.1%

Sample

○ 문의사항 : 수원교육지원청 평생교육건강과로 문의 바랍니다.Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21
157수원피아체레음악학원학교교과교습학원예능(대)경기도 수원시 장안구 파장로68번길 61 , 1층 (파장동 572-13)정화순031-307-5532예능(중)음악중급101개월0일10801200000000001200001
31434수원리드인다독다독독서논술전문학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 에듀타운로 25 , 301호 (이의동, 명품프라자)오현주<NA>보통교과보습·논술초등(독서,논술)201개월0일90315000004000000001900004
10273수원수학원학교교과교습학원입시.검정 및 보습경기도 수원시 팔달구 동말로 95 , 6층 (화서동)조명한031-242-6399보통교과보습고등단과(영,수)81개월0일12152360000000002360001
32331수원이자경수학퍼스트학원학교교과교습학원입시.검정 및 보습경기도 수원시 권선구 곡반정로 160 , 503호 (곡반정동, 라퍼스트)정원석<NA>보통교과보습수학(고)1941개월0일23224852980000004852989
9293수원와이즈리더영어학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 반달로7번길 16 이폴리스 6층 605호 (영통동)안상희031-202-0516보통교과보습I-80201개월0일20641462500000001462505
1926수원정자에이스학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 대평로 86 , 5층 507호 일부, 507-1호 전체 (정자동877-1외1필지,화이트쇼핑몰)김경원,차주향031-269-7084보통교과보습중등단과(수학)A171개월0일14702300000000002300002
1003수원그랜드음악학원학교교과교습학원예능(대)경기도 수원시 영통구 봉영로 1605 , 모던타운 602호일부 (영통동)정현주031-204-0021예능(중)음악바이올린51개월0일504600000000006000010
15775수원씨엠수학전문학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 정자천로 179 , 408호, 409호 일부 (정자동)이지훈<NA>보통교과보습수학(초)2101개월0일10321700000000001700001
16693수원공부의정석학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 매영로 85 , 504호, 302호 (매탄동, 성일코아빌딩)박두선031-217-1300보통교과보습수학초등101개월0일15122313300000002313303
16639수원동수원요리전문학원평생직업교육학원직업기술경기도 수원시 팔달구 중부대로223번길 4 , 403호 (우만동, 블루하우스)오현주031-213-5330산업응용기술식음료품(바리스타,소믈리에)중식조리기능사 자격증 취득반161개월0일3024200000025000000004500001
○ 문의사항 : 수원교육지원청 평생교육건강과로 문의 바랍니다.Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21
30524수원컬컴수원점학원평생직업교육학원국제화경기도 수원시 권선구 권선로 472 , 3층 일부 (세류동, 세지빌딩)주식회사 컬컴평생교육원<NA>국제어학(성인)한국어회화2001개월0일8601200000000001200001
19121수원싹수학학원학교교과교습학원입시.검정 및 보습경기도 수원시 팔달구 권광로276번길 4 , 2층 일부 및 3층 일부 (인계동)주식회사 싹아카데미<NA>보통교과보습수학1151개월0일18902100000000002100007
20975수원수원영통아이들의작업실미술학원학교교과교습학원예능(대)경기도 수원시 영통구 봉영로 1612 , 411호 (영통동)김연진<NA>예능(중)미술초등미술(분기)43개월0일1134192000031800000005100004
27168수원지트에듀케이션광교해마루학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 도청로17번길 40 , 3층 301호~305호 전체 (이의동)황재옥<NA>보통교과보습중등수학9201개월0일55911850800001149201300009
30931수원두런영어학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 만석로159번길 56 , 4층 전체 (파장동, 승리속셈보습학원)이여주,임화숙<NA>보통교과보습초등영어151개월0일10321700000000001700003
22275수원러셀영통학원학교교과교습학원종합(대)경기도 수원시 영통구 봉영로 1623 , 408호, 409호, 414호, 415호 일부, 416호 일부 (영통동 958-1, 드림피아빌딩)메가스터디교육(주)031-251-1010보통교과보습고등 국어, 수학, 영어, 사회, 과학, 제2외국어811351개월0일132527500000000027500076
28015수원수학의아침사이언스카이학원학교교과교습학원종합(대)경기도 수원시 영통구 법조로 25 , 602~605호, 607~613호 (하동, 광교 SK VIEW Lake)플로우교육 주식회사<NA>보통교과보습초등보습(더블랙 정규J)w101개월0일152220000000000020000035
29949수원망포피아체레음악학원학교교과교습학원예능(대)경기도 수원시 영통구 태장로7번길 57 , 402호 일부 (망포동)이유미<NA>예능(중)음악악기고급21개월0일5161400000000001400004
25559수원실력상승학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 매영로 58 , 2층 전체 (매탄동, 홍식빌딩)임현수031-278-2777보통교과보습초등단과(영어)101개월0일12041800000000001800004
26157수원세영코딩연구소학원학교교과교습학원기타(대)경기도 수원시 영통구 영통로 127 , 6층 602호 전체 (망포동, 센타프라자)최민국<NA>기타(중)기타(소)프로그래밍심화A(성인)81개월0일5042000000000002000001

Duplicate rows

Most frequently occurring

○ 문의사항 : 수원교육지원청 평생교육건강과로 문의 바랍니다.Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21# duplicates
39수원탑올림피아드학원학교교과교습학원입시.검정 및 보습경기도 수원시 팔달구 경수대로604번길 15 , 대창빌딩 2층 (우만동)오지혜031-893-9010보통교과보습중등단과(수학)151개월0일151225000000000025000063
0수원SAP영어학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 월드컵로87번길 7 , 301호, 호반프라자 (원천동)서희란031-2160-553보통교과보습중등영어301개월0일154829000000000029000042
1수원SR영어독서학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 파장로 53 , 정자 벽산블루밍상가1 2층 201호 (정자동)김주아031-255-5567보통교과보습초등단과(영어2)201개월0일116118000000000018000022
2수원경기목공기술학원평생직업교육학원직업기술경기도 수원시 권선구 덕영대로 1150 , 1층 일부, 5층 일부 (세류동, 동일빌딩)이승찬031-221-7893산업기반기술건축목공인테리어(1)151개월0일193521510000000021510012
3수원곰수학학원학교교과교습학원입시.검정 및 보습경기도 수원시 영통구 효원로 381 , 901호 일부 (매탄동 1267-1)(주)곰수학학원031-217-9449보통교과보습고등단과(국어)251개월0일126025000000000025000062
4수원눈높이러닝센터조원학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 금당로 42 외1필지 그린프라자 201호 (조원동)(주)대교031-253-9509보통교과보습수학공부와락151개월0일1683000000000030000122
5수원눈높이러닝센터조원학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 금당로 42 외1필지 그린프라자 201호 (조원동)(주)대교031-253-9509보통교과보습아이리스닝151개월0일2103200000000032000122
6수원눈높이러닝센터조원학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 금당로 42 외1필지 그린프라자 201호 (조원동)(주)대교031-253-9509보통교과보습중등사회역사151개월0일2103200000000032000122
7수원뉴영통서울학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 대평로 128 외6필지 파크프라자 205호 (정자동)(주)정자영통서울학원031-247-0011보통교과입시수학/초등101개월0일1161180000000000180000232
8수원뉴영통서울학원학교교과교습학원입시.검정 및 보습경기도 수원시 장안구 대평로 128 외6필지 파크프라자 205호 (정자동)(주)정자영통서울학원031-247-0011보통교과입시영어/중등101개월0일1548280000000000280000232