Overview

Dataset statistics

Number of variables21
Number of observations10000
Missing cells16540
Missing cells (%)7.9%
Duplicate rows3
Duplicate rows (%)< 0.1%
Total size in memory1.8 MiB
Average record size in memory185.0 B

Variable types

Text10
Categorical3
Numeric8

Dataset

Description전북특별자치도교육청 14개 시군 학원 현황 데이터로 학원명, 학원종류, 학원주소, 설립자, 교습계열, 교습과정, 교습비 등을 제공합니다.
Author전북특별자치도교육청
URLhttps://www.data.go.kr/data/15053372/fileData.do

Alerts

Dataset has 3 (< 0.1%) duplicate rowsDuplicates
학원종류 is highly imbalanced (65.0%)Imbalance
피복비 is highly imbalanced (61.7%)Imbalance
전화번호 has 2811 (28.1%) missing valuesMissing
교습과정 has 144 (1.4%) missing valuesMissing
모의고사비 has 2212 (22.1%) missing valuesMissing
재료비 has 2154 (21.5%) missing valuesMissing
급식비 has 2391 (23.9%) missing valuesMissing
기숙사비 has 2219 (22.2%) missing valuesMissing
차량비 has 2338 (23.4%) missing valuesMissing
기타경비합계 has 2116 (21.2%) missing valuesMissing
정원 is highly skewed (γ1 = 30.06614959)Skewed
모의고사비 is highly skewed (γ1 = 23.83810154)Skewed
급식비 is highly skewed (γ1 = 26.88086505)Skewed
차량비 is highly skewed (γ1 = 29.50850813)Skewed
모의고사비 has 7729 (77.3%) zerosZeros
재료비 has 7599 (76.0%) zerosZeros
급식비 has 7594 (75.9%) zerosZeros
기숙사비 has 7710 (77.1%) zerosZeros
차량비 has 7594 (75.9%) zerosZeros
기타경비합계 has 7484 (74.8%) zerosZeros
강사수 has 249 (2.5%) zerosZeros

Reproduction

Analysis started2024-03-14 15:30:16.076190
Analysis finished2024-03-14 15:30:18.902425
Duration2.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct3243
Distinct (%)32.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:30:19.595728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length26
Mean length8.8354
Min length3

Characters and Unicode

Total characters88354
Distinct characters729
Distinct categories10 ?
Distinct scripts5 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique962 ?
Unique (%)9.6%

Sample

1st row멜로디음악학원
2nd row쿼크수학영어전문학원
3rd row참빛학원
4th row삼천카이스트학원
5th row솔내지앤비어학원
ValueCountFrequency (%)
눈높이러닝센터영등학원 36
 
0.4%
이젠컴퓨터아트서비스학원 31
 
0.3%
눈높이러닝센터부송학원 31
 
0.3%
그린컴퓨터아트학원 31
 
0.3%
눈높이러닝센터수성학원 30
 
0.3%
진평생직업교육학원 30
 
0.3%
눈높이러닝센터학원 30
 
0.3%
전주동양컴퓨터학원 28
 
0.3%
동양컴퓨터회계학원 27
 
0.3%
등용문아카데미회계컴퓨터아트학원 26
 
0.3%
Other values (3300) 9912
97.1%
2024-03-15T00:30:20.889852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11836
 
13.4%
10091
 
11.4%
2100
 
2.4%
2014
 
2.3%
2001
 
2.3%
1849
 
2.1%
1783
 
2.0%
1766
 
2.0%
1577
 
1.8%
1453
 
1.6%
Other values (719) 51884
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 83947
95.0%
Uppercase Letter 2358
 
2.7%
Lowercase Letter 790
 
0.9%
Open Punctuation 295
 
0.3%
Close Punctuation 295
 
0.3%
Space Separator 223
 
0.3%
Other Punctuation 218
 
0.2%
Decimal Number 185
 
0.2%
Math Symbol 25
 
< 0.1%
Dash Punctuation 18
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11836
 
14.1%
10091
 
12.0%
2100
 
2.5%
2014
 
2.4%
2001
 
2.4%
1849
 
2.2%
1783
 
2.1%
1766
 
2.1%
1577
 
1.9%
1453
 
1.7%
Other values (648) 47477
56.6%
Uppercase Letter
ValueCountFrequency (%)
S 270
 
11.5%
E 243
 
10.3%
M 234
 
9.9%
I 172
 
7.3%
A 147
 
6.2%
C 141
 
6.0%
T 132
 
5.6%
Y 128
 
5.4%
B 108
 
4.6%
P 97
 
4.1%
Other values (15) 686
29.1%
Lowercase Letter
ValueCountFrequency (%)
n 93
11.8%
s 79
10.0%
e 77
9.7%
i 75
9.5%
o 70
8.9%
g 60
7.6%
a 56
 
7.1%
l 51
 
6.5%
h 48
 
6.1%
t 40
 
5.1%
Other values (11) 141
17.8%
Other Punctuation
ValueCountFrequency (%)
& 67
30.7%
. 49
22.5%
· 45
20.6%
, 18
 
8.3%
? 12
 
5.5%
' 8
 
3.7%
! 6
 
2.8%
# 5
 
2.3%
: 5
 
2.3%
2
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 69
37.3%
2 46
24.9%
3 37
20.0%
6 10
 
5.4%
9 10
 
5.4%
8 5
 
2.7%
7 4
 
2.2%
4 3
 
1.6%
0 1
 
0.5%
Open Punctuation
ValueCountFrequency (%)
( 295
100.0%
Close Punctuation
ValueCountFrequency (%)
) 295
100.0%
Space Separator
ValueCountFrequency (%)
223
100.0%
Math Symbol
ValueCountFrequency (%)
+ 25
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 83915
95.0%
Latin 3147
 
3.6%
Common 1259
 
1.4%
Han 32
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11836
 
14.1%
10091
 
12.0%
2100
 
2.5%
2014
 
2.4%
2001
 
2.4%
1849
 
2.2%
1783
 
2.1%
1766
 
2.1%
1577
 
1.9%
1453
 
1.7%
Other values (641) 47445
56.5%
Latin
ValueCountFrequency (%)
S 270
 
8.6%
E 243
 
7.7%
M 234
 
7.4%
I 172
 
5.5%
A 147
 
4.7%
C 141
 
4.5%
T 132
 
4.2%
Y 128
 
4.1%
B 108
 
3.4%
P 97
 
3.1%
Other values (35) 1475
46.9%
Common
ValueCountFrequency (%)
( 295
23.4%
) 295
23.4%
223
17.7%
1 69
 
5.5%
& 67
 
5.3%
. 49
 
3.9%
2 46
 
3.7%
· 45
 
3.6%
3 37
 
2.9%
+ 25
 
2.0%
Other values (15) 108
 
8.6%
Han
ValueCountFrequency (%)
12
37.5%
9
28.1%
4
 
12.5%
3
 
9.4%
2
 
6.2%
1
 
3.1%
1
 
3.1%
Greek
ValueCountFrequency (%)
α 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 83908
95.0%
ASCII 4359
 
4.9%
None 48
 
0.1%
CJK 32
 
< 0.1%
Compat Jamo 7
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11836
 
14.1%
10091
 
12.0%
2100
 
2.5%
2014
 
2.4%
2001
 
2.4%
1849
 
2.2%
1783
 
2.1%
1766
 
2.1%
1577
 
1.9%
1453
 
1.7%
Other values (640) 47438
56.5%
ASCII
ValueCountFrequency (%)
( 295
 
6.8%
) 295
 
6.8%
S 270
 
6.2%
E 243
 
5.6%
M 234
 
5.4%
223
 
5.1%
I 172
 
3.9%
A 147
 
3.4%
C 141
 
3.2%
T 132
 
3.0%
Other values (58) 2207
50.6%
None
ValueCountFrequency (%)
· 45
93.8%
2
 
4.2%
α 1
 
2.1%
CJK
ValueCountFrequency (%)
12
37.5%
9
28.1%
4
 
12.5%
3
 
9.4%
2
 
6.2%
1
 
3.1%
1
 
3.1%
Compat Jamo
ValueCountFrequency (%)
7
100.0%

학원종류
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학교교과교습학원
8720 
평생직업교육학원
1277 
<NA>
 
3

Length

Max length8
Median length8
Mean length7.9988
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교교과교습학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row학교교과교습학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 8720
87.2%
평생직업교육학원 1277
 
12.8%
<NA> 3
 
< 0.1%

Length

2024-03-15T00:30:21.332958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:30:21.679850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 8720
87.2%
평생직업교육학원 1277
 
12.8%
na 3
 
< 0.1%
Distinct3323
Distinct (%)33.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:30:23.182290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length90
Median length61
Mean length39.2525
Min length10

Characters and Unicode

Total characters392525
Distinct characters544
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique999 ?
Unique (%)10.0%

Sample

1st row전북특별자치도 전주시 완산구 중산중앙로 9-3 (중화산동2가)
2nd row전북특별자치도 전주시 완산구 효천천변길 24 , 302호 (효자동2가)
3rd row전북특별자치도 전주시 덕진구 석소로 27 , 201호 (인후동1가, 아중제일아파트)
4th row전북특별자치도 전주시 완산구 효자천변1길 26-3 , 2층 (효자동1가)
5th row전북특별자치도 전주시 덕진구 두간로 6 , 3층 (송천동1가)
ValueCountFrequency (%)
전북특별자치도 9980
 
12.3%
9004
 
11.1%
전주시 5680
 
7.0%
덕진구 2945
 
3.6%
완산구 2735
 
3.4%
2층 2486
 
3.1%
3층 1525
 
1.9%
익산시 1462
 
1.8%
군산시 1325
 
1.6%
4층 821
 
1.0%
Other values (2993) 43185
53.2%
2024-03-15T00:30:25.027658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
71408
 
18.2%
16414
 
4.2%
2 13168
 
3.4%
, 13165
 
3.4%
11284
 
2.9%
1 11229
 
2.9%
11193
 
2.9%
( 10895
 
2.8%
) 10890
 
2.8%
10220
 
2.6%
Other values (534) 212659
54.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 226815
57.8%
Space Separator 71408
 
18.2%
Decimal Number 56543
 
14.4%
Other Punctuation 13357
 
3.4%
Open Punctuation 10895
 
2.8%
Close Punctuation 10890
 
2.8%
Dash Punctuation 1971
 
0.5%
Uppercase Letter 476
 
0.1%
Lowercase Letter 91
 
< 0.1%
Math Symbol 76
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
16414
 
7.2%
11284
 
5.0%
11193
 
4.9%
10220
 
4.5%
10209
 
4.5%
10078
 
4.4%
10001
 
4.4%
9990
 
4.4%
9647
 
4.3%
7994
 
3.5%
Other values (478) 119785
52.8%
Uppercase Letter
ValueCountFrequency (%)
A 71
14.9%
S 62
13.0%
K 34
 
7.1%
B 31
 
6.5%
Y 29
 
6.1%
M 29
 
6.1%
T 28
 
5.9%
E 26
 
5.5%
C 26
 
5.5%
R 21
 
4.4%
Other values (12) 119
25.0%
Lowercase Letter
ValueCountFrequency (%)
e 28
30.8%
r 12
13.2%
k 7
 
7.7%
i 7
 
7.7%
w 6
 
6.6%
a 6
 
6.6%
s 6
 
6.6%
o 6
 
6.6%
d 6
 
6.6%
t 3
 
3.3%
Other values (2) 4
 
4.4%
Decimal Number
ValueCountFrequency (%)
2 13168
23.3%
1 11229
19.9%
3 7629
13.5%
0 7120
12.6%
4 5175
 
9.2%
5 3101
 
5.5%
6 2605
 
4.6%
7 2587
 
4.6%
8 2033
 
3.6%
9 1896
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 13165
98.6%
@ 78
 
0.6%
. 78
 
0.6%
/ 21
 
0.2%
· 10
 
0.1%
? 5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
71408
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10895
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10890
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1971
100.0%
Math Symbol
ValueCountFrequency (%)
~ 76
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 226815
57.8%
Common 165143
42.1%
Latin 567
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
16414
 
7.2%
11284
 
5.0%
11193
 
4.9%
10220
 
4.5%
10209
 
4.5%
10078
 
4.4%
10001
 
4.4%
9990
 
4.4%
9647
 
4.3%
7994
 
3.5%
Other values (478) 119785
52.8%
Latin
ValueCountFrequency (%)
A 71
 
12.5%
S 62
 
10.9%
K 34
 
6.0%
B 31
 
5.5%
Y 29
 
5.1%
M 29
 
5.1%
e 28
 
4.9%
T 28
 
4.9%
E 26
 
4.6%
C 26
 
4.6%
Other values (24) 203
35.8%
Common
ValueCountFrequency (%)
71408
43.2%
2 13168
 
8.0%
, 13165
 
8.0%
1 11229
 
6.8%
( 10895
 
6.6%
) 10890
 
6.6%
3 7629
 
4.6%
0 7120
 
4.3%
4 5175
 
3.1%
5 3101
 
1.9%
Other values (12) 11363
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 226811
57.8%
ASCII 165697
42.2%
None 10
 
< 0.1%
Compat Jamo 4
 
< 0.1%
CJK Compat 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
71408
43.1%
2 13168
 
7.9%
, 13165
 
7.9%
1 11229
 
6.8%
( 10895
 
6.6%
) 10890
 
6.6%
3 7629
 
4.6%
0 7120
 
4.3%
4 5175
 
3.1%
5 3101
 
1.9%
Other values (44) 11917
 
7.2%
Hangul
ValueCountFrequency (%)
16414
 
7.2%
11284
 
5.0%
11193
 
4.9%
10220
 
4.5%
10209
 
4.5%
10078
 
4.4%
10001
 
4.4%
9990
 
4.4%
9647
 
4.3%
7994
 
3.5%
Other values (477) 119781
52.8%
None
ValueCountFrequency (%)
· 10
100.0%
Compat Jamo
ValueCountFrequency (%)
4
100.0%
CJK Compat
ValueCountFrequency (%)
3
100.0%
Distinct2845
Distinct (%)28.5%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-03-15T00:30:26.286158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length3
Mean length3.6311631
Min length2

Characters and Unicode

Total characters36308
Distinct characters370
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique789 ?
Unique (%)7.9%

Sample

1st row신은정
2nd row김윤빈
3rd row오치훈
4th row최문희
5th row이은경
ValueCountFrequency (%)
주)대교 407
 
3.9%
주식회사 273
 
2.6%
웅진씽크빅 122
 
1.2%
조완순 55
 
0.5%
주)대교-박명규 55
 
0.5%
이환배 34
 
0.3%
강경일,우선정 31
 
0.3%
유한회사 31
 
0.3%
염정호 31
 
0.3%
진정화 30
 
0.3%
Other values (2844) 9250
89.6%
2024-03-15T00:30:27.798024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1960
 
5.4%
1462
 
4.0%
1274
 
3.5%
1272
 
3.5%
1036
 
2.9%
801
 
2.2%
766
 
2.1%
755
 
2.1%
729
 
2.0%
694
 
1.9%
Other values (360) 25559
70.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34023
93.7%
Close Punctuation 642
 
1.8%
Open Punctuation 639
 
1.8%
Space Separator 346
 
1.0%
Other Punctuation 340
 
0.9%
Lowercase Letter 133
 
0.4%
Uppercase Letter 130
 
0.4%
Dash Punctuation 55
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1960
 
5.8%
1462
 
4.3%
1274
 
3.7%
1272
 
3.7%
1036
 
3.0%
801
 
2.4%
766
 
2.3%
755
 
2.2%
729
 
2.1%
694
 
2.0%
Other values (324) 23274
68.4%
Uppercase Letter
ValueCountFrequency (%)
R 17
13.1%
N 12
 
9.2%
Y 11
 
8.5%
O 11
 
8.5%
S 10
 
7.7%
C 9
 
6.9%
E 8
 
6.2%
M 7
 
5.4%
T 7
 
5.4%
D 7
 
5.4%
Other values (10) 31
23.8%
Lowercase Letter
ValueCountFrequency (%)
r 21
15.8%
e 21
15.8%
o 21
15.8%
h 14
10.5%
t 14
10.5%
y 7
 
5.3%
b 7
 
5.3%
i 7
 
5.3%
s 7
 
5.3%
p 7
 
5.3%
Close Punctuation
ValueCountFrequency (%)
) 642
100.0%
Open Punctuation
ValueCountFrequency (%)
( 639
100.0%
Space Separator
ValueCountFrequency (%)
346
100.0%
Other Punctuation
ValueCountFrequency (%)
, 340
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34023
93.7%
Common 2022
 
5.6%
Latin 263
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1960
 
5.8%
1462
 
4.3%
1274
 
3.7%
1272
 
3.7%
1036
 
3.0%
801
 
2.4%
766
 
2.3%
755
 
2.2%
729
 
2.1%
694
 
2.0%
Other values (324) 23274
68.4%
Latin
ValueCountFrequency (%)
r 21
 
8.0%
e 21
 
8.0%
o 21
 
8.0%
R 17
 
6.5%
h 14
 
5.3%
t 14
 
5.3%
N 12
 
4.6%
Y 11
 
4.2%
O 11
 
4.2%
S 10
 
3.8%
Other values (21) 111
42.2%
Common
ValueCountFrequency (%)
) 642
31.8%
( 639
31.6%
346
17.1%
, 340
16.8%
- 55
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34023
93.7%
ASCII 2285
 
6.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1960
 
5.8%
1462
 
4.3%
1274
 
3.7%
1272
 
3.7%
1036
 
3.0%
801
 
2.4%
766
 
2.3%
755
 
2.2%
729
 
2.1%
694
 
2.0%
Other values (324) 23274
68.4%
ASCII
ValueCountFrequency (%)
) 642
28.1%
( 639
28.0%
346
15.1%
, 340
14.9%
- 55
 
2.4%
r 21
 
0.9%
e 21
 
0.9%
o 21
 
0.9%
R 17
 
0.7%
h 14
 
0.6%
Other values (26) 169
 
7.4%

전화번호
Text

MISSING 

Distinct2302
Distinct (%)32.0%
Missing2811
Missing (%)28.1%
Memory size156.2 KiB
2024-03-15T00:30:28.767671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.010154
Min length11

Characters and Unicode

Total characters86341
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique676 ?
Unique (%)9.4%

Sample

1st row063-223-4387
2nd row063-245-8891
3rd row063-224-3625
4th row063-277-0522
5th row063-223-8889
ValueCountFrequency (%)
063-835-9109 36
 
0.5%
063-232-2111 31
 
0.4%
063-835-9509 31
 
0.4%
063-276-2381 31
 
0.4%
063-538-9509 30
 
0.4%
063-536-9509 30
 
0.4%
063-252-8814 28
 
0.4%
063-236-8814 27
 
0.4%
063-271-5505 27
 
0.4%
063-285-9191 26
 
0.4%
Other values (2292) 6892
95.9%
2024-03-15T00:30:30.013758image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 14378
16.7%
0 12302
14.2%
3 11908
13.8%
6 11018
12.8%
2 9086
10.5%
5 5922
6.9%
8 4733
 
5.5%
4 4456
 
5.2%
1 4412
 
5.1%
7 4227
 
4.9%
Other values (2) 3899
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 71960
83.3%
Dash Punctuation 14378
 
16.7%
Space Separator 3
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 12302
17.1%
3 11908
16.5%
6 11018
15.3%
2 9086
12.6%
5 5922
8.2%
8 4733
 
6.6%
4 4456
 
6.2%
1 4412
 
6.1%
7 4227
 
5.9%
9 3896
 
5.4%
Dash Punctuation
ValueCountFrequency (%)
- 14378
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 86341
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 14378
16.7%
0 12302
14.2%
3 11908
13.8%
6 11018
12.8%
2 9086
10.5%
5 5922
6.9%
8 4733
 
5.5%
4 4456
 
5.2%
1 4412
 
5.1%
7 4227
 
4.9%
Other values (2) 3899
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 86341
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 14378
16.7%
0 12302
14.2%
3 11908
13.8%
6 11018
12.8%
2 9086
10.5%
5 5922
6.9%
8 4733
 
5.5%
4 4456
 
5.2%
1 4412
 
5.1%
7 4227
 
4.9%
Other values (2) 3899
 
4.5%

교습계열
Categorical

Distinct20
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통교과
5019 
예능(중)
2460 
외국어
897 
산업응용기술
 
374
컴퓨터
 
356
Other values (15)
894 

Length

Max length7
Median length4
Mean length4.2579
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row예능(중)
2nd row보통교과
3rd row보통교과
4th row보통교과
5th row외국어

Common Values

ValueCountFrequency (%)
보통교과 5019
50.2%
예능(중) 2460
24.6%
외국어 897
 
9.0%
산업응용기술 374
 
3.7%
컴퓨터 356
 
3.6%
기예(중) 249
 
2.5%
<NA> 136
 
1.4%
기타(중) 111
 
1.1%
독서실 96
 
1.0%
산업기반기술 65
 
0.7%
Other values (10) 237
 
2.4%

Length

2024-03-15T00:30:30.449533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보통교과 5019
50.2%
예능(중 2460
24.6%
외국어 897
 
9.0%
산업응용기술 374
 
3.7%
컴퓨터 356
 
3.6%
기예(중 249
 
2.5%
na 136
 
1.4%
기타(중 111
 
1.1%
독서실 96
 
1.0%
산업기반기술 65
 
0.7%
Other values (10) 237
 
2.4%

교습과정
Text

MISSING 

Distinct61
Distinct (%)0.6%
Missing144
Missing (%)1.4%
Memory size156.2 KiB
2024-03-15T00:30:31.200060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length2
Mean length4.5777192
Min length2

Characters and Unicode

Total characters45118
Distinct characters120
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)0.1%

Sample

1st row음악
2nd row입시
3rd row입시·논술
4th row보습
5th row실용외국어(유아/초·중·고)
ValueCountFrequency (%)
보습 3487
35.4%
음악 1828
18.5%
입시 1378
 
14.0%
실용외국어(유아/초·중·고 894
 
9.1%
미술 520
 
5.3%
컴퓨터(정보처리,통신기기,인터넷,소프트웨어 350
 
3.6%
이·미용 221
 
2.2%
무용 118
 
1.2%
식음료품(바리스타,소믈리에 110
 
1.1%
보습·논술 105
 
1.1%
Other values (51) 845
 
8.6%
2024-03-15T00:30:32.359949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3952
 
8.8%
3592
 
8.0%
· 2328
 
5.2%
1998
 
4.4%
1954
 
4.3%
( 1785
 
4.0%
) 1785
 
4.0%
1418
 
3.1%
1404
 
3.1%
1350
 
3.0%
Other values (110) 23552
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 36885
81.8%
Other Punctuation 4663
 
10.3%
Open Punctuation 1785
 
4.0%
Close Punctuation 1785
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3952
 
10.7%
3592
 
9.7%
1998
 
5.4%
1954
 
5.3%
1418
 
3.8%
1404
 
3.8%
1350
 
3.7%
1293
 
3.5%
1112
 
3.0%
1002
 
2.7%
Other values (105) 17810
48.3%
Other Punctuation
ValueCountFrequency (%)
· 2328
49.9%
, 1347
28.9%
/ 988
21.2%
Open Punctuation
ValueCountFrequency (%)
( 1785
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1785
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 36885
81.8%
Common 8233
 
18.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3952
 
10.7%
3592
 
9.7%
1998
 
5.4%
1954
 
5.3%
1418
 
3.8%
1404
 
3.8%
1350
 
3.7%
1293
 
3.5%
1112
 
3.0%
1002
 
2.7%
Other values (105) 17810
48.3%
Common
ValueCountFrequency (%)
· 2328
28.3%
( 1785
21.7%
) 1785
21.7%
, 1347
16.4%
/ 988
12.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 36885
81.8%
ASCII 5905
 
13.1%
None 2328
 
5.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3952
 
10.7%
3592
 
9.7%
1998
 
5.4%
1954
 
5.3%
1418
 
3.8%
1404
 
3.8%
1350
 
3.7%
1293
 
3.5%
1112
 
3.0%
1002
 
2.7%
Other values (105) 17810
48.3%
None
ValueCountFrequency (%)
· 2328
100.0%
ASCII
ValueCountFrequency (%)
( 1785
30.2%
) 1785
30.2%
, 1347
22.8%
/ 988
16.7%
Distinct5532
Distinct (%)55.5%
Missing30
Missing (%)0.3%
Memory size156.2 KiB
2024-03-15T00:30:33.698645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length35
Mean length7.1528586
Min length1

Characters and Unicode

Total characters71314
Distinct characters603
Distinct categories14 ?
Distinct scripts3 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4567 ?
Unique (%)45.8%

Sample

1st row피아노기초-고
2nd row고등영어B
3rd row중등수학논술1
4th row수학A(초)
5th row영어중등D
ValueCountFrequency (%)
중등수학 185
 
1.7%
초등수학 158
 
1.4%
고등수학 149
 
1.4%
중등영어 145
 
1.3%
고등영어 127
 
1.2%
초등영어 125
 
1.1%
피아노 95
 
0.9%
수학(초 92
 
0.8%
수학 89
 
0.8%
초등 88
 
0.8%
Other values (5438) 9757
88.6%
2024-03-15T00:30:35.263592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
) 4882
 
6.8%
( 4876
 
6.8%
4355
 
6.1%
3725
 
5.2%
2819
 
4.0%
2781
 
3.9%
2735
 
3.8%
2482
 
3.5%
2320
 
3.3%
2203
 
3.1%
Other values (593) 38136
53.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 52023
72.9%
Close Punctuation 4914
 
6.9%
Open Punctuation 4908
 
6.9%
Decimal Number 2985
 
4.2%
Uppercase Letter 2435
 
3.4%
Other Punctuation 1978
 
2.8%
Space Separator 1057
 
1.5%
Lowercase Letter 505
 
0.7%
Dash Punctuation 308
 
0.4%
Math Symbol 107
 
0.2%
Other values (4) 94
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
4355
 
8.4%
3725
 
7.2%
2819
 
5.4%
2781
 
5.3%
2735
 
5.3%
2482
 
4.8%
2320
 
4.5%
2203
 
4.2%
1703
 
3.3%
1196
 
2.3%
Other values (505) 25704
49.4%
Uppercase Letter
ValueCountFrequency (%)
A 681
28.0%
B 546
22.4%
C 308
12.6%
D 147
 
6.0%
E 109
 
4.5%
T 77
 
3.2%
P 68
 
2.8%
I 63
 
2.6%
S 62
 
2.5%
G 51
 
2.1%
Other values (15) 323
13.3%
Lowercase Letter
ValueCountFrequency (%)
n 57
11.3%
o 56
11.1%
e 46
 
9.1%
i 38
 
7.5%
r 37
 
7.3%
a 35
 
6.9%
c 33
 
6.5%
t 30
 
5.9%
s 28
 
5.5%
l 23
 
4.6%
Other values (15) 122
24.2%
Other Punctuation
ValueCountFrequency (%)
, 1651
83.5%
. 202
 
10.2%
/ 77
 
3.9%
& 17
 
0.9%
; 14
 
0.7%
: 8
 
0.4%
* 5
 
0.3%
# 1
 
0.1%
! 1
 
0.1%
\ 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 780
26.1%
2 595
19.9%
3 567
19.0%
0 496
16.6%
4 237
 
7.9%
5 177
 
5.9%
6 85
 
2.8%
7 21
 
0.7%
8 18
 
0.6%
9 9
 
0.3%
Letter Number
ValueCountFrequency (%)
38
51.4%
20
27.0%
7
 
9.5%
4
 
5.4%
4
 
5.4%
1
 
1.4%
Close Punctuation
ValueCountFrequency (%)
) 4882
99.3%
] 32
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 4876
99.3%
[ 32
 
0.7%
Math Symbol
ValueCountFrequency (%)
~ 62
57.9%
+ 45
42.1%
Space Separator
ValueCountFrequency (%)
1057
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 308
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 16
100.0%
Other Number
ValueCountFrequency (%)
2
100.0%
Control
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 52023
72.9%
Common 16277
 
22.8%
Latin 3014
 
4.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
4355
 
8.4%
3725
 
7.2%
2819
 
5.4%
2781
 
5.3%
2735
 
5.3%
2482
 
4.8%
2320
 
4.5%
2203
 
4.2%
1703
 
3.3%
1196
 
2.3%
Other values (505) 25704
49.4%
Latin
ValueCountFrequency (%)
A 681
22.6%
B 546
18.1%
C 308
 
10.2%
D 147
 
4.9%
E 109
 
3.6%
T 77
 
2.6%
P 68
 
2.3%
I 63
 
2.1%
S 62
 
2.1%
n 57
 
1.9%
Other values (46) 896
29.7%
Common
ValueCountFrequency (%)
) 4882
30.0%
( 4876
30.0%
, 1651
 
10.1%
1057
 
6.5%
1 780
 
4.8%
2 595
 
3.7%
3 567
 
3.5%
0 496
 
3.0%
- 308
 
1.9%
4 237
 
1.5%
Other values (22) 828
 
5.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 52022
72.9%
ASCII 19214
 
26.9%
Number Forms 74
 
0.1%
Enclosed Alphanum 2
 
< 0.1%
Compat Jamo 1
 
< 0.1%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
) 4882
25.4%
( 4876
25.4%
, 1651
 
8.6%
1057
 
5.5%
1 780
 
4.1%
A 681
 
3.5%
2 595
 
3.1%
3 567
 
3.0%
B 546
 
2.8%
0 496
 
2.6%
Other values (70) 3083
16.0%
Hangul
ValueCountFrequency (%)
4355
 
8.4%
3725
 
7.2%
2819
 
5.4%
2781
 
5.3%
2735
 
5.3%
2482
 
4.8%
2320
 
4.5%
2203
 
4.2%
1703
 
3.3%
1196
 
2.3%
Other values (504) 25703
49.4%
Number Forms
ValueCountFrequency (%)
38
51.4%
20
27.0%
7
 
9.5%
4
 
5.4%
4
 
5.4%
1
 
1.4%
Enclosed Alphanum
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
· 1
100.0%

정원
Real number (ℝ)

SKEWED 

Distinct134
Distinct (%)1.3%
Missing30
Missing (%)0.3%
Infinite0
Infinite (%)0.0%
Mean23.440722
Minimum0
Maximum3000
Zeros21
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:35.665791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile5
Q110
median15
Q323
95-th percentile60
Maximum3000
Range3000
Interquartile range (IQR)13

Descriptive statistics

Standard deviation92.833675
Coefficient of variation (CV)3.960359
Kurtosis955.40266
Mean23.440722
Median Absolute Deviation (MAD)5
Skewness30.06615
Sum233704
Variance8618.0912
MonotonicityNot monotonic
2024-03-15T00:30:36.095960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 1610
16.1%
15 1200
 
12.0%
20 1010
 
10.1%
8 717
 
7.2%
12 535
 
5.3%
30 427
 
4.3%
6 390
 
3.9%
5 350
 
3.5%
9 261
 
2.6%
40 224
 
2.2%
Other values (124) 3246
32.5%
ValueCountFrequency (%)
0 21
 
0.2%
1 103
 
1.0%
2 40
 
0.4%
3 58
 
0.6%
4 123
 
1.2%
5 350
3.5%
6 390
3.9%
7 171
 
1.7%
8 717
7.2%
9 261
 
2.6%
ValueCountFrequency (%)
3000 9
0.1%
1300 1
 
< 0.1%
328 1
 
< 0.1%
323 1
 
< 0.1%
247 1
 
< 0.1%
245 1
 
< 0.1%
210 1
 
< 0.1%
200 2
 
< 0.1%
182 10
0.1%
180 2
 
< 0.1%
Distinct51
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-03-15T00:30:36.768520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.0581
Min length5

Characters and Unicode

Total characters50581
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row1개월0일
2nd row1개월0일
3rd row1개월0일
4th row1개월0일
5th row1개월0일
ValueCountFrequency (%)
1개월0일 8985
89.5%
1개월20일 335
 
3.3%
0개월0일 123
 
1.2%
2개월0일 91
 
0.9%
3개월0일 48
 
0.5%
0개월1일 47
 
0.5%
1개월 43
 
0.4%
0일 43
 
0.4%
1개월5일 41
 
0.4%
0개월19일 39
 
0.4%
Other values (42) 248
 
2.5%
2024-03-15T00:30:37.893260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 10058
19.9%
10000
19.8%
10000
19.8%
10000
19.8%
1 9684
19.1%
2 510
 
1.0%
3 74
 
0.1%
5 68
 
0.1%
6 44
 
0.1%
43
 
0.1%
Other values (4) 100
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 30000
59.3%
Decimal Number 20538
40.6%
Space Separator 43
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 10058
49.0%
1 9684
47.2%
2 510
 
2.5%
3 74
 
0.4%
5 68
 
0.3%
6 44
 
0.2%
9 41
 
0.2%
4 41
 
0.2%
8 17
 
0.1%
7 1
 
< 0.1%
Other Letter
ValueCountFrequency (%)
10000
33.3%
10000
33.3%
10000
33.3%
Space Separator
ValueCountFrequency (%)
43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 30000
59.3%
Common 20581
40.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 10058
48.9%
1 9684
47.1%
2 510
 
2.5%
3 74
 
0.4%
5 68
 
0.3%
6 44
 
0.2%
43
 
0.2%
9 41
 
0.2%
4 41
 
0.2%
8 17
 
0.1%
Hangul
ValueCountFrequency (%)
10000
33.3%
10000
33.3%
10000
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 30000
59.3%
ASCII 20581
40.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 10058
48.9%
1 9684
47.1%
2 510
 
2.5%
3 74
 
0.4%
5 68
 
0.3%
6 44
 
0.2%
43
 
0.2%
9 41
 
0.2%
4 41
 
0.2%
8 17
 
0.1%
Hangul
ValueCountFrequency (%)
10000
33.3%
10000
33.3%
10000
33.3%
Distinct585
Distinct (%)5.9%
Missing30
Missing (%)0.3%
Memory size156.2 KiB
2024-03-15T00:30:39.677076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length4
Mean length3.7296891
Min length1

Characters and Unicode

Total characters37185
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)2.4%

Sample

1st row1200
2nd row1784
3rd row1170
4th row1060
5th row1800
ValueCountFrequency (%)
1200 1508
 
15.1%
1000 606
 
6.1%
1440 451
 
4.5%
720 341
 
3.4%
1800 316
 
3.2%
1400 295
 
3.0%
960 277
 
2.8%
1600 230
 
2.3%
2400 226
 
2.3%
1080 203
 
2.0%
Other values (575) 5517
55.3%
2024-03-15T00:30:41.771449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 15596
41.9%
1 6828
18.4%
2 4548
 
12.2%
4 2748
 
7.4%
8 1943
 
5.2%
6 1841
 
5.0%
3 1088
 
2.9%
5 996
 
2.7%
9 802
 
2.2%
7 756
 
2.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 37146
99.9%
Other Punctuation 39
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 15596
42.0%
1 6828
18.4%
2 4548
 
12.2%
4 2748
 
7.4%
8 1943
 
5.2%
6 1841
 
5.0%
3 1088
 
2.9%
5 996
 
2.7%
9 802
 
2.2%
7 756
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37185
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 15596
41.9%
1 6828
18.4%
2 4548
 
12.2%
4 2748
 
7.4%
8 1943
 
5.2%
6 1841
 
5.0%
3 1088
 
2.9%
5 996
 
2.7%
9 802
 
2.2%
7 756
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37185
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 15596
41.9%
1 6828
18.4%
2 4548
 
12.2%
4 2748
 
7.4%
8 1943
 
5.2%
6 1841
 
5.0%
3 1088
 
2.9%
5 996
 
2.7%
9 802
 
2.2%
7 756
 
2.0%
Distinct552
Distinct (%)5.5%
Missing32
Missing (%)0.3%
Memory size156.2 KiB
2024-03-15T00:30:43.222077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length6
Mean length5.8923555
Min length1

Characters and Unicode

Total characters58735
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique276 ?
Unique (%)2.8%

Sample

1st row120000
2nd row401000
3rd row250000
4th row150000
5th row300000
ValueCountFrequency (%)
200000 833
 
8.4%
150000 814
 
8.2%
250000 718
 
7.2%
300000 578
 
5.8%
180000 398
 
4.0%
120000 357
 
3.6%
130000 354
 
3.6%
160000 352
 
3.5%
140000 322
 
3.2%
100000 281
 
2.8%
Other values (542) 4961
49.8%
2024-03-15T00:30:45.146496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 40724
69.3%
1 4311
 
7.3%
2 3804
 
6.5%
5 2936
 
5.0%
3 2310
 
3.9%
4 1410
 
2.4%
8 1023
 
1.7%
6 996
 
1.7%
7 756
 
1.3%
9 455
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 58725
> 99.9%
Other Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 40724
69.3%
1 4311
 
7.3%
2 3804
 
6.5%
5 2936
 
5.0%
3 2310
 
3.9%
4 1410
 
2.4%
8 1023
 
1.7%
6 996
 
1.7%
7 756
 
1.3%
9 455
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 58735
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 40724
69.3%
1 4311
 
7.3%
2 3804
 
6.5%
5 2936
 
5.0%
3 2310
 
3.9%
4 1410
 
2.4%
8 1023
 
1.7%
6 996
 
1.7%
7 756
 
1.3%
9 455
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58735
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 40724
69.3%
1 4311
 
7.3%
2 3804
 
6.5%
5 2936
 
5.0%
3 2310
 
3.9%
4 1410
 
2.4%
8 1023
 
1.7%
6 996
 
1.7%
7 756
 
1.3%
9 455
 
0.8%

모의고사비
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct15
Distinct (%)0.2%
Missing2212
Missing (%)22.1%
Infinite0
Infinite (%)0.0%
Mean143.09194
Minimum0
Maximum100000
Zeros7729
Zeros (%)77.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:45.438374image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum100000
Range100000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2000.934
Coefficient of variation (CV)13.983555
Kurtosis892.76394
Mean143.09194
Median Absolute Deviation (MAD)0
Skewness23.838102
Sum1114400
Variance4003736.8
MonotonicityNot monotonic
2024-03-15T00:30:45.637249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 7729
77.3%
20000 19
 
0.2%
30000 10
 
0.1%
10000 10
 
0.1%
15000 5
 
0.1%
22000 3
 
< 0.1%
8000 3
 
< 0.1%
3000 2
 
< 0.1%
24000 1
 
< 0.1%
12000 1
 
< 0.1%
Other values (5) 5
 
0.1%
(Missing) 2212
 
22.1%
ValueCountFrequency (%)
0 7729
77.3%
3000 2
 
< 0.1%
3400 1
 
< 0.1%
4000 1
 
< 0.1%
8000 3
 
< 0.1%
9000 1
 
< 0.1%
10000 10
 
0.1%
11000 1
 
< 0.1%
12000 1
 
< 0.1%
15000 5
 
0.1%
ValueCountFrequency (%)
100000 1
 
< 0.1%
30000 10
0.1%
24000 1
 
< 0.1%
22000 3
 
< 0.1%
20000 19
0.2%
15000 5
 
0.1%
12000 1
 
< 0.1%
11000 1
 
< 0.1%
10000 10
0.1%
9000 1
 
< 0.1%

재료비
Real number (ℝ)

MISSING  ZEROS 

Distinct73
Distinct (%)0.9%
Missing2154
Missing (%)21.5%
Infinite0
Infinite (%)0.0%
Mean11626.609
Minimum0
Maximum3000000
Zeros7599
Zeros (%)76.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:45.862277image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum3000000
Range3000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation121928.17
Coefficient of variation (CV)10.486993
Kurtosis279.87076
Mean11626.609
Median Absolute Deviation (MAD)0
Skewness15.234302
Sum91222375
Variance1.4866479 × 1010
MonotonicityNot monotonic
2024-03-15T00:30:46.115131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7599
76.0%
20000 24
 
0.2%
30000 21
 
0.2%
10000 16
 
0.2%
5000 13
 
0.1%
300000 13
 
0.1%
25000 10
 
0.1%
600000 9
 
0.1%
50000 9
 
0.1%
800000 8
 
0.1%
Other values (63) 124
 
1.2%
(Missing) 2154
 
21.5%
ValueCountFrequency (%)
0 7599
76.0%
800 1
 
< 0.1%
3000 1
 
< 0.1%
3125 1
 
< 0.1%
4000 1
 
< 0.1%
4400 2
 
< 0.1%
5000 13
 
0.1%
8800 1
 
< 0.1%
10000 16
 
0.2%
13200 1
 
< 0.1%
ValueCountFrequency (%)
3000000 3
< 0.1%
2600000 1
 
< 0.1%
2500000 2
< 0.1%
2100000 1
 
< 0.1%
1900000 1
 
< 0.1%
1850000 1
 
< 0.1%
1800000 3
< 0.1%
1500000 2
< 0.1%
1490000 1
 
< 0.1%
1400000 1
 
< 0.1%

급식비
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct6
Distinct (%)0.1%
Missing2391
Missing (%)23.9%
Infinite0
Infinite (%)0.0%
Mean184.12406
Minimum0
Maximum143000
Zeros7594
Zeros (%)75.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:46.324918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum143000
Range143000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4818.5946
Coefficient of variation (CV)26.170369
Kurtosis726.46796
Mean184.12406
Median Absolute Deviation (MAD)0
Skewness26.880865
Sum1401000
Variance23218854
MonotonicityNot monotonic
2024-03-15T00:30:46.591855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 7594
75.9%
130000 9
 
0.1%
6000 3
 
< 0.1%
143000 1
 
< 0.1%
5000 1
 
< 0.1%
65000 1
 
< 0.1%
(Missing) 2391
 
23.9%
ValueCountFrequency (%)
0 7594
75.9%
5000 1
 
< 0.1%
6000 3
 
< 0.1%
65000 1
 
< 0.1%
130000 9
 
0.1%
143000 1
 
< 0.1%
ValueCountFrequency (%)
143000 1
 
< 0.1%
130000 9
 
0.1%
65000 1
 
< 0.1%
6000 3
 
< 0.1%
5000 1
 
< 0.1%
0 7594
75.9%

기숙사비
Real number (ℝ)

MISSING  ZEROS 

Distinct15
Distinct (%)0.2%
Missing2219
Missing (%)22.2%
Infinite0
Infinite (%)0.0%
Mean361.77869
Minimum0
Maximum110000
Zeros7710
Zeros (%)77.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:46.788227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum110000
Range110000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4596.5869
Coefficient of variation (CV)12.705521
Kurtosis289.62515
Mean361.77869
Median Absolute Deviation (MAD)0
Skewness16.046832
Sum2815000
Variance21128611
MonotonicityNot monotonic
2024-03-15T00:30:46.976925image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 7710
77.1%
20000 19
 
0.2%
30000 14
 
0.1%
10000 8
 
0.1%
62000 7
 
0.1%
100000 5
 
0.1%
50000 3
 
< 0.1%
80000 3
 
< 0.1%
24000 2
 
< 0.1%
60000 2
 
< 0.1%
Other values (5) 8
 
0.1%
(Missing) 2219
 
22.2%
ValueCountFrequency (%)
0 7710
77.1%
10000 8
 
0.1%
15000 2
 
< 0.1%
20000 19
 
0.2%
24000 2
 
< 0.1%
30000 14
 
0.1%
40000 2
 
< 0.1%
50000 3
 
< 0.1%
60000 2
 
< 0.1%
62000 7
 
0.1%
ValueCountFrequency (%)
110000 1
 
< 0.1%
100000 5
 
0.1%
83000 1
 
< 0.1%
80000 3
 
< 0.1%
70000 2
 
< 0.1%
62000 7
0.1%
60000 2
 
< 0.1%
50000 3
 
< 0.1%
40000 2
 
< 0.1%
30000 14
0.1%

차량비
Real number (ℝ)

MISSING  SKEWED  ZEROS 

Distinct10
Distinct (%)0.1%
Missing2338
Missing (%)23.4%
Infinite0
Infinite (%)0.0%
Mean378.36074
Minimum0
Maximum230000
Zeros7594
Zeros (%)75.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:47.161112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum230000
Range230000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation7236.5096
Coefficient of variation (CV)19.125953
Kurtosis925.4545
Mean378.36074
Median Absolute Deviation (MAD)0
Skewness29.508508
Sum2899000
Variance52367071
MonotonicityNot monotonic
2024-03-15T00:30:47.340135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
0 7594
75.9%
20000 38
 
0.4%
10000 10
 
0.1%
230000 7
 
0.1%
30000 4
 
< 0.1%
35000 3
 
< 0.1%
50000 2
 
< 0.1%
40000 2
 
< 0.1%
15000 1
 
< 0.1%
9000 1
 
< 0.1%
(Missing) 2338
 
23.4%
ValueCountFrequency (%)
0 7594
75.9%
9000 1
 
< 0.1%
10000 10
 
0.1%
15000 1
 
< 0.1%
20000 38
 
0.4%
30000 4
 
< 0.1%
35000 3
 
< 0.1%
40000 2
 
< 0.1%
50000 2
 
< 0.1%
230000 7
 
0.1%
ValueCountFrequency (%)
230000 7
 
0.1%
50000 2
 
< 0.1%
40000 2
 
< 0.1%
35000 3
 
< 0.1%
30000 4
 
< 0.1%
20000 38
 
0.4%
15000 1
 
< 0.1%
10000 10
 
0.1%
9000 1
 
< 0.1%
0 7594
75.9%

피복비
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
7780 
<NA>
2218 
104000
 
1
350000
 
1

Length

Max length6
Median length1
Mean length1.6664
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 7780
77.8%
<NA> 2218
 
22.2%
104000 1
 
< 0.1%
350000 1
 
< 0.1%

Length

2024-03-15T00:30:47.581039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:30:47.910764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 7780
77.8%
na 2218
 
22.2%
104000 1
 
< 0.1%
350000 1
 
< 0.1%

기타경비합계
Real number (ℝ)

MISSING  ZEROS 

Distinct89
Distinct (%)1.1%
Missing2116
Missing (%)21.2%
Infinite0
Infinite (%)0.0%
Mean12673.997
Minimum0
Maximum3000000
Zeros7484
Zeros (%)74.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:48.268262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3358.75
Maximum3000000
Range3000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation122466.14
Coefficient of variation (CV)9.6627872
Kurtosis273.51824
Mean12673.997
Median Absolute Deviation (MAD)0
Skewness14.995295
Sum99921795
Variance1.4997955 × 1010
MonotonicityNot monotonic
2024-03-15T00:30:48.708613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7484
74.8%
20000 71
 
0.7%
30000 34
 
0.3%
10000 29
 
0.3%
300000 13
 
0.1%
40000 12
 
0.1%
50000 12
 
0.1%
35000 10
 
0.1%
25000 10
 
0.1%
15000 10
 
0.1%
Other values (79) 199
 
2.0%
(Missing) 2116
 
21.2%
ValueCountFrequency (%)
0 7484
74.8%
800 1
 
< 0.1%
3000 3
 
< 0.1%
3125 1
 
< 0.1%
3400 1
 
< 0.1%
4000 2
 
< 0.1%
5000 8
 
0.1%
8000 3
 
< 0.1%
9000 2
 
< 0.1%
10000 29
 
0.3%
ValueCountFrequency (%)
3000000 3
< 0.1%
2600000 1
 
< 0.1%
2500000 2
< 0.1%
2100000 1
 
< 0.1%
1900000 1
 
< 0.1%
1850000 1
 
< 0.1%
1800000 3
< 0.1%
1500000 2
< 0.1%
1490000 1
 
< 0.1%
1400000 1
 
< 0.1%
Distinct585
Distinct (%)5.9%
Missing32
Missing (%)0.3%
Memory size156.2 KiB
2024-03-15T00:30:50.272809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length6
Mean length5.899378
Min length1

Characters and Unicode

Total characters58805
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique308 ?
Unique (%)3.1%

Sample

1st row120000
2nd row401000
3rd row250000
4th row150000
5th row300000
ValueCountFrequency (%)
200000 837
 
8.4%
150000 811
 
8.1%
250000 714
 
7.2%
300000 566
 
5.7%
180000 394
 
4.0%
160000 359
 
3.6%
130000 353
 
3.5%
120000 346
 
3.5%
140000 328
 
3.3%
100000 274
 
2.7%
Other values (575) 4986
50.0%
2024-03-15T00:30:51.908488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 40729
69.3%
1 4320
 
7.3%
2 3818
 
6.5%
5 2964
 
5.0%
3 2323
 
4.0%
4 1408
 
2.4%
8 1016
 
1.7%
6 1002
 
1.7%
7 748
 
1.3%
9 467
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 58795
> 99.9%
Other Punctuation 10
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 40729
69.3%
1 4320
 
7.3%
2 3818
 
6.5%
5 2964
 
5.0%
3 2323
 
4.0%
4 1408
 
2.4%
8 1016
 
1.7%
6 1002
 
1.7%
7 748
 
1.3%
9 467
 
0.8%
Other Punctuation
ValueCountFrequency (%)
, 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 58805
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 40729
69.3%
1 4320
 
7.3%
2 3818
 
6.5%
5 2964
 
5.0%
3 2323
 
4.0%
4 1408
 
2.4%
8 1016
 
1.7%
6 1002
 
1.7%
7 748
 
1.3%
9 467
 
0.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 58805
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 40729
69.3%
1 4320
 
7.3%
2 3818
 
6.5%
5 2964
 
5.0%
3 2323
 
4.0%
4 1408
 
2.4%
8 1016
 
1.7%
6 1002
 
1.7%
7 748
 
1.3%
9 467
 
0.8%

강사수
Real number (ℝ)

ZEROS 

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1491
Minimum0
Maximum41
Zeros249
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-15T00:30:52.136861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q34
95-th percentile9
Maximum41
Range41
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.4750938
Coefficient of variation (CV)1.1035197
Kurtosis28.907351
Mean3.1491
Median Absolute Deviation (MAD)1
Skewness4.0856276
Sum31491
Variance12.076277
MonotonicityNot monotonic
2024-03-15T00:30:52.511401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1 3441
34.4%
2 2180
21.8%
3 1243
 
12.4%
4 904
 
9.0%
5 609
 
6.1%
6 400
 
4.0%
0 249
 
2.5%
7 220
 
2.2%
8 186
 
1.9%
9 123
 
1.2%
Other values (14) 445
 
4.5%
ValueCountFrequency (%)
0 249
 
2.5%
1 3441
34.4%
2 2180
21.8%
3 1243
 
12.4%
4 904
 
9.0%
5 609
 
6.1%
6 400
 
4.0%
7 220
 
2.2%
8 186
 
1.9%
9 123
 
1.2%
ValueCountFrequency (%)
41 17
 
0.2%
24 13
 
0.1%
23 5
 
0.1%
21 3
 
< 0.1%
20 34
 
0.3%
18 8
 
0.1%
17 32
 
0.3%
16 21
 
0.2%
15 27
 
0.3%
14 97
1.0%

Sample

학원명학원종류학원주소설립자-성명전화번호교습계열교습과정교습과목(반)정원교습기간총교습시간(분)교습비모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비강사수
7216멜로디음악학원학교교과교습학원전북특별자치도 전주시 완산구 중산중앙로 9-3 (중화산동2가)신은정063-223-4387예능(중)음악피아노기초-고81개월0일120012000000000001200001
13900쿼크수학영어전문학원학교교과교습학원전북특별자치도 전주시 완산구 효천천변길 24 , 302호 (효자동2가)김윤빈<NA>보통교과입시고등영어B101개월0일178440100000000004010003
5371참빛학원학교교과교습학원전북특별자치도 전주시 덕진구 석소로 27 , 201호 (인후동1가, 아중제일아파트)오치훈063-245-8891보통교과입시·논술중등수학논술1101개월0일117025000000000002500004
8161삼천카이스트학원학교교과교습학원전북특별자치도 전주시 완산구 효자천변1길 26-3 , 2층 (효자동1가)최문희063-224-3625보통교과보습수학A(초)281개월0일106015000000000001500002
1973솔내지앤비어학원학교교과교습학원전북특별자치도 전주시 덕진구 두간로 6 , 3층 (송천동1가)이은경063-277-0522외국어실용외국어(유아/초·중·고)영어중등D1231개월0일180030000000000003000002
6979한상훈영어전문학원학교교과교습학원전북특별자치도 전주시 덕진구 기지로 86 , 501호 (중동)한상훈063-223-8889보통교과보습고등 보습101개월0일1490340000000000034000013
26786부안탑학원학교교과교습학원전북특별자치도 부안군 부안읍 석정로 296 , 3층 (부안읍)이준호063-583-6818보통교과입시중고등(국,사,과)121개월20일900117000<NA><NA><NA><NA><NA><NA><NA>1170001
7853개념원리수학전문학원학교교과교습학원전북특별자치도 전주시 덕진구 시천로 40 , 2층 일부 (송천동1가)정현미<NA>보통교과보습수학(고2이과)101개월0일156030000000000003000003
3490백수림수학전문학원학교교과교습학원전북특별자치도 전주시 완산구 난전들로 249 , 3층 (삼천동1가)김동빈063-228-3372보통교과입시수학(초)191개월0일132018000000000001800002
17645YM플러스학원학교교과교습학원전북특별자치도 군산시 계산로 87-6 , 2층 201호 (지곡동)김선희<NA>보통교과보습중등영어101개월0일900160000<NA><NA><NA><NA><NA><NA><NA>1600002
학원명학원종류학원주소설립자-성명전화번호교습계열교습과정교습과목(반)정원교습기간총교습시간(분)교습비모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비강사수
20855이안서가학원학교교과교습학원전북특별자치도 익산시 마한로5길 18-3 , 2층 (영등동, 킨더슐레어린이집)오정미063-834-5525보통교과보습국어(유치부,초등)1121개월0일90014800000000001480005
11988눈높이러닝센터에코자연학원학교교과교습학원전북특별자치도 전주시 덕진구 세병로 174-10 , 2층 (송천동2가, 대강빌딩2)(주)대교063-244-0906보통교과보습국어151개월0일252380000000000380005
3971눈높이러닝센터솔내학원학교교과교습학원전북특별자치도 전주시 덕진구 오송1길 37-7 , 302호 (송천동1가)(주)대교-박명규063-271-9109보통교과보습써밋어휘력151개월0일252400000000000400005
16694뻔뻔수학과학학원학교교과교습학원전북특별자치도 군산시 수송로 119 , 4층 (나운동, 은하빌딩)정대철063-466-6318보통교과보습과학B(고등)121개월0일1200250000<NA><NA><NA><NA><NA><NA><NA>2500002
24776그림마을미술학원학교교과교습학원전북특별자치도 김제시 동서로 207 (요촌동,그림마을미술학원)최혜라063-547-1475예능(중)미술미술(초등)201개월0일150018000000<NA>0<NA>001800001
27043영쓰영어학교교과교습학원전북특별자치도 부안군 부안읍 매창로 162 (부안읍)이영미<NA>보통교과보습고등영어141개월0일2300300000<NA><NA><NA><NA><NA><NA><NA>3000001
1679위너스영어수학학원학교교과교습학원전북특별자치도 전주시 완산구 양지2길 52-13 , 상가2동 302호 (평화동2가)송영희063-227-5900보통교과입시초등단과A(영어)61개월0일92013000000000001300001
9967TOP(탑)영어전문학원학교교과교습학원전북특별자치도 전주시 덕진구 세병로 174-9 , 501호 (송천동2가)박준규063-255-0413보통교과보습중등과학B121개월0일8101500000300000000300001800002
8523꿈트리클래스학원학교교과교습학원전북특별자치도 전주시 덕진구 오공로 49 , 4층 404호 (중동)윤인옥<NA>기타(중)기타(소)블록C201개월0일114019000000000001900004
23496연지음악학원학교교과교습학원전북특별자치도 정읍시 명륜길 29-4 , 솔로몬빌딩 2층 (연지동)김경선063-538-3323예능(중)음악성악중급150개월20일1000160000<NA><NA><NA><NA><NA><NA><NA>1600003

Duplicate rows

Most frequently occurring

학원명학원종류학원주소설립자-성명전화번호교습계열교습과정교습과목(반)정원교습기간총교습시간(분)교습비모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비강사수# duplicates
0배쌤과학학원학교교과교습학원전북특별자치도 전주시 덕진구 오공로 43-17 , 301호 일부 (중동)배주희<NA>보통교과입시고등화학1371개월0일1264300000000000030000012
1수석어학원학교교과교습학원전북특별자치도 전주시 덕진구 세병로 8 , 602호 (송천동2가, 에코시티 데시앙 5블럭)김상은<NA>외국어실용외국어(유아/초·중·고)내국인(중)81개월0일1280280000000000028000042
2잉글리쉬무무송천보습학원학교교과교습학원전북특별자치도 전주시 덕진구 오송1길 19 , 301호 (송천동1가)박인권063-255-9905보통교과보습중등영어(문법)941개월0일1200125000000000012500032