Overview

Dataset statistics

Number of variables23
Number of observations8596
Missing cells101
Missing cells (%)0.1%
Duplicate rows29
Duplicate rows (%)0.3%
Total size in memory1.6 MiB
Average record size in memory197.0 B

Variable types

Categorical11
Text5
Numeric7

Dataset

Description경상북도 내 교습소에 대하여 교습소명, 교습자명, 교습소주소, 교습과목, 교습비, 정원, 기타경비 등의 항목을 제공합니다.
Author경상북도교육청
URLhttps://www.data.go.kr/data/15048353/fileData.do

Alerts

간식비 has constant value ""Constant
기숙사비 has constant value ""Constant
차량비 has constant value ""Constant
Dataset has 29 (0.3%) duplicate rowsDuplicates
등록상태 is highly imbalanced (95.5%)Imbalance
교습계열 is highly imbalanced (52.1%)Imbalance
교습과정 is highly imbalanced (53.7%)Imbalance
모의고사비 is highly imbalanced (99.8%)Imbalance
급식비 is highly imbalanced (99.7%)Imbalance
피복비 is highly imbalanced (99.7%)Imbalance
교습소주소 has 86 (1.0%) missing valuesMissing
기타경비합계 is highly skewed (γ1 = 22.42170524)Skewed
재료비 has 8453 (98.3%) zerosZeros
기타경비합계 has 8447 (98.3%) zerosZeros
총교습비 has 1121 (13.0%) zerosZeros
총교습비(시간당) has 1280 (14.9%) zerosZeros

Reproduction

Analysis started2024-03-14 15:30:55.528184
Analysis finished2024-03-14 15:30:57.471380
Duration1.94 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct22
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
포항시
2379 
구미시
1778 
경산시
1058 
경주시
963 
안동시
515 
Other values (17)
1903 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row포항시
2nd row포항시
3rd row포항시
4th row포항시
5th row포항시

Common Values

ValueCountFrequency (%)
포항시 2379
27.7%
구미시 1778
20.7%
경산시 1058
12.3%
경주시 963
11.2%
안동시 515
 
6.0%
김천시 459
 
5.3%
상주시 287
 
3.3%
영주시 238
 
2.8%
영천시 225
 
2.6%
칠곡군 169
 
2.0%
Other values (12) 525
 
6.1%

Length

2024-03-15T00:30:57.655208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
포항시 2379
27.7%
구미시 1778
20.7%
경산시 1058
12.3%
경주시 963
11.2%
안동시 515
 
6.0%
김천시 459
 
5.3%
상주시 287
 
3.3%
영주시 238
 
2.8%
영천시 225
 
2.6%
칠곡군 169
 
2.0%
Other values (12) 525
 
6.1%
Distinct1628
Distinct (%)18.9%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
2024-03-15T00:30:58.988707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.9866217
Min length2

Characters and Unicode

Total characters42865
Distinct characters38
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique76 ?
Unique (%)0.9%

Sample

1st row2008-26
2nd row2008-26
3rd row2008-26
4th row2008-26
5th row2008-26
ValueCountFrequency (%)
2521 51
 
0.6%
2528 42
 
0.5%
2556 42
 
0.5%
2475 42
 
0.5%
690 38
 
0.4%
01월 37
 
0.4%
1052 27
 
0.3%
1781 25
 
0.3%
apr-22 25
 
0.3%
may-11 21
 
0.2%
Other values (1619) 8303
96.0%
2024-03-15T00:31:01.003047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 8135
19.0%
1 5676
13.2%
0 5277
12.3%
- 3710
8.7%
3 2796
 
6.5%
4 2590
 
6.0%
5 2489
 
5.8%
6 2252
 
5.3%
9 2128
 
5.0%
7 2124
 
5.0%
Other values (28) 5688
13.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 35571
83.0%
Dash Punctuation 3710
 
8.7%
Lowercase Letter 2236
 
5.2%
Uppercase Letter 1118
 
2.6%
Other Letter 173
 
0.4%
Space Separator 57
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 342
15.3%
e 275
12.3%
u 253
11.3%
n 225
10.1%
p 223
10.0%
r 220
9.8%
y 125
 
5.6%
c 125
 
5.6%
b 110
 
4.9%
g 83
 
3.7%
Other values (4) 255
11.4%
Decimal Number
ValueCountFrequency (%)
2 8135
22.9%
1 5676
16.0%
0 5277
14.8%
3 2796
 
7.9%
4 2590
 
7.3%
5 2489
 
7.0%
6 2252
 
6.3%
9 2128
 
6.0%
7 2124
 
6.0%
8 2104
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
J 294
26.3%
M 218
19.5%
A 210
18.8%
F 110
 
9.8%
S 96
 
8.6%
D 69
 
6.2%
N 65
 
5.8%
O 56
 
5.0%
Other Letter
ValueCountFrequency (%)
57
32.9%
57
32.9%
32
18.5%
27
15.6%
Dash Punctuation
ValueCountFrequency (%)
- 3710
100.0%
Space Separator
ValueCountFrequency (%)
57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 39338
91.8%
Latin 3354
 
7.8%
Hangul 173
 
0.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 342
 
10.2%
J 294
 
8.8%
e 275
 
8.2%
u 253
 
7.5%
n 225
 
6.7%
p 223
 
6.6%
r 220
 
6.6%
M 218
 
6.5%
A 210
 
6.3%
y 125
 
3.7%
Other values (12) 969
28.9%
Common
ValueCountFrequency (%)
2 8135
20.7%
1 5676
14.4%
0 5277
13.4%
- 3710
9.4%
3 2796
 
7.1%
4 2590
 
6.6%
5 2489
 
6.3%
6 2252
 
5.7%
9 2128
 
5.4%
7 2124
 
5.4%
Other values (2) 2161
 
5.5%
Hangul
ValueCountFrequency (%)
57
32.9%
57
32.9%
32
18.5%
27
15.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42692
99.6%
Hangul 173
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 8135
19.1%
1 5676
13.3%
0 5277
12.4%
- 3710
8.7%
3 2796
 
6.5%
4 2590
 
6.1%
5 2489
 
5.8%
6 2252
 
5.3%
9 2128
 
5.0%
7 2124
 
5.0%
Other values (24) 5515
12.9%
Hangul
ValueCountFrequency (%)
57
32.9%
57
32.9%
32
18.5%
27
15.6%
Distinct1778
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
2024-03-15T00:31:02.002006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length37
Median length31
Mean length9.8182876
Min length5

Characters and Unicode

Total characters84398
Distinct characters637
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)1.2%

Sample

1st row강선생수학교습소
2nd row강선생수학교습소
3rd row강선생수학교습소
4th row강선생수학교습소
5th row강선생수학교습소
ValueCountFrequency (%)
플라톤아카데미비산독서논술교습소 51
 
0.6%
플라톤아카데미봉곡독서논술교습소 42
 
0.5%
플라톤아카데미도량5주공독서논술교습소 42
 
0.5%
플라톤독서토론논술교습소 42
 
0.5%
수학교습소 37
 
0.4%
하이어영어교습소 30
 
0.3%
리드인독서논술교습소 30
 
0.3%
생각나무 27
 
0.3%
독서논술교습소 27
 
0.3%
교습소 26
 
0.3%
Other values (1815) 8541
96.0%
2024-03-15T00:31:03.440267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8801
 
10.4%
8770
 
10.4%
8576
 
10.2%
2672
 
3.2%
2470
 
2.9%
2319
 
2.7%
2308
 
2.7%
2249
 
2.7%
1770
 
2.1%
1758
 
2.1%
Other values (627) 42705
50.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 81160
96.2%
Uppercase Letter 1111
 
1.3%
Lowercase Letter 956
 
1.1%
Space Separator 299
 
0.4%
Open Punctuation 295
 
0.3%
Close Punctuation 295
 
0.3%
Decimal Number 189
 
0.2%
Other Punctuation 79
 
0.1%
Dash Punctuation 10
 
< 0.1%
Connector Punctuation 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8801
 
10.8%
8770
 
10.8%
8576
 
10.6%
2672
 
3.3%
2470
 
3.0%
2319
 
2.9%
2308
 
2.8%
2249
 
2.8%
1770
 
2.2%
1758
 
2.2%
Other values (564) 39467
48.6%
Uppercase Letter
ValueCountFrequency (%)
M 97
 
8.7%
E 86
 
7.7%
I 84
 
7.6%
S 81
 
7.3%
T 74
 
6.7%
A 68
 
6.1%
N 68
 
6.1%
C 57
 
5.1%
U 53
 
4.8%
O 52
 
4.7%
Other values (14) 391
35.2%
Lowercase Letter
ValueCountFrequency (%)
a 117
12.2%
l 103
10.8%
e 90
9.4%
o 82
 
8.6%
s 81
 
8.5%
i 79
 
8.3%
n 56
 
5.9%
h 55
 
5.8%
t 50
 
5.2%
r 35
 
3.7%
Other values (12) 208
21.8%
Decimal Number
ValueCountFrequency (%)
0 58
30.7%
3 56
29.6%
5 42
22.2%
2 13
 
6.9%
7 12
 
6.3%
1 8
 
4.2%
Other Punctuation
ValueCountFrequency (%)
' 29
36.7%
. 23
29.1%
& 12
15.2%
, 11
 
13.9%
· 4
 
5.1%
Space Separator
ValueCountFrequency (%)
299
100.0%
Open Punctuation
ValueCountFrequency (%)
( 295
100.0%
Close Punctuation
ValueCountFrequency (%)
) 295
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Math Symbol
ValueCountFrequency (%)
+ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 81145
96.1%
Latin 2067
 
2.4%
Common 1171
 
1.4%
Han 15
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8801
 
10.8%
8770
 
10.8%
8576
 
10.6%
2672
 
3.3%
2470
 
3.0%
2319
 
2.9%
2308
 
2.8%
2249
 
2.8%
1770
 
2.2%
1758
 
2.2%
Other values (562) 39452
48.6%
Latin
ValueCountFrequency (%)
a 117
 
5.7%
l 103
 
5.0%
M 97
 
4.7%
e 90
 
4.4%
E 86
 
4.2%
I 84
 
4.1%
o 82
 
4.0%
s 81
 
3.9%
S 81
 
3.9%
i 79
 
3.8%
Other values (36) 1167
56.5%
Common
ValueCountFrequency (%)
299
25.5%
( 295
25.2%
) 295
25.2%
0 58
 
5.0%
3 56
 
4.8%
5 42
 
3.6%
' 29
 
2.5%
. 23
 
2.0%
2 13
 
1.1%
& 12
 
1.0%
Other values (7) 49
 
4.2%
Han
ValueCountFrequency (%)
12
80.0%
3
 
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 81145
96.1%
ASCII 3234
 
3.8%
CJK 15
 
< 0.1%
None 4
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
8801
 
10.8%
8770
 
10.8%
8576
 
10.6%
2672
 
3.3%
2470
 
3.0%
2319
 
2.9%
2308
 
2.8%
2249
 
2.8%
1770
 
2.2%
1758
 
2.2%
Other values (562) 39452
48.6%
ASCII
ValueCountFrequency (%)
299
 
9.2%
( 295
 
9.1%
) 295
 
9.1%
a 117
 
3.6%
l 103
 
3.2%
M 97
 
3.0%
e 90
 
2.8%
E 86
 
2.7%
I 84
 
2.6%
o 82
 
2.5%
Other values (52) 1686
52.1%
CJK
ValueCountFrequency (%)
12
80.0%
3
 
20.0%
None
ValueCountFrequency (%)
· 4
100.0%
Distinct1662
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
2024-03-15T00:31:04.698687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length35
Median length3
Mean length3.0638669
Min length2

Characters and Unicode

Total characters26337
Distinct characters261
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique86 ?
Unique (%)1.0%

Sample

1st row강태욱
2nd row강태욱
3rd row강태욱
4th row강태욱
5th row강태욱
ValueCountFrequency (%)
김현숙 51
 
0.6%
이영주 49
 
0.6%
김혜영 43
 
0.5%
정재희 42
 
0.5%
김은정 36
 
0.4%
김지은 32
 
0.4%
김은영 32
 
0.4%
이광현 30
 
0.3%
이현주 28
 
0.3%
정호정 27
 
0.3%
Other values (1666) 8283
95.7%
2024-03-15T00:31:06.368006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1899
 
7.2%
1564
 
5.9%
1395
 
5.3%
975
 
3.7%
902
 
3.4%
793
 
3.0%
775
 
2.9%
719
 
2.7%
651
 
2.5%
642
 
2.4%
Other values (251) 16022
60.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 25737
97.7%
Uppercase Letter 507
 
1.9%
Space Separator 57
 
0.2%
Close Punctuation 18
 
0.1%
Open Punctuation 18
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1899
 
7.4%
1564
 
6.1%
1395
 
5.4%
975
 
3.8%
902
 
3.5%
793
 
3.1%
775
 
3.0%
719
 
2.8%
651
 
2.5%
642
 
2.5%
Other values (226) 15422
59.9%
Uppercase Letter
ValueCountFrequency (%)
A 71
14.0%
N 53
10.5%
I 47
 
9.3%
O 45
 
8.9%
G 37
 
7.3%
H 32
 
6.3%
E 30
 
5.9%
Y 27
 
5.3%
K 22
 
4.3%
W 21
 
4.1%
Other values (12) 122
24.1%
Space Separator
ValueCountFrequency (%)
57
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 25737
97.7%
Latin 507
 
1.9%
Common 93
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1899
 
7.4%
1564
 
6.1%
1395
 
5.4%
975
 
3.8%
902
 
3.5%
793
 
3.1%
775
 
3.0%
719
 
2.8%
651
 
2.5%
642
 
2.5%
Other values (226) 15422
59.9%
Latin
ValueCountFrequency (%)
A 71
14.0%
N 53
10.5%
I 47
 
9.3%
O 45
 
8.9%
G 37
 
7.3%
H 32
 
6.3%
E 30
 
5.9%
Y 27
 
5.3%
K 22
 
4.3%
W 21
 
4.1%
Other values (12) 122
24.1%
Common
ValueCountFrequency (%)
57
61.3%
) 18
 
19.4%
( 18
 
19.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 25737
97.7%
ASCII 600
 
2.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1899
 
7.4%
1564
 
6.1%
1395
 
5.4%
975
 
3.8%
902
 
3.5%
793
 
3.1%
775
 
3.0%
719
 
2.8%
651
 
2.5%
642
 
2.5%
Other values (226) 15422
59.9%
ASCII
ValueCountFrequency (%)
A 71
 
11.8%
57
 
9.5%
N 53
 
8.8%
I 47
 
7.8%
O 45
 
7.5%
G 37
 
6.2%
H 32
 
5.3%
E 30
 
5.0%
Y 27
 
4.5%
K 22
 
3.7%
Other values (15) 179
29.8%

등록상태
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
개원
8526 
자진휴원(소)
 
66
직권휴원(소)
 
4

Length

Max length7
Median length2
Mean length2.0407166
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 8526
99.2%
자진휴원(소) 66
 
0.8%
직권휴원(소) 4
 
< 0.1%

Length

2024-03-15T00:31:06.786404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:07.110456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 8526
99.2%
자진휴원(소 66
 
0.8%
직권휴원(소 4
 
< 0.1%

우편번호
Real number (ℝ)

Distinct629
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129737.43
Minimum36025
Maximum791946
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:07.407557image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36025
5-th percentile36641
Q137618
median38450
Q339311
95-th percentile790885
Maximum791946
Range755921
Interquartile range (IQR)1693

Descriptive statistics

Standard deviation241627.68
Coefficient of variation (CV)1.8624361
Kurtosis3.1558035
Mean129737.43
Median Absolute Deviation (MAD)847
Skewness2.2666136
Sum1.115223 × 109
Variance5.8383934 × 1010
MonotonicityNot monotonic
2024-03-15T00:31:07.677338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39660 284
 
3.3%
38069 129
 
1.5%
38662 107
 
1.2%
39147 94
 
1.1%
36722 91
 
1.1%
38084 89
 
1.0%
39146 85
 
1.0%
38687 81
 
0.9%
37591 79
 
0.9%
39205 72
 
0.8%
Other values (619) 7485
87.1%
ValueCountFrequency (%)
36025 1
 
< 0.1%
36026 10
0.1%
36028 3
 
< 0.1%
36056 5
0.1%
36066 4
 
< 0.1%
36068 3
 
< 0.1%
36072 12
0.1%
36077 2
 
< 0.1%
36079 4
 
< 0.1%
36082 4
 
< 0.1%
ValueCountFrequency (%)
791946 5
 
0.1%
791943 15
0.2%
791852 8
0.1%
791851 1
 
< 0.1%
791850 1
 
< 0.1%
791848 12
0.1%
791847 9
0.1%
791846 5
 
0.1%
791844 3
 
< 0.1%
791842 6
 
0.1%

교습소주소
Text

MISSING 

Distinct1815
Distinct (%)21.3%
Missing86
Missing (%)1.0%
Memory size67.3 KiB
2024-03-15T00:31:08.926969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length75
Median length55
Mean length37.76886
Min length19

Characters and Unicode

Total characters321413
Distinct characters428
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique97 ?
Unique (%)1.1%

Sample

1st row경상북도 포항시 남구 대이로 97, , 3층 (대잠동)
2nd row경상북도 포항시 남구 대이로 97, , 3층 (대잠동)
3rd row경상북도 포항시 남구 대이로 97, , 3층 (대잠동)
4th row경상북도 포항시 남구 대이로 97, , 3층 (대잠동)
5th row경상북도 포항시 남구 대이로 97, , 3층 (대잠동)
ValueCountFrequency (%)
경상북도 8510
 
12.2%
6789
 
9.8%
포항시 2332
 
3.3%
구미시 1763
 
2.5%
북구 1541
 
2.2%
2층 1369
 
2.0%
1층 1294
 
1.9%
경산시 1052
 
1.5%
경주시 963
 
1.4%
상가동 852
 
1.2%
Other values (2347) 43147
62.0%
2024-03-15T00:31:10.744228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
61560
 
19.2%
, 18052
 
5.6%
1 12639
 
3.9%
11752
 
3.7%
11162
 
3.5%
10781
 
3.4%
10358
 
3.2%
9430
 
2.9%
2 9175
 
2.9%
8436
 
2.6%
Other values (418) 158068
49.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 172178
53.6%
Space Separator 61560
 
19.2%
Decimal Number 49636
 
15.4%
Other Punctuation 18126
 
5.6%
Open Punctuation 8417
 
2.6%
Close Punctuation 8408
 
2.6%
Dash Punctuation 2431
 
0.8%
Uppercase Letter 571
 
0.2%
Lowercase Letter 64
 
< 0.1%
Letter Number 11
 
< 0.1%
Other values (2) 11
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11752
 
6.8%
11162
 
6.5%
10781
 
6.3%
10358
 
6.0%
9430
 
5.5%
8436
 
4.9%
6854
 
4.0%
5388
 
3.1%
4816
 
2.8%
4745
 
2.8%
Other values (372) 88456
51.4%
Uppercase Letter
ValueCountFrequency (%)
W 83
14.5%
A 70
12.3%
K 61
10.7%
B 56
9.8%
S 46
8.1%
E 43
7.5%
V 40
7.0%
I 40
7.0%
C 37
6.5%
L 16
 
2.8%
Other values (11) 79
13.8%
Decimal Number
ValueCountFrequency (%)
1 12639
25.5%
2 9175
18.5%
0 6756
13.6%
3 5522
11.1%
4 3471
 
7.0%
5 3374
 
6.8%
6 2504
 
5.0%
8 2281
 
4.6%
7 2206
 
4.4%
9 1708
 
3.4%
Other Punctuation
ValueCountFrequency (%)
, 18052
99.6%
@ 44
 
0.2%
· 15
 
0.1%
/ 12
 
0.1%
: 3
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
e 42
65.6%
c 12
 
18.8%
k 10
 
15.6%
Space Separator
ValueCountFrequency (%)
61560
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8417
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8408
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2431
100.0%
Letter Number
ValueCountFrequency (%)
11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 7
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 172182
53.6%
Common 148585
46.2%
Latin 646
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11752
 
6.8%
11162
 
6.5%
10781
 
6.3%
10358
 
6.0%
9430
 
5.5%
8436
 
4.9%
6854
 
4.0%
5388
 
3.1%
4816
 
2.8%
4745
 
2.8%
Other values (373) 88460
51.4%
Latin
ValueCountFrequency (%)
W 83
12.8%
A 70
10.8%
K 61
9.4%
B 56
8.7%
S 46
 
7.1%
E 43
 
6.7%
e 42
 
6.5%
V 40
 
6.2%
I 40
 
6.2%
C 37
 
5.7%
Other values (15) 128
19.8%
Common
ValueCountFrequency (%)
61560
41.4%
, 18052
 
12.1%
1 12639
 
8.5%
2 9175
 
6.2%
( 8417
 
5.7%
) 8408
 
5.7%
0 6756
 
4.5%
3 5522
 
3.7%
4 3471
 
2.3%
5 3374
 
2.3%
Other values (10) 11211
 
7.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 172178
53.6%
ASCII 149205
46.4%
None 19
 
< 0.1%
Number Forms 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
61560
41.3%
, 18052
 
12.1%
1 12639
 
8.5%
2 9175
 
6.1%
( 8417
 
5.6%
) 8408
 
5.6%
0 6756
 
4.5%
3 5522
 
3.7%
4 3471
 
2.3%
5 3374
 
2.3%
Other values (33) 11831
 
7.9%
Hangul
ValueCountFrequency (%)
11752
 
6.8%
11162
 
6.5%
10781
 
6.3%
10358
 
6.0%
9430
 
5.5%
8436
 
4.9%
6854
 
4.0%
5388
 
3.1%
4816
 
2.8%
4745
 
2.8%
Other values (372) 88456
51.4%
None
ValueCountFrequency (%)
· 15
78.9%
4
 
21.1%
Number Forms
ValueCountFrequency (%)
11
100.0%

일시수용능력인원
Real number (ℝ)

Distinct14
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.8700558
Minimum0
Maximum29
Zeros76
Zeros (%)0.9%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:11.108473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q15
median7
Q39
95-th percentile9
Maximum29
Range29
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.1826928
Coefficient of variation (CV)0.31771108
Kurtosis8.5581431
Mean6.8700558
Median Absolute Deviation (MAD)2
Skewness0.5363353
Sum59055
Variance4.764148
MonotonicityNot monotonic
2024-03-15T00:31:11.474342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
9 2989
34.8%
5 1556
18.1%
6 980
 
11.4%
7 925
 
10.8%
8 921
 
10.7%
4 839
 
9.8%
3 242
 
2.8%
0 76
 
0.9%
2 42
 
0.5%
29 7
 
0.1%
Other values (4) 19
 
0.2%
ValueCountFrequency (%)
0 76
 
0.9%
2 42
 
0.5%
3 242
 
2.8%
4 839
 
9.8%
5 1556
18.1%
6 980
 
11.4%
7 925
 
10.8%
8 921
 
10.7%
9 2989
34.8%
10 3
 
< 0.1%
ValueCountFrequency (%)
29 7
 
0.1%
20 5
 
0.1%
16 5
 
0.1%
12 6
 
0.1%
10 3
 
< 0.1%
9 2989
34.8%
8 921
 
10.7%
7 925
 
10.8%
6 980
 
11.4%
5 1556
18.1%

분야구분
Categorical

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
입시.검정 및 보습
4700 
예능(대)
3364 
국제화
 
315
기타(대)
 
168
정보
 
34

Length

Max length10
Median length10
Mean length7.6469288
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row입시.검정 및 보습
2nd row입시.검정 및 보습
3rd row입시.검정 및 보습
4th row입시.검정 및 보습
5th row입시.검정 및 보습

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 4700
54.7%
예능(대) 3364
39.1%
국제화 315
 
3.7%
기타(대) 168
 
2.0%
정보 34
 
0.4%
<NA> 15
 
0.2%

Length

2024-03-15T00:31:11.901737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:12.263933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 4700
26.1%
4700
26.1%
보습 4700
26.1%
예능(대 3364
18.7%
국제화 315
 
1.8%
기타(대 168
 
0.9%
정보 34
 
0.2%
na 15
 
0.1%

교습계열
Categorical

IMBALANCE 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
보통교과
4696 
예능(중)
3364 
외국어
 
315
기타(중)
 
168
정보
 
34
Other values (2)
 
19

Length

Max length5
Median length4
Mean length4.3663332
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보통교과
2nd row보통교과
3rd row보통교과
4th row보통교과
5th row보통교과

Common Values

ValueCountFrequency (%)
보통교과 4696
54.6%
예능(중) 3364
39.1%
외국어 315
 
3.7%
기타(중) 168
 
2.0%
정보 34
 
0.4%
<NA> 15
 
0.2%
진학지도 4
 
< 0.1%

Length

2024-03-15T00:31:12.616856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:12.840013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보통교과 4696
54.6%
예능(중 3364
39.1%
외국어 315
 
3.7%
기타(중 168
 
2.0%
정보 34
 
0.4%
na 15
 
0.2%
진학지도 4
 
< 0.1%

교습과정
Categorical

IMBALANCE 

Distinct23
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
보습
4271 
음악
2228 
미술
1099 
실용외국어(유아/초·중·고)
 
311
보습·논술
 
273
Other values (18)
 
414

Length

Max length15
Median length2
Mean length2.6528618
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보습
2nd row보습
3rd row보습
4th row보습
5th row보습

Common Values

ValueCountFrequency (%)
보습 4271
49.7%
음악 2228
25.9%
미술 1099
 
12.8%
실용외국어(유아/초·중·고) 311
 
3.6%
보습·논술 273
 
3.2%
보통교과 129
 
1.5%
컴퓨터(소) 55
 
0.6%
서예 41
 
0.5%
예능(중) 28
 
0.3%
정보 22
 
0.3%
Other values (13) 139
 
1.6%

Length

2024-03-15T00:31:13.067196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보습 4271
49.7%
음악 2228
25.9%
미술 1099
 
12.8%
실용외국어(유아/초·중·고 311
 
3.6%
보습·논술 273
 
3.2%
보통교과 129
 
1.5%
컴퓨터(소 55
 
0.6%
서예 41
 
0.5%
예능(중 28
 
0.3%
정보 22
 
0.3%
Other values (13) 139
 
1.6%
Distinct3024
Distinct (%)35.2%
Missing15
Missing (%)0.2%
Memory size67.3 KiB
2024-03-15T00:31:14.149378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length17
Mean length5.7309171
Min length1

Characters and Unicode

Total characters49177
Distinct characters340
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2186 ?
Unique (%)25.5%

Sample

1st row수학중등D
2nd row수학초등A
3rd row수학고등C
4th row수학고등B
5th row수학고등A
ValueCountFrequency (%)
피아노 362
 
3.7%
미술 200
 
2.1%
피아노(초급 196
 
2.0%
피아노(중급 195
 
2.0%
피아노(고급 191
 
2.0%
수학 181
 
1.9%
영어 153
 
1.6%
고급 130
 
1.3%
중등수학 121
 
1.3%
초급 117
 
1.2%
Other values (2686) 7814
80.9%
2024-03-15T00:31:15.513948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3990
 
8.1%
( 3619
 
7.4%
) 3618
 
7.4%
2978
 
6.1%
2556
 
5.2%
2454
 
5.0%
2219
 
4.5%
2093
 
4.3%
2071
 
4.2%
2005
 
4.1%
Other values (330) 21574
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 35812
72.8%
Open Punctuation 3624
 
7.4%
Close Punctuation 3623
 
7.4%
Uppercase Letter 2460
 
5.0%
Decimal Number 1973
 
4.0%
Space Separator 1081
 
2.2%
Other Punctuation 213
 
0.4%
Connector Punctuation 111
 
0.2%
Dash Punctuation 108
 
0.2%
Letter Number 94
 
0.2%
Other values (3) 78
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3990
 
11.1%
2978
 
8.3%
2556
 
7.1%
2454
 
6.9%
2219
 
6.2%
2093
 
5.8%
2071
 
5.8%
2005
 
5.6%
1814
 
5.1%
1661
 
4.6%
Other values (264) 11971
33.4%
Uppercase Letter
ValueCountFrequency (%)
A 896
36.4%
B 857
34.8%
C 331
 
13.5%
D 119
 
4.8%
E 60
 
2.4%
I 31
 
1.3%
F 31
 
1.3%
H 20
 
0.8%
S 18
 
0.7%
P 14
 
0.6%
Other values (12) 83
 
3.4%
Lowercase Letter
ValueCountFrequency (%)
c 6
14.3%
a 6
14.3%
e 5
11.9%
d 4
9.5%
r 4
9.5%
t 4
9.5%
n 3
7.1%
i 3
7.1%
b 1
 
2.4%
o 1
 
2.4%
Other values (5) 5
11.9%
Decimal Number
ValueCountFrequency (%)
1 564
28.6%
2 551
27.9%
3 366
18.6%
4 179
 
9.1%
5 119
 
6.0%
0 87
 
4.4%
6 77
 
3.9%
7 13
 
0.7%
8 12
 
0.6%
9 5
 
0.3%
Other Punctuation
ValueCountFrequency (%)
, 160
75.1%
. 25
 
11.7%
· 13
 
6.1%
/ 12
 
5.6%
& 3
 
1.4%
Letter Number
ValueCountFrequency (%)
45
47.9%
38
40.4%
9
 
9.6%
2
 
2.1%
Open Punctuation
ValueCountFrequency (%)
( 3619
99.9%
[ 5
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 3618
99.9%
] 5
 
0.1%
Math Symbol
ValueCountFrequency (%)
~ 28
80.0%
+ 7
 
20.0%
Space Separator
ValueCountFrequency (%)
1081
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 111
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 108
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 35812
72.8%
Common 10769
 
21.9%
Latin 2596
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3990
 
11.1%
2978
 
8.3%
2556
 
7.1%
2454
 
6.9%
2219
 
6.2%
2093
 
5.8%
2071
 
5.8%
2005
 
5.6%
1814
 
5.1%
1661
 
4.6%
Other values (264) 11971
33.4%
Latin
ValueCountFrequency (%)
A 896
34.5%
B 857
33.0%
C 331
 
12.8%
D 119
 
4.6%
E 60
 
2.3%
45
 
1.7%
38
 
1.5%
I 31
 
1.2%
F 31
 
1.2%
H 20
 
0.8%
Other values (31) 168
 
6.5%
Common
ValueCountFrequency (%)
( 3619
33.6%
) 3618
33.6%
1081
 
10.0%
1 564
 
5.2%
2 551
 
5.1%
3 366
 
3.4%
4 179
 
1.7%
, 160
 
1.5%
5 119
 
1.1%
_ 111
 
1.0%
Other values (15) 401
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 35812
72.8%
ASCII 13258
 
27.0%
Number Forms 94
 
0.2%
None 13
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
3990
 
11.1%
2978
 
8.3%
2556
 
7.1%
2454
 
6.9%
2219
 
6.2%
2093
 
5.8%
2071
 
5.8%
2005
 
5.6%
1814
 
5.1%
1661
 
4.6%
Other values (264) 11971
33.4%
ASCII
ValueCountFrequency (%)
( 3619
27.3%
) 3618
27.3%
1081
 
8.2%
A 896
 
6.8%
B 857
 
6.5%
1 564
 
4.3%
2 551
 
4.2%
3 366
 
2.8%
C 331
 
2.5%
4 179
 
1.4%
Other values (51) 1196
 
9.0%
Number Forms
ValueCountFrequency (%)
45
47.9%
38
40.4%
9
 
9.6%
2
 
2.1%
None
ValueCountFrequency (%)
· 13
100.0%

정원
Real number (ℝ)

Distinct40
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.1795021
Minimum0
Maximum72
Zeros42
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:15.744502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q15
median7
Q39
95-th percentile10
Maximum72
Range72
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.7718536
Coefficient of variation (CV)0.52536423
Kurtosis49.653226
Mean7.1795021
Median Absolute Deviation (MAD)2
Skewness4.8916441
Sum61715
Variance14.226879
MonotonicityNot monotonic
2024-03-15T00:31:16.126534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=40)
ValueCountFrequency (%)
9 2564
29.8%
5 1678
19.5%
6 893
 
10.4%
8 870
 
10.1%
7 834
 
9.7%
4 803
 
9.3%
3 301
 
3.5%
10 131
 
1.5%
2 105
 
1.2%
1 66
 
0.8%
Other values (30) 351
 
4.1%
ValueCountFrequency (%)
0 42
 
0.5%
1 66
 
0.8%
2 105
 
1.2%
3 301
 
3.5%
4 803
 
9.3%
5 1678
19.5%
6 893
 
10.4%
7 834
 
9.7%
8 870
 
10.1%
9 2564
29.8%
ValueCountFrequency (%)
72 1
 
< 0.1%
63 3
 
< 0.1%
60 1
 
< 0.1%
54 1
 
< 0.1%
45 3
 
< 0.1%
42 2
 
< 0.1%
40 2
 
< 0.1%
36 8
0.1%
35 2
 
< 0.1%
32 1
 
< 0.1%

모의고사비
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8594 
20000
 
1
10000
 
1

Length

Max length5
Median length1
Mean length1.0009307
Min length1

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8594
> 99.9%
20000 1
 
< 0.1%
10000 1
 
< 0.1%

Length

2024-03-15T00:31:16.572570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:16.831663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8594
> 99.9%
20000 1
 
< 0.1%
10000 1
 
< 0.1%

재료비
Real number (ℝ)

ZEROS 

Distinct62
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean481.16112
Minimum0
Maximum118000
Zeros8453
Zeros (%)98.3%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:17.190429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum118000
Range118000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation4853.8722
Coefficient of variation (CV)10.087831
Kurtosis205.98738
Mean481.16112
Median Absolute Deviation (MAD)0
Skewness13.367401
Sum4136061
Variance23560075
MonotonicityNot monotonic
2024-03-15T00:31:17.632147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8453
98.3%
20000 26
 
0.3%
10000 19
 
0.2%
5000 13
 
0.2%
15000 7
 
0.1%
26560 5
 
0.1%
45000 4
 
< 0.1%
30000 4
 
< 0.1%
50000 3
 
< 0.1%
12000 3
 
< 0.1%
Other values (52) 59
 
0.7%
ValueCountFrequency (%)
0 8453
98.3%
2000 1
 
< 0.1%
3600 1
 
< 0.1%
4000 1
 
< 0.1%
4300 1
 
< 0.1%
5000 13
 
0.2%
6000 1
 
< 0.1%
7220 3
 
< 0.1%
9600 1
 
< 0.1%
10000 19
 
0.2%
ValueCountFrequency (%)
118000 1
< 0.1%
96000 1
< 0.1%
90000 1
< 0.1%
88000 1
< 0.1%
85000 1
< 0.1%
83000 1
< 0.1%
82000 1
< 0.1%
80000 2
< 0.1%
78000 1
< 0.1%
77000 1
< 0.1%

급식비
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8594 
10000
 
2

Length

Max length5
Median length1
Mean length1.0009307
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8594
> 99.9%
10000 2
 
< 0.1%

Length

2024-03-15T00:31:17.913014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:18.105314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8594
> 99.9%
10000 2
 
< 0.1%

간식비
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8596 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8596
100.0%

Length

2024-03-15T00:31:18.434221image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:18.755990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8596
100.0%

기숙사비
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8596 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8596
100.0%

Length

2024-03-15T00:31:19.044463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:19.309212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8596
100.0%

차량비
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8596 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8596
100.0%

Length

2024-03-15T00:31:19.651292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:19.968405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8596
100.0%

피복비
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size67.3 KiB
0
8592 
190000
 
1
300000
 
1
230000
 
1
200000
 
1

Length

Max length6
Median length1
Mean length1.0023267
Min length1

Unique

Unique4 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 8592
> 99.9%
190000 1
 
< 0.1%
300000 1
 
< 0.1%
230000 1
 
< 0.1%
200000 1
 
< 0.1%

Length

2024-03-15T00:31:20.143442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T00:31:20.336005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 8592
> 99.9%
190000 1
 
< 0.1%
300000 1
 
< 0.1%
230000 1
 
< 0.1%
200000 1
 
< 0.1%

기타경비합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct66
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean596.33097
Minimum0
Maximum300000
Zeros8447
Zeros (%)98.3%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:20.584799image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum300000
Range300000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation7034.6283
Coefficient of variation (CV)11.796517
Kurtosis695.46214
Mean596.33097
Median Absolute Deviation (MAD)0
Skewness22.421705
Sum5126061
Variance49485995
MonotonicityNot monotonic
2024-03-15T00:31:20.892481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 8447
98.3%
20000 27
 
0.3%
10000 20
 
0.2%
5000 13
 
0.2%
15000 7
 
0.1%
26560 5
 
0.1%
30000 4
 
< 0.1%
45000 4
 
< 0.1%
12000 3
 
< 0.1%
7220 3
 
< 0.1%
Other values (56) 63
 
0.7%
ValueCountFrequency (%)
0 8447
98.3%
2000 1
 
< 0.1%
3600 1
 
< 0.1%
4000 1
 
< 0.1%
4300 1
 
< 0.1%
5000 13
 
0.2%
6000 1
 
< 0.1%
7220 3
 
< 0.1%
9600 1
 
< 0.1%
10000 20
 
0.2%
ValueCountFrequency (%)
300000 1
< 0.1%
230000 1
< 0.1%
200000 1
< 0.1%
190000 1
< 0.1%
118000 1
< 0.1%
96000 1
< 0.1%
90000 1
< 0.1%
88000 1
< 0.1%
85000 1
< 0.1%
83000 1
< 0.1%

총교습비
Real number (ℝ)

ZEROS 

Distinct269
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean151431.35
Minimum0
Maximum864000
Zeros1121
Zeros (%)13.0%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:21.405498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q1100000
median147500
Q3200000
95-th percentile300000
Maximum864000
Range864000
Interquartile range (IQR)100000

Descriptive statistics

Standard deviation96249.241
Coefficient of variation (CV)0.63559651
Kurtosis1.6785061
Mean151431.35
Median Absolute Deviation (MAD)52500
Skewness0.66707211
Sum1.3017039 × 109
Variance9.2639164 × 109
MonotonicityNot monotonic
2024-03-15T00:31:21.825178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1121
 
13.0%
150000 695
 
8.1%
200000 565
 
6.6%
120000 539
 
6.3%
130000 493
 
5.7%
250000 462
 
5.4%
140000 412
 
4.8%
100000 399
 
4.6%
300000 356
 
4.1%
160000 318
 
3.7%
Other values (259) 3236
37.6%
ValueCountFrequency (%)
0 1121
13.0%
10000 7
 
0.1%
12000 1
 
< 0.1%
13000 1
 
< 0.1%
18000 1
 
< 0.1%
20000 21
 
0.2%
21000 2
 
< 0.1%
24000 1
 
< 0.1%
25000 4
 
< 0.1%
30000 25
 
0.3%
ValueCountFrequency (%)
864000 1
 
< 0.1%
850000 1
 
< 0.1%
750000 1
 
< 0.1%
650000 1
 
< 0.1%
600000 9
0.1%
590000 1
 
< 0.1%
560000 1
 
< 0.1%
550000 3
 
< 0.1%
540000 2
 
< 0.1%
520000 1
 
< 0.1%

총교습비(시간당)
Real number (ℝ)

ZEROS 

Distinct916
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7988.9012
Minimum0
Maximum234000
Zeros1280
Zeros (%)14.9%
Negative0
Negative (%)0.0%
Memory size75.7 KiB
2024-03-15T00:31:22.110647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q16000
median8400
Q310500
95-th percentile12500
Maximum234000
Range234000
Interquartile range (IQR)4500

Descriptive statistics

Standard deviation7189.5354
Coefficient of variation (CV)0.89994045
Kurtosis358.3038
Mean7988.9012
Median Absolute Deviation (MAD)2229
Skewness14.788411
Sum68672595
Variance51689419
MonotonicityNot monotonic
2024-03-15T00:31:22.352754image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1280
 
14.9%
10000 338
 
3.9%
7500 307
 
3.6%
6000 239
 
2.8%
8333 231
 
2.7%
9000 183
 
2.1%
10714 154
 
1.8%
9375 154
 
1.8%
12500 143
 
1.7%
11250 137
 
1.6%
Other values (906) 5430
63.2%
ValueCountFrequency (%)
0 1280
14.9%
650 1
 
< 0.1%
833 1
 
< 0.1%
926 1
 
< 0.1%
1200 3
 
< 0.1%
1286 1
 
< 0.1%
2000 1
 
< 0.1%
3000 2
 
< 0.1%
3125 1
 
< 0.1%
3333 2
 
< 0.1%
ValueCountFrequency (%)
234000 1
 
< 0.1%
216000 1
 
< 0.1%
200000 1
 
< 0.1%
150000 3
< 0.1%
140000 1
 
< 0.1%
130000 1
 
< 0.1%
127500 1
 
< 0.1%
125000 1
 
< 0.1%
105556 1
 
< 0.1%
102857 1
 
< 0.1%

Sample

지역신고번호교습소명교습자-성명등록상태우편번호교습소주소일시수용능력인원분야구분교습계열교습과정교습과목(반)정원모의고사비재료비급식비간식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)
0포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학중등D90000000000
1포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학초등A9000000001700009444
2포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학고등C90000000000
3포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학고등B90000000000
4포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학고등A90000000000
5포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학중등E90000000000
6포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학중등C90000000040000013793
7포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학중등B90000000035000012805
8포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학중등A90000000030000010714
9포항시2008-26강선생수학교습소강태욱개원37678경상북도 포항시 남구 대이로 97, , 3층 (대잠동)9입시.검정 및 보습보통교과보습수학초등B90000000025000011364
지역신고번호교습소명교습자-성명등록상태우편번호교습소주소일시수용능력인원분야구분교습계열교습과정교습과목(반)정원모의고사비재료비급식비간식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)
8586울진군222피아니스트피아노교습소남상희개원36324경상북도 울진군 울진읍 읍내8길 , 144예능(대)예능(중)음악피아노(초급)4000000001300006500
8587울진군222피아니스트피아노교습소남상희개원36324경상북도 울진군 울진읍 읍내8길 , 144예능(대)예능(중)음악피아노(중급)4000000001400007000
8588울진군222피아니스트피아노교습소남상희개원36324경상북도 울진군 울진읍 읍내8길 , 144예능(대)예능(중)음악피아노(고급)4000000001500007500
8589울진군288하쌤수학교습소하지연개원36305경상북도 울진군 북면 울진북로 2120, 2층0입시.검정 및 보습보통교과보통교과초등 수학50000000000
8590울진군288하쌤수학교습소하지연개원36305경상북도 울진군 북면 울진북로 2120, 2층0입시.검정 및 보습보통교과보통교과심화 수학40000000000
8591울진군288하쌤수학교습소하지연개원36305경상북도 울진군 북면 울진북로 2120, 2층0입시.검정 및 보습보통교과보통교과고등 수학40000000000
8592울진군288하쌤수학교습소하지연개원36305경상북도 울진군 북면 울진북로 2120, 2층0입시.검정 및 보습보통교과보통교과중등 수학50000000000
8593울릉군제933호실로피아노교습소황제영개원40218경상북도 울릉군 울릉읍 저동길 68, (울릉읍)7예능(대)예능(중)음악피아노 초급7000000001000005000
8594울릉군제933호실로피아노교습소황제영개원40218경상북도 울릉군 울릉읍 저동길 68, (울릉읍)7예능(대)예능(중)음악피아노 중급7000000001100005500
8595울릉군제933호실로피아노교습소황제영개원40218경상북도 울릉군 울릉읍 저동길 68, (울릉읍)7예능(대)예능(중)음악피아노 고급7000000001200006000

Duplicate rows

Most frequently occurring

지역신고번호교습소명교습자-성명등록상태우편번호교습소주소일시수용능력인원분야구분교습계열교습과정교습과목(반)정원모의고사비재료비급식비간식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)# duplicates
2경주시2005-31샤갈미술교습소정경희개원38083경상북도 경주시 황성로27번길 15, , 206호 (황성동)6입시.검정 및 보습보통교과보습수학30000000010000003
3구미시1437최선생미술교습소최선형개원730300경상북도 구미시 인동36길 35, , 상가동 208호 (구평동, 구미구평푸르지오)9입시.검정 및 보습보통교과보습수학10000000030000003
19영주시896푸르넷수학교습소김미숙개원36138경상북도 영주시 대동로 20, (가흥동) 2층9입시.검정 및 보습보통교과보습수학900000000003
21예천군05월 30일계명서당한자교습소김보현개원36828경상북도 예천군 예천읍 효자로 40, , 세종타운상가 102호 (예천읍, 세종타운)8입시.검정 및 보습보통교과보습수학5000000005000003
22예천군05월 30일계명서당한자교습소김보현개원36828경상북도 예천군 예천읍 효자로 40, , 세종타운상가 102호 (예천읍, 세종타운)8입시.검정 및 보습보통교과보습영어5000000005000003
0경산시1687안쌤수학교습소안정미개원38655경상북도 경산시 대학로16길 22-2, , 101호 (정평동)6입시.검정 및 보습보통교과보습수학(중)600000000200000100002
1경산시1757바르다수학교습소장희경개원38438경상북도 경산시 하양읍 대학로296길 9-13, , 201호 (하양읍)8입시.검정 및 보습보통교과보습수학(중)500000000270000112502
4구미시2109피아노와우쿨이야기음악교습소김은아개원730130경상북도 구미시 왕산로 68, , 상가 302/201 (임은동,임은 코오롱하늘채)9예능(대)예능(중)음악우쿨중급9000000003000075002
5구미시2109피아노와우쿨이야기음악교습소김은아개원730130경상북도 구미시 왕산로 68, , 상가 302/201 (임은동,임은 코오롱하늘채)9예능(대)예능(중)음악우쿨초급B9000000003000075002
6구미시2109피아노와우쿨이야기음악교습소김은아개원730130경상북도 구미시 왕산로 68, , 상가 302/201 (임은동,임은 코오롱하늘채)9예능(대)예능(중)음악피아노중급90000000010000083332