Overview

Dataset statistics

Number of variables23
Number of observations10000
Missing cells24
Missing cells (%)< 0.1%
Duplicate rows11
Duplicate rows (%)0.1%
Total size in memory2.0 MiB
Average record size in memory205.0 B

Variable types

Categorical7
Text6
Numeric10

Dataset

Description경상북도 내 학원에 대하여 학원명, 학원종류, 분야, 학원주소, 교습과목, 정원, 교습비 등의 항목을 제공합니다.
Author경상북도교육청
URLhttps://www.data.go.kr/data/15048352/fileData.do

Alerts

기숙사비 has constant value ""Constant
Dataset has 11 (0.1%) duplicate rowsDuplicates
학원종류 is highly imbalanced (56.9%)Imbalance
등록상태 is highly imbalanced (93.8%)Imbalance
급식비 is highly imbalanced (99.7%)Imbalance
피복비 is highly imbalanced (99.8%)Imbalance
정원 is highly skewed (γ1 = 97.04349249)Skewed
모의고사비 is highly skewed (γ1 = 58.20902393)Skewed
재료비 is highly skewed (γ1 = 21.16122338)Skewed
차량비 is highly skewed (γ1 = 21.34991848)Skewed
기타경비합계 is highly skewed (γ1 = 20.63672693)Skewed
총교습비(시간당) is highly skewed (γ1 = 22.3586759)Skewed
정원합계 has 320 (3.2%) zerosZeros
모의고사비 has 9978 (99.8%) zerosZeros
재료비 has 9833 (98.3%) zerosZeros
차량비 has 9936 (99.4%) zerosZeros
기타경비합계 has 9752 (97.5%) zerosZeros
총교습비 has 2095 (20.9%) zerosZeros
총교습비(시간당) has 2233 (22.3%) zerosZeros

Reproduction

Analysis started2024-05-04 07:52:21.157023
Analysis finished2024-05-04 07:52:25.699649
Duration4.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

지역
Categorical

Distinct23
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
구미시
2852 
포항시
2318 
경산시
1356 
경주시
763 
안동시
540 
Other values (18)
2171 

Length

Max length4
Median length3
Mean length3.001
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row구미시
2nd row포항시
3rd row구미시
4th row구미시
5th row포항시

Common Values

ValueCountFrequency (%)
구미시 2852
28.5%
포항시 2318
23.2%
경산시 1356
13.6%
경주시 763
 
7.6%
안동시 540
 
5.4%
칠곡군 355
 
3.5%
김천시 352
 
3.5%
영주시 332
 
3.3%
영천시 243
 
2.4%
예천군 188
 
1.9%
Other values (13) 701
 
7.0%

Length

2024-05-04T07:52:26.024353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
구미시 2852
28.5%
포항시 2318
23.2%
경산시 1356
13.6%
경주시 763
 
7.6%
안동시 540
 
5.4%
칠곡군 355
 
3.5%
김천시 352
 
3.5%
영주시 332
 
3.3%
영천시 243
 
2.4%
예천군 188
 
1.9%
Other values (13) 701
 
7.0%
Distinct2458
Distinct (%)24.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-04T07:52:27.115548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length4
Mean length3.9289
Min length1

Characters and Unicode

Total characters39289
Distinct characters42
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique468 ?
Unique (%)4.7%

Sample

1st row2458
2nd row등1044
3rd row2445
4th row1479
5th row2496
ValueCountFrequency (%)
711 61
 
0.6%
2327 34
 
0.3%
1595 31
 
0.3%
2025 31
 
0.3%
1678 30
 
0.3%
625 29
 
0.3%
429 27
 
0.3%
2297 27
 
0.3%
1638 26
 
0.3%
2380 26
 
0.3%
Other values (2443) 9733
96.8%
2024-05-04T07:52:28.876390image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 7204
18.3%
1 5535
14.1%
0 3386
8.6%
5 3206
8.2%
4 2969
7.6%
6 2937
7.5%
3 2914
7.4%
7 2901
7.4%
8 2813
 
7.2%
9 2576
 
6.6%
Other values (32) 2848
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36441
92.8%
Dash Punctuation 943
 
2.4%
Lowercase Letter 764
 
1.9%
Other Letter 704
 
1.8%
Uppercase Letter 382
 
1.0%
Space Separator 55
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 124
16.2%
e 91
11.9%
n 86
11.3%
u 79
10.3%
r 57
7.5%
c 55
7.2%
p 45
 
5.9%
o 40
 
5.2%
v 40
 
5.2%
b 39
 
5.1%
Other values (4) 108
14.1%
Decimal Number
ValueCountFrequency (%)
2 7204
19.8%
1 5535
15.2%
0 3386
9.3%
5 3206
8.8%
4 2969
8.1%
6 2937
8.1%
3 2914
8.0%
7 2901
8.0%
8 2813
 
7.7%
9 2576
 
7.1%
Other Letter
ValueCountFrequency (%)
547
77.7%
55
 
7.8%
55
 
7.8%
16
 
2.3%
10
 
1.4%
10
 
1.4%
9
 
1.3%
2
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
J 114
29.8%
M 69
18.1%
A 43
 
11.3%
N 40
 
10.5%
F 39
 
10.2%
D 30
 
7.9%
O 25
 
6.5%
S 22
 
5.8%
Dash Punctuation
ValueCountFrequency (%)
- 943
100.0%
Space Separator
ValueCountFrequency (%)
55
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 37439
95.3%
Latin 1146
 
2.9%
Hangul 704
 
1.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 124
 
10.8%
J 114
 
9.9%
e 91
 
7.9%
n 86
 
7.5%
u 79
 
6.9%
M 69
 
6.0%
r 57
 
5.0%
c 55
 
4.8%
p 45
 
3.9%
A 43
 
3.8%
Other values (12) 383
33.4%
Common
ValueCountFrequency (%)
2 7204
19.2%
1 5535
14.8%
0 3386
9.0%
5 3206
8.6%
4 2969
7.9%
6 2937
7.8%
3 2914
7.8%
7 2901
7.7%
8 2813
 
7.5%
9 2576
 
6.9%
Other values (2) 998
 
2.7%
Hangul
ValueCountFrequency (%)
547
77.7%
55
 
7.8%
55
 
7.8%
16
 
2.3%
10
 
1.4%
10
 
1.4%
9
 
1.3%
2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 38585
98.2%
Hangul 704
 
1.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 7204
18.7%
1 5535
14.3%
0 3386
8.8%
5 3206
8.3%
4 2969
7.7%
6 2937
7.6%
3 2914
7.6%
7 2901
7.5%
8 2813
 
7.3%
9 2576
 
6.7%
Other values (24) 2144
 
5.6%
Hangul
ValueCountFrequency (%)
547
77.7%
55
 
7.8%
55
 
7.8%
16
 
2.3%
10
 
1.4%
10
 
1.4%
9
 
1.3%
2
 
0.3%
Distinct3201
Distinct (%)32.0%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-04T07:52:29.745273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length34
Median length26
Mean length8.8274
Min length3

Characters and Unicode

Total characters88274
Distinct characters713
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique914 ?
Unique (%)9.1%

Sample

1st row아뜰리에뷰티아카데미학원
2nd row문덕시몬독서실
3rd row초지학원
4th row가온학원
5th row장성해법수학학원
ValueCountFrequency (%)
정직한선생님들사동점영어수학학원 59
 
0.6%
구미직업능력개발학원 31
 
0.3%
눈높이러닝센터상모학원 30
 
0.3%
눈높이러닝센터봉곡학원 29
 
0.3%
한결요리제과제빵학원 27
 
0.3%
눈높이러닝센터듀클라스학원 26
 
0.3%
눈높이러닝센터옥곡학원 23
 
0.2%
양덕점와와학습코칭학원 23
 
0.2%
구미에이닷영어학원 23
 
0.2%
눈높이러닝센터옥산1지구학원 22
 
0.2%
Other values (3241) 9888
97.1%
2024-05-04T07:52:31.400978image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
11707
 
13.3%
10099
 
11.4%
2261
 
2.6%
1977
 
2.2%
1964
 
2.2%
1936
 
2.2%
1868
 
2.1%
1266
 
1.4%
1247
 
1.4%
1239
 
1.4%
Other values (703) 52710
59.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 84056
95.2%
Uppercase Letter 1890
 
2.1%
Lowercase Letter 717
 
0.8%
Space Separator 410
 
0.5%
Open Punctuation 372
 
0.4%
Close Punctuation 372
 
0.4%
Decimal Number 267
 
0.3%
Other Punctuation 148
 
0.2%
Dash Punctuation 21
 
< 0.1%
Math Symbol 19
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
11707
 
13.9%
10099
 
12.0%
2261
 
2.7%
1977
 
2.4%
1964
 
2.3%
1936
 
2.3%
1868
 
2.2%
1266
 
1.5%
1247
 
1.5%
1239
 
1.5%
Other values (633) 48492
57.7%
Uppercase Letter
ValueCountFrequency (%)
E 280
14.8%
S 233
12.3%
M 128
 
6.8%
T 124
 
6.6%
B 119
 
6.3%
C 100
 
5.3%
I 96
 
5.1%
A 83
 
4.4%
N 82
 
4.3%
K 82
 
4.3%
Other values (15) 563
29.8%
Lowercase Letter
ValueCountFrequency (%)
e 109
15.2%
i 88
12.3%
n 59
 
8.2%
o 56
 
7.8%
l 55
 
7.7%
s 54
 
7.5%
a 38
 
5.3%
t 34
 
4.7%
h 32
 
4.5%
d 29
 
4.0%
Other values (13) 163
22.7%
Decimal Number
ValueCountFrequency (%)
1 79
29.6%
3 62
23.2%
0 58
21.7%
2 34
12.7%
5 16
 
6.0%
4 7
 
2.6%
7 7
 
2.6%
9 2
 
0.7%
8 2
 
0.7%
Other Punctuation
ValueCountFrequency (%)
& 58
39.2%
. 41
27.7%
· 24
16.2%
, 15
 
10.1%
' 10
 
6.8%
Open Punctuation
ValueCountFrequency (%)
( 371
99.7%
[ 1
 
0.3%
Close Punctuation
ValueCountFrequency (%)
) 371
99.7%
] 1
 
0.3%
Space Separator
ValueCountFrequency (%)
410
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Math Symbol
ValueCountFrequency (%)
+ 19
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 84056
95.2%
Latin 2607
 
3.0%
Common 1611
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
11707
 
13.9%
10099
 
12.0%
2261
 
2.7%
1977
 
2.4%
1964
 
2.3%
1936
 
2.3%
1868
 
2.2%
1266
 
1.5%
1247
 
1.5%
1239
 
1.5%
Other values (633) 48492
57.7%
Latin
ValueCountFrequency (%)
E 280
 
10.7%
S 233
 
8.9%
M 128
 
4.9%
T 124
 
4.8%
B 119
 
4.6%
e 109
 
4.2%
C 100
 
3.8%
I 96
 
3.7%
i 88
 
3.4%
A 83
 
3.2%
Other values (38) 1247
47.8%
Common
ValueCountFrequency (%)
410
25.5%
( 371
23.0%
) 371
23.0%
1 79
 
4.9%
3 62
 
3.8%
0 58
 
3.6%
& 58
 
3.6%
. 41
 
2.5%
2 34
 
2.1%
· 24
 
1.5%
Other values (12) 103
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 84056
95.2%
ASCII 4194
 
4.8%
None 24
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
11707
 
13.9%
10099
 
12.0%
2261
 
2.7%
1977
 
2.4%
1964
 
2.3%
1936
 
2.3%
1868
 
2.2%
1266
 
1.5%
1247
 
1.5%
1239
 
1.5%
Other values (633) 48492
57.7%
ASCII
ValueCountFrequency (%)
410
 
9.8%
( 371
 
8.8%
) 371
 
8.8%
E 280
 
6.7%
S 233
 
5.6%
M 128
 
3.1%
T 124
 
3.0%
B 119
 
2.8%
e 109
 
2.6%
C 100
 
2.4%
Other values (59) 1949
46.5%
None
ValueCountFrequency (%)
· 24
100.0%

학원종류
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
학교교과교습학원
9115 
평생직업교육학원
 
885

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row평생직업교육학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row학교교과교습학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 9115
91.1%
평생직업교육학원 885
 
8.8%

Length

2024-05-04T07:52:32.002308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:52:32.469449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 9115
91.1%
평생직업교육학원 885
 
8.8%

분야구분
Categorical

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
입시.검정 및 보습
5943 
예능(대)
1768 
국제화
610 
종합(대)
 
590
직업기술
 
559
Other values (6)
 
530

Length

Max length10
Median length10
Mean length7.7492
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row직업기술
2nd row독서실
3rd row입시.검정 및 보습
4th row입시.검정 및 보습
5th row입시.검정 및 보습

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 5943
59.4%
예능(대) 1768
 
17.7%
국제화 610
 
6.1%
종합(대) 590
 
5.9%
직업기술 559
 
5.6%
독서실 206
 
2.1%
기타(대) 154
 
1.5%
기예(대) 106
 
1.1%
정보 32
 
0.3%
인문사회(대) 28
 
0.3%

Length

2024-05-04T07:52:33.007266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
입시.검정 5943
27.2%
5943
27.2%
보습 5943
27.2%
예능(대 1768
 
8.1%
국제화 610
 
2.8%
종합(대 590
 
2.7%
직업기술 559
 
2.6%
독서실 206
 
0.9%
기타(대 154
 
0.7%
기예(대 106
 
0.5%
Other values (3) 64
 
0.3%
Distinct3272
Distinct (%)32.7%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2024-05-04T07:52:34.192320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length58
Mean length32.89982
Min length18

Characters and Unicode

Total characters328735
Distinct characters455
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks4 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique946 ?
Unique (%)9.5%

Sample

1st row경상북도 구미시 구미중앙로 77, , 3층 일부 (원평동)
2nd row경상북도 구미시 인동38길 9-16, , 204호 (구평동, 종합프라자)
3rd row경상북도 구미시 상사서로 42, , 3층 (상모동)
4th row경상북도 포항시 북구 새천년대로1249번길 10-6, , 3층 일부 (장성동)
5th row경상북도 칠곡군 왜관읍 평장길 42, , 2층 (왜관읍)
ValueCountFrequency (%)
경상북도 9992
 
13.4%
7842
 
10.5%
2층 3058
 
4.1%
구미시 2852
 
3.8%
포항시 2311
 
3.1%
3층 1833
 
2.5%
경산시 1360
 
1.8%
북구 1245
 
1.7%
남구 1066
 
1.4%
경주시 763
 
1.0%
Other values (2892) 42378
56.7%
2024-05-04T07:52:35.435483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65312
19.9%
, 19912
 
6.1%
12681
 
3.9%
11826
 
3.6%
11397
 
3.5%
10637
 
3.2%
( 9781
 
3.0%
) 9781
 
3.0%
9723
 
3.0%
1 9458
 
2.9%
Other values (445) 158227
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 172639
52.5%
Space Separator 65312
 
19.9%
Decimal Number 48505
 
14.8%
Other Punctuation 19984
 
6.1%
Open Punctuation 9787
 
3.0%
Close Punctuation 9787
 
3.0%
Dash Punctuation 2215
 
0.7%
Uppercase Letter 358
 
0.1%
Math Symbol 115
 
< 0.1%
Lowercase Letter 31
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12681
 
7.3%
11826
 
6.9%
11397
 
6.6%
10637
 
6.2%
9723
 
5.6%
9297
 
5.4%
8214
 
4.8%
7642
 
4.4%
5719
 
3.3%
4997
 
2.9%
Other values (399) 80506
46.6%
Uppercase Letter
ValueCountFrequency (%)
A 92
25.7%
B 64
17.9%
M 40
11.2%
G 33
 
9.2%
C 16
 
4.5%
J 16
 
4.5%
S 13
 
3.6%
K 12
 
3.4%
E 12
 
3.4%
W 12
 
3.4%
Other values (9) 48
13.4%
Decimal Number
ValueCountFrequency (%)
1 9458
19.5%
2 9397
19.4%
3 7204
14.9%
0 4792
9.9%
4 4379
9.0%
5 3811
7.9%
6 2812
 
5.8%
7 2394
 
4.9%
9 2204
 
4.5%
8 2054
 
4.2%
Other Punctuation
ValueCountFrequency (%)
, 19912
99.6%
. 49
 
0.2%
· 12
 
0.1%
/ 7
 
< 0.1%
: 3
 
< 0.1%
@ 1
 
< 0.1%
Lowercase Letter
ValueCountFrequency (%)
i 21
67.7%
o 6
 
19.4%
e 4
 
12.9%
Open Punctuation
ValueCountFrequency (%)
( 9781
99.9%
[ 6
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 9781
99.9%
] 6
 
0.1%
Space Separator
ValueCountFrequency (%)
65312
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2215
100.0%
Math Symbol
ValueCountFrequency (%)
~ 115
100.0%
Letter Number
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 172639
52.5%
Common 155705
47.4%
Latin 391
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12681
 
7.3%
11826
 
6.9%
11397
 
6.6%
10637
 
6.2%
9723
 
5.6%
9297
 
5.4%
8214
 
4.8%
7642
 
4.4%
5719
 
3.3%
4997
 
2.9%
Other values (399) 80506
46.6%
Common
ValueCountFrequency (%)
65312
41.9%
, 19912
 
12.8%
( 9781
 
6.3%
) 9781
 
6.3%
1 9458
 
6.1%
2 9397
 
6.0%
3 7204
 
4.6%
0 4792
 
3.1%
4 4379
 
2.8%
5 3811
 
2.4%
Other values (13) 11878
 
7.6%
Latin
ValueCountFrequency (%)
A 92
23.5%
B 64
16.4%
M 40
10.2%
G 33
 
8.4%
i 21
 
5.4%
C 16
 
4.1%
J 16
 
4.1%
S 13
 
3.3%
K 12
 
3.1%
E 12
 
3.1%
Other values (13) 72
18.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 172639
52.5%
ASCII 156082
47.5%
None 12
 
< 0.1%
Number Forms 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
65312
41.8%
, 19912
 
12.8%
( 9781
 
6.3%
) 9781
 
6.3%
1 9458
 
6.1%
2 9397
 
6.0%
3 7204
 
4.6%
0 4792
 
3.1%
4 4379
 
2.8%
5 3811
 
2.4%
Other values (34) 12255
 
7.9%
Hangul
ValueCountFrequency (%)
12681
 
7.3%
11826
 
6.9%
11397
 
6.6%
10637
 
6.2%
9723
 
5.6%
9297
 
5.4%
8214
 
4.8%
7642
 
4.4%
5719
 
3.3%
4997
 
2.9%
Other values (399) 80506
46.6%
None
ValueCountFrequency (%)
· 12
100.0%
Number Forms
ValueCountFrequency (%)
2
100.0%

우편번호
Real number (ℝ)

Distinct831
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean108304.28
Minimum0
Maximum799801
Zeros2
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:36.072913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile36639
Q137676
median38662
Q339322
95-th percentile790723
Maximum799801
Range799801
Interquartile range (IQR)1646

Descriptive statistics

Standard deviation215532.51
Coefficient of variation (CV)1.9900646
Kurtosis5.6493265
Mean108304.28
Median Absolute Deviation (MAD)779
Skewness2.7619441
Sum1.0830428 × 109
Variance4.6454263 × 1010
MonotonicityNot monotonic
2024-05-04T07:52:36.681738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39464 203
 
2.0%
39660 198
 
2.0%
38662 185
 
1.8%
39184 157
 
1.6%
38069 155
 
1.6%
38687 152
 
1.5%
36849 145
 
1.5%
39146 137
 
1.4%
37883 135
 
1.4%
37836 123
 
1.2%
Other values (821) 8410
84.1%
ValueCountFrequency (%)
0 2
 
< 0.1%
13149 3
 
< 0.1%
36025 5
0.1%
36026 1
 
< 0.1%
36028 1
 
< 0.1%
36029 2
 
< 0.1%
36030 4
 
< 0.1%
36056 12
0.1%
36065 2
 
< 0.1%
36066 5
0.1%
ValueCountFrequency (%)
799801 2
 
< 0.1%
791946 4
 
< 0.1%
791944 9
0.1%
791943 1
 
< 0.1%
791852 4
 
< 0.1%
791851 10
0.1%
791850 2
 
< 0.1%
791848 1
 
< 0.1%
791846 8
0.1%
791843 5
0.1%

등록상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
개원
9927 
자진휴원(소)
 
73

Length

Max length7
Median length2
Mean length2.0365
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 9927
99.3%
자진휴원(소) 73
 
0.7%

Length

2024-05-04T07:52:37.118235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:52:37.418989image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 9927
99.3%
자진휴원(소 73
 
0.7%
Distinct2805
Distinct (%)28.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2024-05-04T07:52:38.028744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length3
Mean length3.6387
Min length2

Characters and Unicode

Total characters36387
Distinct characters350
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique778 ?
Unique (%)7.8%

Sample

1st row배승연
2nd row이옥분
3rd row박환선
4th row노태문
5th row최정아
ValueCountFrequency (%)
주식회사 811
 
7.5%
대교 569
 
5.2%
주)웅진씽크빅 162
 
1.5%
주)재능교육 79
 
0.7%
박동준 59
 
0.5%
동화세상에듀코 48
 
0.4%
이동하 43
 
0.4%
디쉐어 31
 
0.3%
엑츠어학원 30
 
0.3%
조경화 27
 
0.2%
Other values (2810) 9003
82.9%
2024-05-04T07:52:39.131419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2116
 
5.8%
1590
 
4.4%
1489
 
4.1%
1258
 
3.5%
957
 
2.6%
948
 
2.6%
898
 
2.5%
871
 
2.4%
847
 
2.3%
780
 
2.1%
Other values (340) 24633
67.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 34666
95.3%
Space Separator 957
 
2.6%
Close Punctuation 312
 
0.9%
Open Punctuation 312
 
0.9%
Uppercase Letter 138
 
0.4%
Lowercase Letter 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2116
 
6.1%
1590
 
4.6%
1489
 
4.3%
1258
 
3.6%
948
 
2.7%
898
 
2.6%
871
 
2.5%
847
 
2.4%
780
 
2.3%
767
 
2.2%
Other values (315) 23102
66.6%
Uppercase Letter
ValueCountFrequency (%)
E 16
11.6%
O 14
10.1%
N 12
 
8.7%
S 12
 
8.7%
A 10
 
7.2%
R 10
 
7.2%
H 10
 
7.2%
I 9
 
6.5%
Y 6
 
4.3%
L 6
 
4.3%
Other values (10) 33
23.9%
Lowercase Letter
ValueCountFrequency (%)
n 1
50.0%
c 1
50.0%
Space Separator
ValueCountFrequency (%)
957
100.0%
Close Punctuation
ValueCountFrequency (%)
) 312
100.0%
Open Punctuation
ValueCountFrequency (%)
( 312
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 34666
95.3%
Common 1581
 
4.3%
Latin 140
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2116
 
6.1%
1590
 
4.6%
1489
 
4.3%
1258
 
3.6%
948
 
2.7%
898
 
2.6%
871
 
2.5%
847
 
2.4%
780
 
2.3%
767
 
2.2%
Other values (315) 23102
66.6%
Latin
ValueCountFrequency (%)
E 16
11.4%
O 14
 
10.0%
N 12
 
8.6%
S 12
 
8.6%
A 10
 
7.1%
R 10
 
7.1%
H 10
 
7.1%
I 9
 
6.4%
Y 6
 
4.3%
L 6
 
4.3%
Other values (12) 35
25.0%
Common
ValueCountFrequency (%)
957
60.5%
) 312
 
19.7%
( 312
 
19.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 34666
95.3%
ASCII 1721
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2116
 
6.1%
1590
 
4.6%
1489
 
4.3%
1258
 
3.6%
948
 
2.7%
898
 
2.6%
871
 
2.5%
847
 
2.4%
780
 
2.3%
767
 
2.2%
Other values (315) 23102
66.6%
ASCII
ValueCountFrequency (%)
957
55.6%
) 312
 
18.1%
( 312
 
18.1%
E 16
 
0.9%
O 14
 
0.8%
N 12
 
0.7%
S 12
 
0.7%
A 10
 
0.6%
R 10
 
0.6%
H 10
 
0.6%
Other values (15) 56
 
3.3%

일시수용능력인원
Real number (ℝ)

Distinct163
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean48.4005
Minimum0
Maximum1000
Zeros40
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:39.564724image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile15
Q126
median40
Q360
95-th percentile100
Maximum1000
Range1000
Interquartile range (IQR)34

Descriptive statistics

Standard deviation46.030335
Coefficient of variation (CV)0.95103016
Kurtosis217.07321
Mean48.4005
Median Absolute Deviation (MAD)15
Skewness11.479765
Sum484005
Variance2118.7918
MonotonicityNot monotonic
2024-05-04T07:52:40.045285image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30 807
 
8.1%
40 639
 
6.4%
20 584
 
5.8%
50 551
 
5.5%
60 415
 
4.2%
25 309
 
3.1%
15 278
 
2.8%
80 220
 
2.2%
24 218
 
2.2%
32 216
 
2.2%
Other values (153) 5763
57.6%
ValueCountFrequency (%)
0 40
 
0.4%
1 2
 
< 0.1%
6 12
 
0.1%
7 28
 
0.3%
8 36
 
0.4%
9 29
 
0.3%
10 122
1.2%
11 38
 
0.4%
12 72
0.7%
13 36
 
0.4%
ValueCountFrequency (%)
1000 10
0.1%
840 4
 
< 0.1%
362 2
 
< 0.1%
300 2
 
< 0.1%
298 2
 
< 0.1%
260 2
 
< 0.1%
240 3
 
< 0.1%
237 5
0.1%
227 1
 
< 0.1%
223 2
 
< 0.1%

정원합계
Real number (ℝ)

ZEROS 

Distinct329
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean222.3927
Minimum0
Maximum3280
Zeros320
Zeros (%)3.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:40.443320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21
Q160
median100
Q3200
95-th percentile645
Maximum3280
Range3280
Interquartile range (IQR)140

Descriptive statistics

Standard deviation426.22596
Coefficient of variation (CV)1.9165465
Kurtosis27.576744
Mean222.3927
Median Absolute Deviation (MAD)50
Skewness5.0074967
Sum2223927
Variance181668.57
MonotonicityNot monotonic
2024-05-04T07:52:40.809641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
60 477
 
4.8%
80 417
 
4.2%
100 373
 
3.7%
90 331
 
3.3%
0 320
 
3.2%
120 302
 
3.0%
150 243
 
2.4%
70 211
 
2.1%
40 194
 
1.9%
30 176
 
1.8%
Other values (319) 6956
69.6%
ValueCountFrequency (%)
0 320
3.2%
4 4
 
< 0.1%
6 3
 
< 0.1%
7 4
 
< 0.1%
8 4
 
< 0.1%
9 7
 
0.1%
10 6
 
0.1%
12 6
 
0.1%
13 2
 
< 0.1%
14 5
 
0.1%
ValueCountFrequency (%)
3280 59
0.6%
2860 19
 
0.2%
2806 30
0.3%
2520 23
 
0.2%
2470 13
 
0.1%
2410 20
 
0.2%
2254 29
0.3%
2173 39
0.4%
1824 6
 
0.1%
1686 6
 
0.1%
Distinct72
Distinct (%)0.7%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2024-05-04T07:52:41.282713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length2
Mean length3.8705965
Min length2

Characters and Unicode

Total characters38675
Distinct characters125
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)0.1%

Sample

1st row이·미용
2nd row독서실(유아/초·중·고)
3rd row보습
4th row보습
5th row보습
ValueCountFrequency (%)
보습 5762
57.7%
음악 1179
 
11.8%
실용외국어(유아/초·중·고 634
 
6.3%
미술 517
 
5.2%
보습·논술 262
 
2.6%
이·미용 219
 
2.2%
독서실(유아/초·중·고 198
 
2.0%
무용 188
 
1.9%
식음료품(바리스타,소믈리에 164
 
1.6%
컴퓨터(정보처리,통신기기,인터넷,소프트웨어 128
 
1.3%
Other values (62) 741
 
7.4%
2024-05-04T07:52:42.034715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
6263
 
16.2%
6024
 
15.6%
· 2151
 
5.6%
1387
 
3.6%
) 1375
 
3.6%
( 1375
 
3.6%
1264
 
3.3%
1138
 
2.9%
882
 
2.3%
861
 
2.2%
Other values (115) 15955
41.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 32292
83.5%
Other Punctuation 3633
 
9.4%
Close Punctuation 1375
 
3.6%
Open Punctuation 1375
 
3.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6263
19.4%
6024
18.7%
1387
 
4.3%
1264
 
3.9%
1138
 
3.5%
882
 
2.7%
861
 
2.7%
841
 
2.6%
834
 
2.6%
832
 
2.6%
Other values (110) 11966
37.1%
Other Punctuation
ValueCountFrequency (%)
· 2151
59.2%
/ 832
 
22.9%
, 650
 
17.9%
Close Punctuation
ValueCountFrequency (%)
) 1375
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1375
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 32292
83.5%
Common 6383
 
16.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6263
19.4%
6024
18.7%
1387
 
4.3%
1264
 
3.9%
1138
 
3.5%
882
 
2.7%
861
 
2.7%
841
 
2.6%
834
 
2.6%
832
 
2.6%
Other values (110) 11966
37.1%
Common
ValueCountFrequency (%)
· 2151
33.7%
) 1375
21.5%
( 1375
21.5%
/ 832
 
13.0%
, 650
 
10.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 32292
83.5%
ASCII 4232
 
10.9%
None 2151
 
5.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
6263
19.4%
6024
18.7%
1387
 
4.3%
1264
 
3.9%
1138
 
3.5%
882
 
2.7%
861
 
2.7%
841
 
2.6%
834
 
2.6%
832
 
2.6%
Other values (110) 11966
37.1%
None
ValueCountFrequency (%)
· 2151
100.0%
ASCII
ValueCountFrequency (%)
) 1375
32.5%
( 1375
32.5%
/ 832
19.7%
, 650
15.4%
Distinct5163
Distinct (%)51.7%
Missing8
Missing (%)0.1%
Memory size156.2 KiB
2024-05-04T07:52:42.700067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length40
Median length36
Mean length6.6723379
Min length1

Characters and Unicode

Total characters66670
Distinct characters576
Distinct categories12 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4140 ?
Unique (%)41.4%

Sample

1st row바디페인팅
2nd row일반실(학생)-일
3rd row중등영어
4th row초등(국과사)
5th row중등과학
ValueCountFrequency (%)
중등수학 214
 
2.0%
중등영어 180
 
1.7%
초등수학 180
 
1.7%
고등수학 143
 
1.3%
초등영어 141
 
1.3%
고등영어 133
 
1.2%
영어 92
 
0.9%
수학 91
 
0.8%
초등 77
 
0.7%
피아노 70
 
0.7%
Other values (5063) 9388
87.7%
2024-05-04T07:52:44.024464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5239
 
7.9%
) 4464
 
6.7%
( 4453
 
6.7%
3116
 
4.7%
2984
 
4.5%
2867
 
4.3%
2748
 
4.1%
2716
 
4.1%
2412
 
3.6%
2210
 
3.3%
Other values (566) 33461
50.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 49808
74.7%
Close Punctuation 4466
 
6.7%
Open Punctuation 4455
 
6.7%
Uppercase Letter 2587
 
3.9%
Decimal Number 2411
 
3.6%
Other Punctuation 1469
 
2.2%
Space Separator 780
 
1.2%
Lowercase Letter 375
 
0.6%
Dash Punctuation 147
 
0.2%
Math Symbol 94
 
0.1%
Other values (2) 78
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5239
 
10.5%
3116
 
6.3%
2984
 
6.0%
2867
 
5.8%
2748
 
5.5%
2716
 
5.5%
2412
 
4.8%
2210
 
4.4%
1508
 
3.0%
1252
 
2.5%
Other values (486) 22756
45.7%
Uppercase Letter
ValueCountFrequency (%)
A 834
32.2%
B 706
27.3%
C 344
13.3%
D 157
 
6.1%
E 80
 
3.1%
T 63
 
2.4%
I 59
 
2.3%
S 57
 
2.2%
F 48
 
1.9%
L 36
 
1.4%
Other values (14) 203
 
7.8%
Lowercase Letter
ValueCountFrequency (%)
i 44
 
11.7%
t 33
 
8.8%
l 30
 
8.0%
e 28
 
7.5%
a 28
 
7.5%
c 22
 
5.9%
o 21
 
5.6%
m 20
 
5.3%
s 18
 
4.8%
n 17
 
4.5%
Other values (12) 114
30.4%
Decimal Number
ValueCountFrequency (%)
1 820
34.0%
2 703
29.2%
3 379
15.7%
4 166
 
6.9%
5 117
 
4.9%
6 83
 
3.4%
0 77
 
3.2%
7 27
 
1.1%
8 27
 
1.1%
9 12
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 1288
87.7%
. 84
 
5.7%
/ 59
 
4.0%
& 17
 
1.2%
: 13
 
0.9%
· 7
 
0.5%
1
 
0.1%
Letter Number
ValueCountFrequency (%)
39
52.7%
24
32.4%
5
 
6.8%
5
 
6.8%
1
 
1.4%
Math Symbol
ValueCountFrequency (%)
+ 53
56.4%
~ 37
39.4%
< 2
 
2.1%
> 2
 
2.1%
Close Punctuation
ValueCountFrequency (%)
) 4464
> 99.9%
] 2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 4453
> 99.9%
[ 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
779
99.9%
  1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 147
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 49806
74.7%
Common 13826
 
20.7%
Latin 3036
 
4.6%
Han 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5239
 
10.5%
3116
 
6.3%
2984
 
6.0%
2867
 
5.8%
2748
 
5.5%
2716
 
5.5%
2412
 
4.8%
2210
 
4.4%
1508
 
3.0%
1252
 
2.5%
Other values (485) 22754
45.7%
Latin
ValueCountFrequency (%)
A 834
27.5%
B 706
23.3%
C 344
11.3%
D 157
 
5.2%
E 80
 
2.6%
T 63
 
2.1%
I 59
 
1.9%
S 57
 
1.9%
F 48
 
1.6%
i 44
 
1.4%
Other values (41) 644
21.2%
Common
ValueCountFrequency (%)
) 4464
32.3%
( 4453
32.2%
, 1288
 
9.3%
1 820
 
5.9%
779
 
5.6%
2 703
 
5.1%
3 379
 
2.7%
4 166
 
1.2%
- 147
 
1.1%
5 117
 
0.8%
Other values (19) 510
 
3.7%
Han
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 49804
74.7%
ASCII 16779
 
25.2%
Number Forms 74
 
0.1%
None 9
 
< 0.1%
CJK 2
 
< 0.1%
Compat Jamo 2
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5239
 
10.5%
3116
 
6.3%
2984
 
6.0%
2867
 
5.8%
2748
 
5.5%
2716
 
5.5%
2412
 
4.8%
2210
 
4.4%
1508
 
3.0%
1252
 
2.5%
Other values (483) 22752
45.7%
ASCII
ValueCountFrequency (%)
) 4464
26.6%
( 4453
26.5%
, 1288
 
7.7%
A 834
 
5.0%
1 820
 
4.9%
779
 
4.6%
B 706
 
4.2%
2 703
 
4.2%
3 379
 
2.3%
C 344
 
2.1%
Other values (62) 2009
12.0%
Number Forms
ValueCountFrequency (%)
39
52.7%
24
32.4%
5
 
6.8%
5
 
6.8%
1
 
1.4%
None
ValueCountFrequency (%)
· 7
77.8%
  1
 
11.1%
1
 
11.1%
CJK
ValueCountFrequency (%)
2
100.0%
Compat Jamo
ValueCountFrequency (%)
1
50.0%
1
50.0%

정원
Real number (ℝ)

SKEWED 

Distinct97
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.6432
Minimum0
Maximum35000
Zeros42
Zeros (%)0.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:44.538575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile4
Q18
median10
Q320
95-th percentile40
Maximum35000
Range35000
Interquartile range (IQR)12

Descriptive statistics

Standard deviation353.69582
Coefficient of variation (CV)18.006018
Kurtosis9574.6653
Mean19.6432
Median Absolute Deviation (MAD)5
Skewness97.043492
Sum196432
Variance125100.73
MonotonicityNot monotonic
2024-05-04T07:52:45.019122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 2458
24.6%
20 1182
11.8%
15 850
 
8.5%
8 821
 
8.2%
5 699
 
7.0%
6 466
 
4.7%
12 436
 
4.4%
30 418
 
4.2%
25 215
 
2.1%
9 193
 
1.9%
Other values (87) 2262
22.6%
ValueCountFrequency (%)
0 42
 
0.4%
1 101
 
1.0%
2 139
 
1.4%
3 157
 
1.6%
4 182
 
1.8%
5 699
7.0%
6 466
4.7%
7 165
 
1.7%
8 821
8.2%
9 193
 
1.9%
ValueCountFrequency (%)
35000 1
 
< 0.1%
5000 1
 
< 0.1%
300 3
< 0.1%
207 4
< 0.1%
200 1
 
< 0.1%
180 1
 
< 0.1%
170 1
 
< 0.1%
150 2
< 0.1%
146 1
 
< 0.1%
120 4
< 0.1%

모의고사비
Real number (ℝ)

SKEWED  ZEROS 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean139.3
Minimum0
Maximum450000
Zeros9978
Zeros (%)99.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:45.695602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum450000
Range450000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation6175.3631
Coefficient of variation (CV)44.331393
Kurtosis3674.4898
Mean139.3
Median Absolute Deviation (MAD)0
Skewness58.209024
Sum1393000
Variance38135109
MonotonicityNot monotonic
2024-05-04T07:52:46.132691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
0 9978
99.8%
20000 6
 
0.1%
10000 4
 
< 0.1%
5000 3
 
< 0.1%
7000 1
 
< 0.1%
15000 1
 
< 0.1%
25000 1
 
< 0.1%
13000 1
 
< 0.1%
300000 1
 
< 0.1%
8000 1
 
< 0.1%
Other values (3) 3
 
< 0.1%
ValueCountFrequency (%)
0 9978
99.8%
5000 3
 
< 0.1%
7000 1
 
< 0.1%
8000 1
 
< 0.1%
10000 4
 
< 0.1%
13000 1
 
< 0.1%
15000 1
 
< 0.1%
20000 6
 
0.1%
25000 1
 
< 0.1%
150000 1
 
< 0.1%
ValueCountFrequency (%)
450000 1
 
< 0.1%
300000 1
 
< 0.1%
250000 1
 
< 0.1%
150000 1
 
< 0.1%
25000 1
 
< 0.1%
20000 6
0.1%
15000 1
 
< 0.1%
13000 1
 
< 0.1%
10000 4
< 0.1%
8000 1
 
< 0.1%

재료비
Real number (ℝ)

SKEWED  ZEROS 

Distinct83
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3392.24
Minimum0
Maximum1600000
Zeros9833
Zeros (%)98.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:46.656990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1600000
Range1600000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation50482.671
Coefficient of variation (CV)14.88181
Kurtosis509.4759
Mean3392.24
Median Absolute Deviation (MAD)0
Skewness21.161223
Sum33922400
Variance2.5485 × 109
MonotonicityNot monotonic
2024-05-04T07:52:47.245922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 9833
98.3%
10000 14
 
0.1%
300000 11
 
0.1%
20000 11
 
0.1%
200000 5
 
0.1%
5000 5
 
0.1%
150000 5
 
0.1%
27000 5
 
0.1%
64000 4
 
< 0.1%
15000 4
 
< 0.1%
Other values (73) 103
 
1.0%
ValueCountFrequency (%)
0 9833
98.3%
1000 2
 
< 0.1%
3000 1
 
< 0.1%
5000 5
 
0.1%
6000 3
 
< 0.1%
8000 1
 
< 0.1%
10000 14
 
0.1%
10400 1
 
< 0.1%
12000 1
 
< 0.1%
14000 1
 
< 0.1%
ValueCountFrequency (%)
1600000 1
 
< 0.1%
1500000 2
< 0.1%
1360000 1
 
< 0.1%
1260000 1
 
< 0.1%
1200000 1
 
< 0.1%
1050000 1
 
< 0.1%
1000000 4
< 0.1%
950000 1
 
< 0.1%
900000 1
 
< 0.1%
800000 2
< 0.1%

급식비
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9997 
20000
 
2
100000
 
1

Length

Max length6
Median length1
Mean length1.0013
Min length1

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9997
> 99.9%
20000 2
 
< 0.1%
100000 1
 
< 0.1%

Length

2024-05-04T07:52:47.686513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:52:48.011516image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9997
> 99.9%
20000 2
 
< 0.1%
100000 1
 
< 0.1%

기숙사비
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 10000
100.0%

Length

2024-05-04T07:52:48.556572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:52:48.907763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 10000
100.0%

차량비
Real number (ℝ)

SKEWED  ZEROS 

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean153.1
Minimum0
Maximum100000
Zeros9936
Zeros (%)99.4%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:49.320094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum100000
Range100000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2328.2617
Coefficient of variation (CV)15.207457
Kurtosis587.67323
Mean153.1
Median Absolute Deviation (MAD)0
Skewness21.349918
Sum1531000
Variance5420802.5
MonotonicityNot monotonic
2024-05-04T07:52:49.805567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
0 9936
99.4%
10000 16
 
0.2%
20000 13
 
0.1%
50000 10
 
0.1%
12500 8
 
0.1%
25000 6
 
0.1%
30000 3
 
< 0.1%
28000 2
 
< 0.1%
12000 2
 
< 0.1%
16000 1
 
< 0.1%
Other values (3) 3
 
< 0.1%
ValueCountFrequency (%)
0 9936
99.4%
10000 16
 
0.2%
12000 2
 
< 0.1%
12500 8
 
0.1%
16000 1
 
< 0.1%
20000 13
 
0.1%
25000 6
 
0.1%
28000 2
 
< 0.1%
30000 3
 
< 0.1%
35000 1
 
< 0.1%
ValueCountFrequency (%)
100000 1
 
< 0.1%
50000 10
0.1%
40000 1
 
< 0.1%
35000 1
 
< 0.1%
30000 3
 
< 0.1%
28000 2
 
< 0.1%
25000 6
0.1%
20000 13
0.1%
16000 1
 
< 0.1%
12500 8
0.1%

피복비
Categorical

IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9997 
50
 
1
180000
 
1
15
 
1

Length

Max length6
Median length1
Mean length1.0007
Min length1

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9997
> 99.9%
50 1
 
< 0.1%
180000 1
 
< 0.1%
15 1
 
< 0.1%

Length

2024-05-04T07:52:50.302145image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-05-04T07:52:50.671266image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9997
> 99.9%
50 1
 
< 0.1%
180000 1
 
< 0.1%
15 1
 
< 0.1%

기타경비합계
Real number (ℝ)

SKEWED  ZEROS 

Distinct90
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3725.1465
Minimum0
Maximum1600000
Zeros9752
Zeros (%)97.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:50.993869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum1600000
Range1600000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation50988.556
Coefficient of variation (CV)13.687665
Kurtosis489.73739
Mean3725.1465
Median Absolute Deviation (MAD)0
Skewness20.636727
Sum37251465
Variance2.5998328 × 109
MonotonicityNot monotonic
2024-05-04T07:52:51.600952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 9752
97.5%
10000 29
 
0.3%
20000 26
 
0.3%
50000 14
 
0.1%
300000 12
 
0.1%
12500 8
 
0.1%
5000 7
 
0.1%
25000 7
 
0.1%
30000 7
 
0.1%
16000 7
 
0.1%
Other values (80) 131
 
1.3%
ValueCountFrequency (%)
0 9752
97.5%
15 1
 
< 0.1%
50 1
 
< 0.1%
1000 2
 
< 0.1%
3000 1
 
< 0.1%
5000 7
 
0.1%
8000 2
 
< 0.1%
10000 29
 
0.3%
10400 1
 
< 0.1%
12000 3
 
< 0.1%
ValueCountFrequency (%)
1600000 1
 
< 0.1%
1500000 2
< 0.1%
1360000 1
 
< 0.1%
1260000 1
 
< 0.1%
1200000 1
 
< 0.1%
1050000 1
 
< 0.1%
1000000 4
< 0.1%
950000 1
 
< 0.1%
900000 1
 
< 0.1%
800000 2
< 0.1%

총교습비
Real number (ℝ)

ZEROS 

Distinct428
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean186862.04
Minimum0
Maximum8050680
Zeros2095
Zeros (%)20.9%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:52.163447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q140000
median150000
Q3250000
95-th percentile480000
Maximum8050680
Range8050680
Interquartile range (IQR)210000

Descriptive statistics

Standard deviation269827.91
Coefficient of variation (CV)1.4439954
Kurtosis200.95839
Mean186862.04
Median Absolute Deviation (MAD)100000
Skewness10.216516
Sum1.8686204 × 109
Variance7.2807103 × 1010
MonotonicityNot monotonic
2024-05-04T07:52:52.643093image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2095
20.9%
200000 593
 
5.9%
150000 590
 
5.9%
250000 555
 
5.5%
300000 465
 
4.7%
180000 298
 
3.0%
160000 292
 
2.9%
100000 260
 
2.6%
120000 260
 
2.6%
140000 260
 
2.6%
Other values (418) 4332
43.3%
ValueCountFrequency (%)
0 2095
20.9%
1 1
 
< 0.1%
110 1
 
< 0.1%
1560 1
 
< 0.1%
3500 1
 
< 0.1%
4000 1
 
< 0.1%
5000 16
 
0.2%
5500 3
 
< 0.1%
6000 9
 
0.1%
7000 16
 
0.2%
ValueCountFrequency (%)
8050680 1
 
< 0.1%
8000000 1
 
< 0.1%
5000000 3
 
< 0.1%
3500000 1
 
< 0.1%
3300000 1
 
< 0.1%
3200000 1
 
< 0.1%
3199600 1
 
< 0.1%
3192000 1
 
< 0.1%
3000000 9
0.1%
2780400 1
 
< 0.1%

총교습비(시간당)
Real number (ℝ)

SKEWED  ZEROS 

Distinct1214
Distinct (%)12.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean9002.6018
Minimum0
Maximum800000
Zeros2233
Zeros (%)22.3%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-05-04T07:52:53.171327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q14167
median8640
Q310714
95-th percentile14225.15
Maximum800000
Range800000
Interquartile range (IQR)6547

Descriptive statistics

Standard deviation16829.736
Coefficient of variation (CV)1.8694302
Kurtosis839.25093
Mean9002.6018
Median Absolute Deviation (MAD)2588
Skewness22.358676
Sum90026018
Variance2.8324 × 108
MonotonicityNot monotonic
2024-05-04T07:52:53.696875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 2233
 
22.3%
10000 381
 
3.8%
7500 281
 
2.8%
12500 241
 
2.4%
8333 217
 
2.2%
9000 210
 
2.1%
9375 167
 
1.7%
10714 153
 
1.5%
10417 145
 
1.5%
11250 140
 
1.4%
Other values (1204) 5832
58.3%
ValueCountFrequency (%)
0 2233
22.3%
6 1
 
< 0.1%
60 2
 
< 0.1%
97 1
 
< 0.1%
111 3
 
< 0.1%
125 4
 
< 0.1%
132 1
 
< 0.1%
139 8
 
0.1%
140 1
 
< 0.1%
146 2
 
< 0.1%
ValueCountFrequency (%)
800000 1
 
< 0.1%
700000 1
 
< 0.1%
375000 1
 
< 0.1%
312500 1
 
< 0.1%
300000 1
 
< 0.1%
275000 1
 
< 0.1%
225000 1
 
< 0.1%
200000 3
< 0.1%
187500 3
< 0.1%
166667 2
< 0.1%

Sample

지역등록번호학원명학원종류분야구분학원주소우편번호등록상태설립자-성명일시수용능력인원정원합계교습과정교습과목(반)정원모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)
17172구미시2458아뜰리에뷰티아카데미학원평생직업교육학원직업기술경상북도 구미시 구미중앙로 77, , 3층 일부 (원평동)39220개원배승연37102이·미용바디페인팅80300000000030000000
1520포항시등1044문덕시몬독서실학교교과교습학원독서실<NA>790900개원이옥분6663독서실(유아/초·중·고)일반실(학생)-일800000007000292
19991구미시2445초지학원학교교과교습학원입시.검정 및 보습경상북도 구미시 인동38길 9-16, , 204호 (구평동, 종합프라자)39449개원박환선3681보습중등영어9000000000
13049구미시1479가온학원학교교과교습학원입시.검정 및 보습경상북도 구미시 상사서로 42, , 3층 (상모동)39339개원노태문30178보습초등(국과사)1000000001000007500
5191포항시2496장성해법수학학원학교교과교습학원입시.검정 및 보습경상북도 포항시 북구 새천년대로1249번길 10-6, , 3층 일부 (장성동)37585개원최정아2060보습중등과학10000000000
29892칠곡군556비스타영어학원학교교과교습학원입시.검정 및 보습경상북도 칠곡군 왜관읍 평장길 42, , 2층 (왜관읍)39899개원윤유정8019보습고등영어8000000028000011667
8089경주시17-Jan메타수학영어학원학교교과교습학원입시.검정 및 보습경상북도 경주시 황성로 35-3, , 광림프라자 301호 (황성동)38084개원조한울3348보습과학(중)6000000020000011538
21435영주시675글로벌독서실학교교과교습학원독서실경상북도 영주시 남간로 8, , (2층, 3층) (휴천동)36162개원박중광113113독서실(유아/초·중·고)독서실10000000005500688
13395구미시799구미공인중개사학원법학원평생직업교육학원인문사회(대)경상북도 구미시 구미중앙로5길 5, , 3층 일부 (원평동)39221개원엄기송60540성인고시주택관리사반(동영상)300000000650000
9225경주시23-Jun제니스영어학원학교교과교습학원입시.검정 및 보습경상북도 경주시 용황로 91-34, , 2층 (용강동)38069개원김민정20100보습초등영어C20000000030000012500
지역등록번호학원명학원종류분야구분학원주소우편번호등록상태설립자-성명일시수용능력인원정원합계교습과정교습과목(반)정원모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)
22630영천시181라이크이스턴외국어학원학교교과교습학원입시.검정 및 보습경상북도 영천시 호국로 58, (야사동)38860개원김대한130290보습영어중등(정규반)22000000024000010909
18727구미시2776윤선생우리집앞영어교실구미원호학원학교교과교습학원입시.검정 및 보습경상북도 구미시 고아읍 문장로22길 38, , 302호 (고아읍, 한누리타운2차)39148개원김윤경28168보습고등국어14000000030000014400
13219구미시2558곰쌤학원학교교과교습학원입시.검정 및 보습경상북도 구미시 산동읍 신당1로2길 15, , 305호, 407호, 408호, 409호 (산동읍)39464개원송혜경78226보습초등(수학)5000000020000010000
25207경산시732눈높이러닝센터옥산1지구학원학교교과교습학원입시.검정 및 보습경상북도 경산시 성암로 42, , 4층 (옥산동)38680개원주식회사 대교80645보습초등주제논술15000000000
1910포항시2113비오비수학학원학교교과교습학원입시.검정 및 보습경상북도 포항시 북구 대곡로 29, , 4층 (두호동)791804개원이미향3880보습수학(고등C)5000000040000013793
19591구미시2531지수학학원학교교과교습학원입시.검정 및 보습경상북도 구미시 인동36길 17, , 4층 (구평동, 보람빌딩)39449개원김지수3880보습수학(초)210000000020000010000
14454구미시1678눈높이러닝센터상모학원학교교과교습학원입시.검정 및 보습경상북도 구미시 상모로9길 1, , 2층 (상모동)39335개원주식회사 대교252806보습스쿨수학(초)61000000000
24784경산시704권샘입시어학원학교교과교습학원종합(대)경상북도 경산시 진량읍 봉황길 29, , 2층 (진량읍)38450개원김민경104263보습특강(고)20000000000
13439구미시1229구미뮤지컬실용음악학원학교교과교습학원예능(대)경상북도 구미시 여헌로14길 2, 2층39434개원나한아40180음악시창과청음1000000003400010200
28629의성군31종로아인스학원학교교과교습학원입시.검정 및 보습경상북도 의성군 의성읍 후죽4길 3, (의성읍)37337개원손미향133135보습사회,과학150000000600007500

Duplicate rows

Most frequently occurring

지역등록번호학원명학원종류분야구분학원주소우편번호등록상태설립자-성명일시수용능력인원정원합계교습과정교습과목(반)정원모의고사비재료비급식비기숙사비차량비피복비기타경비합계총교습비총교습비(시간당)# duplicates
0경산시495공부나누기입시학원학교교과교습학원입시.검정 및 보습경상북도 경산시 하양읍 하양로 82, , 3층 (하양읍)38436개원박필자3935보습국어,영어,수학9000000020000002
1경산시976수학의힘영어의힘학원학교교과교습학원입시.검정 및 보습경상북도 경산시 대학로12길 30, , 2층(201호) (정평동)38657개원박선영34212보습초등영어20000000017000085002
2구미시1021최강학원학교교과교습학원입시.검정 및 보습경상북도 구미시 옥계북로 40, 601호39185개원최수영7050보습과학1000000015000002
3구미시1021최강학원학교교과교습학원입시.검정 및 보습경상북도 구미시 옥계북로 40, 601호39185개원최수영7050보습사회1000000015000002
4구미시981이상욱논리속독상모학원학교교과교습학원입시.검정 및 보습경상북도 구미시 상사동로 77, (임은동)39335개원김순화40130보습영어.수학300000005000002
5봉화군63하이엔드 영수학원학교교과교습학원입시.검정 및 보습경상북도 봉화군 봉화읍 내성로 88, (봉화읍)36238개원김재연102234보습수학110000000002
6안동시450드림아이음악학원학교교과교습학원예능(대)경상북도 안동시 복주3길 23-16, , 3층 (옥동)36660개원임경미30106보습수학3000000010000002
7안동시478에스케이씨(SKC)영어학원학교교과교습학원입시.검정 및 보습경상북도 안동시 경동로 514, (태화동)36668개원신기철6464보습영어3000000025000002
8영천시197포인트입시학원학교교과교습학원입시.검정 및 보습경상북도 영천시 망정2길 184, , 3층 (망정동)38831개원김태진65130미술미술500000008000002
9예천군23-Nov오늘,공부학원학교교과교습학원입시.검정 및 보습경상북도 예천군 호명면 양지3길 16, 301호, 302호36849개원안세윤1950보통교과영어100000000002