Overview

Dataset statistics

Number of variables12
Number of observations51
Missing cells15
Missing cells (%)2.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.0 KiB
Average record size in memory100.6 B

Variable types

Text5
Categorical5
Numeric2

Dataset

Description울진교육지원청의 관내 학원 및 교습소 현황 데이터이며, 학원명, 학원종류, 분야구분, 학원주소, 우편번호, 등록일, 등록상태, 설립자-성명, 전화번호, 교습계열, 교습과정, 강사수 등의 항목이 있습니다.
Author경상북도교육청 경상북도울진교육지원청
URLhttps://www.data.go.kr/data/3069581/fileData.do

Alerts

교습계열 is highly overall correlated with 학원종류 and 2 other fieldsHigh correlation
분야구분 is highly overall correlated with 학원종류 and 2 other fieldsHigh correlation
교습과정 is highly overall correlated with 학원종류 and 2 other fieldsHigh correlation
학원종류 is highly overall correlated with 분야구분 and 2 other fieldsHigh correlation
학원종류 is highly imbalanced (86.1%)Imbalance
등록상태 is highly imbalanced (76.1%)Imbalance
전화번호 has 15 (29.4%) missing valuesMissing
학원명 has unique valuesUnique
설립자-성명 has unique valuesUnique
강사수 has 2 (3.9%) zerosZeros

Reproduction

Analysis started2023-12-12 06:22:41.934450
Analysis finished2023-12-12 06:22:43.776991
Duration1.84 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학원명
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:22:43.953597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length16
Mean length7.9411765
Min length4

Characters and Unicode

Total characters405
Distinct characters162
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row경북학원
2nd row동아스쿨학원
3rd row한별학원
4th row굿모닝스터디학원
5th row정철영어주니어울진어학원
ValueCountFrequency (%)
경북학원 1
 
1.8%
재능스스로학습센터 1
 
1.8%
하나학원 1
 
1.8%
인재학원 1
 
1.8%
울진컴퓨터학원 1
 
1.8%
해온국어논술학원 1
 
1.8%
교육그룹강한학원 1
 
1.8%
어썸잉글리쉬학원 1
 
1.8%
두란노피아노학원 1
 
1.8%
에듀퀘스트(eduqwest)외국어학원 1
 
1.8%
Other values (45) 45
81.8%
2023-12-12T15:22:44.322630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
52
 
12.8%
50
 
12.3%
14
 
3.5%
12
 
3.0%
10
 
2.5%
8
 
2.0%
7
 
1.7%
7
 
1.7%
6
 
1.5%
6
 
1.5%
Other values (152) 233
57.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 373
92.1%
Uppercase Letter 22
 
5.4%
Space Separator 4
 
1.0%
Close Punctuation 2
 
0.5%
Open Punctuation 2
 
0.5%
Lowercase Letter 1
 
0.2%
Math Symbol 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
 
13.9%
50
 
13.4%
14
 
3.8%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
6
 
1.6%
6
 
1.6%
Other values (132) 201
53.9%
Uppercase Letter
ValueCountFrequency (%)
S 2
 
9.1%
Z 2
 
9.1%
U 2
 
9.1%
E 2
 
9.1%
W 2
 
9.1%
D 2
 
9.1%
M 2
 
9.1%
G 1
 
4.5%
A 1
 
4.5%
K 1
 
4.5%
Other values (5) 5
22.7%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 373
92.1%
Latin 23
 
5.7%
Common 9
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
 
13.9%
50
 
13.4%
14
 
3.8%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
6
 
1.6%
6
 
1.6%
Other values (132) 201
53.9%
Latin
ValueCountFrequency (%)
S 2
 
8.7%
Z 2
 
8.7%
U 2
 
8.7%
E 2
 
8.7%
W 2
 
8.7%
D 2
 
8.7%
M 2
 
8.7%
G 1
 
4.3%
A 1
 
4.3%
n 1
 
4.3%
Other values (6) 6
26.1%
Common
ValueCountFrequency (%)
4
44.4%
) 2
22.2%
( 2
22.2%
+ 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 373
92.1%
ASCII 32
 
7.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
52
 
13.9%
50
 
13.4%
14
 
3.8%
12
 
3.2%
10
 
2.7%
8
 
2.1%
7
 
1.9%
7
 
1.9%
6
 
1.6%
6
 
1.6%
Other values (132) 201
53.9%
ASCII
ValueCountFrequency (%)
4
 
12.5%
S 2
 
6.2%
Z 2
 
6.2%
) 2
 
6.2%
U 2
 
6.2%
E 2
 
6.2%
( 2
 
6.2%
W 2
 
6.2%
D 2
 
6.2%
M 2
 
6.2%
Other values (10) 10
31.2%

학원종류
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
학교교과교습학원
50 
평생직업교육학원
 
1

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique1 ?
Unique (%)2.0%

Sample

1st row학교교과교습학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row학교교과교습학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 50
98.0%
평생직업교육학원 1
 
2.0%

Length

2023-12-12T15:22:44.490036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:44.593664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 50
98.0%
평생직업교육학원 1
 
2.0%

분야구분
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size540.0 B
입시.검정 및 보습
27 
예능(대)
12 
국제화
종합(대)
독서실
 
2
Other values (2)
 
2

Length

Max length10
Median length10
Mean length7.3529412
Min length3

Unique

Unique2 ?
Unique (%)3.9%

Sample

1st row입시.검정 및 보습
2nd row입시.검정 및 보습
3rd row입시.검정 및 보습
4th row입시.검정 및 보습
5th row국제화

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 27
52.9%
예능(대) 12
23.5%
국제화 5
 
9.8%
종합(대) 3
 
5.9%
독서실 2
 
3.9%
직업기술 1
 
2.0%
기타(대) 1
 
2.0%

Length

2023-12-12T15:22:44.719366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:44.832919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 27
25.7%
27
25.7%
보습 27
25.7%
예능(대 12
11.4%
국제화 5
 
4.8%
종합(대 3
 
2.9%
독서실 2
 
1.9%
직업기술 1
 
1.0%
기타(대 1
 
1.0%
Distinct47
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:22:45.099973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length41
Mean length35.313725
Min length25

Characters and Unicode

Total characters1801
Distinct characters123
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique44 ?
Unique (%)86.3%

Sample

1st row경상북도 울진군 울진읍 울진중앙로 159 , 2층 (울진읍)
2nd row경상북도 울진군 죽변면 죽변북로 37 (죽변면, 동아스쿨)
3rd row경상북도 울진군 울진읍 울진중앙로 142 , 142 (울진읍)
4th row경상북도 울진군 울진읍 읍내7길 10 (울진읍, 광명빌딩)
5th row경상북도 울진군 울진읍 읍내6길 12 (울진읍, 정철어학원울진캠퍼스)
ValueCountFrequency (%)
울진읍 64
15.8%
경상북도 51
 
12.6%
울진군 51
 
12.6%
28
 
6.9%
죽변면 14
 
3.5%
후포면 14
 
3.5%
2층 13
 
3.2%
울진중앙로 12
 
3.0%
읍내8길 11
 
2.7%
북면 10
 
2.5%
Other values (86) 137
33.8%
2023-12-12T15:22:45.591588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
354
19.7%
134
 
7.4%
134
 
7.4%
83
 
4.6%
72
 
4.0%
, 63
 
3.5%
57
 
3.2%
52
 
2.9%
) 52
 
2.9%
( 52
 
2.9%
Other values (113) 748
41.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1070
59.4%
Space Separator 354
 
19.7%
Decimal Number 200
 
11.1%
Other Punctuation 63
 
3.5%
Close Punctuation 52
 
2.9%
Open Punctuation 52
 
2.9%
Dash Punctuation 10
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
 
12.5%
134
 
12.5%
83
 
7.8%
72
 
6.7%
57
 
5.3%
52
 
4.9%
51
 
4.8%
51
 
4.8%
38
 
3.6%
29
 
2.7%
Other values (98) 369
34.5%
Decimal Number
ValueCountFrequency (%)
2 41
20.5%
1 36
18.0%
5 21
10.5%
3 18
9.0%
4 18
9.0%
8 16
 
8.0%
0 15
 
7.5%
9 14
 
7.0%
7 12
 
6.0%
6 9
 
4.5%
Space Separator
ValueCountFrequency (%)
354
100.0%
Other Punctuation
ValueCountFrequency (%)
, 63
100.0%
Close Punctuation
ValueCountFrequency (%)
) 52
100.0%
Open Punctuation
ValueCountFrequency (%)
( 52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1070
59.4%
Common 731
40.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
 
12.5%
134
 
12.5%
83
 
7.8%
72
 
6.7%
57
 
5.3%
52
 
4.9%
51
 
4.8%
51
 
4.8%
38
 
3.6%
29
 
2.7%
Other values (98) 369
34.5%
Common
ValueCountFrequency (%)
354
48.4%
, 63
 
8.6%
) 52
 
7.1%
( 52
 
7.1%
2 41
 
5.6%
1 36
 
4.9%
5 21
 
2.9%
3 18
 
2.5%
4 18
 
2.5%
8 16
 
2.2%
Other values (5) 60
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1070
59.4%
ASCII 731
40.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
354
48.4%
, 63
 
8.6%
) 52
 
7.1%
( 52
 
7.1%
2 41
 
5.6%
1 36
 
4.9%
5 21
 
2.9%
3 18
 
2.5%
4 18
 
2.5%
8 16
 
2.2%
Other values (5) 60
 
8.2%
Hangul
ValueCountFrequency (%)
134
 
12.5%
134
 
12.5%
83
 
7.8%
72
 
6.7%
57
 
5.3%
52
 
4.9%
51
 
4.8%
51
 
4.8%
38
 
3.6%
29
 
2.7%
Other values (98) 369
34.5%

우편번호
Real number (ℝ)

Distinct8
Distinct (%)15.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36327.176
Minimum36304
Maximum36370
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T15:22:45.758950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36304
5-th percentile36305
Q136323
median36324
Q336324
95-th percentile36370
Maximum36370
Range66
Interquartile range (IQR)1

Descriptive statistics

Standard deviation18.257827
Coefficient of variation (CV)0.00050259416
Kurtosis1.8718273
Mean36327.176
Median Absolute Deviation (MAD)1
Skewness1.6451187
Sum1852686
Variance333.34824
MonotonicityNot monotonic
2023-12-12T15:22:45.890024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
36324 22
43.1%
36323 7
 
13.7%
36370 7
 
13.7%
36305 4
 
7.8%
36315 4
 
7.8%
36316 3
 
5.9%
36325 3
 
5.9%
36304 1
 
2.0%
ValueCountFrequency (%)
36304 1
 
2.0%
36305 4
 
7.8%
36315 4
 
7.8%
36316 3
 
5.9%
36323 7
 
13.7%
36324 22
43.1%
36325 3
 
5.9%
36370 7
 
13.7%
ValueCountFrequency (%)
36370 7
 
13.7%
36325 3
 
5.9%
36324 22
43.1%
36323 7
 
13.7%
36316 3
 
5.9%
36315 4
 
7.8%
36305 4
 
7.8%
36304 1
 
2.0%
Distinct47
Distinct (%)92.2%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:22:46.137518image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters510
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)84.3%

Sample

1st row1976-08-28
2nd row2003-10-20
3rd row2003-12-12
4th row2004-07-47
5th row2004-12-42
ValueCountFrequency (%)
2010-10-10 2
 
3.9%
2014-08-28 2
 
3.9%
2020-02-12 2
 
3.9%
2006-12-12 2
 
3.9%
2021-02-12 1
 
2.0%
1996-04-64 1
 
2.0%
1976-08-28 1
 
2.0%
2019-04-14 1
 
2.0%
2019-08-28 1
 
2.0%
2020-02-32 1
 
2.0%
Other values (37) 37
72.5%
2023-12-12T15:22:46.471841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 118
23.1%
- 102
20.0%
2 89
17.5%
1 80
15.7%
9 30
 
5.9%
8 21
 
4.1%
3 21
 
4.1%
4 15
 
2.9%
6 14
 
2.7%
5 11
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 408
80.0%
Dash Punctuation 102
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 118
28.9%
2 89
21.8%
1 80
19.6%
9 30
 
7.4%
8 21
 
5.1%
3 21
 
5.1%
4 15
 
3.7%
6 14
 
3.4%
5 11
 
2.7%
7 9
 
2.2%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 510
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 118
23.1%
- 102
20.0%
2 89
17.5%
1 80
15.7%
9 30
 
5.9%
8 21
 
4.1%
3 21
 
4.1%
4 15
 
2.9%
6 14
 
2.7%
5 11
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 510
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 118
23.1%
- 102
20.0%
2 89
17.5%
1 80
15.7%
9 30
 
5.9%
8 21
 
4.1%
3 21
 
4.1%
4 15
 
2.9%
6 14
 
2.7%
5 11
 
2.2%

등록상태
Categorical

IMBALANCE 

Distinct2
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size540.0 B
개원
49 
자진휴원(소)
 
2

Length

Max length7
Median length2
Mean length2.1960784
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row자진휴원(소)
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 49
96.1%
자진휴원(소) 2
 
3.9%

Length

2023-12-12T15:22:46.614920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:46.742362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 49
96.1%
자진휴원(소 2
 
3.9%

설립자-성명
Text

UNIQUE 

Distinct51
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size540.0 B
2023-12-12T15:22:47.006046image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length3
Mean length3.3137255
Min length3

Characters and Unicode

Total characters169
Distinct characters79
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)100.0%

Sample

1st row홍유진
2nd row이성애
3rd row황수연
4th row남주현
5th row조현국
ValueCountFrequency (%)
홍유진 1
 
1.9%
최성희 1
 
1.9%
이효정 1
 
1.9%
이윤정 1
 
1.9%
김유진 1
 
1.9%
송미희 1
 
1.9%
강다연 1
 
1.9%
정은경 1
 
1.9%
주식회사 1
 
1.9%
에듀퀘스트 1
 
1.9%
Other values (42) 42
80.8%
2023-12-12T15:22:47.510713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
10
 
5.9%
9
 
5.3%
9
 
5.3%
8
 
4.7%
7
 
4.1%
6
 
3.6%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (69) 102
60.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 164
97.0%
Open Punctuation 2
 
1.2%
Close Punctuation 2
 
1.2%
Space Separator 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
6.1%
9
 
5.5%
9
 
5.5%
8
 
4.9%
7
 
4.3%
6
 
3.7%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (66) 97
59.1%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 164
97.0%
Common 5
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
10
 
6.1%
9
 
5.5%
9
 
5.5%
8
 
4.9%
7
 
4.3%
6
 
3.7%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (66) 97
59.1%
Common
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 164
97.0%
ASCII 5
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
10
 
6.1%
9
 
5.5%
9
 
5.5%
8
 
4.9%
7
 
4.3%
6
 
3.7%
5
 
3.0%
5
 
3.0%
4
 
2.4%
4
 
2.4%
Other values (66) 97
59.1%
ASCII
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
1
20.0%

전화번호
Text

MISSING 

Distinct36
Distinct (%)100.0%
Missing15
Missing (%)29.4%
Memory size540.0 B
2023-12-12T15:22:47.792352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.027778
Min length12

Characters and Unicode

Total characters433
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row054-783-3884
2nd row054-781-7210
3rd row054-781-0078
4th row054-781-3670
5th row054-782-0509
ValueCountFrequency (%)
054-788-2839 1
 
2.8%
054-783-9376 1
 
2.8%
054-783-4728 1
 
2.8%
054-782-3003 1
 
2.8%
070-8699-0903 1
 
2.8%
054-782-0585 1
 
2.8%
054-788-8777 1
 
2.8%
054-782-2460 1
 
2.8%
054-783-2128 1
 
2.8%
054-782-4523 1
 
2.8%
Other values (26) 26
72.2%
2023-12-12T15:22:48.214957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 72
16.6%
0 61
14.1%
7 51
11.8%
8 51
11.8%
4 50
11.5%
5 49
11.3%
3 27
 
6.2%
2 21
 
4.8%
1 20
 
4.6%
9 19
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 361
83.4%
Dash Punctuation 72
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 61
16.9%
7 51
14.1%
8 51
14.1%
4 50
13.9%
5 49
13.6%
3 27
7.5%
2 21
 
5.8%
1 20
 
5.5%
9 19
 
5.3%
6 12
 
3.3%
Dash Punctuation
ValueCountFrequency (%)
- 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 433
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 72
16.6%
0 61
14.1%
7 51
11.8%
8 51
11.8%
4 50
11.5%
5 49
11.3%
3 27
 
6.2%
2 21
 
4.8%
1 20
 
4.6%
9 19
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 433
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 72
16.6%
0 61
14.1%
7 51
11.8%
8 51
11.8%
4 50
11.5%
5 49
11.3%
3 27
 
6.2%
2 21
 
4.8%
1 20
 
4.6%
9 19
 
4.4%

교습계열
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)13.7%
Missing0
Missing (%)0.0%
Memory size540.0 B
보통교과
27 
예능(중)
12 
외국어
<NA>
독서실
 
2
Other values (2)
 
2

Length

Max length5
Median length4
Mean length4.0980392
Min length3

Unique

Unique2 ?
Unique (%)3.9%

Sample

1st row보통교과
2nd row보통교과
3rd row보통교과
4th row보통교과
5th row외국어

Common Values

ValueCountFrequency (%)
보통교과 27
52.9%
예능(중) 12
23.5%
외국어 5
 
9.8%
<NA> 3
 
5.9%
독서실 2
 
3.9%
컴퓨터 1
 
2.0%
기타(중) 1
 
2.0%

Length

2023-12-12T15:22:48.443095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:48.620455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보통교과 27
52.9%
예능(중 12
23.5%
외국어 5
 
9.8%
na 3
 
5.9%
독서실 2
 
3.9%
컴퓨터 1
 
2.0%
기타(중 1
 
2.0%

교습과정
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)19.6%
Missing0
Missing (%)0.0%
Memory size540.0 B
보습
26 
음악
실용외국어(유아/초·중·고)
미술
<NA>
Other values (5)

Length

Max length24
Median length2
Mean length4.3137255
Min length2

Unique

Unique4 ?
Unique (%)7.8%

Sample

1st row보습
2nd row보습
3rd row보습
4th row보습
5th row실용외국어(유아/초·중·고)

Common Values

ValueCountFrequency (%)
보습 26
51.0%
음악 7
 
13.7%
실용외국어(유아/초·중·고) 5
 
9.8%
미술 4
 
7.8%
<NA> 3
 
5.9%
독서실(유아/초·중·고) 2
 
3.9%
무용 1
 
2.0%
컴퓨터(정보처리,통신기기,인터넷,소프트웨어) 1
 
2.0%
기타(소) 1
 
2.0%
입시 1
 
2.0%

Length

2023-12-12T15:22:48.785651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T15:22:48.946636image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보습 26
51.0%
음악 7
 
13.7%
실용외국어(유아/초·중·고 5
 
9.8%
미술 4
 
7.8%
na 3
 
5.9%
독서실(유아/초·중·고 2
 
3.9%
무용 1
 
2.0%
컴퓨터(정보처리,통신기기,인터넷,소프트웨어 1
 
2.0%
기타(소 1
 
2.0%
입시 1
 
2.0%

강사수
Real number (ℝ)

ZEROS 

Distinct8
Distinct (%)15.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9607843
Minimum0
Maximum8
Zeros2
Zeros (%)3.9%
Negative0
Negative (%)0.0%
Memory size591.0 B
2023-12-12T15:22:49.114410image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile4.5
Maximum8
Range8
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.4827108
Coefficient of variation (CV)0.75618252
Kurtosis5.1621595
Mean1.9607843
Median Absolute Deviation (MAD)1
Skewness1.9856576
Sum100
Variance2.1984314
MonotonicityNot monotonic
2023-12-12T15:22:49.260818image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
1 23
45.1%
2 14
27.5%
3 6
 
11.8%
4 3
 
5.9%
0 2
 
3.9%
5 1
 
2.0%
8 1
 
2.0%
6 1
 
2.0%
ValueCountFrequency (%)
0 2
 
3.9%
1 23
45.1%
2 14
27.5%
3 6
 
11.8%
4 3
 
5.9%
5 1
 
2.0%
6 1
 
2.0%
8 1
 
2.0%
ValueCountFrequency (%)
8 1
 
2.0%
6 1
 
2.0%
5 1
 
2.0%
4 3
 
5.9%
3 6
 
11.8%
2 14
27.5%
1 23
45.1%
0 2
 
3.9%

Interactions

2023-12-12T15:22:43.209474image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:42.694810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:43.330720image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T15:22:42.794634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T15:22:49.397861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학원명학원종류분야구분학원주소우편번호등록일등록상태설립자-성명전화번호교습계열교습과정강사수
학원명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
학원종류1.0001.0001.0001.0000.0001.0000.0001.0001.0001.0001.0000.000
분야구분1.0001.0001.0000.7580.1630.9160.0001.0001.0001.0001.0000.561
학원주소1.0001.0000.7581.0001.0000.9371.0001.0001.0000.9010.0000.958
우편번호1.0000.0000.1631.0001.0000.0000.1401.0001.0000.0000.2690.103
등록일1.0001.0000.9160.9370.0001.0001.0001.0001.0000.8800.4281.000
등록상태1.0000.0000.0001.0000.1401.0001.0001.0001.0000.0000.0000.000
설립자-성명1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
교습계열1.0001.0001.0000.9010.0000.8800.0001.0001.0001.0001.0000.602
교습과정1.0001.0001.0000.0000.2690.4280.0001.0001.0001.0001.0000.410
강사수1.0000.0000.5610.9580.1031.0000.0001.0001.0000.6020.4101.000
2023-12-12T15:22:49.553671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록상태교습계열분야구분교습과정학원종류
등록상태1.0000.0000.0000.0000.000
교습계열0.0001.0001.0000.9640.956
분야구분0.0001.0001.0000.9640.948
교습과정0.0000.9640.9641.0000.921
학원종류0.0000.9560.9480.9211.000
2023-12-12T15:22:49.690341image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
우편번호강사수학원종류분야구분등록상태교습계열교습과정
우편번호1.0000.0910.0000.0460.1690.0000.000
강사수0.0911.0000.0000.3350.0000.3760.202
학원종류0.0000.0001.0000.9480.0000.9560.921
분야구분0.0460.3350.9481.0000.0001.0000.964
등록상태0.1690.0000.0000.0001.0000.0000.000
교습계열0.0000.3760.9561.0000.0001.0000.964
교습과정0.0000.2020.9210.9640.0000.9641.000

Missing values

2023-12-12T15:22:43.466441image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:22:43.711547image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학원명학원종류분야구분학원주소우편번호등록일등록상태설립자-성명전화번호교습계열교습과정강사수
0경북학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 울진중앙로 159 , 2층 (울진읍)363231976-08-28개원홍유진054-783-3884보통교과보습2
1동아스쿨학원학교교과교습학원입시.검정 및 보습경상북도 울진군 죽변면 죽변북로 37 (죽변면, 동아스쿨)363162003-10-20자진휴원(소)이성애054-781-7210보통교과보습2
2한별학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 울진중앙로 142 , 142 (울진읍)363242003-12-12개원황수연054-781-0078보통교과보습1
3굿모닝스터디학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 읍내7길 10 (울진읍, 광명빌딩)363242004-07-47개원남주현054-781-3670보통교과보습4
4정철영어주니어울진어학원학교교과교습학원국제화경상북도 울진군 울진읍 읍내6길 12 (울진읍, 정철어학원울진캠퍼스)363232004-12-42개원조현국054-782-0509외국어실용외국어(유아/초·중·고)4
5글로리아피아노전문학원학교교과교습학원예능(대)경상북도 울진군 북면 울진북로 2095 (북면, 성용빌딩)363052005-09-19개원고수희054-781-0900예능(중)음악1
6명성학원학교교과교습학원입시.검정 및 보습경상북도 울진군 북면 울진북로 2232 (북면)363042005-11-11개원진성민054-782-7714보통교과보습1
7아이네트천재학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 울진중앙로 135-1 (울진읍, 아이네트학원)363232006-07-37개원최용준054-783-0314보통교과보습3
8큐브스쿨학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 연호로 2 , 201호 (울진읍, 아디다스)363252006-12-12개원곽순자054-783-9376보통교과보습1
9JW아카데미학원학교교과교습학원입시.검정 및 보습경상북도 울진군 죽변면 죽변북로 53 (죽변면, 원동종합상사)363162006-12-12개원이현희054-781-3409보통교과보습1
학원명학원종류분야구분학원주소우편번호등록일등록상태설립자-성명전화번호교습계열교습과정강사수
41미예뜰미술학원학교교과교습학원예능(대)경상북도 울진군 울진읍 읍내7길 25 (울진읍, 2층 미예뜰미술학원, 1층 불타는막창, 다도)363241996-07-17개원남은숙054-783-4728예능(중)미술1
42피카소미술학원학교교과교습학원예능(대)경상북도 울진군 울진읍 읍내8길 18-4 (울진읍, 피카소미술학원)363241995-03-23개원남미옥054-782-4523예능(중)미술2
43한샘학원학교교과교습학원입시.검정 및 보습경상북도 울진군 후포면 삼율5길 7 (후포면)363701995-10-40개원김정화054-787-1613보통교과보습4
44샤론피아노학원학교교과교습학원예능(대)경상북도 울진군 울진읍 울진중앙로 144 , 2층 (울진읍, 삼성생명)363241996-08-18개원이유진054-783-9814예능(중)음악1
45상아탑학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 읍내8길 46 (울진읍, 상아탑학원)363241998-03-23개원장미영054-782-4439보통교과보습2
46푸른학원학교교과교습학원입시.검정 및 보습경상북도 울진군 북면 울진북로 2095 (북면, 성용빌딩)363052001-02-12개원홍준호054-782-9955보통교과입시1
47청담입시세잔아트학원학교교과교습학원종합(대)경상북도 울진군 후포면 삼율6길 16-9 (후포면, 금호상가)363702001-03-33개원변정순054-788-5674<NA><NA>3
48한솔+종합학원학교교과교습학원종합(대)경상북도 울진군 북면 울진북로 2095 (북면, 성용빌딩)363052001-11-21개원홍규찬054-783-0562<NA><NA>2
49GnB영어전문학원학교교과교습학원국제화경상북도 울진군 후포면 삼율5길 7 (후포면)363702002-01-21개원윤동용054-787-1605외국어실용외국어(유아/초·중·고)2
50이룸수학전문학원학교교과교습학원입시.검정 및 보습경상북도 울진군 울진읍 울진중앙로 138-5 (울진읍, 이룸수학전문학원)363242002-04-24개원이종건054-781-3998보통교과보습2