Overview

Dataset statistics

Number of variables6
Number of observations598
Missing cells170
Missing cells (%)4.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory28.7 KiB
Average record size in memory49.2 B

Variable types

Numeric1
Text3
Categorical2

Dataset

Description경상북도경산교육지원청 관할의 학원 및 교습소에 대한 데이터로 학원명, 종류, 주소, 전화번호 등의 항목을 제공합니다.
Author경상북도교육청 경상북도경산교육지원청
URLhttps://www.data.go.kr/data/3070741/fileData.do

Alerts

등록상태 has constant value ""Constant
번호 is highly overall correlated with 종류High correlation
종류 is highly overall correlated with 번호High correlation
전화번호 has 170 (28.4%) missing valuesMissing
번호 has unique valuesUnique
학원명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:10:06.448626
Analysis finished2023-12-12 04:10:07.714933
Duration1.27 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct598
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean299.5
Minimum1
Maximum598
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size5.4 KiB
2023-12-12T13:10:07.811703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile30.85
Q1150.25
median299.5
Q3448.75
95-th percentile568.15
Maximum598
Range597
Interquartile range (IQR)298.5

Descriptive statistics

Standard deviation172.77201
Coefficient of variation (CV)0.57686814
Kurtosis-1.2
Mean299.5
Median Absolute Deviation (MAD)149.5
Skewness0
Sum179101
Variance29850.167
MonotonicityStrictly increasing
2023-12-12T13:10:07.996771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
395 1
 
0.2%
397 1
 
0.2%
398 1
 
0.2%
399 1
 
0.2%
400 1
 
0.2%
401 1
 
0.2%
402 1
 
0.2%
403 1
 
0.2%
404 1
 
0.2%
Other values (588) 588
98.3%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
598 1
0.2%
597 1
0.2%
596 1
0.2%
595 1
0.2%
594 1
0.2%
593 1
0.2%
592 1
0.2%
591 1
0.2%
590 1
0.2%
589 1
0.2%

학원명
Text

UNIQUE 

Distinct598
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T13:10:08.318784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length18
Mean length8.4381271
Min length4

Characters and Unicode

Total characters5046
Distinct characters446
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique598 ?
Unique (%)100.0%

Sample

1st row대신입시학원
2nd row이아이이(EiE)영어&입시학원
3rd row성균관독서실
4th row한맥입시학원
5th row공신클럽학원
ValueCountFrequency (%)
대신입시학원 1
 
0.2%
조쌤수학학원 1
 
0.2%
제3교실수학학원 1
 
0.2%
열림수학학원 1
 
0.2%
튼튼영어마스터클럽옥곡어학원 1
 
0.2%
쎈수학러닝센터옥곡학원 1
 
0.2%
어린음악대중산원음악학원 1
 
0.2%
더올림입시학원 1
 
0.2%
강철에프엠영어수학학원 1
 
0.2%
유봉재수학학원 1
 
0.2%
Other values (589) 589
98.3%
2023-12-12T13:10:08.823984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
507
 
10.0%
397
 
7.9%
205
 
4.1%
203
 
4.0%
195
 
3.9%
145
 
2.9%
141
 
2.8%
109
 
2.2%
105
 
2.1%
94
 
1.9%
Other values (436) 2945
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4833
95.8%
Uppercase Letter 108
 
2.1%
Lowercase Letter 32
 
0.6%
Decimal Number 31
 
0.6%
Open Punctuation 16
 
0.3%
Close Punctuation 16
 
0.3%
Other Punctuation 8
 
0.2%
Space Separator 1
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
507
 
10.5%
397
 
8.2%
205
 
4.2%
203
 
4.2%
195
 
4.0%
145
 
3.0%
141
 
2.9%
109
 
2.3%
105
 
2.2%
94
 
1.9%
Other values (389) 2732
56.5%
Uppercase Letter
ValueCountFrequency (%)
E 20
18.5%
M 14
13.0%
S 10
9.3%
B 10
9.3%
T 7
 
6.5%
Y 7
 
6.5%
I 6
 
5.6%
C 6
 
5.6%
U 4
 
3.7%
J 4
 
3.7%
Other values (10) 20
18.5%
Lowercase Letter
ValueCountFrequency (%)
e 7
21.9%
s 4
12.5%
o 4
12.5%
l 3
9.4%
i 3
9.4%
u 2
 
6.2%
v 2
 
6.2%
n 2
 
6.2%
y 1
 
3.1%
k 1
 
3.1%
Other values (3) 3
9.4%
Decimal Number
ValueCountFrequency (%)
0 11
35.5%
3 7
22.6%
1 7
22.6%
2 4
 
12.9%
8 1
 
3.2%
9 1
 
3.2%
Other Punctuation
ValueCountFrequency (%)
. 3
37.5%
& 3
37.5%
' 1
 
12.5%
, 1
 
12.5%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4833
95.8%
Latin 140
 
2.8%
Common 73
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
507
 
10.5%
397
 
8.2%
205
 
4.2%
203
 
4.2%
195
 
4.0%
145
 
3.0%
141
 
2.9%
109
 
2.3%
105
 
2.2%
94
 
1.9%
Other values (389) 2732
56.5%
Latin
ValueCountFrequency (%)
E 20
14.3%
M 14
 
10.0%
S 10
 
7.1%
B 10
 
7.1%
T 7
 
5.0%
e 7
 
5.0%
Y 7
 
5.0%
I 6
 
4.3%
C 6
 
4.3%
s 4
 
2.9%
Other values (23) 49
35.0%
Common
ValueCountFrequency (%)
( 16
21.9%
) 16
21.9%
0 11
15.1%
3 7
9.6%
1 7
9.6%
2 4
 
5.5%
. 3
 
4.1%
& 3
 
4.1%
' 1
 
1.4%
8 1
 
1.4%
Other values (4) 4
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4833
95.8%
ASCII 213
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
507
 
10.5%
397
 
8.2%
205
 
4.2%
203
 
4.2%
195
 
4.0%
145
 
3.0%
141
 
2.9%
109
 
2.3%
105
 
2.2%
94
 
1.9%
Other values (389) 2732
56.5%
ASCII
ValueCountFrequency (%)
E 20
 
9.4%
( 16
 
7.5%
) 16
 
7.5%
M 14
 
6.6%
0 11
 
5.2%
S 10
 
4.7%
B 10
 
4.7%
T 7
 
3.3%
3 7
 
3.3%
e 7
 
3.3%
Other values (37) 95
44.6%

종류
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
학원
406 
교습소
192 

Length

Max length3
Median length2
Mean length2.3210702
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학원
2nd row학원
3rd row학원
4th row학원
5th row학원

Common Values

ValueCountFrequency (%)
학원 406
67.9%
교습소 192
32.1%

Length

2023-12-12T13:10:08.974955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:10:09.089406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원 406
67.9%
교습소 192
32.1%
Distinct562
Distinct (%)94.0%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
2023-12-12T13:10:09.456014image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length67
Median length51
Mean length32.289298
Min length18

Characters and Unicode

Total characters19309
Distinct characters210
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique532 ?
Unique (%)89.0%

Sample

1st row경상북도 경산시 하양읍 대경로159길 11 , 3층 (하양읍)
2nd row경상북도 경산시 대학로12길 21-2 , 3층 (정평동)
3rd row경상북도 경산시 대학로12길 14 , 4층 (정평동)
4th row경상북도 경산시 하양읍 하양로 92 , 4층 (하양읍)
5th row경상북도 경산시 대학로10길 24 , 3층 (정평동)
ValueCountFrequency (%)
경상북도 598
 
13.3%
경산시 598
 
13.3%
549
 
12.2%
하양읍 196
 
4.4%
2층 153
 
3.4%
3층 115
 
2.6%
진량읍 85
 
1.9%
옥곡동 81
 
1.8%
사동 80
 
1.8%
1층 70
 
1.6%
Other values (575) 1975
43.9%
2023-12-12T13:10:10.021489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3926
20.3%
1359
 
7.0%
832
 
4.3%
2 685
 
3.5%
, 667
 
3.5%
657
 
3.4%
) 613
 
3.2%
( 613
 
3.2%
601
 
3.1%
1 601
 
3.1%
Other values (200) 8755
45.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10278
53.2%
Space Separator 3926
 
20.3%
Decimal Number 3028
 
15.7%
Other Punctuation 670
 
3.5%
Close Punctuation 613
 
3.2%
Open Punctuation 613
 
3.2%
Dash Punctuation 156
 
0.8%
Uppercase Letter 23
 
0.1%
Math Symbol 2
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1359
 
13.2%
832
 
8.1%
657
 
6.4%
601
 
5.8%
600
 
5.8%
599
 
5.8%
565
 
5.5%
481
 
4.7%
465
 
4.5%
360
 
3.5%
Other values (172) 3759
36.6%
Decimal Number
ValueCountFrequency (%)
2 685
22.6%
1 601
19.8%
3 402
13.3%
0 286
9.4%
4 244
 
8.1%
5 215
 
7.1%
6 178
 
5.9%
7 159
 
5.3%
9 137
 
4.5%
8 121
 
4.0%
Uppercase Letter
ValueCountFrequency (%)
R 6
26.1%
K 3
13.0%
B 2
 
8.7%
I 2
 
8.7%
E 2
 
8.7%
P 2
 
8.7%
A 2
 
8.7%
V 2
 
8.7%
W 1
 
4.3%
S 1
 
4.3%
Other Punctuation
ValueCountFrequency (%)
, 667
99.6%
· 2
 
0.3%
@ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
3926
100.0%
Close Punctuation
ValueCountFrequency (%)
) 613
100.0%
Open Punctuation
ValueCountFrequency (%)
( 613
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 156
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10278
53.2%
Common 9008
46.7%
Latin 23
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1359
 
13.2%
832
 
8.1%
657
 
6.4%
601
 
5.8%
600
 
5.8%
599
 
5.8%
565
 
5.5%
481
 
4.7%
465
 
4.5%
360
 
3.5%
Other values (172) 3759
36.6%
Common
ValueCountFrequency (%)
3926
43.6%
2 685
 
7.6%
, 667
 
7.4%
) 613
 
6.8%
( 613
 
6.8%
1 601
 
6.7%
3 402
 
4.5%
0 286
 
3.2%
4 244
 
2.7%
5 215
 
2.4%
Other values (8) 756
 
8.4%
Latin
ValueCountFrequency (%)
R 6
26.1%
K 3
13.0%
B 2
 
8.7%
I 2
 
8.7%
E 2
 
8.7%
P 2
 
8.7%
A 2
 
8.7%
V 2
 
8.7%
W 1
 
4.3%
S 1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10278
53.2%
ASCII 9029
46.8%
None 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3926
43.5%
2 685
 
7.6%
, 667
 
7.4%
) 613
 
6.8%
( 613
 
6.8%
1 601
 
6.7%
3 402
 
4.5%
0 286
 
3.2%
4 244
 
2.7%
5 215
 
2.4%
Other values (17) 777
 
8.6%
Hangul
ValueCountFrequency (%)
1359
 
13.2%
832
 
8.1%
657
 
6.4%
601
 
5.8%
600
 
5.8%
599
 
5.8%
565
 
5.5%
481
 
4.7%
465
 
4.5%
360
 
3.5%
Other values (172) 3759
36.6%
None
ValueCountFrequency (%)
· 2
100.0%

등록상태
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.8 KiB
개원
598 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 598
100.0%

Length

2023-12-12T13:10:10.237056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:10:10.345612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 598
100.0%

전화번호
Text

MISSING 

Distinct420
Distinct (%)98.1%
Missing170
Missing (%)28.4%
Memory size4.8 KiB
2023-12-12T13:10:10.626883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.023364
Min length12

Characters and Unicode

Total characters5146
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique413 ?
Unique (%)96.5%

Sample

1st row053-851-6528
2nd row053-811-0300
3rd row053-811-9518
4th row053-853-3789
5th row053-814-8333
ValueCountFrequency (%)
053-814-3450 3
 
0.7%
053-816-4007 2
 
0.5%
053-802-4500 2
 
0.5%
053-818-3310 2
 
0.5%
053-815-2754 2
 
0.5%
053-249-0577 2
 
0.5%
053-811-9998 2
 
0.5%
053-851-0046 1
 
0.2%
053-851-6528 1
 
0.2%
053-817-0430 1
 
0.2%
Other values (410) 410
95.8%
2023-12-12T13:10:11.166520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 856
16.6%
0 822
16.0%
5 744
14.5%
3 625
12.1%
8 561
10.9%
1 512
9.9%
2 269
 
5.2%
7 244
 
4.7%
4 195
 
3.8%
6 182
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4290
83.4%
Dash Punctuation 856
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 822
19.2%
5 744
17.3%
3 625
14.6%
8 561
13.1%
1 512
11.9%
2 269
 
6.3%
7 244
 
5.7%
4 195
 
4.5%
6 182
 
4.2%
9 136
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 856
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5146
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 856
16.6%
0 822
16.0%
5 744
14.5%
3 625
12.1%
8 561
10.9%
1 512
9.9%
2 269
 
5.2%
7 244
 
4.7%
4 195
 
3.8%
6 182
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5146
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 856
16.6%
0 822
16.0%
5 744
14.5%
3 625
12.1%
8 561
10.9%
1 512
9.9%
2 269
 
5.2%
7 244
 
4.7%
4 195
 
3.8%
6 182
 
3.5%

Interactions

2023-12-12T13:10:06.918517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:10:11.278853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호종류
번호1.0000.998
종류0.9981.000
2023-12-12T13:10:11.371599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호종류
번호1.0000.956
종류0.9561.000

Missing values

2023-12-12T13:10:07.517792image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:10:07.658593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호학원명종류학원주소등록상태전화번호
01대신입시학원학원경상북도 경산시 하양읍 대경로159길 11 , 3층 (하양읍)개원053-851-6528
12이아이이(EiE)영어&입시학원학원경상북도 경산시 대학로12길 21-2 , 3층 (정평동)개원053-811-0300
23성균관독서실학원경상북도 경산시 대학로12길 14 , 4층 (정평동)개원053-811-9518
34한맥입시학원학원경상북도 경산시 하양읍 하양로 92 , 4층 (하양읍)개원053-853-3789
45공신클럽학원학원경상북도 경산시 대학로10길 24 , 3층 (정평동)개원053-814-8333
56더올림학원학원경상북도 경산시 원효로28길 32 , 2층 (계양동)개원053-817-8860
67영남간호학원학원경상북도 경산시 원효로 4 , 2층 (중방동)개원053-811-7509
78EP(이피)아동놀이학원학원경상북도 경산시 하양읍 대학로296길 9-9 , 1층 (하양읍)개원053-856-0484
89라온수학학원학원경상북도 경산시 백자로20길 26 , 3층 (사동)개원053-812-1999
910투드림하양학원학원경상북도 경산시 하양읍 동서2길 43 , 4층 (하양읍)개원053-854-0579
번호학원명종류학원주소등록상태전화번호
588589헬로우미술교습소교습소경상북도 경산시 진량읍 선화로20길 25-1 , 상가동 214호 (진량읍,경산1차 윤성아파트)개원053-851-1505
589590영피아노교습소교습소경상북도 경산시 성암로21길 70 , 상가 201호 (중산동)개원053-814-6621
590591안나음악교습소교습소경상북도 경산시 경산로44길 36-1개원053-813-3539
591592그림터미술교습소교습소경상북도 경산시 백양로35길 18 2층 (사동)개원053-817-9877
592593바른풀이수학교습소교습소경상북도 경산시 백자로20길 9(사동, 부영원앙) 상가 301호개원053-801-6972
593594그림이야기미술교습소교습소경상북도 경산시 하양읍하양로33길 6 청구1차 상가202호개원053-851-0029
594595행복피아노교습소교습소경상북도 경산시 하양읍 대학로305길 28-2 (하양읍)개원053-852-3156
595596오름수학교습소교습소경상북도 경산시 펜타힐즈2로 45 , 707호 (중산동)개원053-813-0426
596597ARTTIME미술교습소교습소경상북도 경산시 진량읍영청길 22 청구아파트 상가202호개원<NA>
597598월드메르디앙피아노교습소교습소경상북도 경산시 경청로221길 12 , 201호 (백천동, 경산백천월드메르디앙)개원053-801-9081