Overview

Dataset statistics

Number of variables5
Number of observations357
Missing cells94
Missing cells (%)5.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory14.1 KiB
Average record size in memory40.4 B

Variable types

Text4
Categorical1

Dataset

Description경상북도교육청 경상북도경주교육지원청 학원 현황
Author경상북도교육청 경상북도경주교육지원청
URLhttps://www.data.go.kr/data/15053454/fileData.do

Alerts

전화번호 has 94 (26.3%) missing valuesMissing
학원명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 06:14:03.612169
Analysis finished2023-12-12 06:14:04.289155
Duration0.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

학원명
Text

UNIQUE 

Distinct357
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-12T15:14:04.509436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length7.7955182
Min length4

Characters and Unicode

Total characters2783
Distinct characters368
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique357 ?
Unique (%)100.0%

Sample

1st row상현서당한문학원
2nd row예성피아노학원
3rd row로사음악학원
4th row하나음악학원
5th row꿈나무음악학원
ValueCountFrequency (%)
상현서당한문학원 1
 
0.3%
ybm잉글루영어학원 1
 
0.3%
푸르넷드림학원 1
 
0.3%
왕셈학원 1
 
0.3%
유레카영어학원 1
 
0.3%
필즈수학학원 1
 
0.3%
빨간펜수학의달인용황학원 1
 
0.3%
오선피아노학원 1
 
0.3%
청담솔루션학원 1
 
0.3%
타임스영어학원 1
 
0.3%
Other values (347) 347
97.2%
2023-12-12T15:14:05.001528image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
372
 
13.4%
339
 
12.2%
76
 
2.7%
75
 
2.7%
47
 
1.7%
44
 
1.6%
43
 
1.5%
42
 
1.5%
41
 
1.5%
34
 
1.2%
Other values (358) 1670
60.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2666
95.8%
Uppercase Letter 76
 
2.7%
Lowercase Letter 21
 
0.8%
Decimal Number 11
 
0.4%
Other Punctuation 7
 
0.3%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
372
 
14.0%
339
 
12.7%
76
 
2.9%
75
 
2.8%
47
 
1.8%
44
 
1.7%
43
 
1.6%
42
 
1.6%
41
 
1.5%
34
 
1.3%
Other values (317) 1553
58.3%
Uppercase Letter
ValueCountFrequency (%)
E 14
18.4%
S 10
13.2%
M 9
11.8%
T 8
10.5%
C 5
 
6.6%
I 4
 
5.3%
B 4
 
5.3%
Y 4
 
5.3%
R 3
 
3.9%
A 3
 
3.9%
Other values (7) 12
15.8%
Lowercase Letter
ValueCountFrequency (%)
e 3
14.3%
s 2
9.5%
k 2
9.5%
y 2
9.5%
i 2
9.5%
o 2
9.5%
n 1
 
4.8%
g 1
 
4.8%
l 1
 
4.8%
h 1
 
4.8%
Other values (4) 4
19.0%
Decimal Number
ValueCountFrequency (%)
1 4
36.4%
0 4
36.4%
2 2
18.2%
3 1
 
9.1%
Other Punctuation
ValueCountFrequency (%)
. 3
42.9%
& 2
28.6%
· 1
 
14.3%
, 1
 
14.3%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2666
95.8%
Latin 97
 
3.5%
Common 20
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
372
 
14.0%
339
 
12.7%
76
 
2.9%
75
 
2.8%
47
 
1.8%
44
 
1.7%
43
 
1.6%
42
 
1.6%
41
 
1.5%
34
 
1.3%
Other values (317) 1553
58.3%
Latin
ValueCountFrequency (%)
E 14
14.4%
S 10
 
10.3%
M 9
 
9.3%
T 8
 
8.2%
C 5
 
5.2%
I 4
 
4.1%
B 4
 
4.1%
Y 4
 
4.1%
R 3
 
3.1%
e 3
 
3.1%
Other values (21) 33
34.0%
Common
ValueCountFrequency (%)
1 4
20.0%
0 4
20.0%
. 3
15.0%
2 2
10.0%
& 2
10.0%
( 1
 
5.0%
) 1
 
5.0%
3 1
 
5.0%
· 1
 
5.0%
, 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2666
95.8%
ASCII 116
 
4.2%
None 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
372
 
14.0%
339
 
12.7%
76
 
2.9%
75
 
2.8%
47
 
1.8%
44
 
1.7%
43
 
1.6%
42
 
1.6%
41
 
1.5%
34
 
1.3%
Other values (317) 1553
58.3%
ASCII
ValueCountFrequency (%)
E 14
 
12.1%
S 10
 
8.6%
M 9
 
7.8%
T 8
 
6.9%
C 5
 
4.3%
I 4
 
3.4%
1 4
 
3.4%
B 4
 
3.4%
Y 4
 
3.4%
0 4
 
3.4%
Other values (30) 50
43.1%
None
ValueCountFrequency (%)
· 1
100.0%
Distinct343
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-12T15:14:05.330022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length41
Mean length29.179272
Min length21

Characters and Unicode

Total characters10417
Distinct characters166
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique329 ?
Unique (%)92.2%

Sample

1st row경상북도 경주시 황성로1번길 11-2 2층 (황성동)
2nd row경상북도 경주시 용담로92번길 46 (황성동)
3rd row경상북도 경주시 안강읍 비화원로 122 (안강읍)
4th row경상북도 경주시 불국사초등2길 4 , 2층 (구정동,토마토어린이집)
5th row경상북도 경주시 외동읍 입실로1길 80 (외동읍)
ValueCountFrequency (%)
경상북도 357
 
15.3%
경주시 357
 
15.3%
191
 
8.2%
황성동 115
 
4.9%
2층 97
 
4.1%
안강읍 80
 
3.4%
현곡면 52
 
2.2%
3층 51
 
2.2%
용강동 39
 
1.7%
동천동 37
 
1.6%
Other values (397) 964
41.2%
2023-12-12T15:14:05.778894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2033
19.5%
718
 
6.9%
373
 
3.6%
371
 
3.6%
366
 
3.5%
363
 
3.5%
( 361
 
3.5%
361
 
3.5%
) 361
 
3.5%
358
 
3.4%
Other values (156) 4752
45.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5778
55.5%
Space Separator 2033
 
19.5%
Decimal Number 1514
 
14.5%
Open Punctuation 361
 
3.5%
Close Punctuation 361
 
3.5%
Other Punctuation 242
 
2.3%
Dash Punctuation 122
 
1.2%
Math Symbol 3
 
< 0.1%
Uppercase Letter 2
 
< 0.1%
Other Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
718
 
12.4%
373
 
6.5%
371
 
6.4%
366
 
6.3%
363
 
6.3%
361
 
6.2%
358
 
6.2%
250
 
4.3%
239
 
4.1%
217
 
3.8%
Other values (137) 2162
37.4%
Decimal Number
ValueCountFrequency (%)
2 309
20.4%
1 298
19.7%
3 221
14.6%
4 186
12.3%
0 107
 
7.1%
5 98
 
6.5%
6 92
 
6.1%
9 71
 
4.7%
8 70
 
4.6%
7 62
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 241
99.6%
. 1
 
0.4%
Space Separator
ValueCountFrequency (%)
2033
100.0%
Open Punctuation
ValueCountFrequency (%)
( 361
100.0%
Close Punctuation
ValueCountFrequency (%)
) 361
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 122
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Uppercase Letter
ValueCountFrequency (%)
A 2
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5779
55.5%
Common 4636
44.5%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
718
 
12.4%
373
 
6.5%
371
 
6.4%
366
 
6.3%
363
 
6.3%
361
 
6.2%
358
 
6.2%
250
 
4.3%
239
 
4.1%
217
 
3.8%
Other values (138) 2163
37.4%
Common
ValueCountFrequency (%)
2033
43.9%
( 361
 
7.8%
) 361
 
7.8%
2 309
 
6.7%
1 298
 
6.4%
, 241
 
5.2%
3 221
 
4.8%
4 186
 
4.0%
- 122
 
2.6%
0 107
 
2.3%
Other values (7) 397
 
8.6%
Latin
ValueCountFrequency (%)
A 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5778
55.5%
ASCII 4638
44.5%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2033
43.8%
( 361
 
7.8%
) 361
 
7.8%
2 309
 
6.7%
1 298
 
6.4%
, 241
 
5.2%
3 221
 
4.8%
4 186
 
4.0%
- 122
 
2.6%
0 107
 
2.3%
Other values (8) 399
 
8.6%
Hangul
ValueCountFrequency (%)
718
 
12.4%
373
 
6.5%
371
 
6.4%
366
 
6.3%
363
 
6.3%
361
 
6.2%
358
 
6.2%
250
 
4.3%
239
 
4.1%
217
 
3.8%
Other values (137) 2162
37.4%
None
ValueCountFrequency (%)
1
100.0%
Distinct343
Distinct (%)96.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2023-12-12T15:14:06.152898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length3
Mean length3.32493
Min length2

Characters and Unicode

Total characters1187
Distinct characters168
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique332 ?
Unique (%)93.0%

Sample

1st row김주호
2nd row이옥주
3rd row이상희
4th row최유진
5th row정광희
ValueCountFrequency (%)
주)대교(대표자:박수완 5
 
1.4%
윤태호 2
 
0.6%
권미경 2
 
0.6%
김현미 2
 
0.6%
임종완 2
 
0.6%
사건민사준 2
 
0.6%
조경화 2
 
0.6%
이은정 2
 
0.6%
최민수 2
 
0.6%
김성재 2
 
0.6%
Other values (334) 335
93.6%
2023-12-12T15:14:06.733901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
86
 
7.2%
78
 
6.6%
50
 
4.2%
36
 
3.0%
33
 
2.8%
30
 
2.5%
30
 
2.5%
27
 
2.3%
27
 
2.3%
26
 
2.2%
Other values (158) 764
64.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1146
96.5%
Open Punctuation 15
 
1.3%
Close Punctuation 15
 
1.3%
Other Punctuation 5
 
0.4%
Uppercase Letter 3
 
0.3%
Lowercase Letter 2
 
0.2%
Space Separator 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
7.5%
78
 
6.8%
50
 
4.4%
36
 
3.1%
33
 
2.9%
30
 
2.6%
30
 
2.6%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (149) 723
63.1%
Uppercase Letter
ValueCountFrequency (%)
I 1
33.3%
R 1
33.3%
J 1
33.3%
Lowercase Letter
ValueCountFrequency (%)
n 1
50.0%
c 1
50.0%
Open Punctuation
ValueCountFrequency (%)
( 15
100.0%
Close Punctuation
ValueCountFrequency (%)
) 15
100.0%
Other Punctuation
ValueCountFrequency (%)
: 5
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1146
96.5%
Common 36
 
3.0%
Latin 5
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
7.5%
78
 
6.8%
50
 
4.4%
36
 
3.1%
33
 
2.9%
30
 
2.6%
30
 
2.6%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (149) 723
63.1%
Latin
ValueCountFrequency (%)
I 1
20.0%
n 1
20.0%
c 1
20.0%
R 1
20.0%
J 1
20.0%
Common
ValueCountFrequency (%)
( 15
41.7%
) 15
41.7%
: 5
 
13.9%
1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1146
96.5%
ASCII 41
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
86
 
7.5%
78
 
6.8%
50
 
4.4%
36
 
3.1%
33
 
2.9%
30
 
2.6%
30
 
2.6%
27
 
2.4%
27
 
2.4%
26
 
2.3%
Other values (149) 723
63.1%
ASCII
ValueCountFrequency (%)
( 15
36.6%
) 15
36.6%
: 5
 
12.2%
1
 
2.4%
I 1
 
2.4%
n 1
 
2.4%
c 1
 
2.4%
R 1
 
2.4%
J 1
 
2.4%

전화번호
Text

MISSING 

Distinct262
Distinct (%)99.6%
Missing94
Missing (%)26.3%
Memory size2.9 KiB
2023-12-12T15:14:06.988231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.019011
Min length12

Characters and Unicode

Total characters3161
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique261 ?
Unique (%)99.2%

Sample

1st row054-743-1983
2nd row054-742-4354
3rd row054-761-4955
4th row054-748-7173
5th row054-744-6144
ValueCountFrequency (%)
054-762-2020 2
 
0.8%
054-774-9905 1
 
0.4%
054-742-8595 1
 
0.4%
054-743-1983 1
 
0.4%
054-771-9728 1
 
0.4%
054-749-5256 1
 
0.4%
054-773-0078 1
 
0.4%
054-751-5095 1
 
0.4%
054-748-8207 1
 
0.4%
054-742-3570 1
 
0.4%
Other values (252) 252
95.8%
2023-12-12T15:14:07.544065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 531
16.8%
- 526
16.6%
0 456
14.4%
4 451
14.3%
5 413
13.1%
1 162
 
5.1%
6 143
 
4.5%
2 130
 
4.1%
3 120
 
3.8%
9 119
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2635
83.4%
Dash Punctuation 526
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 531
20.2%
0 456
17.3%
4 451
17.1%
5 413
15.7%
1 162
 
6.1%
6 143
 
5.4%
2 130
 
4.9%
3 120
 
4.6%
9 119
 
4.5%
8 110
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 526
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3161
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
7 531
16.8%
- 526
16.6%
0 456
14.4%
4 451
14.3%
5 413
13.1%
1 162
 
5.1%
6 143
 
4.5%
2 130
 
4.1%
3 120
 
3.8%
9 119
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3161
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7 531
16.8%
- 526
16.6%
0 456
14.4%
4 451
14.3%
5 413
13.1%
1 162
 
5.1%
6 143
 
4.5%
2 130
 
4.1%
3 120
 
3.8%
9 119
 
3.8%

교습과정
Categorical

Distinct31
Distinct (%)8.7%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
보습
187 
음악
36 
독서실(유아/초·중·고)
25 
미술
24 
실용외국어(유아/초·중·고)
22 
Other values (26)
63 

Length

Max length24
Median length2
Mean length4.162465
Min length2

Unique

Unique16 ?
Unique (%)4.5%

Sample

1st row보습
2nd row음악
3rd row음악
4th row음악
5th row음악

Common Values

ValueCountFrequency (%)
보습 187
52.4%
음악 36
 
10.1%
독서실(유아/초·중·고) 25
 
7.0%
미술 24
 
6.7%
실용외국어(유아/초·중·고) 22
 
6.2%
<NA> 16
 
4.5%
무용 7
 
2.0%
보습·논술 5
 
1.4%
입시 3
 
0.8%
컴퓨터(정보처리,통신기기,인터넷,소프트웨어) 3
 
0.8%
Other values (21) 29
 
8.1%

Length

2023-12-12T15:14:07.816772image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
보습 187
52.4%
음악 36
 
10.1%
독서실(유아/초·중·고 25
 
7.0%
미술 24
 
6.7%
실용외국어(유아/초·중·고 22
 
6.2%
na 16
 
4.5%
무용 7
 
2.0%
보습·논술 5
 
1.4%
식음료품(바리스타,소믈리에 3
 
0.8%
실용음악(성악 3
 
0.8%
Other values (21) 29
 
8.1%

Missing values

2023-12-12T15:14:04.107027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T15:14:04.239407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학원명학원주소설립자-성명전화번호교습과정
0상현서당한문학원경상북도 경주시 황성로1번길 11-2 2층 (황성동)김주호054-743-1983보습
1예성피아노학원경상북도 경주시 용담로92번길 46 (황성동)이옥주054-742-4354음악
2로사음악학원경상북도 경주시 안강읍 비화원로 122 (안강읍)이상희054-761-4955음악
3하나음악학원경상북도 경주시 불국사초등2길 4 , 2층 (구정동,토마토어린이집)최유진054-748-7173음악
4꿈나무음악학원경상북도 경주시 외동읍 입실로1길 80 (외동읍)정광희054-744-6144음악
5문창독서실경상북도 경주시 대안길11번길 10 (동천동)신경금054-741-4410독서실(유아/초·중·고)
6백전백승학원경상북도 경주시 안강읍 구부랑두림길 126-11 (안강읍)배재명054-762-1956보습
7경남학원경상북도 경주시 외동읍 입실로3길 18-2 (외동읍)윤용현054-776-8506보습
8아인스학원경상북도 경주시 안강읍 화전중앙길 31 (안강읍)정인숙054-761-8373보습
9육영학원경상북도 경주시 북문로 30 2.3층 (성건동)신영철054-771-6008보습
학원명학원주소설립자-성명전화번호교습과정
347관곡서예학원경상북도 경주시 강동면 강동로 66-28 (강동면)조희덕054-762-4054서예
348경주JR학원경상북도 경주시 황성로 59 (황성동)(주)경주제이알(JR Inc)054-774-2800<NA>
349아트캐드그래픽컴퓨터학원경상북도 경주시 금성로 380 , 3층 (성건동)최윤정054-742-7722컴퓨터(정보처리,통신기기,인터넷,소프트웨어)
350이나연무용학원경상북도 경주시 갓뒤길 29-1 (황성동)이나연054-748-8010무용
351푸르넷프렌즈초등중등A급학원경상북도 경주시 다불로 11 , 2층 (용강동)백경애054-748-0833보습
352ez컴퓨터학원경상북도 경주시 황성로 21-2 (황성동)성종태이미화054-771-2277컴퓨터(소)
353제일외국어학원경상북도 경주시 안강읍 안강중앙로 188 (안강읍)윤우식이혜숙054-763-8119<NA>
354유림학원경상북도 경주시 용담로116번길 7 , 3층 (황성동)이쌍순054-776-0460보습
355엘리트컴퓨터외국어스쿨학원경상북도 경주시 동천로72번길 5-1 (동천동)김동환054-742-2530<NA>
356영광학원경상북도 경주시 외동읍 입실로 80 (외동읍)박진환054-743-6784보습