Overview

Dataset statistics

Number of variables5
Number of observations255
Missing cells145
Missing cells (%)11.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory10.1 KiB
Average record size in memory40.5 B

Variable types

Text4
Categorical1

Dataset

Description경상북도교육청 경상북도경주교육지원청 교습소 현황 정보 제공(교습소명, 전화번호, 주소 등)
Author경상북도교육청 경상북도경주교육지원청
URLhttps://www.data.go.kr/data/15053453/fileData.do

Alerts

전화번호 has 145 (56.9%) missing valuesMissing
교습소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:20:52.306411
Analysis finished2023-12-12 12:20:53.438385
Duration1.13 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교습소명
Text

UNIQUE 

Distinct255
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:20:53.633175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length22
Mean length9.3607843
Min length6

Characters and Unicode

Total characters2387
Distinct characters333
Distinct categories9 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)100.0%

Sample

1st row소리사랑피아노교습소
2nd row햇님피아노교습소
3rd row계명세종피아노교습소
4th row청아미술교습소
5th row효성피아노교습소
ValueCountFrequency (%)
english 2
 
0.8%
소리사랑피아노교습소 1
 
0.4%
뮤직블라썸음악교습소 1
 
0.4%
플러스정쌤수학교습소 1
 
0.4%
놀작황성키즈센터미술교습소 1
 
0.4%
자아의신화국어논술교습소 1
 
0.4%
제이플루트음악교습소 1
 
0.4%
벨칸토성악음악교습소 1
 
0.4%
유림국어교습소 1
 
0.4%
송쌤과학교습소 1
 
0.4%
Other values (250) 250
95.8%
2023-12-12T21:20:54.069057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
11.0%
261
 
10.9%
254
 
10.6%
78
 
3.3%
77
 
3.2%
77
 
3.2%
70
 
2.9%
65
 
2.7%
54
 
2.3%
52
 
2.2%
Other values (323) 1136
47.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2239
93.8%
Uppercase Letter 64
 
2.7%
Lowercase Letter 56
 
2.3%
Open Punctuation 7
 
0.3%
Close Punctuation 7
 
0.3%
Space Separator 6
 
0.3%
Decimal Number 4
 
0.2%
Other Punctuation 3
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
 
11.7%
261
 
11.7%
254
 
11.3%
78
 
3.5%
77
 
3.4%
77
 
3.4%
70
 
3.1%
65
 
2.9%
54
 
2.4%
52
 
2.3%
Other values (278) 988
44.1%
Uppercase Letter
ValueCountFrequency (%)
E 8
12.5%
N 6
 
9.4%
G 5
 
7.8%
L 5
 
7.8%
A 4
 
6.2%
M 4
 
6.2%
S 4
 
6.2%
I 4
 
6.2%
T 3
 
4.7%
H 3
 
4.7%
Other values (11) 18
28.1%
Lowercase Letter
ValueCountFrequency (%)
i 7
12.5%
e 6
10.7%
n 6
10.7%
l 6
10.7%
o 5
8.9%
a 4
 
7.1%
h 3
 
5.4%
u 3
 
5.4%
r 3
 
5.4%
t 3
 
5.4%
Other values (6) 10
17.9%
Other Punctuation
ValueCountFrequency (%)
' 1
33.3%
& 1
33.3%
. 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 7
100.0%
Close Punctuation
ValueCountFrequency (%)
) 7
100.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Decimal Number
ValueCountFrequency (%)
1 4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2238
93.8%
Latin 120
 
5.0%
Common 28
 
1.2%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
 
11.8%
261
 
11.7%
254
 
11.3%
78
 
3.5%
77
 
3.4%
77
 
3.4%
70
 
3.1%
65
 
2.9%
54
 
2.4%
52
 
2.3%
Other values (277) 987
44.1%
Latin
ValueCountFrequency (%)
E 8
 
6.7%
i 7
 
5.8%
e 6
 
5.0%
N 6
 
5.0%
n 6
 
5.0%
l 6
 
5.0%
G 5
 
4.2%
o 5
 
4.2%
L 5
 
4.2%
A 4
 
3.3%
Other values (27) 62
51.7%
Common
ValueCountFrequency (%)
( 7
25.0%
) 7
25.0%
6
21.4%
1 4
14.3%
' 1
 
3.6%
& 1
 
3.6%
. 1
 
3.6%
+ 1
 
3.6%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2238
93.8%
ASCII 148
 
6.2%
CJK 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
263
 
11.8%
261
 
11.7%
254
 
11.3%
78
 
3.5%
77
 
3.4%
77
 
3.4%
70
 
3.1%
65
 
2.9%
54
 
2.4%
52
 
2.3%
Other values (277) 987
44.1%
ASCII
ValueCountFrequency (%)
E 8
 
5.4%
i 7
 
4.7%
( 7
 
4.7%
) 7
 
4.7%
6
 
4.1%
e 6
 
4.1%
N 6
 
4.1%
n 6
 
4.1%
l 6
 
4.1%
G 5
 
3.4%
Other values (35) 84
56.8%
CJK
ValueCountFrequency (%)
1
100.0%
Distinct249
Distinct (%)97.6%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:20:54.401600image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length3
Mean length3.0117647
Min length3

Characters and Unicode

Total characters768
Distinct characters133
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique244 ?
Unique (%)95.7%

Sample

1st row김헌숙
2nd row양윤정
3rd row김새별
4th row김원정
5th row박혜정
ValueCountFrequency (%)
김지현 3
 
1.2%
이은진 2
 
0.8%
이민영 2
 
0.8%
이정민 2
 
0.8%
김진희 2
 
0.8%
김현주 1
 
0.4%
원종석 1
 
0.4%
박영희 1
 
0.4%
조은별 1
 
0.4%
박숙희 1
 
0.4%
Other values (239) 239
93.7%
2023-12-12T21:20:54.855169image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
65
 
8.5%
41
 
5.3%
39
 
5.1%
38
 
4.9%
34
 
4.4%
25
 
3.3%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (123) 435
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 768
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
8.5%
41
 
5.3%
39
 
5.1%
38
 
4.9%
34
 
4.4%
25
 
3.3%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (123) 435
56.6%

Most occurring scripts

ValueCountFrequency (%)
Hangul 768
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
8.5%
41
 
5.3%
39
 
5.1%
38
 
4.9%
34
 
4.4%
25
 
3.3%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (123) 435
56.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 768
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
65
 
8.5%
41
 
5.3%
39
 
5.1%
38
 
4.9%
34
 
4.4%
25
 
3.3%
24
 
3.1%
24
 
3.1%
22
 
2.9%
21
 
2.7%
Other values (123) 435
56.6%

전화번호
Text

MISSING 

Distinct109
Distinct (%)99.1%
Missing145
Missing (%)56.9%
Memory size2.1 KiB
2023-12-12T21:20:55.109384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.063636
Min length12

Characters and Unicode

Total characters1327
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique108 ?
Unique (%)98.2%

Sample

1st row054-772-7910
2nd row054-743-2003
3rd row054-773-1780
4th row054-741-6631
5th row054-284-2275
ValueCountFrequency (%)
054-748-3915 2
 
1.8%
054-741-1131 1
 
0.9%
054-772-7910 1
 
0.9%
054-761-0882 1
 
0.9%
054-761-4658 1
 
0.9%
070-7368-5529 1
 
0.9%
054-775-2717 1
 
0.9%
054-772-3424 1
 
0.9%
054-773-4788 1
 
0.9%
070-7574-1135 1
 
0.9%
Other values (99) 99
90.0%
2023-12-12T21:20:55.547878image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7 225
17.0%
- 220
16.6%
4 189
14.2%
0 167
12.6%
5 163
12.3%
1 79
 
6.0%
2 77
 
5.8%
3 62
 
4.7%
6 52
 
3.9%
9 50
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1107
83.4%
Dash Punctuation 220
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 225
20.3%
4 189
17.1%
0 167
15.1%
5 163
14.7%
1 79
 
7.1%
2 77
 
7.0%
3 62
 
5.6%
6 52
 
4.7%
9 50
 
4.5%
8 43
 
3.9%
Dash Punctuation
ValueCountFrequency (%)
- 220
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1327
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
7 225
17.0%
- 220
16.6%
4 189
14.2%
0 167
12.6%
5 163
12.3%
1 79
 
6.0%
2 77
 
5.8%
3 62
 
4.7%
6 52
 
3.9%
9 50
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1327
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7 225
17.0%
- 220
16.6%
4 189
14.2%
0 167
12.6%
5 163
12.3%
1 79
 
6.0%
2 77
 
5.8%
3 62
 
4.7%
6 52
 
3.9%
9 50
 
3.8%
Distinct247
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
2023-12-12T21:20:55.833857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length57
Median length47
Mean length32.976471
Min length21

Characters and Unicode

Total characters8409
Distinct characters163
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique240 ?
Unique (%)94.1%

Sample

1st row경상북도 경주시 황성로1번길 23-4 , D동 206호 (황성동)
2nd row경상북도 경주시 북문로85번길 42 (성건동)
3rd row경상북도 경주시 승삼북길 7-6 (용강동)
4th row경상북도 경주시 북성로29번길 14 (성건동, 대광빌라)
5th row경상북도 경주시 강동면 강동로 66-26 , 상가 104동 204호 (강동면, 벽산반도타운)
ValueCountFrequency (%)
경상북도 255
 
13.9%
경주시 255
 
13.9%
171
 
9.3%
황성동 90
 
4.9%
충효동 43
 
2.4%
2층 42
 
2.3%
현곡면 32
 
1.7%
용강동 29
 
1.6%
상가 29
 
1.6%
안강읍 26
 
1.4%
Other values (344) 857
46.9%
2023-12-12T21:20:56.330944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1575
 
18.7%
520
 
6.2%
311
 
3.7%
308
 
3.7%
1 301
 
3.6%
2 291
 
3.5%
273
 
3.2%
264
 
3.1%
260
 
3.1%
260
 
3.1%
Other values (153) 4046
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4537
54.0%
Space Separator 1575
 
18.7%
Decimal Number 1426
 
17.0%
Open Punctuation 256
 
3.0%
Close Punctuation 255
 
3.0%
Other Punctuation 227
 
2.7%
Dash Punctuation 116
 
1.4%
Uppercase Letter 14
 
0.2%
Lowercase Letter 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
520
 
11.5%
311
 
6.9%
308
 
6.8%
273
 
6.0%
264
 
5.8%
260
 
5.7%
260
 
5.7%
204
 
4.5%
197
 
4.3%
179
 
3.9%
Other values (129) 1761
38.8%
Decimal Number
ValueCountFrequency (%)
1 301
21.1%
2 291
20.4%
0 174
12.2%
3 172
12.1%
4 137
9.6%
5 101
 
7.1%
6 87
 
6.1%
7 72
 
5.0%
9 64
 
4.5%
8 27
 
1.9%
Uppercase Letter
ValueCountFrequency (%)
C 4
28.6%
D 3
21.4%
K 2
14.3%
B 2
14.3%
A 1
 
7.1%
F 1
 
7.1%
G 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
c 2
66.7%
k 1
33.3%
Space Separator
ValueCountFrequency (%)
1575
100.0%
Open Punctuation
ValueCountFrequency (%)
( 256
100.0%
Close Punctuation
ValueCountFrequency (%)
) 255
100.0%
Other Punctuation
ValueCountFrequency (%)
, 227
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 116
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4537
54.0%
Common 3855
45.8%
Latin 17
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
520
 
11.5%
311
 
6.9%
308
 
6.8%
273
 
6.0%
264
 
5.8%
260
 
5.7%
260
 
5.7%
204
 
4.5%
197
 
4.3%
179
 
3.9%
Other values (129) 1761
38.8%
Common
ValueCountFrequency (%)
1575
40.9%
1 301
 
7.8%
2 291
 
7.5%
( 256
 
6.6%
) 255
 
6.6%
, 227
 
5.9%
0 174
 
4.5%
3 172
 
4.5%
4 137
 
3.6%
- 116
 
3.0%
Other values (5) 351
 
9.1%
Latin
ValueCountFrequency (%)
C 4
23.5%
D 3
17.6%
K 2
11.8%
c 2
11.8%
B 2
11.8%
k 1
 
5.9%
A 1
 
5.9%
F 1
 
5.9%
G 1
 
5.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4537
54.0%
ASCII 3872
46.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1575
40.7%
1 301
 
7.8%
2 291
 
7.5%
( 256
 
6.6%
) 255
 
6.6%
, 227
 
5.9%
0 174
 
4.5%
3 172
 
4.4%
4 137
 
3.5%
- 116
 
3.0%
Other values (14) 368
 
9.5%
Hangul
ValueCountFrequency (%)
520
 
11.5%
311
 
6.9%
308
 
6.8%
273
 
6.0%
264
 
5.8%
260
 
5.7%
260
 
5.7%
204
 
4.5%
197
 
4.3%
179
 
3.9%
Other values (129) 1761
38.8%

교습과정
Categorical

Distinct10
Distinct (%)3.9%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
보습
144 
음악
72 
미술
21 
실용외국어(유아/초·중·고)
 
7
보습·논술
 
6
Other values (5)
 
5

Length

Max length15
Median length2
Mean length2.4666667
Min length2

Unique

Unique5 ?
Unique (%)2.0%

Sample

1st row음악
2nd row음악
3rd row음악
4th row미술
5th row음악

Common Values

ValueCountFrequency (%)
보습 144
56.5%
음악 72
28.2%
미술 21
 
8.2%
실용외국어(유아/초·중·고) 7
 
2.7%
보습·논술 6
 
2.4%
컴퓨터(소) 1
 
0.4%
입시 1
 
0.4%
기타(소) 1
 
0.4%
바둑 1
 
0.4%
입시·논술 1
 
0.4%

Length

2023-12-12T21:20:56.487110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:20:56.629473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보습 144
56.5%
음악 72
28.2%
미술 21
 
8.2%
실용외국어(유아/초·중·고 7
 
2.7%
보습·논술 6
 
2.4%
컴퓨터(소 1
 
0.4%
입시 1
 
0.4%
기타(소 1
 
0.4%
바둑 1
 
0.4%
입시·논술 1
 
0.4%

Missing values

2023-12-12T21:20:53.206595image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:20:53.344536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교습소명교습자-성명전화번호교습소주소교습과정
0소리사랑피아노교습소김헌숙054-772-7910경상북도 경주시 황성로1번길 23-4 , D동 206호 (황성동)음악
1햇님피아노교습소양윤정054-743-2003경상북도 경주시 북문로85번길 42 (성건동)음악
2계명세종피아노교습소김새별054-773-1780경상북도 경주시 승삼북길 7-6 (용강동)음악
3청아미술교습소김원정054-741-6631경상북도 경주시 북성로29번길 14 (성건동, 대광빌라)미술
4효성피아노교습소박혜정054-284-2275경상북도 경주시 강동면 강동로 66-26 , 상가 104동 204호 (강동면, 벽산반도타운)음악
5강하은피아노교습소강진미054-748-1948경상북도 경주시 충효녹지길 142-7 , 상가 201호 (충효동, 대우아파트 1차)음악
6배남주피아노교습소배남주054-749-4383경상북도 경주시 용담로92번길 43-10 (황성동)음악
7열린미술교습소정해경054-776-1733경상북도 경주시 황성로69번길 27-1 (황성동)음악
8최예화피아노교습소최예화054-748-6789경상북도 경주시 황성로27번길 15 , 상가 107호 (황성동, 현대아파트 1차)음악
9진솔피아노교습소류미영054-774-9650경상북도 경주시 천북면 천북로 344 (천북면)음악
교습소명교습자-성명전화번호교습소주소교습과정
245라이즈(RISE)수학교습소김화준<NA>경상북도 경주시 황성로1번길 15 , 3층 (황성동)보습
246온바이올린교습소김보석<NA>경상북도 경주시 황성로64번길 33 , 1층 (황성동)음악
247소르본역사논술청어람한국사교습소김은진<NA>경상북도 경주시 황성로27번길 15 , 203호 (황성동)보습·논술
248민수학교습소이민희<NA>경상북도 경주시 소금강로 33 , 2층 (용강동)보습
249퍼펙트영어교습소이명숙<NA>경상북도 경주시 황성로1번길 27-1 , 203호 (황성동)보습
250영창음악교습소최유희054-772-6707경상북도 경주시 임해로 117 (황오동)음악
251영음피아노과외교습소김영아054-749-0009경상북도 경주시 현곡면 금장5길 20-11 , 상가 나동 203호 (현곡면, 삼성강변타운)음악
252한상선피아노교습소한상선054-775-0770경상북도 경주시 현곡면 금장5길 34-15 , 상가 126동 204호 (현곡면, 신한토탈)음악
253김신애피아노교습소김신애054-772-4763경상북도 경주시 탈해로 20-3 (동천동)음악
254계명피아노교습소김희숙054-771-1383경상북도 경주시 한빛길20번길 22 (성건동)음악