Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells24
Missing cells (%)14.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.5 KiB
Average record size in memory43.9 B

Variable types

Text3
Categorical2

Dataset

Description경상북도 문경지역 내 교습소에 대하여 교습소명, 전화번화, 교습소주소, 분야구분, 교습과정의 항목을 제공합니다.
Author경상북도교육청 경상북도문경교육지원청
URLhttps://www.data.go.kr/data/15053241/fileData.do

Alerts

교습과정 is highly overall correlated with 분야구분High correlation
분야구분 is highly overall correlated with 교습과정High correlation
전화번호 has 24 (70.6%) missing valuesMissing
교습소명 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:15:36.989560
Analysis finished2023-12-13 00:15:37.313872
Duration0.32 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교습소명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T09:15:37.441815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length12
Mean length9.6176471
Min length7

Characters and Unicode

Total characters327
Distinct characters119
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row혜림피아노교습소
2nd row리라피아노교습소
3rd row남부바둑교습소
4th row호서남피아노교습소
5th row하늘그리기미술교습소
ValueCountFrequency (%)
혜림피아노교습소 1
 
2.8%
백소영피아노교습소 1
 
2.8%
희피아노교습소 1
 
2.8%
스마트해법수학교습소 1
 
2.8%
에스클라비어(s 1
 
2.8%
klavier)음악교습소 1
 
2.8%
공부의신영어교습소 1
 
2.8%
창의레고교습소 1
 
2.8%
리라피아노교습소 1
 
2.8%
보이앤걸스페이스미술교습소 1
 
2.8%
Other values (26) 26
72.2%
2023-12-13T09:15:37.730308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
35
 
10.7%
34
 
10.4%
34
 
10.4%
11
 
3.4%
10
 
3.1%
10
 
3.1%
9
 
2.8%
9
 
2.8%
8
 
2.4%
7
 
2.1%
Other values (109) 160
48.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 300
91.7%
Lowercase Letter 13
 
4.0%
Uppercase Letter 5
 
1.5%
Open Punctuation 3
 
0.9%
Close Punctuation 3
 
0.9%
Space Separator 2
 
0.6%
Other Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
35
 
11.7%
34
 
11.3%
34
 
11.3%
11
 
3.7%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.3%
Other values (90) 133
44.3%
Lowercase Letter
ValueCountFrequency (%)
e 2
15.4%
i 2
15.4%
l 2
15.4%
s 1
7.7%
r 1
7.7%
g 1
7.7%
v 1
7.7%
a 1
7.7%
n 1
7.7%
h 1
7.7%
Uppercase Letter
ValueCountFrequency (%)
E 1
20.0%
S 1
20.0%
K 1
20.0%
Y 1
20.0%
J 1
20.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 300
91.7%
Latin 18
 
5.5%
Common 9
 
2.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
35
 
11.7%
34
 
11.3%
34
 
11.3%
11
 
3.7%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.3%
Other values (90) 133
44.3%
Latin
ValueCountFrequency (%)
e 2
 
11.1%
i 2
 
11.1%
l 2
 
11.1%
s 1
 
5.6%
r 1
 
5.6%
g 1
 
5.6%
E 1
 
5.6%
v 1
 
5.6%
a 1
 
5.6%
S 1
 
5.6%
Other values (5) 5
27.8%
Common
ValueCountFrequency (%)
( 3
33.3%
) 3
33.3%
2
22.2%
. 1
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 300
91.7%
ASCII 27
 
8.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
35
 
11.7%
34
 
11.3%
34
 
11.3%
11
 
3.7%
10
 
3.3%
10
 
3.3%
9
 
3.0%
9
 
3.0%
8
 
2.7%
7
 
2.3%
Other values (90) 133
44.3%
ASCII
ValueCountFrequency (%)
( 3
 
11.1%
) 3
 
11.1%
2
 
7.4%
e 2
 
7.4%
i 2
 
7.4%
l 2
 
7.4%
s 1
 
3.7%
r 1
 
3.7%
g 1
 
3.7%
E 1
 
3.7%
Other values (9) 9
33.3%

전화번호
Text

MISSING 

Distinct10
Distinct (%)100.0%
Missing24
Missing (%)70.6%
Memory size404.0 B
2023-12-13T09:15:37.867959image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters120
Distinct characters10
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)100.0%

Sample

1st row054-553-2633
2nd row054-556-6646
3rd row054-556-2111
4th row054-556-5531
5th row054-554-5085
ValueCountFrequency (%)
054-553-2633 1
10.0%
054-556-6646 1
10.0%
054-556-2111 1
10.0%
054-556-5531 1
10.0%
054-554-5085 1
10.0%
054-552-3139 1
10.0%
054-556-8153 1
10.0%
054-554-2211 1
10.0%
054-555-5298 1
10.0%
054-555-0334 1
10.0%
2023-12-13T09:15:38.089288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 38
31.7%
- 20
16.7%
4 14
 
11.7%
0 12
 
10.0%
3 9
 
7.5%
6 8
 
6.7%
1 8
 
6.7%
2 6
 
5.0%
8 3
 
2.5%
9 2
 
1.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100
83.3%
Dash Punctuation 20
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 38
38.0%
4 14
 
14.0%
0 12
 
12.0%
3 9
 
9.0%
6 8
 
8.0%
1 8
 
8.0%
2 6
 
6.0%
8 3
 
3.0%
9 2
 
2.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 120
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 38
31.7%
- 20
16.7%
4 14
 
11.7%
0 12
 
10.0%
3 9
 
7.5%
6 8
 
6.7%
1 8
 
6.7%
2 6
 
5.0%
8 3
 
2.5%
9 2
 
1.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 38
31.7%
- 20
16.7%
4 14
 
11.7%
0 12
 
10.0%
3 9
 
7.5%
6 8
 
6.7%
1 8
 
6.7%
2 6
 
5.0%
8 3
 
2.5%
9 2
 
1.7%
Distinct31
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T09:15:38.271844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length32
Mean length26.558824
Min length21

Characters and Unicode

Total characters903
Distinct characters62
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique28 ?
Unique (%)82.4%

Sample

1st row경상북도 문경시 호서로 116 (흥덕동)
2nd row경상북도 문경시 매봉2길 15-19 (모전동)
3rd row경상북도 문경시 매봉3길 9-1 (모전동)
4th row경상북도 문경시 중앙로 221-10
5th row경상북도 문경시 매봉2길 11-1 (모전동)
ValueCountFrequency (%)
경상북도 34
16.1%
문경시 34
16.1%
모전동 23
 
10.9%
16
 
7.6%
흥덕동 8
 
3.8%
2층 7
 
3.3%
호서로 7
 
3.3%
매봉2길 5
 
2.4%
매봉로 5
 
2.4%
매봉3길 4
 
1.9%
Other values (49) 68
32.2%
2023-12-13T09:15:38.529929image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
179
19.8%
69
 
7.6%
36
 
4.0%
35
 
3.9%
35
 
3.9%
35
 
3.9%
35
 
3.9%
2 35
 
3.9%
( 34
 
3.8%
34
 
3.8%
Other values (52) 376
41.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 492
54.5%
Space Separator 179
 
19.8%
Decimal Number 132
 
14.6%
Open Punctuation 34
 
3.8%
Close Punctuation 34
 
3.8%
Other Punctuation 21
 
2.3%
Dash Punctuation 11
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
14.0%
36
 
7.3%
35
 
7.1%
35
 
7.1%
35
 
7.1%
35
 
7.1%
34
 
6.9%
23
 
4.7%
23
 
4.7%
18
 
3.7%
Other values (37) 149
30.3%
Decimal Number
ValueCountFrequency (%)
2 35
26.5%
1 34
25.8%
0 15
11.4%
6 12
 
9.1%
3 9
 
6.8%
5 8
 
6.1%
4 7
 
5.3%
9 5
 
3.8%
7 4
 
3.0%
8 3
 
2.3%
Space Separator
ValueCountFrequency (%)
179
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Other Punctuation
ValueCountFrequency (%)
, 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 492
54.5%
Common 411
45.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
14.0%
36
 
7.3%
35
 
7.1%
35
 
7.1%
35
 
7.1%
35
 
7.1%
34
 
6.9%
23
 
4.7%
23
 
4.7%
18
 
3.7%
Other values (37) 149
30.3%
Common
ValueCountFrequency (%)
179
43.6%
2 35
 
8.5%
( 34
 
8.3%
1 34
 
8.3%
) 34
 
8.3%
, 21
 
5.1%
0 15
 
3.6%
6 12
 
2.9%
- 11
 
2.7%
3 9
 
2.2%
Other values (5) 27
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 492
54.5%
ASCII 411
45.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
179
43.6%
2 35
 
8.5%
( 34
 
8.3%
1 34
 
8.3%
) 34
 
8.3%
, 21
 
5.1%
0 15
 
3.6%
6 12
 
2.9%
- 11
 
2.7%
3 9
 
2.2%
Other values (5) 27
 
6.6%
Hangul
ValueCountFrequency (%)
69
14.0%
36
 
7.3%
35
 
7.1%
35
 
7.1%
35
 
7.1%
35
 
7.1%
34
 
6.9%
23
 
4.7%
23
 
4.7%
18
 
3.7%
Other values (37) 149
30.3%

분야구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
입시.검정 및 보습
17 
예능(대)
15 
기타(대)

Length

Max length10
Median length7.5
Mean length7.5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row예능(대)
2nd row예능(대)
3rd row기타(대)
4th row예능(대)
5th row예능(대)

Common Values

ValueCountFrequency (%)
입시.검정 및 보습 17
50.0%
예능(대) 15
44.1%
기타(대) 2
 
5.9%

Length

2023-12-13T09:15:38.639775image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:15:38.715420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
입시.검정 17
25.0%
17
25.0%
보습 17
25.0%
예능(대 15
22.1%
기타(대 2
 
2.9%

교습과정
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)14.7%
Missing0
Missing (%)0.0%
Memory size404.0 B
보습
17 
음악
10 
미술
바둑
 
1
서예
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique2 ?
Unique (%)5.9%

Sample

1st row음악
2nd row음악
3rd row바둑
4th row음악
5th row미술

Common Values

ValueCountFrequency (%)
보습 17
50.0%
음악 10
29.4%
미술 5
 
14.7%
바둑 1
 
2.9%
서예 1
 
2.9%

Length

2023-12-13T09:15:38.793034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:15:38.872314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보습 17
50.0%
음악 10
29.4%
미술 5
 
14.7%
바둑 1
 
2.9%
서예 1
 
2.9%

Correlations

2023-12-13T09:15:38.932642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교습소명전화번호교습소주소분야구분교습과정
교습소명1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
교습소주소1.0001.0001.0000.9540.982
분야구분1.0001.0000.9541.0001.000
교습과정1.0001.0000.9821.0001.000
2023-12-13T09:15:39.026735image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
교습과정분야구분
교습과정1.0000.967
분야구분0.9671.000
2023-12-13T09:15:39.106856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분야구분교습과정
분야구분1.0000.967
교습과정0.9671.000

Missing values

2023-12-13T09:15:37.218418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:15:37.285944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

교습소명전화번호교습소주소분야구분교습과정
0혜림피아노교습소054-553-2633경상북도 문경시 호서로 116 (흥덕동)예능(대)음악
1리라피아노교습소054-556-6646경상북도 문경시 매봉2길 15-19 (모전동)예능(대)음악
2남부바둑교습소054-556-2111경상북도 문경시 매봉3길 9-1 (모전동)기타(대)바둑
3호서남피아노교습소054-556-5531경상북도 문경시 중앙로 221-10예능(대)음악
4하늘그리기미술교습소054-554-5085경상북도 문경시 매봉2길 11-1 (모전동)예능(대)미술
5우등생스터디수학교습소054-552-3139경상북도 문경시 매봉로 74 , 2층 (모전동)입시.검정 및 보습보습
6행복한음악교습소054-556-8153경상북도 문경시 중앙8길 20 , 2층 (점촌동)예능(대)음악
7이네트수학교습소<NA>경상북도 문경시 중앙로 223-18 (흥덕동)입시.검정 및 보습보습
8로제타스톤영어교습소<NA>경상북도 문경시 매봉로 25 , 3층 (모전동)입시.검정 및 보습보습
9백소영피아노교습소<NA>경상북도 문경시 당교6길 25 , 상가 108호 (모전동, 문경코아루아파트)예능(대)음악
교습소명전화번호교습소주소분야구분교습과정
24최상위학습관수학교습소<NA>경상북도 문경시 매봉4길 21 , 202호 일부 (모전동, 대동타운)입시.검정 및 보습보습
25신성자바이올린교습소<NA>경상북도 문경시 당교6길 10 , 2층 201호 (모전동)예능(대)음악
26보이앤걸스페이스미술교습소<NA>경상북도 문경시 매봉2길 4 (모전동)예능(대)미술
27창의레고교습소<NA>경상북도 문경시 호서로 101 , 1층 (흥덕동, 건영웰스파크)예능(대)미술
28공부의신영어교습소<NA>경상북도 문경시 당교6길 10 , 202호 (모전동)입시.검정 및 보습보습
29에스클라비어(S Klavier)음악교습소<NA>경상북도 문경시 당교2길 6 , 2층 (모전동)예능(대)음악
30스마트해법수학교습소<NA>경상북도 문경시 호서로 105 , 1층 (흥덕동)입시.검정 및 보습보습
31희피아노교습소054-555-5298경상북도 문경시 중앙로 69 (점촌동)예능(대)음악
32현단서예교습소054-555-0334경상북도 문경시 매봉2길 24 (모전동)기타(대)서예
33Y.J 잉글리시(English)영어교습소<NA>경상북도 문경시 호서로 102-1 (흥덕동)입시.검정 및 보습보습