Overview

Dataset statistics

Number of variables5
Number of observations85
Missing cells48
Missing cells (%)11.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.5 KiB
Average record size in memory42.6 B

Variable types

Numeric1
Text4

Dataset

Description경상북도상주교육지원청 관할 학원 및 교습소 현황 정보를 제공하는 서비스로서 학원명, 설립자, 전화번호, 학원주소를 제공
Author경상북도교육청 경상북도상주교육지원청
URLhttps://www.data.go.kr/data/15053524/fileData.do

Alerts

전화번호 has 48 (56.5%) missing valuesMissing
연번 has unique valuesUnique
교습소명 has unique valuesUnique
교습자 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:53:53.267679
Analysis finished2023-12-12 12:53:53.952738
Duration0.69 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43
Minimum1
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size897.0 B
2023-12-12T21:53:54.041704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.2
Q122
median43
Q364
95-th percentile80.8
Maximum85
Range84
Interquartile range (IQR)42

Descriptive statistics

Standard deviation24.681302
Coefficient of variation (CV)0.57398377
Kurtosis-1.2
Mean43
Median Absolute Deviation (MAD)21
Skewness0
Sum3655
Variance609.16667
MonotonicityStrictly increasing
2023-12-12T21:53:54.190472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
55 1
 
1.2%
63 1
 
1.2%
62 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
Other values (75) 75
88.2%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
85 1
1.2%
84 1
1.2%
83 1
1.2%
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%

교습소명
Text

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-12T21:53:54.463057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length9.2
Min length6

Characters and Unicode

Total characters782
Distinct characters188
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)100.0%

Sample

1st row솔로몬영탑수학교습소
2nd row중앙뮤엠영어교습소
3rd row패스영어교습소
4th row매쓰탑수학교습소
5th row상산뮤엠영어교습소
ValueCountFrequency (%)
솔로몬영탑수학교습소 1
 
1.2%
준수학교습소 1
 
1.2%
드림음악교습소 1
 
1.2%
피닉스영어교습소 1
 
1.2%
라온국어논술교습소 1
 
1.2%
뮤엠영어교습소 1
 
1.2%
낙양해법수학교습소 1
 
1.2%
미술이천진한세계로가다미술교습소 1
 
1.2%
해법영어교습소 1
 
1.2%
최강수학교습소 1
 
1.2%
Other values (75) 75
88.2%
2023-12-12T21:53:54.849545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
88
 
11.3%
86
 
11.0%
85
 
10.9%
38
 
4.9%
37
 
4.7%
23
 
2.9%
21
 
2.7%
20
 
2.6%
18
 
2.3%
15
 
1.9%
Other values (178) 351
44.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 768
98.2%
Uppercase Letter 7
 
0.9%
Lowercase Letter 3
 
0.4%
Open Punctuation 2
 
0.3%
Close Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
88
 
11.5%
86
 
11.2%
85
 
11.1%
38
 
4.9%
37
 
4.8%
23
 
3.0%
21
 
2.7%
20
 
2.6%
18
 
2.3%
15
 
2.0%
Other values (167) 337
43.9%
Uppercase Letter
ValueCountFrequency (%)
L 2
28.6%
G 1
14.3%
Y 1
14.3%
E 1
14.3%
I 1
14.3%
P 1
14.3%
Lowercase Letter
ValueCountFrequency (%)
e 1
33.3%
h 1
33.3%
t 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 768
98.2%
Latin 10
 
1.3%
Common 4
 
0.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
88
 
11.5%
86
 
11.2%
85
 
11.1%
38
 
4.9%
37
 
4.8%
23
 
3.0%
21
 
2.7%
20
 
2.6%
18
 
2.3%
15
 
2.0%
Other values (167) 337
43.9%
Latin
ValueCountFrequency (%)
L 2
20.0%
G 1
10.0%
Y 1
10.0%
E 1
10.0%
e 1
10.0%
h 1
10.0%
t 1
10.0%
I 1
10.0%
P 1
10.0%
Common
ValueCountFrequency (%)
( 2
50.0%
) 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 768
98.2%
ASCII 14
 
1.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
88
 
11.5%
86
 
11.2%
85
 
11.1%
38
 
4.9%
37
 
4.8%
23
 
3.0%
21
 
2.7%
20
 
2.6%
18
 
2.3%
15
 
2.0%
Other values (167) 337
43.9%
ASCII
ValueCountFrequency (%)
L 2
14.3%
( 2
14.3%
) 2
14.3%
G 1
7.1%
Y 1
7.1%
E 1
7.1%
e 1
7.1%
h 1
7.1%
t 1
7.1%
I 1
7.1%

교습자
Text

UNIQUE 

Distinct85
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-12T21:53:55.146982image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length3
Mean length3.2352941
Min length3

Characters and Unicode

Total characters275
Distinct characters108
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique85 ?
Unique (%)100.0%

Sample

1st row금병서
2nd row김현미
3rd row박지훈
4th row홍진곤
5th row조가희
ValueCountFrequency (%)
금병서 1
 
1.1%
강문숙 1
 
1.1%
이시형 1
 
1.1%
안영화 1
 
1.1%
차현미 1
 
1.1%
변광일 1
 
1.1%
임수진 1
 
1.1%
조인혜 1
 
1.1%
채미영 1
 
1.1%
김미경 1
 
1.1%
Other values (77) 77
88.5%
2023-12-12T21:53:55.634167image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
23
 
8.4%
12
 
4.4%
12
 
4.4%
9
 
3.3%
9
 
3.3%
9
 
3.3%
8
 
2.9%
8
 
2.9%
8
 
2.9%
6
 
2.2%
Other values (98) 171
62.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 253
92.0%
Uppercase Letter 20
 
7.3%
Space Separator 2
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
23
 
9.1%
12
 
4.7%
12
 
4.7%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
8
 
3.2%
8
 
3.2%
6
 
2.4%
Other values (84) 149
58.9%
Uppercase Letter
ValueCountFrequency (%)
N 3
15.0%
T 3
15.0%
I 2
10.0%
S 2
10.0%
A 2
10.0%
P 1
 
5.0%
L 1
 
5.0%
E 1
 
5.0%
O 1
 
5.0%
B 1
 
5.0%
Other values (3) 3
15.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 253
92.0%
Latin 20
 
7.3%
Common 2
 
0.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
23
 
9.1%
12
 
4.7%
12
 
4.7%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
8
 
3.2%
8
 
3.2%
6
 
2.4%
Other values (84) 149
58.9%
Latin
ValueCountFrequency (%)
N 3
15.0%
T 3
15.0%
I 2
10.0%
S 2
10.0%
A 2
10.0%
P 1
 
5.0%
L 1
 
5.0%
E 1
 
5.0%
O 1
 
5.0%
B 1
 
5.0%
Other values (3) 3
15.0%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 253
92.0%
ASCII 22
 
8.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
23
 
9.1%
12
 
4.7%
12
 
4.7%
9
 
3.6%
9
 
3.6%
9
 
3.6%
8
 
3.2%
8
 
3.2%
8
 
3.2%
6
 
2.4%
Other values (84) 149
58.9%
ASCII
ValueCountFrequency (%)
N 3
13.6%
T 3
13.6%
I 2
9.1%
S 2
9.1%
A 2
9.1%
2
9.1%
P 1
 
4.5%
L 1
 
4.5%
E 1
 
4.5%
O 1
 
4.5%
Other values (4) 4
18.2%

전화번호
Text

MISSING 

Distinct37
Distinct (%)100.0%
Missing48
Missing (%)56.5%
Memory size812.0 B
2023-12-12T21:53:55.867569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters444
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row054-531-1518
2nd row054-535-0509
3rd row054-534-1931
4th row054-534-7890
5th row054-536-0777
ValueCountFrequency (%)
054-536-2098 1
 
2.7%
054-536-1697 1
 
2.7%
054-532-0942 1
 
2.7%
054-532-0521 1
 
2.7%
054-535-2042 1
 
2.7%
054-536-8500 1
 
2.7%
054-536-9545 1
 
2.7%
054-541-2777 1
 
2.7%
054-534-1949 1
 
2.7%
054-531-2280 1
 
2.7%
Other values (27) 27
73.0%
2023-12-12T21:53:56.309752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 95
21.4%
- 74
16.7%
0 56
12.6%
4 53
11.9%
3 49
11.0%
6 23
 
5.2%
2 21
 
4.7%
7 21
 
4.7%
9 20
 
4.5%
1 18
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 370
83.3%
Dash Punctuation 74
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 95
25.7%
0 56
15.1%
4 53
14.3%
3 49
13.2%
6 23
 
6.2%
2 21
 
5.7%
7 21
 
5.7%
9 20
 
5.4%
1 18
 
4.9%
8 14
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 74
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 444
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 95
21.4%
- 74
16.7%
0 56
12.6%
4 53
11.9%
3 49
11.0%
6 23
 
5.2%
2 21
 
4.7%
7 21
 
4.7%
9 20
 
4.5%
1 18
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 444
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 95
21.4%
- 74
16.7%
0 56
12.6%
4 53
11.9%
3 49
11.0%
6 23
 
5.2%
2 21
 
4.7%
7 21
 
4.7%
9 20
 
4.5%
1 18
 
4.1%

주소
Text

Distinct79
Distinct (%)92.9%
Missing0
Missing (%)0.0%
Memory size812.0 B
2023-12-12T21:53:56.667688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length49
Median length44
Mean length27.870588
Min length21

Characters and Unicode

Total characters2369
Distinct characters83
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique73 ?
Unique (%)85.9%

Sample

1st row경상북도 상주시 냉림로 35 , 상가 101-1호 (냉림동,냉림드림뷰 103동)
2nd row경상북도 상주시 서문길 83 (서문동)
3rd row경상북도 상주시 상서문3길 136 , 2층 (남성동)
4th row경상북도 상주시 상산로 189-1 , 2층 (남성동)
5th row경상북도 상주시 왕산로 343 , 1층 (냉림동)
ValueCountFrequency (%)
경상북도 85
 
15.5%
상주시 85
 
15.5%
49
 
8.9%
남성동 32
 
5.8%
상산로 21
 
3.8%
2층 19
 
3.5%
냉림동 16
 
2.9%
1층 11
 
2.0%
서문동 8
 
1.5%
중앙로 7
 
1.3%
Other values (136) 217
39.5%
2023-12-12T21:53:57.095784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
465
19.6%
227
 
9.6%
106
 
4.5%
1 99
 
4.2%
91
 
3.8%
88
 
3.7%
) 87
 
3.7%
( 86
 
3.6%
86
 
3.6%
85
 
3.6%
Other values (73) 949
40.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1299
54.8%
Space Separator 465
 
19.6%
Decimal Number 358
 
15.1%
Close Punctuation 87
 
3.7%
Open Punctuation 86
 
3.6%
Other Punctuation 57
 
2.4%
Dash Punctuation 17
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
227
17.5%
106
 
8.2%
91
 
7.0%
88
 
6.8%
86
 
6.6%
85
 
6.5%
85
 
6.5%
45
 
3.5%
45
 
3.5%
40
 
3.1%
Other values (58) 401
30.9%
Decimal Number
ValueCountFrequency (%)
1 99
27.7%
2 73
20.4%
3 54
15.1%
9 27
 
7.5%
0 26
 
7.3%
4 21
 
5.9%
6 16
 
4.5%
5 16
 
4.5%
8 15
 
4.2%
7 11
 
3.1%
Space Separator
ValueCountFrequency (%)
465
100.0%
Close Punctuation
ValueCountFrequency (%)
) 87
100.0%
Open Punctuation
ValueCountFrequency (%)
( 86
100.0%
Other Punctuation
ValueCountFrequency (%)
, 57
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1299
54.8%
Common 1070
45.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
227
17.5%
106
 
8.2%
91
 
7.0%
88
 
6.8%
86
 
6.6%
85
 
6.5%
85
 
6.5%
45
 
3.5%
45
 
3.5%
40
 
3.1%
Other values (58) 401
30.9%
Common
ValueCountFrequency (%)
465
43.5%
1 99
 
9.3%
) 87
 
8.1%
( 86
 
8.0%
2 73
 
6.8%
, 57
 
5.3%
3 54
 
5.0%
9 27
 
2.5%
0 26
 
2.4%
4 21
 
2.0%
Other values (5) 75
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1299
54.8%
ASCII 1070
45.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
465
43.5%
1 99
 
9.3%
) 87
 
8.1%
( 86
 
8.0%
2 73
 
6.8%
, 57
 
5.3%
3 54
 
5.0%
9 27
 
2.5%
0 26
 
2.4%
4 21
 
2.0%
Other values (5) 75
 
7.0%
Hangul
ValueCountFrequency (%)
227
17.5%
106
 
8.2%
91
 
7.0%
88
 
6.8%
86
 
6.6%
85
 
6.5%
85
 
6.5%
45
 
3.5%
45
 
3.5%
40
 
3.1%
Other values (58) 401
30.9%

Interactions

2023-12-12T21:53:53.643803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:53:57.441673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번교습소명교습자전화번호주소
연번1.0001.0001.0001.0000.832
교습소명1.0001.0001.0001.0001.000
교습자1.0001.0001.0001.0001.000
전화번호1.0001.0001.0001.0001.000
주소0.8321.0001.0001.0001.000

Missing values

2023-12-12T21:53:53.786847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:53:53.907138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번교습소명교습자전화번호주소
01솔로몬영탑수학교습소금병서054-531-1518경상북도 상주시 냉림로 35 , 상가 101-1호 (냉림동,냉림드림뷰 103동)
12중앙뮤엠영어교습소김현미054-535-0509경상북도 상주시 서문길 83 (서문동)
23패스영어교습소박지훈<NA>경상북도 상주시 상서문3길 136 , 2층 (남성동)
34매쓰탑수학교습소홍진곤054-534-1931경상북도 상주시 상산로 189-1 , 2층 (남성동)
45상산뮤엠영어교습소조가희<NA>경상북도 상주시 왕산로 343 , 1층 (냉림동)
56뿌리깊은나무수학교습소서영애054-534-7890경상북도 상주시 동수4길 130 (냉림동)
67마일즈영어교습소STAPLETON BRIAN JUSTIN<NA>경상북도 상주시 상산로 209 , 1층 (남성동)
78이룸수학교습소이현숙054-536-0777경상북도 상주시 남성3길 27 , 1층 (남성동)
89벨영어교습소박명숙<NA>경상북도 상주시 상산로 145 , 동아아파트상가동 113호 (신봉동)
910토론하는아이들논술교습소배지현<NA>경상북도 상주시 상산로 199 , 3층 (남성동)
연번교습소명교습자전화번호주소
7576홍샘수학교습소김기홍<NA>경상북도 상주시 중앙로 251 , 2층 (성동동)
7677선수학교습소박정희<NA>경상북도 상주시 상서문2길 114-10 , 3층 (남성동)
7778렛씽잉글리쉬교습소이다희<NA>경상북도 상주시 상산로 366 , 상가동 1호 (냉림동, 냉림 1주공아파트)
7879제이미영어교습소원제이미<NA>경상북도 상주시 경상대로 2926 , 상가동 105 (낙양동)
7980엔이능률주니어랩영어교습소김송지<NA>경상북도 상주시 냉림3길 48 (냉림동)
8081토론하는아이들성동초센터논술교습소황미주<NA>경상북도 상주시 중앙로 292 , 2층 (성동동)
8182줄리아니기타교습소조민규<NA>경상북도 상주시 성동로 22 , 201호 (성동동)
8283멘토수학교습소유명실<NA>경상북도 상주시 상산로 178 , 상가 11호 (신봉동, 현대아파트)
8384놀이아트미술교습소김성아<NA>경상북도 상주시 중앙로 217-8 , 2층 (서성동)
8485한스잉글리시외국어교습소한동익<NA>경상북도 상주시 상서문2길 123 (남성동)