Overview

Dataset statistics

Number of variables5
Number of observations243
Missing cells98
Missing cells (%)8.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory41.5 B

Variable types

Numeric1
Text3
Categorical1

Dataset

Description김천교육지원청내 학원 및 교습소 현황에 대한 데이터로
Author경상북도교육청 경상북도김천교육지원청
URLhttps://www.data.go.kr/data/3070214/fileData.do

Alerts

is highly overall correlated with 학원종류High correlation
학원종류 is highly overall correlated with High correlation
전화번호 has 98 (40.3%) missing valuesMissing
has unique valuesUnique
학원명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 23:07:55.401763
Analysis finished2023-12-12 23:07:55.984349
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables


Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct243
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean122
Minimum1
Maximum243
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-13T08:07:56.058304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.1
Q161.5
median122
Q3182.5
95-th percentile230.9
Maximum243
Range242
Interquartile range (IQR)121

Descriptive statistics

Standard deviation70.292247
Coefficient of variation (CV)0.57616596
Kurtosis-1.2
Mean122
Median Absolute Deviation (MAD)61
Skewness0
Sum29646
Variance4941
MonotonicityStrictly increasing
2023-12-13T08:07:56.187165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.4%
154 1
 
0.4%
156 1
 
0.4%
157 1
 
0.4%
158 1
 
0.4%
159 1
 
0.4%
160 1
 
0.4%
161 1
 
0.4%
162 1
 
0.4%
163 1
 
0.4%
Other values (233) 233
95.9%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
243 1
0.4%
242 1
0.4%
241 1
0.4%
240 1
0.4%
239 1
0.4%
238 1
0.4%
237 1
0.4%
236 1
0.4%
235 1
0.4%
234 1
0.4%

학원명
Text

UNIQUE 

Distinct243
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-13T08:07:56.419827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length19
Mean length9.308642
Min length4

Characters and Unicode

Total characters2262
Distinct characters342
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique243 ?
Unique (%)100.0%

Sample

1st row중앙컴퓨터정보처리학원
2nd row리스트피아노학원
3rd row평화이제마체질독서실
4th row새청학독서실
5th row필탑클래스학원
ValueCountFrequency (%)
더(the)학원 2
 
0.8%
수학교습소 2
 
0.8%
천재해법수학교습소 1
 
0.4%
피아노 1
 
0.4%
음악교습소 1
 
0.4%
차일드유영어교습소 1
 
0.4%
학익진과학교습소 1
 
0.4%
크리씨영어교습소 1
 
0.4%
한우리독서토론논술삼도뷰엔빌교습소 1
 
0.4%
입큰달팽이영어교습소 1
 
0.4%
Other values (244) 244
95.3%
2023-12-13T08:07:57.107809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
192
 
8.5%
136
 
6.0%
103
 
4.6%
101
 
4.5%
99
 
4.4%
60
 
2.7%
59
 
2.6%
57
 
2.5%
47
 
2.1%
46
 
2.0%
Other values (332) 1362
60.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2085
92.2%
Uppercase Letter 73
 
3.2%
Lowercase Letter 35
 
1.5%
Close Punctuation 18
 
0.8%
Open Punctuation 18
 
0.8%
Space Separator 13
 
0.6%
Other Punctuation 13
 
0.6%
Decimal Number 7
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
192
 
9.2%
136
 
6.5%
103
 
4.9%
101
 
4.8%
99
 
4.7%
60
 
2.9%
59
 
2.8%
57
 
2.7%
47
 
2.3%
46
 
2.2%
Other values (288) 1185
56.8%
Uppercase Letter
ValueCountFrequency (%)
E 13
17.8%
K 7
9.6%
S 7
9.6%
M 7
9.6%
C 6
 
8.2%
Y 5
 
6.8%
T 4
 
5.5%
J 3
 
4.1%
D 3
 
4.1%
I 3
 
4.1%
Other values (9) 15
20.5%
Lowercase Letter
ValueCountFrequency (%)
e 5
14.3%
a 4
11.4%
s 4
11.4%
t 3
8.6%
r 3
8.6%
b 3
8.6%
n 3
8.6%
l 2
 
5.7%
g 2
 
5.7%
k 2
 
5.7%
Other values (3) 4
11.4%
Other Punctuation
ValueCountFrequency (%)
. 4
30.8%
& 4
30.8%
, 2
15.4%
· 2
15.4%
' 1
 
7.7%
Decimal Number
ValueCountFrequency (%)
1 2
28.6%
3 2
28.6%
2 2
28.6%
0 1
14.3%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Space Separator
ValueCountFrequency (%)
13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2085
92.2%
Latin 108
 
4.8%
Common 69
 
3.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
192
 
9.2%
136
 
6.5%
103
 
4.9%
101
 
4.8%
99
 
4.7%
60
 
2.9%
59
 
2.8%
57
 
2.7%
47
 
2.3%
46
 
2.2%
Other values (288) 1185
56.8%
Latin
ValueCountFrequency (%)
E 13
 
12.0%
K 7
 
6.5%
S 7
 
6.5%
M 7
 
6.5%
C 6
 
5.6%
e 5
 
4.6%
Y 5
 
4.6%
a 4
 
3.7%
T 4
 
3.7%
s 4
 
3.7%
Other values (22) 46
42.6%
Common
ValueCountFrequency (%)
) 18
26.1%
( 18
26.1%
13
18.8%
. 4
 
5.8%
& 4
 
5.8%
, 2
 
2.9%
1 2
 
2.9%
3 2
 
2.9%
2 2
 
2.9%
· 2
 
2.9%
Other values (2) 2
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2085
92.2%
ASCII 175
 
7.7%
None 2
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
192
 
9.2%
136
 
6.5%
103
 
4.9%
101
 
4.8%
99
 
4.7%
60
 
2.9%
59
 
2.8%
57
 
2.7%
47
 
2.3%
46
 
2.2%
Other values (288) 1185
56.8%
ASCII
ValueCountFrequency (%)
) 18
 
10.3%
( 18
 
10.3%
E 13
 
7.4%
13
 
7.4%
K 7
 
4.0%
S 7
 
4.0%
M 7
 
4.0%
C 6
 
3.4%
e 5
 
2.9%
Y 5
 
2.9%
Other values (33) 76
43.4%
None
ValueCountFrequency (%)
· 2
100.0%

학원종류
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
학교교과교습학원
132 
교습소
98 
평생직업교육학원
 
13

Length

Max length8
Median length8
Mean length5.9835391
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교교과교습학원
2nd row학교교과교습학원
3rd row학교교과교습학원
4th row학교교과교습학원
5th row학교교과교습학원

Common Values

ValueCountFrequency (%)
학교교과교습학원 132
54.3%
교습소 98
40.3%
평생직업교육학원 13
 
5.3%

Length

2023-12-13T08:07:57.256025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:07:57.352487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교교과교습학원 132
54.3%
교습소 98
40.3%
평생직업교육학원 13
 
5.3%
Distinct232
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
2023-12-13T08:07:57.620209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length49
Mean length31.329218
Min length21

Characters and Unicode

Total characters7613
Distinct characters171
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique222 ?
Unique (%)91.4%

Sample

1st row경상북도 김천시 김천로 139-1 , 3층 (평화동)
2nd row경상북도 김천시 양금로 194 (황금동)
3rd row경상북도 김천시 구름다리길 30 , 4층 (평화동)
4th row경상북도 김천시 시청6길 46 (신음동)
5th row경상북도 김천시 구읍길 69 (삼락동)
ValueCountFrequency (%)
경상북도 243
 
13.9%
김천시 243
 
13.9%
187
 
10.7%
율곡동 92
 
5.3%
2층 50
 
2.9%
신음동 46
 
2.6%
부곡동 44
 
2.5%
혁신4로 30
 
1.7%
3층 26
 
1.5%
용전1로 25
 
1.4%
Other values (313) 761
43.6%
2023-12-13T08:07:58.066335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1508
19.8%
295
 
3.9%
294
 
3.9%
275
 
3.6%
261
 
3.4%
259
 
3.4%
) 253
 
3.3%
( 253
 
3.3%
, 253
 
3.3%
251
 
3.3%
Other values (161) 3711
48.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4178
54.9%
Space Separator 1508
 
19.8%
Decimal Number 1121
 
14.7%
Other Punctuation 254
 
3.3%
Close Punctuation 253
 
3.3%
Open Punctuation 253
 
3.3%
Dash Punctuation 27
 
0.4%
Uppercase Letter 13
 
0.2%
Letter Number 5
 
0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
295
 
7.1%
294
 
7.0%
275
 
6.6%
261
 
6.2%
259
 
6.2%
251
 
6.0%
245
 
5.9%
245
 
5.9%
179
 
4.3%
144
 
3.4%
Other values (134) 1730
41.4%
Decimal Number
ValueCountFrequency (%)
1 245
21.9%
2 205
18.3%
0 136
12.1%
3 130
11.6%
4 121
10.8%
6 87
 
7.8%
5 74
 
6.6%
8 56
 
5.0%
7 35
 
3.1%
9 32
 
2.9%
Uppercase Letter
ValueCountFrequency (%)
T 2
15.4%
K 2
15.4%
X 2
15.4%
W 2
15.4%
S 1
7.7%
H 1
7.7%
G 1
7.7%
Y 1
7.7%
L 1
7.7%
Other Punctuation
ValueCountFrequency (%)
, 253
99.6%
. 1
 
0.4%
Space Separator
ValueCountFrequency (%)
1508
100.0%
Close Punctuation
ValueCountFrequency (%)
) 253
100.0%
Open Punctuation
ValueCountFrequency (%)
( 253
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Letter Number
ValueCountFrequency (%)
5
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4178
54.9%
Common 3417
44.9%
Latin 18
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
295
 
7.1%
294
 
7.0%
275
 
6.6%
261
 
6.2%
259
 
6.2%
251
 
6.0%
245
 
5.9%
245
 
5.9%
179
 
4.3%
144
 
3.4%
Other values (134) 1730
41.4%
Common
ValueCountFrequency (%)
1508
44.1%
) 253
 
7.4%
( 253
 
7.4%
, 253
 
7.4%
1 245
 
7.2%
2 205
 
6.0%
0 136
 
4.0%
3 130
 
3.8%
4 121
 
3.5%
6 87
 
2.5%
Other values (7) 226
 
6.6%
Latin
ValueCountFrequency (%)
5
27.8%
T 2
 
11.1%
K 2
 
11.1%
X 2
 
11.1%
W 2
 
11.1%
S 1
 
5.6%
H 1
 
5.6%
G 1
 
5.6%
Y 1
 
5.6%
L 1
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4178
54.9%
ASCII 3430
45.1%
Number Forms 5
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1508
44.0%
) 253
 
7.4%
( 253
 
7.4%
, 253
 
7.4%
1 245
 
7.1%
2 205
 
6.0%
0 136
 
4.0%
3 130
 
3.8%
4 121
 
3.5%
6 87
 
2.5%
Other values (16) 239
 
7.0%
Hangul
ValueCountFrequency (%)
295
 
7.1%
294
 
7.0%
275
 
6.6%
261
 
6.2%
259
 
6.2%
251
 
6.0%
245
 
5.9%
245
 
5.9%
179
 
4.3%
144
 
3.4%
Other values (134) 1730
41.4%
Number Forms
ValueCountFrequency (%)
5
100.0%

전화번호
Text

MISSING 

Distinct145
Distinct (%)100.0%
Missing98
Missing (%)40.3%
Memory size2.0 KiB
2023-12-13T08:07:58.354179image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.006897
Min length12

Characters and Unicode

Total characters1741
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique145 ?
Unique (%)100.0%

Sample

1st row054-433-4554
2nd row054-432-2790
3rd row054-434-2664
4th row054-436-3168
5th row054-433-8844
ValueCountFrequency (%)
054-436-2456 1
 
0.7%
054-439-1410 1
 
0.7%
054-4358-284 1
 
0.7%
054-437-4991 1
 
0.7%
054-435-5571 1
 
0.7%
054-437-0579 1
 
0.7%
054-436-0509 1
 
0.7%
054-435-7766 1
 
0.7%
054-431-1132 1
 
0.7%
054-433-4554 1
 
0.7%
Other values (135) 135
93.1%
2023-12-13T08:07:58.804466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 351
20.2%
- 290
16.7%
5 244
14.0%
0 235
13.5%
3 211
12.1%
7 94
 
5.4%
1 81
 
4.7%
6 67
 
3.8%
9 57
 
3.3%
2 56
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1451
83.3%
Dash Punctuation 290
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 351
24.2%
5 244
16.8%
0 235
16.2%
3 211
14.5%
7 94
 
6.5%
1 81
 
5.6%
6 67
 
4.6%
9 57
 
3.9%
2 56
 
3.9%
8 55
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 290
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1741
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 351
20.2%
- 290
16.7%
5 244
14.0%
0 235
13.5%
3 211
12.1%
7 94
 
5.4%
1 81
 
4.7%
6 67
 
3.8%
9 57
 
3.3%
2 56
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1741
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 351
20.2%
- 290
16.7%
5 244
14.0%
0 235
13.5%
3 211
12.1%
7 94
 
5.4%
1 81
 
4.7%
6 67
 
3.8%
9 57
 
3.3%
2 56
 
3.2%

Interactions

2023-12-13T08:07:55.767186image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:07:58.923599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학원종류
1.0000.817
학원종류0.8171.000
2023-12-13T08:07:59.042695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
학원종류
1.0000.706
학원종류0.7061.000

Missing values

2023-12-13T08:07:55.869189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:07:55.950895image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

학원명학원종류학원주소전화번호
01중앙컴퓨터정보처리학원학교교과교습학원경상북도 김천시 김천로 139-1 , 3층 (평화동)054-433-4554
12리스트피아노학원학교교과교습학원경상북도 김천시 양금로 194 (황금동)054-432-2790
23평화이제마체질독서실학교교과교습학원경상북도 김천시 구름다리길 30 , 4층 (평화동)054-434-2664
34새청학독서실학교교과교습학원경상북도 김천시 시청6길 46 (신음동)054-436-3168
45필탑클래스학원학교교과교습학원경상북도 김천시 구읍길 69 (삼락동)054-433-8844
56해피한샘물학원학교교과교습학원경상북도 김천시 신음3길 3 (신음동)054-436-4447
67창조의아침미술학원학교교과교습학원경상북도 김천시 김천로 12-1 (부곡동)054-431-2796
78리나요리학원평생직업교육학원경상북도 김천시 김천로 86 (평화동, 평화프라자)054-435-7751
89예쁜음피아노학원학교교과교습학원경상북도 김천시 시청5길 16-17 , 1층 (신음동)054-433-4638
910이만수바둑학원학교교과교습학원경상북도 김천시 중앙공원1길 3 (남산동)054-434-6550
학원명학원종류학원주소전화번호
233234푸르넷 영어 교습소교습소경상북도 김천시 시청6길 27 , 104호,1층 (신음동, 덕일아파트상가)<NA>
234235해바라기미술교습소교습소경상북도 김천시 시청5길 6 , 2층 (신음동)<NA>
235236하오하오중국어교습소교습소경상북도 김천시 혁신2로 80 , 골드클래스 상가 204호 (율곡동)<NA>
236237미미그리미술교습소교습소경상북도 김천시 혁신6로 63 , 106호 (율곡동, 중흥S-클래스 상가)<NA>
237238한국수학교습소교습소경상북도 김천시 혁신4로 54 , 114동 212호 (율곡동, 한신휴플러스 상가)054-437-2994
238239율곡바이블수학교습소교습소경상북도 김천시 혁신1로 81 , 상가동 308동 (율곡동, 한신휴시티오피스텔)<NA>
239240림스미술교습소교습소경상북도 김천시 용전1로 8 , 107호 (율곡동, 사랑으로부영1단지 상가동)<NA>
240241포인트영어교습소교습소경상북도 김천시 혁신4로 54 , 114동 206호 (율곡동, 한신휴플러스 상가)<NA>
241242메이드바이 미술교습소교습소경상북도 김천시 혁신4로 46 , 114동 213호 (율곡동, 한신휴플러스)<NA>
242243학익진 수학교습소교습소경상북도 김천시 시청로 109 , 2층 (신음동)<NA>