Overview

Dataset statistics

Number of variables6
Number of observations34
Missing cells16
Missing cells (%)7.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.8 KiB
Average record size in memory52.9 B

Variable types

Numeric1
Categorical2
Text3

Dataset

Description성주교육지원청 내 학원 교습소 현황
Author경상북도교육청 경상북도성주교육지원청
URLhttps://www.data.go.kr/data/15053329/fileData.do

Alerts

등록상태 has constant value ""Constant
번호 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 번호High correlation
전화번호 has 16 (47.1%) missing valuesMissing
번호 has unique valuesUnique
학원명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:18:15.054256
Analysis finished2023-12-12 17:18:15.526206
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.5
Minimum1
Maximum34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-13T02:18:15.588617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.65
Q19.25
median17.5
Q325.75
95-th percentile32.35
Maximum34
Range33
Interquartile range (IQR)16.5

Descriptive statistics

Standard deviation9.9582462
Coefficient of variation (CV)0.56904264
Kurtosis-1.2
Mean17.5
Median Absolute Deviation (MAD)8.5
Skewness0
Sum595
Variance99.166667
MonotonicityStrictly increasing
2023-12-13T02:18:15.716383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
1 1
 
2.9%
27 1
 
2.9%
21 1
 
2.9%
22 1
 
2.9%
23 1
 
2.9%
24 1
 
2.9%
25 1
 
2.9%
26 1
 
2.9%
28 1
 
2.9%
19 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
1 1
2.9%
2 1
2.9%
3 1
2.9%
4 1
2.9%
5 1
2.9%
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
ValueCountFrequency (%)
34 1
2.9%
33 1
2.9%
32 1
2.9%
31 1
2.9%
30 1
2.9%
29 1
2.9%
28 1
2.9%
27 1
2.9%
26 1
2.9%
25 1
2.9%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)5.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
학원
25 
교습소

Length

Max length3
Median length2
Mean length2.2647059
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학원
2nd row학원
3rd row학원
4th row학원
5th row학원

Common Values

ValueCountFrequency (%)
학원 25
73.5%
교습소 9
 
26.5%

Length

2023-12-13T02:18:15.845457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:15.944923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원 25
73.5%
교습소 9
 
26.5%

학원명
Text

UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T02:18:16.107956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length10
Mean length8.3823529
Min length6

Characters and Unicode

Total characters285
Distinct characters106
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row뉴아이들무용학원
2nd row솔로몬영수학원
3rd row소나타음악전문학원
4th row엠플러스수학전문학원
5th rowBB음악학원
ValueCountFrequency (%)
뉴아이들 3
 
7.5%
영어교습소 2
 
5.0%
뉴아이들무용학원 1
 
2.5%
피카소미술학원 1
 
2.5%
손샘컴퓨터교습소 1
 
2.5%
드림수학교습소 1
 
2.5%
생각나무영어교습소 1
 
2.5%
미술교습소 1
 
2.5%
수학교습소 1
 
2.5%
솔로몬영수학원 1
 
2.5%
Other values (27) 27
67.5%
2023-12-13T02:18:16.398232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
32
 
11.2%
24
 
8.4%
11
 
3.9%
10
 
3.5%
10
 
3.5%
8
 
2.8%
8
 
2.8%
8
 
2.8%
6
 
2.1%
6
 
2.1%
Other values (96) 162
56.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 273
95.8%
Space Separator 6
 
2.1%
Uppercase Letter 4
 
1.4%
Lowercase Letter 1
 
0.4%
Other Punctuation 1
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
32
 
11.7%
24
 
8.8%
11
 
4.0%
10
 
3.7%
10
 
3.7%
8
 
2.9%
8
 
2.9%
8
 
2.9%
6
 
2.2%
5
 
1.8%
Other values (91) 151
55.3%
Uppercase Letter
ValueCountFrequency (%)
B 3
75.0%
G 1
 
25.0%
Space Separator
ValueCountFrequency (%)
6
100.0%
Lowercase Letter
ValueCountFrequency (%)
n 1
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 273
95.8%
Common 7
 
2.5%
Latin 5
 
1.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
32
 
11.7%
24
 
8.8%
11
 
4.0%
10
 
3.7%
10
 
3.7%
8
 
2.9%
8
 
2.9%
8
 
2.9%
6
 
2.2%
5
 
1.8%
Other values (91) 151
55.3%
Latin
ValueCountFrequency (%)
B 3
60.0%
G 1
 
20.0%
n 1
 
20.0%
Common
ValueCountFrequency (%)
6
85.7%
& 1
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 273
95.8%
ASCII 12
 
4.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
32
 
11.7%
24
 
8.8%
11
 
4.0%
10
 
3.7%
10
 
3.7%
8
 
2.9%
8
 
2.9%
8
 
2.9%
6
 
2.2%
5
 
1.8%
Other values (91) 151
55.3%
ASCII
ValueCountFrequency (%)
6
50.0%
B 3
25.0%
G 1
 
8.3%
n 1
 
8.3%
& 1
 
8.3%
Distinct31
Distinct (%)91.2%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-13T02:18:16.615660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length50
Median length34
Mean length30.558824
Min length20

Characters and Unicode

Total characters1039
Distinct characters59
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)88.2%

Sample

1st row경상북도 성주군 성주읍 성주읍2길 36-3 , 2층 (성주읍)
2nd row경상북도 성주군 성주읍 성주로 3218 3층
3rd row경상북도 성주군 성주읍 성주읍 3길 15-5 2층
4th row경상북도 성주군 성주읍 경산길 7 2층
5th row경상북도 성주군 성주읍 성주읍4길 4 2층
ValueCountFrequency (%)
성주읍 52
21.3%
경상북도 34
13.9%
성주군 34
13.9%
17
 
7.0%
2층 16
 
6.6%
성주읍2길 7
 
2.9%
성주읍3길 7
 
2.9%
성주읍4길 6
 
2.5%
성주로 5
 
2.0%
1층 4
 
1.6%
Other values (48) 62
25.4%
2023-12-13T02:18:16.981190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
216
20.8%
115
 
11.1%
114
 
11.0%
73
 
7.0%
2 43
 
4.1%
36
 
3.5%
34
 
3.3%
34
 
3.3%
34
 
3.3%
34
 
3.3%
Other values (49) 306
29.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 594
57.2%
Space Separator 216
 
20.8%
Decimal Number 147
 
14.1%
Open Punctuation 22
 
2.1%
Close Punctuation 22
 
2.1%
Other Punctuation 20
 
1.9%
Dash Punctuation 17
 
1.6%
Math Symbol 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
19.4%
114
19.2%
73
12.3%
36
 
6.1%
34
 
5.7%
34
 
5.7%
34
 
5.7%
34
 
5.7%
29
 
4.9%
24
 
4.0%
Other values (33) 67
11.3%
Decimal Number
ValueCountFrequency (%)
2 43
29.3%
3 29
19.7%
1 28
19.0%
4 13
 
8.8%
5 11
 
7.5%
6 10
 
6.8%
0 5
 
3.4%
7 4
 
2.7%
8 3
 
2.0%
9 1
 
0.7%
Space Separator
ValueCountFrequency (%)
216
100.0%
Open Punctuation
ValueCountFrequency (%)
( 22
100.0%
Close Punctuation
ValueCountFrequency (%)
) 22
100.0%
Other Punctuation
ValueCountFrequency (%)
, 20
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 594
57.2%
Common 445
42.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
19.4%
114
19.2%
73
12.3%
36
 
6.1%
34
 
5.7%
34
 
5.7%
34
 
5.7%
34
 
5.7%
29
 
4.9%
24
 
4.0%
Other values (33) 67
11.3%
Common
ValueCountFrequency (%)
216
48.5%
2 43
 
9.7%
3 29
 
6.5%
1 28
 
6.3%
( 22
 
4.9%
) 22
 
4.9%
, 20
 
4.5%
- 17
 
3.8%
4 13
 
2.9%
5 11
 
2.5%
Other values (6) 24
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 594
57.2%
ASCII 445
42.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
216
48.5%
2 43
 
9.7%
3 29
 
6.5%
1 28
 
6.3%
( 22
 
4.9%
) 22
 
4.9%
, 20
 
4.5%
- 17
 
3.8%
4 13
 
2.9%
5 11
 
2.5%
Other values (6) 24
 
5.4%
Hangul
ValueCountFrequency (%)
115
19.4%
114
19.2%
73
12.3%
36
 
6.1%
34
 
5.7%
34
 
5.7%
34
 
5.7%
34
 
5.7%
29
 
4.9%
24
 
4.0%
Other values (33) 67
11.3%

등록상태
Categorical

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size404.0 B
개원
34 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개원
2nd row개원
3rd row개원
4th row개원
5th row개원

Common Values

ValueCountFrequency (%)
개원 34
100.0%

Length

2023-12-13T02:18:17.115237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:18:17.235882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개원 34
100.0%

전화번호
Text

MISSING 

Distinct17
Distinct (%)94.4%
Missing16
Missing (%)47.1%
Memory size404.0 B
2023-12-13T02:18:17.394033image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.055556
Min length12

Characters and Unicode

Total characters217
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)88.9%

Sample

1st row054-933-5174
2nd row054-932-8133
3rd row054-932-2126
4th row054-933-3678
5th row054-933-6879
ValueCountFrequency (%)
054-932-2126 2
 
11.1%
054-933-4646 1
 
5.6%
054-933-5174 1
 
5.6%
054-931-1494 1
 
5.6%
054-933-0509 1
 
5.6%
054-931-7813 1
 
5.6%
054-932-0594 1
 
5.6%
054-933-3929 1
 
5.6%
054-933-6900 1
 
5.6%
054-933-3678 1
 
5.6%
Other values (7) 7
38.9%
2023-12-13T02:18:17.679760image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 36
16.6%
3 34
15.7%
0 28
12.9%
9 27
12.4%
4 26
12.0%
5 20
9.2%
2 14
 
6.5%
1 11
 
5.1%
6 9
 
4.1%
7 7
 
3.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 181
83.4%
Dash Punctuation 36
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 34
18.8%
0 28
15.5%
9 27
14.9%
4 26
14.4%
5 20
11.0%
2 14
7.7%
1 11
 
6.1%
6 9
 
5.0%
7 7
 
3.9%
8 5
 
2.8%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 217
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 36
16.6%
3 34
15.7%
0 28
12.9%
9 27
12.4%
4 26
12.0%
5 20
9.2%
2 14
 
6.5%
1 11
 
5.1%
6 9
 
4.1%
7 7
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 217
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 36
16.6%
3 34
15.7%
0 28
12.9%
9 27
12.4%
4 26
12.0%
5 20
9.2%
2 14
 
6.5%
1 11
 
5.1%
6 9
 
4.1%
7 7
 
3.2%

Interactions

2023-12-13T02:18:15.280881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:18:17.767477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분학원명학원주소전화번호
번호1.0000.9941.0000.9590.922
구분0.9941.0001.0000.2251.000
학원명1.0001.0001.0001.0001.000
학원주소0.9590.2251.0001.0001.000
전화번호0.9221.0001.0001.0001.000
2023-12-13T02:18:17.850340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호구분
번호1.0000.804
구분0.8041.000

Missing values

2023-12-13T02:18:15.398055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:18:15.488510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호구분학원명학원주소등록상태전화번호
01학원뉴아이들무용학원경상북도 성주군 성주읍 성주읍2길 36-3 , 2층 (성주읍)개원054-933-5174
12학원솔로몬영수학원경상북도 성주군 성주읍 성주로 3218 3층개원054-932-8133
23학원소나타음악전문학원경상북도 성주군 성주읍 성주읍 3길 15-5 2층개원054-932-2126
34학원엠플러스수학전문학원경상북도 성주군 성주읍 경산길 7 2층개원054-933-3678
45학원BB음악학원경상북도 성주군 성주읍 성주읍4길 4 2층개원054-933-6879
56학원성주다올학원경상북도 성주군 성주읍 성주읍3길 4-1 , 2층 (성주읍)개원<NA>
67학원수학의달인 성주학원경상북도 성주군 성주읍 성주로 3209 (성주읍)개원054-933-7890
78학원김쌤스터디학원경상북도 성주군 초전면 대장길 115 , 2층 (초전면)개원<NA>
89학원제이제이어학원경상북도 성주군 성주읍 성주읍2길 48 , 1층 일부분 (성주읍)개원<NA>
910학원오른스터디학원경상북도 성주군 성주읍 성주순환로3길 25 , 성주하나로아파트 102동 301호 (성주읍)개원<NA>
번호구분학원명학원주소등록상태전화번호
2425학원경북대스터디클럽학원경상북도 성주군 성주읍 성주로 3265 2층개원<NA>
2526교습소한우리독서토론논술성주교습소경상북도 성주군 성주읍 성주읍3길 25, 2층 2호개원070-4140-4627
2627교습소다이룸 영어교습소경상북도 성주군 성주읍 성주로 3233 (성주읍)개원<NA>
2728교습소뉴아이들 영어교습소경상북도 성주군 성주읍 성주읍2길 36-3 , 2층 (성주읍)개원<NA>
2829교습소뉴아이들 수학교습소경상북도 성주군 성주읍 성주읍2길 36-3 , 2층 (성주읍)개원<NA>
2930교습소뉴아이들 미술교습소경상북도 성주군 성주읍 성주읍2길 36-3 , 2층 (성주읍)개원<NA>
3031교습소생각나무영어교습소경상북도 성주군 성주읍 성주읍2길 33 , 1층 일부분 (성주읍)개원<NA>
3132교습소드림수학교습소경상북도 성주군 성주읍 경산2길 12-5 , 1층 일부분 (성주읍)개원<NA>
3233교습소손샘컴퓨터교습소경상북도 성주군 성주읍 성주읍3길 25 , 2층 2호 일부분 (성주읍, 수정2차아파트)개원<NA>
3334교습소시온수학교습소경상북도 성주군 성주읍 성주읍4길 6-1 , 1층 (성주읍)개원<NA>