Overview

Dataset statistics

Number of variables5
Number of observations106
Missing cells55
Missing cells (%)10.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory42.2 B

Variable types

Categorical1
Numeric1
Text3

Dataset

Description경상북도 예천군 관내 학원 및 교습소 현황, 인가 정보로 구분, 순번, 학원명, 주소, 전화번호 구성됨(경상북도예천교육지원청 관리)
Author경상북도교육청 경상북도예천교육지원청
URLhttps://www.data.go.kr/data/3069334/fileData.do

Alerts

전화번호 has 55 (51.9%) missing valuesMissing
학 원 명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 17:02:32.004072
Analysis finished2023-12-12 17:02:32.594489
Duration0.59 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct2
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size980.0 B
학원
85 
교습소
21 

Length

Max length3
Median length2
Mean length2.1981132
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학원
2nd row학원
3rd row학원
4th row학원
5th row학원

Common Values

ValueCountFrequency (%)
학원 85
80.2%
교습소 21
 
19.8%

Length

2023-12-13T02:02:32.659045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:02:32.775015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학원 85
80.2%
교습소 21
 
19.8%

순번
Real number (ℝ)

Distinct85
Distinct (%)80.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.660377
Minimum1
Maximum85
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-13T02:02:32.895737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.25
Q114
median32.5
Q358.75
95-th percentile79.75
Maximum85
Range84
Interquartile range (IQR)44.75

Descriptive statistics

Standard deviation25.668913
Coefficient of variation (CV)0.70018136
Kurtosis-1.2268726
Mean36.660377
Median Absolute Deviation (MAD)21.5
Skewness0.33331547
Sum3886
Variance658.89308
MonotonicityNot monotonic
2023-12-13T02:02:33.060248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 2
 
1.9%
12 2
 
1.9%
2 2
 
1.9%
21 2
 
1.9%
20 2
 
1.9%
18 2
 
1.9%
17 2
 
1.9%
16 2
 
1.9%
15 2
 
1.9%
14 2
 
1.9%
Other values (75) 86
81.1%
ValueCountFrequency (%)
1 2
1.9%
2 2
1.9%
3 2
1.9%
4 2
1.9%
5 2
1.9%
6 2
1.9%
7 2
1.9%
8 2
1.9%
9 2
1.9%
10 2
1.9%
ValueCountFrequency (%)
85 1
0.9%
84 1
0.9%
83 1
0.9%
82 1
0.9%
81 1
0.9%
80 1
0.9%
79 1
0.9%
78 1
0.9%
77 1
0.9%
76 1
0.9%

학 원 명
Text

UNIQUE 

Distinct106
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size980.0 B
2023-12-13T02:02:33.330230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length14
Mean length7.8962264
Min length4

Characters and Unicode

Total characters837
Distinct characters211
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)100.0%

Sample

1st rowSKY영수학원
2nd rowTOP입시학원
3rd row감성탄탄미술학원
4th row경도간호학원
5th row경북이미용직업전문학원
ValueCountFrequency (%)
sky영수학원 1
 
0.9%
포카수학학원 1
 
0.9%
클랑음악학원 1
 
0.9%
콩나물음악학원 1
 
0.9%
캠브리지학원 1
 
0.9%
청어람스터디학원 1
 
0.9%
청담학원 1
 
0.9%
차쌤회계컴퓨터학원 1
 
0.9%
진솔oneclass학원 1
 
0.9%
지앤비영어전문학원 1
 
0.9%
Other values (96) 96
90.6%
2023-12-13T02:02:33.817184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
96
 
11.5%
86
 
10.3%
24
 
2.9%
24
 
2.9%
22
 
2.6%
22
 
2.6%
21
 
2.5%
20
 
2.4%
20
 
2.4%
16
 
1.9%
Other values (201) 486
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 821
98.1%
Uppercase Letter 10
 
1.2%
Lowercase Letter 6
 
0.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
96
 
11.7%
86
 
10.5%
24
 
2.9%
24
 
2.9%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.4%
20
 
2.4%
16
 
1.9%
Other values (187) 470
57.2%
Uppercase Letter
ValueCountFrequency (%)
O 2
20.0%
C 1
10.0%
E 1
10.0%
M 1
10.0%
P 1
10.0%
T 1
10.0%
S 1
10.0%
Y 1
10.0%
K 1
10.0%
Lowercase Letter
ValueCountFrequency (%)
s 2
33.3%
a 1
16.7%
n 1
16.7%
e 1
16.7%
l 1
16.7%

Most occurring scripts

ValueCountFrequency (%)
Hangul 821
98.1%
Latin 16
 
1.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
96
 
11.7%
86
 
10.5%
24
 
2.9%
24
 
2.9%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.4%
20
 
2.4%
16
 
1.9%
Other values (187) 470
57.2%
Latin
ValueCountFrequency (%)
s 2
12.5%
O 2
12.5%
a 1
 
6.2%
n 1
 
6.2%
e 1
 
6.2%
C 1
 
6.2%
l 1
 
6.2%
E 1
 
6.2%
M 1
 
6.2%
P 1
 
6.2%
Other values (4) 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 821
98.1%
ASCII 16
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
96
 
11.7%
86
 
10.5%
24
 
2.9%
24
 
2.9%
22
 
2.7%
22
 
2.7%
21
 
2.6%
20
 
2.4%
20
 
2.4%
16
 
1.9%
Other values (187) 470
57.2%
ASCII
ValueCountFrequency (%)
s 2
12.5%
O 2
12.5%
a 1
 
6.2%
n 1
 
6.2%
e 1
 
6.2%
C 1
 
6.2%
l 1
 
6.2%
E 1
 
6.2%
M 1
 
6.2%
P 1
 
6.2%
Other values (4) 4
25.0%

주소
Text

Distinct104
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size980.0 B
2023-12-13T02:02:34.091322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length21.377358
Min length15

Characters and Unicode

Total characters2266
Distinct characters108
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique102 ?
Unique (%)96.2%

Sample

1st row경상북도예천군예천읍시장로145
2nd row경상북도예천군예천읍효자로52
3rd row경상북도예천군호명면새움3로60-4,메디컬센터702호
4th row경상북도예천군예천읍충효로241,4층
5th row경상북도예천군호명면새움3로52,JM골드스퀘어빌딩6층
ValueCountFrequency (%)
경상북도예천군예천읍상설시장1길26 2
 
1.9%
경상북도예천군예천읍군청앞길7-1 2
 
1.9%
경상북도예천군예천읍맛고을길26-1,2층 2
 
1.9%
경상북도예천군예천읍효자로65,1층 1
 
0.9%
경상북도예천군예천읍시장로145 1
 
0.9%
경상북도예천군호명면새움3로30,402호 1
 
0.9%
경상북도예천군호명면양지3길16,401호 1
 
0.9%
경상북도예천군호명면새움3로20,신성프라자501호 1
 
0.9%
경상북도예천군호명면새움3로52,504호 1
 
0.9%
경상북도예천군호명면새움3로20,703호 1
 
0.9%
Other values (93) 93
87.7%
2023-12-13T02:02:34.609617image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
142
 
6.3%
142
 
6.3%
137
 
6.0%
0 115
 
5.1%
112
 
4.9%
111
 
4.9%
106
 
4.7%
106
 
4.7%
106
 
4.7%
3 95
 
4.2%
Other values (98) 1094
48.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1591
70.2%
Decimal Number 562
 
24.8%
Other Punctuation 92
 
4.1%
Dash Punctuation 18
 
0.8%
Uppercase Letter 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
142
 
8.9%
142
 
8.9%
137
 
8.6%
112
 
7.0%
111
 
7.0%
106
 
6.7%
106
 
6.7%
106
 
6.7%
91
 
5.7%
71
 
4.5%
Other values (83) 467
29.4%
Decimal Number
ValueCountFrequency (%)
0 115
20.5%
3 95
16.9%
1 88
15.7%
2 72
12.8%
4 59
10.5%
6 46
 
8.2%
5 41
 
7.3%
7 23
 
4.1%
8 13
 
2.3%
9 10
 
1.8%
Uppercase Letter
ValueCountFrequency (%)
H 1
33.3%
M 1
33.3%
J 1
33.3%
Other Punctuation
ValueCountFrequency (%)
, 92
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1591
70.2%
Common 672
29.7%
Latin 3
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
142
 
8.9%
142
 
8.9%
137
 
8.6%
112
 
7.0%
111
 
7.0%
106
 
6.7%
106
 
6.7%
106
 
6.7%
91
 
5.7%
71
 
4.5%
Other values (83) 467
29.4%
Common
ValueCountFrequency (%)
0 115
17.1%
3 95
14.1%
, 92
13.7%
1 88
13.1%
2 72
10.7%
4 59
8.8%
6 46
 
6.8%
5 41
 
6.1%
7 23
 
3.4%
- 18
 
2.7%
Other values (2) 23
 
3.4%
Latin
ValueCountFrequency (%)
H 1
33.3%
M 1
33.3%
J 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1591
70.2%
ASCII 675
29.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
142
 
8.9%
142
 
8.9%
137
 
8.6%
112
 
7.0%
111
 
7.0%
106
 
6.7%
106
 
6.7%
106
 
6.7%
91
 
5.7%
71
 
4.5%
Other values (83) 467
29.4%
ASCII
ValueCountFrequency (%)
0 115
17.0%
3 95
14.1%
, 92
13.6%
1 88
13.0%
2 72
10.7%
4 59
8.7%
6 46
 
6.8%
5 41
 
6.1%
7 23
 
3.4%
- 18
 
2.7%
Other values (5) 26
 
3.9%

전화번호
Text

MISSING 

Distinct49
Distinct (%)96.1%
Missing55
Missing (%)51.9%
Memory size980.0 B
2023-12-13T02:02:34.867435image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.019608
Min length12

Characters and Unicode

Total characters613
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)92.2%

Sample

1st row054-655-0158
2nd row054-654-5425
3rd row054-655-9871
4th row054-655-5778
5th row054-655-0204
ValueCountFrequency (%)
054-655-1191 2
 
3.9%
054-654-8882 2
 
3.9%
054-653-8500 1
 
2.0%
054-654-5700 1
 
2.0%
054-654-6070 1
 
2.0%
054-652-6003 1
 
2.0%
054-655-0158 1
 
2.0%
054-655-1046 1
 
2.0%
054-653-1473 1
 
2.0%
054-652-0579 1
 
2.0%
Other values (39) 39
76.5%
2023-12-13T02:02:35.216142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 142
23.2%
- 102
16.6%
4 86
14.0%
0 83
13.5%
6 61
10.0%
2 28
 
4.6%
7 28
 
4.6%
1 25
 
4.1%
8 24
 
3.9%
3 19
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 511
83.4%
Dash Punctuation 102
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 142
27.8%
4 86
16.8%
0 83
16.2%
6 61
11.9%
2 28
 
5.5%
7 28
 
5.5%
1 25
 
4.9%
8 24
 
4.7%
3 19
 
3.7%
9 15
 
2.9%
Dash Punctuation
ValueCountFrequency (%)
- 102
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 613
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 142
23.2%
- 102
16.6%
4 86
14.0%
0 83
13.5%
6 61
10.0%
2 28
 
4.6%
7 28
 
4.6%
1 25
 
4.1%
8 24
 
3.9%
3 19
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 613
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 142
23.2%
- 102
16.6%
4 86
14.0%
0 83
13.5%
6 61
10.0%
2 28
 
4.6%
7 28
 
4.6%
1 25
 
4.1%
8 24
 
3.9%
3 19
 
3.1%

Interactions

2023-12-13T02:02:32.332327image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:02:35.306639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분순번전화번호
구분1.0000.6661.000
순번0.6661.0001.000
전화번호1.0001.0001.000
2023-12-13T02:02:35.380096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번구분
순번1.0000.497
구분0.4971.000

Missing values

2023-12-13T02:02:32.452450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:02:32.550575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분순번학 원 명주소전화번호
0학원1SKY영수학원경상북도예천군예천읍시장로145054-655-0158
1학원2TOP입시학원경상북도예천군예천읍효자로52054-654-5425
2학원3감성탄탄미술학원경상북도예천군호명면새움3로60-4,메디컬센터702호<NA>
3학원4경도간호학원경상북도예천군예천읍충효로241,4층054-655-9871
4학원5경북이미용직업전문학원경상북도예천군호명면새움3로52,JM골드스퀘어빌딩6층<NA>
5학원6고감도미술학원경상북도예천군예천읍효자로117,2층054-655-5778
6학원7권지연피아노학원경상북도예천군호명면행복로146,401호<NA>
7학원8글로리아음악학원경상북도예천군호명면행복로177,801동104호054-655-0204
8학원9금터학원경상북도예천군호명면행복로177,801동101호<NA>
9학원10김지은음악학원경상북도예천군호명면새움3로60-4,메디컬빌딩704호054-652-0411
구분순번학 원 명주소전화번호
96교습소12윤선생우리집앞영어교실예천호명영어교습소경상북도예천군호명면양지로55,101호<NA>
97교습소13임마누엘피아노교습소경상북도예천군풍양면낙상1길83-22054-653-8500
98교습소14임선생독서논술교습소경상북도예천군예천읍상설시장1길26054-654-1119
99교습소15임수진피아노음악교습소경상북도예천군호명면새움3로52,제이엠골드스퀘어4층404호<NA>
100교습소16장원수학교습소경상북도예천군예천읍군청길13-3054-654-6070
101교습소17존슨영어스튜디오교습소경상북도예천군호명면행복로177,803동103호<NA>
102교습소18창의력발전소로봇코딩교습소경상북도예천군호명면새움3로60-4,5층502호<NA>
103교습소19콩처럼쑥쑥크는수학교습소경상북도예천군예천읍효자로65,1층<NA>
104교습소20파스칼수학교습소경상북도예천군예천읍상설시장1길26<NA>
105교습소21피카소미술교습소경상북도예천군예천읍효자로40세종타운상가109호054-654-1092