Overview

Dataset statistics

Number of variables6
Number of observations43
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory52.1 B

Variable types

Categorical2
Text3
Numeric1

Dataset

Description경상북도 봉화군 유치원, 초등학교, 중학교, 고등학교 명칭, 소재지, 연락처, 학생수, 직원수
Author경상북도교육청 경상북도봉화교육지원청
URLhttps://www.data.go.kr/data/15006234/fileData.do

Alerts

유치원명 has unique valuesUnique
원아수 has 3 (7.0%) zerosZeros

Reproduction

Analysis started2023-12-12 17:33:02.435882
Analysis finished2023-12-12 17:33:03.443488
Duration1.01 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

Distinct4
Distinct (%)9.3%
Missing0
Missing (%)0.0%
Memory size476.0 B
유치원
17 
초등학교
17 
중학교
고등학교

Length

Max length4
Median length3
Mean length3.4418605
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row유치원
2nd row유치원
3rd row유치원
4th row유치원
5th row유치원

Common Values

ValueCountFrequency (%)
유치원 17
39.5%
초등학교 17
39.5%
중학교 7
16.3%
고등학교 2
 
4.7%

Length

2023-12-13T02:33:03.522584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:33:03.647780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
유치원 17
39.5%
초등학교 17
39.5%
중학교 7
16.3%
고등학교 2
 
4.7%

유치원명
Text

UNIQUE 

Distinct43
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T02:33:03.908247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.2093023
Min length3

Characters and Unicode

Total characters224
Distinct characters52
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique43 ?
Unique (%)100.0%

Sample

1st row봉화초교병설
2nd row도촌초교병설
3rd row내성초교병설
4th row물야초교병설
5th row동양초교병설
ValueCountFrequency (%)
봉화초교병설 1
 
2.3%
동양초 1
 
2.3%
춘양초 1
 
2.3%
서벽초 1
 
2.3%
소천초 1
 
2.3%
소천초분천분교 1
 
2.3%
소천초임기분교 1
 
2.3%
소천초두음분교 1
 
2.3%
석포초 1
 
2.3%
재산초 1
 
2.3%
Other values (33) 33
76.7%
2023-12-13T02:33:04.295630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
14.7%
21
 
9.4%
18
 
8.0%
16
 
7.1%
11
 
4.9%
9
 
4.0%
9
 
4.0%
9
 
4.0%
6
 
2.7%
5
 
2.2%
Other values (42) 87
38.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 218
97.3%
Close Punctuation 3
 
1.3%
Open Punctuation 3
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
15.1%
21
 
9.6%
18
 
8.3%
16
 
7.3%
11
 
5.0%
9
 
4.1%
9
 
4.1%
9
 
4.1%
6
 
2.8%
5
 
2.3%
Other values (40) 81
37.2%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 218
97.3%
Common 6
 
2.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
15.1%
21
 
9.6%
18
 
8.3%
16
 
7.3%
11
 
5.0%
9
 
4.1%
9
 
4.1%
9
 
4.1%
6
 
2.8%
5
 
2.3%
Other values (40) 81
37.2%
Common
ValueCountFrequency (%)
) 3
50.0%
( 3
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 218
97.3%
ASCII 6
 
2.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
15.1%
21
 
9.6%
18
 
8.3%
16
 
7.3%
11
 
5.0%
9
 
4.1%
9
 
4.1%
9
 
4.1%
6
 
2.8%
5
 
2.3%
Other values (40) 81
37.2%
ASCII
ValueCountFrequency (%)
) 3
50.0%
( 3
50.0%

원아수
Real number (ℝ)

ZEROS 

Distinct32
Distinct (%)74.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.674419
Minimum0
Maximum310
Zeros3
Zeros (%)7.0%
Negative0
Negative (%)0.0%
Memory size519.0 B
2023-12-13T02:33:04.442559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.2
Q17
median15
Q344
95-th percentile256.8
Maximum310
Range310
Interquartile range (IQR)37

Descriptive statistics

Standard deviation73.629291
Coefficient of variation (CV)1.6120466
Kurtosis6.316683
Mean45.674419
Median Absolute Deviation (MAD)11
Skewness2.593991
Sum1964
Variance5421.2724
MonotonicityNot monotonic
2023-12-13T02:33:04.584699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=32)
ValueCountFrequency (%)
11 3
 
7.0%
4 3
 
7.0%
0 3
 
7.0%
10 2
 
4.7%
15 2
 
4.7%
44 2
 
4.7%
12 2
 
4.7%
5 2
 
4.7%
47 1
 
2.3%
51 1
 
2.3%
Other values (22) 22
51.2%
ValueCountFrequency (%)
0 3
7.0%
2 1
 
2.3%
3 1
 
2.3%
4 3
7.0%
5 2
4.7%
6 1
 
2.3%
8 1
 
2.3%
10 2
4.7%
11 3
7.0%
12 2
4.7%
ValueCountFrequency (%)
310 1
2.3%
272 1
2.3%
270 1
2.3%
138 1
2.3%
121 1
2.3%
99 1
2.3%
96 1
2.3%
63 1
2.3%
51 1
2.3%
47 1
2.3%

주소
Text

Distinct26
Distinct (%)60.5%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T02:33:04.866037image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length31
Mean length21.325581
Min length8

Characters and Unicode

Total characters917
Distinct characters70
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)53.5%

Sample

1st row초등학교와 통합
2nd row초등학교와 통합
3rd row초등학교와 통합
4th row초등학교와 통합
5th row초등학교와 통합
ValueCountFrequency (%)
봉화군 27
 
13.9%
경상북도 26
 
13.4%
초등학교와 16
 
8.2%
통합 16
 
8.2%
봉화읍 6
 
3.1%
소천면 5
 
2.6%
춘양면 5
 
2.6%
36238 3
 
1.5%
36266 2
 
1.0%
석포면 2
 
1.0%
Other values (71) 86
44.3%
2023-12-13T02:33:05.276913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
170
 
18.5%
3 47
 
5.1%
2 46
 
5.0%
6 40
 
4.4%
38
 
4.1%
34
 
3.7%
( 27
 
2.9%
27
 
2.9%
) 27
 
2.9%
27
 
2.9%
Other values (60) 434
47.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 470
51.3%
Decimal Number 218
23.8%
Space Separator 170
 
18.5%
Open Punctuation 27
 
2.9%
Close Punctuation 27
 
2.9%
Dash Punctuation 5
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
8.1%
34
 
7.2%
27
 
5.7%
27
 
5.7%
27
 
5.7%
27
 
5.7%
26
 
5.5%
21
 
4.5%
18
 
3.8%
17
 
3.6%
Other values (46) 208
44.3%
Decimal Number
ValueCountFrequency (%)
3 47
21.6%
2 46
21.1%
6 40
18.3%
1 25
11.5%
4 15
 
6.9%
5 12
 
5.5%
0 9
 
4.1%
8 9
 
4.1%
7 8
 
3.7%
9 7
 
3.2%
Space Separator
ValueCountFrequency (%)
170
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 470
51.3%
Common 447
48.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
8.1%
34
 
7.2%
27
 
5.7%
27
 
5.7%
27
 
5.7%
27
 
5.7%
26
 
5.5%
21
 
4.5%
18
 
3.8%
17
 
3.6%
Other values (46) 208
44.3%
Common
ValueCountFrequency (%)
170
38.0%
3 47
 
10.5%
2 46
 
10.3%
6 40
 
8.9%
( 27
 
6.0%
) 27
 
6.0%
1 25
 
5.6%
4 15
 
3.4%
5 12
 
2.7%
0 9
 
2.0%
Other values (4) 29
 
6.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 470
51.3%
ASCII 447
48.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
170
38.0%
3 47
 
10.5%
2 46
 
10.3%
6 40
 
8.9%
( 27
 
6.0%
) 27
 
6.0%
1 25
 
5.6%
4 15
 
3.4%
5 12
 
2.7%
0 9
 
2.0%
Other values (4) 29
 
6.5%
Hangul
ValueCountFrequency (%)
38
 
8.1%
34
 
7.2%
27
 
5.7%
27
 
5.7%
27
 
5.7%
27
 
5.7%
26
 
5.5%
21
 
4.5%
18
 
3.8%
17
 
3.6%
Other values (46) 208
44.3%

직원수
Categorical

Distinct8
Distinct (%)18.6%
Missing0
Missing (%)0.0%
Memory size476.0 B
초등학교와 통합
16 
1
3
2
4
Other values (3)

Length

Max length8
Median length1
Mean length3.6046512
Min length1

Unique

Unique1 ?
Unique (%)2.3%

Sample

1st row초등학교와 통합
2nd row초등학교와 통합
3rd row초등학교와 통합
4th row초등학교와 통합
5th row초등학교와 통합

Common Values

ValueCountFrequency (%)
초등학교와 통합 16
37.2%
1 6
 
14.0%
3 5
 
11.6%
2 5
 
11.6%
4 4
 
9.3%
5 4
 
9.3%
0 2
 
4.7%
7 1
 
2.3%

Length

2023-12-13T02:33:05.463552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T02:33:05.600924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
초등학교와 16
27.1%
통합 16
27.1%
1 6
 
10.2%
3 5
 
8.5%
2 5
 
8.5%
4 4
 
6.8%
5 4
 
6.8%
0 2
 
3.4%
7 1
 
1.7%

전화
Text

Distinct40
Distinct (%)93.0%
Missing0
Missing (%)0.0%
Memory size476.0 B
2023-12-13T02:33:05.820365image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters516
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)86.0%

Sample

1st row054-673-0566
2nd row054-672-8392
3rd row054-673-7190
4th row054-673-7044
5th row054-672-8850
ValueCountFrequency (%)
054-672-3011 2
 
4.7%
054-679-3305 2
 
4.7%
054-672-7413 2
 
4.7%
054-672-7438 1
 
2.3%
054-672-4068 1
 
2.3%
054-672-9054 1
 
2.3%
054-672-2008 1
 
2.3%
054-673-0566 1
 
2.3%
054-672-6365 1
 
2.3%
054-672-8858 1
 
2.3%
Other values (30) 30
69.8%
2023-12-13T02:33:06.158110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 86
16.7%
0 75
14.5%
4 61
11.8%
5 57
11.0%
7 57
11.0%
6 53
10.3%
2 46
8.9%
3 33
 
6.4%
1 23
 
4.5%
8 14
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 430
83.3%
Dash Punctuation 86
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 75
17.4%
4 61
14.2%
5 57
13.3%
7 57
13.3%
6 53
12.3%
2 46
10.7%
3 33
7.7%
1 23
 
5.3%
8 14
 
3.3%
9 11
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 86
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 516
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 86
16.7%
0 75
14.5%
4 61
11.8%
5 57
11.0%
7 57
11.0%
6 53
10.3%
2 46
8.9%
3 33
 
6.4%
1 23
 
4.5%
8 14
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 516
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 86
16.7%
0 75
14.5%
4 61
11.8%
5 57
11.0%
7 57
11.0%
6 53
10.3%
2 46
8.9%
3 33
 
6.4%
1 23
 
4.5%
8 14
 
2.7%

Interactions

2023-12-13T02:33:02.792224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T02:33:06.296185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분유치원명원아수주소직원수전화
구분1.0001.0000.6640.8900.8540.000
유치원명1.0001.0001.0001.0001.0001.000
원아수0.6641.0001.0000.9650.6190.000
주소0.8901.0000.9651.0001.0000.984
직원수0.8541.0000.6191.0001.0000.949
전화0.0001.0000.0000.9840.9491.000
2023-12-13T02:33:06.433697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
직원수구분
직원수1.0000.499
구분0.4991.000
2023-12-13T02:33:06.538228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
원아수구분직원수
원아수1.0000.4980.367
구분0.4981.0000.499
직원수0.3670.4991.000

Missing values

2023-12-13T02:33:03.248649image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T02:33:03.390784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

구분유치원명원아수주소직원수전화
0유치원봉화초교병설11초등학교와 통합초등학교와 통합054-673-0566
1유치원도촌초교병설5초등학교와 통합초등학교와 통합054-672-8392
2유치원내성초교병설11초등학교와 통합초등학교와 통합054-673-7190
3유치원물야초교병설4초등학교와 통합초등학교와 통합054-673-7044
4유치원동양초교병설10초등학교와 통합초등학교와 통합054-672-8850
5유치원법전중앙초교병설6초등학교와 통합초등학교와 통합054-673-6981
6유치원춘양초교병설11초등학교와 통합초등학교와 통합054-673-3042
7유치원서벽초교병설8초등학교와 통합초등학교와 통합054-673-4172
8유치원소천초교병설0초등학교와 통합초등학교와 통합054-673-1761
9유치원소천초분천분교병설5초등학교와 통합초등학교와 통합054-672-7921
구분유치원명원아수주소직원수전화
33초등학교상운초15(36244) 경상북도 봉화군 상운면 예봉로 12312054-672-5011
34중학교봉화중(병)272(36240) 경상북도 봉화군 봉화읍 내성로 1383054-679-3305
35중학교청량중63(36252) 경상북도 봉화군 명호면 양지마을길 145054-672-1043
36중학교춘양중(병)47(36213) 경상북도 봉화군 춘양면 서원촌길 8-142054-672-3011
37중학교춘양중학교서벽분교13(36209) 경상북도 봉화군 춘양면 춘양로 14900054-672-4068
38중학교소천중19(36266) 경상북도 봉화군 소천면 소천로 12372054-672-7438
39중학교물야중26(36202) 경상북도 봉화군 물야면 오록4길 111054-672-2008
40중학교석포중33(36272) 경상북도 봉화군 석포면 석포로 226-11054-672-6105
41고등학교봉화고270(36240) 경상북도 봉화군 봉화읍 내성로 1383054-679-3305
42고등학교한국산림과학고138(36213) 경상북도 봉화군 춘양면 서원촌길 8-142054-672-3011