Overview

Dataset statistics

Number of variables6
Number of observations27
Missing cells1
Missing cells (%)0.6%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.4 KiB
Average record size in memory54.9 B

Variable types

Text2
Numeric2
Categorical2

Dataset

Description시군별 청소년 방과 후 아카데미에 대한 현황을 사업장 소재지, 면적, 운영주체, 정원, 확정예산으로 나열하여 개방합니다.
URLhttps://www.data.go.kr/data/15095089/fileData.do

Alerts

확정예산(천원) is highly overall correlated with 정원High correlation
정원 is highly overall correlated with 확정예산(천원)High correlation
면적(제곱미터) has 1 (3.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 03:07:25.418850
Analysis finished2023-12-12 03:07:26.960698
Duration1.54 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시군
Text

Distinct15
Distinct (%)55.6%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12T12:07:27.083819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters54
Distinct characters23
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)33.3%

Sample

1st row천안
2nd row천안
3rd row천안
4th row천안
5th row천안
ValueCountFrequency (%)
천안 5
18.5%
공주 3
11.1%
아산 3
11.1%
당진 3
11.1%
서천 2
 
7.4%
홍성 2
 
7.4%
보령 1
 
3.7%
서산 1
 
3.7%
논산 1
 
3.7%
계룡 1
 
3.7%
Other values (5) 5
18.5%
2023-12-12T12:07:27.480399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7
13.0%
7
13.0%
6
11.1%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
Other values (13) 14
25.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 54
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
7
13.0%
7
13.0%
6
11.1%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
Other values (13) 14
25.9%

Most occurring scripts

ValueCountFrequency (%)
Hangul 54
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
7
13.0%
7
13.0%
6
11.1%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
Other values (13) 14
25.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 54
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
7
13.0%
7
13.0%
6
11.1%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
3
 
5.6%
2
 
3.7%
Other values (13) 14
25.9%
Distinct25
Distinct (%)92.6%
Missing0
Missing (%)0.0%
Memory size348.0 B
2023-12-12T12:07:27.813787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length27
Mean length22.111111
Min length2

Characters and Unicode

Total characters597
Distinct characters99
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique23 ?
Unique (%)85.2%

Sample

1st row청소년수련관 (동남구 중앙로 111)
2nd row태조산청소년수련관 (동남구 태조산길 261)
3rd row광풍중학교 (동남구 풍세면 광풍로 1021-11)
4th row성정청소년문화의집 (서북구 서부1길 57)
5th row천안 서여자중학교(동남구 천안여상로44)
ValueCountFrequency (%)
청소년수련관 6
 
5.4%
청소년 5
 
4.5%
청소년문화의집 4
 
3.6%
동남구 3
 
2.7%
57 3
 
2.7%
청소년문화센터 2
 
1.8%
500 2
 
1.8%
아산시 2
 
1.8%
홍성읍 2
 
1.8%
교육문화센터 2
 
1.8%
Other values (77) 81
72.3%
2023-12-12T12:07:28.407097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
85
 
14.2%
) 26
 
4.4%
( 26
 
4.4%
25
 
4.2%
22
 
3.7%
21
 
3.5%
20
 
3.4%
18
 
3.0%
18
 
3.0%
1 18
 
3.0%
Other values (89) 318
53.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 369
61.8%
Decimal Number 86
 
14.4%
Space Separator 85
 
14.2%
Close Punctuation 26
 
4.4%
Open Punctuation 26
 
4.4%
Dash Punctuation 5
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
6.8%
22
 
6.0%
21
 
5.7%
20
 
5.4%
18
 
4.9%
18
 
4.9%
13
 
3.5%
12
 
3.3%
10
 
2.7%
10
 
2.7%
Other values (75) 200
54.2%
Decimal Number
ValueCountFrequency (%)
1 18
20.9%
3 14
16.3%
5 10
11.6%
2 10
11.6%
4 10
11.6%
0 7
 
8.1%
7 6
 
7.0%
6 5
 
5.8%
9 5
 
5.8%
8 1
 
1.2%
Space Separator
ValueCountFrequency (%)
85
100.0%
Close Punctuation
ValueCountFrequency (%)
) 26
100.0%
Open Punctuation
ValueCountFrequency (%)
( 26
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 369
61.8%
Common 228
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
6.8%
22
 
6.0%
21
 
5.7%
20
 
5.4%
18
 
4.9%
18
 
4.9%
13
 
3.5%
12
 
3.3%
10
 
2.7%
10
 
2.7%
Other values (75) 200
54.2%
Common
ValueCountFrequency (%)
85
37.3%
) 26
 
11.4%
( 26
 
11.4%
1 18
 
7.9%
3 14
 
6.1%
5 10
 
4.4%
2 10
 
4.4%
4 10
 
4.4%
0 7
 
3.1%
7 6
 
2.6%
Other values (4) 16
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 369
61.8%
ASCII 228
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
85
37.3%
) 26
 
11.4%
( 26
 
11.4%
1 18
 
7.9%
3 14
 
6.1%
5 10
 
4.4%
2 10
 
4.4%
4 10
 
4.4%
0 7
 
3.1%
7 6
 
2.6%
Other values (4) 16
 
7.0%
Hangul
ValueCountFrequency (%)
25
 
6.8%
22
 
6.0%
21
 
5.7%
20
 
5.4%
18
 
4.9%
18
 
4.9%
13
 
3.5%
12
 
3.3%
10
 
2.7%
10
 
2.7%
Other values (75) 200
54.2%

면적(제곱미터)
Real number (ℝ)

MISSING 

Distinct25
Distinct (%)96.2%
Missing1
Missing (%)3.7%
Infinite0
Infinite (%)0.0%
Mean3806.3462
Minimum267
Maximum23056
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-12T12:07:28.613470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum267
5-th percentile372.75
Q11118
median2696
Q33658.75
95-th percentile16622.5
Maximum23056
Range22789
Interquartile range (IQR)2540.75

Descriptive statistics

Standard deviation5440.8121
Coefficient of variation (CV)1.4294055
Kurtosis8.3818633
Mean3806.3462
Median Absolute Deviation (MAD)1389
Skewness2.9396947
Sum98965
Variance29602436
MonotonicityNot monotonic
2023-12-12T12:07:28.783839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
2756 2
 
7.4%
4410 1
 
3.7%
2636 1
 
3.7%
2438 1
 
3.7%
2954 1
 
3.7%
346 1
 
3.7%
2870 1
 
3.7%
1574 1
 
3.7%
658 1
 
3.7%
1148 1
 
3.7%
Other values (15) 15
55.6%
ValueCountFrequency (%)
267 1
3.7%
346 1
3.7%
453 1
3.7%
658 1
3.7%
669 1
3.7%
902 1
3.7%
1108 1
3.7%
1148 1
3.7%
1258 1
3.7%
1447 1
3.7%
ValueCountFrequency (%)
23056 1
3.7%
19746 1
3.7%
7252 1
3.7%
4410 1
3.7%
4036 1
3.7%
4014 1
3.7%
3803 1
3.7%
3226 1
3.7%
3182 1
3.7%
2954 1
3.7%
Distinct3
Distinct (%)11.1%
Missing0
Missing (%)0.0%
Memory size348.0 B
위탁
21 
직영
<NA>
 
1

Length

Max length4
Median length2
Mean length2.0740741
Min length2

Unique

Unique1 ?
Unique (%)3.7%

Sample

1st row위탁
2nd row위탁
3rd row위탁
4th row위탁
5th row위탁

Common Values

ValueCountFrequency (%)
위탁 21
77.8%
직영 5
 
18.5%
<NA> 1
 
3.7%

Length

2023-12-12T12:07:28.976143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:07:29.143950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
위탁 21
77.8%
직영 5
 
18.5%
na 1
 
3.7%

정원
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)29.6%
Missing0
Missing (%)0.0%
Memory size348.0 B
중등 40
14 
초등 40
중등 30
초등 15
중등 15
 
1
Other values (3)

Length

Max length11
Median length5
Mean length5.3703704
Min length5

Unique

Unique4 ?
Unique (%)14.8%

Sample

1st row중등 40
2nd row중등 40
3rd row중등 30
4th row초등 15
5th row중등 30

Common Values

ValueCountFrequency (%)
중등 40 14
51.9%
초등 40 5
 
18.5%
중등 30 2
 
7.4%
초등 15 2
 
7.4%
중등 15 1
 
3.7%
초등 30 1
 
3.7%
초등20,중등20 1
 
3.7%
초등 15,중등 15 1
 
3.7%

Length

2023-12-12T12:07:29.287660image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T12:07:29.451308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
40 19
35.2%
중등 17
31.5%
초등 9
16.7%
15 4
 
7.4%
30 3
 
5.6%
초등20,중등20 1
 
1.9%
15,중등 1
 
1.9%

확정예산(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct12
Distinct (%)44.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean148838.52
Minimum49502
Maximum178502
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size375.0 B
2023-12-12T12:07:29.616147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum49502
5-th percentile72729.4
Q1140149
median170222
Q3173452
95-th percentile178502
Maximum178502
Range129000
Interquartile range (IQR)33303

Descriptive statistics

Standard deviation40653.136
Coefficient of variation (CV)0.27313586
Kurtosis0.33660944
Mean148838.52
Median Absolute Deviation (MAD)5050
Skewness-1.3763315
Sum4018640
Variance1.6526775 × 109
MonotonicityNot monotonic
2023-12-12T12:07:29.755442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
165172 6
22.2%
170222 5
18.5%
173452 4
14.8%
178502 4
14.8%
111326 1
 
3.7%
70792 1
 
3.7%
178062 1
 
3.7%
77250 1
 
3.7%
77372 1
 
3.7%
115126 1
 
3.7%
Other values (2) 2
 
7.4%
ValueCountFrequency (%)
49502 1
 
3.7%
70792 1
 
3.7%
77250 1
 
3.7%
77372 1
 
3.7%
89252 1
 
3.7%
111326 1
 
3.7%
115126 1
 
3.7%
165172 6
22.2%
170222 5
18.5%
173452 4
14.8%
ValueCountFrequency (%)
178502 4
14.8%
178062 1
 
3.7%
173452 4
14.8%
170222 5
18.5%
165172 6
22.2%
115126 1
 
3.7%
111326 1
 
3.7%
89252 1
 
3.7%
77372 1
 
3.7%
77250 1
 
3.7%

Interactions

2023-12-12T12:07:26.075885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:07:25.823339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:07:26.209085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T12:07:25.948708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T12:07:29.889687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시군사업장 소재지면적(제곱미터)운영주체 (직영_위탁)정원확정예산(천원)
시군1.0001.0000.0001.0000.7010.000
사업장 소재지1.0001.0001.0001.0000.7041.000
면적(제곱미터)0.0001.0001.0000.0000.4100.528
운영주체 (직영_위탁)1.0001.0000.0001.0000.0000.454
정원0.7010.7040.4100.0001.0000.909
확정예산(천원)0.0001.0000.5280.4540.9091.000
2023-12-12T12:07:30.051699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
운영주체 (직영_위탁)정원
운영주체 (직영_위탁)1.0000.000
정원0.0001.000
2023-12-12T12:07:30.189812image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
면적(제곱미터)확정예산(천원)운영주체 (직영_위탁)정원
면적(제곱미터)1.0000.0030.0000.251
확정예산(천원)0.0031.0000.2340.614
운영주체 (직영_위탁)0.0000.2341.0000.000
정원0.2510.6140.0001.000

Missing values

2023-12-12T12:07:26.761393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T12:07:26.895693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

시군사업장 소재지면적(제곱미터)운영주체 (직영_위탁)정원확정예산(천원)
0천안청소년수련관 (동남구 중앙로 111)4410위탁중등 40165172
1천안태조산청소년수련관 (동남구 태조산길 261)2636위탁중등 40165172
2천안광풍중학교 (동남구 풍세면 광풍로 1021-11)19746위탁중등 30111326
3천안성정청소년문화의집 (서북구 서부1길 57)1108위탁초등 1570792
4천안천안 서여자중학교(동남구 천안여상로44)23056위탁중등 30178062
5공주청소년 문화센터 (대통1길 57)1447위탁초등 40165172
6공주청소년 문화센터 (대통1길 57)267위탁중등 40165172
7공주공주시청소년꿈창작소(무령로 550-53)4036위탁초등 1577250
8보령청소년 수련관 (성주면 성주산로 500)3803직영초등 40173452
9아산청소년 교육문화센터 (아산시 시민로 500)7252위탁중등 40165172
시군사업장 소재지면적(제곱미터)운영주체 (직영_위탁)정원확정예산(천원)
17당진송악 청소년문화의집 (송악읍 기지시리 524)902직영중등 4089252
18금산금산문화의 집 (금산읍 방아동4길 17)1148위탁중등 40170222
19부여청소년수련원 (부여읍 의열로43)658위탁초등 40178502
20서천청소년문화센터 (서천읍 군청로 16-2)1574위탁중등 40170222
21서천청소년수련관 (장항읍 장항산단로 34번길 60-23)2870위탁중등 40170222
22청양청양문화의집 (충남 청양군 청양읍 문화예술로 187)346위탁초등20,중등20170222
23홍성청소년수련관 (홍성읍 문화로33번길 33)2756위탁중등 40170222
24홍성청소년수련관 (홍성읍 문화로33번길 33)2756위탁초등 15,중등 1549502
25예산청소년수련관 (예산읍 벚꽃로 214)2954위탁중등 40178502
26태안청소년수련관 (태안읍 백화로 199)2438직영초등 40178502