Overview

Dataset statistics

Number of variables9
Number of observations1244
Missing cells4
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory93.7 KiB
Average record size in memory77.1 B

Variable types

Categorical3
Text1
Numeric5

Dataset

Description전국 초/중/고등학교 대상의 학교흡연예방사업 계획 현황 데이터로 예산, 프로그램 운영계획(SENSE, Jr-END, END, CROWN) 항목을 제공합니다.
Author한국건강증진개발원
URLhttps://www.data.go.kr/data/15092426/fileData.do

Alerts

예산(원) is highly overall correlated with 프로그램 운영계획(SENSE)High correlation
프로그램 운영계획(SENSE) is highly overall correlated with 예산(원) and 1 other fieldsHigh correlation
프로그램 운영계획(Jr.END) is highly overall correlated with 프로그램 운영계획(SENSE)High correlation
프로그램 운영계획(SENSE) has 235 (18.9%) zerosZeros
프로그램 운영계획(Jr.END) has 807 (64.9%) zerosZeros
프로그램 운영계획(END) has 425 (34.2%) zerosZeros
프로그램 운영계획(CROWN) has 1117 (89.8%) zerosZeros

Reproduction

Analysis started2023-12-12 10:47:05.699796
Analysis finished2023-12-12 10:47:11.314136
Duration5.61 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

시도명
Categorical

Distinct18
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
경기도
219 
서울특별시
117 
경상남도
114 
경상북도
111 
전라남도
99 
Other values (13)
584 

Length

Max length7
Median length5
Mean length4.0956592
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row강원도
2nd row강원도
3rd row강원도
4th row강원도
5th row강원도

Common Values

ValueCountFrequency (%)
경기도 219
17.6%
서울특별시 117
9.4%
경상남도 114
9.2%
경상북도 111
8.9%
전라남도 99
8.0%
부산광역시 84
 
6.8%
충청남도 84
 
6.8%
강원도 83
 
6.7%
전라북도 78
 
6.3%
충청북도 63
 
5.1%
Other values (8) 192
15.4%

Length

2023-12-12T19:47:11.415571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
경기도 219
17.6%
서울특별시 117
9.4%
경상남도 114
9.2%
경상북도 111
8.9%
전라남도 99
8.0%
부산광역시 84
 
6.8%
충청남도 84
 
6.8%
강원도 83
 
6.7%
전라북도 78
 
6.3%
충청북도 63
 
5.1%
Other values (8) 192
15.4%
Distinct229
Distinct (%)18.4%
Missing2
Missing (%)0.2%
Memory size9.8 KiB
2023-12-12T19:47:11.884827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length3
Mean length3.5297907
Min length2

Characters and Unicode

Total characters4384
Distinct characters145
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row강릉시
2nd row강릉시
3rd row강릉시
4th row강릉시
5th row강릉시
ValueCountFrequency (%)
남구 32
 
2.3%
동구 30
 
2.1%
중구 28
 
2.0%
서구 25
 
1.8%
북구 25
 
1.8%
창원시 25
 
1.8%
청주시 22
 
1.6%
수원시 19
 
1.3%
고양시 16
 
1.1%
성남시 16
 
1.1%
Other values (228) 1179
83.2%
2023-12-12T19:47:12.557243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
554
 
12.6%
550
 
12.5%
354
 
8.1%
175
 
4.0%
134
 
3.1%
128
 
2.9%
118
 
2.7%
104
 
2.4%
101
 
2.3%
101
 
2.3%
Other values (135) 2065
47.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4209
96.0%
Space Separator 175
 
4.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
554
 
13.2%
550
 
13.1%
354
 
8.4%
134
 
3.2%
128
 
3.0%
118
 
2.8%
104
 
2.5%
101
 
2.4%
101
 
2.4%
98
 
2.3%
Other values (134) 1967
46.7%
Space Separator
ValueCountFrequency (%)
175
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4209
96.0%
Common 175
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
554
 
13.2%
550
 
13.1%
354
 
8.4%
134
 
3.2%
128
 
3.0%
118
 
2.8%
104
 
2.5%
101
 
2.4%
101
 
2.4%
98
 
2.3%
Other values (134) 1967
46.7%
Common
ValueCountFrequency (%)
175
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4209
96.0%
ASCII 175
 
4.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
554
 
13.2%
550
 
13.1%
354
 
8.4%
134
 
3.2%
128
 
3.0%
118
 
2.8%
104
 
2.5%
101
 
2.4%
101
 
2.4%
98
 
2.3%
Other values (134) 1967
46.7%
ASCII
ValueCountFrequency (%)
175
100.0%

학교급
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
고등학교
393 
초등학교
379 
중학교
353 
특수/기타학교
119 

Length

Max length7
Median length4
Mean length4.0032154
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row고등학교
2nd row고등학교
3rd row중학교
4th row중학교
5th row초등학교

Common Values

ValueCountFrequency (%)
고등학교 393
31.6%
초등학교 379
30.5%
중학교 353
28.4%
특수/기타학교 119
 
9.6%

Length

2023-12-12T19:47:12.749861image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:47:12.901677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
고등학교 393
31.6%
초등학교 379
30.5%
중학교 353
28.4%
특수/기타학교 119
 
9.6%

구분
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size9.8 KiB
기본형
868 
심화형
376 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기본형
2nd row심화형
3rd row기본형
4th row심화형
5th row기본형

Common Values

ValueCountFrequency (%)
기본형 868
69.8%
심화형 376
30.2%

Length

2023-12-12T19:47:13.046902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T19:47:13.198231image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기본형 868
69.8%
심화형 376
30.2%

예산(원)
Real number (ℝ)

HIGH CORRELATION 

Distinct460
Distinct (%)37.0%
Missing2
Missing (%)0.2%
Infinite0
Infinite (%)0.0%
Mean13908797
Minimum0
Maximum1.364 × 108
Zeros4
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size11.1 KiB
2023-12-12T19:47:13.329817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1000000
Q14000000
median8500000
Q318975000
95-th percentile42995000
Maximum1.364 × 108
Range1.364 × 108
Interquartile range (IQR)14975000

Descriptive statistics

Standard deviation14937357
Coefficient of variation (CV)1.0739504
Kurtosis9.7958943
Mean13908797
Median Absolute Deviation (MAD)5500000
Skewness2.5193722
Sum1.7274725 × 1010
Variance2.2312464 × 1014
MonotonicityNot monotonic
2023-12-12T19:47:13.490142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3000000 58
 
4.7%
5000000 44
 
3.5%
4000000 39
 
3.1%
1000000 36
 
2.9%
6000000 31
 
2.5%
2000000 24
 
1.9%
8000000 23
 
1.8%
7000000 23
 
1.8%
3300000 19
 
1.5%
3500000 18
 
1.4%
Other values (450) 927
74.5%
ValueCountFrequency (%)
0 4
 
0.3%
450 1
 
0.1%
3000 1
 
0.1%
200000 1
 
0.1%
300000 2
 
0.2%
468000 1
 
0.1%
500000 10
0.8%
600000 1
 
0.1%
700000 2
 
0.2%
800000 6
0.5%
ValueCountFrequency (%)
136400000 1
0.1%
111600000 1
0.1%
103400000 1
0.1%
99723000 1
0.1%
98601800 1
0.1%
94790000 1
0.1%
81000000 1
0.1%
78500000 1
0.1%
76500000 1
0.1%
76000000 1
0.1%

프로그램 운영계획(SENSE)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct31
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.0321543
Minimum0
Maximum37
Zeros235
Zeros (%)18.9%
Negative0
Negative (%)0.0%
Memory size11.1 KiB
2023-12-12T19:47:13.643669image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q36
95-th percentile14.85
Maximum37
Range37
Interquartile range (IQR)5

Descriptive statistics

Standard deviation5.087229
Coefficient of variation (CV)1.2616652
Kurtosis6.3635836
Mean4.0321543
Median Absolute Deviation (MAD)2
Skewness2.2401413
Sum5016
Variance25.879898
MonotonicityNot monotonic
2023-12-12T19:47:13.799172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=31)
ValueCountFrequency (%)
1 297
23.9%
0 235
18.9%
2 150
12.1%
3 112
 
9.0%
4 71
 
5.7%
5 63
 
5.1%
6 57
 
4.6%
7 41
 
3.3%
9 34
 
2.7%
8 34
 
2.7%
Other values (21) 150
12.1%
ValueCountFrequency (%)
0 235
18.9%
1 297
23.9%
2 150
12.1%
3 112
 
9.0%
4 71
 
5.7%
5 63
 
5.1%
6 57
 
4.6%
7 41
 
3.3%
8 34
 
2.7%
9 34
 
2.7%
ValueCountFrequency (%)
37 1
 
0.1%
33 1
 
0.1%
32 1
 
0.1%
30 2
0.2%
27 3
0.2%
26 1
 
0.1%
25 2
0.2%
24 3
0.2%
22 1
 
0.1%
21 3
0.2%

프로그램 운영계획(Jr.END)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct26
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6784566
Minimum0
Maximum28
Zeros807
Zeros (%)64.9%
Negative0
Negative (%)0.0%
Memory size11.1 KiB
2023-12-12T19:47:13.933013image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile10
Maximum28
Range28
Interquartile range (IQR)1

Descriptive statistics

Standard deviation3.597776
Coefficient of variation (CV)2.1435026
Kurtosis11.312936
Mean1.6784566
Median Absolute Deviation (MAD)0
Skewness3.050298
Sum2088
Variance12.943992
MonotonicityNot monotonic
2023-12-12T19:47:14.075025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
0 807
64.9%
1 159
 
12.8%
2 44
 
3.5%
5 37
 
3.0%
4 31
 
2.5%
7 27
 
2.2%
3 22
 
1.8%
6 22
 
1.8%
10 18
 
1.4%
9 17
 
1.4%
Other values (16) 60
 
4.8%
ValueCountFrequency (%)
0 807
64.9%
1 159
 
12.8%
2 44
 
3.5%
3 22
 
1.8%
4 31
 
2.5%
5 37
 
3.0%
6 22
 
1.8%
7 27
 
2.2%
8 12
 
1.0%
9 17
 
1.4%
ValueCountFrequency (%)
28 1
 
0.1%
26 1
 
0.1%
25 1
 
0.1%
24 1
 
0.1%
21 2
0.2%
20 2
0.2%
19 1
 
0.1%
18 1
 
0.1%
17 3
0.2%
16 2
0.2%

프로그램 운영계획(END)
Real number (ℝ)

ZEROS 

Distinct15
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6374598
Minimum0
Maximum15
Zeros425
Zeros (%)34.2%
Negative0
Negative (%)0.0%
Memory size11.1 KiB
2023-12-12T19:47:14.222740image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile6
Maximum15
Range15
Interquartile range (IQR)2

Descriptive statistics

Standard deviation2.0495657
Coefficient of variation (CV)1.2516739
Kurtosis6.8785315
Mean1.6374598
Median Absolute Deviation (MAD)1
Skewness2.1946824
Sum2037
Variance4.2007195
MonotonicityNot monotonic
2023-12-12T19:47:14.353905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
0 425
34.2%
1 358
28.8%
2 179
14.4%
3 106
 
8.5%
4 65
 
5.2%
5 41
 
3.3%
6 26
 
2.1%
7 18
 
1.4%
8 9
 
0.7%
9 8
 
0.6%
Other values (5) 9
 
0.7%
ValueCountFrequency (%)
0 425
34.2%
1 358
28.8%
2 179
14.4%
3 106
 
8.5%
4 65
 
5.2%
5 41
 
3.3%
6 26
 
2.1%
7 18
 
1.4%
8 9
 
0.7%
9 8
 
0.6%
ValueCountFrequency (%)
15 1
 
0.1%
14 3
 
0.2%
12 2
 
0.2%
11 1
 
0.1%
10 2
 
0.2%
9 8
 
0.6%
8 9
 
0.7%
7 18
1.4%
6 26
2.1%
5 41
3.3%

프로그램 운영계획(CROWN)
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.12620579
Minimum0
Maximum6
Zeros1117
Zeros (%)89.8%
Negative0
Negative (%)0.0%
Memory size11.1 KiB
2023-12-12T19:47:14.490142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum6
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.43126569
Coefficient of variation (CV)3.4171626
Kurtosis41.946222
Mean0.12620579
Median Absolute Deviation (MAD)0
Skewness5.1934616
Sum157
Variance0.1859901
MonotonicityNot monotonic
2023-12-12T19:47:14.636070image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 1117
89.8%
1 107
 
8.6%
2 14
 
1.1%
3 4
 
0.3%
6 1
 
0.1%
4 1
 
0.1%
ValueCountFrequency (%)
0 1117
89.8%
1 107
 
8.6%
2 14
 
1.1%
3 4
 
0.3%
4 1
 
0.1%
6 1
 
0.1%
ValueCountFrequency (%)
6 1
 
0.1%
4 1
 
0.1%
3 4
 
0.3%
2 14
 
1.1%
1 107
 
8.6%
0 1117
89.8%

Interactions

2023-12-12T19:47:09.791602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:06.450672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.356545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.212951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.019068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.945194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:06.636937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.535511image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.382591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.189144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:10.092658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:06.834700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.705913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.527923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.327289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:10.225299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.003061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.860572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.670940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.456146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:10.366539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:07.194303image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.034791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:08.849417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:47:09.607742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:47:14.751675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명학교급구분예산(원)프로그램 운영계획(SENSE)프로그램 운영계획(Jr.END)프로그램 운영계획(END)프로그램 운영계획(CROWN)
시도명1.0000.0000.0700.3900.2510.1820.3080.157
학교급0.0001.0000.2960.3920.5000.6090.3840.261
구분0.0700.2961.0000.3220.5770.3200.4460.156
예산(원)0.3900.3920.3221.0000.9170.8950.4810.084
프로그램 운영계획(SENSE)0.2510.5000.5770.9171.0000.8940.5410.000
프로그램 운영계획(Jr.END)0.1820.6090.3200.8950.8941.0000.2990.000
프로그램 운영계획(END)0.3080.3840.4460.4810.5410.2991.0000.422
프로그램 운영계획(CROWN)0.1570.2610.1560.0840.0000.0000.4221.000
2023-12-12T19:47:14.911967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
시도명학교급구분
시도명1.0000.0000.063
학교급0.0001.0000.197
구분0.0630.1971.000
2023-12-12T19:47:15.039417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
예산(원)프로그램 운영계획(SENSE)프로그램 운영계획(Jr.END)프로그램 운영계획(END)프로그램 운영계획(CROWN)시도명학교급구분
예산(원)1.0000.7550.4590.4780.0890.1620.2440.246
프로그램 운영계획(SENSE)0.7551.0000.6010.480-0.0500.1000.3220.444
프로그램 운영계획(Jr.END)0.4590.6011.000-0.037-0.1310.0680.4120.245
프로그램 운영계획(END)0.4780.480-0.0371.0000.1710.1180.2430.339
프로그램 운영계획(CROWN)0.089-0.050-0.1310.1711.0000.0740.1700.112
시도명0.1620.1000.0680.1180.0741.0000.0000.063
학교급0.2440.3220.4120.2430.1700.0001.0000.197
구분0.2460.4440.2450.3390.1120.0630.1971.000

Missing values

2023-12-12T19:47:10.539316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:47:10.750627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T19:47:11.244587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

시도명시군구명학교급구분예산(원)프로그램 운영계획(SENSE)프로그램 운영계획(Jr.END)프로그램 운영계획(END)프로그램 운영계획(CROWN)
0강원도강릉시고등학교기본형135000003030
1강원도강릉시고등학교심화형40000000011
2강원도강릉시중학교기본형124000004010
3강원도강릉시중학교심화형80000001020
4강원도강릉시초등학교기본형3310000013720
5강원도강릉시초등학교심화형75000002110
6강원도강릉시특수/기타학교기본형10000000000
7강원도고성군고등학교기본형26000000020
8강원도고성군중학교기본형21000002000
9강원도고성군초등학교기본형41000004110
시도명시군구명학교급구분예산(원)프로그램 운영계획(SENSE)프로그램 운영계획(Jr.END)프로그램 운영계획(END)프로그램 운영계획(CROWN)
1234충청북도청주시 흥덕구초등학교기본형49600000131110
1235충청북도청주시 흥덕구특수/기타학교기본형10000001100
1236충청북도충주시고등학교기본형190000006030
1237충청북도충주시고등학교심화형50000000000
1238충청북도충주시중학교기본형2450000010050
1239충청북도충주시중학교심화형70000001010
1240충청북도충주시초등학교기본형43700000191310
1241충청북도충주시특수/기타학교기본형40000001110
1242<NA><NA>중학교기본형23000001000
1243<NA><NA>초등학교기본형40000004400