Overview

Dataset statistics

Number of variables20
Number of observations10000
Missing cells93577
Missing cells (%)46.8%
Duplicate rows1596
Duplicate rows (%)16.0%
Total size in memory1.6 MiB
Average record size in memory173.0 B

Variable types

Numeric3
Categorical5
Text2
Boolean8
Unsupported2

Dataset

Description교복 구매 유형 및 단가 현황
Author경기도
URLhttps://data.gg.go.kr/portal/data/service/selectServicePage.do?&infId=UOO00DLMWBU8XNQ292ZD23603470&infSeq=2

Alerts

동복학교주관구매여부 has constant value ""Constant
동복공동구매여부 has constant value ""Constant
동복개별구매여부 has constant value ""Constant
하복학교주관구매여부 has constant value ""Constant
하복공동구매여부 has constant value ""Constant
하복개별구매여부 has constant value ""Constant
하복해당없음여부 has constant value ""Constant
Dataset has 1596 (16.0%) duplicate rowsDuplicates
제외여부 is highly overall correlated with 동복평균가격(원) and 3 other fieldsHigh correlation
지역명 is highly overall correlated with 시군명 and 1 other fieldsHigh correlation
학교급명 is highly overall correlated with 제외여부High correlation
시군명 is highly overall correlated with 지역교육청명 and 1 other fieldsHigh correlation
동복평균가격(원) is highly overall correlated with 하복평균가격(원) and 1 other fieldsHigh correlation
하복평균가격(원) is highly overall correlated with 동복평균가격(원) and 1 other fieldsHigh correlation
지역교육청명 is highly overall correlated with 시군명 and 3 other fieldsHigh correlation
설립구분명 is highly overall correlated with 지역교육청명High correlation
지역교육청명 is highly imbalanced (68.4%)Imbalance
지역명 is highly imbalanced (68.5%)Imbalance
학교급명 is highly imbalanced (68.2%)Imbalance
설립구분명 is highly imbalanced (60.1%)Imbalance
학교명 has 8078 (80.8%) missing valuesMissing
제외사유 has 4417 (44.2%) missing valuesMissing
비대상여부 has 10000 (100.0%) missing valuesMissing
동복학교주관구매여부 has 9593 (95.9%) missing valuesMissing
동복공동구매여부 has 9709 (97.1%) missing valuesMissing
동복해당없음여부 has 10000 (100.0%) missing valuesMissing
동복평균가격(원) has 6222 (62.2%) missing valuesMissing
하복학교주관구매여부 has 9600 (96.0%) missing valuesMissing
하복공동구매여부 has 9714 (97.1%) missing valuesMissing
하복해당없음여부 has 9994 (99.9%) missing valuesMissing
하복평균가격(원) has 6248 (62.5%) missing valuesMissing
비대상여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
동복해당없음여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-10 21:04:53.410911
Analysis finished2023-12-10 21:04:56.716652
Duration3.31 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기준년도
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2016.993
Minimum2014
Maximum2020
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:04:56.778076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2014
5-th percentile2014
Q12015
median2016
Q32019
95-th percentile2020
Maximum2020
Range6
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.0465507
Coefficient of variation (CV)0.0010146543
Kurtosis-1.4205623
Mean2016.993
Median Absolute Deviation (MAD)2
Skewness0.036753667
Sum20169930
Variance4.1883698
MonotonicityNot monotonic
2023-12-11T06:04:56.887131image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
2018 1995
20.0%
2015 1927
19.3%
2016 1860
18.6%
2019 1529
15.3%
2020 1443
14.4%
2014 1246
12.5%
ValueCountFrequency (%)
2014 1246
12.5%
2015 1927
19.3%
2016 1860
18.6%
2018 1995
20.0%
2019 1529
15.3%
2020 1443
14.4%
ValueCountFrequency (%)
2020 1443
14.4%
2019 1529
15.3%
2018 1995
20.0%
2016 1860
18.6%
2015 1927
19.3%
2014 1246
12.5%

시군명
Categorical

HIGH CORRELATION 

Distinct31
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수원시
851 
용인시
759 
고양시
693 
화성시
 
642
성남시
 
625
Other values (26)
6430 

Length

Max length4
Median length3
Mean length3.0858
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row양주시
2nd row의정부시
3rd row이천시
4th row의정부시
5th row수원시

Common Values

ValueCountFrequency (%)
수원시 851
 
8.5%
용인시 759
 
7.6%
고양시 693
 
6.9%
화성시 642
 
6.4%
성남시 625
 
6.2%
부천시 539
 
5.4%
안산시 466
 
4.7%
파주시 460
 
4.6%
남양주시 452
 
4.5%
평택시 438
 
4.4%
Other values (21) 4075
40.8%

Length

2023-12-11T06:04:57.008306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수원시 851
 
8.5%
용인시 759
 
7.6%
고양시 693
 
6.9%
화성시 642
 
6.4%
성남시 625
 
6.2%
부천시 539
 
5.4%
안산시 466
 
4.7%
파주시 460
 
4.6%
남양주시 452
 
4.5%
평택시 438
 
4.4%
Other values (21) 4075
40.8%

지역교육청명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct28
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8078 
경기도교육청
 
423
경기도수원교육지원청
 
125
경기도용인교육지원청
 
124
경기도화성오산교육지원청
 
122
Other values (23)
1128 

Length

Max length13
Median length4
Mean length5.0855
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도동두천양주교육지원청
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8078
80.8%
경기도교육청 423
 
4.2%
경기도수원교육지원청 125
 
1.2%
경기도용인교육지원청 124
 
1.2%
경기도화성오산교육지원청 122
 
1.2%
경기도구리남양주교육지원청 89
 
0.9%
경기도고양교육지원청 88
 
0.9%
경기도성남교육지원청 83
 
0.8%
경기도안산교육지원청 75
 
0.8%
경기도부천교육지원청 73
 
0.7%
Other values (18) 720
 
7.2%

Length

2023-12-11T06:04:57.123361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 8078
80.8%
경기도교육청 423
 
4.2%
경기도수원교육지원청 125
 
1.2%
경기도용인교육지원청 124
 
1.2%
경기도화성오산교육지원청 122
 
1.2%
경기도구리남양주교육지원청 89
 
0.9%
경기도고양교육지원청 88
 
0.9%
경기도성남교육지원청 83
 
0.8%
경기도안산교육지원청 75
 
0.8%
경기도부천교육지원청 73
 
0.7%
Other values (18) 720
 
7.2%

지역명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct43
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8078 
경기도 화성시
 
120
경기도 부천시
 
102
경기도 남양주시
 
91
경기도 파주시
 
88
Other values (38)
1521 

Length

Max length12
Median length4
Mean length4.8865
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row경기도 양주시
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8078
80.8%
경기도 화성시 120
 
1.2%
경기도 부천시 102
 
1.0%
경기도 남양주시 91
 
0.9%
경기도 파주시 88
 
0.9%
경기도 평택시 82
 
0.8%
경기도 김포시 62
 
0.6%
경기도 시흥시 61
 
0.6%
경기도 성남시 분당구 61
 
0.6%
경기도 의정부시 60
 
0.6%
Other values (33) 1195
 
11.9%

Length

2023-12-11T06:04:57.244228image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 8078
63.9%
경기도 1922
 
15.2%
수원시 163
 
1.3%
용인시 150
 
1.2%
화성시 120
 
0.9%
고양시 115
 
0.9%
성남시 114
 
0.9%
부천시 102
 
0.8%
안산시 92
 
0.7%
남양주시 91
 
0.7%
Other values (40) 1691
 
13.4%

학교명
Text

MISSING 

Distinct1408
Distinct (%)73.3%
Missing8078
Missing (%)80.8%
Memory size156.2 KiB
2023-12-11T06:04:57.486639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length6
Mean length6.2819979
Min length4

Characters and Unicode

Total characters12074
Distinct characters298
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique960 ?
Unique (%)49.9%

Sample

1st row은봉초등학교
2nd row장촌초등학교
3rd row마산초등학교
4th row연무중학교
5th row운암초등학교
ValueCountFrequency (%)
관인고등학교 4
 
0.2%
창현초등학교 4
 
0.2%
의정부부용초등학교 4
 
0.2%
안화중학교 4
 
0.2%
포일초등학교 4
 
0.2%
양평전자과학고등학교 4
 
0.2%
중흥초등학교 4
 
0.2%
송북초등학교 3
 
0.2%
평촌초등학교 3
 
0.2%
상품초등학교 3
 
0.2%
Other values (1399) 1886
98.1%
2023-12-11T06:04:57.846140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1957
16.2%
1954
16.2%
1439
 
11.9%
1039
 
8.6%
518
 
4.3%
447
 
3.7%
132
 
1.1%
124
 
1.0%
119
 
1.0%
112
 
0.9%
Other values (288) 4233
35.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 12051
99.8%
Lowercase Letter 14
 
0.1%
Uppercase Letter 4
 
< 0.1%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%
Space Separator 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1957
16.2%
1954
16.2%
1439
 
11.9%
1039
 
8.6%
518
 
4.3%
447
 
3.7%
132
 
1.1%
124
 
1.0%
119
 
1.0%
112
 
0.9%
Other values (273) 4210
34.9%
Lowercase Letter
ValueCountFrequency (%)
s 4
28.6%
e 2
14.3%
i 2
14.3%
n 2
14.3%
g 1
 
7.1%
l 1
 
7.1%
h 1
 
7.1%
u 1
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
E 1
25.0%
B 1
25.0%
I 1
25.0%
T 1
25.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 12051
99.8%
Latin 18
 
0.1%
Common 5
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1957
16.2%
1954
16.2%
1439
 
11.9%
1039
 
8.6%
518
 
4.3%
447
 
3.7%
132
 
1.1%
124
 
1.0%
119
 
1.0%
112
 
0.9%
Other values (273) 4210
34.9%
Latin
ValueCountFrequency (%)
s 4
22.2%
e 2
11.1%
i 2
11.1%
n 2
11.1%
E 1
 
5.6%
g 1
 
5.6%
l 1
 
5.6%
h 1
 
5.6%
B 1
 
5.6%
u 1
 
5.6%
Other values (2) 2
11.1%
Common
ValueCountFrequency (%)
( 2
40.0%
) 2
40.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 12051
99.8%
ASCII 23
 
0.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1957
16.2%
1954
16.2%
1439
 
11.9%
1039
 
8.6%
518
 
4.3%
447
 
3.7%
132
 
1.1%
124
 
1.0%
119
 
1.0%
112
 
0.9%
Other values (273) 4210
34.9%
ASCII
ValueCountFrequency (%)
s 4
17.4%
e 2
 
8.7%
i 2
 
8.7%
( 2
 
8.7%
) 2
 
8.7%
n 2
 
8.7%
E 1
 
4.3%
g 1
 
4.3%
l 1
 
4.3%
h 1
 
4.3%
Other values (5) 5
21.7%

학교급명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct9
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8078 
초등학교
1028 
중학교
 
463
고등학교
 
399
특수학교
 
19
Other values (4)
 
13

Length

Max length7
Median length4
Mean length3.9544
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row초등학교
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8078
80.8%
초등학교 1028
 
10.3%
중학교 463
 
4.6%
고등학교 399
 
4.0%
특수학교 19
 
0.2%
방통고 6
 
0.1%
각종학교(고) 3
 
< 0.1%
각종학교(중) 2
 
< 0.1%
방통중 2
 
< 0.1%

Length

2023-12-11T06:04:58.000339image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:04:58.135847image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8078
80.8%
초등학교 1028
 
10.3%
중학교 463
 
4.6%
고등학교 399
 
4.0%
특수학교 19
 
0.2%
방통고 6
 
0.1%
각종학교(고 3
 
< 0.1%
각종학교(중 2
 
< 0.1%
방통중 2
 
< 0.1%

설립구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
<NA>
8078 
공립
1732 
사립
 
188
국립
 
2

Length

Max length4
Median length4
Mean length3.6156
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 8078
80.8%
공립 1732
 
17.3%
사립 188
 
1.9%
국립 2
 
< 0.1%

Length

2023-12-11T06:04:58.281626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T06:04:58.394264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 8078
80.8%
공립 1732
 
17.3%
사립 188
 
1.9%
국립 2
 
< 0.1%

제외여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size97.7 KiB
True
5583 
False
4415 
(Missing)
 
2
ValueCountFrequency (%)
True 5583
55.8%
False 4415
44.1%
(Missing) 2
 
< 0.1%
2023-12-11T06:04:58.471087image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

제외사유
Text

MISSING 

Distinct1347
Distinct (%)24.1%
Missing4417
Missing (%)44.2%
Memory size156.2 KiB
2023-12-11T06:04:58.703097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length69
Median length46
Mean length14.121082
Min length2

Characters and Unicode

Total characters78838
Distinct characters232
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique778 ?
Unique (%)13.9%

Sample

1st row교복미착용학교이므로 제외처리함.
2nd row본교는 교복미착용으로 공시내용이 없으므로 제외함.
3rd row본교는 해당항목에 대해서 통계자료가 없으므로 제외함.
4th row교복미착용
5th row교복 없음
ValueCountFrequency (%)
교복 2002
 
10.4%
본교는 1455
 
7.5%
미착용 1233
 
6.4%
교복을 948
 
4.9%
착용하지 775
 
4.0%
없음 755
 
3.9%
해당없음 689
 
3.6%
제외함 619
 
3.2%
없으므로 591
 
3.1%
않으므로 583
 
3.0%
Other values (704) 9649
50.0%
2023-12-11T06:04:59.239250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13964
 
17.7%
6391
 
8.1%
3759
 
4.8%
3383
 
4.3%
3149
 
4.0%
2828
 
3.6%
2655
 
3.4%
2365
 
3.0%
2309
 
2.9%
1962
 
2.5%
Other values (222) 36073
45.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 62712
79.5%
Space Separator 13964
 
17.7%
Other Punctuation 1662
 
2.1%
Decimal Number 211
 
0.3%
Close Punctuation 140
 
0.2%
Open Punctuation 139
 
0.2%
Dash Punctuation 7
 
< 0.1%
Uppercase Letter 2
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
6391
 
10.2%
3759
 
6.0%
3383
 
5.4%
3149
 
5.0%
2828
 
4.5%
2655
 
4.2%
2365
 
3.8%
2309
 
3.7%
1962
 
3.1%
1903
 
3.0%
Other values (194) 32008
51.0%
Other Punctuation
ValueCountFrequency (%)
. 1585
95.4%
, 39
 
2.3%
12
 
0.7%
? 5
 
0.3%
/ 4
 
0.2%
# 4
 
0.2%
; 4
 
0.2%
& 4
 
0.2%
· 4
 
0.2%
: 1
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 53
25.1%
2 48
22.7%
0 46
21.8%
9 19
 
9.0%
3 13
 
6.2%
5 12
 
5.7%
4 8
 
3.8%
7 6
 
2.8%
6 5
 
2.4%
8 1
 
0.5%
Close Punctuation
ValueCountFrequency (%)
) 139
99.3%
] 1
 
0.7%
Open Punctuation
ValueCountFrequency (%)
( 138
99.3%
[ 1
 
0.7%
Space Separator
ValueCountFrequency (%)
13964
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Uppercase Letter
ValueCountFrequency (%)
X 2
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 62712
79.5%
Common 16124
 
20.5%
Latin 2
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
6391
 
10.2%
3759
 
6.0%
3383
 
5.4%
3149
 
5.0%
2828
 
4.5%
2655
 
4.2%
2365
 
3.8%
2309
 
3.7%
1962
 
3.1%
1903
 
3.0%
Other values (194) 32008
51.0%
Common
ValueCountFrequency (%)
13964
86.6%
. 1585
 
9.8%
) 139
 
0.9%
( 138
 
0.9%
1 53
 
0.3%
2 48
 
0.3%
0 46
 
0.3%
, 39
 
0.2%
9 19
 
0.1%
3 13
 
0.1%
Other values (17) 80
 
0.5%
Latin
ValueCountFrequency (%)
X 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 62712
79.5%
ASCII 16110
 
20.4%
None 16
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
13964
86.7%
. 1585
 
9.8%
) 139
 
0.9%
( 138
 
0.9%
1 53
 
0.3%
2 48
 
0.3%
0 46
 
0.3%
, 39
 
0.2%
9 19
 
0.1%
3 13
 
0.1%
Other values (16) 66
 
0.4%
Hangul
ValueCountFrequency (%)
6391
 
10.2%
3759
 
6.0%
3383
 
5.4%
3149
 
5.0%
2828
 
4.5%
2655
 
4.2%
2365
 
3.8%
2309
 
3.7%
1962
 
3.1%
1903
 
3.0%
Other values (194) 32008
51.0%
None
ValueCountFrequency (%)
12
75.0%
· 4
 
25.0%

비대상여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

동복학교주관구매여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.2%
Missing9593
Missing (%)95.9%
Memory size97.7 KiB
True
 
407
(Missing)
9593 
ValueCountFrequency (%)
True 407
 
4.1%
(Missing) 9593
95.9%
2023-12-11T06:04:59.381813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

동복공동구매여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.3%
Missing9709
Missing (%)97.1%
Memory size97.7 KiB
True
 
291
(Missing)
9709 
ValueCountFrequency (%)
True 291
 
2.9%
(Missing) 9709
97.1%
2023-12-11T06:04:59.473686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

동복개별구매여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2023-12-11T06:04:59.552817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

동복해당없음여부
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing10000
Missing (%)100.0%
Memory size166.0 KiB

동복평균가격(원)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct311
Distinct (%)8.2%
Missing6222
Missing (%)62.2%
Infinite0
Infinite (%)0.0%
Mean191671.17
Minimum58000
Maximum320000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:04:59.681678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum58000
5-th percentile137850
Q1175000
median197000
Q3210000
95-th percentile245000
Maximum320000
Range262000
Interquartile range (IQR)35000

Descriptive statistics

Standard deviation30589.496
Coefficient of variation (CV)0.15959362
Kurtosis1.4076303
Mean191671.17
Median Absolute Deviation (MAD)16000
Skewness-0.21627204
Sum7.2413368 × 108
Variance9.3571726 × 108
MonotonicityNot monotonic
2023-12-11T06:04:59.844965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
213000 133
 
1.3%
210000 132
 
1.3%
198000 114
 
1.1%
203000 101
 
1.0%
212000 96
 
1.0%
200000 93
 
0.9%
175000 90
 
0.9%
185000 87
 
0.9%
195000 78
 
0.8%
189000 76
 
0.8%
Other values (301) 2778
27.8%
(Missing) 6222
62.2%
ValueCountFrequency (%)
58000 2
< 0.1%
69000 2
< 0.1%
75000 2
< 0.1%
76000 1
< 0.1%
79000 1
< 0.1%
83000 1
< 0.1%
88000 1
< 0.1%
90000 1
< 0.1%
93900 2
< 0.1%
98000 2
< 0.1%
ValueCountFrequency (%)
320000 1
 
< 0.1%
319375 2
< 0.1%
305000 1
 
< 0.1%
300000 4
< 0.1%
293000 1
 
< 0.1%
292000 1
 
< 0.1%
290000 2
< 0.1%
287000 1
 
< 0.1%
285000 3
< 0.1%
284000 3
< 0.1%

하복학교주관구매여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.2%
Missing9600
Missing (%)96.0%
Memory size97.7 KiB
True
 
400
(Missing)
9600 
ValueCountFrequency (%)
True 400
 
4.0%
(Missing) 9600
96.0%
2023-12-11T06:04:59.963831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

하복공동구매여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)0.3%
Missing9714
Missing (%)97.1%
Memory size97.7 KiB
True
 
286
(Missing)
9714 
ValueCountFrequency (%)
True 286
 
2.9%
(Missing) 9714
97.1%
2023-12-11T06:05:00.061025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

하복개별구매여부
Boolean

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size87.9 KiB
False
10000 
ValueCountFrequency (%)
False 10000
100.0%
2023-12-11T06:05:00.130599image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

하복해당없음여부
Boolean

CONSTANT  MISSING 

Distinct1
Distinct (%)16.7%
Missing9994
Missing (%)99.9%
Memory size97.7 KiB
True
 
6
(Missing)
9994 
ValueCountFrequency (%)
True 6
 
0.1%
(Missing) 9994
99.9%
2023-12-11T06:05:00.201961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

하복평균가격(원)
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct228
Distinct (%)6.1%
Missing6248
Missing (%)62.5%
Infinite0
Infinite (%)0.0%
Mean81593.48
Minimum17000
Maximum272000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2023-12-11T06:05:00.308450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17000
5-th percentile60000
Q174000
median81000
Q386000
95-th percentile110000
Maximum272000
Range255000
Interquartile range (IQR)12000

Descriptive statistics

Standard deviation16275.632
Coefficient of variation (CV)0.19947221
Kurtosis11.072584
Mean81593.48
Median Absolute Deviation (MAD)6000
Skewness1.73595
Sum3.0613874 × 108
Variance2.6489619 × 108
MonotonicityNot monotonic
2023-12-11T06:05:00.469635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
86000 260
 
2.6%
85000 235
 
2.4%
80000 231
 
2.3%
87000 226
 
2.3%
82000 225
 
2.2%
75000 204
 
2.0%
70000 178
 
1.8%
79000 166
 
1.7%
83000 150
 
1.5%
78000 109
 
1.1%
Other values (218) 1768
 
17.7%
(Missing) 6248
62.5%
ValueCountFrequency (%)
17000 1
 
< 0.1%
19000 1
 
< 0.1%
20000 1
 
< 0.1%
22000 1
 
< 0.1%
25000 4
< 0.1%
27500 1
 
< 0.1%
28000 2
 
< 0.1%
30000 8
0.1%
32000 1
 
< 0.1%
38000 4
< 0.1%
ValueCountFrequency (%)
272000 1
< 0.1%
198000 2
< 0.1%
176000 1
< 0.1%
170000 2
< 0.1%
169000 1
< 0.1%
165000 1
< 0.1%
163400 1
< 0.1%
161000 1
< 0.1%
160000 1
< 0.1%
158000 1
< 0.1%

Interactions

2023-12-11T06:04:55.563378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:54.892431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.208688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.667156image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:54.995057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.335749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.771079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.093288image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T06:04:55.444527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T06:05:00.560536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도시군명지역교육청명지역명학교급명설립구분명제외여부동복평균가격(원)하복평균가격(원)
기준년도1.0000.0000.0000.0000.0000.1100.0230.4970.287
시군명0.0001.0000.9911.0000.1950.2250.1070.3530.263
지역교육청명0.0000.9911.0000.9930.7240.9400.6030.3720.269
지역명0.0001.0000.9931.0000.2230.3570.0980.4860.400
학교급명0.0000.1950.7240.2231.0000.5150.9990.0380.187
설립구분명0.1100.2250.9400.3570.5151.0000.2010.1460.066
제외여부0.0230.1070.6030.0980.9990.2011.000NaNNaN
동복평균가격(원)0.4970.3530.3720.4860.0380.146NaN1.0000.533
하복평균가격(원)0.2870.2630.2690.4000.1870.066NaN0.5331.000
2023-12-11T06:05:00.954921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
제외여부지역명학교급명시군명지역교육청명설립구분명
제외여부1.0000.0770.9640.0910.5200.330
지역명0.0771.0000.0860.9970.8460.176
학교급명0.9640.0861.0000.0760.3890.380
시군명0.0910.9970.0761.0000.8470.114
지역교육청명0.5200.8460.3890.8471.0000.750
설립구분명0.3300.1760.3800.1140.7501.000
2023-12-11T06:05:01.069811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준년도동복평균가격(원)하복평균가격(원)시군명지역교육청명지역명학교급명설립구분명제외여부
기준년도1.0000.1230.2160.0000.0000.0000.0000.0210.082
동복평균가격(원)0.1231.0000.6350.1300.1420.1840.0150.1121.000
하복평균가격(원)0.2160.6351.0000.1000.1160.1580.1200.0711.000
시군명0.0000.1300.1001.0000.8470.9970.0760.1140.091
지역교육청명0.0000.1420.1160.8471.0000.8460.3890.7500.520
지역명0.0000.1840.1580.9970.8461.0000.0860.1760.077
학교급명0.0000.0150.1200.0760.3890.0861.0000.3800.964
설립구분명0.0210.1120.0710.1140.7500.1760.3801.0000.330
제외여부0.0821.0001.0000.0910.5200.0770.9640.3301.000

Missing values

2023-12-11T06:04:55.941856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T06:04:56.252307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-11T06:04:56.525224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유비대상여부동복학교주관구매여부동복공동구매여부동복개별구매여부동복해당없음여부동복평균가격(원)하복학교주관구매여부하복공동구매여부하복개별구매여부하복해당없음여부하복평균가격(원)
204902018양주시경기도동두천양주교육지원청경기도 양주시은봉초등학교초등학교공립Y교복미착용학교이므로 제외처리함.<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
482332014의정부시<NA><NA><NA><NA><NA>Y본교는 교복미착용으로 공시내용이 없으므로 제외함.<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
414572015이천시<NA><NA><NA><NA><NA>Y본교는 해당항목에 대해서 통계자료가 없으므로 제외함.<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
412962015의정부시<NA><NA><NA><NA><NA>N<NA><NA><NA><NA>N<NA>170000<NA><NA>N<NA><NA>
373322015수원시<NA><NA><NA><NA><NA>Y교복미착용<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
250802016고양시경기도고양교육지원청경기도 고양시 일산서구장촌초등학교초등학교공립Y교복 없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
71282020화성시<NA><NA><NA><NA><NA>Y교복 미착용<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
432472015화성시경기도화성오산교육지원청경기도 화성시마산초등학교초등학교공립Y해당사항 없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
97722019성남시<NA><NA><NA><NA><NA>Y교복미착용<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
270502016부천시<NA><NA><NA><NA><NA>Y교복 입지 않음.<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유비대상여부동복학교주관구매여부동복공동구매여부동복개별구매여부동복해당없음여부동복평균가격(원)하복학교주관구매여부하복공동구매여부하복개별구매여부하복해당없음여부하복평균가격(원)
319852016이천시<NA><NA><NA><NA><NA>N<NA><NA><NA><NA>N<NA>191000<NA><NA>N<NA>80000
18472020동두천시<NA><NA><NA><NA><NA>Y교복 없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
255562016광주시<NA><NA><NA><NA><NA>Y해당없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
46292020양평군<NA><NA><NA><NA><NA>Y교복미착용<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
451212014부천시<NA><NA><NA><NA><NA>Y교복 미착용으로 제외처리<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
391022015안양시경기도안양과천교육지원청경기도 안양시 동안구달안초등학교초등학교공립Y교복 미착용<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
246672016고양시<NA><NA><NA><NA><NA>Y해당 없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
175062018부천시<NA><NA><NA><NA><NA>N<NA><NA><NA><NA>N<NA>198000<NA><NA>N<NA>81000
475132014연천군<NA><NA><NA><NA><NA>Y해당사항 없음 (교복 미착용)<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>
173812018부천시<NA><NA><NA><NA><NA>Y해당사항 없음<NA><NA><NA>N<NA><NA><NA><NA>N<NA><NA>

Duplicate rows

Most frequently occurring

기준년도시군명지역교육청명지역명학교명학교급명설립구분명제외여부제외사유동복학교주관구매여부동복공동구매여부동복개별구매여부동복평균가격(원)하복학교주관구매여부하복공동구매여부하복개별구매여부하복해당없음여부하복평균가격(원)# duplicates
3162015용인시<NA><NA><NA><NA><NA>N<NA><NA><NA>N<NA><NA><NA>N<NA><NA>23
11672019안산시<NA><NA><NA><NA><NA>N<NA><NA><NA>N212000<NA><NA>N<NA>8600022
2092015성남시<NA><NA><NA><NA><NA>N<NA><NA><NA>N<NA><NA><NA>N<NA><NA>18
2262015수원시<NA><NA><NA><NA><NA>N<NA><NA><NA>N<NA><NA><NA>N<NA><NA>13
272014부천시<NA><NA><NA><NA><NA>Y본교는 해당항목에 대해서 공시내용이 없으므로 제외함.<NA><NA>N<NA><NA><NA>N<NA><NA>12
7032018고양시<NA><NA><NA><NA><NA>Y본교는 교복을 착용하지 않으므로 해당 항목을 제외처리 함<NA><NA>N<NA><NA><NA>N<NA><NA>12
15062020용인시<NA><NA><NA><NA><NA>N<NA><NA><NA>N213000<NA><NA>N<NA>8700012
15852020화성시<NA><NA><NA><NA><NA>Y교복 미착용<NA><NA>N<NA><NA><NA>N<NA><NA>12
422014수원시<NA><NA><NA><NA><NA>Y본교는 해당항목에 대해서 공시내용이 없으므로 제외함.<NA><NA>N<NA><NA><NA>N<NA><NA>11
1342015고양시<NA><NA><NA><NA><NA>N<NA><NA><NA>N<NA><NA><NA>N<NA><NA>11