Overview

Dataset statistics

Number of variables8
Number of observations811
Missing cells811
Missing cells (%)12.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory53.2 KiB
Average record size in memory67.2 B

Variable types

Numeric2
Categorical4
Text1
Unsupported1

Dataset

Description강원특별자치도교육청 학교 놀이시설 현황입니다. 학교별 놀이시설명, 안전검사여부, 안전교육이수, 보험가입 현황을 확인하실 수 있습니다.
URLhttps://www.data.go.kr/data/15061998/fileData.do

Alerts

안전검사여부 is highly imbalanced (94.4%)Imbalance
보험가입여부 is highly imbalanced (76.6%)Imbalance
비고 has 811 (100.0%) missing valuesMissing
연번 has unique valuesUnique
시설번호 has unique valuesUnique
비고 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2023-12-12 20:18:02.483465
Analysis finished2023-12-12 20:18:04.014309
Duration1.53 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct811
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean406
Minimum1
Maximum811
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.3 KiB
2023-12-13T05:18:04.110144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile41.5
Q1203.5
median406
Q3608.5
95-th percentile770.5
Maximum811
Range810
Interquartile range (IQR)405

Descriptive statistics

Standard deviation234.25983
Coefficient of variation (CV)0.57699465
Kurtosis-1.2
Mean406
Median Absolute Deviation (MAD)203
Skewness0
Sum329266
Variance54877.667
MonotonicityStrictly increasing
2023-12-13T05:18:04.336069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
546 1
 
0.1%
536 1
 
0.1%
537 1
 
0.1%
538 1
 
0.1%
539 1
 
0.1%
540 1
 
0.1%
541 1
 
0.1%
542 1
 
0.1%
543 1
 
0.1%
Other values (801) 801
98.8%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
811 1
0.1%
810 1
0.1%
809 1
0.1%
808 1
0.1%
807 1
0.1%
806 1
0.1%
805 1
0.1%
804 1
0.1%
803 1
0.1%
802 1
0.1%

시설번호
Real number (ℝ)

UNIQUE 

Distinct811
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean460077.41
Minimum459
Maximum588109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size7.3 KiB
2023-12-13T05:18:04.497280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum459
5-th percentile2797
Q1508561
median512360
Q3544731
95-th percentile581522.5
Maximum588109
Range587650
Interquartile range (IQR)36170

Descriptive statistics

Standard deviation179738.99
Coefficient of variation (CV)0.39067119
Kurtosis2.1542433
Mean460077.41
Median Absolute Deviation (MAD)17838
Skewness-1.9948135
Sum3.7312278 × 108
Variance3.2306104 × 1010
MonotonicityNot monotonic
2023-12-13T05:18:04.639243image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
508138 1
 
0.1%
509997 1
 
0.1%
579026 1
 
0.1%
580179 1
 
0.1%
530639 1
 
0.1%
510202 1
 
0.1%
584618 1
 
0.1%
511000 1
 
0.1%
573168 1
 
0.1%
508166 1
 
0.1%
Other values (801) 801
98.8%
ValueCountFrequency (%)
459 1
0.1%
595 1
0.1%
621 1
0.1%
676 1
0.1%
1131 1
0.1%
1277 1
0.1%
1278 1
0.1%
1443 1
0.1%
1447 1
0.1%
1449 1
0.1%
ValueCountFrequency (%)
588109 1
0.1%
588014 1
0.1%
586275 1
0.1%
586103 1
0.1%
585934 1
0.1%
585714 1
0.1%
585531 1
0.1%
585501 1
0.1%
585405 1
0.1%
585269 1
0.1%

설치장소
Categorical

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
학교
455 
유치원
356 

Length

Max length3
Median length2
Mean length2.4389642
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row학교
2nd row유치원
3rd row학교
4th row유치원
5th row학교

Common Values

ValueCountFrequency (%)
학교 455
56.1%
유치원 356
43.9%

Length

2023-12-13T05:18:04.807662image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:04.920249image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
학교 455
56.1%
유치원 356
43.9%
Distinct804
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
2023-12-13T05:18:05.266715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length20
Mean length12.625154
Min length4

Characters and Unicode

Total characters10239
Distinct characters260
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique797 ?
Unique (%)98.3%

Sample

1st row가산초등학교 놀이터
2nd row가온유치원내 어린이놀이시설
3rd row간성초등학교 놀이터
4th row간성초병설유치원 놀이터
5th row갈래초등학교 놀이시설
ValueCountFrequency (%)
놀이시설 300
 
17.7%
놀이터 205
 
12.1%
병설유치원 118
 
7.0%
상상놀이터 53
 
3.1%
어린이놀이시설 28
 
1.7%
어린이 12
 
0.7%
운동장 11
 
0.7%
친환경 11
 
0.7%
실내놀이시설 9
 
0.5%
실외놀이터 9
 
0.5%
Other values (684) 936
55.3%
2023-12-13T05:18:05.890292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
881
 
8.6%
779
 
7.6%
726
 
7.1%
707
 
6.9%
700
 
6.8%
680
 
6.6%
653
 
6.4%
645
 
6.3%
410
 
4.0%
390
 
3.8%
Other values (250) 3668
35.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 9295
90.8%
Space Separator 881
 
8.6%
Open Punctuation 24
 
0.2%
Close Punctuation 24
 
0.2%
Decimal Number 11
 
0.1%
Dash Punctuation 2
 
< 0.1%
Other Punctuation 1
 
< 0.1%
Other Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
779
 
8.4%
726
 
7.8%
707
 
7.6%
700
 
7.5%
680
 
7.3%
653
 
7.0%
645
 
6.9%
410
 
4.4%
390
 
4.2%
372
 
4.0%
Other values (241) 3233
34.8%
Decimal Number
ValueCountFrequency (%)
2 9
81.8%
1 1
 
9.1%
3 1
 
9.1%
Space Separator
ValueCountFrequency (%)
881
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%
Other Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 9295
90.8%
Common 944
 
9.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
779
 
8.4%
726
 
7.8%
707
 
7.6%
700
 
7.5%
680
 
7.3%
653
 
7.0%
645
 
6.9%
410
 
4.4%
390
 
4.2%
372
 
4.0%
Other values (241) 3233
34.8%
Common
ValueCountFrequency (%)
881
93.3%
( 24
 
2.5%
) 24
 
2.5%
2 9
 
1.0%
- 2
 
0.2%
/ 1
 
0.1%
1
 
0.1%
1 1
 
0.1%
3 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 9295
90.8%
ASCII 943
 
9.2%
Enclosed Alphanum 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
881
93.4%
( 24
 
2.5%
) 24
 
2.5%
2 9
 
1.0%
- 2
 
0.2%
/ 1
 
0.1%
1 1
 
0.1%
3 1
 
0.1%
Hangul
ValueCountFrequency (%)
779
 
8.4%
726
 
7.8%
707
 
7.6%
700
 
7.5%
680
 
7.3%
653
 
7.0%
645
 
6.9%
410
 
4.4%
390
 
4.2%
372
 
4.0%
Other values (241) 3233
34.8%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%

안전검사여부
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
검사완료
803 
불합격
 
5
미검사
 
3

Length

Max length4
Median length4
Mean length3.9901356
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row검사완료
2nd row검사완료
3rd row검사완료
4th row검사완료
5th row검사완료

Common Values

ValueCountFrequency (%)
검사완료 803
99.0%
불합격 5
 
0.6%
미검사 3
 
0.4%

Length

2023-12-13T05:18:06.097556image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:06.255362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
검사완료 803
99.0%
불합격 5
 
0.6%
미검사 3
 
0.4%
Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
이수
720 
미이수
91 

Length

Max length3
Median length2
Mean length2.1122072
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row이수
2nd row이수
3rd row이수
4th row이수
5th row이수

Common Values

ValueCountFrequency (%)
이수 720
88.8%
미이수 91
 
11.2%

Length

2023-12-13T05:18:06.408668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:06.543098image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
이수 720
88.8%
미이수 91
 
11.2%

보험가입여부
Categorical

IMBALANCE 

Distinct2
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size6.5 KiB
가입
780 
미가입
 
31

Length

Max length3
Median length2
Mean length2.0382244
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row가입
2nd row가입
3rd row가입
4th row가입
5th row가입

Common Values

ValueCountFrequency (%)
가입 780
96.2%
미가입 31
 
3.8%

Length

2023-12-13T05:18:06.680016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T05:18:06.839260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가입 780
96.2%
미가입 31
 
3.8%

비고
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing811
Missing (%)100.0%
Memory size7.3 KiB

Interactions

2023-12-13T05:18:03.196602image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:18:02.962500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:18:03.312459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T05:18:03.085048image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T05:18:06.943947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설번호설치장소안전검사여부안전교육이수보험가입여부
연번1.0000.0850.0000.0610.2420.000
시설번호0.0851.0000.0810.0000.0430.000
설치장소0.0000.0811.0000.0000.0000.254
안전검사여부0.0610.0000.0001.0000.0340.197
안전교육이수0.2420.0430.0000.0341.0000.000
보험가입여부0.0000.0000.2540.1970.0001.000
2023-12-13T05:18:07.092791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
안전교육이수설치장소보험가입여부안전검사여부
안전교육이수1.0000.0000.0000.056
설치장소0.0001.0000.1630.000
보험가입여부0.0000.1631.0000.324
안전검사여부0.0560.0000.3241.000
2023-12-13T05:18:07.225838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번시설번호설치장소안전검사여부안전교육이수보험가입여부
연번1.0000.0170.0000.0360.1840.000
시설번호0.0171.0000.1320.0000.0710.000
설치장소0.0000.1321.0000.0000.0000.163
안전검사여부0.0360.0000.0001.0000.0560.324
안전교육이수0.1840.0710.0000.0561.0000.000
보험가입여부0.0000.0000.1630.3240.0001.000

Missing values

2023-12-13T05:18:03.772468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T05:18:03.947719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시설번호설치장소놀이시설명안전검사여부안전교육이수보험가입여부비고
01508138학교가산초등학교 놀이터검사완료이수가입<NA>
12547625유치원가온유치원내 어린이놀이시설검사완료이수가입<NA>
232111학교간성초등학교 놀이터검사완료이수가입<NA>
3439079유치원간성초병설유치원 놀이터검사완료이수가입<NA>
45507254학교갈래초등학교 놀이시설검사완료이수가입<NA>
56511625유치원갈래초등학교병설유치원 놀이시설검사완료이수가입<NA>
67508722학교갑천초등학교 놀이터검사완료이수가입<NA>
78510538학교강동초등학교 놀이터검사완료이수가입<NA>
8925720유치원강동초등학교병설유치원 놀이터검사완료이수가입<NA>
910511790유치원강룡사유치원놀이터검사완료이수가입<NA>
연번시설번호설치장소놀이시설명안전검사여부안전교육이수보험가입여부비고
801802565796학교효제초등학교 내 실내놀이시설검사완료이수가입<NA>
8028032246학교효제초등학교 놀이터검사완료이수가입<NA>
803804554594유치원후평숲속유치원 놀이시설검사완료이수가입<NA>
804805509925학교후평초등학교 놀이터검사완료이수가입<NA>
805806509694학교흥양초등학교 놀이시설검사완료이수가입<NA>
806807510507유치원흥양초등학교병설유치원놀이시설검사완료이수가입<NA>
807808585501유치원흥양초등학교병설유치원놀이시설②검사완료이수가입<NA>
808809580178학교흥업초등학교 친환경 상상놀이터검사완료이수가입<NA>
809810544952유치원흥업초등학교병설유치원 놀이시설검사완료이수가입<NA>
810811507397학교흥전초등학교 놀이시설검사완료이수가입<NA>