Overview

Dataset statistics

Number of variables8
Number of observations1738
Missing cells114
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory108.8 KiB
Average record size in memory64.1 B

Variable types

Unsupported2
Categorical3
Text3

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-2799/F/1/datasetView.do

Alerts

Unnamed: 6 is highly overall correlated with 서울시 어린이보호구역 지정 현황High correlation
서울시 어린이보호구역 지정 현황 is highly overall correlated with Unnamed: 2 and 1 other fieldsHigh correlation
Unnamed: 2 is highly overall correlated with 서울시 어린이보호구역 지정 현황High correlation
서울시 어린이보호구역 지정 현황 is highly imbalanced (94.7%)Imbalance
Unnamed: 0 has 28 (1.6%) missing valuesMissing
Unnamed: 4 has 28 (1.6%) missing valuesMissing
Unnamed: 5 has 28 (1.6%) missing valuesMissing
Unnamed: 7 has 28 (1.6%) missing valuesMissing
Unnamed: 0 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-13 07:32:02.467095
Analysis finished2024-03-13 07:32:03.298933
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Unnamed: 0
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing28
Missing (%)1.6%
Memory size13.7 KiB

서울시 어린이보호구역 지정 현황
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct6
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
서울특별시
1709 
소계
 
25
<NA>
 
1
지방자치단체
 
1
시∙도
 
1

Length

Max length6
Median length5
Mean length4.9539701
Min length2

Unique

Unique4 ?
Unique (%)0.2%

Sample

1st row<NA>
2nd row지방자치단체
3rd row시∙도
4th row합계
5th row소계

Common Values

ValueCountFrequency (%)
서울특별시 1709
98.3%
소계 25
 
1.4%
<NA> 1
 
0.1%
지방자치단체 1
 
0.1%
시∙도 1
 
0.1%
합계 1
 
0.1%

Length

2024-03-13T16:32:03.361242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T16:32:03.473939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 1709
98.3%
소계 25
 
1.4%
na 1
 
0.1%
지방자치단체 1
 
0.1%
시∙도 1
 
0.1%
합계 1
 
0.1%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct28
Distinct (%)1.6%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
노원구
 
117
강남구
 
113
성북구
 
103
송파구
 
92
서초구
 
92
Other values (23)
1221 

Length

Max length4
Median length3
Mean length3.0817031
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row<NA>
3rd row자치구
4th row서울시
5th row종로구

Common Values

ValueCountFrequency (%)
노원구 117
 
6.7%
강남구 113
 
6.5%
성북구 103
 
5.9%
송파구 92
 
5.3%
서초구 92
 
5.3%
강서구 90
 
5.2%
양천구 90
 
5.2%
강동구 86
 
4.9%
광진구 76
 
4.4%
동대문구 74
 
4.3%
Other values (18) 805
46.3%

Length

2024-03-13T16:32:03.691979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
노원구 117
 
6.7%
강남구 113
 
6.5%
성북구 103
 
5.9%
송파구 92
 
5.3%
서초구 92
 
5.3%
강서구 90
 
5.2%
양천구 90
 
5.2%
강동구 86
 
4.9%
광진구 76
 
4.4%
동대문구 74
 
4.3%
Other values (18) 805
46.3%
Distinct375
Distinct (%)21.6%
Missing2
Missing (%)0.1%
Memory size13.7 KiB
2024-03-13T16:32:03.949949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length3
Mean length3.1952765
Min length2

Characters and Unicode

Total characters5547
Distinct characters190
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)7.8%

Sample

1st row행정동
2nd row1,709개소
3rd row47개소
4th row홍지동
5th row경운동
ValueCountFrequency (%)
중계동 38
 
2.2%
상계동 37
 
2.1%
신림동 36
 
2.1%
신정동 35
 
2.0%
봉천동 30
 
1.7%
화곡동 29
 
1.7%
목동 28
 
1.6%
대치동 28
 
1.6%
신월동 26
 
1.5%
신당동 21
 
1.2%
Other values (356) 1428
82.3%
2024-03-13T16:32:04.303933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1714
30.9%
186
 
3.4%
101
 
1.8%
91
 
1.6%
78
 
1.4%
78
 
1.4%
68
 
1.2%
65
 
1.2%
65
 
1.2%
64
 
1.2%
Other values (180) 3037
54.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5304
95.6%
Decimal Number 214
 
3.9%
Space Separator 23
 
0.4%
Other Punctuation 6
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1714
32.3%
186
 
3.5%
101
 
1.9%
91
 
1.7%
78
 
1.5%
78
 
1.5%
68
 
1.3%
65
 
1.2%
65
 
1.2%
64
 
1.2%
Other values (167) 2794
52.7%
Decimal Number
ValueCountFrequency (%)
1 59
27.6%
3 41
19.2%
2 40
18.7%
4 19
 
8.9%
5 14
 
6.5%
7 11
 
5.1%
6 11
 
5.1%
9 7
 
3.3%
0 6
 
2.8%
8 6
 
2.8%
Other Punctuation
ValueCountFrequency (%)
. 4
66.7%
, 2
33.3%
Space Separator
ValueCountFrequency (%)
23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5304
95.6%
Common 243
 
4.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1714
32.3%
186
 
3.5%
101
 
1.9%
91
 
1.7%
78
 
1.5%
78
 
1.5%
68
 
1.3%
65
 
1.2%
65
 
1.2%
64
 
1.2%
Other values (167) 2794
52.7%
Common
ValueCountFrequency (%)
1 59
24.3%
3 41
16.9%
2 40
16.5%
23
 
9.5%
4 19
 
7.8%
5 14
 
5.8%
7 11
 
4.5%
6 11
 
4.5%
9 7
 
2.9%
0 6
 
2.5%
Other values (3) 12
 
4.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5304
95.6%
ASCII 243
 
4.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1714
32.3%
186
 
3.5%
101
 
1.9%
91
 
1.7%
78
 
1.5%
78
 
1.5%
68
 
1.3%
65
 
1.2%
65
 
1.2%
64
 
1.2%
Other values (167) 2794
52.7%
ASCII
ValueCountFrequency (%)
1 59
24.3%
3 41
16.9%
2 40
16.5%
23
 
9.5%
4 19
 
7.8%
5 14
 
5.8%
7 11
 
4.5%
6 11
 
4.5%
9 7
 
2.9%
0 6
 
2.5%
Other values (3) 12
 
4.9%

Unnamed: 4
Text

MISSING 

Distinct1545
Distinct (%)90.4%
Missing28
Missing (%)1.6%
Memory size13.7 KiB
2024-03-13T16:32:04.628857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.8192982
Min length5

Characters and Unicode

Total characters15081
Distinct characters277
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1382 ?
Unique (%)80.8%

Sample

1st row도로명 주소
2nd row홍지문2길 1
3rd row삼일대로 446
4th row대학로 64
5th row통일로12길 23
ValueCountFrequency (%)
23 31
 
0.9%
22 31
 
0.9%
21 29
 
0.8%
14 27
 
0.8%
9 26
 
0.8%
13 25
 
0.7%
32 25
 
0.7%
11 25
 
0.7%
17 24
 
0.7%
31 24
 
0.7%
Other values (1721) 3145
92.2%
2024-03-13T16:32:05.057304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1723
 
11.4%
1614
 
10.7%
1 1209
 
8.0%
1157
 
7.7%
2 889
 
5.9%
3 773
 
5.1%
4 592
 
3.9%
5 544
 
3.6%
6 525
 
3.5%
7 404
 
2.7%
Other values (267) 5651
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7131
47.3%
Decimal Number 6031
40.0%
Space Separator 1724
 
11.4%
Dash Punctuation 195
 
1.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1614
22.6%
1157
 
16.2%
181
 
2.5%
159
 
2.2%
119
 
1.7%
89
 
1.2%
87
 
1.2%
85
 
1.2%
83
 
1.2%
80
 
1.1%
Other values (254) 3477
48.8%
Decimal Number
ValueCountFrequency (%)
1 1209
20.0%
2 889
14.7%
3 773
12.8%
4 592
9.8%
5 544
9.0%
6 525
8.7%
7 404
 
6.7%
0 386
 
6.4%
9 371
 
6.2%
8 338
 
5.6%
Space Separator
ValueCountFrequency (%)
1723
99.9%
  1
 
0.1%
Dash Punctuation
ValueCountFrequency (%)
- 195
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7950
52.7%
Hangul 7131
47.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1614
22.6%
1157
 
16.2%
181
 
2.5%
159
 
2.2%
119
 
1.7%
89
 
1.2%
87
 
1.2%
85
 
1.2%
83
 
1.2%
80
 
1.1%
Other values (254) 3477
48.8%
Common
ValueCountFrequency (%)
1723
21.7%
1 1209
15.2%
2 889
11.2%
3 773
9.7%
4 592
 
7.4%
5 544
 
6.8%
6 525
 
6.6%
7 404
 
5.1%
0 386
 
4.9%
9 371
 
4.7%
Other values (3) 534
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7949
52.7%
Hangul 7131
47.3%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1723
21.7%
1 1209
15.2%
2 889
11.2%
3 773
9.7%
4 592
 
7.4%
5 544
 
6.8%
6 525
 
6.6%
7 404
 
5.1%
0 386
 
4.9%
9 371
 
4.7%
Other values (2) 533
 
6.7%
Hangul
ValueCountFrequency (%)
1614
22.6%
1157
 
16.2%
181
 
2.5%
159
 
2.2%
119
 
1.7%
89
 
1.2%
87
 
1.2%
85
 
1.2%
83
 
1.2%
80
 
1.1%
Other values (254) 3477
48.8%
None
ValueCountFrequency (%)
  1
100.0%

Unnamed: 5
Text

MISSING 

Distinct1656
Distinct (%)96.8%
Missing28
Missing (%)1.6%
Memory size13.7 KiB
2024-03-13T16:32:05.255017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length18
Mean length7.7619883
Min length3

Characters and Unicode

Total characters13273
Distinct characters434
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1609 ?
Unique (%)94.1%

Sample

1st row시설명
2nd row상명대학교사범대학부속초등학교
3rd row서울교동초등학교
4th row서울대학교사범대학부설초등학교
5th row서울독립문초등학교
ValueCountFrequency (%)
구립 62
 
3.5%
이화어린이집 5
 
0.3%
사랑유치원 4
 
0.2%
예일유치원 3
 
0.2%
선재어린이집 3
 
0.2%
행복한어린이집 3
 
0.2%
어린왕자어린이집 2
 
0.1%
초롱별어린이집 2
 
0.1%
성결유치원 2
 
0.1%
청운어린이집 2
 
0.1%
Other values (1655) 1699
95.1%
2024-03-13T16:32:05.559161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
863
 
6.5%
809
 
6.1%
809
 
6.1%
769
 
5.8%
758
 
5.7%
756
 
5.7%
638
 
4.8%
561
 
4.2%
528
 
4.0%
520
 
3.9%
Other values (424) 6262
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 13088
98.6%
Space Separator 77
 
0.6%
Decimal Number 47
 
0.4%
Uppercase Letter 24
 
0.2%
Close Punctuation 18
 
0.1%
Open Punctuation 18
 
0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
863
 
6.6%
809
 
6.2%
809
 
6.2%
769
 
5.9%
758
 
5.8%
756
 
5.8%
638
 
4.9%
561
 
4.3%
528
 
4.0%
520
 
4.0%
Other values (398) 6077
46.4%
Uppercase Letter
ValueCountFrequency (%)
S 4
16.7%
E 3
12.5%
K 3
12.5%
O 2
8.3%
L 2
8.3%
G 2
8.3%
T 2
8.3%
Y 1
 
4.2%
B 1
 
4.2%
D 1
 
4.2%
Other values (3) 3
12.5%
Decimal Number
ValueCountFrequency (%)
2 13
27.7%
1 8
17.0%
3 7
14.9%
4 7
14.9%
5 4
 
8.5%
9 3
 
6.4%
0 3
 
6.4%
8 1
 
2.1%
6 1
 
2.1%
Space Separator
ValueCountFrequency (%)
77
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 13088
98.6%
Common 161
 
1.2%
Latin 24
 
0.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
863
 
6.6%
809
 
6.2%
809
 
6.2%
769
 
5.9%
758
 
5.8%
756
 
5.8%
638
 
4.9%
561
 
4.3%
528
 
4.0%
520
 
4.0%
Other values (398) 6077
46.4%
Common
ValueCountFrequency (%)
77
47.8%
) 18
 
11.2%
( 18
 
11.2%
2 13
 
8.1%
1 8
 
5.0%
3 7
 
4.3%
4 7
 
4.3%
5 4
 
2.5%
9 3
 
1.9%
0 3
 
1.9%
Other values (3) 3
 
1.9%
Latin
ValueCountFrequency (%)
S 4
16.7%
E 3
12.5%
K 3
12.5%
O 2
8.3%
L 2
8.3%
G 2
8.3%
T 2
8.3%
Y 1
 
4.2%
B 1
 
4.2%
D 1
 
4.2%
Other values (3) 3
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 13088
98.6%
ASCII 185
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
863
 
6.6%
809
 
6.2%
809
 
6.2%
769
 
5.9%
758
 
5.8%
756
 
5.8%
638
 
4.9%
561
 
4.3%
528
 
4.0%
520
 
4.0%
Other values (398) 6077
46.4%
ASCII
ValueCountFrequency (%)
77
41.6%
) 18
 
9.7%
( 18
 
9.7%
2 13
 
7.0%
1 8
 
4.3%
3 7
 
3.8%
4 7
 
3.8%
S 4
 
2.2%
5 4
 
2.2%
9 3
 
1.6%
Other values (16) 26
 
14.1%

Unnamed: 6
Categorical

HIGH CORRELATION 

Distinct11
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size13.7 KiB
초등학교
606 
어린이집
496 
유치원
363 
병설유치원
143 
학원
 
57
Other values (6)
73 

Length

Max length17
Median length4
Mean length3.8245109
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row(2022. 12. 31.기준)
2nd row시설유형
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
초등학교 606
34.9%
어린이집 496
28.5%
유치원 363
20.9%
병설유치원 143
 
8.2%
학원 57
 
3.3%
특수학교 28
 
1.6%
<NA> 27
 
1.6%
외국인학교 12
 
0.7%
단설유치원 4
 
0.2%
(2022. 12. 31.기준) 1
 
0.1%

Length

2024-03-13T16:32:05.690322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
초등학교 606
34.8%
어린이집 496
28.5%
유치원 363
20.9%
병설유치원 143
 
8.2%
학원 57
 
3.3%
특수학교 28
 
1.6%
na 27
 
1.6%
외국인학교 12
 
0.7%
단설유치원 4
 
0.2%
2022 1
 
0.1%
Other values (3) 3
 
0.2%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing28
Missing (%)1.6%
Memory size13.7 KiB

Correlations

2024-03-13T16:32:05.801180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울시 어린이보호구역 지정 현황Unnamed: 2Unnamed: 6
서울시 어린이보호구역 지정 현황1.0000.9441.000
Unnamed: 20.9441.0000.332
Unnamed: 61.0000.3321.000
2024-03-13T16:32:05.917995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 6서울시 어린이보호구역 지정 현황Unnamed: 2
Unnamed: 61.0000.9980.138
서울시 어린이보호구역 지정 현황0.9981.0000.808
Unnamed: 20.1380.8081.000
2024-03-13T16:32:05.994403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
서울시 어린이보호구역 지정 현황Unnamed: 2Unnamed: 6
서울시 어린이보호구역 지정 현황1.0000.8080.998
Unnamed: 20.8081.0000.138
Unnamed: 60.9980.1381.000

Missing values

2024-03-13T16:32:03.000648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T16:32:03.106178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T16:32:03.218504image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

Unnamed: 0서울시 어린이보호구역 지정 현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
0NaN<NA><NA><NA><NA><NA>(2022. 12. 31.기준)NaN
1연번지방자치단체<NA><NA><NA>시설명시설유형지정연도
2NaN시∙도자치구행정동도로명 주소<NA><NA>NaN
3NaN합계서울시1,709개소<NA><NA><NA>NaN
4NaN소계종로구47개소<NA><NA><NA>NaN
51서울특별시종로구홍지동홍지문2길 1상명대학교사범대학부속초등학교초등학교2006
62서울특별시종로구경운동삼일대로 446서울교동초등학교초등학교2006
73서울특별시종로구이화동대학로 64서울대학교사범대학부설초등학교초등학교1995
84서울특별시종로구무악동통일로12길 23서울독립문초등학교초등학교2005
95서울특별시종로구필운동사직로9길 19서울매동초등학교초등학교2005
Unnamed: 0서울시 어린이보호구역 지정 현황Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7
172876서울특별시강동구상일1동상암로 369주몽학교특수학교1997-02-24 00:00:00
172977서울특별시강동구고덕제2동고덕로 295-59한국구화학교특수학교1997-02-24 00:00:00
173078서울특별시강동구길동천호대로187길 73-10예크트리(YEK TREE)학원학원2021-03-29 00:00:00
173179서울특별시강동구암사동올림픽로104길 36오르다샘앤클래스암사학원학원2021-03-29 00:00:00
173280서울특별시강동구고덕동동남로82길 94-54서울강덕초등학교병설유치원병설유치원2010-08-03 00:00:00
173381서울특별시강동구명일동고덕로46길 53서울고명초등학교병설유치원병설유치원2010-08-03 00:00:00
173482서울특별시강동구길동양재대로116길 69서울길동초등학교병설유치원병설유치원2010-08-03 00:00:00
173583서울특별시강동구둔촌동진황도로61길 29서울선린초등학교병설유치원병설유치원2010-08-03 00:00:00
173684서울특별시강동구둔촌동명일로 23서울위례초등학교병설유치원병설유치원2010-08-03 00:00:00
173785서울특별시강동구상일동동남로 832한영중고등학교병설한영유치원병설유치원2020-10-08 00:00:00