Overview

Dataset statistics

Number of variables7
Number of observations142
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.2 KiB
Average record size in memory58.9 B

Variable types

Numeric1
Categorical4
Text2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-22256/F/1/datasetView.do

Alerts

시도 has constant value ""Constant
연번 is highly overall correlated with 자치구명High correlation
자치구명 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2024-01-05 22:27:37.702032
Analysis finished2024-01-05 22:27:39.592752
Duration1.89 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct142
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean71.5
Minimum1
Maximum142
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.4 KiB
2024-01-05T22:27:39.787593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile8.05
Q136.25
median71.5
Q3106.75
95-th percentile134.95
Maximum142
Range141
Interquartile range (IQR)70.5

Descriptive statistics

Standard deviation41.135953
Coefficient of variation (CV)0.57532802
Kurtosis-1.2
Mean71.5
Median Absolute Deviation (MAD)35.5
Skewness0
Sum10153
Variance1692.1667
MonotonicityStrictly increasing
2024-01-05T22:27:40.220947image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.7%
99 1
 
0.7%
93 1
 
0.7%
94 1
 
0.7%
95 1
 
0.7%
96 1
 
0.7%
97 1
 
0.7%
98 1
 
0.7%
100 1
 
0.7%
91 1
 
0.7%
Other values (132) 132
93.0%
ValueCountFrequency (%)
1 1
0.7%
2 1
0.7%
3 1
0.7%
4 1
0.7%
5 1
0.7%
6 1
0.7%
7 1
0.7%
8 1
0.7%
9 1
0.7%
10 1
0.7%
ValueCountFrequency (%)
142 1
0.7%
141 1
0.7%
140 1
0.7%
139 1
0.7%
138 1
0.7%
137 1
0.7%
136 1
0.7%
135 1
0.7%
134 1
0.7%
133 1
0.7%

시도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
서울특별시
142 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 142
100.0%

Length

2024-01-05T22:27:40.628177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-05T22:27:40.800579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시 142
100.0%

자치구명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)14.8%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
강남구
22 
서초구
19 
노원구
15 
강서구
11 
동작구
10 
Other values (16)
65 

Length

Max length4
Median length3
Mean length3.0492958
Min length2

Unique

Unique3 ?
Unique (%)2.1%

Sample

1st row종로구
2nd row종로구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
강남구 22
15.5%
서초구 19
13.4%
노원구 15
10.6%
강서구 11
 
7.7%
동작구 10
 
7.0%
동대문구 9
 
6.3%
은평구 8
 
5.6%
강동구 7
 
4.9%
광진구 6
 
4.2%
관악구 6
 
4.2%
Other values (11) 29
20.4%

Length

2024-01-05T22:27:41.130860image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
강남구 22
15.5%
서초구 19
13.4%
노원구 15
10.6%
강서구 11
 
7.7%
동작구 10
 
7.0%
동대문구 9
 
6.3%
은평구 8
 
5.6%
강동구 7
 
4.9%
광진구 6
 
4.2%
관악구 6
 
4.2%
Other values (11) 29
20.4%
Distinct123
Distinct (%)86.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-01-05T22:27:41.725531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length11.584507
Min length3

Characters and Unicode

Total characters1645
Distinct characters168
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)83.8%

Sample

1st row우정국로 51-4(수송동)
2nd row숭인동길 47(숭인동)
3rd row퇴계로 447(황학동)
4th row동호로14길 11-8(신당동)
5th row만리재로35길 12(만리동1가)
ValueCountFrequency (%)
대치동 16
 
6.0%
중계로 9
 
3.4%
한글비석로 6
 
2.3%
232(중계동 3
 
1.1%
통일로 2
 
0.8%
논현로5길 2
 
0.8%
송파대로37길 2
 
0.8%
90 2
 
0.8%
사당로16길 2
 
0.8%
36-8 2
 
0.8%
Other values (217) 219
82.6%
2024-01-05T22:27:42.747515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
139
 
8.4%
120
 
7.3%
114
 
6.9%
( 89
 
5.4%
) 89
 
5.4%
1 89
 
5.4%
78
 
4.7%
2 72
 
4.4%
3 55
 
3.3%
5 42
 
2.6%
Other values (158) 758
46.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 848
51.6%
Decimal Number 457
27.8%
Space Separator 139
 
8.4%
Open Punctuation 89
 
5.4%
Close Punctuation 89
 
5.4%
Dash Punctuation 23
 
1.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
120
 
14.2%
114
 
13.4%
78
 
9.2%
31
 
3.7%
27
 
3.2%
26
 
3.1%
18
 
2.1%
15
 
1.8%
12
 
1.4%
11
 
1.3%
Other values (144) 396
46.7%
Decimal Number
ValueCountFrequency (%)
1 89
19.5%
2 72
15.8%
3 55
12.0%
5 42
9.2%
6 42
9.2%
0 37
8.1%
8 34
 
7.4%
7 31
 
6.8%
4 28
 
6.1%
9 27
 
5.9%
Space Separator
ValueCountFrequency (%)
139
100.0%
Open Punctuation
ValueCountFrequency (%)
( 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 89
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 23
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 848
51.6%
Common 797
48.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
120
 
14.2%
114
 
13.4%
78
 
9.2%
31
 
3.7%
27
 
3.2%
26
 
3.1%
18
 
2.1%
15
 
1.8%
12
 
1.4%
11
 
1.3%
Other values (144) 396
46.7%
Common
ValueCountFrequency (%)
139
17.4%
( 89
11.2%
) 89
11.2%
1 89
11.2%
2 72
9.0%
3 55
 
6.9%
5 42
 
5.3%
6 42
 
5.3%
0 37
 
4.6%
8 34
 
4.3%
Other values (4) 109
13.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 848
51.6%
ASCII 797
48.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
139
17.4%
( 89
11.2%
) 89
11.2%
1 89
11.2%
2 72
9.0%
3 55
 
6.9%
5 42
 
5.3%
6 42
 
5.3%
0 37
 
4.6%
8 34
 
4.3%
Other values (4) 109
13.7%
Hangul
ValueCountFrequency (%)
120
 
14.2%
114
 
13.4%
78
 
9.2%
31
 
3.7%
27
 
3.2%
26
 
3.1%
18
 
2.1%
15
 
1.8%
12
 
1.4%
11
 
1.3%
Other values (144) 396
46.7%
Distinct141
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2024-01-05T22:27:43.347038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length5.028169
Min length2

Characters and Unicode

Total characters714
Distinct characters241
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique140 ?
Unique (%)98.6%

Sample

1st row선재
2nd row도담도담
3rd row미담
4th row햇살
5th row중림
ValueCountFrequency (%)
보습학원 3
 
2.0%
미담 2
 
1.3%
서울프랑스학교 1
 
0.7%
방배4동 1
 
0.7%
서초1동하은 1
 
0.7%
남태령 1
 
0.7%
구립서초남서울 1
 
0.7%
텀블랜드 1
 
0.7%
양재별하 1
 
0.7%
양재목련 1
 
0.7%
Other values (137) 137
91.3%
2024-01-05T22:27:44.141841image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
 
5.6%
38
 
5.3%
15
 
2.1%
15
 
2.1%
14
 
2.0%
13
 
1.8%
11
 
1.5%
10
 
1.4%
10
 
1.4%
10
 
1.4%
Other values (231) 538
75.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 682
95.5%
Decimal Number 15
 
2.1%
Space Separator 8
 
1.1%
Open Punctuation 3
 
0.4%
Close Punctuation 3
 
0.4%
Uppercase Letter 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
5.9%
38
 
5.6%
15
 
2.2%
15
 
2.2%
14
 
2.1%
13
 
1.9%
11
 
1.6%
10
 
1.5%
10
 
1.5%
10
 
1.5%
Other values (218) 506
74.2%
Decimal Number
ValueCountFrequency (%)
2 3
20.0%
0 3
20.0%
9 3
20.0%
1 2
13.3%
4 2
13.3%
5 1
 
6.7%
8 1
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
C 1
33.3%
M 1
33.3%
S 1
33.3%
Space Separator
ValueCountFrequency (%)
8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 682
95.5%
Common 29
 
4.1%
Latin 3
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
5.9%
38
 
5.6%
15
 
2.2%
15
 
2.2%
14
 
2.1%
13
 
1.9%
11
 
1.6%
10
 
1.5%
10
 
1.5%
10
 
1.5%
Other values (218) 506
74.2%
Common
ValueCountFrequency (%)
8
27.6%
2 3
 
10.3%
0 3
 
10.3%
9 3
 
10.3%
( 3
 
10.3%
) 3
 
10.3%
1 2
 
6.9%
4 2
 
6.9%
5 1
 
3.4%
8 1
 
3.4%
Latin
ValueCountFrequency (%)
C 1
33.3%
M 1
33.3%
S 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 682
95.5%
ASCII 32
 
4.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
40
 
5.9%
38
 
5.6%
15
 
2.2%
15
 
2.2%
14
 
2.1%
13
 
1.9%
11
 
1.6%
10
 
1.5%
10
 
1.5%
10
 
1.5%
Other values (218) 506
74.2%
ASCII
ValueCountFrequency (%)
8
25.0%
2 3
 
9.4%
0 3
 
9.4%
9 3
 
9.4%
( 3
 
9.4%
) 3
 
9.4%
1 2
 
6.2%
4 2
 
6.2%
5 1
 
3.1%
8 1
 
3.1%
Other values (3) 3
 
9.4%

시설종류
Categorical

Distinct8
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
어린이집
77 
학원
43 
유치원
초등학교
 
7
외국인학교
 
3
Other values (3)
 
4

Length

Max length5
Median length4
Mean length3.3591549
Min length2

Unique

Unique2 ?
Unique (%)1.4%

Sample

1st row어린이집
2nd row어린이집
3rd row어린이집
4th row어린이집
5th row어린이집

Common Values

ValueCountFrequency (%)
어린이집 77
54.2%
학원 43
30.3%
유치원 8
 
5.6%
초등학교 7
 
4.9%
외국인학교 3
 
2.1%
특수학교 2
 
1.4%
독서실 1
 
0.7%
병설유치원 1
 
0.7%

Length

2024-01-05T22:27:44.559273image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-05T22:27:44.821728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이집 77
54.2%
학원 43
30.3%
유치원 8
 
5.6%
초등학교 7
 
4.9%
외국인학교 3
 
2.1%
특수학교 2
 
1.4%
독서실 1
 
0.7%
병설유치원 1
 
0.7%

지정연도
Categorical

Distinct3
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size1.2 KiB
2020
73 
2019
42 
2018
27 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2019
2nd row2019
3rd row2019
4th row2020
5th row2018

Common Values

ValueCountFrequency (%)
2020 73
51.4%
2019 42
29.6%
2018 27
 
19.0%

Length

2024-01-05T22:27:45.117520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-05T22:27:45.446788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 73
51.4%
2019 42
29.6%
2018 27
 
19.0%

Interactions

2024-01-05T22:27:38.618815image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-05T22:27:45.725678image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자치구명시설종류지정연도
연번1.0000.9730.5290.605
자치구명0.9731.0000.7170.766
시설종류0.5290.7171.0000.599
지정연도0.6050.7660.5991.000
2024-01-05T22:27:45.977913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자치구명시설종류지정연도
자치구명1.0000.3750.466
시설종류0.3751.0000.459
지정연도0.4660.4591.000
2024-01-05T22:27:46.131539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번자치구명시설종류지정연도
연번1.0000.8090.2830.438
자치구명0.8091.0000.3750.466
시설종류0.2830.3751.0000.459
지정연도0.4380.4660.4591.000

Missing values

2024-01-05T22:27:38.982598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-05T22:27:39.422414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번시도자치구명도로명 주소(동명)시설명시설종류지정연도
01서울특별시종로구우정국로 51-4(수송동)선재어린이집2019
12서울특별시종로구숭인동길 47(숭인동)도담도담어린이집2019
23서울특별시중구퇴계로 447(황학동)미담어린이집2019
34서울특별시중구동호로14길 11-8(신당동)햇살어린이집2020
45서울특별시중구만리재로35길 12(만리동1가)중림어린이집2018
56서울특별시용산구한남대로40길 76(한남동)구립맑은숲어린이집2018
67서울특별시성동구살곶이길 208(사근동)한양초등학교2020
78서울특별시광진구아차산로44가길 5(자양동)자양도담어린이집2018
89서울특별시광진구자양로23길 79(구의동)아이터어린이집2019
910서울특별시광진구자양로35길 13(구의동)한국켄트외국인학교2018
연번시도자치구명도로명 주소(동명)시설명시설종류지정연도
132133서울특별시송파구송파대로37길 95(가락동)해누리초등학교2019
133134서울특별시송파구백제고분로36가길 21(석촌동)예랑어린이집2018
134135서울특별시송파구동남로9길 17-10(가락동)은송어린이집2020
135136서울특별시강동구상일로11길 110(상일동)고현초등학교2020
136137서울특별시강동구천호대로187길 53-18(길동)그리새유치원2020
137138서울특별시강동구동남로 832(상일동)한영병설병설유치원2020
138139서울특별시강동구아리수로 46-14(암사동)구립해나어린이집2018
139140서울특별시강동구성내로18길 19(성내동)성내참사랑어린이집2018
140141서울특별시강동구고덕로80길 134(상일동)고덕숲어린이집2019
141142서울특별시강동구올림픽로79길 46(천호동)구립다온어린이집2020