Overview

Dataset statistics

Number of variables6
Number of observations1590
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory74.7 KiB
Average record size in memory48.1 B

Variable types

Categorical3
Text3

Dataset

Description어린이집현황 2014년12월현재
Author대구광역시
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=15054165&dataSetDetailId=15054165823ddf80c2b5&provdMethod=FILE

Alerts

Unnamed: 1 is highly overall correlated with 대구광역시 어린이집 현황High correlation
Unnamed: 2 is highly overall correlated with 대구광역시 어린이집 현황High correlation
대구광역시 어린이집 현황 is highly overall correlated with Unnamed: 1 and 1 other fieldsHigh correlation
대구광역시 어린이집 현황 is highly imbalanced (99.0%)Imbalance

Reproduction

Analysis started2024-04-18 07:46:52.955891
Analysis finished2024-04-18 07:46:53.806672
Duration0.85 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대구광역시 어린이집 현황
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
대구광역시
1588 
기준년월 : 2014년 12월
 
1
시도
 
1

Length

Max length25
Median length5
Mean length5.0106918
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row 기준년월 : 2014년 12월
2nd row시도
3rd row대구광역시
4th row대구광역시
5th row대구광역시

Common Values

ValueCountFrequency (%)
대구광역시 1588
99.9%
기준년월 : 2014년 12월 1
 
0.1%
시도 1
 
0.1%

Length

2024-04-18T16:46:53.871420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T16:46:53.961021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구광역시 1588
99.7%
기준년월 1
 
0.1%
1
 
0.1%
2014년 1
 
0.1%
12월 1
 
0.1%
시도 1
 
0.1%

Unnamed: 1
Categorical

HIGH CORRELATION 

Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
달서구
424 
북구
360 
수성구
218 
동구
206 
달성군
141 
Other values (5)
241 

Length

Max length4
Median length2
Mean length2.4943396
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row시군구
3rd row중구
4th row중구
5th row중구

Common Values

ValueCountFrequency (%)
달서구 424
26.7%
북구 360
22.6%
수성구 218
13.7%
동구 206
13.0%
달성군 141
 
8.9%
서구 136
 
8.6%
남구 64
 
4.0%
중구 39
 
2.5%
<NA> 1
 
0.1%
시군구 1
 
0.1%

Length

2024-04-18T16:46:54.067443image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T16:46:54.212858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
달서구 424
26.7%
북구 360
22.6%
수성구 218
13.7%
동구 206
13.0%
달성군 141
 
8.9%
서구 136
 
8.6%
남구 64
 
4.0%
중구 39
 
2.5%
na 1
 
0.1%
시군구 1
 
0.1%

Unnamed: 2
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
민간
728 
가정
639 
사회복지법인
121 
국공립
 
42
법인・단체등
 
34
Other values (4)
 
26

Length

Max length7
Median length2
Mean length2.4283019
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row<NA>
2nd row어린이집 유형
3rd row민간
4th row직장
5th row법인・단체등

Common Values

ValueCountFrequency (%)
민간 728
45.8%
가정 639
40.2%
사회복지법인 121
 
7.6%
국공립 42
 
2.6%
법인・단체등 34
 
2.1%
직장 18
 
1.1%
부모협동 6
 
0.4%
<NA> 1
 
0.1%
어린이집 유형 1
 
0.1%

Length

2024-04-18T16:46:54.342728image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-18T16:46:54.452661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
민간 728
45.8%
가정 639
40.2%
사회복지법인 121
 
7.6%
국공립 42
 
2.6%
법인・단체등 34
 
2.1%
직장 18
 
1.1%
부모협동 6
 
0.4%
na 1
 
0.1%
어린이집 1
 
0.1%
유형 1
 
0.1%
Distinct1217
Distinct (%)76.6%
Missing1
Missing (%)0.1%
Memory size12.6 KiB
2024-04-18T16:46:54.659230image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length17
Mean length7.2284456
Min length5

Characters and Unicode

Total characters11486
Distinct characters459
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique976 ?
Unique (%)61.4%

Sample

1st row어린이집명
2nd row비둘기어린이집
3rd row대구프뢰벨어린이집
4th row남산교회어린이집
5th row대구삼성어린이집
ValueCountFrequency (%)
어린이집 38
 
2.3%
아이숲어린이집 7
 
0.4%
하늘꿈어린이집 6
 
0.4%
꿈나무어린이집 5
 
0.3%
아이사랑어린이집 5
 
0.3%
무지개어린이집 5
 
0.3%
사랑어린이집 5
 
0.3%
늘푸른어린이집 5
 
0.3%
미소어린이집 4
 
0.2%
아이뜰어린이집 4
 
0.2%
Other values (1213) 1557
94.9%
2024-04-18T16:46:55.016399image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1748
15.2%
1612
 
14.0%
1597
 
13.9%
1586
 
13.8%
226
 
2.0%
123
 
1.1%
105
 
0.9%
93
 
0.8%
89
 
0.8%
78
 
0.7%
Other values (449) 4229
36.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11381
99.1%
Space Separator 52
 
0.5%
Uppercase Letter 19
 
0.2%
Lowercase Letter 18
 
0.2%
Dash Punctuation 8
 
0.1%
Decimal Number 5
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1748
15.4%
1612
 
14.2%
1597
 
14.0%
1586
 
13.9%
226
 
2.0%
123
 
1.1%
105
 
0.9%
93
 
0.8%
89
 
0.8%
78
 
0.7%
Other values (421) 4124
36.2%
Uppercase Letter
ValueCountFrequency (%)
B 4
21.1%
A 3
15.8%
G 3
15.8%
I 2
10.5%
W 1
 
5.3%
D 1
 
5.3%
S 1
 
5.3%
H 1
 
5.3%
K 1
 
5.3%
C 1
 
5.3%
Lowercase Letter
ValueCountFrequency (%)
i 6
33.3%
n 2
 
11.1%
a 2
 
11.1%
r 2
 
11.1%
e 2
 
11.1%
s 1
 
5.6%
c 1
 
5.6%
t 1
 
5.6%
k 1
 
5.6%
Decimal Number
ValueCountFrequency (%)
2 3
60.0%
3 1
 
20.0%
4 1
 
20.0%
Space Separator
ValueCountFrequency (%)
52
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11381
99.1%
Common 68
 
0.6%
Latin 37
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1748
15.4%
1612
 
14.2%
1597
 
14.0%
1586
 
13.9%
226
 
2.0%
123
 
1.1%
105
 
0.9%
93
 
0.8%
89
 
0.8%
78
 
0.7%
Other values (421) 4124
36.2%
Latin
ValueCountFrequency (%)
i 6
16.2%
B 4
 
10.8%
A 3
 
8.1%
G 3
 
8.1%
I 2
 
5.4%
n 2
 
5.4%
a 2
 
5.4%
r 2
 
5.4%
e 2
 
5.4%
W 1
 
2.7%
Other values (10) 10
27.0%
Common
ValueCountFrequency (%)
52
76.5%
- 8
 
11.8%
2 3
 
4.4%
3 1
 
1.5%
( 1
 
1.5%
) 1
 
1.5%
4 1
 
1.5%
. 1
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11381
99.1%
ASCII 105
 
0.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1748
15.4%
1612
 
14.2%
1597
 
14.0%
1586
 
13.9%
226
 
2.0%
123
 
1.1%
105
 
0.9%
93
 
0.8%
89
 
0.8%
78
 
0.7%
Other values (421) 4124
36.2%
ASCII
ValueCountFrequency (%)
52
49.5%
- 8
 
7.6%
i 6
 
5.7%
B 4
 
3.8%
2 3
 
2.9%
A 3
 
2.9%
G 3
 
2.9%
I 2
 
1.9%
n 2
 
1.9%
a 2
 
1.9%
Other values (18) 20
 
19.0%
Distinct1587
Distinct (%)99.9%
Missing1
Missing (%)0.1%
Memory size12.6 KiB
2024-04-18T16:46:55.234198image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11.998741
Min length4

Characters and Unicode

Total characters19066
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1585 ?
Unique (%)99.7%

Sample

1st row전화번호
2nd row053-428-2599
3rd row053-215-9000
4th row053-253-0130
5th row053-255-7851
ValueCountFrequency (%)
053-564-2997 2
 
0.1%
053-425-3190 2
 
0.1%
053-632-7942 1
 
0.1%
053-641-2666 1
 
0.1%
053-636-6636 1
 
0.1%
053-631-1033 1
 
0.1%
053-639-7789 1
 
0.1%
053-631-0708 1
 
0.1%
053-644-7970 1
 
0.1%
053-636-0770 1
 
0.1%
Other values (1577) 1577
99.2%
2024-04-18T16:46:55.568389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3176
16.7%
5 3085
16.2%
3 2864
15.0%
0 2451
12.9%
6 1307
6.9%
2 1240
 
6.5%
7 1122
 
5.9%
1 1103
 
5.8%
9 951
 
5.0%
4 891
 
4.7%
Other values (5) 876
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 15886
83.3%
Dash Punctuation 3176
 
16.7%
Other Letter 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 3085
19.4%
3 2864
18.0%
0 2451
15.4%
6 1307
8.2%
2 1240
7.8%
7 1122
 
7.1%
1 1103
 
6.9%
9 951
 
6.0%
4 891
 
5.6%
8 872
 
5.5%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 3176
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19062
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
- 3176
16.7%
5 3085
16.2%
3 2864
15.0%
0 2451
12.9%
6 1307
6.9%
2 1240
 
6.5%
7 1122
 
5.9%
1 1103
 
5.8%
9 951
 
5.0%
4 891
 
4.7%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 19062
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3176
16.7%
5 3085
16.2%
3 2864
15.0%
0 2451
12.9%
6 1307
6.9%
2 1240
 
6.5%
7 1122
 
5.9%
1 1103
 
5.8%
9 951
 
5.0%
4 891
 
4.7%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct1584
Distinct (%)99.7%
Missing1
Missing (%)0.1%
Memory size12.6 KiB
2024-04-18T16:46:55.918769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length51
Median length43
Mean length31.075519
Min length2

Characters and Unicode

Total characters49379
Distinct characters343
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1580 ?
Unique (%)99.4%

Sample

1st row주소
2nd row대구광역시 중구 명륜로26안길 12 (남산1동)
3rd row대구광역시 중구 중앙대로62길 15 (남산1동)
4th row대구광역시 중구 관덕정길 16 (남산2동)
5th row대구광역시 중구 달구벌대로 2016-40 (남산2동)
ValueCountFrequency (%)
대구광역시 1589
 
17.3%
달서구 424
 
4.6%
북구 363
 
4.0%
수성구 218
 
2.4%
동구 206
 
2.2%
달성군 141
 
1.5%
서구 136
 
1.5%
남구 64
 
0.7%
101동 59
 
0.6%
다사읍 55
 
0.6%
Other values (2521) 5915
64.5%
2024-04-18T16:46:56.400095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
7601
 
15.4%
3246
 
6.6%
1 2458
 
5.0%
2310
 
4.7%
1923
 
3.9%
1674
 
3.4%
1627
 
3.3%
1597
 
3.2%
1530
 
3.1%
0 1466
 
3.0%
Other values (333) 23947
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 29237
59.2%
Decimal Number 9143
 
18.5%
Space Separator 7601
 
15.4%
Open Punctuation 1225
 
2.5%
Close Punctuation 1225
 
2.5%
Other Punctuation 586
 
1.2%
Dash Punctuation 327
 
0.7%
Uppercase Letter 25
 
0.1%
Lowercase Letter 9
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3246
 
11.1%
2310
 
7.9%
1923
 
6.6%
1674
 
5.7%
1627
 
5.6%
1597
 
5.5%
1530
 
5.2%
925
 
3.2%
834
 
2.9%
724
 
2.5%
Other values (307) 12847
43.9%
Decimal Number
ValueCountFrequency (%)
1 2458
26.9%
0 1466
16.0%
2 1239
13.6%
3 859
 
9.4%
5 750
 
8.2%
4 645
 
7.1%
6 521
 
5.7%
7 496
 
5.4%
9 384
 
4.2%
8 325
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
E 15
60.0%
U 3
 
12.0%
S 3
 
12.0%
D 2
 
8.0%
L 1
 
4.0%
H 1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
, 563
96.1%
/ 19
 
3.2%
. 4
 
0.7%
Lowercase Letter
ValueCountFrequency (%)
e 8
88.9%
b 1
 
11.1%
Space Separator
ValueCountFrequency (%)
7601
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1225
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1225
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 327
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 29237
59.2%
Common 20108
40.7%
Latin 34
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3246
 
11.1%
2310
 
7.9%
1923
 
6.6%
1674
 
5.7%
1627
 
5.6%
1597
 
5.5%
1530
 
5.2%
925
 
3.2%
834
 
2.9%
724
 
2.5%
Other values (307) 12847
43.9%
Common
ValueCountFrequency (%)
7601
37.8%
1 2458
 
12.2%
0 1466
 
7.3%
2 1239
 
6.2%
( 1225
 
6.1%
) 1225
 
6.1%
3 859
 
4.3%
5 750
 
3.7%
4 645
 
3.2%
, 563
 
2.8%
Other values (8) 2077
 
10.3%
Latin
ValueCountFrequency (%)
E 15
44.1%
e 8
23.5%
U 3
 
8.8%
S 3
 
8.8%
D 2
 
5.9%
b 1
 
2.9%
L 1
 
2.9%
H 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 29237
59.2%
ASCII 20142
40.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
7601
37.7%
1 2458
 
12.2%
0 1466
 
7.3%
2 1239
 
6.2%
( 1225
 
6.1%
) 1225
 
6.1%
3 859
 
4.3%
5 750
 
3.7%
4 645
 
3.2%
, 563
 
2.8%
Other values (16) 2111
 
10.5%
Hangul
ValueCountFrequency (%)
3246
 
11.1%
2310
 
7.9%
1923
 
6.6%
1674
 
5.7%
1627
 
5.6%
1597
 
5.5%
1530
 
5.2%
925
 
3.2%
834
 
2.9%
724
 
2.5%
Other values (307) 12847
43.9%

Correlations

2024-04-18T16:46:56.499803image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대구광역시 어린이집 현황Unnamed: 1Unnamed: 2
대구광역시 어린이집 현황1.0001.0001.000
Unnamed: 11.0001.0000.650
Unnamed: 21.0000.6501.000
2024-04-18T16:46:56.967607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 1Unnamed: 2대구광역시 어린이집 현황
Unnamed: 11.0000.3900.998
Unnamed: 20.3901.0000.998
대구광역시 어린이집 현황0.9980.9981.000
2024-04-18T16:46:57.062523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대구광역시 어린이집 현황Unnamed: 1Unnamed: 2
대구광역시 어린이집 현황1.0000.9980.998
Unnamed: 10.9981.0000.390
Unnamed: 20.9980.3901.000

Missing values

2024-04-18T16:46:53.611278image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-18T16:46:53.740245image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

대구광역시 어린이집 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
0기준년월 : 2014년 12월<NA><NA><NA><NA><NA>
1시도시군구어린이집 유형어린이집명전화번호주소
2대구광역시중구민간비둘기어린이집053-428-2599대구광역시 중구 명륜로26안길 12 (남산1동)
3대구광역시중구직장대구프뢰벨어린이집053-215-9000대구광역시 중구 중앙대로62길 15 (남산1동)
4대구광역시중구법인・단체등남산교회어린이집053-253-0130대구광역시 중구 관덕정길 16 (남산2동)
5대구광역시중구민간대구삼성어린이집053-255-7851대구광역시 중구 달구벌대로 2016-40 (남산2동)
6대구광역시중구사회복지법인백합어린이집053-256-6862대구광역시 중구 남산로4길 111 (남산3동)
7대구광역시중구국공립꿈나무어린이집053-252-0655대구광역시 중구 남산로7길 35 (남산4동)
8대구광역시중구국공립동그라미어린이집053-959-7942대구광역시 중구 달구벌대로 1960 남산휴먼시아1단지 103동 101호(남산4동)
9대구광역시중구민간가온어린이집053-422-0707대구광역시 중구 달구벌대로 1970 남산휴먼시아2단지 206동(남산4동)
대구광역시 어린이집 현황Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5
1580대구광역시달성군가정삐아제어린이집053-644-1629대구광역시 달성군 화원읍 명곡로 11 104동 101호(명곡리, 명곡미래빌1단지아파트)
1581대구광역시달성군가정삼주어린이집053-637-6165대구광역시 달성군 화원읍 사문진로 349 화원삼주타운 101동109호
1582대구광역시달성군가정신동아어린이집053-632-1388대구광역시 달성군 화원읍 비슬로539길 38 신동아파밀리에 102동101호
1583대구광역시달성군가정아이랜드어린이집053-634-0770대구광역시 달성군 화원읍 화원로2길 11
1584대구광역시달성군가정아이세상어린이집053-634-2519대구광역시 달성군 명천로17길 7-32 1층
1585대구광역시달성군가정임마누엘어린이집053-642-5919대구광역시 달성군 화원읍 명천로 57
1586대구광역시달성군가정키즈어린이집053-643-0304대구광역시 달성군 화원읍 명곡로 11
1587대구광역시달성군가정태왕리더스 어린이집053-634-6546대구광역시 달성군 화원읍 화원로3길 66 101동 102호(천내리, 화원태왕리더스)
1588대구광역시달성군가정평광어린이집053-635-3566대구광역시 달성군 화원읍 비슬로525길 37 207동 105호(천내리, 화원평광2차아파트)
1589대구광역시달성군가정하늘정원어린이집053-268-0228대구광역시 달성군 화원읍 사문진로6길 17 한샘아파트 101동101호