Overview

Dataset statistics

Number of variables7
Number of observations1291
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory73.3 KiB
Average record size in memory58.1 B

Variable types

Categorical2
Numeric1
Text3
DateTime1

Dataset

Description근로복지공단에서 지원하는 전국의 직장어린이집 현황 데이터입니다. * 2022년 기준으로 위치, 명칭, 연락처를 제공합니다.
URLhttps://www.data.go.kr/data/3044314/fileData.do

Alerts

연도 has constant value ""Constant
연번 is highly overall correlated with 지역High correlation
지역 is highly overall correlated with 연번High correlation
연번 has unique valuesUnique
전화번호 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:46:13.806907
Analysis finished2023-12-12 09:46:14.887483
Duration1.08 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연도
Categorical

CONSTANT 

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2022
1291 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2022
2nd row2022
3rd row2022
4th row2022
5th row2022

Common Values

ValueCountFrequency (%)
2022 1291
100.0%

Length

2023-12-12T18:46:14.967392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:46:15.094802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2022 1291
100.0%

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1291
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean646
Minimum1
Maximum1291
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size11.5 KiB
2023-12-12T18:46:15.231429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile65.5
Q1323.5
median646
Q3968.5
95-th percentile1226.5
Maximum1291
Range1290
Interquartile range (IQR)645

Descriptive statistics

Standard deviation372.82391
Coefficient of variation (CV)0.57712679
Kurtosis-1.2
Mean646
Median Absolute Deviation (MAD)323
Skewness0
Sum833986
Variance138997.67
MonotonicityStrictly increasing
2023-12-12T18:46:15.386594image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.1%
889 1
 
0.1%
867 1
 
0.1%
866 1
 
0.1%
865 1
 
0.1%
864 1
 
0.1%
863 1
 
0.1%
862 1
 
0.1%
861 1
 
0.1%
860 1
 
0.1%
Other values (1281) 1281
99.2%
ValueCountFrequency (%)
1 1
0.1%
2 1
0.1%
3 1
0.1%
4 1
0.1%
5 1
0.1%
6 1
0.1%
7 1
0.1%
8 1
0.1%
9 1
0.1%
10 1
0.1%
ValueCountFrequency (%)
1291 1
0.1%
1290 1
0.1%
1289 1
0.1%
1288 1
0.1%
1287 1
0.1%
1286 1
0.1%
1285 1
0.1%
1284 1
0.1%
1283 1
0.1%
1282 1
0.1%

지역
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)1.3%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
서울특별시
304 
경기도
300 
인천광역시
80 
대전광역시
62 
경상남도
62 
Other values (12)
483 

Length

Max length7
Median length5
Mean length4.2850503
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시
2nd row서울특별시
3rd row서울특별시
4th row서울특별시
5th row서울특별시

Common Values

ValueCountFrequency (%)
서울특별시 304
23.5%
경기도 300
23.2%
인천광역시 80
 
6.2%
대전광역시 62
 
4.8%
경상남도 62
 
4.8%
부산광역시 60
 
4.6%
충청남도 58
 
4.5%
경상북도 58
 
4.5%
강원도 57
 
4.4%
대구광역시 41
 
3.2%
Other values (7) 209
16.2%

Length

2023-12-12T18:46:15.558852image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
서울특별시 304
23.5%
경기도 300
23.2%
인천광역시 80
 
6.2%
대전광역시 62
 
4.8%
경상남도 62
 
4.8%
부산광역시 60
 
4.6%
충청남도 58
 
4.5%
경상북도 58
 
4.5%
강원도 57
 
4.4%
대구광역시 41
 
3.2%
Other values (7) 209
16.2%
Distinct1265
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T18:46:15.782571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length28
Median length23
Mean length10.134005
Min length6

Characters and Unicode

Total characters13083
Distinct characters514
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1243 ?
Unique (%)96.3%

Sample

1st row101경비단 어린이집
2nd rowAJ어린이집
3rd rowCJ키즈빌 어린이집
4th rowCJ키즈빌어린이집
5th rowDB손해보험 아이사랑 어린이집
ValueCountFrequency (%)
어린이집 114
 
6.9%
직장어린이집 35
 
2.1%
ibk 7
 
0.4%
우리누리어린이집 6
 
0.4%
5
 
0.3%
공동직장어린이집 5
 
0.3%
도담어린이집 4
 
0.2%
행복어린이집 4
 
0.2%
좋은 4
 
0.2%
꿈나무어린이집 3
 
0.2%
Other values (1375) 1455
88.6%
2023-12-12T18:46:16.208430image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1406
 
10.7%
1307
 
10.0%
1301
 
9.9%
1294
 
9.9%
351
 
2.7%
206
 
1.6%
175
 
1.3%
148
 
1.1%
146
 
1.1%
145
 
1.1%
Other values (504) 6604
50.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 11893
90.9%
Uppercase Letter 548
 
4.2%
Space Separator 351
 
2.7%
Lowercase Letter 154
 
1.2%
Decimal Number 41
 
0.3%
Open Punctuation 36
 
0.3%
Close Punctuation 36
 
0.3%
Other Punctuation 21
 
0.2%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1406
 
11.8%
1307
 
11.0%
1301
 
10.9%
1294
 
10.9%
206
 
1.7%
175
 
1.5%
148
 
1.2%
146
 
1.2%
145
 
1.2%
141
 
1.2%
Other values (444) 5624
47.3%
Uppercase Letter
ValueCountFrequency (%)
K 86
15.7%
S 66
12.0%
I 53
9.7%
D 42
7.7%
G 42
7.7%
B 42
7.7%
L 38
 
6.9%
C 30
 
5.5%
T 25
 
4.6%
N 23
 
4.2%
Other values (15) 101
18.4%
Lowercase Letter
ValueCountFrequency (%)
m 21
13.6%
e 16
10.4%
i 14
 
9.1%
a 13
 
8.4%
o 12
 
7.8%
s 11
 
7.1%
r 9
 
5.8%
t 8
 
5.2%
l 7
 
4.5%
c 6
 
3.9%
Other values (11) 37
24.0%
Decimal Number
ValueCountFrequency (%)
2 20
48.8%
1 12
29.3%
3 3
 
7.3%
5 2
 
4.9%
4 2
 
4.9%
0 1
 
2.4%
9 1
 
2.4%
Other Punctuation
ValueCountFrequency (%)
! 12
57.1%
& 6
28.6%
. 3
 
14.3%
Space Separator
ValueCountFrequency (%)
351
100.0%
Open Punctuation
ValueCountFrequency (%)
( 36
100.0%
Close Punctuation
ValueCountFrequency (%)
) 36
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 11893
90.9%
Latin 702
 
5.4%
Common 488
 
3.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1406
 
11.8%
1307
 
11.0%
1301
 
10.9%
1294
 
10.9%
206
 
1.7%
175
 
1.5%
148
 
1.2%
146
 
1.2%
145
 
1.2%
141
 
1.2%
Other values (444) 5624
47.3%
Latin
ValueCountFrequency (%)
K 86
 
12.3%
S 66
 
9.4%
I 53
 
7.5%
D 42
 
6.0%
G 42
 
6.0%
B 42
 
6.0%
L 38
 
5.4%
C 30
 
4.3%
T 25
 
3.6%
N 23
 
3.3%
Other values (36) 255
36.3%
Common
ValueCountFrequency (%)
351
71.9%
( 36
 
7.4%
) 36
 
7.4%
2 20
 
4.1%
! 12
 
2.5%
1 12
 
2.5%
& 6
 
1.2%
. 3
 
0.6%
- 3
 
0.6%
3 3
 
0.6%
Other values (4) 6
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 11893
90.9%
ASCII 1190
 
9.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1406
 
11.8%
1307
 
11.0%
1301
 
10.9%
1294
 
10.9%
206
 
1.7%
175
 
1.5%
148
 
1.2%
146
 
1.2%
145
 
1.2%
141
 
1.2%
Other values (444) 5624
47.3%
ASCII
ValueCountFrequency (%)
351
29.5%
K 86
 
7.2%
S 66
 
5.5%
I 53
 
4.5%
D 42
 
3.5%
G 42
 
3.5%
B 42
 
3.5%
L 38
 
3.2%
( 36
 
3.0%
) 36
 
3.0%
Other values (50) 398
33.4%
Distinct1280
Distinct (%)99.1%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T18:46:16.613381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length55
Median length46
Mean length26.264911
Min length13

Characters and Unicode

Total characters33908
Distinct characters515
Distinct categories10 ?
Distinct scripts4 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1270 ?
Unique (%)98.4%

Sample

1st row서울특별시 성북구 정릉로10라길 18-5
2nd row서울특별시 송파구 정의로8길 9 문정동 640-5
3rd row서울특별시 중구 동호로 330 (쌍림동)
4th row서울특별시 서초구 전원말4길 7
5th row서울특별시 용산구 후암로 107 게이트웨이타워
ValueCountFrequency (%)
서울특별시 305
 
4.3%
경기도 300
 
4.2%
인천광역시 80
 
1.1%
1층 72
 
1.0%
경상남도 62
 
0.9%
대전광역시 62
 
0.9%
부산광역시 60
 
0.8%
경상북도 58
 
0.8%
충청남도 58
 
0.8%
성남시 58
 
0.8%
Other values (2990) 5982
84.3%
2023-12-12T18:46:17.246862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5833
 
17.2%
1304
 
3.8%
1226
 
3.6%
1 1200
 
3.5%
997
 
2.9%
815
 
2.4%
2 786
 
2.3%
741
 
2.2%
3 565
 
1.7%
535
 
1.6%
Other values (505) 19906
58.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 21609
63.7%
Space Separator 5833
 
17.2%
Decimal Number 5063
 
14.9%
Close Punctuation 470
 
1.4%
Open Punctuation 469
 
1.4%
Dash Punctuation 193
 
0.6%
Uppercase Letter 140
 
0.4%
Other Punctuation 115
 
0.3%
Lowercase Letter 11
 
< 0.1%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1304
 
6.0%
1226
 
5.7%
997
 
4.6%
815
 
3.8%
741
 
3.4%
535
 
2.5%
484
 
2.2%
437
 
2.0%
429
 
2.0%
407
 
1.9%
Other values (453) 14234
65.9%
Uppercase Letter
ValueCountFrequency (%)
S 18
12.9%
K 16
11.4%
L 11
7.9%
B 11
7.9%
I 11
7.9%
T 11
7.9%
A 10
 
7.1%
D 10
 
7.1%
G 8
 
5.7%
C 7
 
5.0%
Other values (12) 27
19.3%
Decimal Number
ValueCountFrequency (%)
1 1200
23.7%
2 786
15.5%
3 565
11.2%
0 479
 
9.5%
5 416
 
8.2%
4 390
 
7.7%
6 357
 
7.1%
7 333
 
6.6%
9 279
 
5.5%
8 258
 
5.1%
Lowercase Letter
ValueCountFrequency (%)
s 2
18.2%
e 1
9.1%
k 1
9.1%
t 1
9.1%
d 1
9.1%
y 1
9.1%
a 1
9.1%
l 1
9.1%
p 1
9.1%
i 1
9.1%
Other Punctuation
ValueCountFrequency (%)
, 106
92.2%
. 8
 
7.0%
/ 1
 
0.9%
Close Punctuation
ValueCountFrequency (%)
) 466
99.1%
] 4
 
0.9%
Open Punctuation
ValueCountFrequency (%)
( 465
99.1%
[ 4
 
0.9%
Space Separator
ValueCountFrequency (%)
5833
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 193
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 21608
63.7%
Common 12148
35.8%
Latin 151
 
0.4%
Han 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1304
 
6.0%
1226
 
5.7%
997
 
4.6%
815
 
3.8%
741
 
3.4%
535
 
2.5%
484
 
2.2%
437
 
2.0%
429
 
2.0%
407
 
1.9%
Other values (452) 14233
65.9%
Latin
ValueCountFrequency (%)
S 18
11.9%
K 16
10.6%
L 11
 
7.3%
B 11
 
7.3%
I 11
 
7.3%
T 11
 
7.3%
A 10
 
6.6%
D 10
 
6.6%
G 8
 
5.3%
C 7
 
4.6%
Other values (22) 38
25.2%
Common
ValueCountFrequency (%)
5833
48.0%
1 1200
 
9.9%
2 786
 
6.5%
3 565
 
4.7%
0 479
 
3.9%
) 466
 
3.8%
( 465
 
3.8%
5 416
 
3.4%
4 390
 
3.2%
6 357
 
2.9%
Other values (10) 1191
 
9.8%
Han
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 21608
63.7%
ASCII 12299
36.3%
CJK 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5833
47.4%
1 1200
 
9.8%
2 786
 
6.4%
3 565
 
4.6%
0 479
 
3.9%
) 466
 
3.8%
( 465
 
3.8%
5 416
 
3.4%
4 390
 
3.2%
6 357
 
2.9%
Other values (42) 1342
 
10.9%
Hangul
ValueCountFrequency (%)
1304
 
6.0%
1226
 
5.7%
997
 
4.6%
815
 
3.8%
741
 
3.4%
535
 
2.5%
484
 
2.2%
437
 
2.0%
429
 
2.0%
407
 
1.9%
Other values (452) 14233
65.9%
CJK
ValueCountFrequency (%)
1
100.0%

전화번호
Text

UNIQUE 

Distinct1291
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
2023-12-12T18:46:17.577237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.001549
Min length11

Characters and Unicode

Total characters15494
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1291 ?
Unique (%)100.0%

Sample

1st row02-911-2101
2nd row02-6240-1490
3rd row02-722-1070
4th row02-586-2022
5th row02-771-8607
ValueCountFrequency (%)
02 20
 
1.5%
0901 2
 
0.1%
7942 2
 
0.1%
0112 2
 
0.1%
02-911-2101 1
 
0.1%
031-594-9977 1
 
0.1%
031-572-0698 1
 
0.1%
031-622-7540 1
 
0.1%
031-911 1
 
0.1%
031-777-2671 1
 
0.1%
Other values (1338) 1338
97.7%
2023-12-12T18:46:18.041692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 2582
16.7%
0 2522
16.3%
2 1609
10.4%
3 1441
9.3%
1 1385
8.9%
5 1275
8.2%
7 1040
6.7%
6 1000
 
6.5%
4 946
 
6.1%
8 854
 
5.5%
Other values (3) 840
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12829
82.8%
Dash Punctuation 2582
 
16.7%
Space Separator 79
 
0.5%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2522
19.7%
2 1609
12.5%
3 1441
11.2%
1 1385
10.8%
5 1275
9.9%
7 1040
8.1%
6 1000
 
7.8%
4 946
 
7.4%
8 854
 
6.7%
9 757
 
5.9%
Dash Punctuation
ValueCountFrequency (%)
- 2582
100.0%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
* 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15494
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 2582
16.7%
0 2522
16.3%
2 1609
10.4%
3 1441
9.3%
1 1385
8.9%
5 1275
8.2%
7 1040
6.7%
6 1000
 
6.5%
4 946
 
6.1%
8 854
 
5.5%
Other values (3) 840
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15494
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 2582
16.7%
0 2522
16.3%
2 1609
10.4%
3 1441
9.3%
1 1385
8.9%
5 1275
8.2%
7 1040
6.7%
6 1000
 
6.5%
4 946
 
6.1%
8 854
 
5.5%
Other values (3) 840
 
5.4%
Distinct875
Distinct (%)67.8%
Missing0
Missing (%)0.0%
Memory size10.2 KiB
Minimum1992-01-13 00:00:00
Maximum2022-12-01 00:00:00
2023-12-12T18:46:18.206872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:46:18.358389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T18:46:14.527225image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:46:18.457573image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역
연번1.0000.950
지역0.9501.000
2023-12-12T18:46:18.565695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번지역
연번1.0000.783
지역0.7831.000

Missing values

2023-12-12T18:46:14.679252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:46:14.819154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연도연번지역직장어린이집명소재지전화번호설치일(인가일)
020221서울특별시101경비단 어린이집서울특별시 성북구 정릉로10라길 18-502-911-21012022-03-07
120222서울특별시AJ어린이집서울특별시 송파구 정의로8길 9 문정동 640-502-6240-14902016-01-13
220223서울특별시CJ키즈빌 어린이집서울특별시 중구 동호로 330 (쌍림동)02-722-10702011-05-27
320224서울특별시CJ키즈빌어린이집서울특별시 서초구 전원말4길 702-586-20222020-03-01
420225서울특별시DB손해보험 아이사랑 어린이집서울특별시 용산구 후암로 107 게이트웨이타워02-771-86072014-02-20
520226서울특별시DYPNF행복한어린이집서울특별시 강서구 마곡중앙8로7길 3902 -2160-82102022-02-22
620227서울특별시GKL행복(강남 코엑스2호점)어린이집서울특별시 강남구 봉은사로68길 2702-501-33852018-02-27
720228서울특별시GKL행복어린이집(강남)서울특별시 강남구 삼성로122길 7 GKL 행복 어린이집02-3448-45712015-03-02
820229서울특별시GKL행복어린이집(강북)서울특별시 중구 남대문로5가 한강대로 416 서울스퀘어 5층02-6456-88162015-02-27
9202210서울특별시GS SHOP 도담도담 어린이집서울특별시 영등포구 문래동6가 선유로 7502-3667-89352015-03-01
연도연번지역직장어린이집명소재지전화번호설치일(인가일)
128120221282제주특별자치도제주대학교직장어린이집제주특별자치도 제주시 제주대학로 102 교직원 아파트 아라인빌 내 (아라동)064-751-22282010-04-02
128220221283제주특별자치도제주시청직장어린이집제주특별자치도 제주시 광양9길 10 (이도이동)064-723-66032003-06-01
128320221284제주특별자치도제주신화월드어린이집제주특별자치도 서귀포시 안덕면 신화역사로188번길 151070-4548-10602019-02-28
128420221285제주특별자치도제주지방해양경찰청어린이집제주특별자치도 제주시 구산로 63 (아라일동)064-755-30192015-08-26
128520221286제주특별자치도제주특별자치도청어린이집제주특별자치도 제주시 문연로 30 (연동)064-746-75572009-03-20
128620221287제주특별자치도제주한라병원직장어린이집제주특별자치도 제주시 남녕로 5-3064-745-20152015-03-02
128720221288제주특별자치도탐라해군어린이집제주특별자치도 서귀포시 이어도로 642070-4543-20012022-03-30
128820221289제주특별자치도한국가스공사제주늘푸른어린이집제주특별자치도 제주시 광평서길 16064-744 -38112022-12-01
128920221290제주특별자치도한국비엠아이어린이집제주특별자치도 제주시 월평4길 12064-724-51052020-08-10
129020221291제주특별자치도한마음 어린이집제주특별자치도 제주시 남광로 104 (이도이동)064-750-94552000-03-03