Overview

Dataset statistics

Number of variables13
Number of observations492
Missing cells568
Missing cells (%)8.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory50.6 KiB
Average record size in memory105.3 B

Variable types

Categorical5
Text5
Boolean2
Numeric1

Dataset

Description교육청명,교육지원청명,유치원코드,유치원명,설립유형,가입 보험명,대상여부,가입여부,업체명1,업체명2,업체명3,공시차수,주소
Author노원구
URLhttps://data.seoul.go.kr/dataList/OA-20864/S/1/datasetView.do

Alerts

교육청명 has constant value ""Constant
교육지원청명 has constant value ""Constant
업체명3 has constant value ""Constant
대상여부 is highly overall correlated with 가입여부 and 1 other fieldsHigh correlation
가입여부 is highly overall correlated with 대상여부 and 1 other fieldsHigh correlation
업체명2 is highly overall correlated with 대상여부 and 1 other fieldsHigh correlation
공시차수 is highly overall correlated with 가입 보험명High correlation
가입 보험명 is highly overall correlated with 공시차수High correlation
업체명2 is highly imbalanced (83.8%)Imbalance
업체명1 has 78 (15.9%) missing valuesMissing
업체명3 has 490 (99.6%) missing valuesMissing

Reproduction

Analysis started2024-03-13 12:39:11.536736
Analysis finished2024-03-13 12:39:13.447249
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

교육청명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
서울특별시교육청
492 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시교육청
2nd row서울특별시교육청
3rd row서울특별시교육청
4th row서울특별시교육청
5th row서울특별시교육청

Common Values

ValueCountFrequency (%)
서울특별시교육청 492
100.0%

Length

2024-03-13T21:39:13.534883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:39:13.693702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
서울특별시교육청 492
100.0%

교육지원청명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
북부교육지원청
492 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row북부교육지원청
2nd row북부교육지원청
3rd row북부교육지원청
4th row북부교육지원청
5th row북부교육지원청

Common Values

ValueCountFrequency (%)
북부교육지원청 492
100.0%

Length

2024-03-13T21:39:13.827643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:39:13.944049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
북부교육지원청 492
100.0%
Distinct78
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2024-03-13T21:39:14.230787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters17712
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3
2nd row0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3
3rd row0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3
4th row0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3
5th row0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3
ValueCountFrequency (%)
7594e46b-0646-4ce2-9bce-ad3493a6b4ad 11
 
2.2%
1ecec08d-0ef1-b044-e053-0a32095ab044 9
 
1.8%
1ecec08d-0e2e-b044-e053-0a32095ab044 9
 
1.8%
1ecec08c-fc37-b044-e053-0a32095ab044 8
 
1.6%
1ecec08d-08f6-b044-e053-0a32095ab044 8
 
1.6%
1ecec08d-04a7-b044-e053-0a32095ab044 8
 
1.6%
1ecec08c-ed88-b044-e053-0a32095ab044 7
 
1.4%
1ecec08d-008e-b044-e053-0a32095ab044 7
 
1.4%
1ecec08d-0742-b044-e053-0a32095ab044 7
 
1.4%
1ecec08d-08f7-b044-e053-0a32095ab044 7
 
1.4%
Other values (68) 411
83.5%
2024-03-13T21:39:15.248193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 2919
16.5%
4 2024
11.4%
- 1968
11.1%
e 1572
8.9%
c 1273
 
7.2%
3 1053
 
5.9%
b 1043
 
5.9%
5 1022
 
5.8%
a 985
 
5.6%
1 659
 
3.7%
Other values (7) 3194
18.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 9958
56.2%
Lowercase Letter 5786
32.7%
Dash Punctuation 1968
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2919
29.3%
4 2024
20.3%
3 1053
 
10.6%
5 1022
 
10.3%
1 659
 
6.6%
8 625
 
6.3%
2 623
 
6.3%
9 598
 
6.0%
6 244
 
2.5%
7 191
 
1.9%
Lowercase Letter
ValueCountFrequency (%)
e 1572
27.2%
c 1273
22.0%
b 1043
18.0%
a 985
17.0%
d 476
 
8.2%
f 437
 
7.6%
Dash Punctuation
ValueCountFrequency (%)
- 1968
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 11926
67.3%
Latin 5786
32.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2919
24.5%
4 2024
17.0%
- 1968
16.5%
3 1053
 
8.8%
5 1022
 
8.6%
1 659
 
5.5%
8 625
 
5.2%
2 623
 
5.2%
9 598
 
5.0%
6 244
 
2.0%
Latin
ValueCountFrequency (%)
e 1572
27.2%
c 1273
22.0%
b 1043
18.0%
a 985
17.0%
d 476
 
8.2%
f 437
 
7.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17712
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2919
16.5%
4 2024
11.4%
- 1968
11.1%
e 1572
8.9%
c 1273
 
7.2%
3 1053
 
5.9%
b 1043
 
5.9%
5 1022
 
5.8%
a 985
 
5.6%
1 659
 
3.7%
Other values (7) 3194
18.0%
Distinct78
Distinct (%)15.9%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2024-03-13T21:39:15.548548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length5
Mean length7.3231707
Min length5

Characters and Unicode

Total characters3603
Distinct characters103
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울상계초등학교병설유치원
2nd row서울상계초등학교병설유치원
3rd row서울상계초등학교병설유치원
4th row서울상계초등학교병설유치원
5th row서울상계초등학교병설유치원
ValueCountFrequency (%)
꿈동산아이유치원 11
 
2.2%
노원 9
 
1.8%
삼육유치원 9
 
1.8%
서울동일초등학교병설유치원 9
 
1.8%
산내들유치원 8
 
1.6%
삼육대학교부속유치원 8
 
1.6%
예진유치원 8
 
1.6%
서울여자대학교부속유치원 7
 
1.4%
무지개유치원 7
 
1.4%
꿈동산유치원 7
 
1.4%
Other values (70) 424
83.6%
2024-03-13T21:39:16.079334image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
531
14.7%
499
13.8%
492
 
13.7%
142
 
3.9%
132
 
3.7%
130
 
3.6%
120
 
3.3%
111
 
3.1%
105
 
2.9%
105
 
2.9%
Other values (93) 1236
34.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3588
99.6%
Space Separator 15
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
531
14.8%
499
13.9%
492
13.7%
142
 
4.0%
132
 
3.7%
130
 
3.6%
120
 
3.3%
111
 
3.1%
105
 
2.9%
105
 
2.9%
Other values (92) 1221
34.0%
Space Separator
ValueCountFrequency (%)
15
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3588
99.6%
Common 15
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
531
14.8%
499
13.9%
492
13.7%
142
 
4.0%
132
 
3.7%
130
 
3.6%
120
 
3.3%
111
 
3.1%
105
 
2.9%
105
 
2.9%
Other values (92) 1221
34.0%
Common
ValueCountFrequency (%)
15
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3588
99.6%
ASCII 15
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
531
14.8%
499
13.9%
492
13.7%
142
 
4.0%
132
 
3.7%
130
 
3.6%
120
 
3.3%
111
 
3.1%
105
 
2.9%
105
 
2.9%
Other values (92) 1221
34.0%
ASCII
ValueCountFrequency (%)
15
100.0%

설립유형
Categorical

Distinct4
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
사립(사인)
303 
공립(병설)
105 
사립(법인)
66 
공립(단설)
 
18

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row공립(병설)
2nd row공립(병설)
3rd row공립(병설)
4th row공립(병설)
5th row공립(병설)

Common Values

ValueCountFrequency (%)
사립(사인) 303
61.6%
공립(병설) 105
 
21.3%
사립(법인) 66
 
13.4%
공립(단설) 18
 
3.7%

Length

2024-03-13T21:39:16.262272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:39:16.405787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
사립(사인 303
61.6%
공립(병설 105
 
21.3%
사립(법인 66
 
13.4%
공립(단설 18
 
3.7%

가입 보험명
Categorical

HIGH CORRELATION 

Distinct25
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
가스배상책임보험
78 
통학버스 종합보험
78 
통학버스 책임보험
78 
화재보험
78 
놀이시설 안전보험
78 
Other values (20)
102 

Length

Max length15
Median length11
Mean length8.9227642
Min length3

Unique

Unique15 ?
Unique (%)3.0%

Sample

1st row가스배상책임보험
2nd row놀이시설 안전보험
3rd row영유아생명신체피해(상해보험)
4th row통학버스 종합보험
5th row통학버스 책임보험

Common Values

ValueCountFrequency (%)
가스배상책임보험 78
15.9%
통학버스 종합보험 78
15.9%
통학버스 책임보험 78
15.9%
화재보험 78
15.9%
놀이시설 안전보험 78
15.9%
영유아생명신체피해(상해보험) 78
15.9%
생산물배상책임보험 3
 
0.6%
승강기사고배상책임보험 2
 
0.4%
교직원상해보험 2
 
0.4%
유아교육기관종합보험 2
 
0.4%
Other values (15) 15
 
3.0%

Length

2024-03-13T21:39:16.625101image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
통학버스 158
21.7%
가스배상책임보험 78
10.7%
종합보험 78
10.7%
책임보험 78
10.7%
화재보험 78
10.7%
놀이시설 78
10.7%
안전보험 78
10.7%
영유아생명신체피해(상해보험 78
10.7%
생산물배상책임보험 3
 
0.4%
교직원상해보험 3
 
0.4%
Other values (16) 18
 
2.5%

대상여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size624.0 B
True
409 
False
83 
ValueCountFrequency (%)
True 409
83.1%
False 83
 
16.9%
2024-03-13T21:39:16.799268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

가입여부
Boolean

HIGH CORRELATION 

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size624.0 B
True
414 
False
78 
ValueCountFrequency (%)
True 414
84.1%
False 78
 
15.9%
2024-03-13T21:39:16.910697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

업체명1
Text

MISSING 

Distinct71
Distinct (%)17.1%
Missing78
Missing (%)15.9%
Memory size4.0 KiB
2024-03-13T21:39:17.163380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length12
Mean length7.089372
Min length3

Characters and Unicode

Total characters2935
Distinct characters96
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)7.2%

Sample

1st rowKB손해보험
2nd row한국교육시설안전원
3rd row학교안전공제회
4th row한국교육시설안전원
5th row현대해상
ValueCountFrequency (%)
학교안전공제회 82
18.6%
현대해상 53
12.0%
한국교육시설안전원 45
 
10.2%
현대해상화재보험 41
 
9.3%
삼성화재 21
 
4.8%
db손해보험 19
 
4.3%
kb손해보험 19
 
4.3%
손해보험 17
 
3.9%
메리츠화재 10
 
2.3%
삼성화재해상보험 7
 
1.6%
Other values (61) 126
28.6%
2024-03-13T21:39:17.665135image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
180
 
6.1%
166
 
5.7%
157
 
5.3%
141
 
4.8%
141
 
4.8%
139
 
4.7%
124
 
4.2%
124
 
4.2%
116
 
4.0%
114
 
3.9%
Other values (86) 1533
52.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2776
94.6%
Uppercase Letter 87
 
3.0%
Space Separator 34
 
1.2%
Lowercase Letter 32
 
1.1%
Dash Punctuation 4
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
180
 
6.5%
166
 
6.0%
157
 
5.7%
141
 
5.1%
141
 
5.1%
139
 
5.0%
124
 
4.5%
124
 
4.5%
116
 
4.2%
114
 
4.1%
Other values (72) 1374
49.5%
Uppercase Letter
ValueCountFrequency (%)
B 40
46.0%
D 20
23.0%
K 20
23.0%
T 4
 
4.6%
A 2
 
2.3%
X 1
 
1.1%
Lowercase Letter
ValueCountFrequency (%)
k 12
37.5%
b 10
31.2%
h 4
 
12.5%
e 4
 
12.5%
d 2
 
6.2%
Space Separator
ValueCountFrequency (%)
34
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2776
94.6%
Latin 119
 
4.1%
Common 40
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
180
 
6.5%
166
 
6.0%
157
 
5.7%
141
 
5.1%
141
 
5.1%
139
 
5.0%
124
 
4.5%
124
 
4.5%
116
 
4.2%
114
 
4.1%
Other values (72) 1374
49.5%
Latin
ValueCountFrequency (%)
B 40
33.6%
D 20
16.8%
K 20
16.8%
k 12
 
10.1%
b 10
 
8.4%
h 4
 
3.4%
e 4
 
3.4%
T 4
 
3.4%
A 2
 
1.7%
d 2
 
1.7%
Common
ValueCountFrequency (%)
34
85.0%
- 4
 
10.0%
, 2
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2776
94.6%
ASCII 159
 
5.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
180
 
6.5%
166
 
6.0%
157
 
5.7%
141
 
5.1%
141
 
5.1%
139
 
5.0%
124
 
4.5%
124
 
4.5%
116
 
4.2%
114
 
4.1%
Other values (72) 1374
49.5%
ASCII
ValueCountFrequency (%)
B 40
25.2%
34
21.4%
D 20
12.6%
K 20
12.6%
k 12
 
7.5%
b 10
 
6.3%
h 4
 
2.5%
- 4
 
2.5%
e 4
 
2.5%
T 4
 
2.5%
Other values (4) 7
 
4.4%

업체명2
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct13
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
<NA>
456 
현대해상
 
9
현대해상화재보험
 
9
DB손해보험
 
5
삼성화재
 
3
Other values (8)
 
10

Length

Max length14
Median length4
Mean length4.1707317
Min length4

Unique

Unique6 ?
Unique (%)1.2%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 456
92.7%
현대해상 9
 
1.8%
현대해상화재보험 9
 
1.8%
DB손해보험 5
 
1.0%
삼성화재 3
 
0.6%
삼성화재해상보험 2
 
0.4%
삼성가스배상책임보험 2
 
0.4%
한화손해보험 1
 
0.2%
버스운송협회 1
 
0.2%
현대해상화재 1
 
0.2%
Other values (3) 3
 
0.6%

Length

2024-03-13T21:39:17.843349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 456
92.5%
현대해상 9
 
1.8%
현대해상화재보험 9
 
1.8%
db손해보험 5
 
1.0%
삼성화재 3
 
0.6%
삼성화재해상보험 2
 
0.4%
삼성가스배상책임보험 2
 
0.4%
한화손해보험 1
 
0.2%
버스운송협회 1
 
0.2%
현대해상화재 1
 
0.2%
Other values (4) 4
 
0.8%

업체명3
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing490
Missing (%)99.6%
Memory size4.0 KiB
2024-03-13T21:39:18.049827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters12
Distinct characters6
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDB손해보험
2nd rowDB손해보험
ValueCountFrequency (%)
db손해보험 2
100.0%
2024-03-13T21:39:18.410581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
D 2
16.7%
B 2
16.7%
2
16.7%
2
16.7%
2
16.7%
2
16.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8
66.7%
Uppercase Letter 4
33.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Uppercase Letter
ValueCountFrequency (%)
D 2
50.0%
B 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8
66.7%
Latin 4
33.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%
Latin
ValueCountFrequency (%)
D 2
50.0%
B 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8
66.7%
ASCII 4
33.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
D 2
50.0%
B 2
50.0%
Hangul
ValueCountFrequency (%)
2
25.0%
2
25.0%
2
25.0%
2
25.0%

공시차수
Real number (ℝ)

HIGH CORRELATION 

Distinct11
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20222.768
Minimum20181
Maximum20232
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2024-03-13T21:39:18.575857image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20181
5-th percentile20182
Q120212
median20232
Q320232
95-th percentile20232
Maximum20232
Range51
Interquartile range (IQR)20

Descriptive statistics

Standard deviation16.3902
Coefficient of variation (CV)0.00081048253
Kurtosis0.5358581
Mean20222.768
Median Absolute Deviation (MAD)0
Skewness-1.4523546
Sum9949602
Variance268.63867
MonotonicityNot monotonic
2024-03-13T21:39:18.745185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
20232 358
72.8%
20202 36
 
7.3%
20192 31
 
6.3%
20182 30
 
6.1%
20212 20
 
4.1%
20201 5
 
1.0%
20222 5
 
1.0%
20221 4
 
0.8%
20211 1
 
0.2%
20181 1
 
0.2%
ValueCountFrequency (%)
20181 1
 
0.2%
20182 30
6.1%
20192 31
6.3%
20201 5
 
1.0%
20202 36
7.3%
20211 1
 
0.2%
20212 20
4.1%
20221 4
 
0.8%
20222 5
 
1.0%
20231 1
 
0.2%
ValueCountFrequency (%)
20232 358
72.8%
20231 1
 
0.2%
20222 5
 
1.0%
20221 4
 
0.8%
20212 20
 
4.1%
20211 1
 
0.2%
20202 36
 
7.3%
20201 5
 
1.0%
20192 31
 
6.3%
20182 30
 
6.1%

주소
Text

Distinct75
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2024-03-13T21:39:19.214464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length19.083333
Min length15

Characters and Unicode

Total characters9389
Distinct characters54
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row서울특별시 노원구 상계로9길 39
2nd row서울특별시 노원구 상계로9길 39
3rd row서울특별시 노원구 상계로9길 39
4th row서울특별시 노원구 상계로9길 39
5th row서울특별시 노원구 상계로9길 39
ValueCountFrequency (%)
노원구 492
25.0%
서울특별시 486
24.7%
한글비석로 33
 
1.7%
동일로227길 24
 
1.2%
26 24
 
1.2%
섬밭로 21
 
1.1%
23 19
 
1.0%
월계로45가길 18
 
0.9%
16 18
 
0.9%
화랑로 15
 
0.8%
Other values (112) 818
41.6%
2024-03-13T21:39:19.855806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1476
15.7%
522
 
5.6%
522
 
5.6%
492
 
5.2%
492
 
5.2%
492
 
5.2%
486
 
5.2%
486
 
5.2%
486
 
5.2%
486
 
5.2%
Other values (44) 3449
36.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5913
63.0%
Decimal Number 1955
 
20.8%
Space Separator 1476
 
15.7%
Dash Punctuation 45
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
522
8.8%
522
8.8%
492
8.3%
492
8.3%
492
8.3%
486
8.2%
486
8.2%
486
8.2%
486
8.2%
326
 
5.5%
Other values (32) 1123
19.0%
Decimal Number
ValueCountFrequency (%)
2 394
20.2%
1 382
19.5%
3 206
10.5%
5 193
9.9%
4 188
9.6%
6 166
8.5%
8 135
 
6.9%
7 127
 
6.5%
9 104
 
5.3%
0 60
 
3.1%
Space Separator
ValueCountFrequency (%)
1476
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 45
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5913
63.0%
Common 3476
37.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
522
8.8%
522
8.8%
492
8.3%
492
8.3%
492
8.3%
486
8.2%
486
8.2%
486
8.2%
486
8.2%
326
 
5.5%
Other values (32) 1123
19.0%
Common
ValueCountFrequency (%)
1476
42.5%
2 394
 
11.3%
1 382
 
11.0%
3 206
 
5.9%
5 193
 
5.6%
4 188
 
5.4%
6 166
 
4.8%
8 135
 
3.9%
7 127
 
3.7%
9 104
 
3.0%
Other values (2) 105
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5913
63.0%
ASCII 3476
37.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1476
42.5%
2 394
 
11.3%
1 382
 
11.0%
3 206
 
5.9%
5 193
 
5.6%
4 188
 
5.4%
6 166
 
4.8%
8 135
 
3.9%
7 127
 
3.7%
9 104
 
3.0%
Other values (2) 105
 
3.0%
Hangul
ValueCountFrequency (%)
522
8.8%
522
8.8%
492
8.3%
492
8.3%
492
8.3%
486
8.2%
486
8.2%
486
8.2%
486
8.2%
326
 
5.5%
Other values (32) 1123
19.0%

Interactions

2024-03-13T21:39:12.647032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T21:39:20.030523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
유치원코드유치원명설립유형가입 보험명대상여부가입여부업체명1업체명2공시차수주소
유치원코드1.0001.0001.0000.0000.3170.2960.9420.9830.9421.000
유치원명1.0001.0001.0000.0000.3170.2960.9420.9830.9421.000
설립유형1.0001.0001.0000.2170.4580.4600.5570.6510.3920.987
가입 보험명0.0000.0000.2171.0000.5010.4700.7910.7590.9150.000
대상여부0.3170.3170.4580.5011.0000.9930.120NaN0.1250.297
가입여부0.2960.2960.4600.4700.9931.000NaNNaN0.1180.274
업체명10.9420.9420.5570.7910.120NaN1.0000.8890.7970.934
업체명20.9830.9830.6510.759NaNNaN0.8891.0000.5580.983
공시차수0.9420.9420.3920.9150.1250.1180.7970.5581.0000.929
주소1.0001.0000.9870.0000.2970.2740.9340.9830.9291.000
2024-03-13T21:39:20.338255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대상여부가입여부설립유형업체명2가입 보험명
대상여부1.0000.9260.3091.0000.425
가입여부0.9261.0000.3101.0000.398
설립유형0.3090.3101.0000.4220.113
업체명21.0001.0000.4221.0000.348
가입 보험명0.4250.3980.1130.3481.000
2024-03-13T21:39:20.489609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
공시차수설립유형가입 보험명대상여부가입여부업체명2
공시차수1.0000.2600.6580.1250.1180.294
설립유형0.2601.0000.1130.3090.3100.422
가입 보험명0.6580.1131.0000.4250.3980.348
대상여부0.1250.3090.4251.0000.9261.000
가입여부0.1180.3100.3980.9261.0001.000
업체명20.2940.4220.3481.0001.0001.000

Missing values

2024-03-13T21:39:12.830698image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T21:39:13.221009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-03-13T21:39:13.371462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

교육청명교육지원청명유치원코드유치원명설립유형가입 보험명대상여부가입여부업체명1업체명2업체명3공시차수주소
0서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)가스배상책임보험YYKB손해보험<NA><NA>20232서울특별시 노원구 상계로9길 39
1서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)놀이시설 안전보험YY한국교육시설안전원<NA><NA>20232서울특별시 노원구 상계로9길 39
2서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)영유아생명신체피해(상해보험)YY학교안전공제회<NA><NA>20232서울특별시 노원구 상계로9길 39
3서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)통학버스 종합보험NN<NA><NA><NA>20232서울특별시 노원구 상계로9길 39
4서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)통학버스 책임보험NN<NA><NA><NA>20232서울특별시 노원구 상계로9길 39
5서울특별시교육청북부교육지원청0c4bc461-9f36-4d28-94e3-3dfb9c6cd4e3서울상계초등학교병설유치원공립(병설)화재보험YY한국교육시설안전원<NA><NA>20232서울특별시 노원구 상계로9길 39
6서울특별시교육청북부교육지원청1ecec08c-ed86-b044-e053-0a32095ab044한울유치원사립(사인)가스배상책임보험YY현대해상<NA><NA>20212서울특별시 노원구 덕릉로122길 12-6
7서울특별시교육청북부교육지원청1ecec08c-ed86-b044-e053-0a32095ab044한울유치원사립(사인)놀이시설 안전보험YY현대해상<NA><NA>20212서울특별시 노원구 덕릉로122길 12-6
8서울특별시교육청북부교육지원청1ecec08c-ed86-b044-e053-0a32095ab044한울유치원사립(사인)영유아생명신체피해(상해보험)YY학교안전공제회현대해상<NA>20212서울특별시 노원구 덕릉로122길 12-6
9서울특별시교육청북부교육지원청1ecec08c-ed86-b044-e053-0a32095ab044한울유치원사립(사인)통학버스 종합보험YY현대해상<NA><NA>20212서울특별시 노원구 덕릉로122길 12-6
교육청명교육지원청명유치원코드유치원명설립유형가입 보험명대상여부가입여부업체명1업체명2업체명3공시차수주소
482서울특별시교육청북부교육지원청d2ef6fed-c1e4-417d-8794-73f74106491f서울계상초등학교병설유치원공립(병설)영유아생명신체피해(상해보험)YY학교안전공제회<NA><NA>20232서울특별시 노원구 한글비석로41가길 24
483서울특별시교육청북부교육지원청d2ef6fed-c1e4-417d-8794-73f74106491f서울계상초등학교병설유치원공립(병설)통학버스 종합보험NN<NA><NA><NA>20232서울특별시 노원구 한글비석로41가길 24
484서울특별시교육청북부교육지원청d2ef6fed-c1e4-417d-8794-73f74106491f서울계상초등학교병설유치원공립(병설)통학버스 책임보험NN<NA><NA><NA>20232서울특별시 노원구 한글비석로41가길 24
485서울특별시교육청북부교육지원청d2ef6fed-c1e4-417d-8794-73f74106491f서울계상초등학교병설유치원공립(병설)화재보험YY한국교육시설안전원<NA><NA>20232서울특별시 노원구 한글비석로41가길 24
486서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)가스배상책임보험YYKB손해보험<NA><NA>20232서울특별시 노원구 노원로1길 36
487서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)놀이시설 안전보험YY한국교육시설안전원<NA><NA>20232서울특별시 노원구 노원로1길 36
488서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)영유아생명신체피해(상해보험)YY학교안전공제회<NA><NA>20232서울특별시 노원구 노원로1길 36
489서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)통학버스 종합보험NN<NA><NA><NA>20232서울특별시 노원구 노원로1길 36
490서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)통학버스 책임보험NN<NA><NA><NA>20232서울특별시 노원구 노원로1길 36
491서울특별시교육청북부교육지원청f989a1f7-1610-4bc9-8ead-ddb5fdfdfe1e서울태릉초등학교병설유치원공립(병설)화재보험YY한국교육시설안전원<NA><NA>20232서울특별시 노원구 노원로1길 36