Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 705 |
Missing cells | 1776 |
Missing cells (%) | 18.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 78.6 KiB |
Average record size in memory | 114.2 B |
Variable types
Text | 6 |
---|---|
Categorical | 5 |
Numeric | 1 |
Boolean | 1 |
Unsupported | 1 |
Dataset
Description | 2021-01-05 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901054001157000 |
apr_at has constant value "" | Constant |
gubun is highly overall correlated with data_day and 1 other fields | High correlation |
school_kind is highly overall correlated with data_day and 1 other fields | High correlation |
last_load_dttm is highly overall correlated with lng and 4 other fields | High correlation |
data_day is highly overall correlated with lng and 4 other fields | High correlation |
inst_center is highly overall correlated with data_day and 1 other fields | High correlation |
lng is highly overall correlated with data_day and 1 other fields | High correlation |
gubun is highly imbalanced (51.0%) | Imbalance |
school_kind is highly imbalanced (62.4%) | Imbalance |
last_load_dttm is highly imbalanced (93.9%) | Imbalance |
tel has 353 (50.1%) missing values | Missing |
school_addr has 353 (50.1%) missing values | Missing |
apr_at has 354 (50.2%) missing values | Missing |
instt_code has 705 (100.0%) missing values | Missing |
skey has unique values | Unique |
instt_code is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-20 19:17:46.321594 |
---|---|
Analysis finished | 2024-04-20 19:17:49.067247 |
Duration | 2.75 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
skey
Text
UNIQUE
 
Distinct | 705 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
Value | Count | Frequency (%) |
4542 | 1 | 0.1% |
4143 | 1 | 0.1% |
4077 | 1 | 0.1% |
4066 | 1 | 0.1% |
4058 | 1 | 0.1% |
4059 | 1 | 0.1% |
4060 | 1 | 0.1% |
4061 | 1 | 0.1% |
4062 | 1 | 0.1% |
4063 | 1 | 0.1% |
Other values (698) | 698 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 834 | |
3 | 349 | |
9 | 242 | 8.5% |
2 | 241 | 8.5% |
0 | 240 | 8.4% |
1 | 240 | 8.4% |
5 | 235 | 8.3% |
8 | 148 | 5.2% |
7 | 140 | 4.9% |
6 | 140 | 4.9% |
Other values (31) | 36 | 1.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2809 | |
Other Letter | 28 | 1.0% |
Space Separator | 3 | 0.1% |
Other Punctuation | 2 | 0.1% |
Open Punctuation | 1 | < 0.1% |
Uppercase Letter | 1 | < 0.1% |
Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
영 | 2 | 7.1% |
부 | 2 | 7.1% |
동 | 2 | 7.1% |
이 | 1 | 3.6% |
로 | 1 | 3.6% |
어 | 1 | 3.6% |
린 | 1 | 3.6% |
집 | 1 | 3.6% |
신 | 1 | 3.6% |
호 | 1 | 3.6% |
Other values (15) | 15 |
Decimal Number
Value | Count | Frequency (%) |
4 | 834 | |
3 | 349 | |
9 | 242 | 8.6% |
2 | 241 | 8.6% |
0 | 240 | 8.5% |
1 | 240 | 8.5% |
5 | 235 | 8.4% |
8 | 148 | 5.3% |
7 | 140 | 5.0% |
6 | 140 | 5.0% |
Other Punctuation
Value | Count | Frequency (%) |
, | 1 | |
@ | 1 |
Space Separator
Value | Count | Frequency (%) |
3 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2816 | |
Hangul | 28 | 1.0% |
Latin | 1 | < 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
영 | 2 | 7.1% |
부 | 2 | 7.1% |
동 | 2 | 7.1% |
이 | 1 | 3.6% |
로 | 1 | 3.6% |
어 | 1 | 3.6% |
린 | 1 | 3.6% |
집 | 1 | 3.6% |
신 | 1 | 3.6% |
호 | 1 | 3.6% |
Other values (15) | 15 |
Common
Value | Count | Frequency (%) |
4 | 834 | |
3 | 349 | |
9 | 242 | 8.6% |
2 | 241 | 8.6% |
0 | 240 | 8.5% |
1 | 240 | 8.5% |
5 | 235 | 8.3% |
8 | 148 | 5.3% |
7 | 140 | 5.0% |
6 | 140 | 5.0% |
Other values (5) | 7 | 0.2% |
Latin
Value | Count | Frequency (%) |
S | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2817 | |
Hangul | 28 | 1.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4 | 834 | |
3 | 349 | |
9 | 242 | 8.6% |
2 | 241 | 8.6% |
0 | 240 | 8.5% |
1 | 240 | 8.5% |
5 | 235 | 8.3% |
8 | 148 | 5.3% |
7 | 140 | 5.0% |
6 | 140 | 5.0% |
Other values (6) | 8 | 0.3% |
Hangul
Value | Count | Frequency (%) |
영 | 2 | 7.1% |
부 | 2 | 7.1% |
동 | 2 | 7.1% |
이 | 1 | 3.6% |
로 | 1 | 3.6% |
어 | 1 | 3.6% |
린 | 1 | 3.6% |
집 | 1 | 3.6% |
신 | 1 | 3.6% |
호 | 1 | 3.6% |
Other values (15) | 15 |
gubun
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
어린이집 | |
유치원 | |
초등학교 | 22 |
기존 | 2 |
Other values (4) | 5 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.9163121 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 351 | |
어린이집 | 273 | |
유치원 | 52 | 7.4% |
초등학교 | 22 | 3.1% |
기존 | 2 | 0.3% |
고등학교 | 2 | 0.3% |
40 | 1 | 0.1% |
특수학교 | 1 | 0.1% |
중학교 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 351 | |
어린이집 | 273 | |
유치원 | 52 | 7.4% |
초등학교 | 22 | 3.1% |
기존 | 2 | 0.3% |
고등학교 | 2 | 0.3% |
40 | 1 | 0.1% |
특수학교 | 1 | 0.1% |
중학교 | 1 | 0.1% |
school_name
Text
Distinct | 349 |
---|---|
Distinct (%) | 49.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
Value | Count | Frequency (%) |
어린이집 | 17 | 2.3% |
동심어린이집 | 4 | 0.5% |
늘푸른어린이집 | 4 | 0.5% |
유치원 | 4 | 0.5% |
병설유치원 | 4 | 0.5% |
한솔어린이집 | 4 | 0.5% |
한마음어린이집 | 4 | 0.5% |
꿈나무유치원 | 4 | 0.5% |
꿈동산어린이집 | 4 | 0.5% |
큰나무어린이집 | 4 | 0.5% |
Other values (346) | 685 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 570 | 11.2% |
린 | 550 | 10.8% |
어 | 544 | 10.7% |
집 | 544 | 10.7% |
원 | 132 | 2.6% |
유 | 106 | 2.1% |
치 | 106 | 2.1% |
동 | 79 | 1.5% |
초 | 76 | 1.5% |
교 | 76 | 1.5% |
Other values (291) | 2320 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4949 | |
Decimal Number | 50 | 1.0% |
Space Separator | 48 | 0.9% |
Uppercase Letter | 46 | 0.9% |
Lowercase Letter | 4 | 0.1% |
Other Punctuation | 4 | 0.1% |
Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 570 | 11.5% |
린 | 550 | 11.1% |
어 | 544 | 11.0% |
집 | 544 | 11.0% |
원 | 132 | 2.7% |
유 | 106 | 2.1% |
치 | 106 | 2.1% |
동 | 79 | 1.6% |
초 | 76 | 1.5% |
교 | 76 | 1.5% |
Other values (267) | 2166 |
Decimal Number
Value | Count | Frequency (%) |
2 | 14 | |
1 | 12 | |
4 | 8 | |
5 | 5 | 10.0% |
6 | 3 | 6.0% |
3 | 3 | 6.0% |
7 | 2 | 4.0% |
0 | 2 | 4.0% |
8 | 1 | 2.0% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 10 | |
G | 10 | |
K | 6 | |
B | 6 | |
I | 4 | 8.7% |
C | 4 | 8.7% |
F | 2 | 4.3% |
R | 2 | 4.3% |
A | 2 | 4.3% |
Lowercase Letter
Value | Count | Frequency (%) |
k | 2 | |
s | 2 |
Other Punctuation
Value | Count | Frequency (%) |
? | 2 | |
! | 2 |
Space Separator
Value | Count | Frequency (%) |
48 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4946 | |
Common | 104 | 2.0% |
Latin | 50 | 1.0% |
Han | 3 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 570 | 11.5% |
린 | 550 | 11.1% |
어 | 544 | 11.0% |
집 | 544 | 11.0% |
원 | 132 | 2.7% |
유 | 106 | 2.1% |
치 | 106 | 2.1% |
동 | 79 | 1.6% |
초 | 76 | 1.5% |
교 | 76 | 1.5% |
Other values (264) | 2163 |
Common
Value | Count | Frequency (%) |
48 | ||
2 | 14 | 13.5% |
1 | 12 | 11.5% |
4 | 8 | 7.7% |
5 | 5 | 4.8% |
6 | 3 | 2.9% |
3 | 3 | 2.9% |
7 | 2 | 1.9% |
- | 2 | 1.9% |
0 | 2 | 1.9% |
Other values (3) | 5 | 4.8% |
Latin
Value | Count | Frequency (%) |
L | 10 | |
G | 10 | |
K | 6 | |
B | 6 | |
I | 4 | 8.0% |
C | 4 | 8.0% |
k | 2 | 4.0% |
F | 2 | 4.0% |
s | 2 | 4.0% |
R | 2 | 4.0% |
Han
Value | Count | Frequency (%) |
恃 | 1 | |
訣 | 1 | |
低 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4946 | |
ASCII | 154 | 3.0% |
CJK | 3 | 0.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 570 | 11.5% |
린 | 550 | 11.1% |
어 | 544 | 11.0% |
집 | 544 | 11.0% |
원 | 132 | 2.7% |
유 | 106 | 2.1% |
치 | 106 | 2.1% |
동 | 79 | 1.6% |
초 | 76 | 1.5% |
교 | 76 | 1.5% |
Other values (264) | 2163 |
ASCII
Value | Count | Frequency (%) |
48 | ||
2 | 14 | 9.1% |
1 | 12 | 7.8% |
L | 10 | 6.5% |
G | 10 | 6.5% |
4 | 8 | 5.2% |
K | 6 | 3.9% |
B | 6 | 3.9% |
5 | 5 | 3.2% |
I | 4 | 2.6% |
Other values (14) | 31 |
CJK
Value | Count | Frequency (%) |
恃 | 1 | |
訣 | 1 | |
低 | 1 |
student_num
Text
Distinct | 143 |
---|---|
Distinct (%) | 20.4% |
Missing | 3 |
Missing (%) | 0.4% |
Memory size | 5.6 KiB |
Value | Count | Frequency (%) |
20 | 34 | 4.8% |
19 | 18 | 2.5% |
45 | 16 | 2.3% |
46 | 16 | 2.3% |
65 | 16 | 2.3% |
60 | 14 | 2.0% |
49 | 14 | 2.0% |
16 | 14 | 2.0% |
40 | 13 | 1.8% |
33 | 12 | 1.7% |
Other values (137) | 539 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 236 | |
2 | 171 | |
3 | 171 | |
6 | 161 | |
4 | 159 | |
0 | 156 | |
5 | 147 | |
7 | 140 | |
8 | 113 | |
9 | 100 | |
Other values (20) | 28 | 1.8% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1554 | |
Other Letter | 16 | 1.0% |
Space Separator | 4 | 0.3% |
Other Punctuation | 4 | 0.3% |
Close Punctuation | 2 | 0.1% |
Open Punctuation | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
부 | 1 | 6.2% |
호 | 1 | 6.2% |
동 | 1 | 6.2% |
리 | 1 | 6.2% |
관 | 1 | 6.2% |
로 | 1 | 6.2% |
단 | 1 | 6.2% |
구 | 1 | 6.2% |
신 | 1 | 6.2% |
Other values (5) | 5 |
Decimal Number
Value | Count | Frequency (%) |
1 | 236 | |
2 | 171 | |
3 | 171 | |
6 | 161 | |
4 | 159 | |
0 | 156 | |
5 | 147 | |
7 | 140 | |
8 | 113 | |
9 | 100 |
Other Punctuation
Value | Count | Frequency (%) |
, | 2 | |
. | 2 |
Space Separator
Value | Count | Frequency (%) |
4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1566 | |
Hangul | 16 | 1.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 236 | |
2 | 171 | |
3 | 171 | |
6 | 161 | |
4 | 159 | |
0 | 156 | |
5 | 147 | |
7 | 140 | |
8 | 113 | |
9 | 100 | |
Other values (5) | 12 | 0.8% |
Hangul
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
부 | 1 | 6.2% |
호 | 1 | 6.2% |
동 | 1 | 6.2% |
리 | 1 | 6.2% |
관 | 1 | 6.2% |
로 | 1 | 6.2% |
단 | 1 | 6.2% |
구 | 1 | 6.2% |
신 | 1 | 6.2% |
Other values (5) | 5 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1566 | |
Hangul | 16 | 1.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 236 | |
2 | 171 | |
3 | 171 | |
6 | 161 | |
4 | 159 | |
0 | 156 | |
5 | 147 | |
7 | 140 | |
8 | 113 | |
9 | 100 | |
Other values (5) | 12 | 0.8% |
Hangul
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
부 | 1 | 6.2% |
호 | 1 | 6.2% |
동 | 1 | 6.2% |
리 | 1 | 6.2% |
관 | 1 | 6.2% |
로 | 1 | 6.2% |
단 | 1 | 6.2% |
구 | 1 | 6.2% |
신 | 1 | 6.2% |
Other values (5) | 5 |
tel
Text
MISSING
 
Distinct | 351 |
---|---|
Distinct (%) | 99.7% |
Missing | 353 |
Missing (%) | 50.1% |
Memory size | 5.6 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 12.039773 |
Min length | 10 |
Characters and Unicode
Total characters | 4238 |
---|---|
Distinct characters | 13 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 350 ? |
---|---|
Unique (%) | 99.4% |
Sample
1st row | 051-464-0570 |
---|---|
2nd row | 051-242-3583 |
3rd row | 051-244-3362 |
4th row | 051-620-6088 |
5th row | 051-256-1843 |
Value | Count | Frequency (%) |
051-325-7650 | 2 | 0.6% |
051-315-3580 | 2 | 0.6% |
051-532-3211 | 1 | 0.3% |
051-646-2218 | 1 | 0.3% |
051-893-6723 | 1 | 0.3% |
051-894-5486 | 1 | 0.3% |
051-524-0014 | 1 | 0.3% |
051-507-2005 | 1 | 0.3% |
051-524-2340 | 1 | 0.3% |
051-501-4321 | 1 | 0.3% |
Other values (340) | 340 |
Most occurring characters
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 3524 | |
Dash Punctuation | 700 | 16.5% |
Space Separator | 12 | 0.3% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 7.0% |
4 | 247 | 7.0% |
6 | 234 | 6.6% |
8 | 194 | 5.5% |
9 | 152 | 4.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 700 |
Space Separator
Value | Count | Frequency (%) |
12 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4238 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4238 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
school_addr
Text
MISSING
 
Distinct | 344 |
---|---|
Distinct (%) | 97.7% |
Missing | 353 |
Missing (%) | 50.1% |
Memory size | 5.6 KiB |
Length
Max length | 47 |
---|---|
Median length | 39 |
Mean length | 23.360795 |
Min length | 10 |
Characters and Unicode
Total characters | 8223 |
---|---|
Distinct characters | 281 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 336 ? |
---|---|
Unique (%) | 95.5% |
Sample
1st row | 부산광역시 중구 망양로 309 |
---|---|
2nd row | 부산광역시 중구 망양로 319번길 |
3rd row | 부산광역시 중구 흑교로 31번길 34-1 |
4th row | 부산광역시 중구 충장대로 20, 별관 2층 |
5th row | 부산광역시 중구 고가길 40 |
Value | Count | Frequency (%) |
부산광역시 | 347 | 19.7% |
북구 | 35 | 2.0% |
해운대구 | 32 | 1.8% |
금정구 | 29 | 1.6% |
영도구 | 29 | 1.6% |
사상구 | 29 | 1.6% |
기장군 | 28 | 1.6% |
사하구 | 23 | 1.3% |
남구 | 22 | 1.2% |
동구 | 21 | 1.2% |
Other values (693) | 1167 |
Most occurring characters
Value | Count | Frequency (%) |
1429 | 17.4% | |
산 | 390 | 4.7% |
1 | 382 | 4.6% |
부 | 374 | 4.5% |
시 | 362 | 4.4% |
광 | 355 | 4.3% |
역 | 347 | 4.2% |
로 | 333 | 4.0% |
구 | 332 | 4.0% |
2 | 186 | 2.3% |
Other values (271) | 3733 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 5092 | |
Decimal Number | 1549 | 18.8% |
Space Separator | 1429 | 17.4% |
Dash Punctuation | 63 | 0.8% |
Open Punctuation | 31 | 0.4% |
Close Punctuation | 31 | 0.4% |
Other Punctuation | 22 | 0.3% |
Uppercase Letter | 3 | < 0.1% |
Lowercase Letter | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
Decimal Number
Value | Count | Frequency (%) |
1 | 382 | |
2 | 186 | |
3 | 181 | |
0 | 168 | |
4 | 138 | 8.9% |
6 | 123 | 7.9% |
5 | 111 | 7.2% |
7 | 96 | 6.2% |
9 | 82 | 5.3% |
8 | 82 | 5.3% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1 | |
G | 1 | |
L | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 | |
k | 1 | |
s | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 19 | |
@ | 3 | 13.6% |
Space Separator
Value | Count | Frequency (%) |
1429 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 63 |
Open Punctuation
Value | Count | Frequency (%) |
( | 31 |
Close Punctuation
Value | Count | Frequency (%) |
) | 31 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 5092 | |
Common | 3125 | |
Latin | 6 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
Common
Value | Count | Frequency (%) |
1429 | ||
1 | 382 | 12.2% |
2 | 186 | 6.0% |
3 | 181 | 5.8% |
0 | 168 | 5.4% |
4 | 138 | 4.4% |
6 | 123 | 3.9% |
5 | 111 | 3.6% |
7 | 96 | 3.1% |
9 | 82 | 2.6% |
Other values (6) | 229 | 7.3% |
Latin
Value | Count | Frequency (%) |
A | 1 | |
e | 1 | |
k | 1 | |
s | 1 | |
G | 1 | |
L | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 5092 | |
ASCII | 3131 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1429 | ||
1 | 382 | 12.2% |
2 | 186 | 5.9% |
3 | 181 | 5.8% |
0 | 168 | 5.4% |
4 | 138 | 4.4% |
6 | 123 | 3.9% |
5 | 111 | 3.5% |
7 | 96 | 3.1% |
9 | 82 | 2.6% |
Other values (12) | 235 | 7.5% |
Hangul
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
school_kind
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 9 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
기존 | |
---|---|
신규 | |
기존 | 14 |
신규 | 6 |
<NA> | 5 |
Other values (4) | 5 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.0765957 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 기존 |
---|---|
2nd row | 기존 |
3rd row | 기존 |
4th row | 기존 |
5th row | 기존 |
Common Values
Value | Count | Frequency (%) |
기존 | 472 | |
신규 | 203 | |
기존 | 14 | 2.0% |
신규 | 6 | 0.9% |
<NA> | 5 | 0.7% |
기존) | 2 | 0.3% |
가존 | 1 | 0.1% |
기존 | 1 | 0.1% |
신규 | 1 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
기존 | 489 | |
신규 | 210 | |
na | 5 | 0.7% |
가존 | 1 | 0.1% |
inst_center
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 2.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
부산북구보건소 | 35 |
해운대구보건소 | 32 |
금정구보건소 | 29 |
사상구보건소 | 29 |
Other values (12) |
Length
Max length | 8 |
---|---|
Median length | 4 |
Mean length | 5.2312057 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 356 | |
부산북구보건소 | 35 | 5.0% |
해운대구보건소 | 32 | 4.5% |
금정구보건소 | 29 | 4.1% |
사상구보건소 | 29 | 4.1% |
영도구보건소 | 29 | 4.1% |
기장군보건소 | 28 | 4.0% |
사하구보건소 | 23 | 3.3% |
부산남구보건소 | 22 | 3.1% |
부산동구보건소 | 21 | 3.0% |
Other values (7) | 101 | 14.3% |
Length
Value | Count | Frequency (%) |
na | 356 | |
부산북구보건소 | 35 | 5.0% |
해운대구보건소 | 32 | 4.5% |
금정구보건소 | 29 | 4.1% |
사상구보건소 | 29 | 4.1% |
영도구보건소 | 29 | 4.1% |
기장군보건소 | 28 | 4.0% |
사하구보건소 | 23 | 3.3% |
부산남구보건소 | 22 | 3.1% |
부산동구보건소 | 21 | 3.0% |
Other values (7) | 101 | 14.3% |
lat
Text
Distinct | 462 |
---|---|
Distinct (%) | 65.8% |
Missing | 3 |
Missing (%) | 0.4% |
Memory size | 5.6 KiB |
Length
Max length | 19 |
---|---|
Median length | 11 |
Mean length | 10.193732 |
Min length | 4 |
Characters and Unicode
Total characters | 7156 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 243 ? |
---|---|
Unique (%) | 34.6% |
Sample
1st row | 35.07961019 |
---|---|
2nd row | 35.08991597 |
3rd row | 35.09202146 |
4th row | 35.0909773 |
5th row | 35.09400744 |
Value | Count | Frequency (%) |
35.148 | 6 | 0.9% |
35.18408371 | 4 | 0.6% |
35.15574257 | 4 | 0.6% |
35.13663383 | 4 | 0.6% |
35.14389408 | 4 | 0.6% |
35.127 | 3 | 0.4% |
35.268 | 3 | 0.4% |
35.24953561 | 3 | 0.4% |
35.18959425 | 3 | 0.4% |
35.20578417 | 3 | 0.4% |
Other values (453) | 667 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 1185 | |
5 | 1129 | |
1 | 801 | |
. | 700 | |
2 | 640 | |
0 | 478 | |
7 | 476 | |
4 | 448 | 6.3% |
9 | 436 | 6.1% |
6 | 430 | 6.0% |
Other values (4) | 433 | 6.1% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 6446 | |
Other Punctuation | 704 | 9.8% |
Dash Punctuation | 4 | 0.1% |
Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 1185 | |
5 | 1129 | |
1 | 801 | |
2 | 640 | |
0 | 478 | |
7 | 476 | |
4 | 448 | 7.0% |
9 | 436 | 6.8% |
6 | 430 | 6.7% |
8 | 423 | 6.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 700 | |
: | 4 | 0.6% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 7156 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 1185 | |
5 | 1129 | |
1 | 801 | |
. | 700 | |
2 | 640 | |
0 | 478 | |
7 | 476 | |
4 | 448 | 6.3% |
9 | 436 | 6.1% |
6 | 430 | 6.0% |
Other values (4) | 433 | 6.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7156 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 1185 | |
5 | 1129 | |
1 | 801 | |
. | 700 | |
2 | 640 | |
0 | 478 | |
7 | 476 | |
4 | 448 | 6.3% |
9 | 436 | 6.1% |
6 | 430 | 6.0% |
Other values (4) | 433 | 6.1% |
lng
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 466 |
---|---|
Distinct (%) | 66.6% |
Missing | 5 |
Missing (%) | 0.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 129.04377 |
Minimum | 123.423 |
---|---|
Maximum | 129.28265 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 6.3 KiB |
Quantile statistics
Minimum | 123.423 |
---|---|
5-th percentile | 128.96774 |
Q1 | 129.017 |
median | 129.06175 |
Q3 | 129.09871 |
95-th percentile | 129.1805 |
Maximum | 129.28265 |
Range | 5.8596453 |
Interquartile range (IQR) | 0.0817144 |
Descriptive statistics
Standard deviation | 0.26897566 |
---|---|
Coefficient of variation (CV) | 0.0020843754 |
Kurtosis | 308.38948 |
Mean | 129.04377 |
Median Absolute Deviation (MAD) | 0.0426522 |
Skewness | -16.299949 |
Sum | 90330.641 |
Variance | 0.072347908 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
129.108 | 4 | 0.6% |
129.0650406 | 4 | 0.6% |
128.9886692 | 4 | 0.6% |
129.0802187 | 4 | 0.6% |
128.9902706 | 4 | 0.6% |
129.2048929 | 3 | 0.4% |
129.1275334 | 3 | 0.4% |
129.213 | 3 | 0.4% |
129.0333375 | 3 | 0.4% |
129.0144 | 3 | 0.4% |
Other values (456) | 665 | |
(Missing) | 5 | 0.7% |
Value | Count | Frequency (%) |
123.423 | 1 | |
125.7758087 | 1 | |
127.008 | 1 | |
128.503 | 1 | |
128.531 | 1 | |
128.661 | 1 | |
128.7402002 | 1 | |
128.816 | 1 | |
128.83574 | 1 | |
128.8513166 | 1 |
Value | Count | Frequency (%) |
129.2826453 | 1 | 0.1% |
129.258043 | 1 | 0.1% |
129.243607 | 2 | |
129.2238676 | 1 | 0.1% |
129.223 | 1 | 0.1% |
129.2163239 | 2 | |
129.2154125 | 2 | |
129.2152822 | 1 | 0.1% |
129.213 | 3 | |
129.2127674 | 2 |
data_day
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
<NA> | |
---|---|
2020-06-30 |
Length
Max length | 10 |
---|---|
Median length | 4 |
Mean length | 6.9702128 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | <NA> |
---|---|
2nd row | <NA> |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 356 | |
2020-06-30 | 349 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 356 | |
2020-06-30 | 349 |
apr_at
Boolean
CONSTANT
  MISSING
 
Distinct | 1 |
---|---|
Distinct (%) | 0.3% |
Missing | 354 |
Missing (%) | 50.2% |
Memory size | 1.5 KiB |
False | |
---|---|
(Missing) |
Value | Count | Frequency (%) |
False | 351 | |
(Missing) | 354 |
instt_code
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 705 |
---|---|
Missing (%) | 100.0% |
Memory size | 6.3 KiB |
last_load_dttm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 5.6 KiB |
2021-01-05 14:02:45 | |
---|---|
<NA> | 5 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 18.893617 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-01-05 14:02:45 |
---|---|
2nd row | 2021-01-05 14:02:45 |
3rd row | 2021-01-05 14:02:45 |
4th row | 2021-01-05 14:02:45 |
5th row | 2021-01-05 14:02:45 |
Common Values
Value | Count | Frequency (%) |
2021-01-05 14:02:45 | 700 | |
<NA> | 5 | 0.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-01-05 | 700 | |
14:02:45 | 700 | |
na | 5 | 0.4% |
gubun | school_kind | inst_center | lng | |
---|---|---|---|---|
gubun | 1.000 | 0.000 | 0.206 | 0.000 |
school_kind | 0.000 | 1.000 | 0.326 | 0.000 |
inst_center | 0.206 | 0.326 | 1.000 | 0.000 |
lng | 0.000 | 0.000 | 0.000 | 1.000 |
gubun | school_kind | last_load_dttm | data_day | inst_center | |
---|---|---|---|---|---|
gubun | 1.000 | 0.000 | 1.000 | 1.000 | 0.098 |
school_kind | 0.000 | 1.000 | 1.000 | 1.000 | 0.160 |
last_load_dttm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
data_day | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
inst_center | 0.098 | 0.160 | 1.000 | 1.000 | 1.000 |
lng | gubun | school_kind | inst_center | data_day | last_load_dttm | |
---|---|---|---|---|---|---|
lng | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
gubun | 0.000 | 1.000 | 0.000 | 0.098 | 1.000 | 1.000 |
school_kind | 0.000 | 0.000 | 1.000 | 0.160 | 1.000 | 1.000 |
inst_center | 0.000 | 0.098 | 0.160 | 1.000 | 1.000 | 1.000 |
data_day | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
last_load_dttm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
skey | gubun | school_name | student_num | tel | school_addr | school_kind | inst_center | lat | lng | data_day | apr_at | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 4542 | <NA> | 영선어린이집 | 60 | <NA> | <NA> | 기존 | <NA> | 35.07961019 | 129.046556 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
1 | 4543 | <NA> | 영지어린이집 | 75 | <NA> | <NA> | 기존 | <NA> | 35.08991597 | 129.057655 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
2 | 4544 | <NA> | 와치어린이집 | 62 | <NA> | <NA> | 기존 | <NA> | 35.09202146 | 129.057057 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
3 | 4545 | <NA> | 원광어린이집 | 46 | <NA> | <NA> | 기존 | <NA> | 35.0909773 | 129.067166 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
4 | 4546 | <NA> | 은혜어린이집 | 33 | <NA> | <NA> | 기존 | <NA> | 35.09400744 | 129.052586 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
5 | 4547 | <NA> | 자비유치원 | 69 | <NA> | <NA> | 기존 | <NA> | 35.08639435 | 129.064529 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
6 | 4548 | <NA> | 절영어린이집 | 84 | <NA> | <NA> | 기존 | <NA> | 35.07221265 | 129.061746 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
7 | 4549 | <NA> | 지성어린이집 | 90 | <NA> | <NA> | 기존 | <NA> | 35.06688592 | 129.079021 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
8 | 4550 | <NA> | 큰나무어린이집 | 35 | <NA> | <NA> | 기존 | <NA> | 35.09401683 | 129.046302 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
9 | 4551 | <NA> | 해돋이어린이집 | 85 | <NA> | <NA> | 기존 | <NA> | 35.09335357 | 129.040634 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
skey | gubun | school_name | student_num | tel | school_addr | school_kind | inst_center | lat | lng | data_day | apr_at | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
695 | 4307 | <NA> | 해누리유치원 | 119 | <NA> | <NA> | 기존 | <NA> | 35.32925993 | 129.171977 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
696 | 4308 | <NA> | 행복엔젤유치원 | 46 | <NA> | <NA> | 기존 | <NA> | 35.23712301 | 129.210353 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
697 | 4309 | <NA> | 유니어린이집 | 30 | <NA> | <NA> | 신규 | <NA> | 35.11638703 | 129.10936 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
698 | 4310 | <NA> | 21세기영아전담어린이집 | 36 | <NA> | <NA> | 신규 | <NA> | 35.12232511 | 129.081706 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
699 | 4311 | <NA> | BIFC어린이집 | 96 | <NA> | <NA> | 기존 | <NA> | 35.14648787 | 129.065861 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
700 | 4312 | <NA> | LG메트로시티어린이집 | 60 | <NA> | <NA> | 신규 | <NA> | 35.12692281 | 129.109525 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
701 | 4313 | <NA> | LG삐아제어린이집 | 14 | <NA> | <NA> | 신규 | <NA> | 35.12883325 | 129.108596 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
702 | 4314 | <NA> | 경성어린이집 | 71 | <NA> | <NA> | 기존 | <NA> | 35.12717204 | 129.074912 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
703 | 4315 | <NA> | 공립감만어린이집 | 78 | <NA> | <NA> | 기존 | <NA> | 35.11651132 | 129.08571 | <NA> | N | <NA> | 2021-01-05 14:02:45 |
704 | 4316 | <NA> | 대천유치원 | 136 | <NA> | <NA> | 신규 | <NA> | 35.13088489 | 129.0991 | <NA> | N | <NA> | 2021-01-05 14:02:45 |