Dataset statistics
Number of variables | 14 |
---|---|
Number of observations | 354 |
Missing cells | 723 |
Missing cells (%) | 14.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 39.9 KiB |
Average record size in memory | 115.4 B |
Variable types
Text | 6 |
---|---|
Categorical | 5 |
Numeric | 1 |
Unsupported | 2 |
Dataset
Description | 2020-12-22 |
---|---|
Author | 부산시공공데이터포털 |
URL | https://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901054001157000 |
gubun is highly overall correlated with data_day and 1 other fields | High correlation |
school_kind is highly overall correlated with data_day and 1 other fields | High correlation |
last_load_dttm is highly overall correlated with lng and 4 other fields | High correlation |
data_day is highly overall correlated with lng and 4 other fields | High correlation |
inst_center is highly overall correlated with data_day and 1 other fields | High correlation |
lng is highly overall correlated with data_day and 1 other fields | High correlation |
gubun is highly imbalanced (63.3%) | Imbalance |
school_kind is highly imbalanced (60.8%) | Imbalance |
data_day is highly imbalanced (89.3%) | Imbalance |
last_load_dttm is highly imbalanced (89.3%) | Imbalance |
lng has 5 (1.4%) missing values | Missing |
apr_at has 354 (100.0%) missing values | Missing |
instt_code has 354 (100.0%) missing values | Missing |
skey has unique values | Unique |
apr_at is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
instt_code is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-04-20 19:16:12.973265 |
---|---|
Analysis finished | 2024-04-20 19:16:15.564958 |
Duration | 2.59 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
skey
Text
UNIQUE
 
Distinct | 354 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Value | Count | Frequency (%) |
3892 | 1 | 0.3% |
3980 | 1 | 0.3% |
4000 | 1 | 0.3% |
3999 | 1 | 0.3% |
3998 | 1 | 0.3% |
3997 | 1 | 0.3% |
3996 | 1 | 0.3% |
3995 | 1 | 0.3% |
3994 | 1 | 0.3% |
4002 | 1 | 0.3% |
Other values (347) | 347 |
Most occurring characters
Value | Count | Frequency (%) |
4 | 311 | |
3 | 183 | |
1 | 175 | |
0 | 175 | |
9 | 173 | |
2 | 119 | 8.3% |
8 | 73 | 5.1% |
5 | 66 | 4.6% |
7 | 65 | 4.5% |
6 | 65 | 4.5% |
Other values (31) | 36 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1405 | |
Other Letter | 28 | 1.9% |
Space Separator | 3 | 0.2% |
Other Punctuation | 2 | 0.1% |
Uppercase Letter | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Open Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
부 | 2 | 7.1% |
영 | 2 | 7.1% |
동 | 2 | 7.1% |
클 | 1 | 3.6% |
중 | 1 | 3.6% |
흥 | 1 | 3.6% |
움 | 1 | 3.6% |
래 | 1 | 3.6% |
스 | 1 | 3.6% |
프 | 1 | 3.6% |
Other values (15) | 15 |
Decimal Number
Value | Count | Frequency (%) |
4 | 311 | |
3 | 183 | |
1 | 175 | |
0 | 175 | |
9 | 173 | |
2 | 119 | 8.5% |
8 | 73 | 5.2% |
5 | 66 | 4.7% |
7 | 65 | 4.6% |
6 | 65 | 4.6% |
Other Punctuation
Value | Count | Frequency (%) |
@ | 1 | |
, | 1 |
Space Separator
Value | Count | Frequency (%) |
3 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1412 | |
Hangul | 28 | 1.9% |
Latin | 1 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
부 | 2 | 7.1% |
영 | 2 | 7.1% |
동 | 2 | 7.1% |
클 | 1 | 3.6% |
중 | 1 | 3.6% |
흥 | 1 | 3.6% |
움 | 1 | 3.6% |
래 | 1 | 3.6% |
스 | 1 | 3.6% |
프 | 1 | 3.6% |
Other values (15) | 15 |
Common
Value | Count | Frequency (%) |
4 | 311 | |
3 | 183 | |
1 | 175 | |
0 | 175 | |
9 | 173 | |
2 | 119 | 8.4% |
8 | 73 | 5.2% |
5 | 66 | 4.7% |
7 | 65 | 4.6% |
6 | 65 | 4.6% |
Other values (5) | 7 | 0.5% |
Latin
Value | Count | Frequency (%) |
S | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1413 | |
Hangul | 28 | 1.9% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4 | 311 | |
3 | 183 | |
1 | 175 | |
0 | 175 | |
9 | 173 | |
2 | 119 | 8.4% |
8 | 73 | 5.2% |
5 | 66 | 4.7% |
7 | 65 | 4.6% |
6 | 65 | 4.6% |
Other values (6) | 8 | 0.6% |
Hangul
Value | Count | Frequency (%) |
부 | 2 | 7.1% |
영 | 2 | 7.1% |
동 | 2 | 7.1% |
클 | 1 | 3.6% |
중 | 1 | 3.6% |
흥 | 1 | 3.6% |
움 | 1 | 3.6% |
래 | 1 | 3.6% |
스 | 1 | 3.6% |
프 | 1 | 3.6% |
Other values (15) | 15 |
gubun
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 8 |
---|---|
Distinct (%) | 2.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
어린이집 | |
---|---|
유치원 | |
초등학교 | 22 |
기존 | 2 |
고등학교 | 2 |
Other values (3) | 3 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.8333333 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | 어린이집 |
---|---|
2nd row | 어린이집 |
3rd row | 어린이집 |
4th row | 어린이집 |
5th row | 어린이집 |
Common Values
Value | Count | Frequency (%) |
어린이집 | 273 | |
유치원 | 52 | 14.7% |
초등학교 | 22 | 6.2% |
기존 | 2 | 0.6% |
고등학교 | 2 | 0.6% |
40 | 1 | 0.3% |
특수학교 | 1 | 0.3% |
중학교 | 1 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
어린이집 | 273 | |
유치원 | 52 | 14.7% |
초등학교 | 22 | 6.2% |
기존 | 2 | 0.6% |
고등학교 | 2 | 0.6% |
40 | 1 | 0.3% |
특수학교 | 1 | 0.3% |
중학교 | 1 | 0.3% |
school_name
Text
Distinct | 343 |
---|---|
Distinct (%) | 96.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
Value | Count | Frequency (%) |
어린이집 | 7 | 1.9% |
미래어린이집 | 2 | 0.5% |
늘푸른어린이집 | 2 | 0.5% |
한솔어린이집 | 2 | 0.5% |
병설유치원 | 2 | 0.5% |
동심어린이집 | 2 | 0.5% |
한마음어린이집 | 2 | 0.5% |
꿈동산어린이집 | 2 | 0.5% |
유치원 | 2 | 0.5% |
우신어린이집 | 2 | 0.5% |
Other values (338) | 343 |
Most occurring characters
Value | Count | Frequency (%) |
이 | 285 | 11.1% |
린 | 275 | 10.7% |
어 | 272 | 10.6% |
집 | 272 | 10.6% |
원 | 66 | 2.6% |
유 | 53 | 2.1% |
치 | 53 | 2.1% |
동 | 40 | 1.6% |
교 | 38 | 1.5% |
초 | 38 | 1.5% |
Other values (286) | 1171 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2483 | |
Decimal Number | 30 | 1.2% |
Uppercase Letter | 23 | 0.9% |
Space Separator | 22 | 0.9% |
Dash Punctuation | 2 | 0.1% |
Lowercase Letter | 2 | 0.1% |
Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 285 | 11.5% |
린 | 275 | 11.1% |
어 | 272 | 11.0% |
집 | 272 | 11.0% |
원 | 66 | 2.7% |
유 | 53 | 2.1% |
치 | 53 | 2.1% |
동 | 40 | 1.6% |
교 | 38 | 1.5% |
초 | 38 | 1.5% |
Other values (263) | 1091 |
Decimal Number
Value | Count | Frequency (%) |
2 | 8 | |
1 | 7 | |
4 | 4 | |
5 | 3 | 10.0% |
0 | 2 | 6.7% |
3 | 2 | 6.7% |
6 | 2 | 6.7% |
8 | 1 | 3.3% |
7 | 1 | 3.3% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 5 | |
G | 5 | |
K | 3 | |
B | 3 | |
I | 2 | 8.7% |
C | 2 | 8.7% |
F | 1 | 4.3% |
R | 1 | 4.3% |
A | 1 | 4.3% |
Lowercase Letter
Value | Count | Frequency (%) |
k | 1 | |
s | 1 |
Space Separator
Value | Count | Frequency (%) |
22 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2 |
Other Punctuation
Value | Count | Frequency (%) |
! | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2483 | |
Common | 55 | 2.1% |
Latin | 25 | 1.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 285 | 11.5% |
린 | 275 | 11.1% |
어 | 272 | 11.0% |
집 | 272 | 11.0% |
원 | 66 | 2.7% |
유 | 53 | 2.1% |
치 | 53 | 2.1% |
동 | 40 | 1.6% |
교 | 38 | 1.5% |
초 | 38 | 1.5% |
Other values (263) | 1091 |
Common
Value | Count | Frequency (%) |
22 | ||
2 | 8 | 14.5% |
1 | 7 | 12.7% |
4 | 4 | 7.3% |
5 | 3 | 5.5% |
0 | 2 | 3.6% |
- | 2 | 3.6% |
3 | 2 | 3.6% |
6 | 2 | 3.6% |
8 | 1 | 1.8% |
Other values (2) | 2 | 3.6% |
Latin
Value | Count | Frequency (%) |
L | 5 | |
G | 5 | |
K | 3 | |
B | 3 | |
I | 2 | 8.0% |
C | 2 | 8.0% |
F | 1 | 4.0% |
R | 1 | 4.0% |
A | 1 | 4.0% |
k | 1 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2483 | |
ASCII | 80 | 3.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
이 | 285 | 11.5% |
린 | 275 | 11.1% |
어 | 272 | 11.0% |
집 | 272 | 11.0% |
원 | 66 | 2.7% |
유 | 53 | 2.1% |
치 | 53 | 2.1% |
동 | 40 | 1.6% |
교 | 38 | 1.5% |
초 | 38 | 1.5% |
Other values (263) | 1091 |
ASCII
Value | Count | Frequency (%) |
22 | ||
2 | 8 | 10.0% |
1 | 7 | 8.8% |
L | 5 | 6.2% |
G | 5 | 6.2% |
4 | 4 | 5.0% |
K | 3 | 3.8% |
B | 3 | 3.8% |
5 | 3 | 3.8% |
I | 2 | 2.5% |
Other values (13) | 18 |
student_num
Text
Distinct | 140 |
---|---|
Distinct (%) | 39.9% |
Missing | 3 |
Missing (%) | 0.8% |
Memory size | 2.9 KiB |
Value | Count | Frequency (%) |
20 | 17 | 4.8% |
19 | 9 | 2.5% |
65 | 8 | 2.3% |
46 | 8 | 2.3% |
45 | 8 | 2.3% |
60 | 7 | 2.0% |
16 | 7 | 2.0% |
49 | 7 | 2.0% |
40 | 6 | 1.7% |
13 | 6 | 1.7% |
Other values (134) | 272 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 121 | |
2 | 86 | |
3 | 86 | |
6 | 81 | |
4 | 80 | |
0 | 79 | |
5 | 74 | |
7 | 70 | |
8 | 57 | |
9 | 51 | |
Other values (19) | 24 | 3.0% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 785 | |
Other Letter | 16 | 2.0% |
Space Separator | 4 | 0.5% |
Other Punctuation | 2 | 0.2% |
Open Punctuation | 1 | 0.1% |
Close Punctuation | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
리 | 1 | 6.2% |
로 | 1 | 6.2% |
관 | 1 | 6.2% |
동 | 1 | 6.2% |
호 | 1 | 6.2% |
단 | 1 | 6.2% |
시 | 1 | 6.2% |
신 | 1 | 6.2% |
구 | 1 | 6.2% |
Other values (5) | 5 |
Decimal Number
Value | Count | Frequency (%) |
1 | 121 | |
2 | 86 | |
3 | 86 | |
6 | 81 | |
4 | 80 | |
0 | 79 | |
5 | 74 | |
7 | 70 | |
8 | 57 | |
9 | 51 |
Space Separator
Value | Count | Frequency (%) |
4 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 793 | |
Hangul | 16 | 2.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
리 | 1 | 6.2% |
로 | 1 | 6.2% |
관 | 1 | 6.2% |
동 | 1 | 6.2% |
호 | 1 | 6.2% |
단 | 1 | 6.2% |
시 | 1 | 6.2% |
신 | 1 | 6.2% |
구 | 1 | 6.2% |
Other values (5) | 5 |
Common
Value | Count | Frequency (%) |
1 | 121 | |
2 | 86 | |
3 | 86 | |
6 | 81 | |
4 | 80 | |
0 | 79 | |
5 | 74 | |
7 | 70 | |
8 | 57 | |
9 | 51 | |
Other values (4) | 8 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 793 | |
Hangul | 16 | 2.0% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 121 | |
2 | 86 | |
3 | 86 | |
6 | 81 | |
4 | 80 | |
0 | 79 | |
5 | 74 | |
7 | 70 | |
8 | 57 | |
9 | 51 | |
Other values (4) | 8 | 1.0% |
Hangul
Value | Count | Frequency (%) |
산 | 2 | 12.5% |
리 | 1 | 6.2% |
로 | 1 | 6.2% |
관 | 1 | 6.2% |
동 | 1 | 6.2% |
호 | 1 | 6.2% |
단 | 1 | 6.2% |
시 | 1 | 6.2% |
신 | 1 | 6.2% |
구 | 1 | 6.2% |
Other values (5) | 5 |
tel
Text
Distinct | 351 |
---|---|
Distinct (%) | 99.7% |
Missing | 2 |
Missing (%) | 0.6% |
Memory size | 2.9 KiB |
Length
Max length | 13 |
---|---|
Median length | 12 |
Mean length | 12.039773 |
Min length | 10 |
Characters and Unicode
Total characters | 4238 |
---|---|
Distinct characters | 13 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 350 ? |
---|---|
Unique (%) | 99.4% |
Sample
1st row | 051-464-0570 |
---|---|
2nd row | 051-242-3583 |
3rd row | 051-244-3362 |
4th row | 051-620-6088 |
5th row | 051-256-1843 |
Value | Count | Frequency (%) |
051-325-7650 | 2 | 0.6% |
051-315-3580 | 2 | 0.6% |
051-532-3211 | 1 | 0.3% |
051-894-5486 | 1 | 0.3% |
051-464-0570 | 1 | 0.3% |
051-935-4258 | 1 | 0.3% |
051-524-0014 | 1 | 0.3% |
051-507-2005 | 1 | 0.3% |
051-524-2340 | 1 | 0.3% |
051-501-4321 | 1 | 0.3% |
Other values (340) | 340 |
Most occurring characters
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 3524 | |
Dash Punctuation | 700 | 16.5% |
Space Separator | 12 | 0.3% |
Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 7.0% |
4 | 247 | 7.0% |
6 | 234 | 6.6% |
8 | 194 | 5.5% |
9 | 152 | 4.3% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 700 |
Space Separator
Value | Count | Frequency (%) |
12 |
Other Punctuation
Value | Count | Frequency (%) |
. | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4238 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4238 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
- | 700 | |
5 | 625 | |
0 | 622 | |
1 | 603 | |
2 | 317 | |
3 | 282 | |
7 | 248 | 5.9% |
4 | 247 | 5.8% |
6 | 234 | 5.5% |
8 | 194 | 4.6% |
Other values (3) | 166 | 3.9% |
school_addr
Text
Distinct | 344 |
---|---|
Distinct (%) | 97.7% |
Missing | 2 |
Missing (%) | 0.6% |
Memory size | 2.9 KiB |
Length
Max length | 47 |
---|---|
Median length | 39 |
Mean length | 23.360795 |
Min length | 10 |
Characters and Unicode
Total characters | 8223 |
---|---|
Distinct characters | 281 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 336 ? |
---|---|
Unique (%) | 95.5% |
Sample
1st row | 부산광역시 중구 망양로 309 |
---|---|
2nd row | 부산광역시 중구 망양로 319번길 |
3rd row | 부산광역시 중구 흑교로 31번길 34-1 |
4th row | 부산광역시 중구 충장대로 20, 별관 2층 |
5th row | 부산광역시 중구 고가길 40 |
Value | Count | Frequency (%) |
부산광역시 | 347 | 19.7% |
북구 | 35 | 2.0% |
해운대구 | 32 | 1.8% |
영도구 | 29 | 1.6% |
금정구 | 29 | 1.6% |
사상구 | 29 | 1.6% |
기장군 | 28 | 1.6% |
사하구 | 23 | 1.3% |
남구 | 22 | 1.2% |
동구 | 21 | 1.2% |
Other values (693) | 1167 |
Most occurring characters
Value | Count | Frequency (%) |
1429 | 17.4% | |
산 | 390 | 4.7% |
1 | 382 | 4.6% |
부 | 374 | 4.5% |
시 | 362 | 4.4% |
광 | 355 | 4.3% |
역 | 347 | 4.2% |
로 | 333 | 4.0% |
구 | 332 | 4.0% |
2 | 186 | 2.3% |
Other values (271) | 3733 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 5092 | |
Decimal Number | 1549 | 18.8% |
Space Separator | 1429 | 17.4% |
Dash Punctuation | 63 | 0.8% |
Close Punctuation | 31 | 0.4% |
Open Punctuation | 31 | 0.4% |
Other Punctuation | 22 | 0.3% |
Uppercase Letter | 3 | < 0.1% |
Lowercase Letter | 3 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
Decimal Number
Value | Count | Frequency (%) |
1 | 382 | |
2 | 186 | |
3 | 181 | |
0 | 168 | |
4 | 138 | 8.9% |
6 | 123 | 7.9% |
5 | 111 | 7.2% |
7 | 96 | 6.2% |
9 | 82 | 5.3% |
8 | 82 | 5.3% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1 | |
L | 1 | |
G | 1 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 1 | |
k | 1 | |
e | 1 |
Other Punctuation
Value | Count | Frequency (%) |
, | 19 | |
@ | 3 | 13.6% |
Space Separator
Value | Count | Frequency (%) |
1429 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 63 |
Close Punctuation
Value | Count | Frequency (%) |
) | 31 |
Open Punctuation
Value | Count | Frequency (%) |
( | 31 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 5092 | |
Common | 3125 | |
Latin | 6 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
Common
Value | Count | Frequency (%) |
1429 | ||
1 | 382 | 12.2% |
2 | 186 | 6.0% |
3 | 181 | 5.8% |
0 | 168 | 5.4% |
4 | 138 | 4.4% |
6 | 123 | 3.9% |
5 | 111 | 3.6% |
7 | 96 | 3.1% |
9 | 82 | 2.6% |
Other values (6) | 229 | 7.3% |
Latin
Value | Count | Frequency (%) |
A | 1 | |
L | 1 | |
G | 1 | |
s | 1 | |
k | 1 | |
e | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 5092 | |
ASCII | 3131 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1429 | ||
1 | 382 | 12.2% |
2 | 186 | 5.9% |
3 | 181 | 5.8% |
0 | 168 | 5.4% |
4 | 138 | 4.4% |
6 | 123 | 3.9% |
5 | 111 | 3.5% |
7 | 96 | 3.1% |
9 | 82 | 2.6% |
Other values (12) | 235 | 7.5% |
Hangul
Value | Count | Frequency (%) |
산 | 390 | 7.7% |
부 | 374 | 7.3% |
시 | 362 | 7.1% |
광 | 355 | 7.0% |
역 | 347 | 6.8% |
로 | 333 | 6.5% |
구 | 332 | 6.5% |
길 | 178 | 3.5% |
번 | 159 | 3.1% |
동 | 135 | 2.7% |
Other values (249) | 2127 |
school_kind
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 7 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
기존 | |
---|---|
신규 | |
<NA> | 5 |
기존) | 2 |
가존 | 1 |
Other values (2) | 2 |
Length
Max length | 4 |
---|---|
Median length | 2 |
Mean length | 2.039548 |
Min length | 2 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.8% |
Sample
1st row | 기존 |
---|---|
2nd row | 기존 |
3rd row | 기존 |
4th row | 기존 |
5th row | 기존 |
Common Values
Value | Count | Frequency (%) |
기존 | 240 | |
신규 | 104 | |
<NA> | 5 | 1.4% |
기존) | 2 | 0.6% |
가존 | 1 | 0.3% |
기존 | 1 | 0.3% |
신규 | 1 | 0.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
기존 | 243 | |
신규 | 105 | |
na | 5 | 1.4% |
가존 | 1 | 0.3% |
inst_center
Categorical
HIGH CORRELATION
 
Distinct | 17 |
---|---|
Distinct (%) | 4.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
부산북구보건소 | |
---|---|
해운대구보건소 | |
금정구보건소 | |
영도구보건소 | |
사상구보건소 | |
Other values (12) |
Length
Max length | 8 |
---|---|
Median length | 6 |
Mean length | 6.4519774 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 부산중구보건소 |
---|---|
2nd row | 부산중구보건소 |
3rd row | 부산중구보건소 |
4th row | 부산중구보건소 |
5th row | 부산중구보건소 |
Common Values
Value | Count | Frequency (%) |
부산북구보건소 | 35 | |
해운대구보건소 | 32 | 9.0% |
금정구보건소 | 29 | 8.2% |
영도구보건소 | 29 | 8.2% |
사상구보건소 | 29 | 8.2% |
기장군보건소 | 28 | 7.9% |
사하구보건소 | 23 | 6.5% |
부산남구보건소 | 22 | 6.2% |
부산동구보건소 | 21 | 5.9% |
부산진구보건소 | 20 | 5.6% |
Other values (7) | 86 |
Length
Value | Count | Frequency (%) |
부산북구보건소 | 35 | |
해운대구보건소 | 32 | 9.0% |
금정구보건소 | 29 | 8.2% |
영도구보건소 | 29 | 8.2% |
사상구보건소 | 29 | 8.2% |
기장군보건소 | 28 | 7.9% |
사하구보건소 | 23 | 6.5% |
부산남구보건소 | 22 | 6.2% |
부산동구보건소 | 21 | 5.9% |
부산진구보건소 | 20 | 5.6% |
Other values (7) | 86 |
lat
Text
Distinct | 316 |
---|---|
Distinct (%) | 90.0% |
Missing | 3 |
Missing (%) | 0.8% |
Memory size | 2.9 KiB |
Length
Max length | 19 |
---|---|
Median length | 11 |
Mean length | 9.4900285 |
Min length | 4 |
Characters and Unicode
Total characters | 3331 |
---|---|
Distinct characters | 14 |
Distinct categories | 4 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 289 ? |
---|---|
Unique (%) | 82.3% |
Sample
1st row | 35.10578162 |
---|---|
2nd row | 35.10662134 |
3rd row | 35.1025574 |
4th row | 35.10450764 |
5th row | 35.10201442 |
Value | Count | Frequency (%) |
35.148 | 6 | 1.7% |
35.162 | 3 | 0.8% |
35.198 | 3 | 0.8% |
35.127 | 3 | 0.8% |
35.268 | 3 | 0.8% |
35.129 | 2 | 0.6% |
35.2 | 2 | 0.6% |
35.202 | 2 | 0.6% |
35.131 | 2 | 0.6% |
35.083 | 2 | 0.6% |
Other values (307) | 325 |
Most occurring characters
Value | Count | Frequency (%) |
3 | 572 | |
5 | 530 | |
1 | 363 | |
. | 349 | |
2 | 301 | |
7 | 220 | 6.6% |
0 | 216 | 6.5% |
4 | 209 | 6.3% |
6 | 198 | 5.9% |
9 | 182 | 5.5% |
Other values (4) | 191 | 5.7% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2972 | |
Other Punctuation | 353 | 10.6% |
Dash Punctuation | 4 | 0.1% |
Space Separator | 2 | 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 572 | |
5 | 530 | |
1 | 363 | |
2 | 301 | |
7 | 220 | 7.4% |
0 | 216 | 7.3% |
4 | 209 | 7.0% |
6 | 198 | 6.7% |
9 | 182 | 6.1% |
8 | 181 | 6.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 349 | |
: | 4 | 1.1% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 4 |
Space Separator
Value | Count | Frequency (%) |
2 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 3331 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 572 | |
5 | 530 | |
1 | 363 | |
. | 349 | |
2 | 301 | |
7 | 220 | 6.6% |
0 | 216 | 6.5% |
4 | 209 | 6.3% |
6 | 198 | 5.9% |
9 | 182 | 5.5% |
Other values (4) | 191 | 5.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3331 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 572 | |
5 | 530 | |
1 | 363 | |
. | 349 | |
2 | 301 | |
7 | 220 | 6.6% |
0 | 216 | 6.5% |
4 | 209 | 6.3% |
6 | 198 | 5.9% |
9 | 182 | 5.5% |
Other values (4) | 191 | 5.7% |
lng
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 320 |
---|---|
Distinct (%) | 91.7% |
Missing | 5 |
Missing (%) | 1.4% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 129.02334 |
Minimum | 123.423 |
---|---|
Maximum | 129.25804 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.2 KiB |
Quantile statistics
Minimum | 123.423 |
---|---|
5-th percentile | 128.9604 |
Q1 | 129.012 |
median | 129.0584 |
Q3 | 129.09497 |
95-th percentile | 129.17758 |
Maximum | 129.25804 |
Range | 5.835043 |
Interquartile range (IQR) | 0.0829665 |
Descriptive statistics
Standard deviation | 0.37403519 |
---|---|
Coefficient of variation (CV) | 0.0028989731 |
Kurtosis | 163.01325 |
Mean | 129.02334 |
Median Absolute Deviation (MAD) | 0.0414914 |
Skewness | -12.052053 |
Sum | 45029.146 |
Variance | 0.13990233 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
129.108 | 4 | 1.1% |
128.971 | 3 | 0.8% |
129.077 | 3 | 0.8% |
129.213 | 3 | 0.8% |
129.028 | 2 | 0.6% |
129.023 | 2 | 0.6% |
129.017 | 2 | 0.6% |
129.012 | 2 | 0.6% |
129.006 | 2 | 0.6% |
129.16 | 2 | 0.6% |
Other values (310) | 324 | |
(Missing) | 5 | 1.4% |
Value | Count | Frequency (%) |
123.423 | 1 | |
125.7758087 | 1 | |
127.008 | 1 | |
128.503 | 1 | |
128.531 | 1 | |
128.661 | 1 | |
128.7402002 | 1 | |
128.816 | 1 | |
128.8513166 | 1 | |
128.8735538 | 1 |
Value | Count | Frequency (%) |
129.258043 | 1 | 0.3% |
129.243607 | 1 | 0.3% |
129.223 | 1 | 0.3% |
129.2163239 | 1 | 0.3% |
129.213 | 3 | |
129.2127674 | 1 | 0.3% |
129.2104953 | 1 | 0.3% |
129.208 | 1 | 0.3% |
129.2048929 | 1 | 0.3% |
129.2045203 | 1 | 0.3% |
data_day
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
2020-06-30 | |
---|---|
<NA> | 5 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 9.9152542 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-06-30 |
---|---|
2nd row | 2020-06-30 |
3rd row | 2020-06-30 |
4th row | 2020-06-30 |
5th row | 2020-06-30 |
Common Values
Value | Count | Frequency (%) |
2020-06-30 | 349 | |
<NA> | 5 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-06-30 | 349 | |
na | 5 | 1.4% |
apr_at
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 354 |
---|---|
Missing (%) | 100.0% |
Memory size | 3.2 KiB |
instt_code
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 354 |
---|---|
Missing (%) | 100.0% |
Memory size | 3.2 KiB |
last_load_dttm
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.9 KiB |
2020-12-22 14:32:51 | |
---|---|
<NA> | 5 |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 18.788136 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2020-12-22 14:32:51 |
---|---|
2nd row | 2020-12-22 14:32:51 |
3rd row | 2020-12-22 14:32:51 |
4th row | 2020-12-22 14:32:51 |
5th row | 2020-12-22 14:32:51 |
Common Values
Value | Count | Frequency (%) |
2020-12-22 14:32:51 | 349 | |
<NA> | 5 | 1.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2020-12-22 | 349 | |
14:32:51 | 349 | |
na | 5 | 0.7% |
gubun | school_kind | inst_center | lng | |
---|---|---|---|---|
gubun | 1.000 | 0.000 | 0.206 | 0.000 |
school_kind | 0.000 | 1.000 | 0.326 | 0.000 |
inst_center | 0.206 | 0.326 | 1.000 | 0.000 |
lng | 0.000 | 0.000 | 0.000 | 1.000 |
gubun | school_kind | last_load_dttm | data_day | inst_center | |
---|---|---|---|---|---|
gubun | 1.000 | 0.000 | 1.000 | 1.000 | 0.098 |
school_kind | 0.000 | 1.000 | 1.000 | 1.000 | 0.160 |
last_load_dttm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
data_day | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
inst_center | 0.098 | 0.160 | 1.000 | 1.000 | 1.000 |
lng | gubun | school_kind | inst_center | data_day | last_load_dttm | |
---|---|---|---|---|---|---|
lng | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 |
gubun | 0.000 | 1.000 | 0.000 | 0.098 | 1.000 | 1.000 |
school_kind | 0.000 | 0.000 | 1.000 | 0.160 | 1.000 | 1.000 |
inst_center | 0.000 | 0.098 | 0.160 | 1.000 | 1.000 | 1.000 |
data_day | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
last_load_dttm | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
skey | gubun | school_name | student_num | tel | school_addr | school_kind | inst_center | lat | lng | data_day | apr_at | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 3892 | 어린이집 | 노틀담어린이집 | 73 | 051-464-0570 | 부산광역시 중구 망양로 309 | 기존 | 부산중구보건소 | 35.10578162 | 129.028244 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
1 | 3893 | 어린이집 | 보수어린이집 | 45 | 051-242-3583 | 부산광역시 중구 망양로 319번길 | 기존 | 부산중구보건소 | 35.10662134 | 129.027015 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
2 | 3894 | 어린이집 | 보현어린이집 | 40 | 051-244-3362 | 부산광역시 중구 흑교로 31번길 34-1 | 기존 | 부산중구보건소 | 35.1025574 | 129.021524 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
3 | 3895 | 어린이집 | 부산세관어린이집 | 58 | 051-620-6088 | 부산광역시 중구 충장대로 20, 별관 2층 | 기존 | 부산중구보건소 | 35.10450764 | 129.039153 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
4 | 3896 | 어린이집 | 숲속어린이집 | 60 | 051-256-1843 | 부산광역시 중구 고가길 40 | 기존 | 부산중구보건소 | 35.10201442 | 129.031344 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
5 | 3897 | 어린이집 | 성모희보어린이집 | 65 | 051-466-0363 | 부산광역시 중구 중구로 97번길 7 | 신규 | 부산중구보건소 | 35.105868 | 129.029 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
6 | 3898 | 어린이집 | 영주어린이집 | 74 | 051-462-2855 | 부산광역시 중구 망양로 396 | 기존 | 부산중구보건소 | 35.11186392 | 129.02956 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
7 | 4201 | 어린이집 | 금강어린이집 | 16 | 051-317-8880 | 부산광역시 사상구 대동로64번길 25 금강아파트 103동 110호 | 가존 | 사상구보건소 | 35.13612818 | 128.978677 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
8 | 4202 | 어린이집 | 꼬마나라어린이집 | 18 | 051-315-4566 | 부산광역시 사상구 백양대로 372-15 주례한일유엔아이아파트 109동 103호 | 기존 | 사상구보건소 | 35.15394769 | 129.01062 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
9 | 4203 | 유치원 | 감전초등학교병설유치원 | 47 | 051-310-8393 | 부산광역시 사상구 괘감로 132 감전초등학교병설유치원 | 기존 | 사상구보건소 | 35.15574257 | 128.988669 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
skey | gubun | school_name | student_num | tel | school_addr | school_kind | inst_center | lat | lng | data_day | apr_at | instt_code | last_load_dttm | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
344 | 3910 | 유치원 | 영락유치원 | 73 | 051-254-2509 | 부산광역시 서구 대청로 8 | 기존 | 부산서구보건소 | 35.103 | 129.019 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
345 | 3911 | 유치원 | 토성초등학교 병설유치원 | 46 | 051-250-0847 | 부산광역시 서구 구덕로 134번길 45 | 기존 | 부산서구보건소 | 35.09949563 | 129.021167 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
346 | 3912 | 어린이집 | 꼬망쎼어린이집 | 80 | 051-255-9222 | 부산광역시 서구 대영로 73번길 76 | 신규 | 부산서구보건소 | 35.114 | 129.015 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
347 | 3913 | 유치원 | 동신초등학교 병설유치원 | 65 | 051-240-0794 | 부산광역시 서구 대영로 85번길 81-19 | 기존 | 부산서구보건소 | 35.11385804 | 129.017595 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
348 | 3914 | 어린이집 | 한마음어린이집 | 49 | 051-253-1968 | 부산광역시 서구 망양로 193번길 104 | 기존 | 부산서구보건소 | 35.11117899 | 129.026359 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
349 | 3915 | 유치원 | 노틀담유치원 | 103 | 051-253-1843 | 부산광역시 서구 임시수도기념로 61-32 | 기존 | 부산서구보건소 | 35.10275915 | 129.016831 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
350 | 3916 | 초등학교 | 부민초등학교 | 590 | 051-603-2119 | 부산광역시 서구 고운들로 12 | 신규 | 부산서구보건소 | 35.107 | 129.017 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
351 | 3917 | 고등학교 | 경남고등학교 | 574 | 051-250-5090 | 부산광역시 서구 망양로 111번길 65 | 기존 | 부산서구보건소 | 35.12020133 | 129.020044 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
352 | 3918 | 어린이집 | GKL행복어린이집 | 27 | 051-647-9551 | 부산광역시 동구 자성로 134(눌원빌딩 1층) | 기존 | 부산동구보건소 | 35.13663383 | 129.065041 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |
353 | 3919 | 어린이집 | IBK참!좋은어린이집 | 25 | 051-633-3800 | 부산광역시 동구 중앙대로 489 (2층) | 기존 | 부산동구보건소 | 35.13694596 | 129.056167 | 2020-06-30 | <NA> | <NA> | 2020-12-22 14:32:51 |