Dataset statistics
Number of variables | 20 |
---|---|
Number of observations | 200 |
Missing cells | 463 |
Missing cells (%) | 11.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 33.1 KiB |
Average record size in memory | 169.7 B |
Variable types
Text | 6 |
---|---|
Numeric | 10 |
Categorical | 4 |
Dataset
Description | Sample |
---|---|
Author | 오픈메이트 |
URL | https://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=OPMSCHOOL00000000010 |
CAMPUS_CLSS is highly overall correlated with AREA and 4 other fields | High correlation |
SEX is highly overall correlated with STU_CNT and 3 other fields | High correlation |
STUDY_TIME is highly overall correlated with AREA and 7 other fields | High correlation |
AREA is highly overall correlated with STU_CNT and 5 other fields | High correlation |
OPEN_DATE is highly overall correlated with CAMPUS_CLSS and 1 other fields | High correlation |
HOUS_ID is highly overall correlated with BLD_CD and 2 other fields | High correlation |
BLD_CD is highly overall correlated with HOUS_ID and 2 other fields | High correlation |
X_AXIS is highly overall correlated with HOUS_ID and 1 other fields | High correlation |
Y_AXIS is highly overall correlated with HOUS_ID and 1 other fields | High correlation |
STU_CNT is highly overall correlated with AREA and 4 other fields | High correlation |
TEA_CNT is highly overall correlated with AREA and 4 other fields | High correlation |
CLASS_CNT is highly overall correlated with AREA and 5 other fields | High correlation |
SCHOOL_CLSS1 is highly overall correlated with AREA and 2 other fields | High correlation |
CAMPUS_CLSS is highly imbalanced (95.5%) | Imbalance |
STUDY_TIME is highly imbalanced (60.3%) | Imbalance |
SEX is highly imbalanced (54.1%) | Imbalance |
AREA has 140 (70.0%) missing values | Missing |
OPEN_DATE has 123 (61.5%) missing values | Missing |
SCHOOL_CLSS2 has 193 (96.5%) missing values | Missing |
CLASS_CNT has 7 (3.5%) missing values | Missing |
SCHOOL_CD has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 06:37:38.041921 |
---|---|
Analysis finished | 2023-12-10 06:38:15.392920 |
Duration | 37.35 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SCHOOL_CD
Text
UNIQUE
 
Distinct | 200 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
s11942 | 1 | 0.5% |
c01119 | 1 | 0.5% |
c01127 | 1 | 0.5% |
k00380 | 1 | 0.5% |
k00383 | 1 | 0.5% |
c01086 | 1 | 0.5% |
s00556 | 1 | 0.5% |
k00774 | 1 | 0.5% |
s09660 | 1 | 0.5% |
c00970 | 1 | 0.5% |
Other values (190) | 190 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 247 | |
C | 116 | |
2 | 111 | |
9 | 94 | 7.8% |
1 | 92 | 7.7% |
3 | 92 | 7.7% |
4 | 79 | 6.6% |
6 | 77 | 6.4% |
7 | 72 | 6.0% |
8 | 68 | 5.7% |
Other values (4) | 152 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 1000 | |
Uppercase Letter | 200 | 16.7% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 247 | |
2 | 111 | |
9 | 94 | 9.4% |
1 | 92 | 9.2% |
3 | 92 | 9.2% |
4 | 79 | 7.9% |
6 | 77 | 7.7% |
7 | 72 | 7.2% |
8 | 68 | 6.8% |
5 | 68 | 6.8% |
Uppercase Letter
Value | Count | Frequency (%) |
C | 116 | |
S | 52 | |
K | 25 | 12.5% |
U | 7 | 3.5% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1000 | |
Latin | 200 | 16.7% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 247 | |
2 | 111 | |
9 | 94 | 9.4% |
1 | 92 | 9.2% |
3 | 92 | 9.2% |
4 | 79 | 7.9% |
6 | 77 | 7.7% |
7 | 72 | 7.2% |
8 | 68 | 6.8% |
5 | 68 | 6.8% |
Latin
Value | Count | Frequency (%) |
C | 116 | |
S | 52 | |
K | 25 | 12.5% |
U | 7 | 3.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1200 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 247 | |
C | 116 | |
2 | 111 | |
9 | 94 | 7.8% |
1 | 92 | 7.7% |
3 | 92 | 7.7% |
4 | 79 | 6.6% |
6 | 77 | 6.4% |
7 | 72 | 6.0% |
8 | 68 | 5.7% |
Other values (4) | 152 |
SCHOOL_NM
Text
Distinct | 198 |
---|---|
Distinct (%) | 99.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
구립 | 29 | 12.2% |
어린이집 | 5 | 2.1% |
정화예술대학교 | 2 | 0.8% |
엄마품어린이집 | 2 | 0.8% |
금빛 | 1 | 0.4% |
중경고등학교 | 1 | 0.4% |
서마 | 1 | 0.4% |
코알라베이비 | 1 | 0.4% |
센트라스아띠 | 1 | 0.4% |
서울농학교 | 1 | 0.4% |
Other values (193) | 193 |
Most occurring characters
Value | Count | Frequency (%) |
학 | 82 | 5.6% |
교 | 80 | 5.5% |
이 | 77 | 5.3% |
린 | 62 | 4.2% |
집 | 61 | 4.2% |
어 | 61 | 4.2% |
등 | 48 | 3.3% |
서 | 41 | 2.8% |
37 | 2.5% | |
울 | 37 | 2.5% |
Other values (222) | 877 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1406 | |
Space Separator | 37 | 2.5% |
Uppercase Letter | 18 | 1.2% |
Other Punctuation | 1 | 0.1% |
Decimal Number | 1 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
학 | 82 | 5.8% |
교 | 80 | 5.7% |
이 | 77 | 5.5% |
린 | 62 | 4.4% |
집 | 61 | 4.3% |
어 | 61 | 4.3% |
등 | 48 | 3.4% |
서 | 41 | 2.9% |
울 | 37 | 2.6% |
구 | 34 | 2.4% |
Other values (209) | 823 |
Uppercase Letter
Value | Count | Frequency (%) |
K | 4 | |
G | 3 | |
S | 3 | |
I | 2 | |
L | 1 | 5.6% |
N | 1 | 5.6% |
A | 1 | 5.6% |
H | 1 | 5.6% |
C | 1 | 5.6% |
M | 1 | 5.6% |
Space Separator
Value | Count | Frequency (%) |
37 |
Other Punctuation
Value | Count | Frequency (%) |
& | 1 |
Decimal Number
Value | Count | Frequency (%) |
5 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1406 | |
Common | 39 | 2.7% |
Latin | 18 | 1.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
학 | 82 | 5.8% |
교 | 80 | 5.7% |
이 | 77 | 5.5% |
린 | 62 | 4.4% |
집 | 61 | 4.3% |
어 | 61 | 4.3% |
등 | 48 | 3.4% |
서 | 41 | 2.9% |
울 | 37 | 2.6% |
구 | 34 | 2.4% |
Other values (209) | 823 |
Latin
Value | Count | Frequency (%) |
K | 4 | |
G | 3 | |
S | 3 | |
I | 2 | |
L | 1 | 5.6% |
N | 1 | 5.6% |
A | 1 | 5.6% |
H | 1 | 5.6% |
C | 1 | 5.6% |
M | 1 | 5.6% |
Common
Value | Count | Frequency (%) |
37 | ||
& | 1 | 2.6% |
5 | 1 | 2.6% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1406 | |
ASCII | 57 | 3.9% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
학 | 82 | 5.8% |
교 | 80 | 5.7% |
이 | 77 | 5.5% |
린 | 62 | 4.4% |
집 | 61 | 4.3% |
어 | 61 | 4.3% |
등 | 48 | 3.4% |
서 | 41 | 2.9% |
울 | 37 | 2.6% |
구 | 34 | 2.4% |
Other values (209) | 823 |
ASCII
Value | Count | Frequency (%) |
37 | ||
K | 4 | 7.0% |
G | 3 | 5.3% |
S | 3 | 5.3% |
I | 2 | 3.5% |
L | 1 | 1.8% |
N | 1 | 1.8% |
A | 1 | 1.8% |
H | 1 | 1.8% |
C | 1 | 1.8% |
Other values (3) | 3 | 5.3% |
ADDRESS
Text
Distinct | 197 |
---|---|
Distinct (%) | 98.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 47 |
---|---|
Median length | 38 |
Mean length | 28.37 |
Min length | 15 |
Characters and Unicode
Total characters | 5674 |
---|---|
Distinct characters | 242 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 2 ? |
Unique
Unique | 194 ? |
---|---|
Unique (%) | 97.0% |
Sample
1st row | 서울특별시 종로구 필운대로 103(신교동) |
---|---|
2nd row | 서울특별시 종로구 필운대로 97(신교동) |
3rd row | 서울특별시 종로구 통일로 246-20 111동 101호9(무악동무악현대아파트) |
4th row | 서울특별시 종로구 통일로 246-11 무악현대아파트 단지내 |
5th row | 서울특별시 종로구 김상옥로 29 2층 |
Value | Count | Frequency (%) |
서울특별시 | 199 | 19.3% |
성동구 | 72 | 7.0% |
종로구 | 63 | 6.1% |
중구 | 41 | 4.0% |
용산구 | 25 | 2.4% |
왕십리로 | 12 | 1.2% |
금호로 | 11 | 1.1% |
마장로 | 8 | 0.8% |
신당동 | 7 | 0.7% |
홍지문2길 | 6 | 0.6% |
Other values (429) | 589 |
Most occurring characters
Value | Count | Frequency (%) |
833 | 14.7% | |
로 | 255 | 4.5% |
동 | 237 | 4.2% |
서 | 231 | 4.1% |
1 | 224 | 3.9% |
울 | 219 | 3.9% |
구 | 208 | 3.7% |
별 | 200 | 3.5% |
시 | 200 | 3.5% |
특 | 199 | 3.5% |
Other values (232) | 2868 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 3669 | |
Space Separator | 833 | 14.7% |
Decimal Number | 826 | 14.6% |
Open Punctuation | 127 | 2.2% |
Close Punctuation | 127 | 2.2% |
Other Punctuation | 53 | 0.9% |
Dash Punctuation | 24 | 0.4% |
Uppercase Letter | 14 | 0.2% |
Lowercase Letter | 1 | < 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 255 | 7.0% |
동 | 237 | 6.5% |
서 | 231 | 6.3% |
울 | 219 | 6.0% |
구 | 208 | 5.7% |
별 | 200 | 5.5% |
시 | 200 | 5.5% |
특 | 199 | 5.4% |
길 | 95 | 2.6% |
성 | 87 | 2.4% |
Other values (206) | 1738 |
Decimal Number
Value | Count | Frequency (%) |
1 | 224 | |
2 | 122 | |
0 | 108 | |
3 | 92 | |
4 | 70 | 8.5% |
5 | 52 | 6.3% |
7 | 44 | 5.3% |
6 | 42 | 5.1% |
8 | 41 | 5.0% |
9 | 31 | 3.8% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 3 | |
I | 2 | |
S | 2 | |
K | 2 | |
E | 1 | 7.1% |
G | 1 | 7.1% |
W | 1 | 7.1% |
Z | 1 | 7.1% |
B | 1 | 7.1% |
Other Punctuation
Value | Count | Frequency (%) |
. | 52 | |
@ | 1 | 1.9% |
Space Separator
Value | Count | Frequency (%) |
833 |
Open Punctuation
Value | Count | Frequency (%) |
( | 127 |
Close Punctuation
Value | Count | Frequency (%) |
) | 127 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 24 |
Lowercase Letter
Value | Count | Frequency (%) |
e | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 3669 | |
Common | 1990 | |
Latin | 15 | 0.3% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 255 | 7.0% |
동 | 237 | 6.5% |
서 | 231 | 6.3% |
울 | 219 | 6.0% |
구 | 208 | 5.7% |
별 | 200 | 5.5% |
시 | 200 | 5.5% |
특 | 199 | 5.4% |
길 | 95 | 2.6% |
성 | 87 | 2.4% |
Other values (206) | 1738 |
Common
Value | Count | Frequency (%) |
833 | ||
1 | 224 | 11.3% |
( | 127 | 6.4% |
) | 127 | 6.4% |
2 | 122 | 6.1% |
0 | 108 | 5.4% |
3 | 92 | 4.6% |
4 | 70 | 3.5% |
5 | 52 | 2.6% |
. | 52 | 2.6% |
Other values (6) | 183 | 9.2% |
Latin
Value | Count | Frequency (%) |
L | 3 | |
I | 2 | |
S | 2 | |
K | 2 | |
E | 1 | 6.7% |
G | 1 | 6.7% |
W | 1 | 6.7% |
Z | 1 | 6.7% |
B | 1 | 6.7% |
e | 1 | 6.7% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 3669 | |
ASCII | 2005 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
833 | ||
1 | 224 | 11.2% |
( | 127 | 6.3% |
) | 127 | 6.3% |
2 | 122 | 6.1% |
0 | 108 | 5.4% |
3 | 92 | 4.6% |
4 | 70 | 3.5% |
5 | 52 | 2.6% |
. | 52 | 2.6% |
Other values (16) | 198 | 9.9% |
Hangul
Value | Count | Frequency (%) |
로 | 255 | 7.0% |
동 | 237 | 6.5% |
서 | 231 | 6.3% |
울 | 219 | 6.0% |
구 | 208 | 5.7% |
별 | 200 | 5.5% |
시 | 200 | 5.5% |
특 | 199 | 5.4% |
길 | 95 | 2.6% |
성 | 87 | 2.4% |
Other values (206) | 1738 |
AREA
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 60 |
---|---|
Distinct (%) | 100.0% |
Missing | 140 |
Missing (%) | 70.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13535.033 |
Minimum | 423 |
---|---|
Maximum | 49294 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 423 |
---|---|
5-th percentile | 585.6 |
Q1 | 3240.75 |
median | 10731.5 |
Q3 | 18101.5 |
95-th percentile | 46032.45 |
Maximum | 49294 |
Range | 48871 |
Interquartile range (IQR) | 14860.75 |
Descriptive statistics
Standard deviation | 12505.624 |
---|---|
Coefficient of variation (CV) | 0.92394485 |
Kurtosis | 1.5853991 |
Mean | 13535.033 |
Median Absolute Deviation (MAD) | 7507 |
Skewness | 1.3484923 |
Sum | 812102 |
Variance | 1.5639064 × 108 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18562 | 1 | 0.5% |
997 | 1 | 0.5% |
19648 | 1 | 0.5% |
12301 | 1 | 0.5% |
9514 | 1 | 0.5% |
17645 | 1 | 0.5% |
9826 | 1 | 0.5% |
15588 | 1 | 0.5% |
20930 | 1 | 0.5% |
9561 | 1 | 0.5% |
Other values (50) | 50 | 25.0% |
(Missing) | 140 |
Value | Count | Frequency (%) |
423 | 1 | |
469 | 1 | |
521 | 1 | |
589 | 1 | |
694 | 1 | |
764 | 1 | |
829 | 1 | |
927 | 1 | |
972 | 1 | |
997 | 1 |
Value | Count | Frequency (%) |
49294 | 1 | |
48149 | 1 | |
46231 | 1 | |
46022 | 1 | |
35128 | 1 | |
30877 | 1 | |
30005 | 1 | |
27058 | 1 | |
25504 | 1 | |
25277 | 1 |
OPEN_DATE
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 70 |
---|---|
Distinct (%) | 90.9% |
Missing | 123 |
Missing (%) | 61.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 19582682 |
Minimum | 18850509 |
---|---|
Maximum | 20180301 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 18850509 |
---|---|
5-th percentile | 19060923 |
Q1 | 19430924 |
median | 19610303 |
Q3 | 19810118 |
95-th percentile | 20162238 |
Maximum | 20180301 |
Range | 1329792 |
Interquartile range (IQR) | 379194 |
Descriptive statistics
Standard deviation | 337282.58 |
---|---|
Coefficient of variation (CV) | 0.017223513 |
Kurtosis | -0.51621114 |
Mean | 19582682 |
Median Absolute Deviation (MAD) | 199815 |
Skewness | -0.22632657 |
Sum | 1.5078665 × 109 |
Variance | 1.1375954 × 1011 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
18850509 | 2 | 1.0% |
19071010 | 2 | 1.0% |
19451101 | 2 | 1.0% |
19380405 | 2 | 1.0% |
19060923 | 2 | 1.0% |
19080620 | 2 | 1.0% |
19130401 | 2 | 1.0% |
19880914 | 1 | 0.5% |
19590403 | 1 | 0.5% |
19530608 | 1 | 0.5% |
Other values (60) | 60 | |
(Missing) | 123 |
Value | Count | Frequency (%) |
18850509 | 2 | |
18951115 | 1 | |
19060923 | 2 | |
19071010 | 2 | |
19080620 | 2 | |
19100125 | 1 | |
19100413 | 1 | |
19130401 | 2 | |
19150915 | 1 | |
19210502 | 1 |
Value | Count | Frequency (%) |
20180301 | 1 | |
20170310 | 1 | |
20170302 | 1 | |
20170301 | 1 | |
20160222 | 1 | |
20130305 | 1 | |
20030301 | 1 | |
20001201 | 1 | |
19991224 | 1 | |
19990319 | 1 |
HOUS_ID
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 156 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1155734 × 1018 |
Minimum | 1.1110101 × 1018 |
---|---|
Maximum | 1.1200115 × 1018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1.1110101 × 1018 |
---|---|
5-th percentile | 1.1110116 × 1018 |
Q1 | 1.1110185 × 1018 |
median | 1.1140168 × 1018 |
Q3 | 1.1200107 × 1018 |
95-th percentile | 1.1200114 × 1018 |
Maximum | 1.1200115 × 1018 |
Range | 9.0014 × 1015 |
Interquartile range (IQR) | 8.9922 × 1015 |
Descriptive statistics
Standard deviation | 3.789723 × 1015 |
---|---|
Coefficient of variation (CV) | 0.0033971077 |
Kurtosis | -1.663252 |
Mean | 1.1155734 × 1018 |
Median Absolute Deviation (MAD) | 2.9999 × 1015 |
Skewness | 0.014834528 |
Sum | 1.7537494 × 1018 |
Variance | 1.4362 × 1031 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1120010900006330000 | 6 | 3.0% |
1120011200003400000 | 4 | 2.0% |
1117011100001000000 | 3 | 1.5% |
1120011000012170000 | 3 | 1.5% |
1120010200010700000 | 3 | 1.5% |
1114016500025450000 | 3 | 1.5% |
1117011000000010042 | 2 | 1.0% |
1114014400001730007 | 2 | 1.0% |
1117013100007260001 | 2 | 1.0% |
1120011200002350000 | 2 | 1.0% |
Other values (146) | 170 |
Value | Count | Frequency (%) |
1111010100000890003 | 1 | |
1111010100000890009 | 1 | |
1111010100001230000 | 2 | |
1111010200000010001 | 1 | |
1111010200000010004 | 1 | |
1111011100000180000 | 1 | |
1111011300000320000 | 1 | |
1111011300002780004 | 1 | |
1111011400002010011 | 1 | |
1111011600000320000 | 1 |
Value | Count | Frequency (%) |
1120011500006610002 | 1 | |
1120011500004520000 | 1 | |
1120011500003510000 | 1 | |
1120011500003330128 | 1 | |
1120011500003330016 | 1 | |
1120011500002990128 | 1 | |
1120011500002630010 | 1 | |
1120011400007180000 | 1 | |
1120011400007100000 | 1 | |
1120011400006560032 | 2 |
BLD_CD
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 147 |
---|---|
Distinct (%) | 73.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.1155734 × 1024 |
Minimum | 1.1110101 × 1024 |
---|---|
Maximum | 1.1200115 × 1024 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1.1110101 × 1024 |
---|---|
5-th percentile | 1.1110116 × 1024 |
Q1 | 1.1110183 × 1024 |
median | 1.1140168 × 1024 |
Q3 | 1.1200107 × 1024 |
95-th percentile | 1.1200114 × 1024 |
Maximum | 1.1200115 × 1024 |
Range | 9.0014 × 1021 |
Interquartile range (IQR) | 8.9924 × 1021 |
Descriptive statistics
Standard deviation | 3.7897269 × 1021 |
---|---|
Coefficient of variation (CV) | 0.0033971112 |
Kurtosis | -1.6632507 |
Mean | 1.1155734 × 1024 |
Median Absolute Deviation (MAD) | 2.9999 × 1021 |
Skewness | 0.014832906 |
Sum | 2.2311468 × 1026 |
Variance | 1.436203 × 1043 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1.12001090010346e+24 | 6 | 3.0% |
1.12001120010274e+24 | 4 | 2.0% |
1.11401620010067e+24 | 3 | 1.5% |
1.12001020010842e+24 | 3 | 1.5% |
1.12001050010527e+24 | 3 | 1.5% |
1.117011100101e+24 | 3 | 1.5% |
1.12001090010955e+24 | 3 | 1.5% |
1.11101820010007e+24 | 3 | 1.5% |
1.11401650011871e+24 | 3 | 1.5% |
1.11401620010842e+24 | 2 | 1.0% |
Other values (137) | 167 |
Value | Count | Frequency (%) |
1.11101010010089e+24 | 2 | |
1.11101010010123e+24 | 2 | |
1.11101020010001e+24 | 2 | |
1.11101110010018e+24 | 1 | |
1.11101130010032e+24 | 1 | |
1.11101130010278e+24 | 1 | |
1.11101140010201e+24 | 1 | |
1.11101160010032e+24 | 1 | |
1.11101180010095e+24 | 1 | |
1.11101200010058e+24 | 1 |
Value | Count | Frequency (%) |
1.12001150010661e+24 | 1 | |
1.12001150010452e+24 | 1 | |
1.12001150010351e+24 | 1 | |
1.12001150010333e+24 | 2 | |
1.12001150010299e+24 | 1 | |
1.12001150010263e+24 | 1 | |
1.1200114001071e+24 | 1 | |
1.12001140010656e+24 | 2 | |
1.12001140010547e+24 | 1 | |
1.12001140010171e+24 | 1 |
HOUS_ADDR
Text
Distinct | 156 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 25 |
---|---|
Median length | 23 |
Mean length | 20.575 |
Min length | 16 |
Characters and Unicode
Total characters | 4115 |
---|---|
Distinct characters | 104 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 122 ? |
---|---|
Unique (%) | 61.0% |
Sample
1st row | 서울특별시 종로구 신교동 1-1번지 |
---|---|
2nd row | 서울특별시 종로구 신교동 1-4번지 |
3rd row | 서울특별시 종로구 무악동 82번지 |
4th row | 서울특별시 종로구 무악동 83번지 |
5th row | 서울특별시 종로구 연지동 136-74번지 |
Value | Count | Frequency (%) |
서울특별시 | 200 | |
성동구 | 71 | 8.9% |
종로구 | 63 | 7.9% |
중구 | 41 | 5.1% |
용산구 | 25 | 3.1% |
신당동 | 17 | 2.1% |
금호동4가 | 10 | 1.2% |
금호동1가 | 9 | 1.1% |
하왕십리동 | 7 | 0.9% |
성수동2가 | 7 | 0.9% |
Other values (216) | 350 |
Most occurring characters
Value | Count | Frequency (%) |
600 | 14.6% | |
동 | 261 | 6.3% |
지 | 210 | 5.1% |
서 | 205 | 5.0% |
구 | 201 | 4.9% |
시 | 200 | 4.9% |
별 | 200 | 4.9% |
특 | 200 | 4.9% |
울 | 200 | 4.9% |
번 | 200 | 4.9% |
Other values (94) | 1638 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2646 | |
Decimal Number | 766 | 18.6% |
Space Separator | 600 | 14.6% |
Dash Punctuation | 103 | 2.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
동 | 261 | 9.9% |
지 | 210 | 7.9% |
서 | 205 | 7.7% |
구 | 201 | 7.6% |
시 | 200 | 7.6% |
별 | 200 | 7.6% |
특 | 200 | 7.6% |
울 | 200 | 7.6% |
번 | 200 | 7.6% |
성 | 83 | 3.1% |
Other values (82) | 686 |
Decimal Number
Value | Count | Frequency (%) |
1 | 180 | |
2 | 128 | |
3 | 101 | |
0 | 59 | 7.7% |
5 | 57 | 7.4% |
4 | 56 | 7.3% |
8 | 54 | 7.0% |
6 | 54 | 7.0% |
7 | 52 | 6.8% |
9 | 25 | 3.3% |
Space Separator
Value | Count | Frequency (%) |
600 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 103 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2646 | |
Common | 1469 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
동 | 261 | 9.9% |
지 | 210 | 7.9% |
서 | 205 | 7.7% |
구 | 201 | 7.6% |
시 | 200 | 7.6% |
별 | 200 | 7.6% |
특 | 200 | 7.6% |
울 | 200 | 7.6% |
번 | 200 | 7.6% |
성 | 83 | 3.1% |
Other values (82) | 686 |
Common
Value | Count | Frequency (%) |
600 | ||
1 | 180 | 12.3% |
2 | 128 | 8.7% |
- | 103 | 7.0% |
3 | 101 | 6.9% |
0 | 59 | 4.0% |
5 | 57 | 3.9% |
4 | 56 | 3.8% |
8 | 54 | 3.7% |
6 | 54 | 3.7% |
Other values (2) | 77 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2646 | |
ASCII | 1469 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
600 | ||
1 | 180 | 12.3% |
2 | 128 | 8.7% |
- | 103 | 7.0% |
3 | 101 | 6.9% |
0 | 59 | 4.0% |
5 | 57 | 3.9% |
4 | 56 | 3.8% |
8 | 54 | 3.7% |
6 | 54 | 3.7% |
Other values (2) | 77 | 5.2% |
Hangul
Value | Count | Frequency (%) |
동 | 261 | 9.9% |
지 | 210 | 7.9% |
서 | 205 | 7.7% |
구 | 201 | 7.6% |
시 | 200 | 7.6% |
별 | 200 | 7.6% |
특 | 200 | 7.6% |
울 | 200 | 7.6% |
번 | 200 | 7.6% |
성 | 83 | 3.1% |
Other values (82) | 686 |
ROAD_ADDR
Text
Distinct | 156 |
---|---|
Distinct (%) | 78.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
Length
Max length | 22 |
---|---|
Median length | 21 |
Mean length | 17.715 |
Min length | 14 |
Characters and Unicode
Total characters | 3543 |
---|---|
Distinct characters | 111 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 123 ? |
---|---|
Unique (%) | 61.5% |
Sample
1st row | 서울특별시 종로구 필운대로 103 |
---|---|
2nd row | 서울특별시 종로구 필운대로 97 |
3rd row | 서울특별시 종로구 통일로 246-20 |
4th row | 서울특별시 종로구 통일로 246-11 |
5th row | 서울특별시 종로구 김상옥로 29 |
Value | Count | Frequency (%) |
서울특별시 | 200 | |
성동구 | 71 | 8.9% |
종로구 | 63 | 7.9% |
중구 | 41 | 5.1% |
용산구 | 25 | 3.1% |
금호로 | 22 | 2.8% |
왕십리로 | 12 | 1.5% |
청계천로 | 7 | 0.9% |
15 | 7 | 0.9% |
마장로 | 7 | 0.9% |
Other values (200) | 345 |
Most occurring characters
Value | Count | Frequency (%) |
600 | ||
로 | 240 | 6.8% |
서 | 206 | 5.8% |
구 | 205 | 5.8% |
별 | 200 | 5.6% |
특 | 200 | 5.6% |
시 | 200 | 5.6% |
울 | 200 | 5.6% |
1 | 139 | 3.9% |
길 | 95 | 2.7% |
Other values (101) | 1258 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 2319 | |
Decimal Number | 606 | 17.1% |
Space Separator | 600 | 16.9% |
Dash Punctuation | 18 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
로 | 240 | 10.3% |
서 | 206 | 8.9% |
구 | 205 | 8.8% |
별 | 200 | 8.6% |
특 | 200 | 8.6% |
시 | 200 | 8.6% |
울 | 200 | 8.6% |
길 | 95 | 4.1% |
동 | 75 | 3.2% |
성 | 72 | 3.1% |
Other values (89) | 626 |
Decimal Number
Value | Count | Frequency (%) |
1 | 139 | |
2 | 83 | |
3 | 71 | |
4 | 61 | |
0 | 61 | |
5 | 46 | 7.6% |
7 | 41 | 6.8% |
6 | 39 | 6.4% |
8 | 37 | 6.1% |
9 | 28 | 4.6% |
Space Separator
Value | Count | Frequency (%) |
600 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 18 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 2319 | |
Common | 1224 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
로 | 240 | 10.3% |
서 | 206 | 8.9% |
구 | 205 | 8.8% |
별 | 200 | 8.6% |
특 | 200 | 8.6% |
시 | 200 | 8.6% |
울 | 200 | 8.6% |
길 | 95 | 4.1% |
동 | 75 | 3.2% |
성 | 72 | 3.1% |
Other values (89) | 626 |
Common
Value | Count | Frequency (%) |
600 | ||
1 | 139 | 11.4% |
2 | 83 | 6.8% |
3 | 71 | 5.8% |
4 | 61 | 5.0% |
0 | 61 | 5.0% |
5 | 46 | 3.8% |
7 | 41 | 3.3% |
6 | 39 | 3.2% |
8 | 37 | 3.0% |
Other values (2) | 46 | 3.8% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 2319 | |
ASCII | 1224 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
600 | ||
1 | 139 | 11.4% |
2 | 83 | 6.8% |
3 | 71 | 5.8% |
4 | 61 | 5.0% |
0 | 61 | 5.0% |
5 | 46 | 3.8% |
7 | 41 | 3.3% |
6 | 39 | 3.2% |
8 | 37 | 3.0% |
Other values (2) | 46 | 3.8% |
Hangul
Value | Count | Frequency (%) |
로 | 240 | 10.3% |
서 | 206 | 8.9% |
구 | 205 | 8.8% |
별 | 200 | 8.6% |
특 | 200 | 8.6% |
시 | 200 | 8.6% |
울 | 200 | 8.6% |
길 | 95 | 4.1% |
동 | 75 | 3.2% |
성 | 72 | 3.1% |
Other values (89) | 626 |
X_AXIS
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 167 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 312039.64 |
Minimum | 307413 |
---|---|
Maximum | 316912 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 307413 |
---|---|
5-th percentile | 308262.7 |
Q1 | 309424.5 |
median | 312621.5 |
Q3 | 313981.25 |
95-th percentile | 315958 |
Maximum | 316912 |
Range | 9499 |
Interquartile range (IQR) | 4556.75 |
Descriptive statistics
Standard deviation | 2554.866 |
---|---|
Coefficient of variation (CV) | 0.0081876329 |
Kurtosis | -1.2399954 |
Mean | 312039.64 |
Median Absolute Deviation (MAD) | 1946.5 |
Skewness | -0.093769064 |
Sum | 62407928 |
Variance | 6527340.4 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
307939 | 3 | 1.5% |
309155 | 3 | 1.5% |
314152 | 3 | 1.5% |
309126 | 3 | 1.5% |
314087 | 2 | 1.0% |
315958 | 2 | 1.0% |
310894 | 2 | 1.0% |
310768 | 2 | 1.0% |
313985 | 2 | 1.0% |
313762 | 2 | 1.0% |
Other values (157) | 176 |
Value | Count | Frequency (%) |
307413 | 1 | 0.5% |
307644 | 1 | 0.5% |
307817 | 1 | 0.5% |
307923 | 1 | 0.5% |
307939 | 3 | |
308072 | 2 | |
308219 | 1 | 0.5% |
308265 | 1 | 0.5% |
308292 | 1 | 0.5% |
308327 | 1 | 0.5% |
Value | Count | Frequency (%) |
316912 | 1 | |
316718 | 1 | |
316538 | 1 | |
316501 | 1 | |
316498 | 1 | |
316369 | 1 | |
316356 | 1 | |
316041 | 1 | |
316005 | 1 | |
315958 | 2 |
Y_AXIS
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 167 |
---|---|
Distinct (%) | 83.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 551595.18 |
Minimum | 546657 |
---|---|
Maximum | 557018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 546657 |
---|---|
5-th percentile | 548628.4 |
Q1 | 550145 |
median | 551505 |
Q3 | 552890 |
95-th percentile | 555861 |
Maximum | 557018 |
Range | 10361 |
Interquartile range (IQR) | 2745 |
Descriptive statistics
Standard deviation | 2121.3885 |
---|---|
Coefficient of variation (CV) | 0.0038459156 |
Kurtosis | 0.0015087394 |
Mean | 551595.18 |
Median Absolute Deviation (MAD) | 1380.5 |
Skewness | 0.27152904 |
Sum | 1.1031904 × 108 |
Variance | 4500289.1 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
556269 | 3 | 1.5% |
550697 | 3 | 1.5% |
549462 | 3 | 1.5% |
552435 | 2 | 1.0% |
550446 | 2 | 1.0% |
552176 | 2 | 1.0% |
552184 | 2 | 1.0% |
550978 | 2 | 1.0% |
551273 | 2 | 1.0% |
551505 | 2 | 1.0% |
Other values (157) | 177 |
Value | Count | Frequency (%) |
546657 | 1 | |
546728 | 1 | |
547030 | 1 | |
547090 | 1 | |
547129 | 2 | |
547405 | 1 | |
548002 | 1 | |
548287 | 1 | |
548541 | 1 | |
548633 | 1 |
Value | Count | Frequency (%) |
557018 | 1 | 0.5% |
556552 | 1 | 0.5% |
556519 | 1 | 0.5% |
556440 | 1 | 0.5% |
556269 | 3 | |
556194 | 1 | 0.5% |
555931 | 1 | 0.5% |
555861 | 2 | |
555839 | 1 | 0.5% |
555649 | 1 | 0.5% |
BLK_CD
Real number (ℝ)
Distinct | 146 |
---|---|
Distinct (%) | 73.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 291545.78 |
Minimum | 12386 |
---|---|
Maximum | 519998 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 12386 |
---|---|
5-th percentile | 168543 |
Q1 | 207619.5 |
median | 210261.5 |
Q3 | 361582.5 |
95-th percentile | 501458.05 |
Maximum | 519998 |
Range | 507612 |
Interquartile range (IQR) | 153963 |
Descriptive statistics
Standard deviation | 108049.14 |
---|---|
Coefficient of variation (CV) | 0.37060782 |
Kurtosis | -0.72101815 |
Mean | 291545.78 |
Median Absolute Deviation (MAD) | 92528 |
Skewness | 0.25634816 |
Sum | 58309155 |
Variance | 1.1674617 × 1010 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
415019 | 6 | 3.0% |
337542 | 4 | 2.0% |
519998 | 4 | 2.0% |
208198 | 3 | 1.5% |
414903 | 3 | 1.5% |
168543 | 3 | 1.5% |
210175 | 3 | 1.5% |
519833 | 3 | 1.5% |
415624 | 2 | 1.0% |
209479 | 2 | 1.0% |
Other values (136) | 167 |
Value | Count | Frequency (%) |
12386 | 1 | 0.5% |
35187 | 1 | 0.5% |
35619 | 1 | 0.5% |
74639 | 1 | 0.5% |
165103 | 1 | 0.5% |
165655 | 2 | |
167588 | 1 | 0.5% |
168543 | 3 | |
175436 | 1 | 0.5% |
177773 | 2 |
Value | Count | Frequency (%) |
519998 | 4 | |
519833 | 3 | |
509117 | 1 | 0.5% |
502600 | 1 | 0.5% |
502523 | 1 | 0.5% |
501402 | 2 | |
416121 | 1 | 0.5% |
416083 | 1 | 0.5% |
415985 | 1 | 0.5% |
415794 | 1 | 0.5% |
CAMPUS_CLSS
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
본교 | |
---|---|
제2캠퍼스 | 1 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.015 |
Min length | 2 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.5% |
Sample
1st row | 본교 |
---|---|
2nd row | 본교 |
3rd row | 본교 |
4th row | 본교 |
5th row | 본교 |
Common Values
Value | Count | Frequency (%) |
본교 | 199 | |
제2캠퍼스 | 1 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
본교 | 199 | |
제2캠퍼스 | 1 | 0.5% |
SCHOOL_CLSS1
Categorical
HIGH CORRELATION
 
Distinct | 7 |
---|---|
Distinct (%) | 3.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
어린이집 | |
---|---|
유치원 | |
고등학교 | |
초등학교 | |
중학교 | 11 |
Other values (2) | 9 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.785 |
Min length | 3 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 특수학교 |
---|---|
2nd row | 특수학교 |
3rd row | 어린이집 |
4th row | 어린이집 |
5th row | 어린이집 |
Common Values
Value | Count | Frequency (%) |
어린이집 | 116 | |
유치원 | 25 | 12.5% |
고등학교 | 21 | 10.5% |
초등학교 | 18 | 9.0% |
중학교 | 11 | 5.5% |
대학교 | 7 | 3.5% |
특수학교 | 2 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
어린이집 | 116 | |
유치원 | 25 | 12.5% |
고등학교 | 21 | 10.5% |
초등학교 | 18 | 9.0% |
중학교 | 11 | 5.5% |
대학교 | 7 | 3.5% |
특수학교 | 2 | 1.0% |
SCHOOL_CLSS2
Text
MISSING
 
Distinct | 4 |
---|---|
Distinct (%) | 57.1% |
Missing | 193 |
Missing (%) | 96.5% |
Memory size | 1.7 KiB |
Value | Count | Frequency (%) |
사이버대학(대학 | 2 | |
대학교 | 2 | |
전공대학 | 2 | |
기능대학 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
대 | 9 | |
학 | 9 | |
사 | 2 | 5.6% |
이 | 2 | 5.6% |
버 | 2 | 5.6% |
( | 2 | 5.6% |
) | 2 | 5.6% |
교 | 2 | 5.6% |
전 | 2 | 5.6% |
공 | 2 | 5.6% |
Other values (2) | 2 | 5.6% |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 32 | |
Open Punctuation | 2 | 5.6% |
Close Punctuation | 2 | 5.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
대 | 9 | |
학 | 9 | |
사 | 2 | 6.2% |
이 | 2 | 6.2% |
버 | 2 | 6.2% |
교 | 2 | 6.2% |
전 | 2 | 6.2% |
공 | 2 | 6.2% |
기 | 1 | 3.1% |
능 | 1 | 3.1% |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 32 | |
Common | 4 | 11.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
대 | 9 | |
학 | 9 | |
사 | 2 | 6.2% |
이 | 2 | 6.2% |
버 | 2 | 6.2% |
교 | 2 | 6.2% |
전 | 2 | 6.2% |
공 | 2 | 6.2% |
기 | 1 | 3.1% |
능 | 1 | 3.1% |
Common
Value | Count | Frequency (%) |
( | 2 | |
) | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 32 | |
ASCII | 4 | 11.1% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
대 | 9 | |
학 | 9 | |
사 | 2 | 6.2% |
이 | 2 | 6.2% |
버 | 2 | 6.2% |
교 | 2 | 6.2% |
전 | 2 | 6.2% |
공 | 2 | 6.2% |
기 | 1 | 3.1% |
능 | 1 | 3.1% |
ASCII
Value | Count | Frequency (%) |
( | 2 | |
) | 2 |
STUDY_TIME
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 2.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
<NA> | |
---|---|
주간 | |
주야간 | 3 |
원격 | 2 |
주야간+원격 | 2 |
Length
Max length | 6 |
---|---|
Median length | 4 |
Mean length | 3.645 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 주간 |
---|---|
2nd row | 주간 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 159 | |
주간 | 34 | 17.0% |
주야간 | 3 | 1.5% |
원격 | 2 | 1.0% |
주야간+원격 | 2 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 159 | |
주간 | 34 | 17.0% |
주야간 | 3 | 1.5% |
원격 | 2 | 1.0% |
주야간+원격 | 2 | 1.0% |
SEX
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 KiB |
<NA> | |
---|---|
남여공학 | 16 |
남자 | 9 |
여자 | 9 |
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 3.82 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 남여공학 |
---|---|
2nd row | 남여공학 |
3rd row | <NA> |
4th row | <NA> |
5th row | <NA> |
Common Values
Value | Count | Frequency (%) |
<NA> | 166 | |
남여공학 | 16 | 8.0% |
남자 | 9 | 4.5% |
여자 | 9 | 4.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
na | 166 | |
남여공학 | 16 | 8.0% |
남자 | 9 | 4.5% |
여자 | 9 | 4.5% |
STU_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 120 |
---|---|
Distinct (%) | 60.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 474.355 |
Minimum | 16 |
---|---|
Maximum | 22620 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 16 |
---|---|
5-th percentile | 20 |
Q1 | 41 |
median | 65.5 |
Q3 | 285.5 |
95-th percentile | 966.95 |
Maximum | 22620 |
Range | 22604 |
Interquartile range (IQR) | 244.5 |
Descriptive statistics
Standard deviation | 2118.937 |
---|---|
Coefficient of variation (CV) | 4.4669856 |
Kurtosis | 77.775617 |
Mean | 474.355 |
Median Absolute Deviation (MAD) | 39.5 |
Skewness | 8.4798177 |
Sum | 94871 |
Variance | 4489893.9 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
20 | 19 | 9.5% |
49 | 13 | 6.5% |
45 | 6 | 3.0% |
19 | 6 | 3.0% |
39 | 4 | 2.0% |
96 | 4 | 2.0% |
41 | 4 | 2.0% |
64 | 4 | 2.0% |
77 | 4 | 2.0% |
34 | 4 | 2.0% |
Other values (110) | 132 |
Value | Count | Frequency (%) |
16 | 1 | 0.5% |
19 | 6 | 3.0% |
20 | 19 | |
22 | 1 | 0.5% |
23 | 1 | 0.5% |
26 | 1 | 0.5% |
27 | 1 | 0.5% |
30 | 2 | 1.0% |
33 | 1 | 0.5% |
34 | 4 | 2.0% |
Value | Count | Frequency (%) |
22620 | 1 | |
16569 | 1 | |
9194 | 1 | |
6329 | 1 | |
2261 | 1 | |
2021 | 1 | |
1254 | 1 | |
1074 | 1 | |
1014 | 1 | |
1004 | 1 |
TEA_CNT
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 63 |
---|---|
Distinct (%) | 31.5% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 39.495 |
Minimum | 2 |
---|---|
Maximum | 2460 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 2 |
---|---|
5-th percentile | 4 |
Q1 | 8 |
median | 12 |
Q3 | 33.5 |
95-th percentile | 75.6 |
Maximum | 2460 |
Range | 2458 |
Interquartile range (IQR) | 25.5 |
Descriptive statistics
Standard deviation | 177.79295 |
---|---|
Coefficient of variation (CV) | 4.5016571 |
Kurtosis | 175.07167 |
Mean | 39.495 |
Median Absolute Deviation (MAD) | 6 |
Skewness | 12.895497 |
Sum | 7899 |
Variance | 31610.332 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
6 | 20 | 10.0% |
9 | 19 | 9.5% |
10 | 13 | 6.5% |
8 | 11 | 5.5% |
11 | 9 | 4.5% |
15 | 8 | 4.0% |
5 | 7 | 3.5% |
7 | 7 | 3.5% |
4 | 6 | 3.0% |
12 | 6 | 3.0% |
Other values (53) | 94 |
Value | Count | Frequency (%) |
2 | 2 | 1.0% |
3 | 4 | 2.0% |
4 | 6 | 3.0% |
5 | 7 | 3.5% |
6 | 20 | |
7 | 7 | 3.5% |
8 | 11 | |
9 | 19 | |
10 | 13 | |
11 | 9 |
Value | Count | Frequency (%) |
2460 | 1 | |
476 | 1 | |
246 | 1 | |
215 | 1 | |
154 | 1 | |
125 | 1 | |
110 | 1 | |
90 | 1 | |
87 | 2 | |
75 | 1 |
CLASS_CNT
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 33 |
---|---|
Distinct (%) | 17.1% |
Missing | 7 |
Missing (%) | 3.5% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 9.7098446 |
Minimum | 1 |
---|---|
Maximum | 47 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 3 |
Q1 | 3 |
median | 5 |
Q3 | 12 |
95-th percentile | 30.4 |
Maximum | 47 |
Range | 46 |
Interquartile range (IQR) | 9 |
Descriptive statistics
Standard deviation | 9.8651721 |
---|---|
Coefficient of variation (CV) | 1.0159969 |
Kurtosis | 1.5357681 |
Mean | 9.7098446 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 1.6007859 |
Sum | 1874 |
Variance | 97.321621 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 48 | |
4 | 41 | |
5 | 19 | 9.5% |
6 | 13 | 6.5% |
12 | 6 | 3.0% |
15 | 5 | 2.5% |
7 | 5 | 2.5% |
8 | 4 | 2.0% |
27 | 4 | 2.0% |
9 | 4 | 2.0% |
Other values (23) | 44 | |
(Missing) | 7 | 3.5% |
Value | Count | Frequency (%) |
1 | 1 | 0.5% |
2 | 3 | 1.5% |
3 | 48 | |
4 | 41 | |
5 | 19 | 9.5% |
6 | 13 | 6.5% |
7 | 5 | 2.5% |
8 | 4 | 2.0% |
9 | 4 | 2.0% |
10 | 1 | 0.5% |
Value | Count | Frequency (%) |
47 | 1 | 0.5% |
39 | 1 | 0.5% |
37 | 2 | |
36 | 3 | |
33 | 1 | 0.5% |
32 | 1 | 0.5% |
31 | 1 | 0.5% |
30 | 3 | |
29 | 3 | |
28 | 2 |
AREA | OPEN_DATE | HOUS_ID | BLD_CD | X_AXIS | Y_AXIS | BLK_CD | CAMPUS_CLSS | SCHOOL_CLSS1 | SCHOOL_CLSS2 | STUDY_TIME | SEX | STU_CNT | TEA_CNT | CLASS_CNT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AREA | 1.000 | 0.175 | 0.223 | 0.223 | 0.000 | 0.426 | 0.078 | NaN | 0.715 | NaN | NaN | 0.000 | NaN | NaN | 0.734 |
OPEN_DATE | 0.175 | 1.000 | 0.250 | 0.250 | 0.433 | 0.619 | 0.419 | NaN | 0.654 | NaN | NaN | 0.600 | NaN | NaN | 0.237 |
HOUS_ID | 0.223 | 0.250 | 1.000 | 1.000 | 0.735 | 0.869 | 0.850 | 0.100 | 0.375 | 0.928 | 0.577 | 0.370 | 0.000 | 0.000 | 0.368 |
BLD_CD | 0.223 | 0.250 | 1.000 | 1.000 | 0.735 | 0.869 | 0.850 | 0.100 | 0.375 | 0.928 | 0.577 | 0.370 | 0.000 | 0.000 | 0.368 |
X_AXIS | 0.000 | 0.433 | 0.735 | 0.735 | 1.000 | 0.754 | 0.383 | 0.068 | 0.159 | 0.797 | 0.060 | 0.495 | 0.410 | 0.246 | 0.182 |
Y_AXIS | 0.426 | 0.619 | 0.869 | 0.869 | 0.754 | 1.000 | 0.572 | 0.000 | 0.337 | 0.749 | 0.555 | 0.515 | 0.184 | 0.265 | 0.457 |
BLK_CD | 0.078 | 0.419 | 0.850 | 0.850 | 0.383 | 0.572 | 1.000 | 0.000 | 0.085 | 0.626 | 0.437 | 0.211 | 0.272 | 0.318 | 0.000 |
CAMPUS_CLSS | NaN | NaN | 0.100 | 0.100 | 0.068 | 0.000 | 0.000 | 1.000 | 0.313 | 0.000 | 0.867 | NaN | 0.000 | 0.000 | NaN |
SCHOOL_CLSS1 | 0.715 | 0.654 | 0.375 | 0.375 | 0.159 | 0.337 | 0.085 | 0.313 | 1.000 | NaN | 0.867 | 0.000 | 0.486 | 0.446 | 0.799 |
SCHOOL_CLSS2 | NaN | NaN | 0.928 | 0.928 | 0.797 | 0.749 | 0.626 | 0.000 | NaN | 1.000 | 1.000 | NaN | 0.544 | 0.364 | NaN |
STUDY_TIME | NaN | NaN | 0.577 | 0.577 | 0.060 | 0.555 | 0.437 | 0.867 | 0.867 | 1.000 | 1.000 | NaN | 0.760 | 0.519 | NaN |
SEX | 0.000 | 0.600 | 0.370 | 0.370 | 0.495 | 0.515 | 0.211 | NaN | 0.000 | NaN | NaN | 1.000 | NaN | NaN | 0.208 |
STU_CNT | NaN | NaN | 0.000 | 0.000 | 0.410 | 0.184 | 0.272 | 0.000 | 0.486 | 0.544 | 0.760 | NaN | 1.000 | 1.000 | NaN |
TEA_CNT | NaN | NaN | 0.000 | 0.000 | 0.246 | 0.265 | 0.318 | 0.000 | 0.446 | 0.364 | 0.519 | NaN | 1.000 | 1.000 | NaN |
CLASS_CNT | 0.734 | 0.237 | 0.368 | 0.368 | 0.182 | 0.457 | 0.000 | NaN | 0.799 | NaN | NaN | 0.208 | NaN | NaN | 1.000 |
CAMPUS_CLSS | SEX | STUDY_TIME | SCHOOL_CLSS1 | |
---|---|---|---|---|
CAMPUS_CLSS | 1.000 | 1.000 | 0.650 | 0.330 |
SEX | 1.000 | 1.000 | 1.000 | 0.000 |
STUDY_TIME | 0.650 | 1.000 | 1.000 | 0.528 |
SCHOOL_CLSS1 | 0.330 | 0.000 | 0.528 | 1.000 |
AREA | OPEN_DATE | HOUS_ID | BLD_CD | X_AXIS | Y_AXIS | BLK_CD | STU_CNT | TEA_CNT | CLASS_CNT | CAMPUS_CLSS | SCHOOL_CLSS1 | STUDY_TIME | SEX | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
AREA | 1.000 | -0.472 | -0.047 | -0.052 | 0.106 | 0.095 | 0.167 | 0.697 | 0.761 | 0.694 | 1.000 | 0.501 | 1.000 | 0.000 |
OPEN_DATE | -0.472 | 1.000 | 0.342 | 0.335 | 0.175 | -0.160 | 0.002 | -0.422 | -0.415 | -0.479 | 1.000 | 0.307 | 1.000 | 0.347 |
HOUS_ID | -0.047 | 0.342 | 1.000 | 0.999 | 0.687 | -0.798 | -0.028 | -0.164 | -0.127 | -0.093 | 0.067 | 0.266 | 0.262 | 0.353 |
BLD_CD | -0.052 | 0.335 | 0.999 | 1.000 | 0.687 | -0.798 | -0.033 | -0.167 | -0.132 | -0.097 | 0.471 | 0.000 | 0.395 | 0.460 |
X_AXIS | 0.106 | 0.175 | 0.687 | 0.687 | 1.000 | -0.383 | -0.017 | -0.241 | -0.144 | -0.170 | 0.052 | 0.088 | 0.000 | 0.289 |
Y_AXIS | 0.095 | -0.160 | -0.798 | -0.798 | -0.383 | 1.000 | 0.235 | 0.033 | 0.043 | 0.037 | 0.000 | 0.176 | 0.364 | 0.369 |
BLK_CD | 0.167 | 0.002 | -0.028 | -0.033 | -0.017 | 0.235 | 1.000 | -0.106 | -0.009 | -0.047 | 0.000 | 0.028 | 0.177 | 0.191 |
STU_CNT | 0.697 | -0.422 | -0.164 | -0.167 | -0.241 | 0.033 | -0.106 | 1.000 | 0.829 | 0.873 | 0.000 | 0.336 | 0.697 | 1.000 |
TEA_CNT | 0.761 | -0.415 | -0.127 | -0.132 | -0.144 | 0.043 | -0.009 | 0.829 | 1.000 | 0.857 | 0.000 | 0.332 | 0.513 | 1.000 |
CLASS_CNT | 0.694 | -0.479 | -0.093 | -0.097 | -0.170 | 0.037 | -0.047 | 0.873 | 0.857 | 1.000 | 1.000 | 0.573 | 1.000 | 0.000 |
CAMPUS_CLSS | 1.000 | 1.000 | 0.067 | 0.471 | 0.052 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 1.000 | 0.330 | 0.650 | 1.000 |
SCHOOL_CLSS1 | 0.501 | 0.307 | 0.266 | 0.000 | 0.088 | 0.176 | 0.028 | 0.336 | 0.332 | 0.573 | 0.330 | 1.000 | 0.528 | 0.000 |
STUDY_TIME | 1.000 | 1.000 | 0.262 | 0.395 | 0.000 | 0.364 | 0.177 | 0.697 | 0.513 | 1.000 | 0.650 | 0.528 | 1.000 | 1.000 |
SEX | 0.000 | 0.347 | 0.353 | 0.460 | 0.289 | 0.369 | 0.191 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 1.000 | 1.000 |
SCHOOL_CD | SCHOOL_NM | ADDRESS | AREA | OPEN_DATE | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | CAMPUS_CLSS | SCHOOL_CLSS1 | SCHOOL_CLSS2 | STUDY_TIME | SEX | STU_CNT | TEA_CNT | CLASS_CNT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | S11942 | 서울농학교 | 서울특별시 종로구 필운대로 103(신교동) | 27058 | 19130401 | 1111010200000010001 | 1111010200100010001030811 | 서울특별시 종로구 신교동 1-1번지 | 서울특별시 종로구 필운대로 103 | 309118 | 554207 | 361665 | 본교 | 특수학교 | <NA> | 주간 | 남여공학 | 91 | 61 | 29 |
1 | S11940 | 서울맹학교 | 서울특별시 종로구 필운대로 97(신교동) | 10184 | 19130401 | 1111010200000010004 | 1111010200100010004031118 | 서울특별시 종로구 신교동 1-4번지 | 서울특별시 종로구 필운대로 97 | 309028 | 554111 | 361665 | 본교 | 특수학교 | <NA> | 주간 | 남여공학 | 194 | 73 | 39 |
2 | C00287 | 가람어린이집 | 서울특별시 종로구 통일로 246-20 111동 101호9(무악동무악현대아파트) | <NA> | <NA> | 1111018700000820000 | 1111018700100820000021145 | 서울특별시 종로구 무악동 82번지 | 서울특별시 종로구 통일로 246-20 | 308292 | 552979 | 323899 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 20 | 4 | 4 |
3 | C28551 | 동화속아이들어린이집 | 서울특별시 종로구 통일로 246-11 무악현대아파트 단지내 | <NA> | <NA> | 1111018700000830000 | 1111018700100830000021163 | 서울특별시 종로구 무악동 83번지 | 서울특별시 종로구 통일로 246-11 | 308265 | 553037 | 323449 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 65 | 12 | 6 |
4 | C00009 | SGI서울보증 어린이집 | 서울특별시 종로구 김상옥로 29 2층 | <NA> | <NA> | 1111016000001360074 | 1111016000101360074012513 | 서울특별시 종로구 연지동 136-74번지 | 서울특별시 종로구 김상옥로 29 | 311915 | 552845 | 361220 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 49 | 23 | 4 |
5 | S09678 | 대신고등학교 | 서울특별시 종로구 사직로 9(행촌동) | 19636 | 19380405 | 1111018100001710001 | 1111018100101710010020841 | 서울특별시 종로구 행촌동 171-1번지 | 서울특별시 종로구 사직로 9 | 308439 | 552825 | 354308 | 본교 | 고등학교 | <NA> | 주간 | 남자 | 762 | 60 | 27 |
6 | S06467 | 대신중학교 | 서울특별시 종로구 사직로 9(행촌동) | <NA> | 19380405 | 1111018100001710001 | 1111018100101710010020841 | 서울특별시 종로구 행촌동 171-1번지 | 서울특별시 종로구 사직로 9 | 308439 | 552825 | 354308 | 본교 | 중학교 | <NA> | 주간 | 남자 | 384 | 29 | 15 |
7 | C17847 | 종로구청직장어린이집 | 서울특별시 종로구 삼봉로 43 (수송동) | <NA> | <NA> | 1111012400001460002 | 1111012400101460002013567 | 서울특별시 종로구 수송동 146-2번지 | 서울특별시 종로구 삼봉로 43 | 309990 | 552816 | 353143 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 35 | 10 | 3 |
8 | K00393 | 세검정유치원 | 서울특별시 종로구 자하문로 310(홍지동) | 972 | 19810118 | 1111018500000940003 | 1111018500100940003025510 | 서울특별시 종로구 홍지동 94-3번지 | 서울특별시 종로구 자하문로 310 | 308331 | 555839 | 360833 | 본교 | 유치원 | <NA> | <NA> | <NA> | 72 | 6 | 3 |
9 | K00396 | 옥인유치원 | 서울특별시 종로구 자하문로 69(옥인동) | 423 | 19930227 | 1111011100000180000 | 1111011100100180000030505 | 서울특별시 종로구 옥인동 18번지 | 서울특별시 종로구 자하문로 69 | 309253 | 553826 | 352278 | 본교 | 유치원 | <NA> | <NA> | <NA> | 69 | 6 | 3 |
SCHOOL_CD | SCHOOL_NM | ADDRESS | AREA | OPEN_DATE | HOUS_ID | BLD_CD | HOUS_ADDR | ROAD_ADDR | X_AXIS | Y_AXIS | BLK_CD | CAMPUS_CLSS | SCHOOL_CLSS1 | SCHOOL_CLSS2 | STUDY_TIME | SEX | STU_CNT | TEA_CNT | CLASS_CNT | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
190 | C29338 | 딩동댕 | 서울특별시 성동구 둘레13길 15 (성수동2가) | <NA> | <NA> | 1120011500004520000 | 1120011500104520000006841 | 서울특별시 성동구 성수동2가 452번지 | 서울특별시 성동구 둘레13길 15 | 316501 | 548633 | 209730 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 46 | 45 | 4 |
191 | C33384 | 즐거운 | 서울특별시 성동구 둘레15길15-1 | <NA> | <NA> | 1120011500006610002 | 1120011500106610002006813 | 서울특별시 성동구 성수동2가 661-2번지 | 서울특별시 성동구 둘레15길 15-1 | 316718 | 548644 | 209743 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 49 | 8 | 6 |
192 | C25722 | 구립 진터마루 | 서울특별시 성동구 둘레3길18 | <NA> | <NA> | 1120011400001710000 | 1120011400101710000008778 | 서울특별시 성동구 성수동1가 171번지 | 서울특별시 성동구 둘레3길 18 | 316005 | 548930 | 209605 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 96 | 15 | 6 |
193 | C29544 | 예사랑 | 서울특별시 성동구 둘레9길 20-5 | <NA> | <NA> | 1120011500003510000 | 1120011500103510000000001 | 서울특별시 성동구 성수동2가 351번지 | 서울특별시 성동구 둘레9길 20-5 | 316369 | 548803 | 209702 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 39 | 32 | 4 |
194 | C01487 | 금빛 | 서울특별시 성동구 마장로 37길 7 101동 102호 (마장동대성유니드아파트) | <NA> | <NA> | 1120010500008250000 | 1120010500108250000027551 | 서울특별시 성동구 마장동 825번지 | 서울특별시 성동구 마장로37길 7 | 315280 | 552022 | 413926 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 20 | 5 | 3 |
195 | C26034 | 구립 마장 | 서울특별시 성동구 마장로 44길 10로 | <NA> | <NA> | 1120010500007800000 | 1120010500107800000021090 | 서울특별시 성동구 마장동 780번지 | 서울특별시 성동구 마장로44길 10 | 315880 | 552001 | 207831 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 116 | 22 | 7 |
196 | C01104 | 구립 매봉도담 | 서울특별시 성동구 매봉길88 | <NA> | <NA> | 1120011300005280003 | 1120011300105470000010561 | 서울특별시 성동구 옥수동 528-3번지 | 서울특별시 성동구 매봉길 88 | 312966 | 550124 | 415755 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 49 | 10 | 4 |
197 | C01103 | 구립 맑은샘 | 서울특별시 성동구 매봉길 51 | <NA> | <NA> | 1120011300005280002 | 1120011300105280002000001 | 서울특별시 성동구 옥수동 528-2번지 | 서울특별시 성동구 매봉길 51 | 313013 | 549830 | 415755 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 68 | 9 | 5 |
198 | C25710 | 구립 옥수힐스 | 서울특별시 성동구 매봉길 50 (옥수동 옥수파크힐스 2단지) | <NA> | <NA> | 1120011300005280000 | 1120011300105340000010923 | 서울특별시 성동구 옥수동 528번지 | 서울특별시 성동구 매봉길 50 | 312901 | 549903 | 415639 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 45 | 44 | 4 |
199 | C00923 | 구립 옥수파크 | 서울특별시 성동구 매봉길50 | <NA> | <NA> | 1120011300005280000 | 1120011300105340000010923 | 서울특별시 성동구 옥수동 528번지 | 서울특별시 성동구 매봉길 50 | 312901 | 549903 | 415639 | 본교 | 어린이집 | <NA> | <NA> | <NA> | 41 | 10 | 4 |