Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 1984 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 102.8 KiB |
Average record size in memory | 53.1 B |
Variable types
Numeric | 5 |
---|---|
Text | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | 신한은행 |
URL | https://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=320 |
OBJECTID_1(OBJECTID_1) is highly overall correlated with Y중심좌표(Y_Center) | High correlation |
X중심좌표(X_Center) is highly overall correlated with 공간길이(SHAPE_LENG) and 1 other fields | High correlation |
Y중심좌표(Y_Center) is highly overall correlated with OBJECTID_1(OBJECTID_1) | High correlation |
공간길이(SHAPE_LENG) is highly overall correlated with X중심좌표(X_Center) and 1 other fields | High correlation |
공간면적(SHAPE_AREA) is highly overall correlated with X중심좌표(X_Center) and 1 other fields | High correlation |
OBJECTID_1(OBJECTID_1) has unique values | Unique |
그리드코드(GRID50_CD) has unique values | Unique |
Reproduction
Analysis started | 2023-12-10 14:59:30.842842 |
---|---|
Analysis finished | 2023-12-10 14:59:37.629460 |
Duration | 6.79 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
OBJECTID_1(OBJECTID_1)
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 1984 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 74272.196 |
Minimum | 5418 |
---|---|
Maximum | 241647 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.6 KiB |
Quantile statistics
Minimum | 5418 |
---|---|
5-th percentile | 11724.3 |
Q1 | 24753.5 |
median | 46361 |
Q3 | 119475 |
95-th percentile | 209714.3 |
Maximum | 241647 |
Range | 236229 |
Interquartile range (IQR) | 94721.5 |
Descriptive statistics
Standard deviation | 64439.208 |
---|---|
Coefficient of variation (CV) | 0.86760875 |
Kurtosis | -0.11336939 |
Mean | 74272.196 |
Median Absolute Deviation (MAD) | 26105.5 |
Skewness | 1.0802499 |
Sum | 1.4735604 × 108 |
Variance | 4.1524115 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5418 | 1 | 0.1% |
13887 | 1 | 0.1% |
34243 | 1 | 0.1% |
31705 | 1 | 0.1% |
28131 | 1 | 0.1% |
26061 | 1 | 0.1% |
24671 | 1 | 0.1% |
23342 | 1 | 0.1% |
22150 | 1 | 0.1% |
21121 | 1 | 0.1% |
Other values (1974) | 1974 |
Value | Count | Frequency (%) |
5418 | 1 | |
5424 | 1 | |
5427 | 1 | |
5545 | 1 | |
5551 | 1 | |
5553 | 1 | |
5555 | 1 | |
5560 | 1 | |
5563 | 1 | |
5566 | 1 |
Value | Count | Frequency (%) |
241647 | 1 | |
241464 | 1 | |
241269 | 1 | |
240880 | 1 | |
240588 | 1 | |
239826 | 1 | |
239518 | 1 | |
239381 | 1 | |
239259 | 1 | |
237691 | 1 |
그리드코드(GRID50_CD)
Text
UNIQUE
 
Distinct | 1984 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 15.6 KiB |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Characters and Unicode
Total characters | 19840 |
---|---|
Distinct characters | 12 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 1984 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | GS00005418 |
---|---|
2nd row | GS00031992 |
3rd row | GS00052227 |
4th row | GS00067035 |
5th row | GS00097864 |
Value | Count | Frequency (%) |
gs00005418 | 1 | 0.1% |
gs00007513 | 1 | 0.1% |
gs00031705 | 1 | 0.1% |
gs00028131 | 1 | 0.1% |
gs00026061 | 1 | 0.1% |
gs00024671 | 1 | 0.1% |
gs00023342 | 1 | 0.1% |
gs00022150 | 1 | 0.1% |
gs00021121 | 1 | 0.1% |
gs00020161 | 1 | 0.1% |
Other values (1974) | 1974 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 6274 | |
G | 1984 | 10.0% |
S | 1984 | 10.0% |
1 | 1622 | 8.2% |
2 | 1254 | 6.3% |
3 | 1191 | 6.0% |
4 | 1148 | 5.8% |
5 | 978 | 4.9% |
7 | 867 | 4.4% |
6 | 851 | 4.3% |
Other values (2) | 1687 | 8.5% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 15872 | |
Uppercase Letter | 3968 | 20.0% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 6274 | |
1 | 1622 | 10.2% |
2 | 1254 | 7.9% |
3 | 1191 | 7.5% |
4 | 1148 | 7.2% |
5 | 978 | 6.2% |
7 | 867 | 5.5% |
6 | 851 | 5.4% |
8 | 847 | 5.3% |
9 | 840 | 5.3% |
Uppercase Letter
Value | Count | Frequency (%) |
G | 1984 | |
S | 1984 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 15872 | |
Latin | 3968 | 20.0% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 6274 | |
1 | 1622 | 10.2% |
2 | 1254 | 7.9% |
3 | 1191 | 7.5% |
4 | 1148 | 7.2% |
5 | 978 | 6.2% |
7 | 867 | 5.5% |
6 | 851 | 5.4% |
8 | 847 | 5.3% |
9 | 840 | 5.3% |
Latin
Value | Count | Frequency (%) |
G | 1984 | |
S | 1984 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 19840 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 6274 | |
G | 1984 | 10.0% |
S | 1984 | 10.0% |
1 | 1622 | 8.2% |
2 | 1254 | 6.3% |
3 | 1191 | 6.0% |
4 | 1148 | 5.8% |
5 | 978 | 4.9% |
7 | 867 | 4.4% |
6 | 851 | 4.3% |
Other values (2) | 1687 | 8.5% |
X중심좌표(X_Center)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1930 |
---|---|
Distinct (%) | 97.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200383.96 |
Minimum | 179752 |
---|---|
Maximum | 215862 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.6 KiB |
Quantile statistics
Minimum | 179752 |
---|---|
5-th percentile | 187167.5 |
Q1 | 195216.25 |
median | 201345 |
Q3 | 205887.25 |
95-th percentile | 209570.45 |
Maximum | 215862 |
Range | 36110 |
Interquartile range (IQR) | 10671 |
Descriptive statistics
Standard deviation | 6919.2007 |
---|---|
Coefficient of variation (CV) | 0.034529714 |
Kurtosis | -0.27655431 |
Mean | 200383.96 |
Median Absolute Deviation (MAD) | 5075.5 |
Skewness | -0.50136917 |
Sum | 3.9756177 × 108 |
Variance | 47875338 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
198172 | 2 | 0.1% |
193162 | 2 | 0.1% |
197472 | 2 | 0.1% |
204576 | 2 | 0.1% |
198805 | 2 | 0.1% |
199702 | 2 | 0.1% |
192257 | 2 | 0.1% |
208575 | 2 | 0.1% |
204776 | 2 | 0.1% |
202416 | 2 | 0.1% |
Other values (1920) | 1964 |
Value | Count | Frequency (%) |
179752 | 1 | |
180795 | 1 | |
180799 | 1 | |
181002 | 1 | |
181104 | 1 | |
181608 | 1 | |
181637 | 1 | |
181809 | 1 | |
181810 | 1 | |
181837 | 1 |
Value | Count | Frequency (%) |
215862 | 1 | |
215819 | 1 | |
215757 | 1 | |
215220 | 1 | |
215210 | 1 | |
215055 | 1 | |
215004 | 1 | |
214857 | 1 | |
214760 | 1 | |
214617 | 1 |
Y중심좌표(Y_Center)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1631 |
---|---|
Distinct (%) | 82.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 455025.76 |
Minimum | 438706 |
---|---|
Maximum | 464461 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.6 KiB |
Quantile statistics
Minimum | 438706 |
---|---|
5-th percentile | 442667.2 |
Q1 | 450139.25 |
median | 456951.5 |
Q3 | 460006.25 |
95-th percentile | 462731 |
Maximum | 464461 |
Range | 25755 |
Interquartile range (IQR) | 9867 |
Descriptive statistics
Standard deviation | 6332.5253 |
---|---|
Coefficient of variation (CV) | 0.01391685 |
Kurtosis | -0.4256305 |
Mean | 455025.76 |
Median Absolute Deviation (MAD) | 3701 |
Skewness | -0.77321136 |
Sum | 9.0277111 × 108 |
Variance | 40100877 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
464432 | 3 | 0.2% |
459843 | 3 | 0.2% |
456951 | 3 | 0.2% |
460625 | 3 | 0.2% |
457199 | 3 | 0.2% |
457386 | 3 | 0.2% |
457713 | 3 | 0.2% |
459823 | 3 | 0.2% |
458123 | 3 | 0.2% |
457331 | 3 | 0.2% |
Other values (1621) | 1954 |
Value | Count | Frequency (%) |
438706 | 1 | |
438765 | 1 | |
438797 | 1 | |
438822 | 1 | |
438972 | 1 | |
439106 | 1 | |
439108 | 1 | |
439153 | 1 | |
439215 | 1 | |
439446 | 1 |
Value | Count | Frequency (%) |
464461 | 1 | 0.1% |
464460 | 1 | 0.1% |
464459 | 1 | 0.1% |
464437 | 1 | 0.1% |
464436 | 1 | 0.1% |
464435 | 1 | 0.1% |
464433 | 1 | 0.1% |
464432 | 3 | |
464431 | 2 | |
464430 | 1 | 0.1% |
공간길이(SHAPE_LENG)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1313 |
---|---|
Distinct (%) | 66.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 200.07531 |
Minimum | 200.07083 |
---|---|
Maximum | 200.07863 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.6 KiB |
Quantile statistics
Minimum | 200.07083 |
---|---|
5-th percentile | 200.07243 |
Q1 | 200.07424 |
median | 200.07554 |
Q3 | 200.07644 |
95-th percentile | 200.07724 |
Maximum | 200.07863 |
Range | 0.007800952 |
Interquartile range (IQR) | 0.0022066298 |
Descriptive statistics
Standard deviation | 0.0015083557 |
---|---|
Coefficient of variation (CV) | 7.5389396 × 10-6 |
Kurtosis | -0.26970886 |
Mean | 200.07531 |
Median Absolute Deviation (MAD) | 0.001101051 |
Skewness | -0.50182829 |
Sum | 396949.42 |
Variance | 2.2751369 × 10-6 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
200.077237683 | 17 | 0.9% |
200.073639863 | 10 | 0.5% |
200.076841952 | 9 | 0.5% |
200.075246239 | 9 | 0.5% |
200.076441957 | 9 | 0.5% |
200.0760377 | 8 | 0.4% |
200.076841951 | 8 | 0.4% |
200.075637703 | 8 | 0.4% |
200.076837689 | 8 | 0.4% |
200.075637706 | 8 | 0.4% |
Other values (1303) | 1890 |
Value | Count | Frequency (%) |
200.070831388 | 1 | |
200.071031384 | 1 | |
200.071032447 | 1 | |
200.071129785 | 1 | |
200.071229252 | 1 | |
200.071229253 | 1 | |
200.071229254 | 1 | |
200.071230315 | 1 | |
200.071233508 | 1 | |
200.071233509 | 1 |
Value | Count | Frequency (%) |
200.07863234 | 1 | |
200.078631276 | 2 | |
200.078629148 | 1 | |
200.078532873 | 1 | |
200.078531808 | 1 | |
200.078431279 | 2 | |
200.078429153 | 1 | |
200.07842915 | 1 | |
200.078427026 | 1 | |
200.078332875 | 1 |
공간면적(SHAPE_AREA)
Real number (ℝ)
HIGH CORRELATION
 
Distinct | 1333 |
---|---|
Distinct (%) | 67.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2501.8832 |
Minimum | 2501.7711 |
---|---|
Maximum | 2501.9662 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 17.6 KiB |
Quantile statistics
Minimum | 2501.7711 |
---|---|
5-th percentile | 2501.811 |
Q1 | 2501.8562 |
median | 2501.8889 |
Q3 | 2501.9114 |
95-th percentile | 2501.9314 |
Maximum | 2501.9662 |
Range | 0.19509668 |
Interquartile range (IQR) | 0.055186525 |
Descriptive statistics
Standard deviation | 0.037723021 |
---|---|
Coefficient of variation (CV) | 1.5077851 × 10-5 |
Kurtosis | -0.26973135 |
Mean | 2501.8832 |
Median Absolute Deviation (MAD) | 0.027536595 |
Skewness | -0.50181161 |
Sum | 4963736.2 |
Variance | 0.0014230263 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2501.93131493 | 17 | 0.9% |
2501.84133551 | 10 | 0.5% |
2501.92141783 | 9 | 0.5% |
2501.91141414 | 9 | 0.5% |
2501.88150985 | 9 | 0.5% |
2501.91130756 | 8 | 0.4% |
2501.92131123 | 8 | 0.4% |
2501.89130014 | 8 | 0.4% |
2501.89130023 | 8 | 0.4% |
2501.92141781 | 8 | 0.4% |
Other values (1323) | 1890 |
Value | Count | Frequency (%) |
2501.77109826 | 1 | |
2501.77609995 | 1 | |
2501.77612652 | 1 | |
2501.77856084 | 1 | |
2501.78104839 | 1 | |
2501.78104842 | 1 | |
2501.78104844 | 1 | |
2501.78107499 | 1 | |
2501.78115483 | 1 | |
2501.78115486 | 1 |
Value | Count | Frequency (%) |
2501.96619494 | 1 | |
2501.96616833 | 2 | |
2501.96611509 | 1 | |
2501.96370728 | 1 | |
2501.96368065 | 1 | |
2501.96116645 | 2 | |
2501.96111326 | 1 | |
2501.96111319 | 1 | |
2501.96106007 | 1 | |
2501.95870538 | 1 |
OBJECTID_1(OBJECTID_1) | X중심좌표(X_Center) | Y중심좌표(Y_Center) | 공간길이(SHAPE_LENG) | 공간면적(SHAPE_AREA) | |
---|---|---|---|---|---|
OBJECTID_1(OBJECTID_1) | 1.000 | 0.552 | 0.991 | 0.547 | 0.547 |
X중심좌표(X_Center) | 0.552 | 1.000 | 0.569 | 0.998 | 0.998 |
Y중심좌표(Y_Center) | 0.991 | 0.569 | 1.000 | 0.568 | 0.568 |
공간길이(SHAPE_LENG) | 0.547 | 0.998 | 0.568 | 1.000 | 1.000 |
공간면적(SHAPE_AREA) | 0.547 | 0.998 | 0.568 | 1.000 | 1.000 |
OBJECTID_1(OBJECTID_1) | X중심좌표(X_Center) | Y중심좌표(Y_Center) | 공간길이(SHAPE_LENG) | 공간면적(SHAPE_AREA) | |
---|---|---|---|---|---|
OBJECTID_1(OBJECTID_1) | 1.000 | -0.171 | -1.000 | -0.192 | -0.192 |
X중심좌표(X_Center) | -0.171 | 1.000 | 0.180 | 0.998 | 0.998 |
Y중심좌표(Y_Center) | -1.000 | 0.180 | 1.000 | 0.201 | 0.201 |
공간길이(SHAPE_LENG) | -0.192 | 0.998 | 0.201 | 1.000 | 1.000 |
공간면적(SHAPE_AREA) | -0.192 | 0.998 | 0.201 | 1.000 | 1.000 |
OBJECTID_1(OBJECTID_1) | 그리드코드(GRID50_CD) | X중심좌표(X_Center) | Y중심좌표(Y_Center) | 공간길이(SHAPE_LENG) | 공간면적(SHAPE_AREA) | |
---|---|---|---|---|---|---|
0 | 5418 | GS00005418 | 202136 | 464459 | 200.075947 | 2501.89903 |
1 | 31992 | GS00031992 | 206766 | 459031 | 200.07674 | 2501.918877 |
2 | 52227 | GS00052227 | 198028 | 456234 | 200.074737 | 2501.868779 |
3 | 67035 | GS00067035 | 208441 | 454538 | 200.077036 | 2501.92626 |
4 | 97864 | GS00097864 | 185398 | 451615 | 200.072032 | 2501.801135 |
5 | 133249 | GS00133249 | 184561 | 449109 | 200.071828 | 2501.796027 |
6 | 134667 | GS00134667 | 192464 | 449051 | 200.073529 | 2501.838555 |
7 | 153977 | GS00153977 | 211179 | 447550 | 200.077627 | 2501.941052 |
8 | 177763 | GS00177763 | 204388 | 445413 | 200.076224 | 2501.90596 |
9 | 212313 | GS00212313 | 203803 | 442459 | 200.076022 | 2501.900905 |
OBJECTID_1(OBJECTID_1) | 그리드코드(GRID50_CD) | X중심좌표(X_Center) | Y중심좌표(Y_Center) | 공간길이(SHAPE_LENG) | 공간면적(SHAPE_AREA) | |
---|---|---|---|---|---|---|
1974 | 52182 | GS00052182 | 195777 | 456222 | 200.074337 | 2501.858775 |
1975 | 53167 | GS00053167 | 205831 | 456175 | 200.076637 | 2501.916283 |
1976 | 5827 | GS00005827 | 204587 | 464372 | 200.076246 | 2501.906519 |
1977 | 7653 | GS00007653 | 206591 | 463882 | 200.076746 | 2501.91901 |
1978 | 11927 | GS00011927 | 199594 | 462644 | 200.075145 | 2501.878969 |
1979 | 13934 | GS00013934 | 204449 | 462170 | 200.076243 | 2501.906439 |
1980 | 15627 | GS00015627 | 202500 | 461760 | 200.075843 | 2501.896435 |
1981 | 16852 | GS00016852 | 204952 | 461522 | 200.076344 | 2501.908953 |
1982 | 17945 | GS00017945 | 198902 | 461240 | 200.075042 | 2501.876401 |
1983 | 19125 | GS00019125 | 207706 | 461087 | 200.076843 | 2501.921444 |