Overview

Dataset statistics

Number of variables6
Number of observations1549
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory80.3 KiB
Average record size in memory53.1 B

Variable types

Numeric5
Text1

Dataset

Description샘플 데이터
Author서울시, 신한은행
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=319

Alerts

OBJECTID_1(OBJECTID_1) is highly overall correlated with Y중심좌표(Y_Center)High correlation
X중심좌표(X_Center) is highly overall correlated with 공간길이(SHAPE_LENG) and 1 other fieldsHigh correlation
Y중심좌표(Y_Center) is highly overall correlated with OBJECTID_1(OBJECTID_1)High correlation
공간길이(SHAPE_LENG) is highly overall correlated with X중심좌표(X_Center) and 1 other fieldsHigh correlation
공간면적(SHAPE_AREA) is highly overall correlated with X중심좌표(X_Center) and 1 other fieldsHigh correlation
OBJECTID_1(OBJECTID_1) has unique valuesUnique
그리드코드(GRID50_CD) has unique valuesUnique

Reproduction

Analysis started2023-12-10 15:01:15.676772
Analysis finished2023-12-10 15:01:24.080353
Duration8.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

OBJECTID_1(OBJECTID_1)
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct1549
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean105780.49
Minimum39363
Maximum186311
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.7 KiB
2023-12-11T00:01:24.230036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum39363
5-th percentile41475.4
Q147227
median116542
Q3149043
95-th percentile182272.6
Maximum186311
Range146948
Interquartile range (IQR)101816

Descriptive statistics

Standard deviation50615.707
Coefficient of variation (CV)0.47849756
Kurtosis-1.4216628
Mean105780.49
Median Absolute Deviation (MAD)45444
Skewness0.033982143
Sum1.6385398 × 108
Variance2.5619498 × 109
MonotonicityNot monotonic
2023-12-11T00:01:24.602940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
39363 1
 
0.1%
132289 1
 
0.1%
130153 1
 
0.1%
130152 1
 
0.1%
130151 1
 
0.1%
130150 1
 
0.1%
130149 1
 
0.1%
130148 1
 
0.1%
130147 1
 
0.1%
130146 1
 
0.1%
Other values (1539) 1539
99.4%
ValueCountFrequency (%)
39363 1
0.1%
39679 1
0.1%
39680 1
0.1%
39681 1
0.1%
39682 1
0.1%
39683 1
0.1%
39684 1
0.1%
39997 1
0.1%
39998 1
0.1%
39999 1
0.1%
ValueCountFrequency (%)
186311 1
0.1%
186310 1
0.1%
186309 1
0.1%
186308 1
0.1%
186307 1
0.1%
185733 1
0.1%
185732 1
0.1%
185731 1
0.1%
185730 1
0.1%
185729 1
0.1%
Distinct1549
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size12.2 KiB
2023-12-11T00:01:25.294395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters15490
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1549 ?
Unique (%)100.0%

Sample

1st rowGS00039363
2nd rowGS00039679
3rd rowGS00039680
4th rowGS00039681
5th rowGS00039683
ValueCountFrequency (%)
gs00039363 1
 
0.1%
gs00128481 1
 
0.1%
gs00130153 1
 
0.1%
gs00130152 1
 
0.1%
gs00130151 1
 
0.1%
gs00130150 1
 
0.1%
gs00130149 1
 
0.1%
gs00130148 1
 
0.1%
gs00130147 1
 
0.1%
gs00130146 1
 
0.1%
Other values (1539) 1539
99.4%
2023-12-11T00:01:26.174988image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4488
29.0%
1 1593
 
10.3%
G 1549
 
10.0%
S 1549
 
10.0%
4 1111
 
7.2%
8 952
 
6.1%
9 803
 
5.2%
2 761
 
4.9%
6 727
 
4.7%
5 724
 
4.7%
Other values (2) 1233
 
8.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 12392
80.0%
Uppercase Letter 3098
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4488
36.2%
1 1593
 
12.9%
4 1111
 
9.0%
8 952
 
7.7%
9 803
 
6.5%
2 761
 
6.1%
6 727
 
5.9%
5 724
 
5.8%
7 648
 
5.2%
3 585
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
G 1549
50.0%
S 1549
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 12392
80.0%
Latin 3098
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4488
36.2%
1 1593
 
12.9%
4 1111
 
9.0%
8 952
 
7.7%
9 803
 
6.5%
2 761
 
6.1%
6 727
 
5.9%
5 724
 
5.8%
7 648
 
5.2%
3 585
 
4.7%
Latin
ValueCountFrequency (%)
G 1549
50.0%
S 1549
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15490
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4488
29.0%
1 1593
 
10.3%
G 1549
 
10.0%
S 1549
 
10.0%
4 1111
 
7.2%
8 952
 
6.1%
9 803
 
5.2%
2 761
 
4.9%
6 727
 
4.7%
5 724
 
4.7%
Other values (2) 1233
 
8.0%

X중심좌표(X_Center)
Real number (ℝ)

HIGH CORRELATION 

Distinct1353
Distinct (%)87.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200288.95
Minimum180356
Maximum216219
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.7 KiB
2023-12-11T00:01:26.549419image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum180356
5-th percentile185163
Q1194523
median201050
Q3207329
95-th percentile210604
Maximum216219
Range35863
Interquartile range (IQR)12806

Descriptive statistics

Standard deviation7988.5743
Coefficient of variation (CV)0.039885248
Kurtosis-0.4441949
Mean200288.95
Median Absolute Deviation (MAD)6427
Skewness-0.39183641
Sum3.1024758 × 108
Variance63817320
MonotonicityNot monotonic
2023-12-11T00:01:26.925998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
209131 4
 
0.3%
209531 4
 
0.3%
209532 4
 
0.3%
209431 4
 
0.3%
209632 4
 
0.3%
209481 4
 
0.3%
209081 4
 
0.3%
209582 4
 
0.3%
208981 3
 
0.2%
209031 3
 
0.2%
Other values (1343) 1511
97.5%
ValueCountFrequency (%)
180356 1
0.1%
180554 1
0.1%
180604 1
0.1%
180654 1
0.1%
180704 1
0.1%
180754 1
0.1%
180795 1
0.1%
180804 1
0.1%
180854 1
0.1%
180904 1
0.1%
ValueCountFrequency (%)
216219 1
0.1%
216169 1
0.1%
216167 1
0.1%
216119 1
0.1%
216117 1
0.1%
216069 1
0.1%
216067 1
0.1%
216019 1
0.1%
216017 1
0.1%
215969 1
0.1%

Y중심좌표(Y_Center)
Real number (ℝ)

HIGH CORRELATION 

Distinct551
Distinct (%)35.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean451606.68
Minimum444629
Maximum457963
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.7 KiB
2023-12-11T00:01:27.263559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum444629
5-th percentile445002
Q1447865
median450364
Q3456883
95-th percentile457669
Maximum457963
Range13334
Interquartile range (IQR)9018

Descriptive statistics

Standard deviation4498.3374
Coefficient of variation (CV)0.0099607415
Kurtosis-1.4032704
Mean451606.68
Median Absolute Deviation (MAD)3520
Skewness0.087113961
Sum6.9953875 × 108
Variance20235040
MonotonicityNot monotonic
2023-12-11T00:01:27.600672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
450364 8
 
0.5%
449636 7
 
0.5%
450363 7
 
0.5%
449667 7
 
0.5%
449377 6
 
0.4%
449739 6
 
0.4%
449635 6
 
0.4%
450365 6
 
0.4%
449461 5
 
0.3%
449668 5
 
0.3%
Other values (541) 1486
95.9%
ValueCountFrequency (%)
444629 1
 
0.1%
444630 4
0.3%
444686 4
0.3%
444687 4
0.3%
444741 3
0.2%
444742 4
0.3%
444743 3
0.2%
444797 4
0.3%
444798 2
0.1%
444799 2
0.1%
ValueCountFrequency (%)
457963 1
 
0.1%
457904 3
0.2%
457903 3
0.2%
457843 1
 
0.1%
457842 2
0.1%
457841 2
0.1%
457840 2
0.1%
457780 3
0.2%
457779 2
0.1%
457778 3
0.2%

공간길이(SHAPE_LENG)
Real number (ℝ)

HIGH CORRELATION 

Distinct685
Distinct (%)44.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean200.07529
Minimum200.07083
Maximum200.07883
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.7 KiB
2023-12-11T00:01:27.916487image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum200.07083
5-th percentile200.07203
Q1200.07404
median200.07543
Q3200.07683
95-th percentile200.07762
Maximum200.07883
Range0.008000951
Interquartile range (IQR)0.002795703

Descriptive statistics

Standard deviation0.0017407778
Coefficient of variation (CV)8.7006136 × 10-6
Kurtosis-0.44175017
Mean200.07529
Median Absolute Deviation (MAD)0.001399978
Skewness-0.38946534
Sum309916.63
Variance3.0303074 × 10-6
MonotonicityNot monotonic
2023-12-11T00:01:28.207846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
200.077425975 26
 
1.7%
200.077227041 25
 
1.6%
200.077225978 19
 
1.2%
200.077237683 19
 
1.2%
200.077326508 18
 
1.2%
200.075423877 18
 
1.2%
200.075237713 17
 
1.1%
200.075823872 15
 
1.0%
200.073438801 10
 
0.6%
200.077629161 9
 
0.6%
Other values (675) 1373
88.6%
ValueCountFrequency (%)
200.070829258 1
 
0.1%
200.070929788 2
0.1%
200.071029255 1
 
0.1%
200.071029256 1
 
0.1%
200.071030317 1
 
0.1%
200.071030319 3
0.2%
200.071030321 1
 
0.1%
200.071032447 1
 
0.1%
200.071129783 1
 
0.1%
200.071129786 1
 
0.1%
ValueCountFrequency (%)
200.078830209 1
 
0.1%
200.078829143 2
0.1%
200.078730743 1
 
0.1%
200.07872968 2
0.1%
200.078729679 3
0.2%
200.078729677 2
0.1%
200.07863021 1
 
0.1%
200.078529682 1
 
0.1%
200.078430213 1
 
0.1%
200.078429153 1
 
0.1%

공간면적(SHAPE_AREA)
Real number (ℝ)

HIGH CORRELATION 

Distinct702
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2501.8827
Minimum2501.771
Maximum2501.9711
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size13.7 KiB
2023-12-11T00:01:28.504159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2501.771
5-th percentile2501.8011
Q12501.8513
median2501.8862
Q32501.9212
95-th percentile2501.941
Maximum2501.9711
Range0.20009861
Interquartile range (IQR)0.06991893

Descriptive statistics

Standard deviation0.043535755
Coefficient of variation (CV)1.7401198 × 10-5
Kurtosis-0.44176965
Mean2501.8827
Median Absolute Deviation (MAD)0.03501276
Skewness-0.38944702
Sum3875416.3
Variance0.001895362
MonotonicityNot monotonic
2023-12-11T00:01:28.819890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2501.93602404 26
 
1.7%
2501.93104878 25
 
1.6%
2501.93131493 19
 
1.2%
2501.93102219 19
 
1.2%
2501.88595248 18
 
1.2%
2501.93353641 18
 
1.2%
2501.88129662 17
 
1.1%
2501.89595612 15
 
1.0%
2501.83630711 10
 
0.6%
2501.8959561 9
 
0.6%
Other values (692) 1373
88.6%
ValueCountFrequency (%)
2501.771045 1
 
0.1%
2501.77355914 1
 
0.1%
2501.77355915 1
 
0.1%
2501.77604669 1
 
0.1%
2501.77604671 1
 
0.1%
2501.77607325 1
 
0.1%
2501.7760733 3
0.2%
2501.77607337 1
 
0.1%
2501.77612652 1
 
0.1%
2501.7785608 1
 
0.1%
ValueCountFrequency (%)
2501.97114361 1
 
0.1%
2501.97111696 2
0.1%
2501.96865597 1
 
0.1%
2501.96862939 2
0.1%
2501.96862936 3
0.2%
2501.96862932 2
0.1%
2501.96614168 1
 
0.1%
2501.96362748 1
 
0.1%
2501.96113977 1
 
0.1%
2501.96111326 1
 
0.1%

Interactions

2023-12-11T00:01:22.338566image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:16.409066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:17.723871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:19.564510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:20.833074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:22.591778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:16.682581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:17.979793image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:19.801951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:21.079653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:22.815068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:16.894753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:18.229700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:20.059634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:21.353142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:23.040457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:17.131892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:18.490840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:20.292964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:21.609461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:23.371782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:17.381187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:19.236961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:20.555707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T00:01:21.863824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T00:01:29.093597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
OBJECTID_1(OBJECTID_1)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
OBJECTID_1(OBJECTID_1)1.0000.6800.9720.6360.636
X중심좌표(X_Center)0.6801.0000.6610.9950.995
Y중심좌표(Y_Center)0.9720.6611.0000.6190.619
공간길이(SHAPE_LENG)0.6360.9950.6191.0001.000
공간면적(SHAPE_AREA)0.6360.9950.6191.0001.000
2023-12-11T00:01:29.337517image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
OBJECTID_1(OBJECTID_1)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
OBJECTID_1(OBJECTID_1)1.0000.192-0.9990.1710.171
X중심좌표(X_Center)0.1921.000-0.1630.9980.998
Y중심좌표(Y_Center)-0.999-0.1631.000-0.143-0.143
공간길이(SHAPE_LENG)0.1710.998-0.1431.0001.000
공간면적(SHAPE_AREA)0.1710.998-0.1431.0001.000

Missing values

2023-12-11T00:01:23.695026image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T00:01:23.971308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

OBJECTID_1(OBJECTID_1)그리드코드(GRID50_CD)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
039363GS00039363203421457963200.076042501.901357
139679GS00039679201520457903200.0754392501.886325
239680GS00039680201570457903200.0755382501.888813
339681GS00039681201620457903200.0756392501.891327
439683GS00039683201720457904200.0755392501.888839
539997GS00039997199120457840200.0750392501.876321
639998GS00039998199170457840200.0750392501.876321
739999GS00039999199220457841200.0750392501.876321
840000GS00040000199270457841200.0750392501.876321
940003GS00040003199420457842200.0751392501.878836
OBJECTID_1(OBJECTID_1)그리드코드(GRID50_CD)X중심좌표(X_Center)Y중심좌표(Y_Center)공간길이(SHAPE_LENG)공간면적(SHAPE_AREA)
1539128977GS00128977204466449565200.0764292501.911095
1540145890GS00145890213026448260200.0780282501.951083
1541146954GS00146954207725448132200.0770282501.926074
1542149556GS00149556192020447798200.0736272501.841016
1543149559GS00149559192171447799200.0736272501.841016
1544150576GS00150576185219447662200.0720272501.801002
1545158549GS00158549209581447141200.0773272501.933536
1546162569GS00162569210183446794200.0774262501.936024
1547162574GS00162574210433446796200.0775252501.938512
1548185732GS00185732199540444687200.0751242501.878463